Professor Paul Clough
BEng (York), PhD (Sheffield)
Information School
Professor of Search and Analytics
+44 114 222 2664
Full contact details
Information School
Room C600
The Wave
2 Whitham Road
Sheffield
S10 2AH
- Profile
-
I joined British Telecommunications Plc (BT) in 1991 on their technician training scheme and worked for 10 years at BT’s research centre called Adastral Park. During my training, I worked in various research groups in BT and studied telecommunications, electronics and software engineering at Suffolk College. During my time at BT I worked on various projects developing novel hardware and software solutions, ranging from problems in telecommunications to information management.
In 1995 I was sponsored by BT to study Computer Science at the University of York and after graduating decided to pursue an academic career in computing. I left BT and joined the University of Sheffield in 1999 working as a Research Assistant (RA) in the Department of Computer Science in collaboration with the Journalism Department and the British Press Association on a project entitled “Measuring Text Reuse”.
During this time I also completed my PhD under the supervision of Prof. Yorick Wilks. Following various interests in NLP and Information Retrieval (IR) I worked as an RA on a range of research projects until 2005 when I became a lecturer in the Information School. I now head the Information Retrieval research group in the Information School and have continued teaching and researching various aspects of data management and information storage and retrieval.
University Responsibilities
- Head of Information Retrieval Research Group.
- REF Coordinator
- Co-Director of Digital Societies Network.
- Member of the Departmental Research Committee.
- Programme coordinator for MSc Data Science.
- Deputy programme coordinator for MSc Digital Library Management.
- Module Coordinator:
- Information Retrieval
- Information Systems in Health.
- Staff Review and Development Scheme reviewer.
- Research interests
-
My research interests focus on developing effective retrieval technologies that support users as they seek to fulfil their information needs. Specifically I have carried out research in the areas of multilingual search, retrieval of images, geo-spatial search, analysis of transaction logs, text re-use and plagiarism detection, and the evaluation of search systems.
I have published over 100 peer-reviewed articles, including a co-authored Springer book on multilingual information retrieval. My background in natural language processing, gained during my PhD, has allowed me to develop more sophisticated approaches to accessing information. In addition to developing techniques, I have also built up an understanding of the users of information access systems and their information needs, taking a more user-oriented view to my research.
A further theme of my research has been to create re-usable evaluation resources (corpora and test collections) for the wider research community, such as computational linguistics and information retrieval. I have been involved in coordinating activities at three international evaluation campaigns: the Cross Language Evaluation Form (CLEF) in Europe, the Text Retrieval Conference (TREC) in the US and the Forum for Information Retrieval Evaluation (FIRE) in India.
I am head of the Information Retrieval research group.
- Publications
-
Books
- Geographic Information Retrieval: Progress and Challenges in Spatial Search of Text. Now Publishers.
- Multilingual Information Retrieval: From Research To Practice. Berlin/Heidelberg: Springer-Verlag Berlin Heidelberg.
- Preface.
Edited books
- ImageCLEF: Experimental Evaluation in Visual Information Retrieval. Springer-Verlag New York Incorporated.
Journal articles
- Automated evaluation of comments to aid software maintenance. Journal of Software: Evolution and Process, 34(7).
- Investigating the usage of IoT-based smart parking services in the Borough of Westminster. Journal of Global Information Management, 29(6). View this article in WRRO
- Paraphrase type identification for plagiarism detection using contexts and word embeddings. International Journal of Educational Technology in Higher Education, 18. View this article in WRRO
- Investigating clickbait in Chinese social media : a study of WeChat. Online Social Networks and Media, 19.
- Guest editorial : special section on “social and cultural biases in information, algorithms, and systems”. Online Information Review, 44(2), 321-323. View this article in WRRO
- Characterising online museum users: a study of the National Museums Liverpool museum website. International Journal on Digital Libraries, 21(1), 75-87. View this article in WRRO
- Representing search tasks in an information use environment : a case of English primary schools. Journal of Documentation, 75(6), 1370-1395. View this article in WRRO
- Investigating behavioural and computational approaches for defining imprecise regions. Spatial Cognition and Computation, 19(2), 146-171. View this article in WRRO
- View this article in WRRO Using classroom talk to understand children’s search processes for tasks with different goals. Information Research, 24(1).
- How the information use environment influences search activities: a case of English primary schools. Journal of Documentation. View this article in WRRO
- Embedded, added, cocreated: Revisiting the value of information in an age of data. Journal of the Association for Information Science and Technology, 69(5), 744-748. View this article in WRRO
- Users and uses of a global union catalog: A mixed‐methods study of WorldCat.org. Journal of the American Society for Information Science and Technology, 68(9), 2166-2181. View this article in WRRO
- Affective Experiences of International and Home Students During the Information Search Process. New Review of Academic Librarianship. View this article in WRRO
- Predicting meeting participants’ note-taking from previously uttered dialogue acts. Journal of Systems and Information Technology, 18(2), 170-185.
- An IR-Based Approach Utilizing Query Expansion for Plagiarism Detection in MEDLINE. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 14(4), 796-804. View this article in WRRO
- View this article in WRRO Graph Literacy and Business Intelligence: Investigating User Understanding of Dashboard Data Visualizations. Business Intelligence Journal, 20(4), 8-19.
- SIGIR 2014. ACM SIGIR Forum, 49(1), 16-19.
- How do children reformulate their search queries?. INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 20(1).
- An overview of semantic search evaluation initiatives. Journal of Web Semantics, 30, 82-105. View this article in WRRO
- CLEF 2014. ACM SIGIR Forum, 48(2), 56-62.
- Cognitive styles within an exploratory search system for digital libraries. Journal of Documentation, 70, 970-996. View this article in WRRO
- Evaluating hierarchical organisation structures for exploring digital libraries. Information Retrieval, 17(4), 351-379. View this article in WRRO
- A User Evaluation Study: Do Participants' Personal Notes Help Us to Summarise Meetings?. Knowledge and Process Management, 21(2), 122-133.
- SIGIR 2014 workshop on gathering efficient assessments of relevance (GEAR). Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval - SIGIR '14.
- Comparing Medline citations using modified N-grams. Journal of the American Medical Informatics Association, 21(1), 105-110.
- Investigating Religious Information Searching through the Analysis of a Search Engine Log. Journal of the American Society for Information Science and Technology, 64(12), 2492-2506.
- View this article in WRRO Evaluating the performance of information retrieval systems using test collections. Information Research, 18(2).
- Will we be lost without paper maps in the digital age?. Journal of Information Science, 39(1), 48-60.
- Investigating the information-seeking behaviour of genealogists and family historians. Journal of Information Science, 39(1), 73-84.
- ENRICH 2013. ACM SIGIR Forum, 47(2), 68-73.
- Can Social Tagging Assist Information Literacy Practices in Academic Libraries?, 408-414.
- How do healthcare professionals select the medical images they need?. Aslib Proceedings, 65(1), 54-72.
- Examining the limits of crowdsourcing for relevance assessment. IEEE Internet Computing, 17(4), 32-38.
- Computing Similarity between Items in a Digital Library of Cultural Heritage. Journal on Computing and Cultural Heritage, 5(4), 1-19.
- "Readers who borrowed this also borrowed...": Recommender systems in UK libraries. Library Hi Tech, 30(1), 134-150. View this article in WRRO
- Participants' personal note-taking in meetings and its value for automatic meeting summarisation.. Inf. Technol. Manag., 13, 39-57.
- How do health care professionals select medical images they need?. Aslib Proceedings, 64(4), 437-456.
- Socio-Technical Lifelogging: Deriving Design Principles for a Future Proof Digital Past. Human–Computer Interaction, 27(1-2), 37-62.
- Retrieving candidate plagiarised documents using query expansion. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7224 LNCS, 207-218.
- Analysing User's Queries for Cross-Language Image Retrieval from Digital Library Collections.. The Electronic Library, 30, 197-219.
- Developing a corpus of plagiarised short answers. Language Resources and Evaluation, 45(1), 5-24. View this article in WRRO
- CLEF 2011: conference on multilingual and multimodal information access evaluation.. SIGIR Forum, 45, 32-37.
- If we build it, will they come? Recommendations and WorldCat. Proceedings of the American Society for Information Science and Technology, 48(1), 1-3.
- Developing metrics to characterize Flickr groups. Journal of the American Society for Information Science and Technology, 62(3), 493-506. View this article in WRRO
- Report on the ECIR 2011 workshop on information retrieval over query sessions.. SIGIR Forum, 45, 76-80.
- Evaluating multi-query sessions. SIGIR'11 - Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1053-1062.
- Medical image resources used by, health care professionals. Aslib Proceedings, 63(6), 570-585.
- Linking archival data to location: a case study at the UK National Archives. Aslib Proceedings, 63(2-3), 127-147.
- Special issue on image and video retrieval evaluation.. Comput. Vis. Image Underst., 114, 409-410.
- Highlights from GIR 2010: The 6th Workshop on Geographic Information Retrieval (Zurich, Switzerland - February 18--19, 2010).. ACM SIGSPATIAL Special, 2, 17-23.
- Investigating Language Skills and Field of Knowledge on Multilingual Information Access in Digital Libraries.. International Journal of Digital Library Systems, 1, 89-103.
- View this article in WRRO Easy on that trigger dad: a study of long term family photo retrieval.. Pers. Ubiquitous Comput., 14, 31-43.
- Report on the TrebleCLEF query log analysis workshop 2009.. SIGIR Forum, 43, 71-77.
- View this article in WRRO A Study on the Relevance Criteria for Medical Images. Pattern Recognition Letters, 29, 2046-2057.
- View this article in WRRO Flickr: A first look at user behaviour in the context of photography as serious leisure. Information Research, 13(1).
- Exploring the relationship between feature and perceptual visual spaces.. Journal of the American Society for Information Science and Technology, 59, 770-784.
- Modelling vague places with knowledge from the Web. International Journal of Geographical Information Science, 22(10), 1045-1065. View this article in WRRO
- View this article in WRRO The design and implementation of SPIRIT: a spatially aware search engine for information retrieval on the Internet.. Int. J. Geogr. Inf. Sci., 21, 717-745.
- User experiments with the Eurovision cross-language image retrieval system.. Journal of the American Society for Information Science and Technology, 57, 697-708. View this article in WRRO
- Image retrieval: Large-scale evaluation of cross-language image retrieval systems. Bulletin of the American Society for Information Science and Technology, 33(3), 18-21.
Chapters
- Smart Manufacturing, Smart Connected World (pp. 141-169). Springer International Publishing View this article in WRRO
- Multi-Lingual Retrieval of Pictures in ImageCLEF, Information Retrieval Evaluation in a Changing World (pp. 217-230). Springer International Publishing
- Collecting Comparable Corpora, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 55-87). Springer International Publishing
- Cross-Language Comparability and Its Applications for MT, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 13-53). Springer International Publishing
- Appendices, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 291-323). Springer International Publishing
- Mining search logs for usage patterns, Text Mining and Visualization: Case Studies Using Open-Source Tools (pp. 153-172).
- View this article in WRRO Supporting Exploration and Use of Digital Cultural Heritage Materials: the PATHS Perspective In Ruthven I & Chowdhury GG (Ed.), Cultural Heritage Information Access and Management (pp. 197-220). Facet Publishing
- Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface In Kanoulas E, Lupu M, Clough P, Sanderson M, Hall M, Hanbury A & Toms E (Ed.), Information Access Evaluation. Multilinguality, Multimodality, and Interaction 5th International Conference of the CLEF Initiative, CLEF 2014, Sheffield, UK, September 15-18, 2014. Proceedings (pp. v-vi). Springer International Publishing
- Building and Using Comparable Corpora Springer Berlin Heidelberg
- Methods for Collection and Evaluation of Comparable Documents, Building and Using Comparable Corpora (pp. 93-112). Springer Berlin Heidelberg
- Investigating Language Skills and Field of Knowledge on Multilingual Information Access in Digital Libraries, Multimedia Storage and Retrieval Innovations for Digital Library Systems (pp. 85-100). IGI Global
- User-related issues in multilingual access to multimedia collections In Dobreva M, O'Dwyer A & Feliciati P (Ed.), User Studies for Digital Library Development
- Data Sets Created in ImageCLEF, ImageCLEF (pp. 19-43). Springer Berlin Heidelberg
- Seven Years of Image Retrieval Evaluation, ImageCLEF (pp. 3-18). Springer Berlin Heidelberg
- Corpora and text re-use, CORPUS LINGUISTICS, PART 2 (pp. 1249-1271).
- Measuring text reuse in the news industry, Copyright and Piracy (pp. 247-259). Cambridge University Press
Conference proceedings papers
- Efficiency of Large Language Models to scale up Ground Truth: Overview of the IRSE Track at Forum for Information Retrieval 2023. Proceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation
- Can we predict useful comments in source codes? - Analysis of findings from Information Retrieval in Software Engineering Track @ FIRE 2022. Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation
- Conducting Information Science research during pandemics: Experience and Reflections of the Information Retrieval Research Group in Sheffield. Information Science Trends - ASIS&T European Chapter Series, 9 June 2021 - 11 June 2021. View this article in WRRO
- Integrating fate/critical data studies into data science curricula: Where are we going and how do we get there?. FAT* 2020 - Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (pp 425-435) View this article in WRRO
- Analysis of transaction logs from National Museums Liverpool. TPDL 2019 Proceedings : Digital Libraries for Open Knowledge (pp 84-98). Oslo, Norway, 9 September 2019 - 12 September 2019. View this article in WRRO
- Investigating User Perception of Gender Bias in Image Search. The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
- Preface. CEUR Workshop Proceedings, Vol. 2103 (pp i-ii)
- Competent Men and Warm Women: Gender Stereotypes and Backlash in Image Search Results. Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 6 May 2017 - 11 May 2017. View this article in WRRO
- Europeana: What Users Search for and Why (pp 207-219) View this article in WRRO
- The Ghost in the Museum Website: Investigating the General Public’s Interactions with Museum Websites (pp 434-445) View this article in WRRO
- Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles (pp 633-639) View this article in WRRO
- Plagiarism Detection in Texts Obfuscated with Homoglyphs (pp 669-675) View this article in WRRO
- ACHS'16. Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries - JCDL '16, 19 June 2016 - 23 June 2016.
- Investigating Cluster Stability when Analyzing Transaction Logs. Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries - JCDL '16, 19 June 2016 - 23 June 2016. View this article in WRRO
- Evaluating Retrieval over Sessions. Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - SIGIR '16, 17 July 2016 - 21 July 2016.
- Report on ECIR 2016. ACM SIGIR Forum, Vol. 50(1) (pp 12-27)
- View this article in WRRO Exploring entity-centric methods in the UK Government Web Archive. Proceedings of the First International Workshop on Accessing Cultural Heritage at Scale (ACHS’16), Vol. 1611, 22 June 2016 - 22 June 2016.
- Preface of the proceedings of ACHS'16 the first international workshop on accessing cultural heritage at Scale. CEUR Workshop Proceedings, Vol. 1611
- User categories for digital cultural heritage. CEUR Workshop Proceedings, Vol. 1611
- Determining the Optimal Session Interval for Transaction Log Analysis of an Online Library Catalogue (pp 703-708)
- Evaluation. Proceedings of the Forum for Information Retrieval Evaluation on - FIRE '14, 5 December 2014 - 7 December 2014. View this article in WRRO
- The short stories corpus. CEUR Workshop Proceedings, Vol. 1391
- The short stories corpus. CEUR Workshop Proceedings, Vol. 1391
- The Short Stories Corpus: Notebook for PAN at CLEF 2015.. CLEF (Working Notes), Vol. 1391
- View this article in WRRO query reformulation
- Integrating Mixed-Methods for Evaluating Information Access Systems (pp 306-311)
- View this article in WRRO Overview of the SBS 2015 Interactive Track. CEUR Workshop Proceedings
- Unfair Means: Use Cases Beyond Plagiarism (pp 229-234) View this article in WRRO
- Examining New Event Detection. Proceedings of the 2014 Australasian Document Computing Symposium on - ADCS '14, 27 November 2014 - 28 November 2014.
- View this article in WRRO PATHS in context: User characteristics and the construction of cultural heritage narratives. iConference Proceedings 2014
- SIGIR 2014 Workshop on Gathering Efficient Assessments of Relevance (GEAR). SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (pp 1293-1293)
- Hashing and merging heuristics for text reuse detection: Notebook for PAN at CLEF-2014. CEUR Workshop Proceedings, Vol. 1180 (pp 939-946)
- Hashing and merging heuristics for text reuse detection: Notebook for PAN at CLEF-2014. CEUR Workshop Proceedings, Vol. 1180 (pp 939-946)
- Hashing and Merging Heuristics for Text Reuse Detection.. CLEF (Working Notes), Vol. 1180 (pp 939-946)
- Personalised PageRank for making recommendations in digital cultural heritage collections. IEEE/ACM Joint Conference on Digital Libraries, 8 September 2014 - 12 September 2014.
- Categorising search sessions. Proceedings of the 5th Information Interaction in Context Symposium on - IIiX '14, 26 August 2014 - 30 August 2014.
- Investigating the potential impact of non-personalized recommendations in the OPAC. Proceedings of the 5th Information Interaction in Context Symposium on - IIiX '14, 26 August 2014 - 30 August 2014.
- Implementing Recommendations in the PATHS System (pp 169-173)
- A Comparison of Approaches for Measuring Cross-Lingual Similarity of Wikipedia Articles (pp 424-429)
- Supporting Information Access and Sensemaking in Digital Cultural Heritage Environments (pp 143-154)
- Exploration, navigation and retrieval of information in cultural heritage. Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '13, 28 July 2013 - 1 August 2013.
- PATHS: A System for Accessing Cultural Heritage Collections.. ACL (Conference System Demonstrations) (pp 151-156)
- Shefleld submission to the CHiC interactive task: Exploring digital cultural heritage. CEUR Workshop Proceedings, Vol. 1179
- Search or browse? Casual information access to a cultural heritage collection. CEUR Workshop Proceedings, Vol. 1033 (pp 19-22)
- Information seeking in digital cultural heritage with PATHS. Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '13, 28 July 2013 - 1 August 2013.
- Exploring Large Digital Library Collections Using a Map-Based Visualisation (pp 216-227)
- Regional Effects on Query Reformulation Patterns (pp 382-385)
- Selecting success criteria: Experiences with an academic library catalogue. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 8138 LNCS (pp 59-70)
- PAN@FIRE: Overview of the Cross-Language !ndian Text Re-Use Detection Competition (pp 59-70)
- Evaluating the Use of Clustering for Automatically Organising Digital Library Collections (pp 323-334)
- PATHS – Exploring Digital Cultural Heritage Spaces (pp 500-503)
- View this article in WRRO Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey.. Istanbul, Turkey, 21 May 2012 - 27 May 2012.
- The Sheffield and Basque Country universities entry to CHiC: Using random walks and similarity to access cultural heritage. CEUR Workshop Proceedings, Vol. 1178
- Collecting and using comparable corpora for statistical machine translation. Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012 (pp 438-445)
- User-centred design to support exploration and path creation in cultural heritage collections. CEUR Workshop Proceedings, Vol. 909 (pp 75-78)
- PATHS: Personalising access to cultural heritage spaces. 2012 18th International Conference on Virtual Systems and Multimedia, 2 September 2012 - 5 September 2012.
- Enabling the discovery of digital cultural heritage objects through wikipedia. Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2012 at the 13th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2012 (pp 94-100)
- Detecting text reuse with modified and weighted N-grams. *SEM 2012 - 1st Joint Conference on Lexical and Computational Semantics, Vol. 1 (pp 54-58)
- Overview of the TREC 2011 session track. NIST Special Publication
- Searching for Islamic and Qur'anic Information on the Web: A Mixed-Methods Approach.. 7th Asia Information Retrieval Societies Conference, Vol. 7097 (pp 181-192)
- Advances in Information Retrieval - 33rd European Conference on IR Research, ECIR 2011, Dublin, Ireland, April 18-21, 2011. Proceedings. ECIR, Vol. 6611
- View this article in WRRO External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011.. CLEF (Notebook Papers/Labs/Workshop), Vol. 1177
- Introduction to the CLEF 2011 Labs.. CLEF (Notebook Papers/Labs/Workshop), Vol. 1177
- Quantitative analysis of individual differences in note-taking and talking behavior in meetings. Proceedings of the IADIS International Conference ICT, Society and Human Beings 2010, Part of the IADIS Multi Conference on Computer Science and Information Systems 2010, MCCSIS 2010 (pp 172-176)
- Overview of the TREC 2010 session track. NIST Special Publication
- Images and perceptions of neighbourhood extents.. Proceedings of the 6th Workshop on Geographic Information Retrieval
- View this article in WRRO University of Sheffield: Lab Report for PAN at CLEF 2010. Proceedings of the 4th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse
- Proceedings of the 6th Workshop on Geographic Information Retrieval, GIR'10: Foreword. Proceedings of the 6th Workshop on Geographic Information Retrieval, GIR'10
- Do user preferences and evaluation measures line up?. SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 555-562) View this article in WRRO
- Overview of iCLEF 2009: Exploring Search Behaviour in a Multilingual Folksonomy Environment.. 10th Workshop of the Cross-Language Evaluation Forum, Vol. 6242 (pp 13-20) View this article in WRRO
- View this article in WRRO Diversity in Photo Retrieval: Overview of the ImageCLEFPhoto Task 2009.. CLEF (2), Vol. 6242 (pp 45-59)
- View this article in WRRO Extending Domain-Specific Resources to Enable Semantic Access to Cultural Heritage Data.. Journal of Digital Information, Vol. 10(6)
- Multiple approaches to analysing query diversity.. Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 734-735) View this article in WRRO
- What else is there? Search diversity examined. Proceedings of the European Conference on Information Retrieval (ECIR'09), Vol. 5478 (pp 562-569)
- Building a diversity featured search system by fusing existing tools. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 5706 LNCS (pp 560-567) View this article in WRRO
- Building a diversity featured search system by fusing existing tools. CEUR Workshop Proceedings, Vol. 1174
- Overview of iCLEF 2008: Search Log Analysis for Multilingual Image Retrieval.. 9th Workshop of the Cross-Language Evaluation Forum, Vol. 5706 (pp 227-235) View this article in WRRO
- Mapping geographic coverage of the web.. 16th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems (pp 19-19)
- Relevance judgments between TREC and Non-TREC assessors.. Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 683-684) View this article in WRRO
- The good and the bad system: does the test collection predict users' effectiveness?. Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 59-66) View this article in WRRO
- Multimodal indexing of digital audio-visual documents: A case study for cultural heritage data. International Workshop on Content-Based Multimedia Indexing (pp 93-100) View this article in WRRO
- Exploring the Effects of Language Skills on Multilingual Web Search.. Proceedings of the 30th European Conference on IR Research, Vol. 4956 (pp 126-137) View this article in WRRO
- Key design issues with visualising images using Google Earth. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 4956 LNCS (pp 570-574) View this article in WRRO
- Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task.. 9th Workshop of the Cross-Language Evaluation Forum, Vol. 5706 (pp 500-511) View this article in WRRO
- PAN@FIRE. Proceedings of the 5th 2013 Forum on Information Retrieval Evaluation - FIRE '13, 4 December 2013 - 6 December 2013.
- Multilingual needs of cultural heritage Web site visitors: A case study of tate online. ICHIM07 - International Cultural Heritage Informatics Meeting, Proceedings
- The CLEF 2005 Automatic Medical Image Annotation Task.. Int. J. Comput. Vis., Vol. 74 (pp 51-58)
- Large-scale evaluation of cross-language image retrieval systems. Bulletin of the American Society for Information Science and Technology, Vol. 33(3)
- Overview of the ImageCLEFphoto 2007 photographic retrieval task. CEUR Workshop Proceedings, Vol. 1173
- Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task.. 8th Workshop of the Cross-Language Evaluation Forum, Vol. 5152 (pp 433-444) View this article in WRRO
- View this article in WRRO Know the Right People? Recommender Systems for Web 2.0.. LWA 2007 (pp 330-337)
- Overview of the ImageCLEFmed 2006 medical retrieval and medical annotation tasks. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 4730 LNCS (pp 595-608) View this article in WRRO
- Visualising the south Yorkshire floods of '07.. Proceedings of the 4th ACM Workshop On Geographic Information Retrieval (pp 93-94) View this article in WRRO
- The relationship between IR effectiveness measures and user satisfaction.. Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 773-774) View this article in WRRO
- Geo-tagging for imprecise regions of different sizes.. Proceedings of the 4th ACM Workshop on Geographical Information Retrieval (pp 77-82) View this article in WRRO
- View this article in WRRO Web-based delineation of imprecise regions.. Comput. Environ. Urban Syst., Vol. 30 (pp 436-459)
- Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems. Internet Imaging VII
- ICLEF 2006 overview: Searching the Flickr WWW photo-sharing repository. CEUR Workshop Proceedings, Vol. 1172
- Providing multilingual access to FLICKR for Arabic users. CEUR Workshop Proceedings, Vol. 1172
- Overview of the ImageCLEF 2006 photographic retrieval and object annotation tasks. CEUR Workshop Proceedings, Vol. 1172
- View this article in WRRO Variation of Relevance Assessments for Medical Image Retrieval.. Adaptive Multimedia Retrieval, Vol. 4398 (pp 232-246)
- iCLEF 2006 Overview: Searching the Flickr WWW Photo-Sharing Repository.. Proceedings of the 7th Workshop of the Cross-Language Evaluation Forum, Vol. 4730 (pp 186-194) View this article in WRRO
- View this article in WRRO Users' Effectiveness and Satisfaction for Image Retrieval.. LWA, Vol. 1/2006 (pp 84-88)
- View this article in WRRO The Eurovision St Andrews collection of photographs.. SIGIR Forum, Vol. 40 (pp 21-30)
- Providing Multilingual Access to FLICKR for Arabic Users.. 7th Workshop of the Cross-Language Evaluation Forum, Vol. 4730 (pp 205-216) View this article in WRRO
- Overview of the ImageCLEF 2006 Photographic Retrieval and Object Annotation Tasks.. 7th Workshop of the Cross-Language Evaluation Forum, Vol. 4730 (pp 579-594) View this article in WRRO
- Judging the Spatial Relevance of Documents for GIR.. Proceedings of the 28th European Conference on IR Research, Vol. 3936 (pp 548-552) View this article in WRRO
- Reading between the lines: Attitudinal expressions in text. AAAI Spring Symposium - Technical Report, Vol. SS-04-07 (pp 82-85)
- Overview of the CLEF 2005 interactive track. CEUR Workshop Proceedings, Vol. 1171
- Towards a topic complexity measure for cross-language image retrieval. CEUR Workshop Proceedings, Vol. 1171
- Concept hierarchy across languages in text-based image retrieval: A user evaluation. CEUR Workshop Proceedings, Vol. 1171
- View this article in WRRO Evaluation axes for medical image retrieval systems: the imageCLEF experience.. ACM Multimedia (pp 1014-1022)
- Extracting metadata for spatially-aware information retrieval on the internet.. Proceedings of the 2005 Workshop On Geographic Information Retrieval (pp 25-30) View this article in WRRO
- Linguistic Estimation of Topic Difficulty in Cross-Language Image Retrieval.. 6th Workshop of the Cross-Language Evalution Forum, Vol. 4022 (pp 558-566)
- The CLEF 2005 Cross-Language Image Retrieval Track.. 6th Workshop of the Cross-Language Evalution Forum, Vol. 4022 (pp 535-557) View this article in WRRO
- GeoCLEF: The CLEF 2005 Cross-Language Geographic Information Retrieval Track Overview.. Proceedings of the 6th Workshop of the Cross-Language Evalution Forum, Vol. 4022 (pp 908-919) View this article in WRRO
- GeoCLEF: The CLEF 2005 cross-language geographic information retrieval track overview. CEUR Workshop Proceedings, Vol. 1171
- The CLEF 2005 cross-language image retrieval track. CEUR Workshop Proceedings, Vol. 1171
- Caption vs. Query translation for cross-language image retrieval. CEUR Workshop Proceedings, Vol. 1170
- The CLEF cross language image retrieval track (ImageCLEF) 2004. CEUR Workshop Proceedings, Vol. 1170
- View this article in WRRO EuroWordNet as a Resource for Cross-language Information Retrieval. Proceedings of the Fourth International Conference on Language Resources and Evaluation (pp 777-780). Lisbon, Portugal
- Measuring pseudo relevance feedback & CLIR.. Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 484-485) View this article in WRRO
- Caption and Query Translation for Cross-Language Image Retrieval.. 5th Workshop of the Cross-Language Evaluation Forum, Vol. 3491 (pp 614-625)
- The CLEF Cross Language Image Retrieval Track (ImageCLEF) 2004.. Conference on Image and Video Retrieval, Vol. 3115 (pp 243-251) View this article in WRRO
- Relevance Feedback for Cross Language Image Retrieval.. 26th European Conference on IR Research, Vol. 2997 (pp 238-252) View this article in WRRO
- Measuring a Cross Language Image Retrieval System.. 26th European Conference on IR Research, Vol. 2997 (pp 353-363) View this article in WRRO
- Cross-language information retrieval using EuroWordNet and word sense disambiguation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 2997 (pp 327-337)
- The CLEF 2004 Cross-Language Image Retrieval Track.. 5th Workshop of the Cross-Language Evaluation Forum, Vol. 3491 (pp 597-613) View this article in WRRO
- Sheffield at Image CLEF 2003. CEUR Workshop Proceedings, Vol. 1169
- The CLEF 2003 cross language image retrieval task. CEUR Workshop Proceedings, Vol. 1169
- View this article in WRRO Assessing the effectiveness of pen-based input queries.. SIGIR (pp 437-438)
- View this article in WRRO Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-language Information Retrieval. Proceedings of GWC 2004: The Second Global Wordnet Conference (pp 97-105)
- The CLEF 2003 Cross Language Image Retrieval Track.. 4th Workshop of the Cross-Language Evaluation Forum, Vol. 3237 (pp 581-593) View this article in WRRO
- Assessing Translation Quality for Cross Language Image Retrieval.. Comparative Evaluation of Multilingual Information Access Systems, Vol. 3237 (pp 594-610) View this article in WRRO
- View this article in WRRO Building and annotating a corpus for the study of journalistic text reuse.. LREC
- View this article in WRRO METER: MEasuring TExt Reuse. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (pp 152-159)
- View this article in WRRO Bletchley Park: an untold story in Information Science. Trends in Information Science. Bletchley Park: an untold story in Information Science.
- ImageCLEF and ImageCLEFmed: Toward standard test collections for image storage and retrieval research. Proceedings of the American Society for Information Science and Technology, Vol. 43(1) (pp 1-6)
- Clustering and Classifying Users from the National Museums Liverpool Website. TPDL 2021, 25th International Conference on Theory and Practice of Digital Libraries, Vol. LNCS 12866 (pp 1-13). Virtual conference, 13 September 2021 - 17 September 2021. View this article in WRRO
Datasets
Preprints
- Teaching interests
-
I enjoy my role in helping students learn about topics related to data and information management at both undergraduate and postgraduate levels. Currently I coordinate modules in the Information School on the topics of Information Retrieval and Information Systems in Healthcare. I am also developing the new MSc Data Science programme that will launch 2014 and for which I will be overall coordinator.
In 2010 I became coordinator of the Information Retrieval module and revised its content and methods of assessment. Martin White (Intranet Focus and visiting Professor in the Information School) mentioned the module in his O’Reilly book on Enterprise Search as an example of the type of training teams supporting enterprise search should receive. I also deliver lectures on several other courses in the Information School, including Database Design, Digital Multimedia and Business Intelligence.
I have been invited as guest lecturer on several occasions to external organisations, including the 2013 European Summer School in Information Retrieval, the TrebleCLEF cross language search summer school in Pisa in 2009 and Universidad Nacional de Educación a Distancia in 2008. In 2008 I completed a Postgraduate Certificate in Higher Education (PGCHE) and since 2010 have been a Fellow of the UK Higher Education Academy.
- Teaching activities
-
INF110 - Data Science Foundations and Contexts
INF113 - Data Driven Organisations
INF214 - Using Data for Responsible Decision Making
INF6027 - Introduction to Data Science