Dr Mark Stevenson

School of Computer Science

Senior Lecturer

Member of the Natural Language Processing research group

Mark Stevenson heashot
Profile picture of Mark Stevenson heashot
mark.stevenson@sheffield.ac.uk
+44 114 222 1921

Full contact details

Dr Mark Stevenson
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Profile

Mark Stevenson is a Senior Lecturer in Computer Science. He is a member of the Natural Language Processing group which he joined in 1995. His PhD, on Word Sense Disambiguation, was published as a monograph.

He has been Principal Investigator of projects funded by a range of sources including the EU, EPSRC and Google. He was an EPSRC Advanced Research Fellow (2006-2011) and co-ordinator of the EU-funded project PATHS.

He has also worked in a range of commercial and academic organisations including Reuters Ltd (where he was involved in the production and dissemination of the widely used Reuters Corpus), Adastral Park (British Telecom’s research lab) and the Center for the Study of Language and Information, Stanford University.

Research interests

Mark Stevenson’s research focusses on Natural Language Processing and Information Retrieval. Topics he has worked on include word sense disambiguation, Information Extraction, plagiarism/reuse detection, lexicon adaptation, cross-lingual information retrieval and exploratory search.

His research includes applications of these technologies to a range of areas including biomedical journal articles (interpretation of documents, extraction of information from them and data mining information from corpora), cultural heritage (automatic organisation of corpora, exploratory search interfaces) and software testing (generation of realistic test suites).

Publications

Books

  • (2007) Words and Intelligence I: Selected Papers by Yorick Wilks. Springer. RIS download Bibtex download
  • (2007) Words and Intelligence II: Essays in Honour of Yorick Wilks. Springer. RIS download Bibtex download
  • Stevenson M (2003) Word Sense Disambiguation: The Case for Combinations of Knowledge Sources. Stanford, CA.: CSLI Publications. RIS download Bibtex download

Journal articles

Chapters

  • Clough P, Hall M, Goodale P & Stevenson (2015) Supporting Exploration and Use of Digital Cultural Heritage Materials: the PATHS Perspective In Ruthven I & Chowdhury GG (Ed.), Cultural Heritage Information Access and Management (pp. 197-220). Facet Publishing View this article in WRRO RIS download Bibtex download
  • Stevenson M & Agirre E (2010) Word Sense Disambiguation In Mitkov R (Ed.), Oxford Handbook of Computational Linguistics Oxford University Press RIS download Bibtex download
  • Rose T & Stevenson M (2009) Natural Language Processing and Information Retrieval In Davies J, Göker A & Graham M (Ed.), Information Retrieval: Searching in the 21st Century (pp. 215-232-215-232). Wiley RIS download Bibtex download
  • Rayson P & Stevenson M (2008) Sense Tagging In Ludeling A, Kyto M & McEnery T (Ed.), Handbook of Corpus Linguistics Mouton de Gruyter RIS download Bibtex download
  • Agirre E & Stevenson M (2007) Knowledge Sources for WSD, Text, Speech and Language Technology (pp. 217-251). Springer Netherlands RIS download Bibtex download
  • Ahmad K, Brewster C & Stevenson M (2007) Words and Intelligence II Essays in Honor of Yorick Wilks Introduction, WORDS AND INTELLIGENCE II: ESSAYS IN HONOR OF YORICK WILKS (pp. XI-XIV). RIS download Bibtex download
  • Agirre E & Stevenson M (2006) Knowledge Sources for WSD, WORD SENSE DISAMBIGUATION: ALGORITHMS AND APPLICATIONS (pp. 217-251). RIS download Bibtex download
  • Agirre E & Stevenson M (2005) Knowledge Sources for Word Sense Disambiguation In Agirre E & Edmonds P (Ed.), Word Sense Disambiguation: Algorithms, Applications and Trends Kluwer RIS download Bibtex download
  • Stevenson M & Wilks Y (2003) Word Sense Disambiguation In Mitkov R (Ed.), Oxford Handbook of Computational Linguistics (pp. 249-265-249-265). Oxford University Press RIS download Bibtex download
  • Wilks Y & Stevenson M (2000) Combining Independent Knowledge Sources for Word Sense Disambiguation In Nicolov N & Mitkov R (Ed.), Recent Advances in Natural Language Processing (pp. 74-86-74-86). John Benjamins Publishers RIS download Bibtex download
  • Stevenson M & Wilks Y (1999) Large Vocabulary Word Sense Disambiguation In Ravin Y & Leacock C (Ed.), Polysemy: Theoretical and Computational Contributions (pp. 161-177-161-177). Oxford: Oxford University Press. RIS download Bibtex download

Conference proceedings papers

  • Stevenson R & Bin-Hezam R (2024) Stopping Methods Based on Point Processes: Recent Developments. Proceedings of the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems (ALTARS 2024) co-located with the 46th European Conference on Information Retrieval (ECIR 2024) RIS download Bibtex download
  • Bin Hezam R & Stevenson R (2024) RLStop: A Reinforcement Learning Stopping Method for TAR. Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval RIS download Bibtex download
  • Peng X, Lin C & Stevenson R (2021) Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation. Proceedings of the 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics RIS download Bibtex download
  • Maronikolakis A, Schutze H & Stevenson R (2021) Identifying Automatically Generated Headlines using Transformers. Proceedings of the Fourth Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda RIS download Bibtex download
  • Zhang H, Ganchev I, Nikolov NS & Stevenson M (2021) UserReg : a simple but strong model for rating prediction. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing. Toronto, ON, Canada, 6 June 2021 - 11 June 2021. RIS download Bibtex download
  • Peng X, Chen G, Lin C & Stevenson M (2021) Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis. NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp 2364-2375) RIS download Bibtex download
  • Zhang H, Sneyd A & Stevenson R (2020) Robustness and Reliability of Gender Bias Assessment in Word Embeddings: The Role of Base Pairs. Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing RIS download Bibtex download
  • Paik H, Yoo S, Nam H, Stevenson M & No A (2020) DTMBIO 2020. Proceedings of the 29th ACM International Conference on Information & Knowledge Management RIS download Bibtex download
  • Soares F, Stevenson M, Bartolome D & Zaretskaya A (2020) ParaPat: The multi-million sentences parallel corpus of patents abstracts. LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp 3769-3774) RIS download Bibtex download
  • Alokaili A, Aletras N & Stevenson M (2020) Automatic Generation of Topic Labels.. SIGIR (pp 1965-1968) RIS download Bibtex download
  • McDonald T, Dong ZQ, Zhang Y, Hampson R, Young J, Cao Q, Leidner JL & Stevenson M (2020) The University of Sheffield at CheckThat! 2020: Claim Identification and Verification on Twitter. CEUR Workshop Proceedings, Vol. 2696 RIS download Bibtex download
  • Alokaili A, Aletras N & Stevenson M (2020) Automatic Generation of Topic Labels. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval View this article in WRRO RIS download Bibtex download
  • Sneyd A & Stevenson M (2020) Modelling stopping criteria for search results using poisson processes. EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp 3484-3489) View this article in WRRO RIS download Bibtex download
  • Alharbi A & Stevenson R (2019) Improving ranking for systematic reviews using query adaptation. CLEF 2019 Proceedings : Experimental IR Meets Multilinguality, Multimodality, and Interaction (pp 141-148). Lugarno, Switzerland, 9 September 2019 - 12 September 2019. View this article in WRRO RIS download Bibtex download
  • Alharbi A & Stevenson M (2019) Ranking studies for systematic reviews using query adaptation : University of Sheffield's approach to CLEF eHealth 2019 task 2 working notes for CLEF 2019. Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum, Vol. 2380. Lugano, Switzerland, 9 September 2019 - 12 September 2019. View this article in WRRO RIS download Bibtex download
  • Alharbi A & Stevenson R (2019) A Dataset of Systematic Reviews Updates. The 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 1257-1260), 21 July 2019 - 25 July 2019. View this article in WRRO RIS download Bibtex download
  • Roller R, Vashisth G, Thomas P, Wang H, Mikhailov M & Stevenson M (2019) Graph-KD: Exploring relational information for knowledge discovery. CEUR Workshop Proceedings, Vol. 2456 (pp 257-260) RIS download Bibtex download
  • Alharbi A, Briggs W & Stevenson M (2018) Retrieving and ranking studies for systematic reviews: University of Sheffield's approach to CLEF eHealth 2018 Task 2. CEUR Workshop Proceedings, Vol. 2125 View this article in WRRO RIS download Bibtex download
  • Sari Y, Stevenson M & Vlachos A (2018) Topic or Style? Exploring the Most Useful Features for Authorship Attribution.. COLING (pp 343-353) RIS download Bibtex download
  • Alharbi A & Stevenson M (2017) Ranking abstracts to identify relevant evidence for systematic reviews: The University of Sheffield's approach to CLEF eHealth 2017 Task 2: Working notes for CLEF 2017. CEUR Workshop Proceedings, Vol. 1866 View this article in WRRO RIS download Bibtex download
  • Poulston A, Waseem Z & Stevenson M (2017) Using TF-IDF n-gram and word embedding cluster ensembles for author profiling: Notebook for PAN at CLEF 2017. CEUR Workshop Proceedings, Vol. 1866 View this article in WRRO RIS download Bibtex download
  • Poulston A, Stevenson M & Bontcheva K (2017) Hyperlocal home location identification of Twitter profiles. HT 2017 - Proceedings of the 28th ACM Conference on Hypertext and Social Media (pp 45-54) View this article in WRRO RIS download Bibtex download
  • Sari Y, Vlachos A & Stevenson RM (2017) Continuous N-gram Representations for Authorship Attribution. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers (pp 267-273). Valencia, Spain, 3 April 2017 - 7 April 2017. View this article in WRRO RIS download Bibtex download
  • Alvi F, Stevenson M & Clough P (2017) Plagiarism Detection in Texts Obfuscated with Homoglyphs (pp 669-675) View this article in WRRO RIS download Bibtex download
  • Paisley S, Seva J, Stevenson M, Archer R, Preston L, Chilcott J & Thornhill M (2016) Identifying Potential Early Biomarkers Of Acute Myocardial Infarction In The Biomedical Literature: A Comparison Of Text Mining And Manual Sifting Techniques. Value in Health, Vol. 19(7) (pp A367-A367) RIS download Bibtex download
  • Poulston A, Stevenson M & Bontcheva K (2016) User profiling with geo-located posts and demographic data. Proceedings of the First Workshop on NLP and Computational Social Science, November 2016 - November 2016. RIS download Bibtex download
  • Sari Y & Stevenson M (2016) ExploringWord embeddings and character N-Grams for author clustering. CEUR Workshop Proceedings, Vol. 1609 (pp 984-991) RIS download Bibtex download
  • Aletras N, Lau JH, Baldwin T & Stevenson M (2015) TM 2015 -- Topic Models. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management - CIKM '15, 18 October 2015 - 23 October 2015. RIS download Bibtex download
  • Roller R, Agirre E, Soroa A & Stevenson M (2015) Improving distant supervision using inference learning. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), July 2015 - July 2015. View this article in WRRO RIS download Bibtex download
  • Aletras N & Stevenson M (2015) A Hybrid Distributional and Knowledge-based Model of Lexical Semantics. Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, June 2015 - June 2015. RIS download Bibtex download
  • Sari Y & Stevenson M (2015) A machine learning-based intrinsic method for cross-topic and cross-genre authorship verification notebook for PAN at CLEF 2015. CEUR Workshop Proceedings, Vol. 1391 RIS download Bibtex download
  • Poulston A, Stevenson M & Bontcheva K (2015) Topic models and n-gram language models for author profiling. CEUR Workshop Proceedings, Vol. 1391 RIS download Bibtex download
  • Alvi F, Stevenson M & Clough P (2015) The short stories corpus. CEUR Workshop Proceedings, Vol. 1391 RIS download Bibtex download
  • Poulston A, Stevenson M & Bontcheva K (2015) Topic models and n-gram language models for author profiling. CEUR Workshop Proceedings, Vol. 1391 RIS download Bibtex download
  • Alvi F, Stevenson M & Clough P (2015) The short stories corpus. CEUR Workshop Proceedings, Vol. 1391 RIS download Bibtex download
  • Alamri A & Stevensony M (2015) Automatic identification of potentially contradictory claims to support systematic reviews. 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 9 November 2015 - 12 November 2015. RIS download Bibtex download
  • Alvi F, Stevenson M & Clough PD (2015) The Short Stories Corpus: Notebook for PAN at CLEF 2015.. CLEF (Working Notes), Vol. 1391 RIS download Bibtex download
  • Poulston A, Stevenson M & Bontcheva K (2015) Topic Models and n-gram Language Models for Author Profiling - Notebook for PAN at CLEF 2015.. CLEF (Working Notes), Vol. 1391 RIS download Bibtex download
  • Sari Y & Stevenson M (2015) A machine learning-based intrinsic method for cross-topic and cross-genre authorship verification notebook for PAN at CLEF 2015. CEUR Workshop Proceedings, Vol. 1391 RIS download Bibtex download
  • Roller R & Stevenson M (2015) Making the most of limited training data using distant supervision. ACL-IJCNLP 2015 - BioNLP 2015: Workshop on Biomedical Natural Language Processing, Proceedings of the Workshop (pp 12-20) RIS download Bibtex download
  • Roller R & Stevenson M (2015) Held-out versus Gold Standard: Comparison of Evaluation Strategies for Distantly Supervised Relation Extraction from Medline abstracts. EMNLP 2015 - 6th International Workshop on Health Text Mining and Information Analysis, LOUHI 2015 - Proceedings of the Workshop (pp 97-102) View this article in WRRO RIS download Bibtex download
  • Alamri A & Stevenson M (2015) Automatic Detection of Answers to Research Questions from Medline Abstracts. ACL-IJCNLP 2015 - BioNLP 2015: Workshop on Biomedical Natural Language Processing, Proceedings of the Workshop (pp 141-146) RIS download Bibtex download
  • Aletras N & Stevenson M (2014) Labelling Topics using Unsupervised Graph-based Methods. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Vol. 2 (pp 631-636) RIS download Bibtex download
  • Aletras N & Stevenson M (2014) Measuring the Similarity between Automatically Generated Topics. Proceedings of the 14th Conference of the European Chapter of $ the Association for Computational Linguistics (pp 22-27) RIS download Bibtex download
  • Smith J, Hall MM, Goodale P, Clough P & Stevenson R (2014) PATHS in context: User characteristics and the construction of cultural heritage narratives. iConference Proceedings 2014 View this article in WRRO RIS download Bibtex download
  • Alvi F, Stevenson M & Clough P (2014) Hashing and merging heuristics for text reuse detection: Notebook for PAN at CLEF-2014. CEUR Workshop Proceedings, Vol. 1180 (pp 939-946) RIS download Bibtex download
  • Alvi F, Stevenson M & Clough P (2014) Hashing and merging heuristics for text reuse detection: Notebook for PAN at CLEF-2014. CEUR Workshop Proceedings, Vol. 1180 (pp 939-946) RIS download Bibtex download
  • Alvi F, Stevenson M & Clough PD (2014) Hashing and Merging Heuristics for Text Reuse Detection.. CLEF (Working Notes), Vol. 1180 (pp 939-946) RIS download Bibtex download
  • Aletras N, Baldwin T, Lau JH & Stevenson M (2014) Representing topics labels for exploring digital libraries. IEEE/ACM Joint Conference on Digital Libraries, 8 September 2014 - 12 September 2014. RIS download Bibtex download
  • Goodale P, Clough P, Hall M, Stevenson M, Fernie K & Griffiths J (2014) Supporting Information Access and Sensemaking in Digital Cultural Heritage Environments (pp 143-154) RIS download Bibtex download
  • Roller R & Stevenson M (2014) Self-supervised Relation Extraction Using UMLS (pp 116-127) RIS download Bibtex download
  • Roller R & Stevenson M (2014) Applying UMLS for Distantly Supervised Relation Detection. Proceedings of the The Fifth International Workshop on Health Text Mining and Information Analysis (pp 80-84) RIS download Bibtex download
  • Agirre E, Aletras N, Clough PD, Fernando S, Goodale P, Hall MM, Soroa A & Stevenson M (2013) PATHS: A System for Accessing Cultural Heritage Collections.. ACL (Conference System Demonstrations) (pp 151-156) RIS download Bibtex download
  • Aletras N & Stevenson M (2013) Evaluating topic coherence using distributional semantics. Proceedings of the 10th International Conference on Computational Semantics, IWCS 2013 - Long Papers RIS download Bibtex download
  • Preiss J & Stevenson M (2013) Distinguishing Common and Proper Nouns. *SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Vol. 1 (pp 80-84) RIS download Bibtex download
  • Agirre E, Aletras N, Gonzalez-Agirre A, Rigau G & Stevenson M (2013) UBC UOS-TYPED: Regression for Typed-similarity. *SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Vol. 1 (pp 132-137) RIS download Bibtex download
  • Preiss J & Stevenson M (2013) Unsupervised domain tuning to improve word sense disambiguation. NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference (pp 680-684) RIS download Bibtex download
  • Aletras N & Stevenson M (2013) Representing topics using images. NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference (pp 158-167) RIS download Bibtex download
  • Hall MM, Clough PD, Fernando S, Goodale P, Stevenson M, Agirre E, Otegi A, Soroa A, Fernie K & Griffiths J (2013) Information seeking in digital cultural heritage with PATHS. SIGIR 2013 - Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 1105-1106) RIS download Bibtex download
  • Afshan S, McMinn P & Stevenson M (2013) Evolving readable string test inputs using a natural language model to reduce human oracle cost. Proceedings - IEEE 6th International Conference on Software Testing, Verification and Validation, ICST 2013 (pp 352-361) RIS download Bibtex download
  • Preiss J & Stevenson M (2013) Unsupervised Domain Tuning to Improve Word Sense Disambiguation. Proceedings of the 2nd Workshop on Computational Linguistics for Literature, CLfL 2013 at the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013 (pp 680-684) RIS download Bibtex download
  • Agirre E, Aletras N, Gonzalez-Agirre A, Rigau G & Stevenson M (2013) UBC UOS-TYPED: Regression for Typed-similarity. SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual SimilaritySEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity (pp 132-137) RIS download Bibtex download
  • Aletras N & Stevenson M (2013) Representing Topics Using Images. Proceedings of the 2nd Workshop on Computational Linguistics for Literature, CLfL 2013 at the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013 (pp 158-167) RIS download Bibtex download
  • Preiss J & Stevenson M (2013) Distinguishing Common and Proper Nouns. SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual SimilaritySEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity (pp 80-84) RIS download Bibtex download
  • Preiss J & Stevenson M (2013) DALE: A Word Sense Disambiguation System for Biomedical Documents Trained using Automatically Labeled Examples. 2013 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013 - Demonstration Session (pp 1-4) RIS download Bibtex download
  • Roller R & Stevenson M (2013) Identification of Genia Events using Multiple Classifiers. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Vol. 2013-October (pp 125-129) RIS download Bibtex download
  • Fernando S, Goodale P, Clough P, Stevenson M, Hall M & Agirre E (2013) Generating Paths through Cultural Heritage Collections. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp 1-10) RIS download Bibtex download
  • Barron-Cedeno A, Rosso P, Devi S, Clough P & Stevenson M (2012) PAN@FIRE: Overview of the Cross-Language !ndian Text Re-Use Detection Competition. Forum for Information Retrieval Evaluation (FIRE) Working Notes. Bombay, India RIS download Bibtex download
  • Hall M, Agirre E, Aletras N, Bergheim R, Chandrinos K, Clough P, Fernando S, Fernie K, Goodale P, Griffiths J , Lopez de Lacalle O et al (2012) PATHS - Exploring Digital Cultural Heritage Spaces. Theory and Practice of Digital Libraries 2012. Cyprus RIS download Bibtex download
  • Hall M, Clough P & Stevenson M (2012) Evaluating the use of clustering for automatically organising digital library collections. Theory and Practice of Digital Libraries 2012. Cyprus RIS download Bibtex download
  • Shahbaz M, Mcminn P & Stevenson M (2012) Automated Discovery of Valid Test Strings using Dynamic Regular Expressions Collation and Tailored Web Searches. Proceedings of the 12th International Conference on Quality Software (QSIC 2012). Xi’an, China RIS download Bibtex download
  • Cheng W, Preiss J & Stevenson M (2012) Scaling up WSD with Automatically Generated Examples. BioNLP: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing (pp 231-239). Montréal, Canada RIS download Bibtex download
  • Fernando S & Stevenson M (2012) Adapting Wikification to Cultural Heritage. Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (pp 101-106). Avignon, France RIS download Bibtex download
  • Aletras N & Stevenson M (2012) Computing Similarity between Cultural Heritage Items using Multimodal Features. Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (pp 85-93). Avignon, France RIS download Bibtex download
  • Agirre E, Clough P, Fernando S, Hall M, Otegi A & Stevenson M (2012) The Sheffield and Basque Country universities entry to CHiC: Using random walks and similarity to access cultural heritage. CEUR Workshop Proceedings, Vol. 1178 RIS download Bibtex download
  • Goodale P, Clough P, Ford N, Hall M, Stevenson M, Fernando S, Aletras N, Fernie K, Archer P & De Polo A (2012) User-centred design to support exploration and path creation in cultural heritage collections. CEUR Workshop Proceedings, Vol. 909 (pp 75-78) RIS download Bibtex download
  • Fernie K, Griffiths J, Stevenson M, Clough P, Goodale P, Hall M, Archer P, Chandrinos K, Agirre E, De Lacalle OL , De Polo A et al (2012) PATHS: Personalising access to cultural heritage spaces. Proceedings of the 2012 18th International Conference on Virtual Systems and Multimedia, VSMM 2012: Virtual Systems in the Information Society (pp 469-474) RIS download Bibtex download
  • Shahbaz M, McMinn P & Stevenson M (2012) Automated discovery of valid test strings from the web using dynamic regular expressions collation and natural language processing. Proceedings - International Conference on Quality Software (pp 79-88) RIS download Bibtex download
  • McMinn P, Shahbaz M & Stevenson M (2012) Search-based test input generation for string data types using the results of web queries. Proceedings - IEEE 5th International Conference on Software Testing, Verification and Validation, ICST 2012 (pp 141-150) RIS download Bibtex download
  • Clough P, Ford N & Stevenson M (2011) Personalising access to cultural heritage collections using pathways. PATCH 2011 : 3rd International Workshop on Personalized Access To Cultural Heritage (pp 12-19-12-19) RIS download Bibtex download
  • Nawab RMA, Stevenson M & Clough PD (2011) External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011.. CLEF (Notebook Papers/Labs/Workshop), Vol. 1177 View this article in WRRO RIS download Bibtex download
  • Stevenson M & Guo Y (2010) The Effect of Ambiguity on the Automated Acquisition of WSD Examples. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (pp 353-356-353-356). Los Angeles, California RIS download Bibtex download
  • Swampillai K & Stevenson M (2010) Inter-sentential Relations in Information Extraction Corpora. Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC-2010). Valetta, Malta RIS download Bibtex download
  • Plaza L, Stevenson M & Diaz A (2010) Improving Summarization of Biomedical Documents using Word Sense Disambiguation. Proceedings of the 2010 Workshop on Biomedical Natural Language Processing (pp 55-63-55-63). Uppsala, Sweden RIS download Bibtex download
  • Fernando S & Stevenson M (2010) Aligning WordNet Synsets and Wikipedia Articles. Proceedings of the AAAI-2010 Workshop on Collaboratively-built Knowledge Sources and Artificial Intelligence. Atlanta, Georgia RIS download Bibtex download
  • Reddy S, Inumella A, McCarthy D & Stevenson M (2010) IIITH: Domain Specific Word Sense Disambiguation. Proceedings of the 5th International Workshop on Semantic Evaluation. Uppsala, Sweden RIS download Bibtex download
  • Nawab R, Stevenson M & Clough P (2010) University of Sheffield: Lab Report for PAN at CLEF 2010. Proceedings of the 4th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse View this article in WRRO RIS download Bibtex download
  • McMinn P, Stevenson M & Harman M (2010) Reducing qualitative human oracle costs associated with automatically generated test data. 1st International Workshop on Software Test Output Validation, STOV 2010, in Conjunction with the 2010 International Conference on Software Testing and Analysis, ISSTA 2010 (pp 1-4) RIS download Bibtex download
  • Stevenson M, Guo Y, Alamri A & Gaizauskas R (2009) Disambiguation of Biomedical Abbreviations. Proceedings of the BioNLP 2009 Workshop (pp 71-79). Boulder, Colorado RIS download Bibtex download
  • Stevenson M, Alamri A, Guo Y & Gaizauskas R (2009) A Corpus of Biomedical Abbreviations. Proceedings of Corpus Linguistics 2009. Liverpool, UK RIS download Bibtex download
  • Clough P & Stevenson M (2009) Designing a Corpus of Plagiarised Academic Texts. Proceedings of Corpus Linguistics 2009. Liverpool, UK RIS download Bibtex download
  • Stevenson M, Guo Y, Gaizauskas R & Martinez D (2008) Knowledge sources for word sense disambiguation of biomedical text. Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing - BioNLP '08, 19 June 2008 - 19 June 2008. RIS download Bibtex download
  • Stevenson M, Guo Y, Gaizauskas R & Martinez D (2008) Knolwedge Sources for Word Sense Disambiguation of Biomedical Text. Proceedings of the workshop “BioNLP 2008" held in conjunction with the 46th Annual Meeting of the Association for Computational Linguistics (pp 80-87-80-87). Columbus, OH. RIS download Bibtex download
  • Stevenson M, Guo Y & Gaizauskas R (2008) Acquiring Sense Tagged Examples using Relevance Feedback. Proceedings of the 22nd International Conference on Computational Linguistics (COLING-08). Manchester, UK RIS download Bibtex download
  • Fernando S & Stevenson M (2008) A Semantic Approach to Paraphrase Identification. Proceedings of the 11th Annual Research Colloquium of the UK Special-interest group for Computational Lingusitics. Oxford, England RIS download Bibtex download
  • Gupta P, Clough P, Rosso P, Stevenson M & Banchs RE (2007) PAN@FIRE. Proceedings of the 5th 2013 Forum on Information Retrieval Evaluation - FIRE '13, 4 December 2013 - 6 December 2013. RIS download Bibtex download
  • Greenwood M & Stevenson M (2007) A Semi-supervised Approach to Learning Relevant Protein-Protein Interaction Articles. Proceedings of BioCreative II workshop (pp 175-177-175-177). Madrid, Spain RIS download Bibtex download
  • Greenwood M & Stevenson M (2007) A Task-based Comparison of Information Extraction Pattern Models. Proceedings of the Workshop “Deep Linguistic Processing” held in conjunction with the 45th Annual Meeting of the Association for Computational Linguistics (pp 81-88) RIS download Bibtex download
  • Specia L, Stevenson M & Nunes MGV (2007) Learning Expressive Models for Word Sense Disambiguation. 45th Annual Meeting of the Association of Computational Linguistics (pp 41-48) RIS download Bibtex download
  • Greenwood M & Stevenson M (2006) Improving Semi-supervised Acquisition of Relation Extraction Patterns. Proceedings of the Workshop “Information Extraction Beyond The Document” held in conjunction with 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (pp 29-35-29-35). Sydney, Australia RIS download Bibtex download
  • Stevenson M & Greenwood M (2006) Comparing Information Extraction Pattern Models. Proceedings of the Workshop “Information Extraction Beyond The Document” held in conjunction with 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (pp 12-19-12-19). Sydney, Australia RIS download Bibtex download
  • Specia L, Nunes M, Stevenson M & Ribeiro G (2006) Multilingual versus Monolingual WSD. Proceedings of the workshop "Making Sense of Sense" held in conjunction with the Eleventh Conference of the European Chapter of the Association for Computational Lingusitics (pp 33-40-33-40). Trento, Italy RIS download Bibtex download
  • Specia L, Nunes M & Stevenson M (2006) Translation Context Sensitive WSD. Proceedings of the European Association for Machine Transaltion 11th Annual Conference (EAMT-2006) (pp 227-232-227-232). Oslo, Norway RIS download Bibtex download
  • Specia L, Ribeiro GCB, Nunes MDV & Stevenson M (2006) The need for application-dependent WSD strategies: A case study in NIT. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, Vol. 3960 (pp 233-237) RIS download Bibtex download
  • Greenwood MA, Stevenson M & Gaizauskas RJ (2006) The University of Sheffield's TREC 2006 Q&A Experiments.. TREC, Vol. 500-272 RIS download Bibtex download
  • Specia L, Nunes M & Stevenson M (2005) Mining Rules for Word Sense Disambiguation. III TIL - Workshop em Tecnologia da Informacao e da Linguagem Humana, XXV Congresso da SBC. Sao Leopoldo, Brasil RIS download Bibtex download
  • Specia L, Neto S, Nunes M & Stevenson M (2005) An Automatic Approach to Creating a Sense Tagged Corpus for Word Sense Disambiguation in Machine Translation. Second Workshop Organised by the MEANING project (MEANING-2005) (pp 31-36). Trento, Italy RIS download Bibtex download
  • Greenwood M, Stevenson M, Guo Y, Harkema H & Roberts A (2005) Automatically Acquiring a Linguistically Motivated Genic Interaction Extraction System. Proceedings of the workshop “Learning Language in Logic (LLL 05)” held in conjunction the 22nd International Conference on Machine Learning (ICML 05). Bonn, Germany RIS download Bibtex download
  • Specia L, Nunes MGV & Stevenson M (2005) Exploiting Parallel Texts to Produce a Multilingual Sense Tagged Corpus for Word Sense Disambiguation. Recent Advances in Natural Language Processing (pp 525-531) RIS download Bibtex download
  • Stevenson M & Greenwood M (2005) A Semantic Approach to IE Pattern Induction. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (pp 379-386-379-386). Ann Arbour, MI RIS download Bibtex download
  • Stevenson M & Greenwood MA (2005) Learning Information Extraction Patterns Using WordNet. GWC 2006: THIRD INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS (pp 95-102) RIS download Bibtex download
  • Stevenson M (2004) An Unsupervised WordNet-based Algorithm for Relation Extraction. Proceedings of the “Beyond Named Entity” workshop at the Fourth International Conference on Language Resources and Evalutaion (LREC-04) (pp 37-42-37-42). Lisbon, Portugal RIS download Bibtex download
  • Stevenson M & Clough P (2004) EuroWordNet as a Resource for Cross-language Information Retrieval. Proceedings of the Fourth International Conference on Language Resources and Evaluation (pp 777-780). Lisbon, Portugal View this article in WRRO RIS download Bibtex download
  • Stevenson M (2004) Information Extraction from Single and Multiple Sentences. Proceedings of the Twentieth International Conference on Computational Linguistics (COLING-04) (pp 875-881-875-881). Geneva, Switzerland RIS download Bibtex download
  • Clough P & Stevenson M (2004) Cross-language information retrieval using EuroWordNet and word sense disambiguation. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, Vol. 2997 (pp 327-337) RIS download Bibtex download
  • Cimiano P, Ciravegna F, Domingue J, Handschuh S, Lavelli A, Staab S & Stevenson M (2003) Requirements for Information Extraction for Knowledge Management. Knowledge Management and Semantic Annotation Workshop at Second International Semantic Web Conference (ISWC-2003) (pp 89-94-89-94). Sanibel, FL. RIS download Bibtex download
  • Stevenson M & Ciravegna F (2003) Information Extraction as a Semantic Web Technology: Requirements and Promises. Proceedings of the 14th European Conference on Machine Learning (ECML 2003) workshop “Adaptive Text Extraction and Mining”. Cavtat-Dubrovnik, Croatia RIS download Bibtex download
  • Clough P & Stevenson M (2003) Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-language Information Retrieval. GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS (pp 97-105) View this article in WRRO RIS download Bibtex download
  • Stevenson M (2002) Combining Disambiguation Techniques to Enrich an Ontology. Proceedings of the 15th European Conference on Artificial Intelligence (ECAI-02) workshop “Machine Learning and Natural Language Processing for Ontology Engineering” (pp 43-50-43-50). Lyon, France RIS download Bibtex download
  • Rose T, Stevenson M & Whitehead M (2002) The Reuters Corpus – from Yesterday’s News to Tomorrow’s Language Resources. Proceedings of the Third International Conference on Language Resources and Evaluation (LREC-02) (pp 827-832-827-832). Las Palmas, Canary Islands RIS download Bibtex download
  • Stevenson M (2002) Augmenting Noun Taxonomies by Combining Lexical Similarity Metrics. Proceedings of the 19th International Conference on Computational Linguistics (COLING-02) (pp 953-959-953-959). Taipei, Taiwan RIS download Bibtex download
  • Stevenson M (2001) Adding Thesaural Information to Noun Taxonomies (poster). Proceedings of the Second International Conference on Recent Advances in Natural Language Processing (RANLP-01) (pp 297-299-297-299). Tzigov Chark, Bulgaria RIS download Bibtex download
  • Stevenson M & Gaizauskas R (2000) Improving Named Entity Recognition using Annotated Corpora. Proceedings of the Second International Conference on Language Resources and Evaluation (LREC-2000) workshop “Information Extraction meets Corpus Linguistics” (pp 26-32-26-32). Athens, Greece RIS download Bibtex download
  • Stevenson M & Gaizauskas RJ (2000) Using Corpus-derived Name Lists for Named Entity Recognition.. ANLP (pp 290-295) RIS download Bibtex download
  • Stevenson M & Gaizauskas RJ (2000) Experiments on Sentence Boundary Detection.. ANLP (pp 84-89) RIS download Bibtex download
  • Renals S, Goto Y, Gaizauskas R & Stevenson M (1999) Baseline IE-NE Experiments using the SPRACH/LASIE System. Proceedings of the DARPA HUB-4 Workshop (pp 47-50-47-50). Herndon, Virginia RIS download Bibtex download
  • Stevenson M & Wilks Y (1999) Combining weak knowledge sources for sense disambiguation. IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2 (pp 884-889) RIS download Bibtex download
  • Stevenson M (1999) A corpus-based approach to deriving lexical mappings. NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS (pp 285-286) RIS download Bibtex download
  • Basili R, Catizone R, Pazienza M, Stevenson M, Velardi P & Wilks Y (1998) An Empirical Approach to Lexical Tuning. First International Conference on Language Resources and Evaluation (LREC-98) Workshop on Adapting Lexical and Corpus Resources to Sublanguages and Applications (pp 27-33-27-33). Granada, Spain RIS download Bibtex download
  • Cunningham H, Stevenson M & Wilks Y (1998) Implementing a Sense Tagger within a General Architecture for Text Engineering. Proceedings of the New Methods in Language Processing Conference (NeMLaP-3) (pp 59-72-59-72). Sydney, Australia RIS download Bibtex download
  • Wilks Y & Stevenson M (1998) Word Sense Disambiguation using Optimised Combinations of Knowledge Sources. Proceedings of the 17th International Conference on Computational Linguistics and the 36th Annual Meeting of the Association for Computational Linguistics (COLING-ACL-98) (pp 1398-1402-1398-1402). Montreal, Canada RIS download Bibtex download
  • Stevenson M (1998) Extracting Syntactic Relations using Heuristics. Proceedings of the European Summer School in Logic, Language and Information (ESSLLI-98) (pp 248-256-248-256). Saarbrücken, Germany RIS download Bibtex download
  • Stevenson M, Cunningham H & Wilks Y (1998) Sense tagging and language engineering. ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS (pp 185-189) RIS download Bibtex download
  • Wilks Y & Stevenson M (1997) Sense Tagging: Semantic Tagging with a Lexicon. Fifth Conference on Applied Natural Language Processing (ANLP-1997) Workshop “Tagging Text with Lexical Semantics: Why, What and How?” (pp 47-51-47-51). Washington, D.C. RIS download Bibtex download
  • Wilks Y & Stevenson M (1997) Combining Independent Knowledge Sources for Word Sense Disambiguation. Proceedings of Recent Advances in Natural Language Processing (RANLP-97) (pp 1-7-1-7). Tzigov Chark, Bulgaria RIS download Bibtex download
  • Zhang H, Chen Q, Zou Y, Pan Y, Wang J & Stevenson R () Document Set Expansion with Positive-Unlabelled Learning Using Intractable Density Estimation. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation RIS download Bibtex download
  • Bin-Hezam R & Stevenson R () Combining counting processes and classification improves a stopping rule for technology assisted review. Findings of the Association for Computational Linguistics: EMNLP 2023. Singapore, 6 December 2023 - 6 December 2023. View this article in WRRO RIS download Bibtex download
  • Peng X, Zhang Y, Yang J & Stevenson R () On the Vulnerabilities of Text-to-SQL Models. Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering RIS download Bibtex download
  • Wood G & Demirbag M () Introduction RIS download Bibtex download
  • Preiss J & Stevenson RM () HiDE: A Tool for Unrestricted Literature Based Discovery. Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018) View this article in WRRO RIS download Bibtex download
  • Agirre E, Barrena A, Lopez de Lacalla O, Soroa A, Stevenson M & Fernando S () Matching Cultural Heritage items to Wikipedia. Proceedings of the 8th International Conference on Language Resources and Evaluation. Istanbul, Turkey RIS download Bibtex download
  • Fernando S & Stevenson M () Mapping WordNet synsets to Wikipedia articles. Proceedings of the 8th International Conference on Language Resources and Evaluation. Istanbul, Turkey RIS download Bibtex download
  • Nawab R, Stevenson M & Clough P () Detecting Text Reuse with Modified and Weighted N-grams. *SEM 2012: The First Joint Conference on Lexical and Computational Semantics – Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012) (pp 54-58). Montréal, Canada RIS download Bibtex download
  • Biggins S, Mohammed S, Oakley S, Stringer L, Stevenson M & Preiss J () University_Of_Sheffield: Two Approaches to Semantic Text Similarity. *SEM 2012: The First Joint Conference on Lexical and Computational Semantics – Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012) (pp 655-661). Montréal, Canada RIS download Bibtex download
  • Peng X, Chen G, Lin C & Stevenson M () Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis RIS download Bibtex download
  • Sneyd A & Stevenson R () Stopping Criteria for Technology Assisted Reviews based on Counting Processes. Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’21), 11 July 2021 - 15 July 2021. RIS download Bibtex download

Reports

  • Greenwood M & Stevenson M (2006) On the Expressiveness of Information Extraction Patterns RIS download Bibtex download
  • Stevenson M (2003) Evaluating the Single Sentence Assumption in Information Extraction RIS download Bibtex download
  • Stevenson M (1998) Shallow Parsing using Heuristics RIS download Bibtex download
  • Wilks Y & Stevenson M (1996) Sense Tagging: Semantic Tagging with a Lexicon RIS download Bibtex download

Preprints

Grants

Current Grants

  • Distinguishing Common and Proper Nouns, Industrial, 03/2011 - 12/2022, £31,847 as PI

Previous Grants

  • Automatically mapping and assessing inequalities in public health research, NIHR, 04/2021 - 12/2021, £48,764, as PI
  • Institute of Coding, HEFCE, 11/2017 - 03/2021, £957,000, as Co-PI
  • Digital Sensitivity Review, Industrial, 11/2018 - 03/2019, £39,880, as PI
  • Data Analytics, Royal Academy of Engineering, 09/2017 - 09/2020, £30,000 as PI
  • Recommendation Algorithm, Industrial, 04/2017 - 10/2017, £60,600 as PI
  • HiDE: A Tool for Unrestricted Literature Based Discovery, Government, 01/2016 - 06/2016, £66,584 as PI
  • InPuT: Individual Profiling using Text Analysis, Government, 09/2014 - 09/2015, £10,746 as PI
  • Information Processing and Sensemaking: An Exploratory Search System for Document Collections, Government, 09/2014 - 08/2015, £77,840 as PI
  • Connected Marketplace, Industrial, 01/2014 - 08/2014, £5,000 as PI
  • PUMP: Developing a Data Set of Textual and Visual Topic Labels, EPSRC, 09/2013 - 10/2013, £1,540 as PI
  • Language Processing for Literature Based Discovery in Medicine, EPSRC, 06/2012 - 05/2015, £293,127 as PI
  • PATHS: Personalised Access to Cultural Heritage Spaces, EC FP7, 01/2011 - 12/2013, £709,407 as PI
Professional activities and memberships
  • Area chair for EACL 2017 track ``Document analysis including text categorisation, topic models, and retrieval’’
    Winner of best paper award at CLEF 2004 (with Roland Roller)
  • Keynote speaker at RANLP 2013
  • Area chair for EMNLP 2013 track “semantics”
  • Assistant Director of Advanced Computing Research Centre
  • Co-ordinator of EU-funded project (PATHS)
  • Member of ACL SIGLEX board (2010-2013 and 2013-2016)
  • EPSRC Advanced Research Fellow (2006-2011)
  • Member of editorial board of Computational Linguistics (2008-2010)
  • Member of the Natural Language Processing research group