Dr Mark Hepple
MSc, PhD
School of Computer Science
Reader
Member of the Natural Language Processing (NLP) research group


+44 114 222 1829
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Mark Hepple is a Reader in Computer Science. He studied Psychology at Sheffield University (BSc, 1986), and Cognitive Science at Edinburgh University (MSc, 1987; PhD, 1990). Thereafter, he was a Research Associate at Cambridge University (1990-92), and a Postdoctoral Research Fellow at the University of Pennsylvania (1992-93).
He joined the Department of Computer Science at Sheffield University in 1993, as a Lecturer, and as a member of the Natural Language Processing group.
- Research interests
-
Dr Hepple has wide-ranging interests across Computational Linguistics and Natural Language Processing, and has published on many topics, including formal grammar and parsing, information extraction, clinical text mining, temporal information processing, robust dialogue processing, and efficient storage of large-scale linguistic data.
- Publications
-
Journal articles
- Toward an effective Igbo part-of-speech tagger. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 18(4). View this article in WRRO
- A Basic Language Resource Kit Implementation for the IgboNLPProject. ACM Transactions on Asian and Low-Resource Language Information Processing, 17(2), 1-23. View this article in WRRO
- Sub-story detection in Twitter with hierarchical Dirichlet processes. Information Processing & Management, 53(4), 989-1003. View this article in WRRO
- Mining clinical relationships from patient narratives.. BMC Bioinformatics, 9 Suppl 11, S3. View this article in WRRO
- A web service for biomedical term look-up.. Comp Funct Genomics, 6(1-2), 86-93. View this article in WRRO
- View this article in WRRO
- Feature-based formalism for two-level phonology: A description and implementation. Computer Speech and Language, 7(4), 333-358.
Chapters
- Using Semantic Inferences for Temporal Annotation Comparison, The Language Of Time (pp. 575-584). Oxford University PressOxford
- Machine Learning Approaches to Human Dialogue Modelling, Advances in Natural Multimodal Dialogue Systems (pp. 355-370). Springer Netherlands
- Two Functional Approaches For Interpreting D-Tree Grammar Derivations, Studies in Linguistics and Philosophy (pp. 185-204). Springer Netherlands
- Grammatical relations and the Lambek calculus, Discontinuous Constituency DE GRUYTER
Conference proceedings papers
- Multi-task projected embedding for Igbo. Text, Speech, and Dialogue : 21st International Conference, Proceedings (pp 285-294). Brno, Czech Republic, 11 September 2018 - 14 September 2018. View this article in WRRO
- Igbo Diacritic Restoration using Embedding Models. Proceedings of the 2018 Conference of the North American Chapter of
the Association for Computational Linguistics: Student Research
Workshop, June 2018 - June 2018.
- The SENSEI Overview of Newspaper Readers’ Comments. Advances in Information Retrieval. ECIR 2017. Lecture Notes in Computer Science, vol 10193. Springer (pp 758-761) View this article in WRRO
- Lexical Disambiguation of Igbo using Diacritic Restoration. Proceedings of the 1st Workshop on Sense, Concept and Entity
Representations and their Applications, April 2017 - April 2017. View this article in WRRO
- Automatic Label Generation for News Comment Clusters. Proceedings of the 9th International Natural Language Generation Conference (pp 61-69), 5 September 2016 - 8 September 2016. View this article in WRRO
- Automatic Restoration of Diacritics for Igbo Language. Text, Speech, and Dialogue, Vol. 9924 (pp 198-205), 12 September 2016 - 16 September 2016. View this article in WRRO
- Predicting Morphologically-Complex Unknown Words in Igbo. Text, Speech, and Dialogue, Vol. 9924 (pp 206-214), 12 September 2016 - 16 September 2016. View this article in WRRO
- The SENSEI Annotated Corpus: Human Summaries of Reader Comment Conversations in On-line News. Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue (pp 42-52), 13 September 2016 - 15 September 2016. View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- A Graph-Based Approach to Topic Clustering for Online Comments to News. Advances in Information Retrieval (pp 15-29), 20 March 2016 - 23 March 2016. View this article in WRRO
- View this article in WRRO
- Comment-to-Article Linking in the Online News Domain. Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2015 - September 2015. View this article in WRRO
- Part-of-speech Tagset and Corpus Development for Igbo, an African Language. Proceedings of LAW VIII - The 8th Linguistic Annotation Workshop, August 2014 - August 2014. View this article in WRRO
- View this article in WRRO
- SemEval-2007 task 15. Proceedings of the 4th International Workshop on Semantic Evaluations - SemEval '07, 23 June 2007 - 24 June 2007.
- Task-Oriented Extraction of Temporal Information: The Case of Clinical Narratives.. TIME (pp 188-195)
- SUPPLE. Proceedings of the Ninth International Workshop on Parsing Technology - Parsing '05, 9 October 2005 - 10 October 2005.
- Independence and commitment. Proceedings of the 38th Annual Meeting on Association for Computational Linguistics - ACL '00, 3 October 2000 - 6 October 2000.
- View this article in WRRO
- Memoisation for glue language deduction and categorial parsing. Proceedings of the 17th international conference on Computational linguistics -, 10 August 1998 - 14 August 1998.
- Maximal incrementality in linear categorial deduction. Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics -, 7 July 1997 - 12 July 1997.
Preprints
- Toward an effective Igbo part-of-speech tagger. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 18(4). View this article in WRRO
- Grants
-
Research Grants
- SENSEI: Making Sense of Human - Human Conversation, EC FP7, 11/2013 - 10/2016, £459,034, as Co-PI
- uComp: Embedded Human Computation for Knowledge Extraction and Evaluation, EPSRC, 11/2012 - 05/2016, £375,621, as Co-PI
- Reveal II, GCHQ, 10/2008 - 03/2010, £141,763, as PI
- CA4NLP: Engineering Natural Language Interfaces: can CA help?, EPSRC, 04/2008 - 03/2009, £49,480, as PI
- CLEF-Services, MRC, 01/2005 - 06/2008, £401,021, as Co-PI
- CLEF: Clinical E-Science Framework, MRC, 10/2002 - 01/2006, £280,725, as Co-PI
- POESIA: Public Open-source Environment for a Safer Internet, EC FP6, 02/2002 to 02/2004, £89,129, as PI
- Professional activities and memberships
-
Member of the Natural Language Processing research group