Dr Xingyi Song

School of Computer Science

Lecturer in Computational Media Analysis, Natural Language Processing

Outreach Support

Member of the Natural Language Processing research group

x.song@sheffield.ac.uk

Regent Court (CS)

Full contact details

Dr Xingyi Song
School of Computer Science
Regent Court (CS)
211 Portobello
Sheffield
S1 4DP

Profile: Dr. Xingyi Song is a Lecturer in Computational Media Analysis in the Department of Computer Science at the University of Sheffield, where he is a core member of the Natural Language Processing (NLP) research group and the GATE team.

His research focuses on enabling AI to understand and interpret complex data, ranging from human language and digital media to physical machinery and industrial systems, with a particular emphasis on building AI systems that people can trust.

Dr. Song actively translates academic research into practical solutions that benefit society and industry. Through knowledge exchange, he co-developed a text analytics platform for the National Health Service (NHS) to help process clinical data and improve patient insights. He also collaborated with the International Food Policy Research Institute (IFPRI) to build a media analysis tool for their Food Security Portal. He also holds a research leadership role at Sentient Machines, where he drives the development of trustworthy AI applications in the commercial and Fintech sectors.

Prior to his current roles, Dr. Song worked as a machine translation specialist at Iconic Translation Machines (RWS) and as a Research Associate on major EU-funded projects (KConnect, KNOWMAK, and RISIS2) at the University of Sheffield. He holds a BEng in Control Engineering, and an MSc and PhD in Natural Language Processing.

Publications

Journal articles

Kaur L, Griffiths AW, Harrison J, Song X & Blackburn D (2026) Navigating diagnosis: UK informal caregivers’ experiences of the dementia assessment journey. Aging & Mental Health. View this article in WRRO
Jiang Y, Wang T, Xu X, Wang Y, Song X & Maynard D (2025) Cross-modal augmentation for few-shot multimodal fake news detection. Engineering Applications of Artificial Intelligence, 142, 109931-109931.
Razuvayevskaya O, Wu B, Leite JA, Heppell F, Srba I, Scarton C, Bontcheva K & Song X (2024) Comparison between parameter-efficient techniques and full fine-tuning: a case study on multilingual news article classification. PLoS ONE, 19(5). View this article in WRRO
Mu Y, Jin M, Bontcheva K & Song X (2024) Examining temporalities on stance detection towards COVID-19 vaccination. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 6732-6738. View this article in WRRO
Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2024) Navigating prompt complexity for zero-shot classification: a study of large language models in computational social science. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 12074-12086. View this article in WRRO
Mu Y, Song X, Bontcheva K & Aletras N (2024) Examining the limitations of computational rumor detection models trained on static datasets. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 6739-6751. View this article in WRRO
Mu Y, Dong C, Bontcheva K & Song X (2024) Large language models offer an alternative to the traditional approach of topic modelling. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 10160-10171. View this article in WRRO
Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research.. CoRR, abs/2410.03545.
Scarton C, Prescott C, Bayliss C, Oakley C, Wright J, Wrigley S & Song X (2024) Message from the Organising Committee. Proceedings of the 25th Annual Conference of the European Association for Machine Translation Eamt 2024, 1, iv-v.
Scarton C, Oakley C, Prescott C, Wright J, Bayliss C, Wrigley S & Song X (2024) Message from the Organising Committee. Proceedings of the 25th Annual Conference of the European Association for Machine Translation Eamt 2024, 2, iv-v.
Wu B, Li Y, Mu Y, Scarton C, Bontcheva K & Song X (2023) Don’t waste a single annotation: improving single-label classifiers through soft labels. Findings of the Association for Computational Linguistics: EMNLP 2023, 5347-5355. View this article in WRRO
Li Y, Scarton C, Song X & Bontcheva K (2023) Classifying COVID-19 vaccine narratives. Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 648-657. View this article in WRRO
Jiang Y, Yu X, Wang Y, Xu X, Song X & Maynard D (2023) Similarity-aware multimodal prompt learning for fake news detection. Information Sciences, 647. View this article in WRRO
Mu Y, Jin M, Grimshaw C, Scarton C, Bontcheva K & Song X (2023) VaxxHesitancy: A dataset for studying hesitancy towards COVID-19 vaccination on Twitter. Proceedings of the International AAAI Conference on Web and Social Media, 17(1), 1052-1062. View this article in WRRO
Zhang Z & Song X (2023) An exploratory study on utilising the web of linked data for product data mining. SN Computer Science, 4(1). View this article in WRRO
Chilman N, Song X, Roberts A, Tolani E, Stewart R, Chui Z, Birnie K, Harber-Aschan L, Gazard B, Chandran D , Sanyal J et al (2021) Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UK. BMJ Open, 11(3), e042274-e042274.
Song X, Petrak J, Jiang Y, Singh I, Maynard D & Bontcheva K (2021) Classification aware neural topic model for COVID-19 disinformation categorisation. PLoS ONE, 16(2). View this article in WRRO
Maynard D, Lepori B, Petrak J, Song X & Laredo P (2020) Using ontologies to map between research data and policymakers’ presumptions: the experience of the KNOWMAK project. Scientometrics, 125(2), 1275-1290. View this article in WRRO
He L, Gilbert M & Song X (2019) A Python script for adaptive layout optimization of trusses. Structural and Multidisciplinary Optimization, 60(2), 835-847. View this article in WRRO
Jackson R, Kartoglu I, Stringer C, Gorrell G, Roberts A, Song X, Wu H, Agrawal A, Lui K, Groza T , Lewsley D et al (2018) CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.. BMC Medical Informatics and Decision Making, 18(1). View this article in WRRO
Jackson R, Kartoglu IE, Agrawal A, Lui K, Wu H, Groza T, Roberts A, Gorrell G, Song X, Lewsley D , Northwood D et al (2017) CogStack - Experiences Of Deploying Integrated Information Retrieval And Extraction Services In A Large National Health Service Foundation Trust Hospital.
Cook O, Mu Y, Yang X, Song X & Bontcheva K () A Dataset for Analysing News Framing in Chinese Media. Proceedings of the International AAAI Conference on Web and Social Media, 19, 2402-2412.
Jiang Y, Yu X, Wang Y, Xu X, Song X & Maynard D () Similarity-Aware Multimodal Prompt Learning for Fake News Detection. SSRN Electronic Journal.

Book chapters

Barrón-Cedeño A, Alam F, Chakraborty T, Elsayed T, Nakov P, Przybyła P, Struß JM, Haouari F, Hasanain M, Ruggeri F , Song X et al (2024) The CLEF-2024 CheckThat! Lab: Check-Worthiness, Subjectivity, Persuasion, Roles, Authorities, and Adversarial Robustness, Lecture Notes in Computer Science (pp. 449-458). Springer Nature Switzerland
Barrón-Cedeño A, Alam F, Struß JM, Nakov P, Chakraborty T, Elsayed T, Przybyła P, Caselli T, Da San Martino G, Haouari F , Hasanain M et al (2024) Overview of the CLEF-2024 CheckThat! Lab: Check-Worthiness, Subjectivity, Persuasion, Roles, Authorities, and Adversarial Robustness, Lecture Notes in Computer Science (pp. 28-52). Springer Nature Switzerland

Conference proceedings

Cook O, Grimshaw C, Wu BP, Dillon S, Hicks J, Jones L, Smith T, Szert M & Song X (2025) Efficient Annotator Reliability Assessment and Sample Weighting for Knowledge-Based Misinformation Detection on Social Media. Findings of the Association for Computational Linguistics: NAACL 2025 (pp 3348-3358), April 2025 - April 2025.
Cook O, Vasilakes JA, Roberts I & Song X (2025) Efficient Annotator Reliability Assessment with EffiARA. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations) (pp 542-550), July 2025 - July 2025.
Singh I, Scarton C, Song X & Bontcheva K (2025) Breaking Language Barriers with MMTweets: Advancing Cross-Lingual Debunked Narrative Retrieval for Fact-Checking. Ceur Workshop Proceedings, Vol. 4070 (pp 1-19)
Stolfo A, Wu B, Gurnee W, Belinkov Y, Song X, Sachan M & Nanda N (2024) Confidence regulation neurons in language models. Advances in Neural Information Processing Systems. Vancouver, Canada, 10 December 2024 - 10 December 2024. View this article in WRRO
Yang X, Mu Y, Bontcheva K & Song X (2024) Optimising LLM-driven machine translation with context-aware sliding windows. Proceedings of the Ninth Conference on Machine Translation (pp 1004-1010). Miami, Florida, USA, 15 November 2024 - 15 November 2024. View this article in WRRO
Hughes A & Song X (2024) Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence. 2024 Joint International Conference on Computational Linguistics Language Resources and Evaluation Lrec Coling 2024 Main Conference Proceedings (pp 8580-8593)
Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2024) Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science.. LREC/COLING (pp 12074-12086)
Mu Y, Dong C, Bontcheva K & Song X (2024) Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling.. LREC/COLING (pp 10160-10171)
Mu Y, Song X, Bontcheva K & Aletras N (2024) Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets.. LREC/COLING (pp 6739-6751)
Mu Y, Jin M, Bontcheva K & Song X (2024) Examining Temporalities on Stance Detection towards COVID-19 Vaccination.. LREC/COLING (pp 6732-6738)
Grimshaw C, Bontcheva K & Song X (2024) SheffieldVeraAI at SemEval-2024 Task 4: Prompting and fine-tuning a Large Vision-Language Model for Binary Classification of Persuasion Techniques in Memes. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) (pp 2051-2056), June 2024 - June 2024.
Przybyła P, Wu B, Shvets A, Mu Y, Sheang KC, Song X & Saggion H (2024) Overview of the CLEF-2024 CheckThat! Lab Task 6 on Robustness of Credibility Assessment with Adversarial Examples (InCrediblAE). Ceur Workshop Proceedings, Vol. 3740 (pp 321-338)
Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp 12477-12492), November 2024 - November 2024.
Gibbons M, Mi M, Song X & Villavicencio A (2024) ShefCDTeam at SemEval-2024 Task 4: A Text-to-Text Model for Multi-Label Classification. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) (pp 1860-1867), June 2024 - June 2024.
Jiang Y, Song X, Scarton C, Singh I, Aker A & Bontcheva K (2023) Categorising fine-to-coarse grained misinformation: an empirical study of the COVID-19 infodemic. Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing (pp 556-567). Varna, Bulgaria, 8 September 2023 - 8 September 2023. View this article in WRRO
Wilby D, Karmakharm T, Roberts I, Song X & Bontcheva K (2023) GATE Teamware 2: An open-source tool for collaborative document classification annotation. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations (pp 145-151), May 2023 - May 2023.
Wu B, Razuvayevskaya O, Heppell F, Leite JA, Scarton C, Bontcheva K & Song X (2023) SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification. Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023) (pp 1995-2008), July 2023 - July 2023.
Mu Y, Jiang Y, Heppell F, Singh I, Scarton C, Bontcheva K & Song X (2023) A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation.. CoRR, Vol. abs/2304.04811
Li Y, Scarton C, Song X & Bontcheva K (2023) Classifying COVID-19 Vaccine Narratives.. RANLP (pp 648-657)
Wu B, Li Y, Mu Y, Scarton C, Bontcheva K & Song X (2023) Don't waste a single annotation: improving single-label classifiers through soft labels.. EMNLP (Findings) (pp 5347-5355)
Singh I, Bontcheva K, Song X & Scarton C (2022) Comparative analysis of engagement, themes, and causality of Ukraine-related debunks and disinformation. Social Informatics: 13th International Conference, SocInfo 2022, Glasgow, UK, October 19–21, 2022, Proceedings (pp 128-143). Glasgow, UK, 19 October 2022 - 19 October 2022. View this article in WRRO
Singh I, Bontcheva K, Song X & Scarton C (2022) Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation.. SocInfo, Vol. 13618 (pp 128-143)
Jiang Y, Wang Y, Song X & Maynard D (2020) Comparing topic-aware neural networks for bias detection of news. Proceedings of 24th European Conference on Artificial Intelligence (ECAI 2020), Vol. 325 (pp 2054-2061). Santiago de Compostela, Spain, 29 August 2020 - 29 August 2020. View this article in WRRO
Gao J, Han S, Song X & Ciravegna F (2020) RP-DNN : a Tweet level propagation context based deep neural networks for early rumor detection in social media. Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020) (pp 6094-6105). Marseille, France, 11 May 2020 - 11 May 2020. View this article in WRRO
Song X, Downs J, Velupillai S, Holden R, Kikoler M, Bontcheva K, Dutta R & Roberts A (2020) Using deep neural networks with intra- And inter-sentence context to classify suicidal behaviour. Lrec 2020 12th International Conference on Language Resources and Evaluation Conference Proceedings (pp 1303-1310)
Jiang Y, Petrak J, Song X, Bontcheva K & Maynard D (2019) Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation (pp 840-844). Minneapolis, Minnesota, USA, 6 July 2019 - 6 July 2019. View this article in WRRO
Jiang Y, Petrak J, Song X, Bontcheva K & Maynard D (2019) Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation, June 2019 - June 2019.
Song X, Petrak J & Roberts A (2018) A deep neural network sentence level classification method with context information. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (pp 900-904). Brussels, Belgium, 31 October 2018 - 31 October 2018. View this article in WRRO
Song X, Petrak J & Roberts A (2018) A Deep Neural Network Sentence Level Classification Method with Context Information. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) (pp 900-904)
Jiang Y, Song X, Harrison J, QUegan S & Maynard DG (2017) Comparing Attitudes to Climate Change in the Media using sentiment analysis based on Latent Dirichlet Allocation.. Proc. of EMNLP Workshop "Natural Language Meets Journalism"
Blain F, Song X & Specia L (2016) Sheffield Systems for the English-Romanian Translation Task. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Vol. 2 (pp 259-263)
Song X, Specia L & Cohn T (2014) Data selection for discriminative training in statistical machine translation. Proceedings of the 17th Annual Conference of the European Association for Machine Translation Eamt 2014 (pp 45-52)
Song X, Cohn T & Specia L (2013) BLEU deconstructed: Designing a Better MT Evaluation Metric. Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING)
Song X & Cohn T (2011) Regression and Ranking based Optimisation for Sentence Level Machine Translation Evaluation. Wmt 2011 6thworkshop on Statistical Machine Translation Proceedings of the Workshop (pp 123-129)
Sun S, Wu BP, Jin M, Bai P, Zhang H & Song X () ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation. Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 40(46) (pp 39322-39330)
Liang T, Mu Y, Kim S, Kengne Kuate DL, Lang J, Vos R & Song X () Classification-Aware Neural Topic Model CombinedWith Interpretable Analysis - For Conflict Classification. Proceedings of the Conference Recent Advances in Natural Language Processing - Large Language Models for Natural Language Processings (pp 666-672)

Datasets

Gao J, Han S & Song X Trained RPDNN LOO-CV models for early rumor detection.

Preprints

Qin X, Song X, Liu T, Laalej H, Liu Z, Zhu Y & He L (2026) LoRM: Learning the Language of Rotating Machinery for Self-Supervised Condition Monitoring, arXiv.
Cook O, Vasilakes J, Roberts I & Song X (2025) Efficient Annotator Reliability Assessment with EffiARA, arXiv.
Cook O, Mu Y, Yang X, Song X & Bontcheva K (2025) A Dataset for Analysing News Framing in Chinese Media, arXiv.
Cook O, Grimshaw C, Wu B, Dillon S, Hicks J, Jones L, Smith T, Szert M & Song X (2025) Efficient Annotator Reliability Assessment and Sample Weighting for Knowledge-Based Misinformation Detection on Social Media, arXiv.
Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research, arXiv.
Jiang Y, Wang T, Xu X, Wang Y, Song X & Maynard D (2024) Cross-Modal Augmentation for Few-Shot Multimodal Fake News Detection, arXiv.
Stolfo A, Wu B, Gurnee W, Belinkov Y, Song X, Sachan M & Nanda N (2024) Confidence Regulation Neurons in Language Models, arXiv.
Hughes A & Song X (2024) Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence, arXiv.
Mu Y, Bai P, Bontcheva K & Song X (2024) Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling, arXiv.
Mu Y, Dong C, Bontcheva K & Song X (2024) Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling, arXiv.
Wu B, Li Y, Mu Y, Scarton C, Bontcheva K & Song X (2023) Don't Waste a Single Annotation: Improving Single-Label Classifiers Through Soft Labels, arXiv.
Korda A, Heide M, Nag A, Trulley V-N, Rogg H-V, Avram M, Eickhoff S, Jauch-Chara K, Wehkamp K, Song X , Martinetz T et al (2023) Suicide prediction with natural language processing of electronic health records, Cold Spring Harbor Laboratory.
Mu Y, Song X, Bontcheva K & Aletras N (2023) Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets, arXiv.
Liang T, Mu Y, Kim S, Kuate DLK, Lang J, Vos R & Song X (2023) Classification-Aware Neural Topic Model Combined With Interpretable Analysis -- For Conflict Classification, arXiv.
Razuvayevskaya O, Wu B, Leite JA, Heppell F, Srba I, Scarton C, Bontcheva K & Song X (2023) Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification, arXiv.
Robinson A, Thorne W, Wu BP, Pandor A, Essat M, Stevenson M & Song X (2023) Bio-SIEVE: Exploring Instruction Tuning Large Language Models for Systematic Review Automation, arXiv.
Singh I, Scarton C, Song X & Bontcheva K (2023) Breaking Language Barriers with MMTweets: Advancing Cross-Lingual Debunked Narrative Retrieval for Fact-Checking, arXiv.
Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2023) Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science, arXiv.
Mu Y, Jiang Y, Heppell F, Singh I, Scarton C, Bontcheva K & Song X (2023) A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation, arXiv.
Mu Y, Jin M, Bontcheva K & Song X (2023) Examining Temporalities on Stance Detection towards COVID-19 Vaccination, arXiv.
Jiang Y, Yu X, Wang Y, Xu X, Song X & Maynard D (2023) Similarity-Aware Multimodal Prompt Learning for Fake News Detection, arXiv.
Wu B, Razuvayevskaya O, Heppell F, Leite JA, Scarton C, Bontcheva K & Song X (2023) SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification, arXiv.
Mu Y, Jin M, Grimshaw C, Scarton C, Bontcheva K & Song X (2023) VaxxHesitancy: A Dataset for Studying Hesitancy towards COVID-19 Vaccination on Twitter, arXiv.
Singh I, Bontcheva K, Song X & Scarton C (2022) Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation, arXiv.
Li Y, Scarton C, Song X & Bontcheva K (2022) Classifying COVID-19 vaccine narratives, arXiv.
Zhang Z & Song X (2022) An Exploratory Study on Utilising the Web of Linked Data for Product Data Mining, arXiv.
Jiang Y, Song X, Scarton C, Singh I, Aker A & Bontcheva K (2022) Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of the COVID-19 Infodemic, Research Square Platform LLC.
Jiang Y, Song X, Scarton C, Aker A & Bontcheva K (2021) Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of COVID-19 Infodemic, arXiv.
Gao J, Han S, Song X & Ciravegna F (2020) RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detection in Social Media, arXiv.
Gorrell G, Song X & Roberts A (2018) Bio-YODIE: A Named Entity Linking System for Biomedical Text, arXiv.
Song X, Petrak J & Roberts A (2018) A Deep Neural Network Sentence Level Classification Method with Context Information, arXiv.

Grants

Multi-DocVerify: A Multimodal Benchmark for Misinformation Detection and Hallucination Mitigation in Long-Context Reporting, AI Hub in Generative Models, 06/2026 - 05/2027, £95,623, as PI
AI-based models for spacio-temporal analysis of online disinformation, EPSRC, 04/2026 - 03/2027, £54,373, as PI
ASIMOV: AI-as-a-service, Innovate UK, 01/2024 - 03/2025, £142,691, as PI
vera.ai (Verification assisted by AI), Horizon Europe, 09/2022 - 11/2025, £901,250, as Co-I

School of Computer Science

School of Computer Science

Dr Xingyi Song

Journal articles

Book chapters

Conference proceedings

Datasets

Preprints

Links