Dr Xingyi Song

School of Computer Science

Lecturer in Computational Media Analysis, Natural Language Processing

Member of the Natural Language Processing research group

Profile photo of Xingyi Song
Profile picture of Profile photo of Xingyi Song
x.song@sheffield.ac.uk
+44 114 222 1867

Full contact details

Dr Xingyi Song
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Profile

Dr Xingyi Song, a Lecturer in Computational Media Analysis at the Department of Computer Science, University of Sheffield. He is a member of the Natural Language Processing group and GATE team (https://gate.ac.uk/)

Previously he worked as a machine translation specialist at Iconic Translation Machine (2015-2016) and Research Associate for several EU funded projects such as Kconnect, Knowmak and Risis2 (from 2016-2021)) at the University of Sheffield. 

He completed his MSc and PhD in Natural Language Processing group at the University of Sheffield. His research interests are in Natural Language Processing, Computational Social Science, sentiment analysis and Bio-medical text processing. 

Publications

Journal articles

Chapters

Conference proceedings papers

  • Hughes A & Song X (2024) Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings (pp 8580-8593) RIS download Bibtex download
  • Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2024) Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science.. LREC/COLING (pp 12074-12086) RIS download Bibtex download
  • Mu Y, Dong C, Bontcheva K & Song X (2024) Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling.. LREC/COLING (pp 10160-10171) RIS download Bibtex download
  • Mu Y, Song X, Bontcheva K & Aletras N (2024) Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets.. LREC/COLING (pp 6739-6751) RIS download Bibtex download
  • Mu Y, Jin M, Bontcheva K & Song X (2024) Examining Temporalities on Stance Detection towards COVID-19 Vaccination.. LREC/COLING (pp 6732-6738) RIS download Bibtex download
  • Grimshaw C, Bontcheva K & Song X (2024) SheffieldVeraAI at SemEval-2024 Task 4: Prompting and fine-tuning a Large Vision-Language Model for Binary Classification of Persuasion Techniques in Memes. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), June 2024 - June 2024. RIS download Bibtex download
  • Przybyła P, Wu B, Shvets A, Mu Y, Sheang KC, Song X & Saggion H (2024) Overview of the CLEF-2024 CheckThat! Lab Task 6 on Robustness of Credibility Assessment with Adversarial Examples (InCrediblAE). CEUR Workshop Proceedings, Vol. 3740 (pp 321-338) RIS download Bibtex download
  • Yang X, Mu Y, Bontcheva K & Song X (2024) Optimising LLM-Driven Machine Translation with Context-Aware Sliding Windows. Proceedings of the Ninth Conference on Machine Translation (pp 1004-1010), November 2024 - November 2024. RIS download Bibtex download
  • Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp 12477-12492), November 2024 - November 2024. RIS download Bibtex download
  • Wilby D, Karmakharm T, Roberts I, Song X & Bontcheva K (2023) GATE Teamware 2: An open-source tool for collaborative document classification annotation. EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of System Demonstrations (pp 145-151) RIS download Bibtex download
  • Wu B, Razuvayevskaya O, Heppell F, Leite JA, Scarton C, Bontcheva K & Song X (2023) SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification. Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), July 2023 - July 2023. RIS download Bibtex download
  • Liang T, Mu Y, Kim S, Kuate DLK, Lang J, Vos R & Song X (2023) Classification-Aware Neural Topic Model CombinedWith Interpretable Analysis - For Conflict Classification. International Conference Recent Advances in Natural Language Processing, RANLP (pp 666-672) RIS download Bibtex download
  • Jiang Y, Song X, Scarton C, Singh I, Aker A & Bontcheva K (2023) Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of the COVID-19 Infodemic. International Conference Recent Advances in Natural Language Processing, RANLP (pp 556-567) RIS download Bibtex download
  • Mu Y, Jiang Y, Heppell F, Singh I, Scarton C, Bontcheva K & Song X (2023) A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation.. CoRR, Vol. abs/2304.04811 RIS download Bibtex download
  • Li Y, Scarton C, Song X & Bontcheva K (2023) Classifying COVID-19 Vaccine Narratives.. RANLP (pp 648-657) RIS download Bibtex download
  • Wu B, Li Y, Mu Y, Scarton C, Bontcheva K & Song X (2023) Don't waste a single annotation: improving single-label classifiers through soft labels.. EMNLP (Findings) (pp 5347-5355) RIS download Bibtex download
  • Singh I, Bontcheva K, Song X & Scarton C (2022) Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation (pp 128-143) RIS download Bibtex download
  • Singh I, Bontcheva K, Song X & Scarton C (2022) Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation.. SocInfo, Vol. 13618 (pp 128-143) RIS download Bibtex download
  • Jiang Y, Wang Y, Maynard D & Song X (2020) Comparing topic-aware neural networks for bias detection of news. Proceedings of 24th European Conference on Artificial Intelligence (ECAI 2020), Vol. 325 (pp 2054-2061). Santiago de Compostela, Spain, 29 August 2020 - 2 September 2020. View this article in WRRO RIS download Bibtex download
  • Song X, Downs J, Velupillai S, Holden R, Kikoler M, Bontcheva K, Dutta R & Roberts A (2020) Using deep neural networks with intra- And inter-sentence context to classify suicidal behaviour. LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp 1303-1310) RIS download Bibtex download
  • Gao J, Han S, Song X & Ciravegna F (2020) RP-DNN: A tweet level propagation context based deep neural networks for early rumor detection in social media. LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp 6094-6105) View this article in WRRO RIS download Bibtex download
  • Jiang Y, Petrak J, Song X, Bontcheva K & Maynard D (2019) Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation, June 2019 - June 2019. RIS download Bibtex download
  • Jiang Y, Petrak J, Song X, Bontcheva K & Maynard D (2019) Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation. Minneapolis, Minnesota, USA, 6 June 2019 - 7 June 2019. View this article in WRRO RIS download Bibtex download
  • Song X, Petrak J & Roberts A (2018) A deep neural network sentence level classification method with context information. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (pp 900-904). Brussels, Belgium, 31 October 2018 - 4 November 2018. View this article in WRRO RIS download Bibtex download
  • Song X, Petrak J & Roberts A (2018) A Deep Neural Network Sentence Level Classification Method with Context Information. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) (pp 900-904) RIS download Bibtex download
  • Jiang Y, Song X, Harrison J, Quegan S & Maynard D (2017) Comparing Attitudes to Climate Change in the Media using sentiment analysis based on Latent Dirichlet Allocation. Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, September 2017 - September 2017. RIS download Bibtex download
  • Blain F, Song X & Specia L (2016) Sheffield Systems for the English-Romanian WMT Translation Task. Proceedings of the First Conference on Machine Translation RIS download Bibtex download
  • Song X, Specia L & Cohn T (2014) Data selection for discriminative training in statistical machine translation. Proceedings of the 17th Annual Conference of the European Association for Machine Translation, EAMT 2014 (pp 45-52) RIS download Bibtex download
  • Song X, Cohn T & Specia L (2013) BLEU deconstructed: Designing a Better MT Evaluation Metric. Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING) RIS download Bibtex download
  • Song X & Cohn T (2011) Regression and Ranking based Optimisation for Sentence Level Machine Translation Evaluation. Proceedings of the Sixth Workshop on Statistical Machine Translation. Edinburgh, UK RIS download Bibtex download

Datasets

Preprints

Grants

ASIMOV: AI-as-a-service, Innovate UK, 01/2024 - 03/2025, £142,691, as PI.