Professor Nikos Aletras

School of Computer Science

Professor of Natural Language Processing

Head of the Natural Language Processing (NLP) research group

n.aletras@sheffield.ac.uk

Regent Court (CS)

Full contact details

Professor Nikos Aletras
School of Computer Science
Regent Court (CS)
211 Portobello
Sheffield
S1 4DP

Profile: I am a Professor of Natural Language Processing (NLP), currently leading the NLP Group at the School of Computer Science, University of Sheffield. Previously, I’ve gained industrial experience working as a Scholar and Applied Scientist at Amazon. Prior to Amazon, I was a research associate at UCL, Department of Computer Science, after I completed a PhD in Natural Language Processing at the University of Sheffield, Department of Computer Science.

Research interests

NLP
Computational Social Science
Legal NLP
Data Science
Machine Learning

Publications

Journal articles

Cao M, Tan X, Akhter ME, Valentino M, Liakata M, Wang X & Aletras N (2026) Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models.. CoRR, abs/2602.08658.
Yamaguchi A, Mi M & Aletras N (2026) Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks.. CoRR, abs/2601.03448.
Karouzos C, Tan X & Aletras N (2026) An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift.. CoRR, abs/2601.05882.
Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates.. CoRR, abs/2512.04844.
Yamaguchi A, Villavicencio A & Aletras N (2025) How can we effectively expand the vocabulary of LLMs with 0.01GB of target language text?. Computational Linguistics. View this article in WRRO
Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) Adapting chat language models using only target unlabeled language data. Transactions on Machine Learning Research, 2025(09). View this article in WRRO
Alajrami A, Tan X & Aletras N (2025) Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance.. CoRR, abs/2510.03528.
Lewis-Lim S, Tan X, Zhao Z & Aletras N (2025) Can Confidence Estimates Decide When Chain-of-Thought Is Necessary for LLMs?. CoRR, abs/2510.21007.
Cao M, Wang X & Aletras N (2025) Progressive Depth Up-scaling via Optimal Transport.. CoRR, abs/2508.08011.
Fang J, Peng Y, Zhang X, Wang Y, Yi X, Zhang G, Xu Y, Wu B, Liu S, Li Z , Ren Z et al (2025) A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems.. CoRR, abs/2508.07407.
Lewis-Lim S, Tan X, Zhao Z & Aletras N (2025) Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?. CoRR, abs/2508.19827.
Meng C, Tonolini F, Mo F, Aletras N, Yilmaz E & Kazai G (2025) Bridging the gap: from ad-hoc to proactive search in conversations. SIGIR '25: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 64-74. View this article in WRRO
Chlapanis OS, Galanis D, Aletras N & Androutsopoulos I (2025) GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations.. CoRR, abs/2505.17267.
Williams M, Chrysostomou G, Jeronymo V & Aletras N (2025) Compressing Language Models for Specialized Domains.. CoRR, abs/2502.18424.
Mu Y, Niu P, Bontcheva K & Aletras N (2024) Predicting and analyzing the popularity of false rumors in Weibo. Expert Systems with Applications, 243. View this article in WRRO
Chrysostomou G, Zhao Z, Williams M & Aletras N (2024) Investigating hallucinations in pruned large language models for abstractive summarization.. Transactions of the Association for Computational Linguistics, 12, 1163-1181.
Zhao Z & Aletras N (2024) Comparing explanation faithfulness between multilingual and monolingual fine-tuned language models. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 1, 3226-3244. View this article in WRRO
Mu Y, Song X, Bontcheva K & Aletras N (2024) Examining the limitations of computational rumor detection models trained on static datasets. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 6739-6751. View this article in WRRO
Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2024) Navigating prompt complexity for zero-shot classification: a study of large language models in computational social science. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 12074-12086. View this article in WRRO
Yamaguchi A, Villavicencio A & Aletras N (2024) An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference. Findings of the Association for Computational Linguistics: EMNLP 2024, 6760-6785.
Villegas DS, Preoţiuc-Pietro D & Aletras N (2024) Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary Tasks. Eacl 2024 18th Conference of the European Chapter of the Association for Computational Linguistics Findings of Eacl 2024, 1126-1137.
Jin M, Preoţiuc-Pietro D, Doğruöz AS & Aletras N (2024) Who is bragging more online? A large scale analysis of bragging in social media. 2024 Joint International Conference on Computational Linguistics Language Resources and Evaluation Lrec Coling 2024 Main Conference Proceedings, 17575-17587.
Yamaguchi A, Villavicencio A & Aletras N (2024) Vocabulary Expansion for Low-resource Cross-lingual Transfer.. CoRR, abs/2406.11477.
De Clercq O & Aletras N (2024) Introduction. Eacl 2024 18th Conference of the European Chapter of the Association for Computational Linguistics Proceedings of System Demonstrations, IV.
Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research.. CoRR, abs/2410.03545.
Hughes A, Aletras N & Ma N (2024) How Private are Language Models in Abstractive Summarization?. CoRR, abs/2412.12040.
Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2024) Vocabulary Expansion of Chat Models with Unlabeled Target Language Data.. CoRR, abs/2412.11704.
Lalmas M, Zhang M, Santos R, Yilmaz E, Joho H, Ding W, Hauff C, Aletras N, Jaidka K & Alhoori H (2023) CIKM'23 Program Chairs' Welcome. International Conference on Information and Knowledge Management Proceedings, v.
Dommett K, Mensah SA, Zhu J, Stafford T & Aletras N (2023) Is there a permanent campaign for online political advertising? Investigating partisan and non-party campaign activity in the UK between 2018–2021. Journal of Political Marketing, 24(2), 143-161. View this article in WRRO
Vickers P, Barrault L, Monti E & Aletras N (2023) We Need to Talk About Classification Evaluation Metrics in NLP. Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers).
Williams M & Aletras N (2023) Frustratingly Simple Memory Efficiency for Pre-trained Language Models via Dynamic Embedding Pruning.. CoRR, abs/2309.08708.
Xue H & Aletras N (2023) Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention. Findings of the Association for Computational Linguistics: EMNLP 2023, 10355-10373.
Sánchez Villegas D, Goanta C & Aletras N (2023) A Multimodal Analysis of Influencer Content on Twitter. Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), 225-240.
Alajrami A, Margatina K & Aletras N (2023) Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 9085-9108.
Goanta C, Aletras N, Chalkidis I, Ranchordás S & Spanakis G (2023) Regulation and NLP (RegNLP): Taming Large Language Models. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 8712-8724.
Margatina K, Schick T, Aletras N & Dwivedi-Yu J (2023) Active Learning Principles for In-Context Learning with Large Language Models. Findings of the Association for Computational Linguistics: EMNLP 2023, 5011-5034.
Williams M & Aletras N (2023) How Does Calibration Data Affect the Post-training Pruning and Quantization of Large Language Models?. CoRR, abs/2311.09755.
Zhang L, Song H, Aletras N & Lu H (2022) Node-feature convolution for graph convolutional networks. Pattern Recognition, 128. View this article in WRRO
Li W & Aletras N (2022) Improving Graph-Based Text Representations with Character and Word Level N-grams. Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), 228-233.
Mu Y, Niu P & Aletras N (2022) Identifying and Characterizing Active Citizens who Refute Misinformation in Social Media.. CoRR, abs/2204.10080.
Mu Y & Aletras N (2020) Identifying Twitter users who repost unreliable news sources with linguistic information. PeerJ Computer Science, 6.
Fomicheva M, Sun S, Yankovskaya L, Blain F, Guzmán F, Fishel M, Aletras N, Chaudhary V & Specia L (2020) Unsupervised quality estimation for neural machine translation. Transactions of the Association for Computational Linguistics, 8(2020), 539-555. View this article in WRRO
Aletras N, Tsarapatsanis D, Preoţiuc-Pietro D & Lampos V (2016) Predicting Judicial Decisions of the European Court of Human Rights: A Natural Language Processing Perspective. PeerJ in Computer Science, 2. View this article in WRRO
Aletras N, Baldwin T, Lau J & Stevenson M (2015) Evaluating Topic Representations for Exploring Document Collections. Journal of the Association for Information Science and Technology. View this article in WRRO
Gonzalez-Agirre A, Rigau G, Agirre E, Aletras N & Stevenson M (2015) Why are these similar? Investigating item simirlaity types in a large Digital Library. Journal of the Association for Information Science and Technology. View this article in WRRO
Preoţiuc-Pietro D, Volkova S, Lampos V, Bachrach Y & Aletras N (2015) Studying user income through language, behaviour and affect in social media. PLoS ONE, 10(9). View this article in WRRO
Aletras N, Stevenson M & Clough P (2012) Computing similarity between items in a digital library of cultural heritage. Journal of Computing and Cultural Heritage, 5(4).
Chrysostomou G & Aletras N () Flexible Instance-Specific Rationalization of NLP Models. Proceedings of the AAAI Conference on Artificial Intelligence, 36(10), 10545-10553.

Conference proceedings

Williams M & Aletras N (2025) Vocabulary-level Memory Efficiency for Language Model Fine-tuning. Proceedings of the 10th Workshop on Representation Learning for NLP (RepL4NLP-2025) (pp 185-196), May 2025 - May 2025.
Williams M, Chrysostomou G & Aletras N (2025) Self-calibration for Language Model Quantization and Pruning. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) (pp 10149-10167), April 2025 - April 2025.
Tan X, Valentino M, Akhter ME, Liakata M & Aletras N (2025) Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (pp 31874-31888), November 2025 - November 2025.
Meng C, Tonolini F, Mo F, Aletras N, Yilmaz E & Kazai G (2025) Bridging the Gap: From Ad-hoc to Proactive Search in Conversations.. SIGIR (pp 64-74)
Hughes A, Aletras N & Ma N (2025) How Private are Language Models in Abstractive Summarization?. EMNLP (pp 30112-30130)
Chlapanis OS, Galanis D, Aletras N & Androutsopoulos I (2025) GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations.. EMNLP (Findings) (pp 25099-25119)
Alajrami A, Tan X & Aletras N (2025) Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance.. IJCNLP-AACL (long papers) (pp 728-742)
Tan X, Valentino M, Akhter ME, Liakata M & Aletras N (2025) Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision.. EMNLP (pp 31886-31900)
Xue H, Moosavi NS & Aletras N (2025) Deconstructing Attention: Investigating Design Principles for Effective Language Modeling.. IJCNLP-AACL (long papers) (pp 708-727)
Lewis-Lim S, Tan X, Zhao Z & Aletras N (2025) Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?. EMNLP (pp 29838-29853)
Yamaguchi A, Villavicencio A & Aletras N (2024) An empirical study on cross-lingual vocabulary adaptation for efficient language model inference. Findings of the Association for Computational Linguistics: EMNLP 2024 (pp 6760-6785). Miami, Florida, USA, 12 November 2024 - 12 November 2024. View this article in WRRO
Jin M, Preotiuc-Pietro D, Dogruöz AS & Aletras N (2024) Who Is Bragging More Online? A Large Scale Analysis of Bragging in Social Media.. LREC/COLING (pp 17575-17587)
Soun R, Neerkaje A, Sawhney R, Aletras N & Nakov P (2024) RISE: Robust Early-exiting Internal Classifiers for Suicide Risk Evaluation. 2024 Joint International Conference on Computational Linguistics Language Resources and Evaluation Lrec Coling 2024 Main Conference Proceedings (pp 14134-14145)
(2024) Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2024 - System Demonstrations, St. Julians, Malta, March 17-22, 2024. EACL (Demonstrations)
Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2024) Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science.. LREC/COLING (pp 12074-12086)
Villegas DS, Preotiuc-Pietro D & Aletras N (2024) Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary Tasks.. EACL (Findings) (pp 1126-1137)
Mu Y, Song X, Bontcheva K & Aletras N (2024) Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets.. LREC/COLING (pp 6739-6751)
Williams M & Aletras N (2024) On the Impact of Calibration Data in Post-training Quantization and Pruning. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp 10100-10118), August 2024 - August 2024.
Zhao Z & Aletras N (2024) Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models.. NAACL-HLT (pp 3226-3244)
Tonolini F, Aletras N, Massiah J & Kazai G (2024) Bayesian Prompt Ensembles: Model Uncertainty Estimation for Black-Box Large Language Models. Findings of the Association for Computational Linguistics ACL 2024 (pp 12229-12272), August 2024 - August 2024.
Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp 12477-12492), November 2024 - November 2024.
De Clercq O & Aletras N (2024) Introduction. PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS (pp IV-IV)
Sun K, Zhang R, Samuel M, Nikolaos A, Mao Y & Liu X (2023) Self-training through Classifier Disagreement for Cross-Domain Opinion Target Extraction. Proceedings of the ACM Web Conference 2023 (pp 1594-1603)
Mu Y, Bontcheva K & Aletras N (2023) It’s about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits. Eacl 2023 17th Conference of the European Chapter of the Association for Computational Linguistics Findings of Eacl 2023 (pp 724-731)
Shi Z, Tonolini F, Aletras N, Yilmaz E, Kazai G & Jiao Y (2023) Rethinking Semi-supervised Learning with Language Models. Findings of the Association for Computational Linguistics: ACL 2023 (pp 5614-5634), July 2023 - July 2023.
Margatina K & Aletras N (2023) On the Limitations of Simulating Active Learning. Findings of the Association for Computational Linguistics: ACL 2023 (pp 4402-4419), July 2023 - July 2023.
Mensah S, Sun K & Aletras N (2023) Trading Syntax Trees for Wordpieces: Target-oriented Opinion Words Extraction with Wordpieces and Aspect Enhancement. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp 999-1007), July 2023 - July 2023.
Zhao Z & Aletras N (2023) Incorporating attribution importance for improving faithfulness metrics. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1 (pp 4732-4745). Toronto, Canada, 9 July 2023 - 9 July 2023. View this article in WRRO
Feng Y, Jiao Y, Prasad A, Aletras N, Yilmaz E & Kazai G (2023) Schema-Guided User Satisfaction Modeling for Task-Oriented Dialogues. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp 2079-2091), July 2023 - July 2023.
Mu Y, Bontcheva K & Aletras N (2023) It’s about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits. Findings of the Association for Computational Linguistics: EACL 2023 (pp 736-743), May 2023 - May 2023.
Tonolini F, Aletras N, Jiao Y & Kazai G (2023) Robust Weak Supervision with Variational Auto-Encoders. Proceedings of Machine Learning Research, Vol. 202 (pp 34394-34408)
Xue H & Aletras N (2023) Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention.. EMNLP (Findings) (pp 10355-10373)
Alajrami A, Margatina K & Aletras N (2023) Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?. EMNLP (pp 9085-9108)
Margatina K, Schick T, Aletras N & Dwivedi-Yu J (2023) Active Learning Principles for In-Context Learning with Large Language Models.. EMNLP (Findings) (pp 5011-5034)
Goanta C, Aletras N, Chalkidis I, Ranchordás S & Spanakis G (2023) Regulation and NLP (RegNLP): Taming Large Language Models.. EMNLP (pp 8712-8724)
Zhao Z & Aletras N (2023) Incorporating Attribution Importance for Improving Faithfulness Metrics.. ACL (1) (pp 4732-4745)
Villegas DS, Goanta C & Aletras N (2023) A Multimodal Analysis of Influencer Content on Twitter.. IJCNLP (1) (pp 225-240)
Vickers P, Barrault L, Monti E & Aletras N (2023) We Need to Talk About Classification Evaluation Metrics in NLP.. IJCNLP (1) (pp 498-510)
Feng Y, Jiao Y, Prasad A, Aletras N, Yilmaz E & Kazai G (2023) Schema-Guided User Satisfaction Modeling for Task-Oriented Dialogues.. ACL (1) (pp 2079-2091)
Sawhney R, Agarwal S, Neerkaje AT, Aletras N, Nakov P & Flek L (2022) Towards Suicide Ideation Detection Through Online Conversational Context. Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 1716-1727)
Mu Y, Niu P & Aletras N (2022) Identifying and Characterizing Active Citizens who Refute Misinformation in Social Media. 14th ACM Web Science Conference 2022 (pp 401-410)
Fomicheva M, Specia L & Aletras N (2022) Translation Error Detection as Rationale Extraction. Findings of the Association for Computational Linguistics: ACL 2022 (pp 4148-4159), May 2022 - May 2022.
Margatina K, Barrault L & Aletras N (2022) On the Importance of Effectively Adapting Pretrained Language Models for Active Learning. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp 825-836), May 2022 - May 2022.
Chalkidis I, Jana A, Hartung D, Bommarito M, Androutsopoulos I, Katz D & Aletras N (2022) LexGLUE: A Benchmark Dataset for Legal Language Understanding in English. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022 - May 2022.
Alajrami A & Aletras N (2022) How does the pre-training objective affect what large language models learn about linguistic properties?. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp 131-147), May 2022 - May 2022.
Bose T, Aletras N, Illina I & Fohr D (2022) Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection. Findings of the Association for Computational Linguistics: ACL 2022 (pp 372-382), May 2022 - May 2022.
Jin M, Preotiuc-Pietro D, Doğruöz AS & Aletras N (2022) Automatic Identification and Classification of Bragging in Social Media. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp 3945-3959), May 2022 - May 2022.
Chrysostomou G & Aletras N (2022) An Empirical Study on Explanations in Out-of-Domain Settings. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022 - May 2022.
Ao X, Sanchez Villegas D, Preotiuc-Pietro D & Aletras N (2022) Combining Humor and Sarcasm for Improving Political Parody Detection. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, July 2022 - July 2022.
Xue H & Aletras N (2022) HashFormers: Towards Vocabulary-independent Pre-trained Transformers. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (pp 7862-7874), December 2022 - December 2022.
Fomicheva M, Specia L & Aletras N (2022) Translation Error Detection as Rationale Extraction.. ACL (Findings) (pp 4148-4159)
Wen J, Zhu Y, Zhang J, Zhou J & Huang M (2022) AutoCAD: Automatically Generate Counterfactuals for Mitigating Shortcut Learning. Findings of the Association for Computational Linguistics: EMNLP 2022 (pp 2302-2317), December 2022 - December 2022.
Zhao Z, Chrysostomou G, Bontcheva K & Aletras N (2022) On the Impact of Temporal Concept Drift on Model Explanations. Findings of the Association for Computational Linguistics: EMNLP 2022 (pp 4039-4054), December 2022 - December 2022.
(2022) Proceedings of the Natural Legal Language Processing Workshop, NLLP@EMNLP 2022, Abu Dhabi, United Arab Emirates (Hybrid), December 8, 2022. NLLP@EMNLP
Chalkidis I, Jana A, Hartung D, II MJB, Androutsopoulos I, Katz DM & Aletras N (2022) LexGLUE: A Benchmark Dataset for Legal Language Understanding in English.. ACL (1) (pp 4310-4330)
Bose T, Aletras N, Illina I & Fohr D (2022) Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection. Proceedings International Conference on Computational Linguistics Coling, Vol. 29(1) (pp 6656-6666)
Alajrami A & Aletras N (2022) How does the pre-training objective affect what large language models learn about linguistic properties?. ACL (2) (pp 131-147)
Bose T, Aletras N, Illina I & Fohr D (2022) Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection.. ACL (Findings) (pp 372-382)
Bose T, Aletras N, Illina I & Fohr D (2022) Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection.. COLING (pp 6656-6666)
Li W & Aletras N (2022) Improving Graph-Based Text Representations with Character and Word Level N-grams.. AACL/IJCNLP (2) (pp 228-233)
Li M, Chen J, Mensah S, Aletras N, Yang X & Ye Y (2022) A Hierarchical N-Gram Framework for Zero-Shot Link Prediction. Findings of the Association for Computational Linguistics: EMNLP 2022 (pp 2498-2509), December 2022 - December 2022.
Chrysostomou G & Aletras N (2022) An Empirical Study on Explanations in Out-of-Domain Settings.. ACL (1) (pp 6920-6938)
Ao X, Villegas DS, Preotiuc-Pietro D & Aletras N (2022) Combining Humor and Sarcasm for Improving Political Parody Detection.. NAACL-HLT (pp 1800-1807)
Jin M, Preotiuc-Pietro D, Dogruöz AS & Aletras N (2022) Automatic Identification and Classification of Bragging in Social Media.. ACL (1) (pp 3945-3959)
Li M, Chen J, Mensah S, Aletras N, Yang X & Ye Y (2022) A Hierarchical N-Gram Framework for Zero-Shot Link Prediction.. EMNLP (Findings) (pp 2498-2509)
Gajbhiye A, Fomicheva M, Alva-Manchego F, Blain F, Obamuyide A, Aletras N & Specia L (2021) Knowledge distillation for quality estimation. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp 5091-5099). Bangkok, Thailand (virtual conference), 1 August 2021 - 1 August 2021. View this article in WRRO
Sánchez Villegas D & Aletras N (2021) Point-of-Interest Type Prediction using Text and Images. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp 7785-7797), November 2021 - November 2021.
Yamaguchi A, Chrysostomou G, Margatina K & Aletras N (2021) Frustratingly Simple Pretraining Alternatives to Masked Language Modeling. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp 3116-3125), November 2021 - November 2021.
Chrysostomou G & Aletras N (2021) Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word Salience. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp 8189-8200), November 2021 - November 2021.
Margatina K, Vernikos G, Barrault L & Aletras N (2021) Active Learning by Acquiring Contrastive Examples. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, November 2021 - November 2021.
Aletras N, Androutsopoulos I, Barrett L, Goantă C, Preotiuc-Pietro D, Agnoloni T, Ash E, Baldwin B, Blair-Stanek A, Borchmann L , Chalkidis I et al (2021) Introduction. Natural Legal Language Processing Nllp 2021 Proceedings of the 2021 Workshop (pp III)
Chalkidis I, Fergadiotis M, Tsarapatsanis D, Aletras N, Androutsopoulos I & Malakasiotis P (2021) Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases. Naacl Hlt 2021 2021 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Proceedings of the Conference (pp 226-241)
Jin M & Aletras N (2021) Modeling the Severity of Complaints in Social Media. Naacl Hlt 2021 2021 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Proceedings of the Conference (pp 2264-2274)
Tsarapatsanis D & Aletras N (2021) On the Ethical Limits of Natural Language Processing on Legal Text. Findings of the Association for Computational Linguistics Acl Ijcnlp 2021 (pp 3590-3599)
Vickers P, Aletras N, Monti E & Barrault L (2021) In Factuality: Efficient Integration of Relevant Facts for Visual Question Answering. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers) (pp 468-475), August 2021 - August 2021.
Chrysostomou G & Aletras N (2021) Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification.. ACL/IJCNLP (1) (pp 477-488)
Villegas DS, Mokaram S & Aletras N (2021) Analyzing Online Political Advertisements. Findings of the Association for Computational Linguistics Acl Ijcnlp 2021 (pp 3669-3680)
Mensah S, Sun K & Aletras N (2021) An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction. Emnlp 2021 2021 Conference on Empirical Methods in Natural Language Processing Proceedings (pp 9174-9179)
(2021) Proceedings of the Natural Legal Language Processing Workshop 2021, NLLP@EMNLP 2021, Punta Cana, Dominican Republic, November 10, 2021. NLLP@EMNLP
Chrysostomou G & Aletras N (2021) Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word Salience.. EMNLP (1) (pp 8189-8200)
Margatina K, Vernikos G, Barrault L & Aletras N (2021) Active Learning by Acquiring Contrastive Examples.. EMNLP (1) (pp 650-663)
Yamaguchi A, Chrysostomou G, Margatina K & Aletras N (2021) Frustratingly Simple Pretraining Alternatives to Masked Language Modeling.. EMNLP (1) (pp 3116-3125)
Villegas DS & Aletras N (2021) Point-of-Interest Type Prediction using Text and Images.. EMNLP (1) (pp 7785-7797)
Mensah S, Sun K & Aletras N (2021) An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction.. EMNLP (1) (pp 9174-9179)
Gargett A, Firth R & Aletras N (2020) LegalOps: A Summarization Corpus of Legal Opinions. 2020 IEEE International Conference on Big Data (Big Data) (pp 2117-2120), 10 December 2020 - 13 December 2020.
Alokaili A, Aletras N & Stevenson M (2020) Automatic generation of topic labels. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp 1965-1968). Online conference, 25 July 2020 - 25 July 2020. View this article in WRRO
Blain F, Aletras N & Specia L (2020) Quality In, Quality Out: Learning from Actual Mistakes. Proceedings of the 22nd Annual Conference of the European Association for Machine Translation Eamt 2020 (pp 145-154)
Maronikolakis A, Villegas DS, Preoţiuc-Pietro D & Aletras N (2020) Analyzing political parody in social media. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp 4373-4384)
Aletras N, Androutsopoulos I, Barrett L, Meyers A & Preotiuc-Pietro D (2020) Introduction to the nllp 2020workshop. Ceur Workshop Proceedings, Vol. 2645
Alokaili A, Aletras N & Stevenson M (2020) Automatic Generation of Topic Labels.. SIGIR (pp 1965-1968)
Chalkidis I, Fergadiotis M, Malakasiotis P, Aletras N & Androutsopoulos I (2020) LEGAL-BERT: "Preparing the Muppets for Court'".. EMNLP (Findings), Vol. EMNLP 2020 (pp 2898-2904)
Chalkidis I, Fergadiotis M, Kotitsas S, Malakasiotis P, Aletras N & Androutsopoulos I (2020) An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) (pp 7503-7515)
Jin M & Aletras N (2020) Complaint Identification in Social Media with Transformer Networks. Coling 2020 28th International Conference on Computational Linguistics Proceedings of the Conference (pp 1765-1771)
Sánchez Villegas D, Preotiuc-Pietro D & Aletras N (2020) Point-of-Interest Type Inference from Social Media Text. Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (pp 804-810), December 2020 - December 2020.
Chalkidis I, Fergadiotis M, Malakasiotis P, Aletras N & Androutsopoulos I (2020) LEGAL-BERT: The Muppets straight out of Law School. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020
Chalkidis I, Androutsopoulos I & Aletras N (2019) Neural Legal Judgment Prediction in English. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp 4317-4323). Florence, Italy, 28 July 2019 - 2 August 2019.
Preoţiuc-Pietro D, Gaman M & Aletras N (2019) Automatically identifying complaints in social media. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp 5008-5019). Florence, Italy, 28 July 2019 - 2 August 2019.
Chalkidis I, Fergadiotis E, Malakasiotis P, Aletras N & Androutsopoulos I (2019) Extreme multi-label legal text classification: a case study in EU legislation. Proceedings of the Natural Legal Language Processing Workshop 2019 (pp 78-87). Minneapolis, Minnesota, USA, 7 June 2019 - 7 June 2019. View this article in WRRO
Alokaili A, Aletras N & Stevenson M (2019) Re-ranking words to improve interpretability of automatically generated topics. Proceedings of 13th International Conference on Computational Semantics (IWCS) (pp 43-54). Gothenburg, Sweden, 23 May 2019 - 23 May 2019. View this article in WRRO
Karmakharm T, Aletras N & Bontcheva K (2019) Journalist-in-the-Loop: Continuous Learning as a Service for Rumour Analysis. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations (pp 115-120), November 2019 - November 2019.
Preotiuc-Pietro D, Gaman M & Aletras N (2019) Automatically Identifying Complaints in Social Media. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) (pp 5008-5019)
Chalkidis I, Androutsopoulos I & Aletras N (2019) Neural Legal Judgment Prediction in English. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) (pp 4317-4323)
Tsakalidis A, Aletras N, Cristea AI & Liakata M (2018) Nowcasting the stance of social media users in a sudden vote: The case of the Greek referendum. CIKM '18 Proceedings of the 27th ACM International Conference on Information and Knowledge Management (pp 367-376). Torino, Italy, 22 October 2018 - 22 October 2018. View this article in WRRO
Aletras N & Chamberlain BP (2018) Predicting Twitter user socioeconomic attributes with network and language information. Proceedings of the 29th ACM Conference on Hypertext and Social Media (pp 20-24). Baltimore, MD, USA, 9 July 2018 - 9 July 2018. View this article in WRRO
Tsakalidis A, Aletras N, Cristea AI & Liakata M (2018) Nowcasting the Stance of Social Media Users in a Sudden Vote: The Case of the Greek Referendum.. CIKM (pp 367-376)
Sorodoc I, Lau JH, Aletras N & Baldwin T (2017) Multimodal Topic Labelling. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers (pp 701-706), April 2017 - April 2017.
Aletras N & Mittal A (2017) Labeling topics with images using a neural network. Advances in Information Retrieval : 39th European Conference on IR Research, ECIR 2017, Aberdeen, UK, April 8-13, 2017, Proceedings (pp 500-505). Aberdeen, UK, 8 April 2017 - 13 April 2017.
Lampos V, Aletras N, Geyti JK, Zou B & Cox IJ (2016) Inferring the Socioeconomic Status of Social Media Users Based on Behaviour and Language (pp 689-695)
Aletras N (2015) Session details: Short Papers. Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications
Aletras N, Lau JH, Baldwin T & Stevenson M (2015) TM 2015 -- Topic Models. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (pp 1953-1954)
Preoţiuc-Pietro D, Lampos V & Aletras N (2015) An analysis of the user occupational class through Twitter content. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Vol. 1: Long papers(-) (pp 1754-1764), 26 July 2015 - 31 July 2015.
(2015) Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications, TM 2015, Melbourne, Australia, October 19, 2015. TM@CIKM
Aletras N & Stevenson M (2015) A Hybrid Distributional and Knowledge-based Model of Lexical Semantics. Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics (pp 20-29), June 2015 - June 2015.
Aletras N, Baldwin T, Lau JH & Stevenson M (2014) Representing topics labels for exploring digital libraries. IEEE/ACM Joint Conference on Digital Libraries, 8 September 2014 - 12 September 2014.
Aletras N & Stevenson M (2014) Labelling Topics using Unsupervised Graph-based Methods. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Vol. 2 (pp 631-636)
Aletras N & Stevenson M (2014) Measuring the Similarity between Automatically Generated Topics. Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers, April 2014 - April 2014.
Lampos V, Aletras N, Preotiuc-Pietro D & Cohn T (2014) Predicting and Characterising User Impact on Twitter. Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
Agirre E, Aletras N, Clough P, Fernando S, Goodale P, Hall M, Soroa A & Stevenson M (2013) PATHS: A system for accessing cultural heritage collections. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Vol. 2013-August (pp 151-156)
Agirre E, Aletras N, Gonzalez-Agirre A, Rigau G & Stevenson M (2013) UBC UOS-TYPED: Regression for Typed-similarity. Sem 2013 2nd Joint Conference on Lexical and Computational Semantics, Vol. 1 (pp 132-137)
Aletras N & Stevenson M (2013) Evaluating topic coherence using distributional semantics. Proceedings of the 10th International Conference on Computational Semantics Iwcs 2013 Long Papers
Aletras N & Stevenson M (2013) Representing topics using images. Naacl Hlt 2013 2013 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Proceedings of the Main Conference (pp 158-167)
Agirre E, Aletras N, Gonzalez-Agirre A, Rigau G & Stevenson M (2013) UBC UOS-TYPED: Regression for Typed-similarity. Sem 2013 2nd Joint Conference on Lexical and Computational Semantics Proceedings of the Main Conference and the Shared Task Semantic Textual Similaritysem 2013 2nd Joint Conference on Lexical and Computational Semantics Proceedings of the Main Conference and the Shared Task Semantic Textual Similarity (pp 132-137)
Goodale P, Clough P, Ford N, Hall M, Stevenson M, Fernando S, Aletras N, Fernie K, Archer P & De Polo A (2012) User-centred design to support exploration and path creation in cultural heritage collections. Ceur Workshop Proceedings, Vol. 909 (pp 75-78)
Hall M, Agirre E, Aletras N, Bergheim R, Chandrinos K, Clough P, Fernando S, Fernie K, Goodale P, Griffiths J , Lopez de Lacalle O et al (2012) PATHS - Exploring Digital Cultural Heritage Spaces. Theory and Practice of Digital Libraries 2012. Cyprus
Wood G & Demirbag M (2012) Introduction
Aletras N & Stevenson M (2012) Computing similarity between cultural heritage items using multimodal features. Proceedings of the 6th Workshop on Language Technology for Cultural Heritage Social Sciences and Humanities Latech 2012 at the 13th Conference of the European Chapter of the Association for Computational Linguistics Eacl 2012 (pp 85-93)

Preprints

Villegas DS, Lewis-Lim S, Aletras N & Elliott D (2026) Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models, arXiv.
Shetty A, Naseem U, Aletras N, Dras M, Ji H & Nakov P (2026) Towards Pluralistic Alignment of LLMs: A Comprehensive Survey, MDPI AG.
Cao M, Tan X, Akhter ME, Valentino M, Liakata M, Wang X & Aletras N (2026) Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models, arXiv.
Permadi VA, Tan X, Moosavi NS & Aletras N (2026) No Shortcuts to Culture: Indonesian Multi-hop Question Answering for Complex Cultural Understanding, arXiv.
Karouzos C, Tan X & Aletras N (2026) An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift, arXiv.
Lewis-Lim S, Tan X, Zhao Z & Aletras N (2026) Can Confidence Estimates Decide When Chain-of-Thought Is Necessary for LLMs?, arXiv.
Yamaguchi A, Mi M & Aletras N (2026) Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks, arXiv.
Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates, arXiv.
Xue H, Moosavi NS & Aletras N (2025) Deconstructing Attention: Investigating Design Principles for Effective Language Modeling, arXiv.
Hughes A, Duddu V, Asokan N, Aletras N & Ma N (2025) PATCH: Mitigating PII Leakage in Language Models with Privacy-Aware Targeted Circuit PatcHing.
Alajrami A, Tan X & Aletras N (2025) Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance, arXiv.
Yamaguchi A, Morishita T, Villavicencio A & Aletras N (2025) Adapting Chat Language Models Using Only Target Unlabeled Language Data, arXiv.
Tan X, Valentino M, Akhter M, Liakata M & Aletras N (2025) Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision, arXiv.
Fang J, Peng Y, Zhang X, Wang Y, Yi X, Zhang G, Xu Y, Wu B, Liu S, Li Z , Ren Z et al (2025) A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems, arXiv.
Lewis-Lim S, Tan X, Zhao Z & Aletras N (2025) Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?, arXiv.
Cao M, Wang X & Aletras N (2025) Progressive Depth Up-scaling via Optimal Transport, arXiv.
Meng C, Tonolini F, Mo F, Aletras N, Yilmaz E & Kazai G (2025) Bridging the Gap: From Ad-hoc to Proactive Search in Conversations, arXiv.
Chlapanis OS, Galanis D, Aletras N & Androutsopoulos I (2025) GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations, arXiv.
Williams M, Chrysostomou G & Aletras N (2025) Self-calibration for Language Model Quantization and Pruning, arXiv.
Williams M, Chrysostomou G, Jeronymo V & Aletras N (2025) Compressing Language Models for Specialized Domains, arXiv.
Hughes A, Ma N & Aletras N (2024) How Private are Language Models in Abstractive Summarization?, arXiv.
Mu Y, Jin M, Song X & Aletras N (2024) Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research, arXiv.
Yamaguchi A, Villavicencio A & Aletras N (2024) How Can We Effectively Expand the Vocabulary of LLMs with 0.01GB of Target Language Text?, arXiv.
Jin M, Preoţiuc-Pietro D, Doğruöz AS & Aletras N (2024) Who is bragging more online? A large scale analysis of bragging in social media, arXiv.
Zhao Z & Aletras N (2024) Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models, arXiv.
Vickers P, Barrault L, Monti E & Aletras N (2024) We Need to Talk About Classification Evaluation Metrics in NLP, arXiv.
Williams M & Aletras N (2023) On the Impact of Calibration Data in Post-training Quantization and Pruning, arXiv.
Chrysostomou G, Zhao Z, Williams M & Aletras N (2023) Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization, arXiv.
Alajrami A, Margatina K & Aletras N (2023) Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?, arXiv.
Xue H & Aletras N (2023) Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention, arXiv.
Goanta C, Aletras N, Chalkidis I, Ranchordas S & Spanakis G (2023) Regulation and NLP (RegNLP): Taming Large Language Models, arXiv.
Mu Y, Song X, Bontcheva K & Aletras N (2023) Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets, arXiv.
Williams M & Aletras N (2023) Vocabulary-level Memory Efficiency for Language Model Fine-tuning, arXiv.
Villegas DS, Preoţiuc-Pietro D & Aletras N (2023) Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary Tasks, arXiv.
Villegas DS, Goanta C & Aletras N (2023) A Multimodal Analysis of Influencer Content on Twitter, arXiv.
Feng Y, Jiao Y, Prasad A, Aletras N, Yilmaz E & Kazai G (2023) Schema-Guided User Satisfaction Modeling for Task-Oriented Dialogues, arXiv.
Margatina K, Schick T, Aletras N & Dwivedi-Yu J (2023) Active Learning Principles for In-Context Learning with Large Language Models, arXiv.
Mu Y, Wu BP, Thorne W, Robinson A, Aletras N, Scarton C, Bontcheva K & Song X (2023) Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science, arXiv.
Shi Z, Tonolini F, Aletras N, Yilmaz E, Kazai G & Jiao Y (2023) Rethinking Semi-supervised Learning with Language Models, arXiv.
Margatina K & Aletras N (2023) On the Limitations of Simulating Active Learning, arXiv.
Mensah S, Sun K & Aletras N (2023) Trading Syntax Trees for Wordpieces: Target-oriented Opinion Words Extraction with Wordpieces and Aspect Enhancement, arXiv.
Zhao Z & Aletras N (2023) Incorporating Attribution Importance for Improving Faithfulness Metrics, arXiv.
Sun K, Zhang R, Mensah S, Aletras N, Mao Y & Liu X (2023) Self-training through Classifier Disagreement for Cross-Domain Opinion Target Extraction, arXiv.
Mu Y, Bontcheva K & Aletras N (2023) It's about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits, arXiv.
Zhao Z, Chrysostomou G, Bontcheva K & Aletras N (2022) On the Impact of Temporal Concept Drift on Model Explanations, arXiv.
Xue H & Aletras N (2022) HashFormers: Towards Vocabulary-independent Pre-trained Transformers, arXiv.
Li W & Aletras N (2022) Improving Graph-Based Text Representations with Character and Word Level N-grams, arXiv.
Bose T, Aletras N, Illina I & Fohr D (2022) Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection, arXiv.
Ao X, Villegas DS, Preoţiuc-Pietro D & Aletras N (2022) Combining Humor and Sarcasm for Improving Political Parody Detection, arXiv.
Mu Y, Niu P & Aletras N (2022) Identifying and Characterizing Active Citizens who Refute Misinformation in Social Media, arXiv.
Li M, Chen J, Mensah S, Aletras N, Yang X & Ye Y (2022) A Hierarchical N-Gram Framework for Zero-Shot Link Prediction, arXiv.
Bose T, Aletras N, Illina I & Fohr D (2022) Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection, arXiv.
Alajrami A & Aletras N (2022) How does the pre-training objective affect what large language models learn about linguistic properties?, arXiv.
Jin M, Preoţiuc-Pietro D, Doğruöz AS & Aletras N (2022) Automatic Identification and Classification of Bragging in Social Media, arXiv.
Chrysostomou G & Aletras N (2022) An Empirical Study on Explanations in Out-of-Domain Settings, arXiv.
Chalkidis I, Jana A, Hartung D, Bommarito M, Androutsopoulos I, Katz DM & Aletras N (2021) LexGLUE: A Benchmark Dataset for Legal Language Understanding in English, arXiv.
Mensah S, Sun K & Aletras N (2021) An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction, arXiv.
Gajbhiye A, Fomicheva M, Alva-Manchego F, Blain F, Obamuyide A, Aletras N & Specia L (2021) Knowledge Distillation for Quality Estimation, arXiv.
Villegas DS, Mokaram S & Aletras N (2021) Analyzing Online Political Advertisements, arXiv.
Tsarapatsanis D & Aletras N (2021) On the Ethical Limits of Natural Language Processing on Legal Text, arXiv.
Chrysostomou G & Aletras N (2021) Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification, arXiv.
Margatina K, Barrault L & Aletras N (2021) On the Importance of Effectively Adapting Pretrained Language Models for Active Learning, arXiv.
Chrysostomou G & Aletras N (2021) Flexible Instance-Specific Rationalization of NLP Models, arXiv.
Chalkidis I, Fergadiotis M, Tsarapatsanis D, Aletras N, Androutsopoulos I & Malakasiotis P (2021) Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases, arXiv.
Jin M & Aletras N (2021) Modeling the Severity of Complaints in Social Media, arXiv.
Jin M & Aletras N (2020) Complaint Identification in Social Media with Transformer Networks, arXiv.
Chalkidis I, Fergadiotis M, Malakasiotis P, Aletras N & Androutsopoulos I (2020) LEGAL-BERT: The Muppets straight out of Law School, arXiv.
Chalkidis I, Fergadiotis M, Kotitsas S, Malakasiotis P, Aletras N & Androutsopoulos I (2020) An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels, arXiv.
Villegas DS, Preoţiuc-Pietro D & Aletras N (2020) Point-of-Interest Type Inference from Social Media Text, arXiv.
Alokaili A, Aletras N & Stevenson M (2020) Automatic Generation of Topic Labels, arXiv.
Fomicheva M, Sun S, Yankovskaya L, Blain F, Guzmán F, Fishel M, Aletras N, Chaudhary V & Specia L (2020) Unsupervised Quality Estimation for Neural Machine Translation, arXiv.
Maronikolakis A, Villegas DS, Preotiuc-Pietro D & Aletras N (2020) Analyzing Political Parody in Social Media, arXiv.
Preotiuc-Pietro D, Gaman M & Aletras N (2019) Automatically Identifying Complaints in Social Media, arXiv.
Chalkidis I, Androutsopoulos I & Aletras N (2019) Neural Legal Judgment Prediction in English, arXiv.
Alokaili A, Aletras N & Stevenson M (2019) Re-Ranking Words to Improve Interpretability of Automatically Generated Topics, arXiv.
Zhang L, Song H, Aletras N & Lu H (2018) Graph Node-Feature Convolution for Representation Learning, arXiv.
Tsakalidis A, Aletras N, Cristea AI & Liakata M (2018) Nowcasting the Stance of Social Media Users in a Sudden Vote: The Case of the Greek Referendum, arXiv.
Aletras N & Chamberlain BP (2018) Predicting Twitter User Socioeconomic Attributes with Network and Language Information, arXiv.
Aletras N & Mittal A (2016) Labeling Topics with Images using Neural Networks, arXiv.

Grants

Learning Logical Structure for a Better Proving Experience, Renaissance Philanthropy, 09/2025 - 09/2027, £328,937, as Co-PI
Quantifying, improving and guiding reasoning / Chain-of-Thought (CoT) approaches, Industrial, 08/2025 - 07/2026, £31,605, as PI
Efficient Deployment of Large Language Models for Industrial Applications, Industrial, 07/2024 - 07/2025, £30,360, as PI
AdSoLve: Addressing socio-technical limitations of Large Language Models (LLMs) for medical and social computing, RAI (EPSRC), 05/2024 - 03/2028, £3,498,789, as Co-PI
ESPERANTO: Exchanges for SPEech ReseArch aNd TechnOlogies, EC H2020, 01/2021 - 12/2025, £13,800, as Co-I
UKRI Centre for Doctoral Training in Speech and Language Technologies and their Applications, EPSRC, 04/2019 - 09/2027, £5,508,850, as Co-PI
SAI: Social Explainable Artificial Intelligence, EPSRC, 02/2021 - 01/2024, £366,348, as PI
Understanding online political advertising: perceptions, uses and regulation, Leverhulme, 01/2021 - 07/2024, £395,011, as Co-PI
Responsible AI for Inclusive, Democratic Societies: A cross-disciplinary approach to detecting and countering abusive language online, ESRC, 02/2020 - 01/2024, £508,135, as Co-PI
Bergamot: Browser-based Multilingual Translation, EC H2020, 01/2019 - 12/2021, £473,113, as Co-PI
Innovation Next Generation Services Through Collaborative Design, ESRC, 12/2018 - 11/2020, £284,926, as Co-PI
Journalist-in-the-Loop Machine Learning as a Service for Rumour Analysis, Industrial, 11/2018 - 12/2019, £44,642, as Co-PI
Alexa Fellowship, Amazon, 08/2018 - 08/2021, £73,000, as PI

School of Computer Science

School of Computer Science

Professor Nikos Aletras

Journal articles

Conference proceedings

Preprints

Links