Professor Thomas Hain
School of Computer Science
Professor of Speech and Audio Technology
Director of CDT in Speech and Language Technologies
Director of Liveperson Centre
Member of the Speech and Hearing (SpandH) research group


Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Thomas Hain obtained the degree 'Dipl.-Ing' in Electrical/Communication Engineering in 1994 from the University of Technology, Vienna. He joined the Speech Technology Group at Philips Speech Processing which he left in a senior position.
In 1997 he joined the Speech, Vision and Robotics Group at the Cambridge University Engineering Department as Research Associate and PhD Student. He took up a Lectureship at the SVR group in 2001.
In 2004 he joined the Speech and Hearing Group to work as Lecturer in Computer Science. He was promoted to Senior Lecturer in 2008 and Reader in 2011.
- Research interests
-
Thomas' research interests cover many areas in natural language processing, speech, audio and multimedia technology, machine learning, and complex system optimisation and design.
His interests include: large vocabulary continuous speech recognition, non-linear methods in speech processing, low bit-rate speech coding, machine learning, multi-modal systems, image classification, microphone arrays, system and resource optimisation.
- Publications
-
Books
Journal articles
- Automatic detection of behavioural codes in team interactions. Computer Speech and Language, 74.
- Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
- H-VECTORS: Improving the robustness in utterance-level speaker embeddings using a hierarchical attention model.. Neural Netw, 142, 329-339.
- Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection. Journal of Big Data, 8. View this article in WRRO
- System-Independent ASR error detection and classification using Recurrent Neural Network. Computer Speech & Language, 55, 187-199. View this article in WRRO
- Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(3), 572-582. View this article in WRRO
- Lightly supervised alignment of subtitles on multi-genre broadcasts. Multimedia Tools and Applications, 77(23), 30533-30550. View this article in WRRO
- Unsupervised crosslingual adaptation of tokenisers for spoken language recognition. Computer Speech and Language, 46, 327-342. View this article in WRRO
- Acoustic adaptation to dynamic background conditions with asynchronous transformations. Computer Speech and Language, 41, 180-194. View this article in WRRO
- Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system. Computer Speech and Language, 28(6), 1255-1268.
- Lightly supervised learning from a damaged natural speech corpus. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 8086-8090.
- Correctness-adjusted unsupervised discriminative acoustic model adaptation. IEEE Transactions on Audio, Speech and Language Processing, PP(99).
- Introduction to the Special Section on New Frontiers in Rich Transcription. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 20(2), 353-355.
- Automatic transcription of academic lectures from diverse disciplines. 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings, 398-403.
- Application of SVM-based correctness predictions to unsupervised discriminative speaker adaptation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4341-4344.
- Transcribing meetings with the AMIDA systems. IEEE Transactions on Audio, Speech and Language Processing.
- Error approximation and minimum phone error acoustic model estimation. IEEE Transactions on Audio, Speech and Language Processing, 18(6), 1269-1279.
- Automatic Optimization of Speech Decoder Parameters. IEEE SIGNAL PROC LET, 17(1), 95-98.
- The AMI system for the transcription of speech in meetings. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4.
- Automatic transcription of conversational telephone speech. IEEE T SPEECH AUDI P, 13(6), 1173-1185.
- Implicit modelling of pronunciation variation in automatic speech recognition. SPEECH COMMUNICATION, 46(2), 171-188.
Chapters
- Use of Speaker Metadata for Improving Automatic Pronunciation Assessment, Statistical Language and Speech Processing (pp. 61-72).
- Speech recognition, Multimodal Signal Processing (pp. 56-83). Cambridge University Press
- Speech Recognition In Clark A, Fox C & Lappin S (Ed.), The Handbook of Computational Linguistics and Natural Language Processing (pp. 299-332). Wiley-Blackwell
- Juicer: A weighted finite-state transducer speech decoder (pp. 285-296).
- The AMI Meeting Corpus: A Pre-announcement, Machine Learning for Multimodal Interaction, Lecture Notes in Computer Science (pp. 28-39). Edinburgh: Springer.
Conference proceedings papers
- MUST: A Multilingual Student-Teacher Learning Approach for Low-Resource Speech Recognition. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16 December 2023 - 20 December 2023.
- The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 22 October 2023 - 25 October 2023.
- Exploring speech representations for proficiency assessment in language learning. 9th Workshop on Speech and Language Technology in Education (SLaTE) Proceedings (pp 151-155). Dublin, Ireland, 18 August 2023 - 18 August 2023. View this article in WRRO
- Towards Domain Generalisation in ASR with Elitist Sampling and Ensemble Knowledge Distillation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech Enhancement. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- View this article in WRRO
- Deformable temporal convolutional networks for monaural noisy reverberant speech separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023.
- The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Vol. 2023-October
- DRX mode implementation based on virtual machine. 2022 29th IEEE International Conference on Electronics, Circuits and Systems (ICECS), 24 October 2022 - 26 October 2022.
- Unsupervised Data Selection for Speech Recognition with Contrastive Loss Ratios. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 23 May 2022 - 27 May 2022.
- A Model for Assessor Bias in Automatic Pronunciation Assessment. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7267-7271)
- Attention Based Model for Segmental Pronunciation Error Detection. 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 13 December 2021 - 17 December 2021.
- Towards Low-Resource Stargan Voice Conversion Using Weight Adaptive Instance Normalization. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021.
- Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021.
- Improving audio anomalies recognition using temporal convolutional attention networks. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 6473-6477). Toronto, ON, Canada, 6 June 2021 - 11 June 2021.
- Supervised speaker embedding de-mixing in two-speaker environment. 2021 IEEE Spoken Language Technology Workshop (SLT) (pp 758-765). Shenzhen, China, 19 January 2021 - 22 January 2021.
- Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness. 2020 28th European Signal Processing Conference (EUSIPCO), 18 January 2021 - 21 January 2021.
- Contextual Joint Factor Acoustic Embeddings, Vol. 00 (pp 750-757)
- WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization (pp 559-573)
- Robust speaker recognition using speech enhancement and attention model. The Speaker and Language Recognition Workshop (Odyssey 2020) (pp 451-458). Tokyo, Japan, 1 November 2020 - 5 November 2020.
- Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition. Interspeech 2020 (pp 4084-4088). Shanghai, China, 25 October 2020 - 29 October 2020.
- Weakly supervised training of hierarchical attention networks for speaker identification. Proceedings of Interspeech 2020 (pp 2992-2996). Shanghai, China, 25 October 2020 - 29 October 2020.
- Exploration of audio quality assessment and anomaly localisation using attention models. Proceedings of Interspeech 2020 (pp 4611-4615). Shanghai, China, 25 October 2020 - 29 October 2020.
- Speaker re-identification with speaker dependent speech enhancement. Proceedings of Interspeech 2020 (pp 1530-1534). Shanghai, China, 25 October 2020 - 29 October 2020.
- H-vectors : utterance-level speaker embedding using a hierarchical attention model. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7579-7583). Barcelona, Spain (virtual), 4 May 2020 - 8 May 2020.
- Unsupervised Adaptation of Acoustic Models for ASR Using Utterance-Level Embeddings from Squeeze and Excitation Networks. 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019.
- Spatio-Temporal Context Modelling for Speech Emotion Classification. 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019.
- A Cross-Corpus Study on Speech Emotion Recognition. 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019.
- Improving ASR Error Detection with RNNLM Adaptation. 2018 IEEE Spoken Language Technology Workshop (SLT), 18 December 2018 - 21 December 2018.
- Exploring the use of group delay for generalised VTS based noise compensation. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings, 15 April 2018 - 20 April 2018. View this article in WRRO
- Improved acoustic modelling for automatic literacy assessment of children. Proceedings of Interspeech 2018 (pp 1666-1670), 2 September 2018 - 6 September 2018. View this article in WRRO
- Towards a generic approach for automatic speech recognition error detection and classification. 2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP), 21 March 2018 - 24 March 2018.
- Exploring the use of acoustic embeddings in neural machine translation. 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16 December 2017 - 20 December 2017. View this article in WRRO
- Semi-supervised Adaptation of RNNLMs by Fine-tuning with Domain-specific Auxiliary Features. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2715-2719) View this article in WRRO
- Robust Source-Filter Separation of Speech Signal in the Phase Domain. Proceedings of the Annual Conference of the International Speech Communication Association View this article in WRRO
- DNN approach to speaker diarisation using speaker channels. Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing (pp 4925-4929) View this article in WRRO
- Analysing Acoustic Model Changes for Active Learning in Automatic Speech Recognition. International Conference on Systems, Signals and Image Processing (IWSSIP) View this article in WRRO
- Interspeech 2017. Interspeech 2017
- Shefce: A Cantonese-English bilingual speech corpus for pronunciation assessment. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5 March 2017 - 9 March 2017. View this article in WRRO
- Statistical normalisation of phase-based feature representation for robust speech recognition. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 5310-5314) View this article in WRRO
- Automatic speech recognition errors detection using supervised learning techniques. 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA), 29 November 2016 - 2 December 2016. View this article in WRRO
- The 2015 Sheffield system for transcription of Multi-Genre Broadcast media. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings (pp 624-631) View this article in WRRO
- Error correction in lightly supervised alignment of broadcast subtitles. Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) View this article in WRRO
- webASR 2 - Improved cloud based speech technology. Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) (pp 1613-1617) View this article in WRRO
- Using phone features to improve dialogue state tracking
generalisation to unseen states. Proceedings of the 17th Annual Meeting of the Special Interest Group
on Discourse and Dialogue, September 2016 - September 2016. View this article in WRRO
- Use of generalised nonlinearity in Vector Taylor Series noise compensation for robust speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 3798-3802) View this article in WRRO
- Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting. Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting (pp 2939-2943), 9 September 2016 - 12 September 2016. View this article in WRRO
- Emotion Recognition from the Speech Signal by Effective Combination of Generative and Discriminative Models. USES
- The Sheffield language recognition system in NIST LRE 2015. Proceedings of The Speaker and Language Recognition Workshop Odyssey 2016 View this article in WRRO
- View this article in WRRO
- Groupwise learning for ASR k-best list reranking in spoken language translation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-May (pp 6120-6124) View this article in WRRO
- Segment-oriented evaluation of speaker diarisation performance. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016. View this article in WRRO
- Interspeech 2016. Interspeech 2016
- The 2015 sheffield system for longitudinal diarisation of broadcast media. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- The MGB challenge: Evaluating multi-genre broadcast media recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- Knowledge transfer between speakers for personalised dialogue management. Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2015 - September 2015.
- View this article in WRRO
- An investigation into speaker informed DNN front-end for LVCSR. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO
- Automatic assessment of English learner pronunciation using discriminative classifiers. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO
- Speech-enabled environmental control in an AAL setting for people with speech disorders: a case study. IET International Conference on Technologies for Active and Assisted Living (TechAAL)
- Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition (pp 173-184)
- Quality estimation for asr k-best list rescoring in spoken language translation. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015.
- Semi-supervised DNN training in meeting recognition. 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014. View this article in WRRO
- Using neural network front-ends on far field multiple microphones based speech recognition. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) View this article in WRRO
- Semi-supervised DNN training in meeting recognition. 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings (pp 141-146)
- Background-tracking acoustic features for genre identification of broadcast shows. 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014.
- Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014.
- View this article in WRRO
- Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2014 - 9 May 2014. View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- Interpretation of multiparty meetings the AMI and AMIDA projects. 2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008 (pp 115-118)
- On the convergence of fractal transforms. ICASSP’94 (pp 561-564)
- LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks. Interspeech 2024 (pp 2835-2839)
- EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark. Interspeech 2024 (pp 1580-1584)
- 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. The Speaker and Language Recognition Workshop (Odyssey 2024)
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- The University of Sheffield CHiME-7 UDASE Challenge Speech Enhancement System. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023)
- Domain Adaptive Self-supervised Training of Automatic Speech Recognition. INTERSPEECH 2023
- View this article in WRRO
- View this article in WRRO
- Insights on Neural Representations for End-to-End Speech Recognition. Interspeech 2021
- Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition. Interspeech 2020
- Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus. Interspeech 2020
- Unsupervised Acoustic Unit Representation Learning for Voice Conversion Using WaveNet Auto-Encoders. Interspeech 2020
- Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages. Interspeech 2020
- Detecting Mismatch Between Speech and Transcription Using Cross-Modal Attention. Interspeech 2019
- Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition. Interspeech 2019
- Learning Temporal Clusters Using Capsule Routing for Speech Emotion Recognition. Interspeech 2019
- On the Usefulness of the Speech Phase Spectrum for Pitch Extraction. Interspeech 2018 View this article in WRRO
- Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR. Interspeech 2017 View this article in WRRO
- Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition. Interspeech 2016 View this article in WRRO
- Colloquialising Modern Standard Arabic Text for Improved Speech Recognition. Interspeech 2016 View this article in WRRO
- The Sheffield Wargame Corpus — Day Two and Day Three. Interspeech 2016 View this article in WRRO
- Improving Generalisation to New Speakers in Spoken Dialogue State Tracking. Interspeech 2016 View this article in WRRO
- DNN-Based Speaker Clustering for Speaker Diarisation. Interspeech 2016 View this article in WRRO
- Automatic Genre and Show Identification of Broadcast Media. Interspeech 2016 View this article in WRRO
- A Novel Phase-based Feature for Robust Speech Recognition. USES Conference Proceedings
- Improvements in accuracy and speed in the HTK broadcast news transcription system. 6th European Conference on Speech Communication and Technology (pp 1043-1046)
Reports
Theses / Dissertations
Datasets
- The homeService corpus v1.0.
- Experiments results for IEEE/ACM Transaction on Audio, Speech and Language Processing Journal Paper: "Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment".
- Interspeech 2016 - Experiment results for paper "Error correction in lightly supervised alignment of broadcast subtitles".
- Interspeech 2016 - Experiment results for paper "webASR 2 - Improved cloud based speech technology".
- Computer, Speech and Language - Experiment results for paper "Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations".
- Multimedia Tools and Applications - Experiments results for paper "Lightly supervised alignment of subtitles on multigenre broadcasts".
- Interspeech 2016 - Experiment results for paper "Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition".
- Interspeech 2016 - Experiment results for Sheffield Wargame Corpora (SWC1, SWC2, SWC3).
- ICASSP 2016 - Experiment results for the paper "Groupwise learning for ASR k-best list reranking in spoken language translation".
Other
Preprints
- Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis, arXiv.
- LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks, arXiv.
- 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem, arXiv.
- Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations, arXiv.
- SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations, arXiv.
- On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments, arXiv.
- The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions, arXiv.
- Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations, arXiv.
- Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition, arXiv.
- On Data Sampling Strategies for Training Neural Network Speech Separation Models, arXiv.
- View this article in WRRO
- Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition.
- View this article in WRRO
- Grants
-
Current grants
- UKRI Centre for Doctoral Training in Speech and Language Technologies and their Applications, EPSRC, 04/2019 - 09/2027, £5,508,850, as PI
- VoiceBase Centre, VoiceBase Inc./Liveperson, 04/2018 - 03/2026, £2,488,691, as PI
- WFST-based integration of ASR and MT in Spoken Language Translation, Industrial, 03/2014 - 12/2026, £63,588, as PI
Previous grants
- Automatic voice conversion for transforming professional adult voice actors to artificial child voice actors, Innovate UK, 01/2021 - 01/2023, £173,605, as PI
- MAUDIE: Multimedia Analysis for Unsupervised Dubbing In Entertainment, Innovate UK, 05/2018 - 07/2021, £393,115, as PI
- TUTO II: Reading skills tutoring system, ITSLANGUAGE BV, 08/2017 - 12/2019, £121,439, as PI
- Sound Source Separation Based on Deep Learning, Industrial, 05/2019 - 04/2020, £48,000, as PI
- Acoustic correlates of emotions for automatic recognition, Industrial, 10/2018 - 09/2019, £48,900, as PI
- Bridge Project, VoiceBase Inc., 09/2017 - 03/2018, £61,200, as PI
- STATUS IV: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2017 - 10/2017, £60,000, as PI
- TUTO: Reading skills tutoring system, ITSLANGUAGE BV, 09/2016 - 08/2017, £61,983, as PI
- STATUS III: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2015 - 07/2016, £78,684, as PI
- STATUS II: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 11/2013 - 05/2014, £98,982, as PI
- ItsLanguage, ITSLANGUAGE BV, 11/2012 - 03/2015, £68,333, as PI
- German System Adaptation, ITSLANGUAGE BV, 11/2012 - 03/2015, £42,373, as PI
- DocuMeet: Transcription, summarisation and documentation of meetings using advanced speech technologies, indexing and browsing capabilities, EC FP7, 11/2012 - 10/2014, £368,433, as PI
- STATUS: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 10/2012 - 08/2013, £73,726, as PI
- A Joint Model of Spoken Language Translation, Google, 09/2011 - 12/2016, £43,014, as PI
- Natural Speech Technology, EPSRC, 05/2011 - 07/2016, £1,798,665, as PI
- Unsupervised Domain Adaptation, CISCO, 11/2010 - 04/2012, £121,745, as PI
- AMIDA: Augmented Multi-party Interaction with Distance Access, EC FP6, 10/2006 - 12/2009, £467,074, as PI
- AMIDA: Augmented Multi-party Interaction with Distance Access, EC FP6, 10/2006 - 12/2009, £345,350, as PI
- Professional activities and memberships
-
- Head of the Speech and Hearing research group
- Editorial Board member, Computer Speech and Language
- Associate Editor, ACM Transactions on Speech and Language Processing
- Organising committee member, ASRU 2013
- Area Chair, Interspeech 2014, Speech Recognition - Signal Processing, Acoustic Modelling, Robustness and Adaptation.
- Area Chair, ICPR 2014, Track 3 Image, Speech. Signal and Video Processing
- Programme Committee, PoITAL 2014