Professor Heidi Christensen
PhD
School of Computer Science
Head of School
Professor of Spoken Language Technology
Member of the Speech and Hearing (SpandH) research group
+44 114 222 1950
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Professor Christensen obtained her M.Sc. and a Ph.D. degrees from Aalborg University, Denmark, in 1996 and 2002, respectively. In 1998, she was a visiting researcher at IDIAP, Switzerland. From 2000, she worked as a research associate on numerous national and international projects in the Speech and Hearing group in Sheffield before taking up her lectureship in 2015. She was subsequently appointed a senior lecturer (2019) and chair (2022) in the department.
She is the current Head of School and has a long-term interest in EDI (Equality, Diversity and Inclusion), serving as the Faculty Director of EDI (2020-2022).
Her main research interests are in the areas of recognition of disordered speech, automatic processing of conversations, and the automatic detection and tracking of paralinguistic information such as emotions and general interactional behaviours.
- Research interests
-
Her research interests are in the use of speech and language processing in the healthcare domain. She has considerable experience in the areas of recognition of disordered speech, automatic processing of conversations and the automatic detection and tracking of paralinguistic information such as emotions and general interactional behaviours. Her research has been supported by EU, UKRI, NIHR, Google and various charities like Rosetrees Trust and the Psychiatry Research Trust. She has considerable experience in leading highly interdisciplinary research projects including being the technical lead in the development of the CognoSpeak system for using AI to detect early signs of dementia.
- Publications
-
Journal articles
- Raw acoustic-articulatory multimodal dysarthric speech recognition. Computer Speech & Language, 95, 101839-101839.
- Analysis of facial cues for cognitive decline detection using in-the-wild data. Applied Sciences, 15(11). View this article in WRRO
- CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech. 2025 IEEE Symposium on Computational Intelligence in Health and Medicine (CIHM), 1-7. View this article in WRRO
- Early dementia detection using multiple spontaneous speech prompts: The PROCESS challenge. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1-2. View this article in WRRO
- A two-step attention-based feature combination cross-attention system for speech-based dementia detection. IEEE Transactions on Audio, Speech and Language Processing, 33, 896-907. View this article in WRRO
- Analysis of Voice Biomarkers for the Detection of Cognitive Impairment. IEEE Access, 12, 122840-122851.
- Spoken language-based automatic cognitive assessment of stroke survivors. Language and Health, 2(1), 32-38.
- Automatic detection of expressed emotion from five-minute speech samples: challenges and opportunities. PLOS ONE, 19(3). View this article in WRRO
- Predicting the cause of seizures using features extracted from interactions with a virtual agent. Seizure: European Journal of Epilepsy, 114, 84-89. View this article in WRRO
- Speech patterns in responses to questions asked by an intelligent virtual agent can help to distinguish between people with early stage neurodegenerative disorders and healthy controls. Clinical Linguistics & Phonetics, 38(9), 880-901. View this article in WRRO
- Feasibility of longitudinal automated cognitive assessment in the stroke pathway. Alzheimer's & Dementia, 19(S14).
- Developing an automated Cognitive assessment based on language; CognoSpeak‐ working with an ethnic minority group. Alzheimer's & Dementia, 19(S20).
- 02 Predicting the cause of TLOC using an automated analysis of interactions with a virtual agent. Journal of Neurology, Neurosurgery & Psychiatry, 94(12), e2.9-e2.9.
- Features of answers to questions about recent events by people with mild cognitive impairment and Alzheimer’s disease, and healthy controls. Journal of Interactional Research in Communication Disorders, 14(3), 408-429. View this article in WRRO
- The Dysarthric Expressed Emotional Database (DEED): an audio-visual database in British English. PLOS ONE, 18(8). View this article in WRRO
- Special issue on applications of speech and language technologies in healthcare. Applied Sciences, 13(11). View this article in WRRO
- Differentiating between epileptic and functional/dissociative seizures using semantic content analysis of transcripts of routine clinic consultations. Epilepsy & Behavior, 143. View this article in WRRO
- Trends and drivers of pharmaceutical expenditures from systemic anti-cancer therapy. The European Journal of Health Economics, 24(6), 853-865.
- Automated detection of the competency of delivering guided self-help for anxiety via speech and language processing. Applied Sciences, 12(17).
- FEASIBILITY OF AN AUTOMATED ASSESSMENT TO MEASURE COGNITION AND MOOD IN THE ACUTE STROKE SETTING. JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY, 93(9).
- Keeping patient and public partnership at the heart of medical technology development during Covid-19 : examples of adaptive practice. Journal of Medical Engineering & Technology, 46(6). View this article in WRRO
- Characterising spoken interactions of healthy ageing adults with CognoSpeak, a web‐based cognitive assessment tool. Alzheimer's & Dementia, 17(S5).
- Feasibility of using an automated analysis of formulation effort in patients’ spoken seizure descriptions in the differential diagnosis of epileptic and nonepileptic seizures. Seizure, 91, 141-145. View this article in WRRO
- #3079 Investigating the feasibility of automating the differential diagnosis of transient loss of consciousness. Journal of Neurology, Neurosurgery & Psychiatry, 92(8), A7.1-A7.
- How can automated linguistic analysis help to discern functional cognitive disorder from healthy controls and mild cognitive impairment?. BJPsych open, 7(Suppl 1), S7-S7.
- Characterising spoken responses to an intelligent virtual agent by persons with mild cognitive impairment. Clinical Linguistics & Phonetics, 35(3), 237-252. View this article in WRRO
- Acoustic differences in emotional speech of people with dysarthria. Speech Communication, 126, 44-60.
- Fully automated cognitive screening tool based on assessment of speech and language. Journal of Neurology, Neurosurgery & Psychiatry, 92(1), 12-15. View this article in WRRO
- 26 Can an automated assessment of language help distinguish between Functional Cognitive Disorder and early neurodegeneration?. Journal of Neurology, Neurosurgery & Psychiatry, 91(8), e18.2-e19.
- Developing an intelligent virtual agent to stratify people with cognitive complaints: A comparison of human-patient and intelligent virtual agent-patient interaction. Dementia, 19(4), 1173-1188. View this article in WRRO
- A fully automated cognitive screening tool based on assessment of speech and language (5548). Neurology, 94(15_supplement).
- 055 The digital doctor : a fully automated stratification and monitoring system for patients with memory complaints. Journal of Neurology Neurosurgery & Psychiatry, 90(12).
- 056 Exploring the feasibility of automating verbal fluency tasks for cognitive assessment: data collection and analysis. Journal of Neurology Neurosurgery & Psychiatry, 90(12).
- A virtual agent to support individuals living with physical and mental comorbidities : co-design and acceptability testing. Journal of Medical Internet Research, 21(5). View this article in WRRO
- A new diagnostic approach for the identification of patients with neurodegenerative cognitive complaints. PLoS ONE, 14(5). View this article in WRRO
- Dementia detection using automatic analysis of conversations. Computer Speech and Language, 53, 65-79. View this article in WRRO
- NeuroSpeech. SoftwareX, 8, 69-70.
- NeuroSpeech: An open-source software for Parkinson's speech analysis. Digital Signal Processing, 77, 207-221. View this article in WRRO
- PO029 An avatar aid in memory clinic. Journal of Neurology, Neurosurgery & Psychiatry, 88(Suppl 1), A19.4-A20.
- Characterisation of voice quality of Parkinson’s disease using differential phonological posterior features. Computer Speech & Language, 46, 196-208. View this article in WRRO
- An Innovative Speech-Based User Interface for Smarthomes and IoT Solutions to Help People with Speech and Motor Disabilities. Studies in Health Technology and Informatics, 242, 306-313.
- Toward the Automation of Diagnostic Conversation Analysis in Patients with Memory Complaints. Journal of Alzheimer's Disease, 58(2), 373-387.
- Perspectives on Speech and Language Interaction for Daily Assistive Technology. ACM Transactions on Accessible Computing, 7(2), 1-8.
- Perspectives on Speech and Language Interaction for Daily Assistive Technology. ACM Transactions on Accessible Computing, 6(4), 1-2.
- Perspectives on Speech and Language Interaction for Daily Assistive Technology. ACM Transactions on Accessible Computing, 6(3), 1-3.
- The PASCAL CHiME speech separation and recognition challenge. Computer Speech and Language.
- A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources. Computer Speech and Language.
- Combining speech fragment decoding and adaptive noise floor modeling. IEEE Transactions on Audio Speech and Language Processing, 20(3), 818-827.
- Improving source localisation in multi-source, reverberant conditions: exploiting local spectro-temporal location cues.. Abstract for Acoust. Soc. Am. mtg.
- A cascaded broadcast news highlighter. IEEE Transactions on Audio Speech and Language Processing, 16(1), 151-161.
- Acoustic modelling from raw source and filter components for dysarthric speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30, 2968-2980. View this article in WRRO
- Nationwide Survival Benefit after Implementation of First-Line Immunotherapy for Patients with Advanced NSCLC—Real World Efficacy. Cancers, 13(19), 4846-4846.
- Intelligibility assessment and speech recogniser word accuracy rate prediction for dysarthric speakers in a factor analysis subspace. Submitted to ACM Transactions on Accessible Computing (TACCESS).
Conference proceedings
- Text-to-dysarthric-speech generation for dysarthric automatic speech recognition: is purely synthetic data enough?. Speech and Computer: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I(LNAI 16187) (pp 203-216). Szeged, Hungary, 13 October 2025 - 13 October 2025. View this article in WRRO
- A Situational Semantic Projection Model for Ontology Completion in Dysarthric Speech Using Emotion and Dialogue Acts. 2024 6th International Conference on Natural Language Processing (ICNLP) (pp 124-128), 22 March 2024 - 24 March 2024.
- Identifying People with Mild Cognitive Impairment at Risk of Developing Dementia using Speech Analysis. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 1-6), 16 December 2023 - 20 December 2023.
- Moving towards non-binary gender Identification via analysis of system errors in binary gender classification. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- Investigating Visual Features for Cognitive Impairment Detection Using In-the-wild Data. 2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG) (pp 1-8), 5 January 2023 - 8 January 2023.
- Feasibility of automated longitudinal cognitive and mood assessment in the stroke pathway. INTERNATIONAL JOURNAL OF STROKE, Vol. 18(1) (pp 61-62)
- Evaluating the performance of state-of-the-art ASR systems on non-native English using corpora with extensive language background variation. Interspeech 2022: Proceedings of the Annual Conference of the International Speech Communication Association (pp 3958-3962). Incheon, Korea, 18 September 2022 - 18 September 2022. View this article in WRRO
- 372 Exploring the feasibility of an interactive virtual avatar and automated conversation analysis in the differentiation of epileptic and dissociative seizures. Epilepsia, Vol. 63(S2) (pp 271-271). Geneva. Switzerland and online, 9 July 2022 - 9 July 2022. View this article in WRRO
- Multi-Modal Acoustic-Articulatory Feature Fusion For Dysarthric Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7372-7376), 23 May 2022 - 27 May 2022.
- Eye blink rate based detection of cognitive impairment using in-the-wild data. 2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII). Nara, Japan (virtual conference), 28 September 2021 - 28 September 2021. View this article in WRRO
- Parental spoken scaffolding and narrative skills in crowd-sourced storytelling samples of young children. Interspeech 2021 (pp 2946-2950). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO
- Predicting Levels of Depression and Anxiety in People with Neurodegenerative Memory Complaints Presenting with Confounding Symptoms (pp 58-69)
- Multi-Task Estimation of Age and Cognitive Decline from Speech. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7258-7262), 6 June 2021 - 11 June 2021.
- How can automated linguistic analysis help to discern functional cognitive disorder from healthy controls and mild cognitive impairment?. BJPsych Open, Vol. 7(S1) (pp S7-S7). Virtual, 21 June 2021 - 24 June 2021.
- Investigating the feasibility of automating the differential diagnosis of Transient Loss of Consciousness. EPILEPSIA, Vol. 62 (pp 354-355)
- Using the Outputs of Different Automatic Speech Recognition Paradigms for Acoustic- and BERT-Based Alzheimer's Dementia Detection Through Spontaneous Speech.. Interspeech (pp 3810-3814)
- Identifying Cognitive Impairment Using Sentence Representation Vectors.. Interspeech (pp 2941-2945)
- A fully automated cognitive screening tool based on assessment of speech and language. Alzheimer's & Dementia, Vol. 16(S6). Chicago, IL, USA (online), 9 November 2020 - 9 November 2020. View this article in WRRO
- A comparison of acoustic and linguistics methodologies for Alzheimer’s dementia recognition. Interspeech 2020 (pp 2182-2186). Shanghai, China, 25 October 2020 - 29 October 2020.
- Recognising emotions in dysarthric speech using typical speech data. Interspeech 2020 (pp 4821-4825). Shanghai, China, 25 October 2020 - 29 October 2020.
- Autoencoder bottleneck features with multi-task optimisation for improved continuous dysarthric speech recognition. Proceedings of Interspeech 2020 (pp 4581-4585). Shanghai, China (Online), 25 October 2020 - 25 October 2020. View this article in WRRO
- Improving detection of Alzheimer’s Disease using automatic speech recognition to identify high-quality segments for more robust feature extraction. Proceedings of Interspeech 2020 (pp 4961-4965). Shanghai, China, 25 October 2020 - 29 October 2020.
- Improving Cognitive Impairment Classification by Generative Neural Network-Based Feature Augmentation. Interspeech 2020, 25 October 2020 - 25 October 2020.
- Acoustic feature extraction with interpretable deep neural network for neurodegenerative related disorder classification. Proceedings of Interspeech 2020 (pp 4806-4810). Shanghai, China, 25 October 2020 - 25 October 2020. View this article in WRRO
- Source domain data selection for improved transfer learning targeting dysarthric speech recognition. Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) (pp 7424-7428). Barcelona, Spain, 4 May 2020 - 4 May 2020. View this article in WRRO
- Exploring appropriate acoustic and language modelling choices for continuous dysarthric speech recognition. Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) (pp 6094-6098). Barcelona, Spain, 4 May 2020 - 4 May 2020. View this article in WRRO
- Deep learning of articulatory-based representations and applications for improving dysarthric speech recognition. Speech Communication 13th ITG Fachtagung Sprachkommunikation (pp 331-335)
- Automatic hierarchical attention neural network for detecting AD. Proceedings of Interspeech 2019 (pp 4105-4109). Graz, Austria, 15 September 2019 - 19 September 2019.
- Phonetic Analysis of Dysarthric Speech Tempo and Applications to Robust Personalised Dysarthric Speech Recognition. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5836-5840), 12 May 2019 - 17 May 2019.
- Computational Cognitive Assessment: Investigating the Use of an Intelligent Virtual Agent for the Detection of Early Signs of Dementia. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 2732-2736), 12 May 2019 - 17 May 2019.
- Examining Temporal Variations in Recognizing Unspoken Words using EEG Signals. 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC) (pp 976-981). Miyazaki, Japan, 7 October 2018 - 7 October 2018. View this article in WRRO
- Discriminating between imagined speech and non-speech tasks using EEG. 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp 1952-1955). Honolulu, Hawaii, 18 July 2018 - 18 July 2018. View this article in WRRO
- Detecting Signs of Dementia Using Word Vector Representations. Interspeech 2018
- Embedding speech technology into intelligent tutoring systems using the CloudCAST speech technology platform. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, Vol. 10858 LNCS (pp 421-424)
- Detecting and Predicting Alzheimer's Disease Severity in Longitudinal Acoustic Data. Proceedings of the International Conference on Bioinformatics Research and Applications 2017 (pp 57-61)
- An avatar to screen for cognitive impairment. Journal of the Neurological Sciences, Vol. 381 (pp 319-319)
- An avatar-based system for identifying individuals likely to develop dementia. Interspeech 2017 (pp 3147-3151). Stockholm, Sweden, 20 August 2017 - 20 August 2017. View this article in WRRO
- On the impact of non-modal phonation on phonological features. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 5090-5094). New Orleans, LA, USA, 5 March 2017 - 5 March 2017. View this article in WRRO
- Multi-view representation learning via gcca for multimodal analysis of Parkinson's disease. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 2966-2970). New Orleans, LA, USA, 5 March 2017 - 5 March 2017. View this article in WRRO
- An Innovative Speech-Based Interface to Control AAL and IoT Solutions to Help People with Speech and Motor Disability (pp 269-278)
- Cloud-Based Speech Technology for Assistive Technology Applications (CloudCAST). Studies in Health Technology and Informatics, Vol. 242 (pp 322-329). Netherlands View this article in WRRO
- Brain-computer interface technology for speech recognition: A review. Proceedings of 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) (pp 1-5). Jeju, South Korea, 13 December 2016 - 13 December 2016. View this article in WRRO
- Diagnosing people with dementia using automatic conversation analysis. Proceedings of Interspeech (pp 1220-1224). San Francisco, CA, 8 September 2016 - 8 September 2016. View this article in WRRO
- A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. Proceedings of LREC 2016. Portorož, Slovenia, 24 May 2016 - 24 May 2016. View this article in WRRO
- Simple and robust audio-based detection of biomarkers for Alzheimer's disease. SLPAT 2016 Workshop on Speech and Language Processing for Assistive Technologies
- Remote Speech Technology for Speech Professionals - the CloudCAST initiative. Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies (pp 97-102), September 2015 - September 2015.
- Knowledge transfer between speakers for personalised dialogue management. Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue (pp 12-21), September 2015 - September 2015.
- Speech-enabled environmental control in an AAL setting for people with speech disorders: a case study. IET International Conference on Technologies for Active and Assisted Living (TechAAL) (pp 6 .-6 .)
- Introduction. Slpat 2015 6th Workshop on Speech and Language Processing for Assistive Technologies Proceedings (pp III)
- Adaptive speech recognition and dialogue management for users with speech disorders. Proceedings of Interspeech’14
- Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. 2014 IEEE Spoken Language Technology Workshop (SLT) (pp 254-259), 7 December 2014 - 10 December 2014.
- Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space. Interspeech’13
- Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. Interspeech’13
- Learning speaker-specific pronunciations of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1158-1162)
- Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 3609-3612)
- Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 2132-2136)
- homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. 4th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT)
- Learning speaker-specific pronunciations of disordered speech. Interspeech’13
- A comparative study of adaptive, automatic recognition of disordered speech. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 2 (pp 1774-1777)
- Studio report: Linux audio for multi-speaker natural speech technology.. Proc. Linux Audio Conference
- SPECS - an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework. In proc. Workshop on Innovation and Applications in Speech Technology
- Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1657-1660)
- Recent advances in fragment-based speech recognition in reverberant multisource environments.. Proceedings of ISCA Workshop on Machine Listening in Multisource Environments (pp 68-73)
- Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of INTERSPEECH 2011 (pp 1657-1660)
- Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition.. IEEE Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA’11) (pp 207-212)
- Speaker turn tracking with mobile microphones: Combining location and pitch information. European Signal Processing Conference (pp 954-958)
- Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition
- The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association Interspeech 2010 (pp 1918-1921)
- A speech fragment approach to localising multiple speakers in reverberant environments. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp 4593-4596)
- Using location cues to track speaker changes from mobile, binaural microphones. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 124-127)
- The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements.. ICMI (pp 109-116)
- Integrating pitch and localisation cues at a speech fragment level. International Speech Communication Association 8th Annual Conference of the International Speech Communication Association Interspeech 2007, Vol. 4 (pp 2752-2755)
- Active binaural distance estimation for dynamic sources. International Speech Communication Association 8th Annual Conference of the International Speech Communication Association Interspeech 2007, Vol. 2 (pp 933-936)
- Multi-stage compaction approach to broadcast news summarisation. 9th European Conference on Speech Communication and Technology (pp 69-72)
- Maximum entropy segmentation of broadcast news. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 1029-1032)
- INTRODUCING PHONETICALLY MOTIVATED, HETEROGENEOUS INFORMATION INTO AUTOMATIC SPEECH RECOGNITION. INTEGRATION OF PHONETIC KNOWLEDGE IN SPEECH TECHNOLOGY, Vol. 25 (pp 67-86)
- From text summarisation to style-specific summarisation for broadcast news. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, Vol. 2997 (pp 223-237)
- Are extractive text summarisation techniques portable to broadcast news?. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 (pp 489-494)
- Exploring the style-technique interaction in extractive summarization of broadcast news. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 (pp 495-500)
- Punctuation Annotation Using Statistical Prosody Models. Proceedings of the ISCA Workshop on Prosody in Speech Recognition and Understanding (pp 35-40)
- Introducing Phonetically Motivated Information into ASR. Proceedings of Eurospeech 2001
- Employing Heterogeneous Information in a Multi-Stream Framework. Proceedings of ICASSP 2000
- Noise robustness of heterogeneous features employing minimum classification error feature space transformations.. INTERSPEECH (pp 534-537)
- Automatic Detection of Expressed Emotion from Five-Minute Speech Samples: Challenges and Opportunities. Interspeech 2022 (pp 2458-2462)
- Automatic cognitive assessment: Combining sparse datasets with disparate cognitive scores. Interspeech 2022 (pp 2463-2467)
- Dysarthric Speech Recognition From Raw Waveform with Parametric CNNs. Interspeech 2022
- Using the Outputs of Different Automatic Speech Recognition Paradigms for Acoustic- and BERT-Based Alzheimer’s Dementia Detection Through Spontaneous Speech. Interspeech 2021
- Identifying Cognitive Impairment Using Sentence Representation Vectors. Interspeech 2021 (pp 2941-2945)
- Towards the understanding of communicating emotions for people with dysarthria. Proceedings of WSPD 2018. Mysore, India, 8 September 2018 - 8 September 2018.
- CloudCAST — Remote Speech Technology for Speech Professionals. Interspeech 2016 (pp 1608-1612)
- Integrating pitch and localisation cues at a speech fragment level. Interspeech 2007 (pp 2769-2772)
Reports
- Inferring perceiver actions from binaural data
- Synchronized stereoscopic and binaural dataset with head movements : Recording details
Theses
- Speech Recognition using Heterogenous Information Extraction in Multi-Stream Based Systems.
- Speaker Adaptation of Hidden Markov Models using Maximum Likelihood Linear Regression.
Working papers
Datasets
Other
- Simultaneous Tracking of Perceiver Movements and Speaker Changes Using Head-Centered, Binaural Data.
- POPeye: Real-time, binaural sound source localisation on an audio-visual robot-head.
Preprints
- Exploring Gender Disparities in Automatic Speech Recognition Technology, arXiv.
- CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech, arXiv.
- Early Dementia Detection Using Multiple Spontaneous Speech Prompts: The PROCESS Challenge, arXiv.
- Automatic Detection of Expressed Emotion from Five-Minute Speech Samples: Challenges and Opportunities, arXiv.
- Data augmentation using generative networks to identify dementia, arXiv.
- Detecting Alzheimer's Disease by estimating attention and elicitation path through the alignment of spoken picture descriptions with the picture prompt, arXiv.
- A Virtual Agent to Support Individuals Living With Physical and Mental Comorbidities: Co-Design and Acceptability Testing (Preprint), JMIR Publications Inc..
- Raw acoustic-articulatory multimodal dysarthric speech recognition. Computer Speech & Language, 95, 101839-101839.
- Grants
-
- CcHAT: CognoSpeak: a Cognitive Health Assessment Tool, NIHR, 02/2022 - 07/2025, £1,357,247, as Co-I
- UKRI Centre for Doctoral Training in Speech and Language Technologies and their Applications, EPSRC, 04/2019 - 09/2027, £5,508,850, as Co-PI
- Deep learning of articulatory-based representations of dysarthric speech, Google, 02/2016 - 12/2026, £46,624, as PI
- Design and development of CognoMND, LifeArc, 01/07/2024 - 30/06/2027, £209,000 as Co-PI
- The automated coding of expressed emotion to enhance clinical and epidemiological mental health research in adolescence, MRC, 11/2022 - 12/2024, £379,598, as Co-PI
- Interdisciplinary project on COMPutational Assessment of Stroke Survivors (COMPASS), Rosetrees Trust, 02/2020 - 08/2024, £124,864, as PI
- CognoSpeak-eQMS, EPSRC, 01/01/2023 - 31/12/2023, £22,904, as PI
- Participatory co-design of a platform for collecting atypical speech data, Research England, 03/2022 - 07/2022, £19,692, as Co-PI
- CognoSpeak UX, Research England, 06/2021 - 02/2022, £27,694, as PI
- COMPASS: COMPutational Assessment of Stroke Survivors, Research England, 11/2019 - 06/2020, £48,886, as PI
- COCOA: COmputerised COgnitive Assessment, MRC, 10/2018 to 09/2019, £49,513, as PI
- TAPAS: Training Network on Automatic Processing of PAthological Speech, EU H2020, 11/2017 - 09/2022, £468,000, as PI
- CloudVent: Cloud-based speech recognition for people with paralysis using ventilators, RCUK, 10/2015 - 04/2016, £360,000, as Co-PI
- Professional activities and memberships
-
- Member of the Speech and Hearing research group
- Member of the ISCA Diversity Committee
- Co-organiser of ASRU 2021
- Co-organiser of Interspeech 2023