Dr Ning Ma

School of Computer Science

Senior Lecturer in Medical Computing

NHS Liaison Link

Member of the Pervasive Computing research group

Member of the Speech and Hearing research group

n.ma@sheffield.ac.uk

Regent Court (CS)

Full contact details

Dr Ning Ma
School of Computer Science
Regent Court (CS)
211 Portobello
Sheffield
S1 4DP

Profile

Ning is a Lecturer in Medical Computing at the School of Computer Science, University of Sheffield, and also an Academic Directorate of Medical Imaging and Medical Physics at the Sheffield Teaching Hospitals NHS Foundation Trust. Before that he was a Research Fellow in Computer Science working on health-related research projects. His first degree was in Computer Science from South China University of Technology and he has a PhD in hearing inspired automatic speech processing from the University of Sheffield.

Ning’s research interests lie in speech and hearing technologies, machine learning and healthcare. In particular, his research interests focus on development of AI systems that can interpret sounds and low-cost sensor data and extract useful information for screening health issues, such as sleep-disordered breathing and respiratory diseases. He has been PI and Co-PI of several UKRI and HEIF grants on acoustic monitoring of sleep-disordered breathing and cough sound analysis for tuberculosis screening. He is also interested in music AI technology and its link with mental health.

Ning has published 60+ refereed journals and conference papers. He is on the Technical Programme Committee for INTERSPEECH 2023 and 2024 as the Lead Area Chair for Speech, voice, and hearing disorders. He regularly reviews manuscripts and grants for a range of journals and funders.

Ning is a Insigneo Institute Research Theme Co-Director for Healthcare data/AI. He is a member of the British Sleep Society, the British Thoracic Society and IEEE.

Research interests

Acoustic monitoring for healthcare, including sleep disordered breathing and respiratory conditions
Multimodal machine learning for health applications
Speech and hearing technology
Hearing impairment and cochlear implant processing

Publications

Show: Featured publications All publications

Featured publications

Journal articles

Romero HE, Ma N, Brown G & Hill EA (2022) Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950. View this article in WRRO
Ma N, May T & Brown GJ (2017) Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
Ma N, Morris S & Kitterick PT (2016) Benefits to Speech Perception in Noise From the Binaural Integration of Electric and Acoustic Signals in Simulated Unilateral Deafness. Ear and Hearing, 37(3), 248-259. View this article in WRRO

Conference proceedings

Romero HE, Ma N, Brown GJ & Johnson S (2023) Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. Interspeech 2023 Proceedings (pp 5451-5455). Dublin, Ireland, 20 August 2023 - 20 August 2023. View this article in WRRO
Hu Q, Ma N & Brown GJ (2023) Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7447-7451), 23 May 2022 - 27 May 2022.
Tu Z, Ma N & Barker J (2021) Optimising hearing aid fittings for speech in noise with a differentiable hearing loss model. Interspeech 2021 (pp 691-695). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO
Ornolfsson I, Dau T, Ma N & May T (2021) Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 221-225), 6 June 2021 - 11 June 2021.
Romero HE, Ma N, Hill EA & Brown GJ (2020) 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 27 August 2020. View this article in WRRO
Romero HE, Ma N & Brown GJ (2020) Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain (virtual conference), 4 May 2020 - 4 May 2020. View this article in WRRO
Romero HE, Ma N, Hill EA & Brown GF (2020) SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220)
Tu Z, Ma N & Barker J () Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. Interspeech 2022 (pp 3493-3497)
Tu Z, Ma N & Barker J () Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners. Interspeech 2022 (pp 3488-3492)

All publications

Journal articles

Alabed S, Anderson A, Maiter A, Hughes A, McAnenly N, Salehi M, Sharkey M, Dwivedi K, Hokmabadi A, Alahdab F , Stevenson M et al (2026) Large language models for simplifying radiology reports: a systematic review and meta-analysis of patient, public, and clinician evaluations. The Lancet Digital Health, 100960-100960.
Hughes A, Ma N & Aletras N (2026) Investigating Privacy Preservation of Language Models in Legal Text Summarization: A Preliminary Study. Journal of Institutional and Theoretical Economics, 182(1), 73-82.
Ma N, Mirheidari B, Brown GJ, Sanjase N, Maimbolwa M, Chifwamba S, Muzazu S, Muyoyeta M & Kagujje M (2025) Deep Learning for Tuberculosis Screening in a High-burden Setting using Cough Analysis and Speech Foundation Models.. CoRR, abs/2509.09746.
Xu X, Zhang C & Sankar R (2025) PPEA: Post-Position Encoding Attention for Imbalanced Lung Sound Classification.. Annu Int Conf IEEE Eng Med Biol Soc, 2025, 1-6.
Hughes A, Aletras N & Ma N (2024) How Private are Language Models in Abstractive Summarization?. CoRR, abs/2412.12040.
Romero HE, Ma N, Brown G & Hill EA (2022) Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950. View this article in WRRO
Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition.. CoRR, abs/2204.04284.
Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform.. CoRR, abs/1904.01916.
Ma N, Gonzalez J & Brown GJ (2018) Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
Ma N, May T & Brown GJ (2017) Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
Gonzalez JA, Gómez AM, Peinado AM, Ma N & Barker J (2017) Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition. Circuits, Systems, and Signal Processing, 36, 3731-3760. View this article in WRRO
Ma N, Morris S & Kitterick PT (2016) Benefits to Speech Perception in Noise From the Binaural Integration of Electric and Acoustic Signals in Simulated Unilateral Deafness. Ear and Hearing, 37(3), 248-259. View this article in WRRO
Carmona JL, Barker J, Gomez AM & Ma N (2013) Speech spectral envelope enhancement by HMM-based analysis/resynthesis. IEEE Signal Processing Letters, 20(6), 563-566.
González JA, Peinado AM, Ma N, Gómez AM & Barker J (2013) MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition. IEEE Transactions on Audio Speech and Language Processing, 21(3), 624-635.
Barker J, Vincent E, Ma N, Christensen H & Green P (2012) The PASCAL CHiME speech separation and recognition challenge. Computer Speech and Language.
Ma N, Barker J, Christensen H & Green P (2012) A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources.. Computer Speech and Language.
Ma N, Barker J, Christensen H & Green P (2012) Combining speech fragment decoding and adaptive noise floor modelling.. IEEE Transactions on Audio, Speech and Language Processing, 20, 818-827.
Barker J, Ma N, Coy A & Cooke M (2010) Speech fragment decoding techniques for simultaneous speaker identification and speech recognition. COMPUT SPEECH LANG, 24(1), 94-111.
Christensen H, Ma N, Wrigley SN & Barker J (2008) Improving source localisation in multi-source, reverberant conditions: exploiting local spectro-temporal location cues. The Journal of the Acoustical Society of America, 123(5_Supplement), 3294-3294.
Ma N, Green P, Barker J & Coy A (2007) Exploiting correlogram structure for robust speech recognition with multiple speech sources. SPEECH COMMUN, 49(12), 874-891.

Conference proceedings

Rowe V, Niu C, Brown GJ, Elphick H, Thomas L, Johnson S & Ma N (2026) P57 Smartphone-based acoustic screening for paediatric sleep disordered breathing in hospital and home settings: data collection and preliminary machine learning results. P57 Smartphone-based acoustic screening for paediatric sleep disordered breathing in hospital and home settings: data collection and preliminary machine learning results (pp A53-A54)
Niu C, Rowe V, Brown GJ, Elphick H, Kenyon H, Thomas L, Johnson S & Ma N (2026) Transfer Learning for Paediatric Sleep Apnoea Detection using Physiology-Guided Acoustic Models. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 14957-14961), 3 May 2026 - 8 May 2026.
Hughes A, Duddu V, Asokan N, Aletras N & Ma N (2026) PATCH: Mitigating PII Leakage in Language Models with Privacy-Aware Targeted Circuit PatcHing. Findings of the Association for Computational Linguistics: EACL 2026 (pp 5139-5153), March 2026 - March 2026.
Johnson S, Romero H, Gillespie P, Wiffen R, Manuel A, Palethorpe M, De Meyer M, Ma N & Brown GJ (2025) Night-to-night variability in Apnoea-Hypopnoea Index: Implications for Obstructive Sleep Apnoea diagnosis. Sleep science (pp OA3308-OA3308)
Hughes A, Aletras N & Ma N (2025) How Private are Language Models in Abstractive Summarization?. EMNLP (pp 30112-30130)
Romero H, Ma N, Brown G & Johnson S (2024) SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. 2024 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Orlando, Florida, 15 July 2024 - 15 July 2024. View this article in WRRO
Hughes C, Brown G, Ma N & Dibben N (2024) Acoustic effects of facial feminisation surgery on speech and singing: A case study. Proceedings of Interspeech 2024 (pp 3065-3069). Kos island, Greece, 1 September 2024 - 1 September 2024. View this article in WRRO
Romero HE, Ma N, Brown GJ & Johnson S (2023) Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. Interspeech 2023 Proceedings (pp 5451-5455). Dublin, Ireland, 20 August 2023 - 20 August 2023. View this article in WRRO
Hu Q, Ma N & Brown GJ (2023) Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Barker E, Barker J, Gaizauskas R, Ma N & Paramita ML (2022) SNuC: The Sheffield Numbers Spoken Language Corpus. Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp 1978-1984). Marseille, France, 20 June 2022 - 20 June 2022. View this article in WRRO
Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7447-7451), 23 May 2022 - 27 May 2022.
Tu Z, Ma N & Barker J (2021) Optimising hearing aid fittings for speech in noise with a differentiable hearing loss model. Interspeech 2021 (pp 691-695). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO
Ornolfsson I, Dau T, Ma N & May T (2021) Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 221-225), 6 June 2021 - 11 June 2021.
Tu Z, Ma N & Barker J (2021) DHASP: Differentiable Hearing Aid Speech Processing. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 296-300), 6 June 2021 - 11 June 2021.
Ma N, Brown GJ & Vecchiotti P (2021) AMI – Creating musical compositions with a coherent long-term structure. Aisb Convention 2021 Communication and Conversations
Ma N, Brown GJ & Vecchiotti P (2021) AMI – Creating Coherent Musical Composition with Attention. Icmc 2021 Proceedings of the International Computer Music Conference 2021 (pp 414-418)
Romero HE, Ma N, Hill EA & Brown GJ (2020) 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 27 August 2020. View this article in WRRO
Romero HE, Ma N & Brown GJ (2020) Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain (virtual conference), 4 May 2020 - 4 May 2020. View this article in WRRO
Romero HE, Ma N, Hill EA & Brown GF (2020) SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220)
Romero H, Ma N, Brown G, Beeston A & Hasan M (2019) Deep learning features for robust detection of acoustic events in sleep-disordered breathing. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 12 May 2019. View this article in WRRO
Vecchiotti P, Ma N, Squartini S & Brown G (2019) End-to-end binaural sound localisation from the raw waveform. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 12 May 2019. View this article in WRRO
Romero HE, Ma N, Brown GJ, Beeston AV & Hasan M (2019) Deep Learning Features for Robust Detection of Acoustic Events in Sleep-disordered Breathing.. ICASSP (pp 810-814)
Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform.. ICASSP (pp 451-455)
Meutzner H, Ma N, Nickel R, Schymura C & Kolossa D (2017) Improving audio-visual speech recognition using deep neural networks with dynamic stream reliability estimates. Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 5320-5324). New Orleans, Louisiana, USA, 5 March 2017 - 5 March 2017. View this article in WRRO
Guo Y, Wang X, Wu C, Fu Q, Ma N & Brown G (2016) A robust dual-microphone speech source localization algorithm for reverberant environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. San Francisco, USA, 8 September 2016 - 8 September 2016. View this article in WRRO
Ma N & Brown GJ (2016) Speech localisation in a multitalker mixture by humans and machines. Proceedings of INTERSPEECH 2016 (pp 3359-3363). San Francisco, USA, 8 September 2016 - 8 September 2016. View this article in WRRO
Zeiler S, Nicheli R, Ma N, Brown GJ & Kolossa D (2016) Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-May (pp 2797-2801)
Ma N, Marxer R, Barker J & Brown GJ (2015) Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition.. ASRU 2015 Proceedings (pp 490-495). Scottsdale, Arizona, USA View this article in WRRO
Ma N, Brown G & Gonzalez J (2015) Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments. Interspeech 2015 (pp 160-164). Dresden, Germany, 6 September 2015 - 6 September 2015. View this article in WRRO
May T, Ma N & Brown GJ (2015) Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015 (pp 2679-2683). Brisbane, Australia, 19 April 2015 - 19 April 2015. View this article in WRRO
Ma N, May T, Wierstorf H & Brown GJ (2015) A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, 19 April 2015 - 19 April 2015. View this article in WRRO
Ma N, May T, Wierstorf H & Brown GJ (2015) A MACHINE-HEARING SYSTEM EXPLOITING HEAD MOVEMENTS FOR BINAURAL SOUND LOCALISATION IN REVERBERANT CONDITIONS. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2699-2703)
May T, Ma N & Brown GJ (2015) ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2679-2683)
Schymura C, Walther T, Kolossa D, Ma N & Brown GJ (2014) Binaural sound source localisation using a Bayesian-network-based blackboard system and hypothesis-driven feedback. Fourm Acusticum. Krakow (Poland), 7 September 2014 - 7 September 2014. View this article in WRRO
Ma N & Barker J (2013) A fragment-decoding plus missing-data imputation system evaluated on the 2nd CHiME challenge. Proceedings of the 2nd CHiME Workshop on Machine Listening in Multisource Environments (pp 53-58)
González JA, Peinado AM, Gómez AM & Ma N (2012) Log-spectral feature reconstruction based on an occlusion model for noise robust speech recognition. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 3 (pp 2629-2632)
Ma N & Barker J (2012) Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 3 (pp 2637-2640)
Gonzalez JA, Peinado AM, Gomez AM, Ma N & Barker J (2012) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 4693-4696), 25 March 2012 - 30 March 2012.
González JA, Peinado AM, Gómez AM, Ma N & Barker J (2012) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp 4693-4696). IEEE
Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 1668-1671)
Ma N, Barker J, Christensen H & Green P (2011) Recent advances in fragment-based speech recognition in reverberant multisource environments.. Proceedings of ISCA Workshop on Machine Listening in Multisource Environments (pp 68-73)
Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of INTERSPEECH 2011 (pp 1657-1660)
Morales-Cordovilla JA, Ma N, Sanchez V, Carmona JL, Peinado AM & Barker J (2011) A pitch based noise estimation technique for robust speech recognition with Missing Data. 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 4808-4811), 22 May 2011 - 27 May 2011.
Ma N, Barker J, Christensen H & Green P (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition. 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays (pp 207-212), 30 May 2011 - 1 June 2011.
Ma N, Barker J, Christensen H & Green P (2010) Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding. ISCA Workshop on Statistical and Perceptual Audio Processing Sapa 2010
Christensen H, Barker J, Ma N & Green P (2010) The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association Interspeech 2010 (pp 1918-1921)
Ma N, Bartels CD, Bilmes JA & Green PD (2009) Modelling the prepausal lengthening effect for speech recognition: a dynamic Bayesian network approach. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing (pp 4617-4620), 19 April 2009 - 24 April 2009.
Christensen H, Ma N, Wrigley SN & Barker J (2009) A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS (pp 4593-4596)
Ma N, Bartels C, Bilmes J & Green P (2009) Modelling the prepausal lengthening effect for speech recognition: A dynamic Bayesian network approach. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Taipei
Ma N & Green P (2008) A 'speechiness' measure to improve speech decoding in the presence of other sound sources. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1285-1288)
Ma N & Green P (2008) A 'speechiness' measure to improve speech decoding in the presence of other sound sources. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 1285-1288)
Christensen H, Ma N, Wrigley SN & Barker J (2007) Integrating pitch and localisation cues at a speech fragment level. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2752-2755)
Ma N, Barker J & Green P (2007) Applying word duration constraints by using unrolled HMMs. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 353-356)
Barker J, Coy A, Ma N & Cooke M (2006) Recent advances in speech fragment decoding techniques. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 85-88)
Ma N, Green P & Coy A (2006) Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 669-672)
Ma N & Green P (2005) Context-dependent word duration modelling for robust speech recognition. 9th European Conference on Speech Communication and Technology (pp 2609-2612)
Xu X, Brown GJ & Ma N () Sound-based sleep staging using pretrained speech foundation models. Proceedings of the 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Copenhagen, Denmark, 14 July 2025 - 14 July 2025. View this article in WRRO
Tu Z, Ma N & Barker J () Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. Interspeech 2022 (pp 3493-3497)
Tu Z, Ma N & Barker J () Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners. Interspeech 2022 (pp 3488-3492)
Ma N, Brown G & May T () Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions. Interspeech, Vol. 2015 (pp 160-164). Dresden, Germany, 6 September 2015 - 6 September 2015. View this article in WRRO
Christensen H, Ma N, Wrigley SN & Barker J () Integrating pitch and localisation cues at a speech fragment level. Interspeech 2007 (pp 2769-2772)
Ma N & Green P () Context-dependent word duration modelling for robust speech recognition. INTERSPEECH. Lisbon

Datasets

Barker E, Barker J, Gaizauskas R, Ma N & Paramita M SNuC: The Sheffield Numbers Spoken Language Corpus.

Preprints

Ma N, Mirheidari B, Brown GJ, Muyoyeta M, Sanjase N, Maimbolwa MM, Chifwamba S, Muzazu S & Kagujje M (2026) Beyond isolated cough events: AI-based tuberculosis screening through temporal analysis of cough sounds, openRxiv.
Hughes A, Duddu V, Asokan N, Aletras N & Ma N (2025) PATCH: Mitigating PII Leakage in Language Models with Privacy-Aware Targeted Circuit PatcHing, arXiv.
Ma N, Mirheidari B, Brown GJ, Sanjase N, Maimbolwa MM, Chifwamba S, Muzazu S, Muyoyeta M & Kagujje M (2025) Deep Learning for Tuberculosis Screening in a High-burden Setting using Cough Analysis and Speech Foundation Models, arXiv.
Niu C, Rowe V, Brown GJ, Elphick H, Kenyon H, Thomas L, Johnson S & Ma N (2025) Transfer Learning for Paediatric Sleep Apnoea Detection Using Physiology-Guided Acoustic Models.
Hughes A, Ma N & Aletras N (2024) How Private are Language Models in Abstractive Summarization?, arXiv.
Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition, arXiv.
Romero HE, Ma N, Brown GJ, Beeston AV & Hasan M (2019) Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing, arXiv.
Ma N, May T & Brown GJ (2019) Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments, arXiv.
Ma N, Gonzalez JA & Brown GJ (2019) Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks, arXiv.
Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform, arXiv.

Grants

GlucoVox - Exploring Voice as a Non-Invasive Biomarker of Blood Glucose in Diabetes, Department of Health and Social Care, 06/2026 - 06/2027, £500, as PI
Acoustic AI for Home Monitoring of Sleep Disordered Breathing (SDB) in Children with Down Syndrome (DS) and Obesity, Great Ormond Street, 10/2024 - 03/2026, £34,707, as PI
LungSight: Visual and Acoustic Screening for Early Detection of Lung Disease, EPSRC Sandpit, 08/2026 - 07/2029, £310,471, as PI
SafeSleep: Home-Based AI Screening for Obstructive Sleep Apnoea in High-Risk Surgical Patients, National Institute of Academic Anaesthesia, 12/2025 - 05/2027, £33,579, as PI
Advancing lung health in Zambia through increasing access to integrated and comprehensive screening, diagnosis and management of TB and other chronic respiratory diseases at community and primary care levels, Stop TB Partnership, 10/2024 - 05/2026, £30,364, as PI
Home Monitoring of Paediatric Sleep Disordered Breathing with Unobtrusive Sensors, MRC, 05/2024 - 05/2026, £74,822, as PI
Advance Acoustic AI Technology for Low-cost Tuberculosis Screening, EPSRC IAA programme, 04/2024 - 09/2025, £54,803, as PI
Speech and Acoustic Technology for Transgender Voice, Research England, 04/2023 - 06/2023, £5,000, as PI
AI-Enabled Cough Sound Analysis for Tuberculosis Screening, EPSRC IAA programme, 03/2023 - 10/2023 £27,434, as PI
Monitoring sleep disordered breathing of long-Covid patients at home using acoustic AI Technology, Research England, 01/2022 - 07/2022, £71,222, as PI
Artificial Musical Intelligence (AMI): Building Relationships and Identifying Use Cases with Creative Practitioners, Research England, 12/2021 - 06/2023, £19,820, as Co-I
SOMNUS: Sleep disOrder MoNitoring by Unobtrusive Sensors, Innovate UK, 07/2021 - 11/2023, £230,649, as Co-I
Making Elektra, Research England, 02/2021 - 04/2021, £6,236, as Co-I
Brahms: Breathing Resistance Assessment via Home Monitoring of Sleep, Innovate UK, 06/2019 - 02/2021, £109,600, as Co-I
MAI: Musical Artificial Intelligence, HEFCE, 02/2019 - 05/2020, £53,408, as Co-I

Professional activities and memberships

I am on the Technical Programme Committee for INTERSPEECH 2023 and 2024 as the Lead Area Chair for Speech, voice, and hearing disorders. I regularly review manuscripts and grants for a range of journals and funders.
Member of the British Sleep Society
Member of the British Thoracic Society
Insigneo Institute Research Theme Co-Director for Healthcare data/AI

School of Computer Science

School of Computer Science

Dr Ning Ma

Featured publications

Journal articles

Conference proceedings

All publications

Journal articles

Conference proceedings

Datasets

Preprints

Links