Dr Ning Ma
School of Computer Science
Lecturer in Medical Computing
Member of the Pervasive Computing research group
Member of the Speech and Hearing research group
+44 114 222 1839
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Ning is a Lecturer in Medical Computing at the Department of Computer Science, University of Sheffield, and also an Academic Directorate of Medical Imaging and Medical Physics at the Sheffield Teaching Hospitals NHS Foundation Trust. Before that he was a Research Fellow in Computer Science working on health-related research projects. His first degree was in Computer Science from South China University of Technology and he has a PhD in hearing inspired automatic speech processing from the University of Sheffield.
Ning’s research interests lie in speech and hearing technologies, machine learning and healthcare. In particular, his research interests focus on development of AI systems that can interpret sounds and low-cost sensor data and extract useful information for screening health issues, such as sleep-disordered breathing and respiratory diseases. He has been PI and Co-PI of several UKRI and HEIF grants on acoustic monitoring of sleep-disordered breathing and cough sound analysis for tuberculosis screening. He is also interested in music AI technology and its link with mental health.
Ning has published 60+ refereed journals and conference papers. He is on the Technical Programme Committee for INTERSPEECH 2023 and 2024 as the Lead Area Chair for Speech, voice, and hearing disorders. He regularly reviews manuscripts and grants for a range of journals and funders.
Ning is a Insigneo Institute Research Theme Co-Director for Healthcare data/AI. He is a member of the British Sleep Society, the British Thoracic Society and IEEE.
- Research interests
-
- Acoustic monitoring for healthcare, including sleep disordered breathing and respiratory conditions
- Multimodal machine learning for health applications
- Speech and hearing technology
- Hearing impairment and cochlear implant processing
- Publications
-
Show: Featured publications All publications
Featured publications
Journal articles
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950. View this article in WRRO
- Benefits to Speech Perception in Noise From the Binaural Integration of Electric and Acoustic Signals in Simulated Unilateral Deafness. Ear and Hearing, 37(3), 248-259. View this article in WRRO
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments . IEEE Transactions on Audio, Speech, and Language Processing. View this article in WRRO
Conference proceedings papers
- Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 23 May 2022 - 27 May 2022.
- Optimising hearing aid fittings for speech in noise with a differentiable hearing loss model. Interspeech 2021 (pp 691-695). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO
- Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021.
- 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 27 August 2020. View this article in WRRO
- Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain (virtual conference), 4 May 2020 - 4 May 2020. View this article in WRRO
- SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220)
- Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023
- Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. Interspeech 2022
- Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners. Interspeech 2022
All publications
Journal articles
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950. View this article in WRRO
- Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition.. CoRR, abs/2204.04284.
- End-to-end Binaural Sound Localisation from the Raw Waveform.. CoRR, abs/1904.01916.
- Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
- Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition. Circuits, Systems, and Signal Processing. View this article in WRRO
- Benefits to Speech Perception in Noise From the Binaural Integration of Electric and Acoustic Signals in Simulated Unilateral Deafness. Ear and Hearing, 37(3), 248-259. View this article in WRRO
- Speech spectral envelope enhancement by HMM-based analysis/resynthesis. IEEE Signal Processing Letters, 20(6), 563-566.
- MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 21(3), 624-635.
- A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources.. Computer Speech and Language.
- The PASCAL CHiME speech separation and recognition challenge. Computer Speech and Language.
- Combining speech fragment decoding and adaptive noise floor modelling.. IEEE Transactions on Audio, Speech and Language Processing, 20, 818-827.
- Speech fragment decoding techniques for simultaneous speaker identification and speech recognition. COMPUT SPEECH LANG, 24(1), 94-111.
- Improving source localisation in multi‐source, reverberant conditions: exploiting local spectro‐temporal location cues. The Journal of the Acoustical Society of America, 123(5), 3294-3294.
- Exploiting correlogram structure for robust speech recognition with multiple speech sources. SPEECH COMMUN, 49(12), 874-891. View this article in WRRO
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments . IEEE Transactions on Audio, Speech, and Language Processing. View this article in WRRO
Conference proceedings papers
- Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 23 May 2022 - 27 May 2022.
- SNuC: The Sheffield Numbers Spoken Language Corpus. 2022 Language Resources and Evaluation Conference, LREC 2022 (pp 1978-1984)
- Optimising hearing aid fittings for speech in noise with a differentiable hearing loss model. Interspeech 2021 (pp 691-695). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO
- DHASP: Differentiable Hearing Aid Speech Processing, Vol. 00 (pp 296-300)
- Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021.
- AMI – Creating musical compositions with a coherent long-term structure. AISB Convention 2021: Communication and Conversations
- AMI – Creating Coherent Musical Composition with Attention. ICMC 2021 - Proceedings of the International Computer Music Conference 2021 (pp 414-418)
- 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 27 August 2020. View this article in WRRO
- Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain (virtual conference), 4 May 2020 - 4 May 2020. View this article in WRRO
- SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220)
- Deep learning features for robust detection of acoustic events in sleep-disordered breathing. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
- End-to-end binaural sound localisation from the raw waveform. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
- Deep Learning Features for Robust Detection of Acoustic Events in Sleep-disordered Breathing.. ICASSP (pp 810-814)
- End-to-end Binaural Sound Localisation from the Raw Waveform.. ICASSP (pp 451-455)
- Improving audio-visual speech recognition using deep neural networks with dynamic stream reliability estimates. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5 March 2017 - 9 March 2017. View this article in WRRO
- A robust dual-microphone speech source localization algorithm for reverberant environments. Proceedings of INTERSPEECH 2016 View this article in WRRO
- Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-May (pp 2797-2801) View this article in WRRO
- Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- View this article in WRRO Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 160-164)
- View this article in WRRO Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions. Proceedings of Interspeech 2015 (pp 160-164). Dresden, Germany, 6 September 2015 - 10 September 2015.
- Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brisbane, 19 April 2015 - 24 April 2015. View this article in WRRO
- A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO
- A MACHINE-HEARING SYSTEM EXPLOITING HEAD MOVEMENTS FOR BINAURAL SOUND LOCALISATION IN REVERBERANT CONDITIONS. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2699-2703)
- ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2679-2683)
- View this article in WRRO Binaural sound source localisation using a Bayesian-network-based blackboard system and hypothesis-driven feedback. Proceedings of Forum Acusticum, Vol. 2014-January
- A fragment-decoding plus missing-data imputation system evaluated on the 2nd CHiME challenge. Proceedings of the 2nd CHiME Workshop on Machine Listening in Multisource Environments (pp 53-58)
- Log-spectral feature reconstruction based on an occlusion model for noise robust speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 3 (pp 2629-2632)
- Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp 4693-4696). IEEE
- Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 3 (pp 2637-2640)
- Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25 March 2012 - 30 March 2012.
- Recent advances in fragment-based speech recognition in reverberant multisource environments.. Proceedings of ISCA Workshop on Machine Listening in Multisource Environments (pp 68-73)
- Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of INTERSPEECH 2011 (pp 1657-1660)
- Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition. 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 30 May 2011 - 1 June 2011.
- Binaural cues for fragment-based speech recognition in reverberant multisource environments. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 1668-1671)
- A pitch based noise estimation technique for robust speech recognition with Missing Data. 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 22 May 2011 - 27 May 2011.
- Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition
- The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 (pp 1918-1921)
- Modelling the prepausal lengthening effect for speech recognition: A dynamic Bayesian network approach. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Taipei
- Modelling the prepausal lengthening effect for speech recognition: a dynamic Bayesian network approach. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 19 April 2009 - 24 April 2009.
- A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS (pp 4593-4596)
- A 'speechiness' measure to improve speech decoding in the presence of other sound sources. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1285-1288)
- A 'speechiness' measure to improve speech decoding in the presence of other sound sources. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 1285-1288)
- Integrating pitch and localisation cues at a speech fragment level. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2752-2755)
- Applying word duration constraints by using unrolled HMMs. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 353-356)
- Recent advances in speech fragment decoding techniques. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 85-88)
- Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 669-672)
- Context-dependent word duration modelling for robust speech recognition. 9th European Conference on Speech Communication and Technology (pp 2609-2612)
- View this article in WRRO Acoustic effects of facial feminisation surgery on speech and singing: A case study. Processings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024.
- View this article in WRRO SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society. Orlando, Florida, 15 July 2024 - 15 July 2024.
- Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023
- Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. Interspeech 2022
- Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners. Interspeech 2022
- Speech Localisation in a Multitalker Mixture by Humans and Machines. Interspeech 2016 View this article in WRRO
- Integrating pitch and localisation cues at a speech fragment level. Interspeech 2007
- Context-dependent word duration modelling for robust speech recognition. INTERSPEECH. Lisbon
Preprints
- Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition, arXiv.
- Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing, arXiv.
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments, arXiv.
- Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks, arXiv.
- End-to-end Binaural Sound Localisation from the Raw Waveform, arXiv.
- Grants
-
Research Grants
- Home Monitoring of Paediatric Sleep Disordered Breathing with Unobtrusive Sensors, MRC IAA scheme, 05/2024 - 10/2025, £74,822, as PI
- Advance Acoustic AI Technology for Low-cost Tuberculosis Screening, RCUK, 04/2024 - 06/2025, £113,972, as Co-I
- Speech and Acoustic Technology for Transgender Voice, Research England, 04/2023 - 06/2023, £5,000, as PI
- AI-Enabled Cough Sound Analysis for Tuberculosis Screening, EPSRC IAA programme, 03/2023 - 10/2023 £27,434, as PI
- Monitoring sleep disordered breathing of long-Covid patients at home using acoustic AI Technology, Research England HEIF, 01/2022 - 07/2022, £71,222, as PI
- Artificial Musical Intelligence (AMI): Building Relationships and Identifying Use Cases with Creative Practitioners, HEIF, 12/2021 - 06/2023, £19,820, as Co-I
- SOMNUS: Sleep disOrder MoNitoring by Unobtrusive Sensors, Innovate UK, 07/2021 - 11/2023, £230,649, as Co-I
- Making Elektra, Research England, 02/2021 - 04/2021, £6,236, as Co-I
- Brahms: Breathing Resistance Assessment via Home Monitoring of Sleep, Innovate UK, 06/2019 - 02/2021, £109,600, as Co-I
- MAI: Musical Artificial Intelligence, HEFCE, 02/2019 - 05/2020, £53,408, as Co-I
- Professional activities and memberships
-
- I am on the Technical Programme Committee for INTERSPEECH 2023 and 2024 as the Lead Area Chair for Speech, voice, and hearing disorders. I regularly review manuscripts and grants for a range of journals and funders.
- Member of the British Sleep Society
- Member of the British Thoracic Society
- Insigneo Institute Research Theme Co-Director for Healthcare data/AI