Professor Jon Barker

PhD

School of Computer Science

Personal Chair

School Ethics Lead

Member of the Speech and Hearing (SpandH) research group

j.p.barker@sheffield.ac.uk

Regent Court (CS)

Full contact details

Professor Jon Barker
School of Computer Science
Regent Court (CS)
211 Portobello
Sheffield
S1 4DP

Profile

Professor Jon Barker is a member of the Speech and Hearing Research Group. He has a first degree in Electrical and Information Sciences from Cambridge University, UK. After receiving a PhD from the University of Sheffield in 1999, he worked for some time at GIPSA-lab, Grenoble and IDIAP research institute in Switzerland before returning to Sheffield where he has had a permanent post since 2002.

His research interests lie in noise-robust speech processing. Key application areas include distant-microphone speech recognition, speech intelligibility prediction and improved speech processing for hearing-aid users.

Research interests

Professor Barker’s research interests are focused around machine listening and the computational modelling of human hearing. A recent focus has been on modelling speech intelligibility, ie can we predict whether or not a speech signal will be intelligible to a given listener?

This understanding will help us produce better signal processing for application such as hearing aids and cochlear implants. Another strand of his work is about taking insights gained from human auditory perception and using them to engineer robust automatic speech processing systems.

Publications

Journal articles

Roa-Dabike G, Cox TJ, Barker JP, Fazenda BM, Graetzer S, Vos RR, Akeroyd MA, Firth J, Whitmer WM, Bannister S & Greasley A (2026) The Cadenza lyric intelligibility prediction (CLIP) dataset. Data in Brief, 65, 112466-112466.
Bannister S, Firth J, Roa-Dabike G, Vos R, Whitmer W, Greasley AE, Graetzer S, Fazenda B, Cox T, Barker J & Akeroyd MA (2026) The First Cadenza Challenge: Perceptual Evaluation of Machine Learning Systems to Improve Audio Quality of Popular Music for Those with Hearing Loss. Trends in Hearing, 30.
Yue Z, Loweimi E, Cvetkovic Z, Barker J & Christensen H (2026) Raw acoustic-articulatory multimodal dysarthric speech recognition. Computer Speech & Language, 95, 101839-101839.
SUTHERLAND R, CLARKE J, ELGHAZALY H, KUEBERT T, LUGGER M, PETRAUSCH S, ORTIZ JA, XU B, GOETZE S & BARKER JON (2025) Descriptor: Enhancing Conversations for the Hearing Impaired in the 9th Computational Hearing in Multisource Environments Challenge (CHiME9 ECHI). IEEE Data Descriptions, 1-9.
Roa-Dabike G, Akeroyd MA, Bannister S, Barker JP, Cox TJ, Fazenda B, Firth J, Graetzer S, Greasley A, Vos RR & Whitmer WM (2025) The First Cadenza Challenges: Using Machine Learning Competitions to Improve Music for Listeners With a Hearing Loss. IEEE Open Journal of Signal Processing, 6, 722-734.
Roa G, Bannister S, Firth JL, Graetzer S, Vos R, Akeroyd MA, Barker JP, Cox TJ, Fazenda B, Greasley A & Whitmer WM (2025) The second Cadenza machine learning challenge (CAD2): Improving music for people with hearing loss. The Journal of the Acoustical Society of America, 157(4_Supplement), A321-A321.
Leglaive S, Fraticelli M, ElGhazaly H, Borne L, Sadeghi M, Wisdom S, Pariente M, Hershey JR, Pressnitzer D & Barker JP (2025) Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge. Computer Speech & Language, 89, 101685-101685.
Roa Dabike G, Cox TJ, Miller AJ, Fazenda BM, Graetzer S, Vos RR, Akeroyd MA, Firth J, Whitmer WM, Bannister S , Greasley A et al (2024) The cadenza woodwind dataset: Synthesised quartets for music information retrieval and machine learning. Data in Brief, 57, 111199-111199.
Whitmer WM, McShefferty D, Akeroyd MA, Bannister S, Barker JP, Cox TJ, Roa G, Fazenda B, Firth JL, Graetzer S , Greasley A et al (2024) Lyric intelligibility of musical segments for older individuals with hearing loss. The Journal of the Acoustical Society of America, 156(4_Supplement), A121-A121.
Akeroyd MA, Bannister S, Barker JP, Cox TJ, Roa G, Fazenda B, Firth JL, Graetzer S, Greasley A, Vos R & Whitmer WM (2024) Development of the 2nd Cadenza challenge for improving music listening for people with a hearing loss. The Journal of the Acoustical Society of America, 155(3_Supplement), A277-A277.
Bannister S, Greasley AE, Cox TJ, Akeroyd MA, Barker J, Fazenda B, Firth J, Graetzer SN, Roa Dabike G, Vos RR & Whitmer WM (2024) Muddy, muddled, or muffled? Understanding the perception of audio quality in music by hearing aid users. Frontiers in Psychology, 15. View this article in WRRO
Firth JL, Cox TJ, Greasley A, Barker JP, Whitmer WM, Fazenda B, Bannister S, Graetzer S, Vos R, Roa G & Akeroyd MA (2023) A systematic review of measurements of real-world interior car noise for the “Cadenza” machine-learning project. The Journal of the Acoustical Society of America, 153(3_supplement), A332-A332.
Akeroyd MA, Firth JL, Naylor G, Barker JP, Culling J, Cox TJ, Bailey W, Graetzer S, Viveros Muñoz R, Porter E & Griffiths H (2023) Results of the second “clarity” enhancement challenge for hearing devices. The Journal of the Acoustical Society of America, 153(3_supplement), A48-A48.
Graetzer S, Akeroyd MA, Barker J, Cox TJ, Culling JF, Naylor G, Porter E & Viveros-Muñoz R (2022) Dataset of British English speech recordings for psychoacoustics and speech processing research: The clarity speech corpus. Data in Brief, 41.
Akeroyd MA, Barker JP, Cox TJ, Culling J, Graetzer S, Naylor G, Porter E & Viveros Muñoz R (2020) Launching the first “Clarity” Machine Learning Challenge to revolutionise hearing device processing. The Journal of the Acoustical Society of America, 148(4_Supplement), 2711-2711.
Graetzer S, Akeroyd M, Barker JP, Cox TJ, Culling JF, Naylor G, Porter E & Muñoz RV (2020) Clarity: Machine Learning Challenges to Revolutionise Hearing Device Processing.
Cooke M, Garcia Lecumberri ML, Barker J & Marxer R (2019) Lexical frequency effects in English and Spanish word misperceptions. Journal of the Acoustical Society of America, 145(2), EL136-EL141. View this article in WRRO
Alghamdi N, Maddock S, Marxer R, Barker J & Brown GJ (2018) A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, 143(6), 523-529. View this article in WRRO
Marxer R, Barker JP, Alghamdi N & Maddock S (2018) The impact of the Lombard effect on audio and visual speech recognition systems. Speech Communication, 100, 58-68. View this article in WRRO
Alghamdi N, Maddock S, Barker J & Brown GJ (2017) The impact of automatic exaggeration of the visual articulatory features of a talker on the intelligibility of spectrally distorted speech. Speech Communication, 95, 127-136. View this article in WRRO
Vincent E, Watanabe S, Nugraha AA, Barker J & Marxer R (2017) An analysis of environment, microphone and data simulation mismatches in robust speech recognition. Computer Speech & Language, 46, 535-557. View this article in WRRO
Barker JP (2017) Evaluation of scene analysis using real and simulated acoustic mixtures: Lessons learnt from the CHiME speech recognition challenges. The Journal of the Acoustical Society of America, 141(5_Supplement), 3693-3693.
Barker J, Marxer R, Vincent E & Watanabe S (2017) Guest Editorial for the special issue on Multi-Microphone Speech Recognition in Everyday Environments. Computer Speech & Language, 46, 386-387. View this article in WRRO
Gonzalez JA, Gómez AM, Peinado AM, Ma N & Barker J (2017) Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition. Circuits, Systems, and Signal Processing, 36, 3731-3760. View this article in WRRO
Barker J, Marxer R, Vincent E & Watanabe S (2016) The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes. Computer Speech and Language. View this article in WRRO
Marxer R, Barker J, Cooke M & Garcia Lecumberri ML (2016) A corpus of noise-induced word misperceptions for English. Journal of the Acoustical Society of America, 140(5), EL458-EL463.
Vincent E, Barker J, Watanabe S, Le Roux J, Nesta F & Matassoni M (2013) The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes. 2013 IEEE Workshop on Automatic Speech Recognition and Understanding Asru 2013 Proceedings, 162-167.
Carmona JL, Barker J, Gomez AM & Ma N (2013) Speech spectral envelope enhancement by HMM-based analysis/resynthesis. IEEE Signal Processing Letters, 20(6), 563-566.
Cooke M, Barker J & Lecumber MLG (2013) An Overview, 137-172.
González JA, Peinado AM, Ma N, Gómez AM & Barker J (2013) MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition. IEEE Transactions on Audio Speech and Language Processing, 21(3), 624-635.
Barker J & Vincent E (2012) Special issue on speech separation and recognition in multisource environments. Computer Speech and Language.
Barker J, Vincent E, Ma N, Christensen H & Green P (2012) The PASCAL CHiME speech separation and recognition challenge. Computer Speech and Language.
Ma N, Barker J, Christensen H & Green P (2012) A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources. Computer Speech and Language.
Ma N, Barker J, Christensen H & Green P (2012) Combining speech fragment decoding and adaptive noise floor modelling.. IEEE Transactions on Audio, Speech and Language Processing, 20, 818-827.
Cooke M, Barker J, Lecumberri MLG & Wasilewski K (2011) Crowdsourcing for word recognition in noise. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, 3049-3052.
Barker J, Ma N, Coy A & Cooke M (2010) Speech fragment decoding techniques for simultaneous speaker identification and speech recognition. COMPUT SPEECH LANG, 24(1), 94-111.
Barker J & Shao X (2009) Energetic and Informational Masking Effects in an Audiovisual Speech Recognition System. IEEE T AUDIO SPEECH, 17(3), 446-458.
Christensen H, Ma N, Wrigley SN & Barker J (2008) Improving source localisation in multi-source, reverberant conditions: exploiting local spectro-temporal location cues.. Abstract for Acoust. Soc. Am. mtg.
Shao X & Barker J (2008) Stream weight estimation for multistream audio-visual speech recognition in a multispeaker environment. SPEECH COMMUN, 50(4), 337-353.
Cooke M, Garcia Lecumberri ML & Barker J (2008) The foreign language cocktail party problem: Energetic and informational masking effects in non-native speech perception.. J Acoust Soc Am, 123(1), 414-427.
Ma N, Green P, Barker J & Coy A (2007) Exploiting correlogram structure for robust speech recognition with multiple speech sources. SPEECH COMMUN, 49(12), 874-891.
Barker J & Cooke M (2007) Modelling speaker intelligibility in noise. SPEECH COMMUN, 49(5), 402-417.
Coy A & Barker J (2007) An automatic speech recognition system based on the scene analysis account of auditory perception. SPEECH COMMUN, 49(5), 384-401.
Cooke M, Barker J, Cunningham S & Shao X (2006) An audio-visual corpus for speech perception and automatic speech recognition.. J Acoust Soc Am, 120(5 Pt 1), 2421-2424.
Harding S, Barker J & Brown GJ (2006) Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE T AUDIO SPEECH, 14(1), 58-67.
Barker JP, Cooke MP & Ellis DPW (2005) Decoding speech in the presence of other sources. SPEECH COMMUN, 45(1), 5-25.
Palomaki KJ, Brown GJ & Barker JP (2004) Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. SPEECH COMMUN, 43(1-2), 123-142.
Ellis D & Barker J (2003) Machine recognition of sounds in mixtures. The Journal of the Acoustical Society of America, 113(4_Supplement), 2230-2230.
Barker J & Cooke M (1999) Is the sine-wave speech cocktail party worth attending?. Speech Communication, 27, 159-174.
Barker J & Cooke M (1999) Is the sine-wave speech cocktail party worth attending?. SPEECH COMMUNICATION, 27(3-4), 159-174.
Barker J & Cooke M (1997) Modelling the recognition of sine-wave sentences. BRITISH JOURNAL OF AUDIOLOGY, 31(2), 112-113.
Barker JP & Cooke MP (1996) Modeling the recognition of sine-wave sentences. Journal of the Acoustical Society of America, 100, 2682-2682.
Yue Z, Loweimi E, Christensen H, Barker J & Cvetkovic Z () Acoustic modelling from raw source and filter components for dysarthric speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30, 2968-2980. View this article in WRRO

Book chapters

Barker JP, Marxer R, Vincent E & Watanabe S (2017) The CHiME Challenges: Robust Speech Recognition in Everyday Environments, New Era for Robust Speech Recognition (pp. 327-344). Springer International Publishing
Mandel MI & Barker JP (2017) Multichannel Spatial Clustering Using Model-Based Source Separation, New Era for Robust Speech Recognition (pp. 51-77). Springer International Publishing
Cooke M, Barker J & Lecumberri MLG (2013) Crowdsourcing in Speech Perception In Eskanazi M, Levow G-A, Meng H, Parent G & Sundermann D (Ed.), Crowdsourcing for Speech Processing (pp. 137-169). John Wiley and Sons
Barker J (2012) Missing Data Techniques: Recognition with Incomplete Spectrograms In Virtanen T, Singh R & Raj B (Ed.), Techniques for Noise Robustness in Automatic Speech Recognition (pp. 371-398). Wiley
Barker J (2006) Robust automatic speech recognition In Wang D-L & Brown GJ (Ed.), Computational Auditory Scene Analysis: Principals, Algorithms and Applications (pp. 297-350). Wiley/IEEE Press

Conference proceedings

Roa-Dabike G, Barker JP, Cox TJ, Akeroyd MA, Bannister S, Fazenda B, Firth J, Graetzer S, Greasley A, Vos RR & Whitmer WM (2026) Overview of the ICASSP 2026 Cadenza Challenge: Predicting Lyric Intelligibility. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 21757-21759), 3 May 2026 - 8 May 2026.
Sutherland R, Close G, Hain T, Goetze S & Barker J (2024) Using speech foundational models in loss functions for hearing aid speech enhancement. Proceedings of 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 421-425). Lyon, France, 26 August 2024 - 26 August 2024. View this article in WRRO
Barker J, Akeroyd MA, Bailey W, Cox TJ, Culling JF, Firth J, Graetzer S & Naylor G (2024) The 2nd Clarity Prediction Challenge: A Machine Learning Challenge for Hearing Aid Intelligibility Prediction. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 11551-11555), 14 April 2024 - 19 April 2024.
Dabike GR, Akeroyd MA, Bannister S, Barker J, Cox TJ, Fazenda B, Firth J, Graetzer S, Greasley A, Vos RR & Whitmer WM (2024) The ICASSP SP Cadenza Challenge: Music Demixing/Remixing for Hearing Aids. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (pp 93-94), 14 April 2024 - 19 April 2024.
Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.. ICASSP (pp 306-310)
Akeroyd MA, Bailey W, Barker J, Cox TJ, Culling JF, Graetzer S, Naylor G, Podwińska Z & Tu Z (2023) The 2nd Clarity Enhancement Challenge for Hearing Aid Speech Intelligibility Enhancement: Overview and Outcomes. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 1-5), 4 June 2023 - 10 June 2023.
Cox TJ, Barker J, Bailey W, Graetzer S, Akeroyd MA, Culling JF & Naylor G (2023) Overview of the 2023 ICASSP SP Clarity Challenge: Speech Enhancement for Hearing Aids. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 1-2), 4 June 2023 - 10 June 2023.
Cox T, Akeroyd M, Barker J, Culling J, Firth J, Graetzer S, Griffiths H, Harris L, Viveros Munoz R, Naylor G , Podwinska Z et al (2023) Predicting Speech Intelligibility for People with a Hearing Loss: The Clarity Challenges. INTER-NOISE and NOISE-CON Congress and Conference Proceedings, Vol. 265(3) (pp 4599-4606)
Dabike GR, Bannister S, Firth J, Graetzer S, Vos R, Akeroyd MA, Barker J, Cox TJ, Fazenda B, Greasley A & Whitmer W (2023) The First Cadenza Signal Processing Challenge: Improving Music for Those With a Hearing Loss. Ceur Workshop Proceedings, Vol. 3528
Barker E, Barker J, Gaizauskas R, Ma N & Paramita ML (2022) SNuC: The Sheffield Numbers Spoken Language Corpus. Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp 1978-1984). Marseille, France, 20 June 2022 - 20 June 2022. View this article in WRRO
Deadman J & Barker J (2022) Improved Simulation of Realistically-Spatialised Simultaneous Speech Using Multi-Camera Analysis in The Chime-5 Dataset. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 591-595), 23 May 2022 - 27 May 2022.
Yue Z, Loweimi E, Cvetkovic Z, Christensen H & Barker J (2022) Multi-Modal Acoustic-Articulatory Feature Fusion For Dysarthric Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7372-7376), 23 May 2022 - 27 May 2022.
Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7447-7451), 23 May 2022 - 27 May 2022.
Yue Z, Barker J, Christensen H, McKean C, Ashton E, Wren Y, Gadgil S & Bright R (2021) Parental spoken scaffolding and narrative skills in crowd-sourced storytelling samples of young children. Interspeech 2021 (pp 2946-2950). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO
Tu Z, Ma N & Barker J (2021) Optimising hearing aid fittings for speech in noise with a differentiable hearing loss model. Interspeech 2021 (pp 691-695). Brno, Czechia, 30 August 2021 - 3 September 2021.
Zhang J, Zorila C, Doddipatla R & Barker J (2021) Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 6084-6088), 6 June 2021 - 11 June 2021.
Tu Z, Ma N & Barker J (2021) DHASP: Differentiable Hearing Aid Speech Processing. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 296-300), 6 June 2021 - 11 June 2021.
Dabike GR & Barker J (2021) The use of Voice Source Features for Sung Speech Recognition. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 6513-6517), 6 June 2021 - 11 June 2021.
Falconer L, Coy A & Barker J (2021) Modelling the Effects of Hearing Aid Algorithms on Speech and Speaker Intelligibility as Perceived by Listeners with Simulated Sensorineural Hearing Impairment. SoutheastCon 2021 (pp 1-8), 10 March 2021 - 13 March 2021.
Yue Z, Christensen H & Barker J (2020) Autoencoder bottleneck features with multi-task optimisation for improved continuous dysarthric speech recognition. Proceedings of Interspeech 2020 (pp 4581-4585). Shanghai, China (Online), 25 October 2020 - 25 October 2020. View this article in WRRO
Xiong F, Barker J, Yue Z & Christensen H (2020) Source domain data selection for improved transfer learning targeting dysarthric speech recognition. Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) (pp 7424-7428). Barcelona, Spain, 4 May 2020 - 4 May 2020. View this article in WRRO
Yue Z, Xiong F, Christensen H & Barker J (2020) Exploring appropriate acoustic and language modelling choices for continuous dysarthric speech recognition. Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) (pp 6094-6098). Barcelona, Spain, 4 May 2020 - 4 May 2020. View this article in WRRO
Zhang J, Zorila C, Doddipatla R & Barker J (2020) On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2020 - 8 May 2020.
Xiong F, Barker J & Christensen H (2020) Deep learning of articulatory-based representations and applications for improving dysarthric speech recognition. Speech Communication 13th ITG Fachtagung Sprachkommunikation (pp 331-335)
Dabike GR & Barker J (2019) Automatic lyric transcription from karaoke vocal tracks: resources and a baseline system. Interspeech 2019 Proceedings (pp 579-583). Graz, Austria, 15 September 2019 - 15 September 2019. View this article in WRRO
Xiong F, Barker J & Christensen H (2019) Phonetic Analysis of Dysarthric Speech Tempo and Applications to Robust Personalised Dysarthric Speech Recognition. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5836-5840), 12 May 2019 - 17 May 2019.
Al Dabel M & Barker J (2019) AN OPTIMISATION APPROACH FOR ENHANCING SPEECH INTELLIGIBILITY USING TIME -VARYING SPECTRAL SHAPING IN NOISE. PROCEEDINGS OF THE 19TH INTERNATIONAL CONGRESS OF PHONETIC SCIENCES 2019, ICPHS 2019 (pp 2986-2990)
Loweimi E, Barker JP & Hain T (2018) Exploring the use of group delay for generalised VTS based noise compensation. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings. Calgary, Alberta, Canada, 15 April 2018 - 15 April 2018. View this article in WRRO
Loweimi E, Barker J & Hain T (2018) On the usefulness of the speech phase spectrum for pitch extraction. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2018-S (pp 696-700). Hyderabad, India, 2 September 2018 - 2 September 2018. View this article in WRRO
Loweimi E, Barker J & Hain T (2017) Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR. Interspeech 2017 (pp 2466-2470). Stockholm, 20 August 2017 - 20 August 2017. View this article in WRRO
Loweimi E, Barker J, Torralba OS & Hain T (2017) Robust Source-Filter Separation of Speech Signal in the Phase Domain. Proceedings of the Annual Conference of the International Speech Communication Association. Stockholm, 20 August 2017 - 20 August 2017. View this article in WRRO
Loweimi E, Barker J & Hain T (2017) Statistical Normalisation of Phase-based Feature Representation For Robust Speech Recognition. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2017. New Orleans View this article in WRRO
Abel A, Marxer R, Barker J, Watt R, Whitmer B, Derleth P & Hussain A (2016) A Data Driven Approach to Audiovisual Speech Mapping (pp 331-342)
Loweimi E, Barker J & Hain T (2016) Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2016 (pp 3798-3802). San Fransisco, 8 September 2016 - 8 September 2016. View this article in WRRO
Loweimi E, Barker J & Hain T (2015) Source-filter Separation of Speech Signal in the Phase Domain. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 (pp 598-602). Dresden, Germany, 6 September 2016 - 6 September 2016. View this article in WRRO
Barker J, Marxer R, Vincent E & Watanabe S (2015) The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (pp 504-511), 13 December 2015 - 17 December 2015.
Ma N, Marxer R, Barker J & Brown GJ (2015) Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition.. ASRU 2015 Proceedings (pp 490-495). Scottsdale, Arizona, USA View this article in WRRO
Foster P, Sigtia S, Krstulovic S, Barker J & Plumbley MD (2015) Chime-home: A dataset for sound source recognition in a domestic environment. 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (pp 1-5), 18 October 2015 - 21 October 2015.
Loweimi E, Doulaty M, Barker J & Hain T (2015) Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition (pp 173-184)
Alghamdi N, Maddock SC, Brown GJ & Barker J (2015) Investigating the Impact of Artificial Enhancement of Lip Visibility on the Intelligibility of Spectrally-Distorted Speech. FAAVSP-2015 (pp 93-98), 11 September 2015 - 13 September 2015.
Marxer R, Cooke M & Barker J (2015) A framework for the evaluation of microscopic intelligibility models. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, Vol. 2015-January (pp 2558-2562)
Al Dabel M & Barker J (2014) Speech pre-enhancement using a discriminative microscopic intelligibility model. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 2068-2072)
Vincent E, Barker J, Watanabe S, Roux JL, Nesta F & Matassoni M (2013) The second ‘CHiME’ Speech Separation and Recognition Challenge: Datasets, tasks and baselines. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE
Ma N & Barker J (2013) A fragment-decoding plus missing-data imputation system evaluated on the 2nd CHiME challenge. Proceedings of the 2nd CHiME Workshop on Machine Listening in Multisource Environments (pp 53-58)
Ma N & Barker J (2012) Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012 Interspeech 2012, Vol. 3 (pp 2637-2640)
González JA, Peinado AM, Gómez AM, Ma N & Barker J (2012) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp 4693-4696)
Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1657-1660)
Ma N, Barker J, Christensen H & Green P (2011) Recent advances in fragment-based speech recognition in reverberant multisource environments.. Proceedings of ISCA Workshop on Machine Listening in Multisource Environments (pp 68-73)
Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of INTERSPEECH 2011 (pp 1657-1660)
Morales-Cordovilla JA, Ma N, Sánchez V, Carmona JL, Peinado AM & Barker J (2011) A pitch based noise estimation technique for robust speech recognition with missing data. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp 4808-4811)
Ma N, Barker J, Christensen H & Green P (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition.. IEEE Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA’11) (pp 207-212)
Cooke M, Barker J, Garcia Lecumberri ML & Wasilewski K (2011) Crowdsourcing for word recognition in noise. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3056-+)
Kabir A, Barker J & Giurgiu M (2010) Robust Formant Estimation: Increasing the Reliability by Comparison among three Methods. Proceedings of the International Conference on Circuits, Systems and Signals, (Recent Advances in Circuits, Sistems and Signals) (pp 341-344)
Kabir A, Barker J & Giurgiu M (2010) An Approach to Vocal Tract Length Normalization by Robust Formant Estimation. Proceedings of the International Conference on Circuits, Systems and Signals, (Recent Advances in Circuits, Sistems and Signals) (pp 345-348)
Christensen H & Barker J (2010) Speaker turn tracking with mobile microphones: Combining location and pitch information. European Signal Processing Conference (pp 954-958)
Kabir A, Barker J & Giurgiu M (2010) Integrating Hidden Markov Model and PRAAT: A toolbox for robust automatic speech transcription. Proceedings of SPIE the International Society for Optical Engineering, Vol. 7745
Kabir A, Giurgiu M & Barker J (2010) Robust automatic transcription of english speech corpora. 2010 8th International Conference on Communications Comm 2010 (pp 79-82)
Ma N, Barker J, Christensen H & Green P (2010) Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition
Christensen H, Barker J, Ma N & Green P (2010) The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association Interspeech 2010 (pp 1918-1921)
Christensen H, Ma N, Wrigley SN & Barker J (2009) A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS (pp 4593-4596)
Christensen H & Barker J (2009) Using location cues to track speaker changes from mobile, binaural microphones. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 124-127)
Arnaud E, Christensen H, Lu Y-C, Barker J, Khalidov V, Hansard ME, Holveck B, Mathieu H, Narasimha R, Taillant E , Forbes F et al (2008) The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements.. ICMI (pp 109-116)
Christensen H, Ma N, Wrigley SN & Barker J (2007) Integrating pitch and localisation cues at a speech fragment level. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2752-2755)
Barker J & Shao X (2007) Audio-visual speech fragment decoding. Proceedings of the International Conference on Auditory-Visual Speech Processing (AVSP 2007)
Ma N, Barker J & Green P (2007) Applying word duration constraints by using unrolled HMMs. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 353-356)
Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, Vol. 5 (pp V949-V952)
Palomäki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, Vol. 1 (pp I289-I292)
Shao X & Barker J (2006) Audio-Visual Speech Recognition in the Presence of a Competing Speaker. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 1292-1295)
Coy A & Barker J (2006) A Multipitch Tracker for Monaural Speech Segmentation. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 1678-1681)
Barker J, Coy A, Ma N & Cooke M (2006) Recent advances in speech fragment decoding techniques. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 85-88)
Palomaki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 289-292)
Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 5807-5810)
Palomaki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 289-292). Toulouse, FRANCE, 14 May 2006 - 19 May 2006.
Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol V, Proceedings (pp 949-952)
Coy A & Barker J (2005) Soft harmonic masks for recognising speech in the presence of a competing speaker. 9th European Conference on Speech Communication and Technology (pp 2641-2644)
Harding S, Barker J & Brown GJ (2005) Binaural feature selection for missing data speech recognition. 9th European Conference on Speech Communication and Technology (pp 1269-1272)
Barker J (2005) Tracking facial markers with an adaptive marker collocation model. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 665-668)
Coy A & Barker J (2005) Recognising speech in the presence of a competing speaker using a 'speech fragment decoder'. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 425-428)
Harding S, Barker J & Brown GJ (2005) Mask estimation based on sound localisation for missing data speech recognition. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 537-540)
Brown GJ, Kalle Palomäki K & Barker J (2004) A Missing Data Approach for Robust Automatic Speech Recognition in the Presence of Reverberation. Proceedings of the 18th International Congress on Acoustics (ICA) (pp 449-452)
Barker J, Cooke M & Ellis D (2002) Temporal integration as a consequence of multi-source decoding. Proceedings of the ISCA Workshop on the Temporal Integration in the Perception of Speech (TIPS)
Palomaki KJ, Brown GJ & Barker J (2002) Missing data speech recognition in reverberant conditions. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS (pp 65-68)
Barker J, Cooke M & Ellis D (2001) Combining bottom-up and top-down constraints for robust ASR: The multisource decoder. Proceedings of Workshop on consistent and reliable acoustic cues for sound analysis (CRAC-01)
Morris AC, Barker J & Bourlard H (2001) From Missing Data to Maybe Useful Data: Soft Data Modelling for Noise Robust ASR. Proceedings of the Worshop on Innovation in Speech Processing (WISP 2001)
Green P, Barker J, Cooke M & Josifovski L (2001) Handling Missing and Unreliable Information in Speech Recognition. Proceedings of the 8th International Workshop on Artificial Intelligence and Statistics (AISTATS-2001)
Barker J, Green P & Cooke M (2001) Linking Auditory Scene Analysis and Robust ASR by Missing Data Techniques. Proceedings of the Worshop on Innovation in Speech Processing (WISP 2001)
Barker J, Cooke M & Green PD (2001) Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise.. INTERSPEECH (pp 213-217)
Brown GJ, Barker J & Wang DL (2001) A neural oscillator sound separator for missing data speech recognition. Proceedings of the International Joint Conference on Neural Networks, Vol. 4 (pp 2907-2912)
Brown GJ, Barker J & Wang DL (2001) A neural oscillator sound separator for missing data speech recognition. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS (pp 2907-2912)
Barker J, Josifovski L, Cooke M & Green PD (2000) Soft decisions in missing data techniques for robust automatic speech recognition.. INTERSPEECH (pp 373-376)
Barker J, Cooke M & Ellis DPW (2000) Decoding speech in the presence of other sound sources.. INTERSPEECH (pp 270-273)
Barker JP & Berthommier F (1999) Evidence of correlation between acoustic and visual features of speech. Proc. ICPhS ’99
Barker JP & Berthommier F (1999) Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models. Proceedings of the ISCA Workshop on Auditory-Visual Speech Processing (AVSP) ’99
Barker JP, Berthommier F & Schwartz JL (1998) Is primitive AV coherence an aid to segment the scene?. Proceedings of the ISCA Workshop on Auditory-Visual Speech Processing (AVSP) ’98
Barker J, Williams G & Renals S (1998) Acoustic confidence measures for segmenting broadcast news.. ICSLP
Barker J & Cooke M (1997) Modelling the recognition of spectrally reduced speech.. EUROSPEECH
Zhang J, Zorila C, Doddipatla R & Barker J () On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training. Interspeech 2022
Yue Z, Loweimi E, Christensen H, Barker J & Cvetkovic Z () Dysarthric Speech Recognition From Raw Waveform with Parametric CNNs. Interspeech 2022
Deadman J & Barker J () Modelling Turn-taking in Multispeaker Parties for Realistic Data Simulation. Interspeech 2022 (pp 266-270)
Tu Z, Ma N & Barker J () Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. Interspeech 2022 (pp 3493-3497)
Tu Z, Ma N & Barker J () Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners. Interspeech 2022 (pp 3488-3492)
Barker J, Akeroyd M, Cox TJ, Culling JF, Firth J, Graetzer S, Griffiths H, Harris L, Naylor G, Podwinska Z , Porter E et al () The 1st Clarity Prediction Challenge: A machine learning challenge for hearing aid intelligibility prediction. Interspeech 2022
Zhang J, Zorilă C, Doddipatla R & Barker J () Teacher-Student MixIT for Unsupervised and Semi-Supervised Speech Separation. Interspeech 2021 (pp 3495-3499)
Graetzer S, Barker J, Cox TJ, Akeroyd M, Culling JF, Naylor G, Porter E & Muñoz RV () Clarity-2021 Challenges: Machine Learning Challenges for Advancing Hearing Aid Processing. Interspeech 2021
Deadman J & Barker J () Simulating Realistically-Spatialised Simultaneous Speech Using Video-Driven Speaker Detection and the CHiME-5 Dataset. Interspeech 2020 (pp 349-353)
Watanabe S, Mandel M, Barker J, Vincent E, Arora A, Chang X, Khudanpur S, Manohar V, Povey D, Raj D , Snyder D et al () CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings. 6th International Workshop on Speech Processing in Everyday Environments (CHiME 2020)
Barker J, Watanabe S, Vincent E & Trmal J () The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines. Interspeech 2018
Gogate M, Adeel A, Marxer R, Barker J & Hussain A () DNN Driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation. Interspeech 2018 (pp 2723-2727)
Marxer R & Barker J () Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement. Interspeech 2017 (pp 1988-1992)
Mandel MI & Barker J () Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions. Interspeech 2016 (pp 1991-1995)
Lecumberri MLG, Barker J, Marxer R & Cooke M () Language Effects in Noise-Induced Word Misperceptions. Interspeech 2016 (pp 640-644)
Tóth AM, Cooke M & Barker J () Misperceptions Arising from Speech-in-Babble Interactions. Interspeech 2016 (pp 630-634)
Lin L, Barker J & Brown GJ () The effect of cochlear implant processing on speaker intelligibility: a perceptual study and computer model. Interspeech 2015 (pp 1566-1570)
Barker J & Coy A () Towards Solving the Cocktail Party Problem through Primitive Grouping and Model Combination. Proceedings of Forum Acusticum
Rajaravivarma V, Lord E & Barker J () Data compression techniques in image compression for multimedia systems. Southcon/96 Conference Record (pp 624-627)

Posters

Alghamdi N, Maddock S, Brown GJ & Barker J (2015) A comparison of audiovisual and auditory-only training on the perception of spectrally-distorted speech. 18th International Congress of Phonetic Sciences.

Theses

Barker J (1998) The relationship between auditory organisation and speech perception: Studies with spectrally reduced speech.

Other

Christensen H, Barker J, Lu Y-C, Xavier J, Caseiro R & Araújo H (2009) POPeye: Real-time, binaural sound source localisation on an audio-visual robot-head.
Christensen H & Barker J (2009) Simultaneous Tracking of Perceiver Movements and Speaker Changes Using Head-Centered, Binaural Data.

Preprints

Roa-Dabike G, Cox TJ, Barker JP, Akeroyd MA, Bannister S, Fazenda B, Firth J, Graetzer S, Greasley A, Vos RR & Whitmer WM (2025) Source Separation of Small Classical Ensembles: Challenges and Opportunities, arXiv.
Dabike GR, Akeroyd MA, Bannister S, Barker JP, Cox TJ, Fazenda B, Firth J, Graetzer S, Greasley A, Vos RR & Whitmer WM (2024) The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing loss, arXiv.
Leglaive S, Fraticelli M, ElGhazaly H, Borne L, Sadeghi M, Wisdom S, Pariente M, Hershey JR, Pressnitzer D & Barker JP (2024) Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge, arXiv.
Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models, arXiv.
Cox TJ, Barker J, Bailey W, Graetzer S, Akeroyd MA, Culling JF & Naylor G (2023) Overview Of The 2023 Icassp Sp Clarity Challenge: Speech Enhancement For Hearing Aids, arXiv.
Dabike GR, Bannister S, Firth J, Graetzer S, Vos R, Akeroyd MA, Barker J, Cox TJ, Fazenda B, Greasley A & Whitmer W (2023) The First Cadenza Signal Processing Challenge: Improving Music for Those With a Hearing Loss, arXiv.
Dabike GR, Akeroyd MA, Bannister S, Barker J, Cox TJ, Fazenda B, Firth J, Graetzer S, Greasley A, Vos RR & Whitmer WM (2023) The ICASSP SP Cadenza Challenge: Music Demixing/Remixing for Hearing Aids, arXiv.
Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition, arXiv.
Watanabe S, Mandel M, Barker J, Vincent E, Arora A, Chang X, Khudanpur S, Manohar V, Povey D, Raj D , Snyder D et al (2020) CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings, arXiv.
Gogate M, Adeel A, Marxer R, Barker J & Hussain A (2018) DNN driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation, arXiv.
Barker J, Watanabe S, Vincent E & Trmal J (2018) The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines, arXiv.

Grants

EnhanceMusic: Machine Learning Challenges to Revolutionise Music Listening for People with Hearing Loss, EPSRC, 06/2022 - 11/2026, £377,568, as PI
Challenges to Revolutionise Hearing Device Processing (RHDP), EPSRC, 10/2019 - 10/2025, £362,691, as PI
UKRI Centre for Doctoral Training in Speech and Language Technologies and their Applications, EPSRC, 04/2019 to 09/2027, £5,508,850, as Co-I
TAPAS: Training Network on Automatic Processing of PAthological Speech, EC H2020, 11/2017 to 06/2022, £468,000, as Co-I
Deep Probabilistic Models for Making Sense of Unstructured Data, EPSRC, 03/2016 - 09/2019, £974,161, as Co-I
Deep learning of articulatory-based representations of dysarthric speech, Industrial, 02/2016 to 01/2017, £46,624, as Co-I
Towards visually-driven speech enhancement for cognitively-inspired multi-modal hearing-aid devices (AV-COGHEAR), EPSRC, 10/2015 to 09/2018, £125,493, as PI
INSPIRE: Investigating Speech In Real Environments, EC FP7, 01/2012 to 12/2015, £308,473, as PI
ACAS: Analysis of Complex Acoustic Scenes, EPSRC, 07/2010 to 09/2010, £9,978, as PI
CHIME: Computational Hearing in Multisource Environments, EPSRC, 06/2009 to 05/2012, £326,245, as PI
Audio-Visual Speech Recognition in the Presence of Non-Stationary Noise, EPSRC, 02/2005 to 05/2007, £116,853, as PI

Professional activities and memberships

Member of the Speech and Hearing research group
Co-founder of the CHiME series of International Workshops and Robust Speech Recognition Evaluations, 2011 onwards.
EURASIP Best Paper Award, 2009; for best paper in Speech Communication during 2005.
ISCA Best Paper Award, 2008; for best paper in Speech Communication 2005-2007.

School of Computer Science

School of Computer Science

Professor Jon Barker

Journal articles

Book chapters

Conference proceedings

Posters

Theses

Other

Preprints

Links