Professor Guy Brown
BSc(Hons), PhD, MEd
School of Computer Science
Professor of Computer Science
Member of the Speech and Hearing (SpandH) research group
+44 114 222 1821
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Professor Brown obtained a BSc (Hons) Applied Science from Sheffield City Polytechnic in 1984 and a PhD in Computer Science from the University of Sheffield in 1992. He was appointed to a lectureship in the Department of Computer Science, University of Sheffield in 1992.
He also obtained the MEd in Teaching and Learning from the University of Sheffield in 1997. He has held visiting appointments at LIMSI-CNRS (France), Ohio State University (USA), Helsinki University of Technology (Finland) and ATR (Japan).
He was appointed to a Chair of Computer Science in 2013. Professor Brown was Head of the Department of Computer Science from 2015 to 2023.
- Research interests
-
Professor Brown's main research interest is Computational Auditory Scene Analysis (CASA), which aims to build machine systems that mimic the ability of human listeners to segregate complex mixtures of sound.
He also has interests in noise-robust and reverberation-robust automatic speech recognition, models of auditory function in normal and impaired hearing, binaural modelling and the phonetics of overlapping speech. A recent interest is the application of CASA technology in mobile robot platforms.
He is the co-editor (with DeLiang Wang) of Computational auditory scene analysis: Principles, Algorithms, and Applications (IEEE Press/Wiley-Interscience).
- Publications
-
Show: Featured publications All publications
Featured publications
Journal articles
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
- Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
- Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment. Computer Speech and Language.
- A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53(6), 924-940. View this article in WRRO
- A computer model of auditory efferent suppression: implications for the recognition of speech in noise.. J Acoust Soc Am, 127(2), 943-954.
- Resources for turn competition in overlapping talk. Speech Communication, 55(5), 721-743.
Chapters
- Reflexive and Reflective Auditory Feedback, Modern Acoustics and Signal Processing (pp. 3-31). Springer International Publishing
Conference proceedings papers
- Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023, Vol. 2023-August (pp 5451-5455)
- Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain, 4 May 2020 - 8 May 2020.
- Improving random GUI testing with image-based widget detection. ISSTA 2019 Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp 307-317). Beijing, China, 15 July 2019 - 15 July 2019. View this article in WRRO
- Resources for turn competition in overlap in multi-party conversations: speech rate, pausing and duration.. INTERSPEECH (pp 2550-2553)
- View this article in WRRO Acoustic effects of facial feminisation surgery on speech and singing: A case study. Processings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024.
- View this article in WRRO SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society. Orlando, Florida, 15 July 2024 - 15 July 2024.
- Perceptual compensation for effects of reverberation in speech identification: A computer model based on auditory efferent processing.. Interspeech 2010. Japan, 26 September 2010 - 30 September 2010.
All publications
Books
- Frontmatter. IEEE.
- Computational Auditory Scene Analysis. IEEE.
Journal articles
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
- Talking in time : the development of a self-administered conversation analysis based training programme for cochlear implant users. Cochlear Implants International, 20(5), 255-265. View this article in WRRO
- End-to-end Binaural Sound Localisation from the Raw Waveform.. CoRR, abs/1904.01916.
- Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
- A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, 143(6), 523-529. View this article in WRRO
- The impact of automatic exaggeration of the visual articulatory features of a talker on the intelligibility of spectrally distorted speech. Speech Communication, 95, 127-136. View this article in WRRO
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
- Utilising temporal signal features in adverse noise conditions: Detection, estimation, and the reassigned spectrogram.. The Journal of the Acoustical Society of America, 139(2), 904-917. View this article in WRRO
- Comparing human and automatic speech recognition in a perceptual restoration experiment. Computer Speech & Language, 35, 14-31. View this article in WRRO
- Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization. EURASIP Journal on Advances in Signal Processing, 76. View this article in WRRO
- Perceptual compensation for the effects of reverberation on consonant identification: Evidence from studies with monaural stimuli. The Journal of the Acoustical Society of America, 136(6), 3072-3084. View this article in WRRO
- The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 134(3), EL282-EL288.
- A computer model of the auditory periphery and its application to the study of hearing. Advances in Experimental Medicine and Biology, 787, 11-20.
- A frequency-selective feedback model of auditory efferent suppression and its implications for the recognition of speech in noise. Journal of the Acoustical Society of America, 132(3), 1535-1541.
- Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment. Computer Speech and Language.
- Pitch Contour Matching and Interactional Alignment across Turns: An Acoustic Investigation. Language and Speech, 55, 57-76-57-76.
- The representation of speech in a nonlinear auditory model: Time-domain analysis of simulated auditory-nerve firing patterns. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2453-2456.
- A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53(6), 924-940. View this article in WRRO
- A computer model of auditory efferent suppression: implications for the recognition of speech in noise.. J Acoust Soc Am, 127(2), 943-954.
- A speech-in-noise test based on spoken digits: Comparison of normal and impaired listeners using a computer model. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, 2470-2473.
- Computational auditory scene analysis : a representational approach..
- Speech in noise and the medial olivo‐cochlear efferent system. The Journal of the Acoustical Society of America, 123(5), 3051-3051.
- Effect of sound spatialisation on multitasking in remote meetings. The Journal of the Acoustical Society of America, 123(5), 3861-3861.
- Auditory‐motivated techniques for detection and classification of passive sonar signals. The Journal of the Acoustical Society of America, 123(5), 3344-3344.
- A reverberation‐robust automatic speech recognition system based on temporal masking. The Journal of the Acoustical Society of America, 123(5), 2978-2978.
- A computational model of binaural speech intelligibility level difference. The Journal of the Acoustical Society of America, 123(5), 3715-3715.
- Introduction to the special section on blind signal processing for speech and audio applications. IEEE T AUDIO SPEECH, 15(5), 1509-1510.
- Auditory-inspired interval statistic receivers for passive sonar signal detection. OCEANS 2007 - Europe.
- Information systems and creativity: an empirical study. J DOC, 63(4), 443-464.
- Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE T AUDIO SPEECH, 14(1), 58-67.
- Using instrument recognition for melody extraction from polyphonic audio. The Journal of the Acoustical Society of America, 118(3), 2032-2032.
- Classification of transient sonar sounds using perceptually motivated features. IEEE J OCEANIC ENG, 30(3), 588-600.
- A computational model of the speech reception threshold for laterally separated speech and noise. 9th European Conference on Speech Communication and Technology, 1753-1756.
- Speech and crosstalk detection in multichannel audio. IEEE T SPEECH AUDI P, 13(1), 84-91. View this article in WRRO
- A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. SPEECH COMMUN, 43(4), 361-378.
- A computational model of auditory selective attention.. IEEE Trans Neural Netw, 15(5), 1151-1163.
- Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. SPEECH COMMUN, 43(1-2), 123-142.
- A classification-based cocktail-party processor. Advances in Neural Information Processing Systems.
- Instrument recognition in accompanied sonatas and concertos. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4.
- Speech segregation based on sound localization. J ACOUST SOC AM, 114(4), 2236-2252.
- A multipitch tracking algorithm for noisy speech. IEEE T SPEECH AUDI P, 11(3), 229-241.
- A missing feature approach to instrument identification in polyphonic music. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 5, 553-556.
- Sound propagation in micro-scale urban areas: Simulation and animation. Acta Acustica (Stuttgart), 89(SUPP.).
- A comparison of auditory and blind separation techniques for speech segregation. IEEE Transactions on Speech and Audio Processing, 9(3), 189-195.
- Separation of speech from interfering sounds based on oscillatory correlation. IEEE T NEURAL NETWOR, 10(3), 684-697.
- Speech and hearing demonstrations in matlab. The Journal of the Acoustical Society of America, 105(2), 1213-1214.
- An oscillatory correlation framework for the separation of speech from interfering sounds. The Journal of the Acoustical Society of America, 105(2), 1307-1307.
- Interactive explorations in speech and hearing.. Journal of the Acoustical Society of Japan (E), 20(2), 89-97.
- Modelling the perceptual segregation of double vowels with a network of neural oscillators. NEURAL NETWORKS, 10(9), 1547-1558.
- A computational model of auditory organization. I: Context sensitive integration of multiple grouping principles. British Journal of Audiology, 31(2), 116-117.
- A computational model of auditory organization. II: Grouping by emergent properties. British Journal of Audiology, 31(2), 117.
- Visualization of rhythm, time and metre. ARTIF INTELL REV, 10(3-4), 253-273.
- Combining multiple grouping cues in a blackboard model of auditory organisation. British Journal of Audiology, 30(2), 111.
- A neural oscillator model of auditory streaming. British Journal of Audiology, 30(2), 111-112.
- A computational model of speech segmentation. The Journal of the Acoustical Society of America, 96(5_Supplement), 3293-3293.
- COMPUTATIONAL AUDITORY SCENE ANALYSIS. COMPUT SPEECH LANG, 8(4), 297-336.
- PERCEPTUAL GROUPING OF MUSICAL SOUNDS - A COMPUTATIONAL MODEL. J NEW MUSIC RES, 23(2), 107-132.
- A computational model of prosodic perception. The Journal of the Acoustical Society of America, 95(5_Supplement), 2950-2950.
- Computational auditory scene analysis: listening to several things at once.. Endeavour, 17(4), 186-190.
- Interactive computational auditory scene analysis: An environment for exploring auditory representations and groups. The Journal of the Acoustical Society of America, 93(4_Supplement), 2308-2308.
- Using the BBC microcomputer to teach the electrocardiogram to biology students. Journal of Biological Education, 24(1), 13-17.
- COMPUTER-SIMULATIONS IN TEACHING NEUROMUSCULAR PHARMACOLOGY - TIME FOR A CHANGE FROM TRADITIONAL METHODS. ATLA-ALTERN LAB ANIM, 16(2), 163-174.
- A COMPUTER-ASSISTED-LEARNING PROGRAM FOR TEACHING THE ELECTROCARDIOGRAM TO SCIENCE UNDERGRADUATES. BRIT J PHARMACOL, 94, P470-P470.
- AUTOMATED-ANALYSIS OF DRUG-INDUCED CONVULSIONS USING BBC MICROCOMPUTER. BRIT J PHARMACOL, 94, P471-P471.
- A COMPUTER-ASSISTED-LEARNING PROGRAM FOR TEACHING NEUROMUSCULAR PHARMACOLOGY TO UNDERGRADUATES. BRIT J PHARMACOL, 94, P472-P472.
- MICROCOMPUTER SIMULATIONS OF LABORATORY EXPERIMENTS IN PHYSIOLOGY. ATLA-ALTERN LAB ANIM, 15(4), 280-289.
- COMPUTER-SIMULATIONS - AN ALTERNATIVE TO THE USE OF ANIMALS IN TEACHING. J BIOL EDUC, 22(1), 19-22.
- Resources for turn competition in overlapping talk. Speech Communication, 55(5), 721-743.
- A frequency-selective feedback model of auditory efferent suppression and its implications for the recognition of speech in noise. Journal of the Acoustical Society of America.
Chapters
- Reflexive and Reflective Auditory Feedback, Modern Acoustics and Signal Processing (pp. 3-31). Springer International Publishing
- Neural and Perceptual Modeling, Computational Auditory Scene Analysis IEEE
- Reverberation, Computational Auditory Scene Analysis IEEE
- Binaural Sound Localization, Computational Auditory Scene Analysis IEEE
- Fundamentals of Computational Auditory Scene Analysis, Computational Auditory Scene Analysis IEEE
- Physiological Models of Auditory Scene Analysis, Computational Models of the Auditory System (pp. 203-236). Springer US
- Fundamental frequency height as a resource for the management of overlap in talk-in-interaction. In Barth-Weingarten D, Dehé N & Wichmann A (Ed.), Where Prosody Meets Pragmatics (pp. 183-204). Emerald Group Publishing Limited
- Auditory Scene Analysis: Computational Models, International Encyclopedia of the Social & Behavioral Sciences (pp. 943-946). Elsevier
- Teaching Professional Ethics to Software Engineers, Projects in the Computing Curriculum (pp. 3-18). Springer London
- Visualization of Rhythm, Time and Metre, Integration of Natural Language and Vision Processing (pp. 253-273). Springer Netherlands
- Listening to Speech Psychology Press
- Separation of Speech by Computational Auditory Scene Analysis, Signals and Communication Technology (pp. 371-402). Springer-Verlag
Conference proceedings papers
- Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023, Vol. 2023-August (pp 5451-5455)
- Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- AMI – Creating Coherent Musical Composition with Attention. ICMC 2021 - Proceedings of the International Computer Music Conference 2021 (pp 414-418)
- AMI – Creating musical compositions with a coherent long-term structure. AISB Convention 2021: Communication and Conversations
- 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 30 August 2020. View this article in WRRO
- Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain, 4 May 2020 - 8 May 2020.
- SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220)
- Improving random GUI testing with image-based widget detection. ISSTA 2019 Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp 307-317). Beijing, China, 15 July 2019 - 15 July 2019. View this article in WRRO
- End-to-end binaural sound localisation from the raw waveform. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
- Deep learning features for robust detection of acoustic events in sleep-disordered breathing. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
- Deep Learning Features for Robust Detection of Acoustic Events in Sleep-disordered Breathing.. ICASSP (pp 810-814)
- End-to-end Binaural Sound Localisation from the Raw Waveform.. ICASSP (pp 451-455)
- Modelling Hand Gestures to Test Leap Motion Controlled Applications. 2018 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW) (pp 204-213), 13 April 2018 - 13 April 2018. View this article in WRRO
- A robust dual-microphone speech source localization algorithm for reverberant environments. Proceedings of INTERSPEECH 2016 View this article in WRRO
- Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016. View this article in WRRO
- Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- Investigating the Impact of Artificial Enhancement of Lip Visibility on the Intelligibility of Spectrally-Distorted Speech. FAAVSP-2015 (pp 93-98), 11 September 2015 - 13 September 2015.
- View this article in WRRO Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions. Proceedings of Interspeech 2015 (pp 160-164). Dresden, Germany, 6 September 2015 - 10 September 2015.
- View this article in WRRO Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 160-164)
- Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brisbane, 19 April 2015 - 24 April 2015. View this article in WRRO
- A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO
- ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2679-2683)
- A MACHINE-HEARING SYSTEM EXPLOITING HEAD MOVEMENTS FOR BINAURAL SOUND LOCALISATION IN REVERBERANT CONDITIONS. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2699-2703)
- The effect of cochlear implant processing on speaker intelligibility: A perceptual study and computer model. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 1566-1570)
- View this article in WRRO Binaural sound source localisation using a Bayesian-network-based blackboard system and hypothesis-driven feedback. Proceedings of Forum Acusticum, Vol. 2014-January
- Consonant confusions provide further evidence that time-reversed rooms disturb compensation for reverberation. Proceedings of Forum Acusticum, Vol. 2014-January
- Recognition of reverberant speech by missing data imputation and NMF feature enhancement. IEEE SPS AASP REVERB Challenge Workshop. Florence, 10 May 2014 - 10 May 2014.
- Automatic testing of natural user interfaces. Seventh IEEE International Conference on Software Testing, Verification and Validation, 31 March 2014 - 4 April 2014.
- Perceptual compensation for the effects of reverberation on consonant identification: A comparison of human and machine performance. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 2 (pp 1714-1717)
- A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey
- The representation of speech in a nonlinear auditory model: time-domain analysis of simulated auditory-nerve firing patterns. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 2464-2467)
- Resources for turn competition in overlap in multi-party conversations: Speech rate, pausing and duration. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 (pp 2554-+)
- A speech-in-noise test based on spoken digits: Comparison of normal and impaired listeners using a computer model. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 (pp 2474-+)
- Resources for turn competition in overlap in multi-party conversations: speech rate, pausing and duration.. INTERSPEECH (pp 2550-2553)
- Audio spatialisation strategies for multitasking during teleconferences. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2935-2938)
- Fundamental Frequency Height as a Resource for the Management of Overlap in Talk-in-Interaction (pp 183-203) View this article in WRRO
- Audio spatialisation strategies for multitasking during teleconferences. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2903-2906)
- The influence of audio presentation style on multitasking during teleconferences. INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (pp 801-804)
- The Influence of Audio Presentation Style on Multitasking During Teleconferences. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 801-804)
- Binaural speech separation using recurrent timing neural networks for joint F0-localisation estimation. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 4892 (pp 271-282)
- Recurrent timing neural networks for joint F0-localisation based speech separation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
- Recurrent timing neural networks for joint F0-localisation based speech separation. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS (pp 157-160)
- Auditory-inspired interval statistic receivers for passive sonar signal detection. OCEANS 2007 - EUROPE, VOLS 1-3 (pp 37-42)
- Speech separation based on the statistics of binaural auditory features. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 5
- Recognition of reverberant speech using full cepstral features and spectral missing data. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
- Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 289-292)
- Speech separation based on the statistics of binaural auditory features. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 5807-5810)
- Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 289-292). Toulouse, FRANCE, 14 May 2006 - 19 May 2006.
- Speech separation based on the statistics of binaural auditory features. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol V, Proceedings (pp 949-952)
- Binaural feature selection for missing data speech recognition. 9th European Conference on Speech Communication and Technology (pp 1269-1272)
- Physiologically motivated audio-visual localisation and tracking. 9th European Conference on Speech Communication and Technology (pp 773-776)
- Mask estimation based on sound localisation for missing data speech recognition. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 537-540)
- Techniques for robust speech recognition in noisy and reverberant conditions. SPEECH SEPARATION BY HUMANS AND MACHINES (pp 213-220)
- Instrument recognition in accompanied sonatas and concertos. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS (pp 217-220)
- EXTRACTING MELODY LINES FROM COMPLEX AUDIO. ISMIR 2004 - 5th International Symposium on Music Information Retrieval
- Feature selection for the classification of crosstalk in multi-channel audio. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 469-472)
- A missing feature approach to instrument identification in polyphonic music. 2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS (pp 49-49)
- A missing feature approach to instrument identification in polyphonic music. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS (pp 553-556)
- New Strategies for Computer-Assisted Composition Software: A Perspective. International Computer Music Conference, ICMC Proceedings
- Location-based sound segregation. IEEE International Conference on Acoustics Speech and Signal Processing, 13 May 2002 - 17 May 2002.
- Location-based sound segregation. Proceedings of the International Joint Conference on Neural Networks, Vol. 3 (pp 2299-2303)
- A multi-pitch tracking algorithm for noisy speech. IEEE International Conference on Acoustics Speech and Signal Processing, 13 May 2002 - 17 May 2002.
- Missing data speech recognition in reverberant conditions. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS (pp 65-68)
- A neural oscillator model of auditory selective attention. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, Vol. 14 (pp 1205-1212)
- A Qualitative Analysis of Composers at Work. International Computer Music Conference, ICMC Proceedings (pp 572-580)
- Neural network ensembles and their application to traffic flow prediction in telecommunications networks. Proceedings of the International Joint Conference on Neural Networks, Vol. 1 (pp 693-698)
- A neural oscillator sound separator for missing data speech recognition. Proceedings of the International Joint Conference on Neural Networks, Vol. 4 (pp 2907-2912)
- A neural oscillator sound separator for missing data speech recognition. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS (pp 2907-2912)
- A neural oscillator model of auditory attention. ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, Vol. 2130 (pp 1163-1170)
- Identification of concurrent vowels using spectral matching with 'missing data'. BRITISH JOURNAL OF AUDIOLOGY, Vol. 34(2) (pp 108-109)
- Synfire chains as a neural mechanism for auditory grouping. BRITISH JOURNAL OF AUDIOLOGY, Vol. 34(2) (pp 116-117)
- An oscillatory correlation framework for computational auditory scene analysis. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, Vol. 12 (pp 747-753)
- A blackboard architecture for computational auditory scene analysis. SPEECH COMMUNICATION, Vol. 27(3-4) (pp 351-366)
- The interactive auditory demonstrations project.. EUROSPEECH
- A computational model of auditory organization .1. Context sensitive integration of multiple grouping principles. BRITISH JOURNAL OF AUDIOLOGY, Vol. 31(2) (pp 116-117)
- A computational model of auditory organization .2. Grouping by emergent properties. BRITISH JOURNAL OF AUDIOLOGY, Vol. 31(2) (pp 117-117)
- Modelling the perceptual separation of concurrent vowels with a network of neural oscillators. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4 (pp 569-574)
- A COMPUTATIONAL MODEL OF PROSODY PERCEPTION. 3rd International Conference on Spoken Language Processing, ICSLP 1994 (pp 127-130)
- COMPUTATIONAL AUDITORY SCENE ANALYSIS - EXPLOITING PRINCIPLES OF PERCEIVED CONTINUITY. SPEECH COMMUNICATION, Vol. 13(3-4) (pp 391-399)
- A Computational Model of Auditory Scene Analysis. 2nd International Conference on Spoken Language Processing, ICSLP 1992 (pp 523-526)
- View this article in WRRO Acoustic effects of facial feminisation surgery on speech and singing: A case study. Processings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024.
- View this article in WRRO SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society. Orlando, Florida, 15 July 2024 - 15 July 2024.
- Speech Localisation in a Multitalker Mixture by Humans and Machines. Interspeech 2016 View this article in WRRO
- Perceptual compensation for the effects of reverberation on consonant identification: a comparison of human and machine performance. Interspeech 2012
- Location-based sound segregation. Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290)
- Pitch tracking based on statistical anticipation. IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
- Speech segregation based on sound localization. IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
- The separation of speech from interfering sounds: an oscillatory correlation approach. IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339)
- A neural oscillator model of primitive auditory grouping. Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics
- Perceptual compensation for effects of reverberation in speech identification: A computer model based on auditory efferent processing.. Interspeech 2010. Japan, 26 September 2010 - 30 September 2010.
Exhibitions
Posters
- A comparison of audiovisual and auditory-only training on the perception of spectrally-distorted speech. 18th International Congress of Phonetic Sciences.
Preprints
- Grants
-
Research grants
- Teaching computer science and music through live coding, Research England, 07/2023 - 02/2024, £50,642, as Co-PI
- SOMNUS: Sleep disOrder MoNitoring by Unobtrusive Sensors, Innovate UK, 07/2021 - 11/2023, £120,228, as PI
- Monitoring sleep disordered breathing of long-Covid patients at home using acoustic AI Technology, Research England, 01/2022 - 07/2022, £71,222, as Co-PI
- Making Elektra, Research England, 02/2021 - 04/2021, £6,236, as PI
- Brahms: Breathing Resistance Assessment via Home Monitoring of Sleep, Innovate UK, 06/2019 - 02/2021, £109,600, as PI
- MAI: Musical Artificial Intelligence, HEFCE, 02/2019 - 05/2020, £53,408, as PI
- Insitute of Coding, HEFCE, 11/2017 - 03/2020, £957,000, as Co-PI
- Studentship, Passion 4 Life, 10/2017 - 09/2020, as PI
- Passion for Life, InnovateUK, 04/2015 - 06/2017, £149,280, as PI
- Meeting the challenge of simultaneous talk for cochlear implant users, AHRC, 03/2014 - 03/2015, £69,339, as Co-PI
- Two!Ears, EC - FP7, 12/2013 - 11/2016, £267,134, as PI
- Automatic Testing of Natural User Interfaces, Microsoft Research Ltd., 06/2013 - 12/2017, £15,625, as Co-PI
- A computational model of speech recognition in hearing impaired listeners based on missing feature theory, RNID, 10/2007 - 09/2010, £68,951, as PI
- Phonetic design of overlapping speech in talk-in-interaction: A cross-linguistic study, AHRC, 01/2009 - 06/2012, £169,652, as Co-PI
- Perceptual constancy in real-room listening by humans and machines, EPSRC, 10/2008 - 04/2012, £121,515, as PI
- S2S: Sound to Sense, EC - FP6, 05/2007 - 04/2011, £168,923, as PI
- Studentship, QINETIQ, 10/2004 - 09/2007, £38,427, as PI
- Studentship, Defence and Science Technology Laboratory, 10/2000 - 09/2003, £4,200, as PI
- Professional activities and memberships
-
- Member of the Speech and Hearing research group
- Recipient of a University of Sheffield Senate Award for Sustained Excellence in Learning and Teaching, 2014.
- Recipient (with Dr Gordon Fraser) of a Microsoft Software Engineering Innovation Foundation Award in 2013.
- Guest editor of the IEEE Transactions on Audio, Speech and Language Processing special issue on blind signal processing for speech and audio applications, 2007.