Professor Guy Brown
BSc(Hons), PhD, MEd
School of Computer Science
Professor of Computer Science
Member of the Speech and Hearing (SpandH) research group


+44 114 222 1821
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Professor Brown obtained a BSc (Hons) Applied Science from Sheffield City Polytechnic in 1984 and a PhD in Computer Science from the University of Sheffield in 1992. He was appointed to a lectureship in the Department of Computer Science, University of Sheffield in 1992.
He also obtained the MEd in Teaching and Learning from the University of Sheffield in 1997. He has held visiting appointments at LIMSI-CNRS (France), Ohio State University (USA), Helsinki University of Technology (Finland) and ATR (Japan).
He was appointed to a Chair of Computer Science in 2013. Professor Brown was Head of the Department of Computer Science from 2015 to 2023.
- Research interests
-
Professor Brown's main research interest is Computational Auditory Scene Analysis (CASA), which aims to build machine systems that mimic the ability of human listeners to segregate complex mixtures of sound.
He also has interests in noise-robust and reverberation-robust automatic speech recognition, models of auditory function in normal and impaired hearing, binaural modelling and the phonetics of overlapping speech. A recent interest is the application of CASA technology in mobile robot platforms.
He is the co-editor (with DeLiang Wang) of Computational auditory scene analysis: Principles, Algorithms, and Applications (IEEE Press/Wiley-Interscience).
- Publications
-
Show: Featured publications All publications
Featured publications
Journal articles
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
- Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
- Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment. Computer Speech and Language.
- A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53(6), 924-940. View this article in WRRO
- A computer model of auditory efferent suppression: implications for the recognition of speech in noise.. J Acoust Soc Am, 127(2), 943-954.
- Resources for turn competition in overlapping talk. Speech Communication, 55(5), 721-743.
Chapters
- Reflexive and Reflective Auditory Feedback, Modern Acoustics and Signal Processing (pp. 3-31). Springer International Publishing
Conference proceedings papers
- Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023, Vol. 2023-August (pp 5451-5455)
- Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain, 4 May 2020 - 8 May 2020.
- Improving random GUI testing with image-based widget detection. ISSTA 2019 Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp 307-317). Beijing, China, 15 July 2019 - 15 July 2019. View this article in WRRO
- View this article in WRRO
- View this article in WRRO
All publications
Books
- Frontmatter. IEEE.
- Computational Auditory Scene Analysis. IEEE.
Journal articles
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
- Talking in time : the development of a self-administered conversation analysis based training programme for cochlear implant users. Cochlear Implants International, 20(5), 255-265. View this article in WRRO
- Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
- A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, 143(6), 523-529. View this article in WRRO
- The impact of automatic exaggeration of the visual articulatory features of a talker on the intelligibility of spectrally distorted speech. Speech Communication, 95, 127-136. View this article in WRRO
- Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
- Utilising temporal signal features in adverse noise conditions: Detection, estimation, and the reassigned spectrogram.. The Journal of the Acoustical Society of America, 139(2), 904-917. View this article in WRRO
- Comparing human and automatic speech recognition in a perceptual restoration experiment. Computer Speech & Language, 35, 14-31. View this article in WRRO
- Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization. EURASIP Journal on Advances in Signal Processing, 76. View this article in WRRO
- Perceptual compensation for the effects of reverberation on consonant identification: Evidence from studies with monaural stimuli. The Journal of the Acoustical Society of America, 136(6), 3072-3084. View this article in WRRO
- The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 134(3), EL282-EL288.
- A computer model of the auditory periphery and its application to the study of hearing. Advances in Experimental Medicine and Biology, 787, 11-20.
- Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment. Computer Speech and Language.
- Pitch Contour Matching and Interactional Alignment across Turns: An Acoustic Investigation. Language and Speech, 55, 57-76-57-76.
- A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53(6), 924-940. View this article in WRRO
- A computer model of auditory efferent suppression: implications for the recognition of speech in noise.. J Acoust Soc Am, 127(2), 943-954.
- Speech in noise and the medial olivo‐cochlear efferent system. The Journal of the Acoustical Society of America, 123(5), 3051-3051.
- Effect of sound spatialisation on multitasking in remote meetings. The Journal of the Acoustical Society of America, 123(5), 3861-3861.
- Auditory‐motivated techniques for detection and classification of passive sonar signals. The Journal of the Acoustical Society of America, 123(5), 3344-3344.
- A reverberation‐robust automatic speech recognition system based on temporal masking. The Journal of the Acoustical Society of America, 123(5), 2978-2978.
- A computational model of binaural speech intelligibility level difference. The Journal of the Acoustical Society of America, 123(5), 3715-3715.
- Introduction to the special section on blind signal processing for speech and audio applications. IEEE T AUDIO SPEECH, 15(5), 1509-1510.
- Auditory-inspired interval statistic receivers for passive sonar signal detection. OCEANS 2007 - Europe.
- Information systems and creativity: an empirical study. J DOC, 63(4), 443-464.
- Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE T AUDIO SPEECH, 14(1), 58-67.
- Using instrument recognition for melody extraction from polyphonic audio. The Journal of the Acoustical Society of America, 118(3), 2032-2032.
- Classification of transient sonar sounds using perceptually motivated features. IEEE J OCEANIC ENG, 30(3), 588-600.
- Speech and crosstalk detection in multichannel audio. IEEE T SPEECH AUDI P, 13(1), 84-91. View this article in WRRO
- A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. SPEECH COMMUN, 43(4), 361-378.
- A computational model of auditory selective attention.. IEEE Trans Neural Netw, 15(5), 1151-1163.
- Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. SPEECH COMMUN, 43(1-2), 123-142.
- Speech segregation based on sound localization. J ACOUST SOC AM, 114(4), 2236-2252.
- A multipitch tracking algorithm for noisy speech. IEEE T SPEECH AUDI P, 11(3), 229-241.
- A comparison of auditory and blind separation techniques for speech segregation. IEEE Transactions on Speech and Audio Processing, 9(3), 189-195.
- Speech and hearing demonstrations in matlab. The Journal of the Acoustical Society of America, 105(2), 1213-1214.
- An oscillatory correlation framework for the separation of speech from interfering sounds. The Journal of the Acoustical Society of America, 105(2), 1307-1307.
- Interactive explorations in speech and hearing.. Journal of the Acoustical Society of Japan (E), 20(2), 89-97.
- A computational model of speech segmentation. The Journal of the Acoustical Society of America, 96(5_Supplement), 3293-3293.
- A computational model of prosodic perception. The Journal of the Acoustical Society of America, 95(5_Supplement), 2950-2950.
- Computational auditory scene analysis: listening to several things at once.. Endeavour, 17(4), 186-190.
- Interactive computational auditory scene analysis: An environment for exploring auditory representations and groups. The Journal of the Acoustical Society of America, 93(4_Supplement), 2308-2308.
- Using the BBC microcomputer to teach the electrocardiogram to biology students. Journal of Biological Education, 24(1), 13-17.
- Resources for turn competition in overlapping talk. Speech Communication, 55(5), 721-743.
Chapters
- Reflexive and Reflective Auditory Feedback, Modern Acoustics and Signal Processing (pp. 3-31). Springer International Publishing
- Neural and Perceptual Modeling, Computational Auditory Scene Analysis IEEE
- Reverberation, Computational Auditory Scene Analysis IEEE
- Binaural Sound Localization, Computational Auditory Scene Analysis IEEE
- Fundamentals of Computational Auditory Scene Analysis, Computational Auditory Scene Analysis IEEE
- Physiological Models of Auditory Scene Analysis, Computational Models of the Auditory System (pp. 203-236). Springer US
- Auditory Scene Analysis: Computational Models, International Encyclopedia of the Social & Behavioral Sciences (pp. 943-946). Elsevier
- Teaching Professional Ethics to Software Engineers, Projects in the Computing Curriculum (pp. 3-18). Springer London
- Visualization of Rhythm, Time and Metre, Integration of Natural Language and Vision Processing (pp. 253-273). Springer Netherlands
- Listening to Speech Psychology Press
- Separation of Speech by Computational Auditory Scene Analysis, Signals and Communication Technology (pp. 371-402). Springer-Verlag
Conference proceedings papers
- Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023, Vol. 2023-August (pp 5451-5455)
- Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
- 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 30 August 2020. View this article in WRRO
- Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain, 4 May 2020 - 8 May 2020.
- Improving random GUI testing with image-based widget detection. ISSTA 2019 Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp 307-317). Beijing, China, 15 July 2019 - 15 July 2019. View this article in WRRO
- End-to-end binaural sound localisation from the raw waveform. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
- Deep learning features for robust detection of acoustic events in sleep-disordered breathing. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
- Modelling Hand Gestures to Test Leap Motion Controlled Applications. 2018 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW) (pp 204-213), 13 April 2018 - 13 April 2018. View this article in WRRO
- A robust dual-microphone speech source localization algorithm for reverberant environments. Proceedings of INTERSPEECH 2016 View this article in WRRO
- Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016. View this article in WRRO
- Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brisbane, 19 April 2015 - 24 April 2015. View this article in WRRO
- A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO
- View this article in WRRO
- Fundamental Frequency Height as a Resource for the Management of Overlap in Talk-in-Interaction (pp 183-203) View this article in WRRO
- Recurrent timing neural networks for joint F0-localisation based speech separation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
- Location-based sound segregation. IEEE International Conference on Acoustics Speech and Signal Processing, 13 May 2002 - 17 May 2002.
- A multi-pitch tracking algorithm for noisy speech. IEEE International Conference on Acoustics Speech and Signal Processing, 13 May 2002 - 17 May 2002.
- View this article in WRRO
- View this article in WRRO
- Speech Localisation in a Multitalker Mixture by Humans and Machines. Interspeech 2016 View this article in WRRO
- Perceptual compensation for the effects of reverberation on consonant identification: a comparison of human and machine performance. Interspeech 2012
- Location-based sound segregation. Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290)
- Pitch tracking based on statistical anticipation. IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
- Speech segregation based on sound localization. IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
- The separation of speech from interfering sounds: an oscillatory correlation approach. IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339)
- A neural oscillator model of primitive auditory grouping. Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics
Exhibitions
Posters
Preprints
- Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
- Grants
-
Research grants
- Teaching computer science and music through live coding, Research England, 07/2023 - 02/2024, £50,642, as Co-PI
- SOMNUS: Sleep disOrder MoNitoring by Unobtrusive Sensors, Innovate UK, 07/2021 - 11/2023, £120,228, as PI
- Monitoring sleep disordered breathing of long-Covid patients at home using acoustic AI Technology, Research England, 01/2022 - 07/2022, £71,222, as Co-PI
- Making Elektra, Research England, 02/2021 - 04/2021, £6,236, as PI
- Brahms: Breathing Resistance Assessment via Home Monitoring of Sleep, Innovate UK, 06/2019 - 02/2021, £109,600, as PI
- MAI: Musical Artificial Intelligence, HEFCE, 02/2019 - 05/2020, £53,408, as PI
- Insitute of Coding, HEFCE, 11/2017 - 03/2020, £957,000, as Co-PI
- Studentship, Passion 4 Life, 10/2017 - 09/2020, as PI
- Passion for Life, InnovateUK, 04/2015 - 06/2017, £149,280, as PI
- Meeting the challenge of simultaneous talk for cochlear implant users, AHRC, 03/2014 - 03/2015, £69,339, as Co-PI
- Two!Ears, EC - FP7, 12/2013 - 11/2016, £267,134, as PI
- Automatic Testing of Natural User Interfaces, Microsoft Research Ltd., 06/2013 - 12/2017, £15,625, as Co-PI
- A computational model of speech recognition in hearing impaired listeners based on missing feature theory, RNID, 10/2007 - 09/2010, £68,951, as PI
- Phonetic design of overlapping speech in talk-in-interaction: A cross-linguistic study, AHRC, 01/2009 - 06/2012, £169,652, as Co-PI
- Perceptual constancy in real-room listening by humans and machines, EPSRC, 10/2008 - 04/2012, £121,515, as PI
- S2S: Sound to Sense, EC FP6, 05/2007 - 04/2011, £168,923, as PI
- Studentship, QINETIQ, 10/2004 - 09/2007, £38,427, as PI
- Studentship, Defence and Science Technology Laboratory, 10/2000 - 09/2003, £4,200, as PI
- Professional activities and memberships
-
- Member of the Speech and Hearing research group
- Recipient of a University of Sheffield Senate Award for Sustained Excellence in Learning and Teaching, 2014.
- Recipient (with Dr Gordon Fraser) of a Microsoft Software Engineering Innovation Foundation Award in 2013.
- Guest editor of the IEEE Transactions on Audio, Speech and Language Processing special issue on blind signal processing for speech and audio applications, 2007.