Professor Guy Brown

BSc(Hons), PhD, MEd

School of Computer Science

Professor of Computer Science

Member of the Speech and Hearing (SpandH) research group

g.j.brown@sheffield.ac.uk
+44 114 222 1821

+44 114 222 1825

Regent Court (DCS)

Full contact details

Professor Guy Brown
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP

Profile

Professor Brown obtained a BSc (Hons) Applied Science from Sheffield City Polytechnic in 1984 and a PhD in Computer Science from the University of Sheffield in 1992. He was appointed to a lectureship in the Department of Computer Science, University of Sheffield in 1992.

He also obtained the MEd in Teaching and Learning from the University of Sheffield in 1997. He has held visiting appointments at LIMSI-CNRS (France), Ohio State University (USA), Helsinki University of Technology (Finland) and ATR (Japan).

He was appointed to a Chair of Computer Science in 2013. Professor Brown was Head of the Department of Computer Science from 2015 to 2023.

Research interests

Professor Brown's main research interest is Computational Auditory Scene Analysis (CASA), which aims to build machine systems that mimic the ability of human listeners to segregate complex mixtures of sound.

He also has interests in noise-robust and reverberation-robust automatic speech recognition, models of auditory function in normal and impaired hearing, binaural modelling and the phonetics of overlapping speech. A recent interest is the application of CASA technology in mobile robot platforms.

He is the co-editor (with DeLiang Wang) of Computational auditory scene analysis: Principles, Algorithms, and Applications (IEEE Press/Wiley-Interscience).

Publications

Show: Featured publications All publications

Featured publications

Journal articles

Romero HE, Ma N, Brown G & Hill EA (2022) Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
Ma N, Gonzalez J & Brown GJ (2018) Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
Ma N, May T & Brown GJ (2017) Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
Keronen S, Kallasjoki H, Remes U, Brown GJ, Gemmeke J & Palomaki K (2012) Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment. Computer Speech and Language.
Palomäki KJ & Brown GJ (2011) A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53(6), 924-940. View this article in WRRO
Brown GJ, Ferry RT & Meddis R (2010) A computer model of auditory efferent suppression: implications for the recognition of speech in noise.. J Acoust Soc Am, 127(2), 943-954.
Kurtic E, Brown GJ & Wells B () Resources for turn competition in overlapping talk. Speech Communication, 55(5), 721-743.

Chapters

Blauert J & Brown GJ (2020) Reflexive and Reflective Auditory Feedback, Modern Acoustics and Signal Processing (pp. 3-31). Springer International Publishing

Conference proceedings papers

Romero HE, Ma N, Brown GJ & Johnson S (2023) Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023, Vol. 2023-August (pp 5451-5455)
Hu Q, Ma N & Brown GJ (2023) Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Romero HE, Ma N & Brown GJ (2020) Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain, 4 May 2020 - 8 May 2020.
White T, Fraser G & Brown G (2019) Improving random GUI testing with image-based widget detection. ISSTA 2019 Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp 307-317). Beijing, China, 15 July 2019 - 15 July 2019. View this article in WRRO
Kurtic E, Brown GJ & Wells B (2010) Resources for turn competition in overlap in multi-party conversations: speech rate, pausing and duration.. INTERSPEECH (pp 2550-2553)
Hughes C, Brown G, Ma N & Dibben N () Acoustic effects of facial feminisation surgery on speech and singing: A case study. Processings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024. View this article in WRRO
Romero H, Ma N, Brown G & Johnson S () SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society. Orlando, Florida, 15 July 2024 - 15 July 2024. View this article in WRRO
Brown GJ & Beeston AV () Perceptual compensation for effects of reverberation in speech identification: A computer model based on auditory efferent processing.. Interspeech 2010. Japan, 26 September 2010 - 30 September 2010.

All publications

Books

(2011) Frontmatter. IEEE.
Wang D & Brown GJ (2006) Computational Auditory Scene Analysis. IEEE.

Journal articles

Romero HE, Ma N, Brown G & Hill EA (2022) Acoustic screening for obstructive sleep apnea in home environments based on deep neural networks. IEEE Journal of Biomedical and Health Informatics, 26(7), 2941-2950.
Wells W, Beeston A, Bradley E, Brown G, Crook H & Kurtic E (2019) Talking in time : the development of a self-administered conversation analysis based training programme for cochlear implant users. Cochlear Implants International, 20(5), 255-265. View this article in WRRO
Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform.. CoRR, abs/1904.01916.
Ma N, Gonzalez J & Brown GJ (2018) Robust binaural localization of a target sound source by combining spectral source models and deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(11), 2122-2131. View this article in WRRO
Alghamdi N, Maddock S, Marxer R, Barker J & Brown GJ (2018) A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, 143(6), 523-529. View this article in WRRO
Alghamdi N, Maddock S, Barker J & Brown GJ (2017) The impact of automatic exaggeration of the visual articulatory features of a talker on the intelligibility of spectrally distorted speech. Speech Communication, 95, 127-136. View this article in WRRO
Ma N, May T & Brown GJ (2017) Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments. IEEE Transactions on Audio, Speech, and Language Processing, 25(12), 2444-2453. View this article in WRRO
Mill RW & Brown GJ (2016) Utilising temporal signal features in adverse noise conditions: Detection, estimation, and the reassigned spectrogram.. The Journal of the Acoustical Society of America, 139(2), 904-917. View this article in WRRO
Remes U, Ramírez López A, Juvela L, Palomäki K, Brown GJ, Alku P & Kurimo M (2016) Comparing human and automatic speech recognition in a perceptual restoration experiment. Computer Speech & Language, 35, 14-31. View this article in WRRO
Keronen S, Kallasjoki H, Palomaki KJ, Brown GJ & Gemmeke JF (2015) Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization. EURASIP Journal on Advances in Signal Processing, 76. View this article in WRRO
Beeston AV, Brown GJ & Watkins AJ (2014) Perceptual compensation for the effects of reverberation on consonant identification: Evidence from studies with monaural stimuli. The Journal of the Acoustical Society of America, 136(6), 3072-3084. View this article in WRRO
Juergens T, Brand T, Clark NR, Meddis R & Brown GJ (2013) The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 134(3), EL282-EL288.
Meddis R, Lecluyse W, Clark NR, Jürgens T, Tan CM, Panda MR & Brown GJ (2013) A computer model of the auditory periphery and its application to the study of hearing. Advances in Experimental Medicine and Biology, 787, 11-20.
Clark NR, Brown GJ, Jurgens T & Meddis R (2012) A frequency-selective feedback model of auditory efferent suppression and its implications for the recognition of speech in noise. Journal of the Acoustical Society of America, 132(3), 1535-1541.
Keronen S, Kallasjoki H, Remes U, Brown GJ, Gemmeke J & Palomaki K (2012) Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment. Computer Speech and Language.
Gorisch J, Wells B & Brown GJ (2012) Pitch Contour Matching and Interactional Alignment across Turns: An Acoustic Investigation. Language and Speech, 55, 57-76-57-76.
Brown GJ, Jürgens T, Meddis R, Robertson M & Clark NR (2011) The representation of speech in a nonlinear auditory model: Time-domain analysis of simulated auditory-nerve firing patterns. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2453-2456.
Palomäki KJ & Brown GJ (2011) A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53(6), 924-940. View this article in WRRO
Brown GJ, Ferry RT & Meddis R (2010) A computer model of auditory efferent suppression: implications for the recognition of speech in noise.. J Acoust Soc Am, 127(2), 943-954.
Robertson M, Brown GJ, Lecluyse W, Panda M & Tan CM (2010) A speech-in-noise test based on spoken digits: Comparison of normal and impaired listeners using a computer model. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, 2470-2473.
Brown GJ (2010) Computational auditory scene analysis : a representational approach..
Meddis R, Ferry R & Brown GJ (2008) Speech in noise and the medial olivo‐cochlear efferent system. The Journal of the Acoustical Society of America, 123(5), 3051-3051.
Wrigley SN, Tucker S, Brown GJ & Whittaker S (2008) Effect of sound spatialisation on multitasking in remote meetings. The Journal of the Acoustical Society of America, 123(5), 3861-3861.
Brown GJ, Mill RW & Tucker S (2008) Auditory‐motivated techniques for detection and classification of passive sonar signals. The Journal of the Acoustical Society of America, 123(5), 3344-3344.
Brown GJ & Palomäki KJ (2008) A reverberation‐robust automatic speech recognition system based on temporal masking. The Journal of the Acoustical Society of America, 123(5), 2978-2978.
Palomäki KJ & Brown GJ (2008) A computational model of binaural speech intelligibility level difference. The Journal of the Acoustical Society of America, 123(5), 3715-3715.
Makino S, Lee TW & Brown GJ (2007) Introduction to the special section on blind signal processing for speech and audio applications. IEEE T AUDIO SPEECH, 15(5), 1509-1510.
Mill RW & Brown GJ (2007) Auditory-inspired interval statistic receivers for passive sonar signal detection. OCEANS 2007 - Europe.
Eaglestone B, Ford N, Brown GJ & Moore A (2007) Information systems and creativity: an empirical study. J DOC, 63(4), 443-464.
Harding S, Barker J & Brown GJ (2006) Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE T AUDIO SPEECH, 14(1), 58-67.
Eggink J & Brown GJ (2005) Using instrument recognition for melody extraction from polyphonic audio. The Journal of the Acoustical Society of America, 118(3), 2032-2032.
Tucker S & Brown GJ (2005) Classification of transient sonar sounds using perceptually motivated features. IEEE J OCEANIC ENG, 30(3), 588-600.
Brown GJ & Palomäki KJ (2005) A computational model of the speech reception threshold for laterally separated speech and noise. 9th European Conference on Speech Communication and Technology, 1753-1756.
Wrigley SN, Brown GJ, Wan V & Renals S (2005) Speech and crosstalk detection in multichannel audio. IEEE T SPEECH AUDI P, 13(1), 84-91. View this article in WRRO
Palomaki KJ, Brown GJ & Wang DL (2004) A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. SPEECH COMMUN, 43(4), 361-378.
Wrigley SN & Brown GJ (2004) A computational model of auditory selective attention.. IEEE Trans Neural Netw, 15(5), 1151-1163.
Palomaki KJ, Brown GJ & Barker JP (2004) Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. SPEECH COMMUN, 43(1-2), 123-142.
Roman N, Wang DL & Brown GJ (2004) A classification-based cocktail-party processor. Advances in Neural Information Processing Systems.
Eggink J & Brown GJ (2004) Instrument recognition in accompanied sonatas and concertos. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4.
Roman N, Wang DL & Brown GJ (2003) Speech segregation based on sound localization. J ACOUST SOC AM, 114(4), 2236-2252.
Wu MY, Wang DL & Brown GJ (2003) A multipitch tracking algorithm for noisy speech. IEEE T SPEECH AUDI P, 11(3), 229-241.
Eggink J & Brown GJ (2003) A missing feature approach to instrument identification in polyphonic music. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 5, 553-556.
Kang J, Meng Y & Brown GJ (2003) Sound propagation in micro-scale urban areas: Simulation and animation. Acta Acustica (Stuttgart), 89(SUPP.).
Van Der Kouwe AJW, Wang DL & Brown GJ (2001) A comparison of auditory and blind separation techniques for speech segregation. IEEE Transactions on Speech and Audio Processing, 9(3), 189-195.
Wang DLL & Brown GJ (1999) Separation of speech from interfering sounds based on oscillatory correlation. IEEE T NEURAL NETWOR, 10(3), 684-697.
Cooke MP & Brown GJ (1999) Speech and hearing demonstrations in matlab. The Journal of the Acoustical Society of America, 105(2), 1213-1214.
Brown GJ & Wang DL (1999) An oscillatory correlation framework for the separation of speech from interfering sounds. The Journal of the Acoustical Society of America, 105(2), 1307-1307.
Cooke M & Brown GJ (1999) Interactive explorations in speech and hearing.. Journal of the Acoustical Society of Japan (E), 20(2), 89-97.
Brown GJ & Wang D (1997) Modelling the perceptual segregation of double vowels with a network of neural oscillators. NEURAL NETWORKS, 10(9), 1547-1558.
Godsmark G & Brown GJ (1997) A computational model of auditory organization. I: Context sensitive integration of multiple grouping principles. British Journal of Audiology, 31(2), 116-117.
Godsmark D & Brown GJ (1997) A computational model of auditory organization. II: Grouping by emergent properties. British Journal of Audiology, 31(2), 117.
Todd NPM & Brown GJ (1996) Visualization of rhythm, time and metre. ARTIF INTELL REV, 10(3-4), 253-273.
Godsmark D & Brown GJ (1996) Combining multiple grouping cues in a blackboard model of auditory organisation. British Journal of Audiology, 30(2), 111.
Brown GJ & Cooke MP (1996) A neural oscillator model of auditory streaming. British Journal of Audiology, 30(2), 111-112.
McAngus Todd NP & Brown G (1994) A computational model of speech segmentation. The Journal of the Acoustical Society of America, 96(5_Supplement), 3293-3293.
BROWN GJ & COOKE M (1994) COMPUTATIONAL AUDITORY SCENE ANALYSIS. COMPUT SPEECH LANG, 8(4), 297-336.
BROWN GJ & COOKE M (1994) PERCEPTUAL GROUPING OF MUSICAL SOUNDS - A COMPUTATIONAL MODEL. J NEW MUSIC RES, 23(2), 107-132.
Todd NPM & Brown G (1994) A computational model of prosodic perception. The Journal of the Acoustical Society of America, 95(5_Supplement), 2950-2950.
Cooke M, Brown GJ, Crawford M & Green P (1993) Computational auditory scene analysis: listening to several things at once.. Endeavour, 17(4), 186-190.
Crawford M, Cooke M & Brown G (1993) Interactive computational auditory scene analysis: An environment for exploring auditory representations and groups. The Journal of the Acoustical Society of America, 93(4_Supplement), 2308-2308.
Dewhurst DW, Brown GJ, Meehan AS & Meehan MJ (1990) Using the BBC microcomputer to teach the electrocardiogram to biology students. Journal of Biological Education, 24(1), 13-17.
BROWN GJ, COLLINS GGS, DEWHURST DG & HUGHES IE (1988) COMPUTER-SIMULATIONS IN TEACHING NEUROMUSCULAR PHARMACOLOGY - TIME FOR A CHANGE FROM TRADITIONAL METHODS. ATLA-ALTERN LAB ANIM, 16(2), 163-174.
BROWN GJ & DEWHURST DG (1988) A COMPUTER-ASSISTED-LEARNING PROGRAM FOR TEACHING THE ELECTROCARDIOGRAM TO SCIENCE UNDERGRADUATES. BRIT J PHARMACOL, 94, P470-P470.
BASARABHORWATH I, BROWN GJ, DEWHURST DG, HALL J & MEEHAN AS (1988) AUTOMATED-ANALYSIS OF DRUG-INDUCED CONVULSIONS USING BBC MICROCOMPUTER. BRIT J PHARMACOL, 94, P471-P471.
BROWN GJ, COLLINS GGS & DEWHURST DG (1988) A COMPUTER-ASSISTED-LEARNING PROGRAM FOR TEACHING NEUROMUSCULAR PHARMACOLOGY TO UNDERGRADUATES. BRIT J PHARMACOL, 94, P472-P472.
DEWHURST DG, BROWN GJ & MEEHAN AS (1988) MICROCOMPUTER SIMULATIONS OF LABORATORY EXPERIMENTS IN PHYSIOLOGY. ATLA-ALTERN LAB ANIM, 15(4), 280-289.
DEWHURST DG, BROWN GJ & MEEHAN AS (1988) COMPUTER-SIMULATIONS - AN ALTERNATIVE TO THE USE OF ANIMALS IN TEACHING. J BIOL EDUC, 22(1), 19-22.
Kurtic E, Brown GJ & Wells B () Resources for turn competition in overlapping talk. Speech Communication, 55(5), 721-743.
Clark N, Brown GJ, Jurgens T & Meddis R () A frequency-selective feedback model of auditory efferent suppression and its implications for the recognition of speech in noise. Journal of the Acoustical Society of America.

Chapters

Blauert J & Brown GJ (2020) Reflexive and Reflective Auditory Feedback, Modern Acoustics and Signal Processing (pp. 3-31). Springer International Publishing
(2011) Neural and Perceptual Modeling, Computational Auditory Scene Analysis IEEE
(2011) Reverberation, Computational Auditory Scene Analysis IEEE
(2011) Binaural Sound Localization, Computational Auditory Scene Analysis IEEE
(2011) Fundamentals of Computational Auditory Scene Analysis, Computational Auditory Scene Analysis IEEE
Brown GJ (2010) Physiological Models of Auditory Scene Analysis, Computational Models of the Auditory System (pp. 203-236). Springer US
Brown GJ, Wells B & Kurtic E (2009) Fundamental frequency height as a resource for the management of overlap in talk-in-interaction. In Barth-Weingarten D, Dehé N & Wichmann A (Ed.), Where Prosody Meets Pragmatics (pp. 183-204). Emerald Group Publishing Limited
Brown GJ (2001) Auditory Scene Analysis: Computational Models, International Encyclopedia of the Social & Behavioral Sciences (pp. 943-946). Elsevier
Brown GJ (1998) Teaching Professional Ethics to Software Engineers, Projects in the Computing Curriculum (pp. 3-18). Springer London
McAngus Todd NP & Brown GJ (1996) Visualization of Rhythm, Time and Metre, Integration of Natural Language and Vision Processing (pp. 253-273). Springer Netherlands
() Listening to Speech Psychology Press
Brown GJ & Wang D () Separation of Speech by Computational Auditory Scene Analysis, Signals and Communication Technology (pp. 371-402). Springer-Verlag

Conference proceedings papers

Xu X, Brown G & Ma N (2025) Sound-Based Sleep Staging Using Pretrained Speech Foundation Models. Proceedings of the 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society and Biology, 14 July 2025 - 14 July 2025.
Romero HE, Ma N, Brown GJ & Johnson S (2023) Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023, Vol. 2023-August (pp 5451-5455)
Hu Q, Ma N & Brown GJ (2023) Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Ma N, Brown GJ & Vecchiotti P (2021) AMI – Creating Coherent Musical Composition with Attention. ICMC 2021 - Proceedings of the International Computer Music Conference 2021 (pp 414-418)
Ma N, Brown GJ & Vecchiotti P (2021) AMI – Creating musical compositions with a coherent long-term structure. AISB Convention 2021: Communication and Conversations
Romero HE, Ma N, Hill EA & Brown GJ (2020) 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 30 August 2020. View this article in WRRO
Romero HE, Ma N & Brown GJ (2020) Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain, 4 May 2020 - 8 May 2020.
Romero HE, Ma N, Hill EA & Brown GF (2020) SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220)
White T, Fraser G & Brown G (2019) Improving random GUI testing with image-based widget detection. ISSTA 2019 Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp 307-317). Beijing, China, 15 July 2019 - 15 July 2019. View this article in WRRO
Vecchiotti P, Ma N, Squartini S & Brown G (2019) End-to-end binaural sound localisation from the raw waveform. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
Romero H, Ma N, Brown G, Beeston A & Hasan M (2019) Deep learning features for robust detection of acoustic events in sleep-disordered breathing. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO
Romero HE, Ma N, Brown GJ, Beeston AV & Hasan M (2019) Deep Learning Features for Robust Detection of Acoustic Events in Sleep-disordered Breathing.. ICASSP (pp 810-814)
Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform.. ICASSP (pp 451-455)
White T, Fraser G & Brown GJ (2018) Modelling Hand Gestures to Test Leap Motion Controlled Applications. 2018 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW) (pp 204-213), 13 April 2018 - 13 April 2018. View this article in WRRO
Guo Y, Wang X, Wu C, Fu Q, Ma N & Brown G (2016) A robust dual-microphone speech source localization algorithm for reverberant environments. Proceedings of INTERSPEECH 2016 View this article in WRRO
Zeiler S, Nicheli R, Ma N, Brown GJ & Kolossa D (2016) Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016. View this article in WRRO
Ma N, Marxer R, Barker J & Brown GJ (2015) Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
Alghamdi N, Maddock SC, Brown GJ & Barker J (2015) Investigating the Impact of Artificial Enhancement of Lip Visibility on the Intelligibility of Spectrally-Distorted Speech. FAAVSP-2015 (pp 93-98), 11 September 2015 - 13 September 2015.
Ma N, Brown G & May T (2015) Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions. Proceedings of Interspeech 2015 (pp 160-164). Dresden, Germany, 6 September 2015 - 10 September 2015. View this article in WRRO
Ma N, Brown GJ & Gonzalez JA (2015) Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 160-164) View this article in WRRO
May T, Ma N & Brown GJ (2015) Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brisbane, 19 April 2015 - 24 April 2015. View this article in WRRO
Ma N, May T, Wierstorf H & Brown GJ (2015) A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO
May T, Ma N, Brown GJ & IEEE (2015) ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2679-2683)
Ma N, May T, Wierstorf H, Brown GJ & IEEE (2015) A MACHINE-HEARING SYSTEM EXPLOITING HEAD MOVEMENTS FOR BINAURAL SOUND LOCALISATION IN REVERBERANT CONDITIONS. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2699-2703)
Lin L, Barker J & Brown GJ (2015) The effect of cochlear implant processing on speaker intelligibility: A perceptual study and computer model. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 1566-1570)
Schymura C, Walther T, Kolossa D, Ma N & Brown GJ (2014) Binaural sound source localisation using a Bayesian-network-based blackboard system and hypothesis-driven feedback. Proceedings of Forum Acusticum, Vol. 2014-January View this article in WRRO
Beeston AV & Brown GJ (2014) Consonant confusions provide further evidence that time-reversed rooms disturb compensation for reverberation. Proceedings of Forum Acusticum, Vol. 2014-January
Kallasjoki H, Gemmeke J, Palomaki K, Beeston A & Brown GJ (2014) Recognition of reverberant speech by missing data imputation and NMF feature enhancement. IEEE SPS AASP REVERB Challenge Workshop. Florence, 10 May 2014 - 10 May 2014.
Hunt C, Brown GJ & Fraser G (2014) Automatic testing of natural user interfaces. Seventh IEEE International Conference on Software Testing, Verification and Validation, 31 March 2014 - 4 April 2014.
Brown GJ, Beeston AV & Palomäki KJ (2012) Perceptual compensation for the effects of reverberation on consonant identification: A comparison of human and machine performance. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 2 (pp 1714-1717)
Kurtic E, Wells B, Brown GJ, Kempton T & Aker A (2012) A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey
Brown GJ, Juergens T, Meddis R, Robertson M, Clark NR & Assoc ISC (2011) The representation of speech in a nonlinear auditory model: time-domain analysis of simulated auditory-nerve firing patterns. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 2464-2467)
Kurtic E, Brown GJ, Wells B & ASSOC ISC (2010) Resources for turn competition in overlap in multi-party conversations: Speech rate, pausing and duration. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 (pp 2554-+)
Robertson M, Brown GJ, Lecluyse W, Panda M, Tan CM & ASSOC ISC (2010) A speech-in-noise test based on spoken digits: Comparison of normal and impaired listeners using a computer model. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 (pp 2474-+)
Kurtic E, Brown GJ & Wells B (2010) Resources for turn competition in overlap in multi-party conversations: speech rate, pausing and duration.. INTERSPEECH (pp 2550-2553)
Wrigley SN, Tucker S, Brown GJ & Whittaker S (2009) Audio spatialisation strategies for multitasking during teleconferences. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2935-2938)
Kurtić E, Brown GJ & Wells B (2009) Fundamental Frequency Height as a Resource for the Management of Overlap in Talk-in-Interaction (pp 183-203) View this article in WRRO
Wrigley SN, Tucker S, Brown GJ & Whittaker S (2009) Audio spatialisation strategies for multitasking during teleconferences. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2903-2906)
Wrigley SN, Tucker S, Brown GJ & Whittaker S (2008) The influence of audio presentation style on multitasking during teleconferences. INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (pp 801-804)
Wrigley SN, Tucker S, Brown GJ & Whittaker S (2008) The Influence of Audio Presentation Style on Multitasking During Teleconferences. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 801-804)
Wrigley SN & Brown GJ (2008) Binaural speech separation using recurrent timing neural networks for joint F0-localisation estimation. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 4892 (pp 271-282)
Wrigley SN & Brown GJ (2007) Recurrent timing neural networks for joint F0-localisation based speech separation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
Wrigley SN & Brown GJ (2007) Recurrent timing neural networks for joint F0-localisation based speech separation. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS (pp 157-160)
Mill RW & Brown GJ (2007) Auditory-inspired interval statistic receivers for passive sonar signal detection. OCEANS 2007 - EUROPE, VOLS 1-3 (pp 37-42)
Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 5
Palomäki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
Palomaki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 289-292)
Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 5807-5810)
Palomaki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 289-292). Toulouse, FRANCE, 14 May 2006 - 19 May 2006.
Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol V, Proceedings (pp 949-952)
Harding S, Barker J & Brown GJ (2005) Binaural feature selection for missing data speech recognition. 9th European Conference on Speech Communication and Technology (pp 1269-1272)
Wrigley SN & Brown GJ (2005) Physiologically motivated audio-visual localisation and tracking. 9th European Conference on Speech Communication and Technology (pp 773-776)
Harding S, Barker J & Brown GJ (2005) Mask estimation based on sound localisation for missing data speech recognition. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 537-540)
Brown GJ & Palomaki KJ (2005) Techniques for robust speech recognition in noisy and reverberant conditions. SPEECH SEPARATION BY HUMANS AND MACHINES (pp 213-220)
Eggink J & Brown GJ (2004) Instrument recognition in accompanied sonatas and concertos. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS (pp 217-220)
Eggink J & Brown GJ (2004) EXTRACTING MELODY LINES FROM COMPLEX AUDIO. ISMIR 2004 - 5th International Symposium on Music Information Retrieval
Wrigley SN, Brown GJ, Wan V & Renals S (2003) Feature selection for the classification of crosstalk in multi-channel audio. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 469-472)
Eggink J & Brown GJ (2003) A missing feature approach to instrument identification in polyphonic music. 2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS (pp 49-49)
Eggink J & Brown GJ (2003) A missing feature approach to instrument identification in polyphonic music. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS (pp 553-556)
Dahan K, Brown GJ & Eaglestone B (2003) New Strategies for Computer-Assisted Composition Software: A Perspective. International Computer Music Conference, ICMC Proceedings
Eggink J & Brown GJ (2003) Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio. 4th International Symposium on Music Information Retrieval, ISMIR 2003
Roman N, Wang D & Brown GJ (2002) Location-based sound segregation. IEEE International Conference on Acoustics Speech and Signal Processing, 13 May 2002 - 17 May 2002.
Roman N, Wang DL & Brown GJ (2002) Location-based sound segregation. Proceedings of the International Joint Conference on Neural Networks, Vol. 3 (pp 2299-2303)
Wu M, Wang D & Brown GJ (2002) A multi-pitch tracking algorithm for noisy speech. IEEE International Conference on Acoustics Speech and Signal Processing, 13 May 2002 - 17 May 2002.
Palomaki KJ, Brown GJ & Barker J (2002) Missing data speech recognition in reverberant conditions. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS (pp 65-68)
Wrigley SN & Brown GJ (2002) A neural oscillator model of auditory selective attention. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, Vol. 14 (pp 1205-1212)
Nuhn R, Eaglestone B, Ford N, Moore A & Brown G (2002) A Qualitative Analysis of Composers at Work. International Computer Music Conference, ICMC Proceedings (pp 572-580)
Yao X, Fischer M & Brown G (2001) Neural network ensembles and their application to traffic flow prediction in telecommunications networks. Proceedings of the International Joint Conference on Neural Networks, Vol. 1 (pp 693-698)
Brown GJ, Barker J & Wang DL (2001) A neural oscillator sound separator for missing data speech recognition. Proceedings of the International Joint Conference on Neural Networks, Vol. 4 (pp 2907-2912)
Brown GJ, Barker J & Wang DL (2001) A neural oscillator sound separator for missing data speech recognition. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS (pp 2907-2912)
Wrigley SN & Brown GJ (2001) A neural oscillator model of auditory attention. ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, Vol. 2130 (pp 1163-1170)
Makin SJ & Brown GJ (2000) Identification of concurrent vowels using spectral matching with 'missing data'. BRITISH JOURNAL OF AUDIOLOGY, Vol. 34(2) (pp 108-109)
Wrigley SN & Brown GJ (2000) Synfire chains as a neural mechanism for auditory grouping. BRITISH JOURNAL OF AUDIOLOGY, Vol. 34(2) (pp 116-117)
Brown GJ & Wang DL (2000) An oscillatory correlation framework for computational auditory scene analysis. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, Vol. 12 (pp 747-753)
Godsmark D & Brown GJ (1999) A blackboard architecture for computational auditory scene analysis. SPEECH COMMUNICATION, Vol. 27(3-4) (pp 351-366)
Cooke M, Parker H, Brown GJ & Wrigley SN (1999) The interactive auditory demonstrations project.. EUROSPEECH
Godsmark G & Brown GJ (1997) A computational model of auditory organization .1. Context sensitive integration of multiple grouping principles. BRITISH JOURNAL OF AUDIOLOGY, Vol. 31(2) (pp 116-117)
Godsmark D & Brown GJ (1997) A computational model of auditory organization .2. Grouping by emergent properties. BRITISH JOURNAL OF AUDIOLOGY, Vol. 31(2) (pp 117-117)
Brown GJ & Wang DL (1997) Modelling the perceptual separation of concurrent vowels with a network of neural oscillators. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4 (pp 569-574)
Todd NPMA & Brown GJ (1994) A COMPUTATIONAL MODEL OF PROSODY PERCEPTION. 3rd International Conference on Spoken Language Processing, ICSLP 1994 (pp 127-130)
COOKE MP & BROWN GJ (1993) COMPUTATIONAL AUDITORY SCENE ANALYSIS - EXPLOITING PRINCIPLES OF PERCEIVED CONTINUITY. SPEECH COMMUNICATION, Vol. 13(3-4) (pp 391-399)
Brown GJ & Cooke MP (1992) A Computational Model of Auditory Scene Analysis. 2nd International Conference on Spoken Language Processing, ICSLP 1992 (pp 523-526)
Hughes C, Brown G, Ma N & Dibben N () Acoustic effects of facial feminisation surgery on speech and singing: A case study. Processings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024. View this article in WRRO
Romero H, Ma N, Brown G & Johnson S () SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society. Orlando, Florida, 15 July 2024 - 15 July 2024. View this article in WRRO
Ma N & Brown GJ () Speech Localisation in a Multitalker Mixture by Humans and Machines. Interspeech 2016 View this article in WRRO
Brown GJ, Beeston AV & Palomäki KJ () Perceptual compensation for the effects of reverberation on consonant identification: a comparison of human and machine performance. Interspeech 2012
Roman N, DeLiang Wang & Brown GJ () Location-based sound segregation. Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290)
Mingyang Wu , DeLiang Wang & Brown GJ () Pitch tracking based on statistical anticipation. IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
Roman N, DeLiang Wang & Brown GJ () Speech segregation based on sound localization. IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
Brown GJ & DeLiang Wang () The separation of speech from interfering sounds: an oscillatory correlation approach. IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339)
Brown GJ & Cooke M () A neural oscillator model of primitive auditory grouping. Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics
Brown GJ & Beeston AV () Perceptual compensation for effects of reverberation in speech identification: A computer model based on auditory efferent processing.. Interspeech 2010. Japan, 26 September 2010 - 30 September 2010.

Exhibitions

Maddock SC, Brown GJ & Bax N (2014, September 18) Computer Love 2.0. Festival of the Mind 2014, Sheffield.

Posters

Alghamdi N, Maddock S, Brown GJ & Barker J (2015) A comparison of audiovisual and auditory-only training on the perception of spectrally-distorted speech. 18th International Congress of Phonetic Sciences.

Preprints

Ma N, May T & Brown GJ (2019) Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments, arXiv.
Ma N, Gonzalez JA & Brown GJ (2019) Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks, arXiv.
Romero HE, Ma N, Brown GJ, Beeston AV & Hasan M (2019) Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing, arXiv.
Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform, arXiv.

Grants

Teaching computer science and music through live coding, Research England, 07/2023 - 02/2024, £50,642, as Co-PI
SOMNUS: Sleep disOrder MoNitoring by Unobtrusive Sensors, Innovate UK, 07/2021 - 11/2023, £120,228, as PI
Monitoring sleep disordered breathing of long-Covid patients at home using acoustic AI Technology, Research England, 01/2022 - 07/2022, £71,222, as Co-PI
Making Elektra, Research England, 02/2021 - 04/2021, £6,236, as PI
Brahms: Breathing Resistance Assessment via Home Monitoring of Sleep, Innovate UK, 06/2019 - 02/2021, £109,600, as PI
MAI: Musical Artificial Intelligence, HEFCE, 02/2019 - 05/2020, £53,408, as PI
Insitute of Coding, HEFCE, 11/2017 - 03/2020, £957,000, as Co-PI
Studentship, Passion 4 Life, 10/2017 - 09/2020, as PI
Passion for Life, InnovateUK, 04/2015 - 06/2017, £149,280, as PI
Meeting the challenge of simultaneous talk for cochlear implant users, AHRC, 03/2014 - 03/2015, £69,339, as Co-PI
Two!Ears, EC - FP7, 12/2013 - 11/2016, £267,134, as PI
Automatic Testing of Natural User Interfaces, Microsoft Research Ltd., 06/2013 - 12/2017, £15,625, as Co-PI
A computational model of speech recognition in hearing impaired listeners based on missing feature theory, RNID, 10/2007 - 09/2010, £68,951, as PI
Phonetic design of overlapping speech in talk-in-interaction: A cross-linguistic study, AHRC, 01/2009 - 06/2012, £169,652, as Co-PI
Perceptual constancy in real-room listening by humans and machines, EPSRC, 10/2008 - 04/2012, £121,515, as PI
S2S: Sound to Sense, EC FP6, 05/2007 - 04/2011, £168,923, as PI
Studentship, QINETIQ, 10/2004 - 09/2007, £38,427, as PI
Studentship, Defence and Science Technology Laboratory, 10/2000 - 09/2003, £4,200, as PI

Professional activities and memberships

Member of the Speech and Hearing research group
Recipient of a University of Sheffield Senate Award for Sustained Excellence in Learning and Teaching, 2014.
Recipient (with Dr Gordon Fraser) of a Microsoft Software Engineering Innovation Foundation Award in 2013.
Guest editor of the IEEE Transactions on Audio, Speech and Language Processing special issue on blind signal processing for speech and audio applications, 2007.

School of Computer Science

School of Computer Science

Professor Guy Brown

Featured publications

Journal articles

Chapters

Conference proceedings papers

All publications

Books

Journal articles

Chapters

Conference proceedings papers

Exhibitions

Posters

Preprints

Links