Dr Stefan Goetze
School of Computer Science
Visiting Professor
Member of the Speech and Hearing (SpandH) research group


Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Stefan Goetze was a Senior Lecturer in the School of Computer Science at Sheffield from 2020 - 2025. He obtained the degree 'Dipl.-Ing' in 2004 and 'Dr.-Ing.' in 2013 in Electrical/Communication Engineering from the University of Bremen, Germany.
From 2008 to 2020 he was with the Fraunhofer-Institute for Digital Media Technology IDMT in Oldenburg, Germany where he was first Head of "Audio System Technology for Audiology and Assistive Systems" (2010-2017) and later Head of "Automatic Speech Recognition" as well as Dept. Head of the Department "Hearing, Speech and Audio Technology" (2017-2020).
- Research interests
-
His research interests include machine learning, signal analysis, enhancement and classification as well for large scale applications as for resource-limited IoT (Internet of Things) and assistive devices.
- Publications
-
Journal articles
- Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
- Non-intrusive speech quality prediction using modulation energies and LSTM-network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(7), 1151-1163. View this article in WRRO
- Non-Intrusive Speech Quality Prediction Using Modulation Energies and LSTM-Network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27, 1151-1163.
- Joint estimation of reverberation time and early-to-late reverberation ratio from single-channel speech signals. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(2), 255-267. View this article in WRRO
- Intelligente Erkennersysteme für die Pflege. Pflegezeitschrift, 72(1-2), 17-19. View this article in WRRO
- Exploring auditory-inspired acoustic features for room acoustic parameter estimation from monaural speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(10), 1809-1820. View this article in WRRO
- Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition. Computer Speech & Language, 46, 558-573.
- Classifier architectures for acoustic scenes and events : implications for DNNs, TDNNs, and perceptual features from DCASE 2016. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1304-1314. View this article in WRRO
- Instrumental and perceptual evaluation of dereverberation techniques based on robust acoustic multichannel equalization. Journal of the Audio Engineering Society, 65(1/2), 117-129. View this article in WRRO
- Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments. The Journal of the Acoustical Society of America, 139(4), 2224-2225.
- Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech. EURASIP Journal on Advances in Signal Processing, 2015(1).
- Spectro-Temporal Gabor Filterbank Features for Acoustic Event Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(12), 2198-2208.
- Front-end technologies for robust ASR in reverberant environments—spectral enhancement-based dereverberation and auditory modulation filterbank features. EURASIP Journal on Advances in Signal Processing, 2015(1).
- Reduction of Gaussian, Supergaussian, and Impulsive Noise by Interpolation of the Binary Mask Residual. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(10), 1680-1691.
- Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios. EURASIP Journal on Audio, Speech, and Music Processing, 2014(1).
- Information and communication technologies for promoting and sustaining quality of life, health and self-sufficiency in ageing societies – outcomes of the Lower Saxony Research NetworkDesign of Environments for Ageing(GAL). Informatics for Health and Social Care, 39(3-4), 166-187.
- Regularization for Partial Multichannel Equalization for Speech Dereverberation. IEEE Transactions on Audio, Speech, and Language Processing, 21(9), 1879-1890.
- Acoustic Monitoring and Localization for Social Care. Journal of Computing Science and Engineering, 6(1), 40-50.
- Acoustic user interfaces for ambient-assisted living technologies. Informatics for Health and Social Care, 35(3-4), 125-143.
- The Lower Saxony research networkdesign of environments for ageing: towards interdisciplinary research on information and communication technologies in ageing societies. Informatics for Health and Social Care, 35(3-4), 92-103.
- A study on combining acoustic echo cancelers with impulse response shortening. The Journal of the Acoustical Society of America, 120(5), 3258-3258.
- Effectiveness of computer tailored health communication in increasing physical activity in people with or at risk of long-term conditions: systematic review and meta-analysis (Preprint). Journal of Medical Internet Research.
- Speech Quality Assessment for Listening-Room Compensation. Journal of the Audio Engineering Society, 62(6), 386-399.
Chapters
- Computer-Based Adaption of Cooking Recipes Integrated in a Speech Dialogue Assistance System, Ambient Assisted Living (pp. 163-172). Springer International Publishing
- Ambient Voice Control for a Personal Activity and Household Assistant, Ambient Assisted Living (pp. 63-74). Springer Berlin Heidelberg
- Detection and Classification of Acoustic Events for In-Home Care, Ambient Assisted Living (pp. 181-195). Springer Berlin Heidelberg
- Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners, Lecture Notes in Computer Science (pp. 568-575). Springer Berlin Heidelberg
Conference proceedings papers
- Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement. 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 421-425), 26 August 2024 - 30 August 2024.
- Refining Text Input For Augmentative and Alternative Communication (AAC) Devices: Analysing Language Model Layers For Optimisation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 14 May 2024 - 19 May 2024.
- The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 22 October 2023 - 25 October 2023.
- The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (pp 1-5). New York, NY, USA, 22 October 2023 - 25 October 2023.
- Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech Enhancement. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- ASR-Based, Single-Ended Modeling of Listening Effort - A Tool for TV Sound Engineers. Proceedings of Forum Acusticum (pp 2441-2445). Lyon, France, 7 December 2020 - 11 December 2020.
- Measuring, modelling and predicting perceived reverberation. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 381-385). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- On DNN posterior probability combination in multi-stream speech recognition for reverberant environments. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 5250-5254). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- Combination strategy based on relative performance monitoring for multi-stream reverberant speech recognition. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 4870-4874). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- Performance comparison of real-time single-channel speech dereverberation algorithms. 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 1 March 2017 - 3 March 2017.
- Performance comparison of intrusive and non-intrusive instrumental quality measures for enhanced speech. 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 13 September 2016 - 16 September 2016.
- Perceptual and instrumental evaluation of the perceived level of reverberation. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Classification of human cough signals using spectro-temporal Gabor filterbank features. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Late reverberant spectral variance estimation using acoustic channel equalization. 2015 23rd European Signal Processing Conference (EUSIPCO), 31 August 2015 - 4 September 2015.
- A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015.
- A study on joint beamforming and spectral enhancement for robust speech recognition in reverberant environments. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015.
- A study on speech quality and speech intelligibility measures for quality assessment of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- Subjective speech quality and speech intelligibility evaluation of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2014 - 9 May 2014.
- On the use of spectro-temporal features for the IEEE AASP challenge ‘detection and classification of acoustic scenes and events’. 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 20 October 2013 - 23 October 2013.
- Enhancing Wireless Sensor Networks with Acoustic Sensing Technology: Use Cases, Applications & Experiments. 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, 20 August 2013 - 23 August 2013.
- A perceptually constrained channel shortening technique for speech dereverberation. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Automatic acoustic siren detection in traffic noise by part-based models. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Blind estimation of reverberation time based on spectro-temporal modulation filtering. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Non-intrusive regularization for least-squares multichannel equalization for speech dereverberation. 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, 14 November 2012 - 17 November 2012.
- System identification for listening-room compensation by means of acoustic echo cancellation and acoustic echo suppression filters. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25 March 2012 - 30 March 2012.
- Voice activity detection driven acoustic event classification for monitoring in smart homes. 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010), 7 November 2010 - 10 November 2010.
- Hands-free telecommunication for elderly persons suffering from hearing deficiencies. The 12th IEEE International Conference on e-Health Networking, Applications and Services, 1 July 2010 - 3 July 2010.
- Quality assessment for listening-room compensation algorithms. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 14 March 2010 - 19 March 2010.
- Multi-channel listening-room compensation using a decoupled filtered-X LMS algorithm. 2008 42nd Asilomar Conference on Signals, Systems and Computers, 26 October 2008 - 29 October 2008.
- System Identification for Multi-Channel Listening-Room Compensation Using an Acoustic Echo Canceller. 2008 Hands-Free Speech Communication and Microphone Arrays, 6 May 2008 - 8 May 2008.
- Objective perceptual quality assessment for self-steering binaural hearing aid microphone arrays. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 31 March 2008 - 4 April 2008.
- Optimization of Gabor Features for Text-Independent Speaker Identification. 2007 IEEE International Symposium on Circuits and Systems, 27 May 2007 - 30 May 2007.
- Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 International Symposium on Intelligent Signal Processing and Communication Systems, 28 November 2007 - 1 December 2007.
- Enhanced Partitioned Stereo Residual Echo Estimation. 2006 Fortieth Asilomar Conference on Signals, Systems and Computers, 29 October 2006 - 1 November 2006.
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- View this article in WRRO
- The University of Sheffield CHiME-7 UDASE Challenge Speech Enhancement System. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023)
Reports
Preprints
- Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models, arXiv.
- On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments, arXiv.
- The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions, arXiv.
- Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations, arXiv.
- On Data Sampling Strategies for Training Neural Network Speech Separation Models, arXiv.
- Effectiveness of computer tailored health communication in increasing physical activity in people with or at risk of long-term conditions: systematic review and meta-analysis (Preprint), JMIR Publications Inc..
- Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation, arXiv.
- View this article in WRRO
- Joint Estimation of Reverberation Time and Direct-to-Reverberation Ratio from Speech using Auditory-Inspired Features, arXiv.
- Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
- Grants
-
Research Grants
- Participatory co-design of a platform for collecting atypical speech data, Research England, 03/2022 - 07/2022, £19,692, as PI