Dr Stefan Goetze
School of Computer Science
Senior Lecturer
Course Director for MSc Computer Science with Speech and Language Processing
Member of the Speech and Hearing (SpandH) research group
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Stefan Goetze is Senior Lecturer in the Department of Computer Science. He obtained the degree 'Dipl.-Ing' in 2004 and 'Dr.-Ing.' in 2013 in Electrical/Communication Engineering from the University of Bremen, Germany.
From 2008 to 2020 he was with the Fraunhofer-Institute for Digital Media Technology IDMT in Oldenburg, Germany where he was first Head of "Audio System Technology for Audiology and Assistive Systems" (2010-2017) and later Head of "Automatic Speech Recognition" as well as Dept. Head of the Department "Hearing, Speech and Audio Technology" (2017-2020).
- Research interests
-
His research interests include machine learning, signal analysis, enhancement and classification as well for large scale applications as for resource-limited IoT (Internet of Things) and assistive devices.
- Publications
-
Journal articles
- Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
- Non-intrusive speech quality prediction using modulation energies and LSTM-network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(7), 1151-1163. View this article in WRRO
- Non-Intrusive Speech Quality Prediction Using Modulation Energies and LSTM-Network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27, 1151-1163.
- Joint estimation of reverberation time and early-to-late reverberation ratio from single-channel speech signals. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(2), 255-267. View this article in WRRO
- Intelligente Erkennersysteme für die Pflege. Pflegezeitschrift, 72(1-2), 17-19. View this article in WRRO
- Exploring auditory-inspired acoustic features for room acoustic parameter estimation from monaural speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(10), 1809-1820. View this article in WRRO
- Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition. Computer Speech & Language, 46, 558-573.
- Classifier architectures for acoustic scenes and events : implications for DNNs, TDNNs, and perceptual features from DCASE 2016. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1304-1314. View this article in WRRO
- Instrumental and perceptual evaluation of dereverberation techniques based on robust acoustic multichannel equalization. Journal of the Audio Engineering Society, 65(1/2), 117-129. View this article in WRRO
- Special Issue on Dereverberation and Reverberation of Audio, Music, and Speech. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 65(1-2), 6-7.
- Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments. The Journal of the Acoustical Society of America, 139(4), 2224-2225.
- Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech. EURASIP Journal on Advances in Signal Processing, 2015(1).
- Spectro-Temporal Gabor Filterbank Features for Acoustic Event Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(12), 2198-2208.
- Front-end technologies for robust ASR in reverberant environments—spectral enhancement-based dereverberation and auditory modulation filterbank features. EURASIP Journal on Advances in Signal Processing, 2015(1).
- Reduction of Gaussian, Supergaussian, and Impulsive Noise by Interpolation of the Binary Mask Residual. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(10), 1680-1691.
- Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios. EURASIP Journal on Audio, Speech, and Music Processing, 2014(1).
- Information and communication technologies for promoting and sustaining quality of life, health and self-sufficiency in ageing societies – outcomes of the Lower Saxony Research NetworkDesign of Environments for Ageing(GAL). Informatics for Health and Social Care, 39(3-4), 166-187.
- Regularization for Partial Multichannel Equalization for Speech Dereverberation. IEEE Transactions on Audio, Speech, and Language Processing, 21(9), 1879-1890.
- Notrufsysteme mit automatischer akustischer Gefahrendetektion. Science^2 - Safety and Security, 1, 12-18.
- Acoustic Monitoring and Localization for Social Care. Journal of Computing Science and Engineering, 6(1), 40-50.
- Acoustic User Interfaces for Ambient Assisted Living Technologies. Informatics for Health and Social Care, SI Ageing & Technology, 35, 161-179.
- The Lower Saxony Research Network Design of Environments for Ageing (GAL) - Towards Interdisciplinary Research on ICT in Ageing Societies. Informatics for Health and Social Care, SI Ageing & Technology, 35, 92-103.
- Acoustic user interfaces for ambient-assisted living technologies. Informatics for Health and Social Care, 35(3-4), 125-143.
- The Lower Saxony research networkdesign of environments for ageing: towards interdisciplinary research on information and communication technologies in ageing societies. Informatics for Health and Social Care, 35(3-4), 92-103.
- A study on combining acoustic echo cancelers with impulse response shortening. The Journal of the Acoustical Society of America, 120(5), 3258-3258.
- Effectiveness of computer tailored health communication in increasing physical activity in people with or at risk of long-term conditions: systematic review and meta-analysis (Preprint). Journal of Medical Internet Research.
- Speech Quality Assessment for Listening-Room Compensation. Journal of the Audio Engineering Society, 62(6), 386-399.
Chapters
- Computer-Based Adaption of Cooking Recipes Integrated in a Speech Dialogue Assistance System, Ambient Assisted Living (pp. 163-172). Springer International Publishing
- Innovative Hörunterstützung in Kommunikationssystemen In Schick A, Meis M & Nocke C (Ed.), Beiträge zur psychologischen Akustik, Akustik in Büro und Objekt (pp. in press-in press). Oldenburg: Isensee Verlag.
- Acoustic Applications and Technologies for Ambient Assisted Living Scenarios, Ambient Assisted Living (AAL) Forum (pp. 337-342). Lecce, Italy.
- Ambient Voice Control for a Personal Activity and Household Assistant, Ambient Assisted Living (pp. 63-74). Springer Berlin Heidelberg
- Considering Hearing Deficiencies in Human-Computer Interaction In Ziefle M & Röcker C (Ed.), Human-Centered Design of E-Health Technologies: Concepts, Methods and Applications (pp. 180-207). IGI Global
- Detection and Classification of Acoustic Events for In-Home Care (Best-Paper Award) In Wichert R & Eberhardt B (Ed.), Ambient Assisted Living - Advanced Technologies and Societal Change, Springer Lecture Notes in Computer Science (LNCS) (pp. 181-196). Springer Science
- Detection and Classification of Acoustic Events for In-Home Care, Ambient Assisted Living (pp. 181-195). Springer Berlin Heidelberg
- Intelligente Konferenzsysteme für natürliche Freisprechkommunikation In Schick A, Meis M & Nocke C (Ed.), Beiträge zur psychologischen Akustik, Akustik in Büro und Objekt (pp. 249-266). Oldenburg: Isensee Verlag.
- Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners, Lecture Notes in Computer Science (pp. 568-575). Springer Berlin Heidelberg
Conference proceedings papers
- Refining Text Input For Augmentative and Alternative Communication (AAC) Devices: Analysing Language Model Layers For Optimisation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 14 May 2024 - 19 May 2024.
- Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.. ICASSP (pp 306-310)
- Hallucination in Perceptual Metric-Driven Speech Enhancement Networks. European Signal Processing Conference (pp 21-25)
- Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement. European Signal Processing Conference (pp 421-425)
- The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 22 October 2023 - 25 October 2023.
- Pre-Trained Intermediate ASR Features and Human Memory Simulation for Non-Intrusive Speech Intelligibility Prediction in the Clarity Prediction Challenge 2. he 4th Clarity Workshop on Machine Learning Challenges for Hearing Aids (Clarity-2023). https://claritychallenge.org/clarity2023-workshop/results.html, 19 August 2023 - 19 August 2023.
- The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (pp 1-5). New York, NY, USA, 22 October 2023 - 25 October 2023.
- Bridging the Communication Rate Gap: Enhancing Text Input for Augmentative and Alternative Communication (AAC). HCII 2023 Conference Proceedings, Vol. 10, 23 July 2023 - 23 July 2023.
- Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech Enhancement. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- Non-intrusive Speech Intelligibility Estimated By Metric Prediction for Hearing Impaired Individuals for the Clarity Prediction Challenge 1. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 18 September 2022 - 22 September 2022.
- MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. Proc. 30th European Signal Processing Conference, EUSIPCO 2022. Belgrade, Serbia, 29 August 2022 - 2 September 2022.
- ASR-Based, Single-Ended Modeling of Listening Effort - A Tool for TV Sound Engineers. Proceedings of Forum Acusticum (pp 2441-2445). Lyon, France, 7 December 2020 - 11 December 2020.
- Single-ended Prediction of Listening Effort for English Speech. DAGA 2020 - 46. Jahrestagung für Akustik (pp 775-777). Hannover, Germany
- 2D audio-visual localization in home environments using a particle filter. Sprachkommunikation - 10. ITG-Fachtagung (pp 75-78)
- Context and user requirement analyses of a new digital speech therapy system (THERESIAH). Conf. on Implantable Auditory Prosthesis (CIAP). Lake Tahoe, CA, USA
- Hearing support to reduce listening effort at work: an EEG study. DAGA 2019 – Proc. 45th Annual Meeting of the Deutsche Gesellschaft für Akustik e.V.. Rostock, Germany
- Erfassung der Höranstrengung fertiger TV-Mischungen. DAGA 2019 – Proc. 45th Annual Meeting of the Deutsche Gesellschaft für Akustik e.V.. Rostock, Germany
- Automatische Überwachung der Sprachverständlichkeit im Rundfunkmaterial. 30th Tonmeistertagung – VDT International Convention. Düsseldorf, Germany
- Measuring, modelling and predicting perceived reverberation. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 381-385). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- On DNN posterior probability combination in multi-stream speech recognition for reverberant environments. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 5250-5254). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- Combination strategy based on relative performance monitoring for multi-stream reverberant speech recognition. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 4870-4874). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- Performance comparison of real-time single-channel speech dereverberation algorithms. 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 1 March 2017 - 3 March 2017.
- Performance comparison of intrusive and non-intrusive instrumental quality measures for enhanced speech. 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 13 September 2016 - 16 September 2016.
- Acoustic Scene Classification using Time-Delay Neural Networks and Amplitude Modulation Filter Bank Features. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) (pp 70-74). Budapest, Hungary
- Performance comparison of GMM, HMM and DNN based approaches for acoustic event detection within Task 3 of the DCASE 2016 challenge. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) (pp 80-84). Budapest, Hungary
- Messung der Höranstrengung älterer Mitarbeiter eines Callcenters mittels neuroergonomischer Messmethoden / Neuroergonomic assessment of listening effort in older call center employees. Proc. Zukunft Lebensräume Kongress 2016 (pp 327-332). Frankfurt, Germany
- Perceptual and instrumental evaluation of the perceived level of reverberation. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Classification of human cough signals using spectro-temporal Gabor filterbank features. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Concept for automated usability evaluation of graphical user interfaces. Proc. Kognitive Systeme: Mensch, Teams, Systeme und Automaten. Bochum, Germany
- Spectrally and spatially informed noise suppression using beamforming and convolutive NMF. Proc. AES 60th Conference on Dereverberation and Reverberation of Audio, Music, and Speech. Leuven, Belgium
- Predicting the quality of processed speech by combining modulation-based features and model trees. Speech Communication - 12. ITG-Fachtagung Sprachkommunikation (pp 180-184)
- Late reverberant spectral variance estimation using acoustic channel equalization. 2015 23rd European Signal Processing Conference (EUSIPCO), 31 August 2015 - 4 September 2015.
- A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015.
- Joint estimation of reverberation time and direct-to-reverberation ratio from speech using auditory inspired features. Proc. ACE Challenge Workshop, a satellite event of WASPAA. New Paltz, NY, USA
- A study on joint beamforming and spectral enhancement for robust speech recognition in reverberant environments. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015.
- Concept of a Nutrition Consultant Application with Context Based Speech Recognition. 4. Interdisziplinärer Workshop Kognitive Systeme 2015, Mensch, Teams, Systeme und Automaten. Bielefeld, Germany
- CooCo, what can i cook today? Surprise me. CEUR Workshop Proceedings, Vol. 1520 (pp 233-240)
- A study on speech quality and speech intelligibility measures for quality assessment of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- Subjective speech quality and speech intelligibility evaluation of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2014 - 9 May 2014.
- Joint Dereverberation and Noise Reduction Using Beamforming and a Single-Channel Speech Enhancement Scheme. Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. Florence, Italy
- Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (pp 5559-5563). Florence, Italy
- Robust ASR in reverberant environments using temporal cepstrum smoothing for speech enhancement and an amplitude modulation filterbank for feature extraction. Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. Florence, Italy
- Improving acoustic event detection by localization algorithms. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 523-524). Oldenburg, Germany
- Nutzbarkeit von modellierten Phonemfolgen zur Erkennung von unbekannten Wörtern in phonembasierten Spracherkennern. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 538-539). Oldenburg, Germany
- Influence of a spherical microphone array on a sound source number estimator based upon independent component analysis. Proc. 40th German Annual Conference on Acoustics (DAGA 14). Oldenburg, Germany
- A 2-Stage Approach for Joint Noise Reduction and Dereverberation by means of Multi-Channel Equalization and a Noise Processor. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 186-187). Oldenburg, Germany
- Room Transfer Function Estimation using Cepstral Smoothing. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 493-494). Oldenburg, Germany
- PTP Synchronized Isosynchronous Multi-Channel Audio-Streaming over Gigabit-Ethernet based on FPGAs. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 182-183). Oldenburg, Germany
- Networked embedded acoustic processing system for smart building applications. Conference on Design and Architectures for Signal and Image Processing, DASIP (pp 349-350)
- Acoustic Event Detection Using Signal Enhancement and Spectro-temporal Feature Extraction. IEEE AASP Challenge: Detection and Classification of Acoustic Scenes and Events. New Paltz, NY, USA
- On the use of spectro-temporal features for the IEEE AASP challenge ‘detection and classification of acoustic scenes and events’. 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 20 October 2013 - 23 October 2013.
- MOBECS - User Requirements for a Mobile Emergency Call System. AAL Forum 2013. Norrköping, Sweden
- Enhancing Wireless Sensor Networks with Acoustic Sensing Technology: Use Cases, Applications & Experiments. 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, 20 August 2013 - 23 August 2013.
- Noise Robust Distant Automatic Speech Recognition Utilizing NMF based Source Separation and Auditory Feature Extraction. Proc. 2nd International Workshop on Machine Listening in Multisource Environments (CHiME 2013) (pp 1-6). Vancouver, Canada
- A perceptually constrained channel shortening technique for speech dereverberation. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Automatic acoustic siren detection in traffic noise by part-based models. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Blind estimation of reverberation time based on spectro-temporal modulation filtering. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Anwendungen akustischer Ereigniserkennung im Automobil. Proc. AmE 2013 - Automotive meets Electronics. Dortmund, Germany
- MOBECS - Mobility by Safety: Konzept und Nutzeranforderungen. AAL Kongress 2013 (pp 504-507). Berlin, Germany
- Non-intrusive regularization for least-squares multichannel equalization for speech dereverberation. 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, 14 November 2012 - 17 November 2012.
- The Ambient Adaptable Living Assistant is Meeting its Users. In Proc. AAL Forum 2012 (pp 629-636). Eindhoven, The Netherlands
- Computational Efficient Noise Reduction for Dialogue Systems in Car Environments based on Binary Time-Frequency Masking and Autoregressive Interpolation. Workshop on Dialog systems that think along - Do they really understand me. Saarbrücken, Germany
- Reduction of Non-stationary Noise for a Robotic Living Assistant using Sparse Non-negative Matrix Factorization. Proc. Speech and Multimodal Interaction in Assistive Environments (SMIAE 2012). Jeju Island, Republic of Korea
- Multimodal Human-Machine Interaction for Service Robots in Home-Care Environments. Proc. Speech and Multimodal Interaction in Assistive Environments (SMIAE 2012). Jeju Island, Republic of Korea
- Objective Methods to Asses Speech Signals Processed by Short-Term Spectral Attenuation. Proc. 38th Annual Convention for Acoustics (DAGA). Darmstadt, Germany
- System identification for listening-room compensation by means of acoustic echo cancellation and acoustic echo suppression filters. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25 March 2012 - 30 March 2012.
- 2D audio-visual localization in home environments using a particle filter. Proceedings of 10th ITG Symposium on Speech Communication
- Increasing the robustness of acoustic multichannel equalization by means of regularization. International Workshop on Acoustic Signal Enhancement, IWAENC 2012
- A new approach for reduction of supergaussian noise using autoregressive interpolation and time-frequency masking. International Workshop on Acoustic Signal Enhancement, IWAENC 2012
- System identification of equalized room impulse responses by an acoustic echo canceller using proportionate LMS algorithms. 130th Audio Engineering Society Convention 2011, Vol. 2 (pp 1150-1162)
- Room impulse response reshaping by joint optimization of multiple p-norm based criteria. European Signal Processing Conference (pp 1658-1662)
- Speech quality assessment for listening-room compensation. Proceedings of the AES International Conference (pp 11-20)
- Evaluation of joint position-pitch estimation algorithm for localising multiple speakers in adverse acoustical environments. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Room Impulse Response Reshaping by p-Norm Optimization based on Estimates of Room Impulse Responses. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Speech / Non-Speech Discrimination for Acoustic Monitoring Considering Privacy Issues. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Real-time Room Reverberation Estimation for Online Speech Intelligibility Monitoring. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Speech Activity Detection for Activity Monitoring using an Embedded Platform. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Hearing-Loss Compensation in a Telephone System. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Ambiente Sprachsteuerung für einen Pers"’onlichen Aktivitäts- und Haushaltsassistenten. 4. Deutscher AAL-Kongress. Berlin, Germany
- Erkennung und Klassifikation von akustischen Ereignissen zur häuslichen Pflege. 4. Deutscher AAL-Kongress. Berlin, Germany
- Voice activity detection driven acoustic event classification for monitoring in smart homes. 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010), 7 November 2010 - 10 November 2010.
- Hands-free telecommunication for elderly persons suffering from hearing deficiencies. The 12th IEEE International Conference on e-Health Networking, Applications and Services, 1 July 2010 - 3 July 2010.
- Objective Quality Measures for Dereverberation Methods based on Room Impulse Response Equalization. Proc. German Annual Conference on Acoustics (DAGA). Berlin, Germany
- Quality assessment for listening-room compensation algorithms. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 14 March 2010 - 19 March 2010.
- The Lower Saxony Research Network Design of Environments for Ageing (GAL) - Towards Interdisciplinary Research on ICT in Aging Societies. Medizininformatik-Weltkongress Medinfo 2010
- How can audio technology improve working conditions?. Change 2009 –Ambient Assisted Working Accessible and assistive ICT in Enterprise Environments, Emden, Germany
- Estimation of the Optimum System Delay for Speech Dereverberation by Inverse Filtering. International Conference on Acoustics (NAG/DAGA 2009). Rotterdam, The Netherlands
- Direction of Arrival Estimation based on the Dual Delay Line Approach for Binaural Hearing Aid Microphone Arrays. Int. Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) (pp 185-188). Xiamen, China
- Multi-channel listening-room compensation using a decoupled filtered-X LMS algorithm. 2008 42nd Asilomar Conference on Signals, Systems and Computers, 26 October 2008 - 29 October 2008.
- Combined Source Tracking and Noise Reduction for Application in Hearing Aids. 8. ITG-Fachtagung Sprachkommunikation. Aachen, Germany
- A Decoupled Filtered-X LMS Algorithm for Listening-Room Compensation. Proc. Int. Workshop on Acoustic Echo and Noise Control (IWAENC). Seattle, USA
- System Identification for Multi-Channel Listening-Room Compensation Using an Acoustic Echo Canceller. 2008 Hands-Free Speech Communication and Microphone Arrays, 6 May 2008 - 8 May 2008.
- System Identification for Multi-Channel Listening-Room Compensation using an Acoustic Echo Canceller. Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) (pp 224-227). Trento, Italy
- Room Impulse Response Shaping based on Estimates of Room Impulse Responses. German Annual Conference on Acoustics (DAGA) (pp 829-830). Dresden, Germany
- Objective perceptual quality assessment for self-steering binaural hearing aid microphone arrays. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 31 March 2008 - 4 April 2008.
- System identification for multi-channel listening-room compensation using an acoustic echo canceller. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (pp 225-+)
- Optimization of Gabor Features for Text-Independent Speaker Identification. 2007 IEEE International Symposium on Circuits and Systems, 27 May 2007 - 30 May 2007.
- Least Squares Equalizer Design under Consideration of Tail Effects. Proc. German Annual Conference on Acoustics (DAGA) (pp 599-600). Stuttgart, Germany
- Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 International Symposium on Intelligent Signal Processing and Communication Systems, 28 November 2007 - 1 December 2007.
- Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2 (pp 112-+)
- A psychoacoustic noise reduction approach for stereo hands-free systems. Audio Engineering Society - 120th Convention Spring Preprints 2006, Vol. 4 (pp 1980-1989)
- Multichannel-noise reduction-systems for speaker identification in an automotive environment. Audio Engineering Society - 120th Convention Spring Preprints 2006, Vol. 4 (pp 1941-1952)
- Enhanced Partitioned Stereo Residual Echo Estimation. 2006 Fortieth Asilomar Conference on Signals, Systems and Computers, 29 October 2006 - 1 November 2006.
- View this article in WRRO Transcription-free fine-tuning of speech separation models for noisy and reverberant multi-speaker automatic speech recognition. Proceedings of Interspeech 2024. Kos Island, Greece, 1 September 2024 - 1 September 2024.
- View this article in WRRO Training data augmentation for dysarthric automatic speech recognition by text-to-dysarthric-speech synthesis. Proceedings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024.
- Combining Conformer and Dual-Path-Transformer Networks for Single Channel Noisy Reverberant Speech Separation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Active Learning for Sound Event Classification using Bayesian Neural Networks with Gaussian Variational Posterior. Proc. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP24). Seoul, South Korea, 14 April 2024 - 19 April 2024.
- View this article in WRRO Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate ASR features and human memory models. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2024). Seoul, Korea, 14 April 2024 - 14 April 2024.
- View this article in WRRO Multi-CMGAN+/+: leveraging multi-objective speech quality metric prediction for speech enhancement. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024). Seoul, Korea, 14 April 2024 - 14 April 2024.
- View this article in WRRO Improving audiovisual active speaker detection in egocentric recordings with the data-efficient image transformer. Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2023). Taipei, Taiwan, 16 December 2023 - 16 December 2023.
- View this article in WRRO On time domain conformer models for monaural speech separation in noisy reverberant acoustic environments. Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. Beitou, Taipei, 16 December 2023 - 16 December 2023.
- The University of Sheffield CHiME-7 UDASE Challenge Speech Enhancement System. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023)
- On data sampling strategies for training neural network speech separation models. 2023 31st European Signal Processing Conference (EUSIPCO). Helsinki, Finland, 4 September 2023 - 4 September 2023.
- Message Recommendation Strategies for Tailoring Health Information to Promote Physical Activities. Communications in Computer and Information Science (CCIS). Copenhagen, Denmark, 23 July 2023 - 23 July 2023.
- PAMGAN+/-: Improving Phase-Aware Speech Enhancement Performance via Expanded Discriminator Training. AES Convention Europe 2023
- Moving Towards Non-Binary Gender Identification Via Analysis of System Errors in Binary Gender Classification. 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), 4 June 2023 - 10 June 2023.
- Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation. Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation, 1 September 2022.
- Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. IEEE 30th European Signal Processing Conference
- Residual Echo Power Spectral Density Estimation Based on an Optimal Smoothed Misalignment For Acoustic Echo Cancelation. Proc. Int. Workshop on Acoustic Echo and Noise Control (IWAENC-2005) , Eindhoven, The Netherlands (pp 209-212)
- Comparison of Speech Enhancement Systems for Noise Fields in a Car Environment. German 32. Deutsche Jahrestagung für Akustik (DAGA’06) (pp 45-46). Braunschweig, Germany
- Performance of Text-Independent Speaker Identification considering In-Car Acoustics. German 32. Deutsche Jahrestagung für Akustik (DAGA’06) (pp 223-224). Braunschweig, Germany
- Multi-Channel Speech Enhancement using a Psychoacoustic Approach for a Post-Filter. German ITG-Symposium on Speech Communication. Kiel, Germany
- Active Learning for Sound Event Classification using Monte-Carlo Dropout and PANN Embeddings. Proc. DCASE Workshop. Online, 15 November 2021 - 19 November 2021.
Reports
- Clarity Prediction Challenge 1 Entry: Non-intrusive Speech Intelligibility Metric Prediction - Technical Report
Preprints
- Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement, arXiv.
- Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition, arXiv.
- Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis, arXiv.
- Hallucination in Perceptual Metric-Driven Speech Enhancement Networks, arXiv.
- Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models, arXiv.
- Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement, arXiv.
- On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments, arXiv.
- The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions, arXiv.
- Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations, arXiv.
- On Data Sampling Strategies for Training Neural Network Speech Separation Models, arXiv.
- Effectiveness of computer tailored health communication in increasing physical activity in people with or at risk of long-term conditions: systematic review and meta-analysis (Preprint), JMIR Publications Inc..
- Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation, arXiv.
- Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation, arXiv.
- Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation, arXiv.
- View this article in WRRO MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data.
- Joint Estimation of Reverberation Time and Direct-to-Reverberation Ratio from Speech using Auditory-Inspired Features, arXiv.
- Grants
-
Research Grants
- Participatory co-design of a platform for collecting atypical speech data, Research England, 03/2022 - 07/2022, £19,692, as PI