Professor Roger K. Moore
BA (Hons), MSc, PhD
School of Computer Science
Professor of Spoken Language Processing
Deputy Head of School
Head of the Speech and Hearing (SpandH) research group
+44 114 222 1807
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Prof. Roger K. Moore has over 40 years’ experience in Speech Technology R&D and, although an engineer by training, much of his research has been based on insights from human speech perception and production.
As Head of the UK Government's Speech Research Unit from 1985 to 1999, he was responsible for the development of the Aurix range of speech technology products and the subsequent formation of 20/20 Speech Ltd.
Since 2004 he has been Professor of Spoken Language Processing at the University of Sheffield, and also holds Visiting Chairs at Bristol Robotics Laboratory and University College London Psychology & Language Sciences. He was President of the European/International Speech Communication Association from 1997 to 2001, General Chair for INTERSPEECH-2009 and ISCA Distinguished Lecturer during 2014-15.
In 2017 he organised the first international workshop on ‘Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR)’. Prof. Moore is the current Editor-in-Chief of Computer Speech & Language and in 2016 he was awarded the LREC Antonio Zampoli Prize for "Outstanding Contributions to the Advancement of Language Resources & Language Technology Evaluation within Human Language Technologies".
- Research interests
-
Prof. Moore is currently working on a unified theory of spoken language processing in the general area of `Cognitive Informatics` called `PRESENCE` (PREdictive SENsorimotor Control and Emulation). PRESENCE weaves together accounts from a wide variety of different disciplines concerned with the behaviour of living systems - many of them outside the normal realms of spoken language - and compiles them into a new framework that is intended to breathe life into a new generation of research into spoken language processing.
Prof. Moore is involved in collaborations aimed at Clinical Applications of Speech Technology (particularly for individuals with speaking difficulties) and he is becoming increasingly involved in Creative Applications of Speech Technology through interactions with colleagues from the performing arts.
- Publications
-
Books
- Biomedical Engineering Systems and Technologies. Springer International Publishing.
- Spoken language system and corpus design.
- Spoken Language Reference Materials. De Gruyter.
- Spoken Language Characterization. De Gruyter.
Journal articles
- Vocal interactivity in-and-between humans, animals and robots. Interaction Studies, 24(1), 1-4.
- Is honesty the best policy for mismatched partners? Aligning multi-modal affordances of a social robot: an opinion paper. Frontiers in Virtual Reality.
- Spoken language interaction with robots: Recommendations for future research. Computer Speech & Language, 71.
- Cross-species parallels in babbling : animals and algorithms. Philosophical Transactions of the Royal Society B: Biological Sciences, 376(1836).
- Acceptability and effectiveness of NHS recommended e-therapies for depression, anxiety and stress: A meta-analysis. Journal of Medical Internet Research. View this article in WRRO
- Usability, acceptability and effectiveness of web-based conversational agents to facilitate problem solving in older adults : controlled study. Journal of Medical Internet Research, 22(5). View this article in WRRO
- E-Therapies in England for Stress, Anxiety or Depression: how are apps developed? A survey of NHS e-therapy developers. BMJ Health & Care Informatics, 26(1). View this article in WRRO
- The effects of robot facial emotional expressions and gender on child-robot interaction in a field study. Connection Science. View this article in WRRO
- Toward a Needs-Based Architecture for `Intelligent' Communicative Agents: Speaking with Intention. Frontiers in Robotics and AI, 4. View this article in WRRO
- Direct Speech Reconstruction From Articulatory Sensor Data by Machine Learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(12), 2362-2374. View this article in WRRO
- Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics. JASA Express Letters, 141(3), EL307-EL307. View this article in WRRO
- E-therapies in England for stress, anxiety or depression: what is being used in the NHS? A survey of mental health services.. BMJ Open, 7(1). View this article in WRRO
- Restoring Speech Following Total Removal of the Larynx. Studies in Health Technology and Informatics, 242, 314-321. View this article in WRRO
- Vocal Interactivity in-and-between Humans, Animals, and Robots. Frontiers in Robotics and AI, 3. View this article in WRRO
- A silent speech system based on permanent magnet articulography and direct synthesis. Computer Speech & Language, 39, 67-87. View this article in WRRO
- Introducing a Pictographic Language for Envisioning a Rich Variety of Enactive Systems with Different Degrees of Complexity. International Journal of Advanced Robotic Systems, 13(2). View this article in WRRO
- Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) (Dagstuhl Seminar 16442).. Dagstuhl Reports, 6, 154-194.
- Spoken language processing: Time to look outside?. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8791, 21-36.
- Discovering the phoneme inventory of an unwritten language: A machine-assisted approach. SPEECH COMMUNICATION, 56, 152-166. View this article in WRRO
- Spoken language processing: Where do we go from here?. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7407, 119-133.
- Performance of the MVOCA silent speech interface across multiple speakers. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 1140-1143.
- A Bayesian explanation of the 'Uncanny Valley' effect and related psychological phenomena.. Sci Rep, 2, 864. View this article in WRRO
- Small-Vocabulary Speech Recognition Using a Silent Speech Interface Based on Magnetic Sensing. Speech Communication.
- Generating context-sensitive ECA responses to user barge-in interruptions. Journal on Multimodal User Interfaces, 6(1-2), 13-25.
- Generating context-sensitive ECA responses to user barge-in interruptions. Journal on Multimodal User Interfaces, 1-13.
- A prototype for a conversational companion for reminiscing about images. COMPUT SPEECH LANG, 25(2), 140-157.
- Speech synthesis parameter generation for the assistive silent speech interface MVOCA. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 3009-3012.
- Towards the detection of social dominance in dialogue. Speech Communication, 53(9-10), 1104-1114. View this article in WRRO
- Computing phonological generalization over real speech exemplars. J PHONETICS, 38(4), 540-547. View this article in WRRO
- Isolated word recognition of silent speech using magnetic implants and sensors.. Med Eng Phys, 32(10), 1189-1197.
- Discovering an optimal set of minimally contrasting acoustic speech units: A point of focus for whole-word pattern matchinga1. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, 310-313.
- Evaluation of a silent speech interface based on magnetic sensing. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, 246-249.
- Isolated word recognition of silent speech using magnetic implants and sensors. Medical Engineering and Physics.
- Speech as the Perception of Affordances. ECOL PSYCHOL, 22(4), 327-343.
- An attention-gating recurrent working memory architecture for emergent speech representation. CONNECT SCI, 22(2), 157-175.
- Biomimetic vocal tract modeling: Synthesis of speech articulation.. The Journal of the Acoustical Society of America, 125(4), 2495-2495.
- Towards an investigation of speech energetics using 'AnTon': an animatronic model of a human tongue and vocal tract. CONNECT SCI, 20(4), 319-336.
- PRESENCE: A human-inspired architecture for speech-based human-machine interaction. IEEE T COMPUT, 56(9), 1176-1188. View this article in WRRO
- Spoken language processing: Piecing together the puzzle. SPEECH COMMUN, 49(5), 418-435. View this article in WRRO
- ACORNS - Towards computational modeling of communication and recognition skills. Proceedings of the 6th IEEE International Conference on Cognitive Informatics, ICCI 2007, 349-356.
- Using linguistic cues for the automatic recognition of personality in conversation and text. J ARTIF INTELL RES, 30, 457-500.
- 2006 Workshop on Spoken Language Technology. IEEE Transactions on Audio, Speech, and Language Processing, 14(3), 1094-1094.
- Introduction to the special issue on data mining of speech, audio, and dialog. IEEE T SPEECH AUDI P, 13(5), 633-634.
- Speech communication: Louis pols special issue. Speech Communication, 47(1-2), 3-6.
- Results from a survey of attendees at ASRU 1997 and 2003. 9th European Conference on Speech Communication and Technology, 117-120.
- An investigation into a simulation of episodic memory for automatic speech recognition. 9th European Conference on Speech Communication and Technology, 1245-1248.
- Panel on ubiquitous speech processing. 9th European Conference on Speech Communication and Technology.
- Speech technology for e-inclusion of people with physical disabilities and disordered speech. 9th European Conference on Speech Communication and Technology, 445-448.
- Dictation and Voice Control: Automatic Speech Recognition in the Marketplace. IEE Colloquium (Digest)(499).
- Critique: The potential role of speech production models in automatic speech recognition. The Journal of the Acoustical Society of America, 99(3), 1710-1713.
- Modelling intonation contours at the phrase level using continuous density hidden Markov models. Computer Speech & Language, 8(3), 247-260.
- Editorial. Speech Communication, 9(1), ix-ix.
- Minimally distinct word-pair discrimination using a back-propagation network. Computer Speech & Language, 3(2), 119-131.
- Isolated digit recognition experiments using the multi-layer perceptron. Speech Communication, 7(4), 403-409.
- Speech Recognition Systems and Theories of Speech Perception, 427-441.
- A multilevel approach to pattern processing. Pattern Recognition, 14(1-6), 261-265.
- A Dynamic Programming Algorithm for the Distance Between Two Finite Areas. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-1(1), 86-88.
- Evaluating speech recognizers. IEEE Transactions on Acoustics, Speech, and Signal Processing, 25(2), 178-183.
- Freedom comes at a cost?: An exploratory study on affordances’ impact on users’ perception of a social robot. Frontiers in Robotics and AI, 11.
- Using social robots for language learning: are we there yet?. Journal of China Computer-Assisted Language Learning, 0(0).
- Talking with Robots: Opportunities and Challenges.
- View this article in WRRO Vocal Interactivity in Crowds, Flocks and Swarms: Implications for Voice User Interfaces.
- View this article in WRRO A 'Canny' Approach to Spoken Language Interfaces.
Chapters
- PCT and beyond: toward a computational framework for ‘intelligent’ communicative systems In Mansell W (Ed.), The Interdisciplinary Handbook of Perceptual Control Theory (pp. 557-582). Academic Press (Elsevier)
- A Structural Approach to Dealing with High Dimensionality Parameter Search Spaces, Towards Autonomous Robotic Systems (pp. 159-170). Springer International Publishing
- Evaluating ToRCH Structure for Characterizing Robots, Towards Autonomous Robotic Systems (pp. 319-330). Springer International Publishing
- Voice Restoration After Laryngectomy Based on Magnetic Sensing of Articulator Movement and Statistical Articulation-to-Speech Conversion, Biomedical Engineering Systems and Technologies (pp. 295-316). Springer International Publishing View this article in WRRO
- Towards an Intraoral-Based Silent Speech Restoration System for Post-laryngectomy Voice Replacement, Biomedical Engineering Systems and Technologies (pp. 22-38). Springer International Publishing View this article in WRRO
- From talking and listening robots to intelligent communicative machines, ROBOTS THAT TALK AND LISTEN: TECHNOLOGY AND SOCIAL IMPACT (pp. 317-335).
- Spoken Language Processing: Time to Look Outside?, Statistical Language and Speech Processing (pp. 21-36). Springer International Publishing
- Interacting with Purpose (and Feeling!): What Neuropsychology and the Performing Arts Can Tell Us About ’Real’ Spoken Language Behaviour, Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop (pp. 5-5). Springer New York
- Cognitive approaches to spoken language technology In Chen F & Jokinen K (Ed.), Speech Technology: Theory and Applications (pp. 89-103). Springer Verlag
- Speech Recognition In Clark A, Fox C & Lappin S (Ed.), The Handbook of Computational Linguistics and Natural Language Processing (pp. 299-332). Wiley-Blackwell
- Spoken Language Processing by Machine In Gaskell G (Ed.), Oxford Handbook of Psycholinguistics (pp. 723-738). New York: Oxford University Press.
- Affective computing and collaborative networks: Towards emotion-aware interaction (pp. 315-322).
- Isolated Digit Recognition Using the Multi-Layer Perceptron, Recent Advances in Speech Understanding and Dialog Systems (pp. 261-265). Springer Berlin Heidelberg
- Part V: Conclusion, Robots that Talk and Listen DE GRUYTER
Conference proceedings papers
- Refining Text Input For Augmentative and Alternative Communication (AAC) Devices: Analysing Language Model Layers For Optimisation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 14 April 2024 - 19 April 2024.
- Bridging the communication rate gap: enhancing text input for augmentative and alternative communication (AAC). HCI International 2023 – Late Breaking Papers, Vol. 14055. Copenhagen, Denmark, 23 July 2023 - 23 July 2023. View this article in WRRO
- Local Minima Drive Communications in Cooperative Interaction. Proceedings of the AISB Convention 2023 (pp 51-56)
- Incremental Disfluency Detection for Spoken Learner English. BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings (pp 272-278)
- Investigating deep neural structures and their interpretability in the domain of voice conversion. Interspeech 2021 (pp 806-810). Brno, Czechia, 30 August 2021 - 3 September 2021.
- Using Sampling Techniques and Machine Learning Algorithms to Improve Big Five Personality Traits Recognition from Non-verbal Cues. Proceedings - 2021 IEEE 4th National Computing Colleges Conference, NCCC 2021
- Spatio-Temporal Context Modelling for Speech Emotion Classification. 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019.
- Learning temporal clusters using capsule routing for speech emotion recognition. Proceedings of Interspeech 2019 (pp 1701-1705). Graz, Austria, 15 September 2019 - 19 September 2019.
- Using Alexa for flashcard-based learning. Proceedings of Interspeech 2019 (pp 1846-1850). Graz, Austria, 15 September 2019 - 19 September 2019.
- On the use/misuse of the term 'phoneme'. Proceedings Interspeech 2019 (pp 2340-2344). Graz, Austria, 15 September 2019 - 19 September 2019. View this article in WRRO
- Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions. Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems
- Examining Temporal Variations in Recognizing Unspoken Words using EEG Signals. 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 7 October 2018 - 10 October 2018. View this article in WRRO
- View this article in WRRO An End-to-End Deep Neural Network for Facial Emotion Classification. FUSION 2019 - 22nd International Conference on Information Fusion
- View this article in WRRO Dual Stream Spatio-Temporal Motion Fusion with Self-Attention for Action Recognition. FUSION 2019 - 22nd International Conference on Information Fusion
- Discriminating between Imagined Speech and Non-Speech Tasks using EEG. 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp 1952-1955), 18 July 2018 - 21 July 2018. View this article in WRRO
- American Sign Language Posture Understanding with Deep Neural Networks. 2018 21st International Conference on Information Fusion (FUSION) (pp 573-579). UK, 10 July 2018 - 13 July 2018. View this article in WRRO
- Learning Capsules for Vehicle Logo Recognition. 2018 21st International Conference on Information Fusion (FUSION) (pp 565-572). UK, 10 July 2018 - 13 July 2018. View this article in WRRO
- Towards a comprehensive taxonomy for characterizing robots. Conference proceedings TAROS 2018, Vol. 10965 (pp 381-392). Bristol, UK, 25 July 2018 - 27 July 2018.
- A Wearable Silent Speech Interface based on Magnetic Sensors with Motion-Artefact Removal. Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies, 19 January 2018 - 21 January 2018.
- Creating a voice for miro, the world's first commercial biomimetic robot. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 (pp 3419-3420) View this article in WRRO
- Children's age influences their use of biological and mechanical questions towards a humanoid. Processdings of the 18th Towards Autonomous Robotic Systems (TAROS), Vol. 10454 (pp 290-299), 19 July 2017 - 21 July 2017. View this article in WRRO
- A Biomimetic Vocalisation System for MiRo. Biomimetic and Biohybrid Systems. Living Machines 2017 View this article in WRRO
- You made him be alive: Children’s perceptions of animacy in a humanoid robot. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 10384 (pp 73-85), 25 July 2017 - 28 July 2017. View this article in WRRO
- The Sheffield Search and Rescue corpus. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5840-5844), 5 March 2017 - 9 March 2017. View this article in WRRO
- View this article in WRRO PrimEmo: A neural implementation of survival circuits supporting primitive emotions. Proceedings of AISB Annual Convention 2017 (pp 173-180)
- View this article in WRRO A needs-driven cognitive architecture for future 'intelligent' communicative agents. CEUR Workshop Proceedings, Vol. 1855 (pp 50-51)
- Interspeech 2017. Interspeech 2017
- Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction. Lecture Notes in Electrical Engineering, Vol. 427 (pp 281-291) View this article in WRRO
- Brain-computer interface technology for speech recognition: A review. Proceedings of 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) (pp 1-5). Jeju, South Korea, 13 December 2016 - 16 December 2016.
- Designing Robot Personalities for Human-Robot Symbiotic Interaction in an Educational Context (pp 413-417) View this article in WRRO
- View this article in WRRO Impact of robot responsiveness and adult involvement on children's social behaviours in human-robot interaction. AISB Annual Convention 2016, AISB 2016
- Congratulations, It’s a Boy! Bench-Marking Children’s Perceptions of the Robokind Zeno-R25 (pp 33-39) View this article in WRRO
- The EASEL Project: Towards Educational Human-Robot Symbiotic Interaction (pp 297-306) View this article in WRRO
- Towards a Synthetic Tutor Assistant: The EASEL Project and its Architecture (pp 353-364) View this article in WRRO
- Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing. Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies, 21 February 2016 - 23 February 2016.
- Direct Speech Generation for a Silent Speech Interface based on Permanent Magnet Articulography. Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies, 21 February 2016 - 23 February 2016.
- View this article in WRRO Speech-based location estimation of first responders in a simulated search and rescue scenario. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 2734-2738)
- Integrating User-Centred Design in the Development of a Silent Speech Interface Based on Permanent Magnetic Articulography (pp 324-337)
- Children’s Age Influences Their Perceptions of a Humanoid Robot as Being Like a Person or Machine (pp 348-353) View this article in WRRO
- A User-centric Design of Permanent Magnetic Articulography based Assistive Speech Technology. Proceedings of the International Conference on Bio-inspired Systems and Signal Processing, 12 January 2015 - 15 January 2015.
- View this article in WRRO Presence of life-like robot expressions influences children's enjoyment of human-robot interactions in the field. AISB Convention 2015
- Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1018-1022)
- On the use of the 'pure data' programming language for teaching and public outreach in speech processing. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1498-1499)
- The Uncanny Valley: A Focus on Misaligned Cues (pp 256-265)
- Optimising Robot Personalities for Symbiotic Interaction (pp 392-395) View this article in WRRO
- A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices. 8th ISCA Workshop on Speech Synthesis. Barcelona, Spain, 31 August 2013 - 2 September 2013.
- Performance of the MVOCA Silent Speech Interface Across Multiple Speakers. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1139-1142)
- C2H: A Computational Model of H&H-based Phonetic Contrast in Synthetic Speech. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 (pp 986-989)
- Establishing some principles of human speech production through two-dimensional computational models. SAPA-SCALE Conference 2012 (pp 5-10)
- Cross-language phone recognition when the target language phoneme inventory is not known. Interspeech’11. Florence
- Progress and prospects for speech technology: Results from three sexennial surveys. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1533-1536)
- Speech Synthesis Parameter Generation for the Assistive Silent Speech Interface MVOCA. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3020-+)
- Progress and Prospects for Speech Technology: Results from Three Sexennial Surveys. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 1544-1547)
- Reactive speech synthesis: actively managing phonetic contrast along an H&H continuum. 17th International Congress of Phonetics Sciences (ICPhS). Hong Kong
- Discovering an Optimal Set of Minimally Contrasting Acoustic Speech Units: A Point of Focus for Whole-Word Pattern Matching. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 310-313)
- Evaluation of a Silent Speech Interface Based on Magnetic Sensing. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 246-249)
- Biomimetic vocal tract modeling: preliminary results of vocalization experiments. 157th Meeting Acoustical Society of America
- Finding allophones: An evaluation on consonants in the TIMIT corpus. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1651-1654)
- Modelling vocabulary growth from birth to young adulthood. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1727-1730)
- Discovering keywords from cross-modal input: Ecological vs. engineering methods for enhancing acoustic repetitions. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1171-1174)
- The case for case-based automatic speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 3027-3030)
- Modelling Vocabulary Growth from Birth to Young Adulthood. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 1695-1698)
- The Case for Case-Based Automatic Speech Recognition. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2999-3002)
- Discovering Keywords from Cross-Modal Input: Ecological vs. Engineering Methods for Enhancing Acoustic Repetitions. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 1151-1154)
- Finding Allophones: an Evaluation on Consonants in the TIMIT Corpus. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 1631-1634)
- A Computational Model of Language Acquisition: the Emergence of Words. FUNDAMENTA INFORMATICAE, Vol. 90(3) (pp 229-249)
- Evolving Spiking Neural Parameters for Behavioral Sequences. ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, Vol. 5769 (pp 784-793)
- A Computational Model of Preverbal Infant Word Learning. Proceedings of ICCM 2009 - 9th International Conference on Cognitive Modeling (pp 432-433)
- Language identification: Insights from the classification of hand annotated phone transcripts. Odyssey 2008: Speaker and Language Recognition Workshop
- Language identification: Insights from the classification of hand annotated phone transcripts. Odyssey 2008: Speaker and Language Recognition Workshop
- AnTon: An animatronic model of a human tongue and vocal tract. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2647-2650)
- AnTon: an Animatronic Model of a Human Tongue and Vocal Tract. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 2647-2650)
- Animatronic model of a human tongue.. ALIFE (pp 775-775)
- Towards capturing fine phonetic variation in speech using articulatory features. SPEECH COMMUNICATION, Vol. 49(10-11) (pp 811-826)
- Temporal Episodic Memory Model: An Evolution of MINERVA2. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2256-2259)
- Sound localization through evolutionary learning applied to spiking neural networks. 2007 IEEE Symposium on Foundations of Computational Intelligence, Vols 1 and 2 (pp 350-356)
- Towards a unified theory of Spoken Language Processing. ICCI 2005: FOURTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS - PROCEEDINGS (pp 167-172)
- Modelling data entry rates for asr and alternative input methods. 8th International Conference on Spoken Language Processing, ICSLP 2004 (pp 2285-2288)
- Spoken language output: Realising the vision. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 2909-2912)
- A comparison of the data requirements of automatic speech recognition systems and human listeners. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 2581-2584)
- Message from the ISCA president. EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology (pp i)
- Dictation and voice control. IEE Colloquium (Digest), Vol. 499
- Modelling asynchrony in speech using elementary single-signal decomposition. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2 (pp 1247-1250)
- Theory of word frequencies and its application to dialogue move recognition. International Conference on Spoken Language Processing, ICSLP, Proceedings, Vol. 3 (pp 1880-1883)
- THE APPLICATION OF DYNAMIC PROGRAMMING TECHNIQUES TO NON-WORD BASED TOPIC SPOTTING. 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 (pp 1355-1358)
- EAGLES SPOKEN LANGUAGE WORKING GROUP: OVERVIEW AND RESULTS. 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 (pp 841-844)
- WHITHER A THEORY OF SPEECH PATTERN PROCESSING?. 3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993 (pp 43-47)
- MODELLING OF INTONATION CONTOURS AT THE SENTENCE LEVEL USING CHMMS AND THE 1961 O'CONNOR AND ARNOLD SCHEME. 3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993 (pp 785-788)
- SIMULTANEOUS RECOGNITION OF CONCURRENT SPEECH SIGNALS USING HIDDEN MARKOV MODEL DECOMPOSITION. 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991 (pp 1175-1178)
- The ARM continuous speech recognition system. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1 (pp 69-72)
- Hidden Markov model decomposition of speech and noise. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2 (pp 845-848)
- IMPROVED SPEECH RECOGNITION USING A REDUCED AUDITORY REPRESENTATION.. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 75-78)
- NOISE COMPENSATION ALGORITHMS FOR USE WITH HIDDEN MARKOV MODEL BASED SPEECH RECOGNITION.. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 481-484)
- Systems for Isolated and Connected Word Recognition (pp 73-143)
- Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition. ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing, 26 April 1985 - 29 April 1985.
- OVERVIEW OF SPEECH INPUT. (pp 25-38)
- DISCRIMINATIVE NETWORK; A MECHANISM FOR FOCUSING RECOGNITION IN WHOLE-WORD PATTERN MATCHING.. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 3 (pp 1041-1044)
- SOME TECHNIQUES FOR INCORPORATING LOCAL TIMESCALE VARIABILITY INFORMATION INTO A DYNAMIC TIME-WARPING ALGORITHM FOR AUTOMATIC SPEECH RECOGNITION.. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 3 (pp 1037-1040)
- TOWARDS AN INTEGRATED DISCRIMINATIVE NETWORK FOR AUTOMATIC SPEECH RECOGNITION.
- AUTOMATIC SPEECH RECOGNITION USING LOCAL TIMESCALE VARIABILITY INFORMATION.
- Progress and Prospects for Spoken Language Technology: Results from Five Sexennial Surveys. INTERSPEECH 2023
- Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition. Interspeech 2020
- Evaluation of a Silent Speech Interface Based on Magnetic Sensing and Deep Learning for a Phonetically Rich Vocabulary. Interspeech 2017 View this article in WRRO
- A Real-Time Parametric General-Purpose Mammalian Vocal Synthesiser. Interspeech 2016
- Progress and Prospects for Spoken Language Technology: What Ordinary People Think. Interspeech 2016
- Progress and Prospects for Spoken Language Technology: Results from Four Sexennial Surveys. Interspeech 2016
- C2h: a computational model of H&h-based phonetic contrast in synthetic speech. Interspeech 2012
- Reactive speech synthesis: actively managing phonetic contrast along an H&H continuum
- Understanding speech understanding. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)
- A comparison of phoneme decision tree (PDT) and context adaptive phone (CAP) based approaches to vocabulary-independent speech recognition. Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
- Locally constrained dynamic programming in automatic speech recognition. ICASSP '82. IEEE International Conference on Acoustics, Speech, and Signal Processing
Working papers
- View this article in WRRO Automatic recognition of child speech for robotic applications in noisy environments.
Presentations
- View this article in WRRO Get off on the right foot with whom?: How users’ profiles affect their perception and experience with a social robot. London, UK.
- View this article in WRRO Better curious than smart?: Enhance inclusiveness between mismatched conversational partners: An opinion paper. Hamburg, Germany.
Preprints
- Digital capability, open-source use, and interoperability standards within the NHS in England: A survey of healthcare trusts (Preprint), JMIR Publications Inc..
- Local Minima Drive Communications in Cooperative Interaction.
- Interactivism in Spoken Dialogue Systems.
- Whither the Priors for (Vocal) Interactivity?.
- Investigating Deep Neural Structures and their Interpretability in the Domain of Voice Conversion, arXiv.
- Vocal interactivity in crowds, flocks and swarms: implications for voice user interfaces, PeerJ Preprints.
- On the Use/Misuse of the Term 'Phoneme', arXiv.
- A Biomimetic Vocalisation System for MiRo, arXiv.
- Automatic recognition of child speech for robotic applications in noisy environments, arXiv.
- Impact of robot responsiveness and adult involvement on children's social behaviours in human-robot interaction, arXiv.
- Acceptability and Effectiveness of NHS-Recommended e-Therapies for Depression, Anxiety, and Stress: Meta-Analysis (Preprint).
- Usability, Acceptability, and Effectiveness of Web-Based Conversational Agents to Facilitate Problem Solving in Older Adults: Controlled Study (Preprint).
- Vocal interactivity in crowds, flocks and swarms: implications for voice user interfaces, PeerJ.
- Professional activities and memberships
-
- Chair of Spoken Language Processing in the ‘Speech and Hearing’ research group, Dept. Computer Science, University of Sheffield.
- Editor-in-Chief of ‘Computer Speech & Language’.
- Editorial Board Member for ‘Speech Communication’, ‘Languages’ and the ‘International Journal of Cognitive Informatics and Natural Intelligence’.
- Associate Editor for the ‘Advances in Cognitive Informatics and Natural Intelligence’ (ACINI) Book Series.
- Visiting Professor, Bristol Robotics Laboratory.
- Visiting Professor, Psychology and Language Sciences, University College London.
- 2014-15 Distinguished Lecturer International Speech Communication Association
- Fellow of the International Speech Communication Association since 2008.
- General Chair for INTERSPEECH, Brighton (6th-10th September 2009).
- Chief Scientific Officer of ‘20/20 Speech Ltd.’ (now ‘Aurix Ltd.’) from 1999 to 2004.
- Head of the UK Government’s ‘Speech Research Unit’ (SRU) from 1985 until its privatisation in 1999.
- President of the ‘International Speech Communication Association’ (ISCA) from 1997 to 2001.
- President of the ‘Permanent Council of the International Conferences on Spoken Language Processing’ (PC-ICSLP) from 1996 to 2000.
- Author and co-author of over 150 scientific publications in Speech Technology algorithms, applications and assessment and related areas (h-index = 23).
- Recipient of the 1999 NATO RTO Scientific Achievement Award for “repeated contribution in scientific and technological cooperation”.
- Recipient of the 1994 UK Institute of Acoustics Tyndall Medal for “distinguished work in the field of speech research and technology”.
- Founder Member of the European Speech Communication Association.