Dr Yoshi Gotoh
PhD
School of Computer Science
Lecturer
Student Projects Officer
Member of the Speech and Hearing (SpandH) research group


y.gotoh@sheffield.ac.uk
Regent Court (DCS)
Full contact details
Dr Yoshi Gotoh
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Yoshi is a lecturer in the Department of Computer Science. He has a first degree in Engineering form the University of Tokyo and a PhD from Brown University.
- Research interests
-
Yoshi has been working in the field of speech and spoken language processing for years. His current interests include audio visual processing, in particular, video analysis and video information retrieval.
- Publications
-
Journal articles
- Graph-based topic models for trajectory clustering in crowd videos. Machine Vision and Applications, 31. View this article in WRRO
- Generating natural language tags for video information management. Machine Vision and Applications, 28(3-4), 243-265. View this article in WRRO
- View this article in WRRO
- A framework for creating natural language descriptions of video streams. Information Sciences, 303, 61-82. View this article in WRRO
- A unified spatio-temporal human body region tracking approach to action recognition. Neurocomputing, 161, 56-64. View this article in WRRO
- Spoken document retrieval based on confusion network with syllable fragments. International Journal of Advanced Robotic Systems, 9.
- On the subjectivity of human-authored summaries. NAT LANG ENG, 15, 193-213.
- A cascaded broadcast news highlighter. IEEE T AUDIO SPEECH, 16(1), 151-161.
- View this article in WRRO
- View this article in WRRO
- Taggers for parsers. Artificial Intelligence, 85(1-2), 45-57.
- Taggers for parsers. Artificial Intelligence, 84(1-2), 357-357.
- Analysis of LPC/DFT features for an HMM-based alphadigit recognizer. IEEE Signal Processing Letters, 3(4), 103-106.
Conference proceedings papers
- 3D visual speech animation using 2D videos. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 2367-2371). Brighton, 12 May 2019 - 17 May 2019. View this article in WRRO
- Graph-based correlated topic model for trajectory clustering in crowded videos. IEEE Winter Conference on Applications of Computer Vision (pp 1029-1037), 12 March 2018 - 14 March 2018. View this article in WRRO
- Medical image colorization for better visualization and segmentation. Medical Image Understanding and Analysis, Vol. 723 (pp 571-580) View this article in WRRO
- View this article in WRRO
- Manifold matching with application to instance search based on video queries. ICISP. Cherbourg, 30 June 2014.
- View this article in WRRO
- Speaker role based structural classification of broadcast news stories. Interspeech 2007
- Relative evaluation of informativeness in machine generated summaries. Interspeech 2007
Working papers
- Graph-based topic models for trajectory clustering in crowd videos. Machine Vision and Applications, 31. View this article in WRRO
- Grants
-
Research Grants
-
Visual Understanding for Fake Imagery Detect, Innovate UK, 09/2021 - 03/2024, £218,226, as Co-PI
- Multimedia Analysis for Unsupervised Dubbing In Entertainment (MAUDIE), Innovate UK, 04/2018 - 03/2021, £393,115, as Co-PI
- S3L: Statistical Summarization of Spoken Language, EPSRC, 12/2001 - 09/2005, £284,248, as Co-PI
-
- Professional activities and memberships
-
Member of the Speech and Hearing research group