Shinji Watanabe

Cited by

	All	Since 2020
Citations	34462	28026
h-index	87	77
i10-index	387	341

8000

4000

2000

6000

201120122013201420152016201720182019202020212022202320242025104 142 240 241 383 476 920 1475 2119 2952 4332 4956 6462 7588 1704

Public access

View all

96 articles

7 articles

available

not available

Based on funding mandates

Co-authors

Takaaki HoriAppleVerified email at apple.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Jonathan Le RouxMERLVerified email at merl.com
Xuankai ChangApple AI/MLVerified email at apple.com
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Atsushi NakamuraGraduate School of Natural Sciences, Nagoya City UniversityVerified email at ieee.org
Wangyou ZhangAssistant Professor, School of Artificial Intelligence, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yifan PengCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Brian YanCarnegie Mellon UniversityVerified email at cs.cmu.edu
Hakan ErdoganGoogleVerified email at google.com
Yusuke FujitaLY Corp.Verified email at linecorp.com
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Shota HoriguchiNTT CorporationVerified email at ntt.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Hung-yi LeeNational Taiwan UniversityVerified email at ntu.edu.tw
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Siddhant AroraGraduate Student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verified email at columbia.edu

Shinji Watanabe

Carnegie Mellon University

Verified email at cmu.edu - Homepage

Speech recognition Speech processing Speech enhancement Speech translation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani Proc. Interspeech 2019, 2019	288	2019
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni 2013 IEEE international conference on acoustics, speech and signal …, 2013	277	2013
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. G Sell, D Snyder, A McCree, D Garcia-Romero, J Villalba, M Maciejewski, ... Interspeech, 2808-2812, 2018	267	2018
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	260	2021
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	255	2020
Deep beamforming networks for multi-channel speech recognition X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	235	2016
Torchaudio: Building blocks for audio and speech processing YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	220	2022
End-to-end speaker diarization for an unknown number of speakers with encoder-decoder based attractors S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu arXiv preprint arXiv:2005.09921, 2020	220	2020
Topic tracking model for analyzing consumer purchase behavior. T Iwata, S Watanabe, T Yamada, N Ueda IJCAI 9, 1427-1432, 2009	216	2009
Audiogpt: Understanding and generating speech, music, sound, and talking head R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024	209	2024
End-to-end speech recognition: A survey R Prabhavalkar, T Hori, TN Sainath, R Schlüter, S Watanabe IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 325-351, 2023	208	2023
Conditional diffusion probabilistic model for speech enhancement YJ Lu, ZQ Wang, S Watanabe, A Richard, C Yu, Y Tsao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	208	2022
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques R Haeb-Umbach, S Watanabe, T Nakatani, M Bacchiani, B Hoffmeister, ... IEEE Signal processing magazine 36 (6), 111-124, 2019	203	2019
Statistical voice dialog system and method S Watanabe, JR Hershey US Patent 9,837,075, 2017	197	2017
Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks. Z Chen, S Watanabe, H Erdogan, JR Hershey Interspeech, 3274-3278, 2015	193	2015
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding Y Peng, S Dalmia, I Lane, S Watanabe International Conference on Machine Learning, 17627-17643, 2022	192	2022
Language independent end-to-end architecture for joint language identification and speech recognition S Watanabe, T Hori, JR Hershey 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	192	2017
ESPnet-ST: All-in-one speech translation toolkit H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ... arXiv preprint arXiv:2004.10234, 2020	184	2020
Intermediate loss regularization for ctc-based speech recognition J Lee, S Watanabe ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	162	2021
Recurrent deep neural networks for robust speech recognition C Weng, D Yu, S Watanabe, BHF Juang 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	162	2014

The system can't perform the operation now. Try again later.

Articles 21–40

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors

zproxy.org