Follow
Shinji Watanabe
Title
Cited by
Cited by
Year
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration
S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani
Proc. Interspeech 2019, 2019
2882019
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines
E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni
2013 IEEE international conference on acoustics, speech and signal …, 2013
2772013
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
G Sell, D Snyder, A McCree, D Garcia-Romero, J Villalba, M Maciejewski, ...
Interspeech, 2808-2812, 2018
2672018
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
2602021
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit
T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ...
ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020
2552020
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2352016
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
2202022
End-to-end speaker diarization for an unknown number of speakers with encoder-decoder based attractors
S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu
arXiv preprint arXiv:2005.09921, 2020
2202020
Topic tracking model for analyzing consumer purchase behavior.
T Iwata, S Watanabe, T Yamada, N Ueda
IJCAI 9, 1427-1432, 2009
2162009
Audiogpt: Understanding and generating speech, music, sound, and talking head
R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024
2092024
End-to-end speech recognition: A survey
R Prabhavalkar, T Hori, TN Sainath, R Schlüter, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 325-351, 2023
2082023
Conditional diffusion probabilistic model for speech enhancement
YJ Lu, ZQ Wang, S Watanabe, A Richard, C Yu, Y Tsao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
2082022
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques
R Haeb-Umbach, S Watanabe, T Nakatani, M Bacchiani, B Hoffmeister, ...
IEEE Signal processing magazine 36 (6), 111-124, 2019
2032019
Statistical voice dialog system and method
S Watanabe, JR Hershey
US Patent 9,837,075, 2017
1972017
Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks.
Z Chen, S Watanabe, H Erdogan, JR Hershey
Interspeech, 3274-3278, 2015
1932015
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Y Peng, S Dalmia, I Lane, S Watanabe
International Conference on Machine Learning, 17627-17643, 2022
1922022
Language independent end-to-end architecture for joint language identification and speech recognition
S Watanabe, T Hori, JR Hershey
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
1922017
ESPnet-ST: All-in-one speech translation toolkit
H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ...
arXiv preprint arXiv:2004.10234, 2020
1842020
Intermediate loss regularization for ctc-based speech recognition
J Lee, S Watanabe
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1622021
Recurrent deep neural networks for robust speech recognition
C Weng, D Yu, S Watanabe, BHF Juang
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
1622014
The system can't perform the operation now. Try again later.
Articles 21–40