Follow
Hayato Futami
Hayato Futami
Sony Group Corporation
Verified email at sony.com
Title
Cited by
Cited by
Year
Distilling the knowledge of BERT for sequence-to-sequence ASR
H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2008.03822, 2020
652020
Asr rescoring and confidence estimation with electra
H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
272021
Non-autoregressive error correction for CTC-based ASR with phone-conditioned masked LM
H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2209.04062, 2022
132022
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network
S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ...
arXiv preprint arXiv:2310.02973, 2023
102023
Distilling the Knowledge of BERT for CTC-based ASR
H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2209.02030, 2022
102022
Decoder-only architecture for speech recognition with ctc prompts and text data augmentation
E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe
arXiv preprint arXiv:2309.08876, 2023
92023
Decoder-only architecture for streaming end-to-end speech recognition
E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe
arXiv preprint arXiv:2406.16107, 2024
82024
Phoneme-aware encoding for prefix-tree-based contextual ASR
H Futami, E Tsunoo, Y Kashiwagi, H Ogawa, S Arora, S Watanabe
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
UniverSLU: Universal spoken language understanding for diverse tasks with natural language instructions
S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ...
arXiv preprint arXiv:2310.02973, 2023
72023
A study on the integration of pipeline and e2e slu systems for spoken semantic parsing toward stop quality challenge
S Arora, H Futami, SL Wu, J Huynh, Y Peng, Y Kashiwagi, E Tsunoo, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
Streaming joint speech recognition and disfluency detection
H Futami, E Tsunoo, K Shibata, Y Kashiwagi, T Okuda, S Arora, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
Integrating pretrained asr and lm to perform sequence generation for spoken language understanding
S Arora, H Futami, Y Kashiwagi, E Tsunoo, B Yan, S Watanabe
arXiv preprint arXiv:2307.11005, 2023
62023
Joint modelling of spoken language understanding tasks with integrated dialog history
S Arora, H Futami, E Tsunoo, B Yan, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
Integration of frame-and label-synchronous beam search for streaming encoder-decoder speech recognition
E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe
arXiv preprint arXiv:2307.12767, 2023
42023
The pipeline system of asr and nlu with mlm-based data augmentation toward stop low-resource challenge
H Futami, J Huynh, S Arora, SL Wu, Y Kashiwagi, Y Peng, B Yan, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
Rapid language adaptation for multilingual e2e speech recognition using encoder prompting
Y Kashiwagi, H Futami, E Tsunoo, S Arora, S Watanabe
arXiv preprint arXiv:2406.12611, 2024
32024
Tensor decomposition for minimization of E2E SLU model toward on-device processing
Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...
arXiv preprint arXiv:2306.01247, 2023
32023
Task Arithmetic for Language Expansion in Speech Translation
YF Cheng, H Futami, Y Kashiwagi, E Tsunoo, WS Teo, S Arora, ...
arXiv preprint arXiv:2409.11274, 2024
12024
E-branchformer-based e2e slu toward stop on-device challenge
Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens
Y Kashiwagi, H Futami, E Tsunoo, S Arora, S Watanabe
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–20