Distilling the knowledge of BERT for sequence-to-sequence ASR H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2008.03822, 2020 | 53 | 2020 |
Asr rescoring and confidence estimation with electra H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 19 | 2021 |
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2209.04062, 2022 | 8 | 2022 |
Distilling the Knowledge of BERT for CTC-based ASR H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2209.02030, 2022 | 6 | 2022 |
A study on the integration of pipeline and e2e slu systems for spoken semantic parsing toward stop quality challenge S Arora, H Futami, SL Wu, J Huynh, Y Peng, Y Kashiwagi, E Tsunoo, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Streaming joint speech recognition and disfluency detection H Futami, E Tsunoo, K Shibata, Y Kashiwagi, T Okuda, S Arora, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Joint modelling of spoken language understanding tasks with integrated dialog history S Arora, H Futami, E Tsunoo, B Yan, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ... arXiv preprint arXiv:2310.02973, 2023 | 2 | 2023 |
Decoder-only architecture for speech recognition with ctc prompts and text data augmentation E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe arXiv preprint arXiv:2309.08876, 2023 | 2 | 2023 |
Integrating pretrained ASR and LM to perform sequence generation for spoken language understanding S Arora, H Futami, Y Kashiwagi, E Tsunoo, B Yan, S Watanabe arXiv preprint arXiv:2307.11005, 2023 | 2 | 2023 |
The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge H Futami, J Huynh, S Arora, SL Wu, Y Kashiwagi, Y Peng, B Yan, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Tensor decomposition for minimization of E2E SLU model toward on-device processing Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ... arXiv preprint arXiv:2306.01247, 2023 | 1 | 2023 |
Phoneme-aware Encoding for Prefix-tree-based Contextual ASR H Futami, E Tsunoo, Y Kashiwagi, H Ogawa, S Arora, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Integration of Frame-and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe arXiv preprint arXiv:2307.12767, 2023 | | 2023 |
E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |