How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR K Iwamoto, T Ochiai, M Delcroix, R Ikeshita, H Sato, S Araki, S Katagiri arXiv preprint arXiv:2201.06685, 2022 | 52 | 2022 |
Learning to enhance or not: Neural network-based switching of enhanced and observed signals for overlapping speech recognition H Sato, T Ochiai, M Delcroix, K Kinoshita, N Kamo, T Moriya ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 24 | 2022 |
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020 | 23 | 2020 |
Multimodal attention fusion for target speaker extraction H Sato, T Ochiai, K Kinoshita, M Delcroix, T Nakatani, S Araki 2021 IEEE Spoken Language Technology Workshop (SLT), 778-784, 2021 | 22 | 2021 |
Should we always separate?: Switching between enhanced and observed signals for overlapping speech recognition H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Kamo arXiv preprint arXiv:2106.00949, 2021 | 20 | 2021 |
Streaming target-speaker ASR with neural transducer T Moriya, H Sato, T Ochiai, M Delcroix, T Shinozaki arXiv preprint arXiv:2209.04175, 2022 | 13 | 2022 |
Distilling attention weights for CTC-based ASR systems T Moriya, H Sato, T Tanaka, T Ashihara, R Masumura, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 13 | 2020 |
Listen only to me! How well can target speech extraction handle false alarms? M Delcroix, K Kinoshita, T Ochiai, K Zmolikova, H Sato, T Nakatani arXiv preprint arXiv:2204.04811, 2022 | 11 | 2022 |
A case report of comorbid eating disorder and factitious disorder I Mizuta, T Fukunaga, H Sato, M Ogasawara, M Takeda, Y Inoue Psychiatry and clinical neurosciences 54 (5), 603-606, 2000 | 11 | 2000 |
Neural Whispered Speech Detection with Imbalanced Learning. T Ashihara, Y Shinohara, H Sato, T Moriya, K Matsui, T Fukutomi, ... INTERSPEECH, 3352-3356, 2019 | 10 | 2019 |
Speech emotion recognition based on listener adaptive models A Ando, R Masumura, H Sato, T Moriya, T Ashihara, Y Ijima, T Toda ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 9 | 2021 |
SimpleFlat: A simple whole-network pre-training approach for RNN transducer-based end-to-end speech recognition T Moriya, T Ashihara, T Tanaka, T Ochiai, H Sato, A Ando, Y Ijima, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |
Strategies to improve robustness of target speech extraction to enrollment variations H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Makishima, M Ihori, ... arXiv preprint arXiv:2206.08174, 2022 | 7 | 2022 |
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture. T Moriya, T Tanaka, T Ashihara, T Ochiai, H Sato, A Ando, R Masumura, ... Interspeech, 1787-1791, 2021 | 7 | 2021 |
Hybrid RNN-T/Attention-based streaming ASR with triggered chunkwise attention and dual internal language model integration T Moriya, T Ashihara, A Ando, H Sato, T Tanaka, K Matsuura, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 6 | 2022 |
World's first bio-degradable actuator for removal-free implantable MEMS H Sato, Y Inoue, M Ikeuchi, K Ikuta 2017 IEEE 30th International Conference on Micro Electro Mechanical Systems …, 2017 | 4 | 2017 |
End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders. R Masumura, H Sato, T Tanaka, T Moriya, Y Ijima, T Oba INTERSPEECH, 1606-1610, 2019 | 3 | 2019 |
Improving scheduled sampling for neural transducer-based ASR T Moriya, T Ashihara, H Sato, K Matsuura, T Tanaka, R Masumura ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Downstream task agnostic speech enhancement with self-supervised representation loss H Sato, R Masumura, T Ochiai, M Delcroix, T Moriya, T Ashihara, ... arXiv preprint arXiv:2305.14723, 2023 | 2 | 2023 |
Transcribing speech as spoken and written dual text using an autoregressive model M Ihori, H Sato, T Tanaka, R Masumura, S Mizuno, N Hojo Proc. INTERSPEECH 2023, 461-465, 2023 | 2 | 2023 |