Can you tell me how to get past sesame street? sentence-level pretraining beyond language modeling A Wang, J Hula, P Xia, R Pappagari, RT McCoy, R Patel, N Kim, I Tenney, ... Proceedings of ACL, 4465-4476, 2019 | 143* | 2019 |
Leveraging Unpaired Text Data for Training End-To-End Speech-to-Intent Systems Y Huang, HK Kuo, S Thomas, Z Kons, K Audhkhasi, B Kingsbury, R Hoory, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 65 | 2020 |
jiant 1.2: A software toolkit for research on general-purpose text understanding models A Wang, IF Tenney, Y Pruksachatkun, K Yu, J Hula, P Xia, R Pappagari, ... Note: http://jiant. info/Cited by: footnote 4, 2019 | 51 | 2019 |
Implementing a whole sentence recurrent neural network language model for natural language processing Y Huang, A Sethy, K Audhkhasi, B Ramabhadran US Patent 10,431,210, 2019 | 31 | 2019 |
End-to-End Spoken Language Understanding Without Full Transcripts HKJ Kuo, Z Tüske, S Thomas, Y Huang, K Audhkhasi, B Kingsbury, ... INTERSPEECH, 906-910, 2020 | 28 | 2020 |
Modular Hybrid Autoregressive Transducer Z Meng, T Chen, R Prabhavalkar, Y Zhang, G Wang, K Audhkhasi, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 197-204, 2023 | 19 | 2023 |
English Broadcast News Speech Recognition by Humans and Machines S Thomas, M Suzuki, Y Huang, G Kurata, Z Tuske, G Saon, B Kingsbury, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 19 | 2019 |
Whole sentence neural language models Y Huang, A Sethy, K Audhkhasi, B Ramabhadran 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 15 | 2018 |
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. Z Chen, A Rosenberg, Y Zhang, H Zen, M Ghodsi, Y Huang, J Emond, ... Interspeech, 736-740, 2021 | 13 | 2021 |
Fast Neural Network Language Model Lookups at N-Gram Speeds. Y Huang, A Sethy, B Ramabhadran INTERSPEECH, 274-278, 2017 | 13 | 2017 |
Detection and Recovery of OOVs for Improved English Broadcast News Captioning. S Thomas, K Audhkhasi, Z Tüske, Y Huang, M Picheny INTERSPEECH, 2973-2977, 2019 | 11 | 2019 |
Large-Scale Language Model Rescoring on Long-Form Data T Chen, C Allauzen, Y Huang, D Park, D Rybach, WR Huang, R Cabrera, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
Leveraging unpaired text data for training end-to-end spoken language understanding systems HKJ Kuo, Y Huang, S Thomas, K Audhkhasi, MA Picheny US Patent 11,587,551, 2023 | 8 | 2023 |
Convolutional Dropout and Wordpiece Augmentation for End-to-End Speech Recognition H Xu, Y Huang, Y Zhu, K Audhkhasi, B Ramabhadran ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 5 | 2021 |
Semi-Supervised Training and Data Augmentation for Adaptation of Automatic Broadcast News Captioning Systems Y Huang, S Thomas, M Suzuki, Z Tüske, L Sansone, M Picheny 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 5 | 2019 |
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition K Audhkhasi, Y Huang, B Ramabhadran, PJ Moreno arXiv preprint arXiv:2209.06096, 2022 | 3 | 2022 |
Detecting and recovering out-of-vocabulary words in voice-to-text transcription systems S Thomas, K Audhkhasi, Z Tueske, Y Huang, MA Picheny US Patent 11,183,194, 2021 | 3 | 2021 |
Non-Parallel Voice Conversion for ASR Augmentation G Wang, A Rosenberg, B Ramabhadran, F Biadsy, Y Huang, J Emond, ... arXiv preprint arXiv:2209.06987, 2022 | 2 | 2022 |
End-to-end spoken language understanding without full transcripts HKJ Kuo, Z Tueske, S Thomas, Y Huang, BED Kingsbury, K Audhkhasi US Patent 11,929,062, 2024 | 1 | 2024 |
Regularizing Word Segmentation by Creating Misspellings H Xu, K Audhkhasi, Y Huang, J Emond, B Ramabhadran Proc. Interspeech 2021, 2561-2565, 2021 | 1 | 2021 |