Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 120 | 2022 |
Espresso: A fast end-to-end neural speech recognition toolkit Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 81 | 2019 |
Wenet 2.0: More productive end-to-end speech recognition toolkit B Zhang, D Wu, Z Peng, X Song, Z Yao, H Lv, L Xie, C Yang, F Pan, J Niu arXiv preprint arXiv:2203.15455, 2022 | 59 | 2022 |
Wake word detection with streaming transformers Y Wang, H Lv, D Povey, L Xie, S Khudanpur ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 35 | 2021 |
The NNI Query-by-Example System for MediaEval 2014. P Yang, H Xu, X Xiao, L Xie, CC Leung, H Chen, J Yu, H Lv, L Wang, ... MediaEval, 2014 | 24 | 2014 |
Wake word detection with alignment-free lattice-free MMI Y Wang, H Lv, D Povey, L Xie, S Khudanpur arXiv preprint arXiv:2005.08347, 2020 | 22 | 2020 |
The NNI Query-by-Example System for MediaEval 2015. J Hou, CCL Van Tung Pham, CC Leung, L Wang, H Xu, H Lv, L Xie, Z Fu, ... MediaEval, 2015 | 22 | 2015 |
Language independent query-by-example spoken term detection using n-best phone sequences and partial matching H Xu, P Yang, X Xiao, L Xie, CC Leung, H Chen, J Yu, H Lv, L Wang, ... 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 20 | 2015 |
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. CC Leung, L Wang, H Xu, J Hou, Van Tung Pham, H Lv, L Xie, X Xiao, ... INTERSPEECH, 3703-3707, 2016 | 19 | 2016 |
Approximate search of audio queries by using DTW with phone time boundary and data augmentation H Xu, J Hou, X Xiao, CC Leung, L Wang, H Lv, L Xie, B Ma, ES Chng, H Li 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 16 | 2016 |
Acoustic Modeling from Frequency Domain Representations of Speech. P Ghahremani, H Hadian, H Lv, D Povey, S Khudanpur Interspeech, 1596-1600, 2018 | 13 | 2018 |
W-Infer-polation: Approximate reasoning via integrating weighted fuzzy rule inference and interpolation H Lv, F Li, C Shang, Q Shen Knowledge-Based Systems 258, 109995, 2022 | 6 | 2022 |
Context-aware RNNLM rescoring for conversational speech recognition K Wei, P Guo, H Lv, Z Tu, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 5 | 2021 |
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation H Lv, D Povey, M Yarmohammadi, K Li, Y Wang, L Xie, S Khudanpur IEEE Signal Processing Letters 28, 703-707, 2021 | 2 | 2021 |
Minimizing sequential confusion error in speech command recognition Z Yang, H Lv, X Wang, A Zhang, L Xie arXiv preprint arXiv:2207.01261, 2022 | 1 | 2022 |
An asynchronous WFST-based decoder for automatic speech recognition H Lv, Z Chen, H Xu, D Povey, L Xie, S Khudanpur ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 1 | 2021 |
Incremental Lattice Determinization for WFST Decoders Z Chen, M Yarmohammadi, H Xu, H Lv, L Xie, D Povey, S Khudanpur 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2019 | 1 | 2019 |
Experimental study on dereverberation and noise reduction for distant speech recognition ZH Fu, L Xie, H Lv The 9th International Symposium on Chinese Spoken Language Processing, 393-397, 2014 | | 2014 |