Bowen Shi

Cited by

	All	Since 2019
Citations	1127	1104
h-index	17	17
i10-index	22	22

480

240

120

360

2016201720182019202020212022202320245 12 6 30 48 109 170 474 269

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Wei-Ning HsuFacebook AI Research (FAIR)Verified email at csail.mit.edu
Karen LivescuTTI-ChicagoVerified email at ttic.edu
Greg ShakhnarovichProfessor, TTI-ChicagoVerified email at ttic.edu
Diane BrentariMary K. Werkman Professor of Linguistics, University of ChicagoVerified email at uchicago.edu
Ming SunMetaVerified email at fb.com
Spyros MatsoukasAmazon.comVerified email at amazon.com
Abdelrahman MohamedResearch scientist, Facebook AI ResearchVerified email at fb.com
Apoorv VyasFAIR Labs MetaVerified email at meta.com
Andros TjandraFacebook AI (research scientist)Verified email at fb.com
Michael AuliMeta, FAIRVerified email at meta.com
Vineel PratapFacebook AI ResearchVerified email at fb.com

Bowen Shi

Facebook AI Research

Verified email at meta.com

speech and audio sign language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning audio-visual speech representation by masked multimodal cluster prediction B Shi, WN Hsu, K Lakhotia, A Mohamed arXiv preprint arXiv:2201.02184, 2022	200	2022
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... Journal of Machine Learning Research 25 (97), 1-52, 2024	106	2024
Robust self-supervised audio-visual speech recognition B Shi, WN Hsu, A Mohamed arXiv preprint arXiv:2201.01763, 2022	89	2022
Offloading guidelines for augmented reality applications on wearable devices B Shi, J Yang, Z Huang, P Hui Proceedings of the 23rd ACM international conference on Multimedia, 1271-1274, 2015	80	2015
American sign language fingerspelling recognition in the wild B Shi, AM Del Rio, J Keane, J Michaux, D Brentari, G Shakhnarovich, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 145-152, 2018	77	2018
Voicebox: Text-guided multilingual universal speech generation at scale M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ... Advances in neural information processing systems 36, 2024	71	2024
Few-shot acoustic event detection via meta learning B Shi, M Sun, KC Puvvada, CC Kao, S Matsoukas, C Wang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	64	2020
Fingerspelling recognition in the wild with iterative visual attention B Shi, AMD Rio, J Keane, D Brentari, G Shakhnarovich, K Livescu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	64	2019
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591 2 (3), 2023	57*	2023
Comparative layer-wise analysis of self-supervised speech models A Pasad, B Shi, K Livescu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	42	2023
A cross-task analysis of text span representations S Toshniwal, H Shi, B Shi, L Gao, K Livescu, K Gimpel arXiv preprint arXiv:2006.03866, 2020	37	2020
Fingerspelling detection in american sign language B Shi, D Brentari, G Shakhnarovich, K Livescu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	27	2021
u-hubert: Unified mixed-modal speech pretraining and zero-shot transfer to unlabeled modality WN Hsu, B Shi Advances in Neural Information Processing Systems 35, 21157-21170, 2022	21	2022
Open-domain sign language translation learned from online video B Shi, D Brentari, G Shakhnarovich, K Livescu arXiv preprint arXiv:2205.12870, 2022	20	2022
Semi-supervised acoustic event detection based on tri-training B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	20	2019
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition B Shi, K Livescu 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	20	2017
Compression of acoustic event detection models with low-rank matrix factorization and quantization training B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang arXiv preprint arXiv:1905.00855, 2019	17	2019
Expresso: A benchmark and analysis of discrete expressive speech resynthesis TA Nguyen, WN Hsu, A d'Avirro, B Shi, I Gat, M Fazel-Zarani, T Remez, ... arXiv preprint arXiv:2308.05725, 2023	13	2023
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang arXiv preprint arXiv:2303.00628, 2023	13	2023
Compression of acoustic event detection models with quantized distillation B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang arXiv preprint arXiv:1907.00873, 2019	12	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors