Bowen Shi

Citée par

	Toutes	Depuis 2019
Citations	1440	1416
indice h	19	19
indice i10	26	26

600

300

150

450

2016201720182019202020212022202320245 13 6 30 48 109 172 455 596

Accès public

Tout afficher

4 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Wei-Ning HsuFacebook AI Research (FAIR)Adresse e-mail validée de csail.mit.edu
Karen LivescuTTI-ChicagoAdresse e-mail validée de ttic.edu
Greg ShakhnarovichProfessor, TTI-ChicagoAdresse e-mail validée de ttic.edu
Diane BrentariMary K. Werkman Professor of Linguistics, University of ChicagoAdresse e-mail validée de uchicago.edu
Apoorv VyasFAIR Labs MetaAdresse e-mail validée de meta.com
Ming SunMetaAdresse e-mail validée de fb.com
Andros TjandraFacebook AI (research scientist)Adresse e-mail validée de fb.com
Spyros MatsoukasAmazon.comAdresse e-mail validée de amazon.com
Abdelrahman MohamedResearch scientist, Facebook AI ResearchAdresse e-mail validée de fb.com
Michael AuliMeta, FAIRAdresse e-mail validée de meta.com
Vineel PratapFacebook AI ResearchAdresse e-mail validée de fb.com

Suivre

Bowen Shi

Facebook AI Research

Adresse e-mail validée de meta.com

speech and audio sign language


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Learning audio-visual speech representation by masked multimodal cluster prediction B Shi, WN Hsu, K Lakhotia, A Mohamed arXiv preprint arXiv:2201.02184, 2022	237	2022
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... Journal of Machine Learning Research 25 (97), 1-52, 2024	167	2024
Voicebox: Text-guided multilingual universal speech generation at scale M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ... Advances in neural information processing systems 36, 2024	116	2024
Robust self-supervised audio-visual speech recognition B Shi, WN Hsu, A Mohamed arXiv preprint arXiv:2201.01763, 2022	99	2022
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591 2 (3), 2023	83	2023
American sign language fingerspelling recognition in the wild B Shi, AM Del Rio, J Keane, J Michaux, D Brentari, G Shakhnarovich, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 145-152, 2018	83	2018
Offloading guidelines for augmented reality applications on wearable devices B Shi, J Yang, Z Huang, P Hui Proceedings of the 23rd ACM international conference on Multimedia, 1271-1274, 2015	83	2015
Comparative layer-wise analysis of self-supervised speech models A Pasad, B Shi, K Livescu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	73	2023
Fingerspelling recognition in the wild with iterative visual attention B Shi, AMD Rio, J Keane, D Brentari, G Shakhnarovich, K Livescu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	71	2019
Few-shot acoustic event detection via meta learning B Shi, M Sun, KC Puvvada, CC Kao, S Matsoukas, C Wang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	67	2020
A cross-task analysis of text span representations S Toshniwal, H Shi, B Shi, L Gao, K Livescu, K Gimpel arXiv preprint arXiv:2006.03866, 2020	41	2020
Audiobox: Unified audio generation with natural language prompts A Vyas, B Shi, M Le, A Tjandra, YC Wu, B Guo, J Zhang, X Zhang, ... arXiv preprint arXiv:2312.15821, 2023	33	2023
Open-domain sign language translation learned from online video B Shi, D Brentari, G Shakhnarovich, K Livescu arXiv preprint arXiv:2205.12870, 2022	32	2022
Fingerspelling detection in american sign language B Shi, D Brentari, G Shakhnarovich, K Livescu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	28	2021
u-hubert: Unified mixed-modal speech pretraining and zero-shot transfer to unlabeled modality WN Hsu, B Shi Advances in Neural Information Processing Systems 35, 21157-21170, 2022	25	2022
Expresso: A benchmark and analysis of discrete expressive speech resynthesis TA Nguyen, WN Hsu, A d'Avirro, B Shi, I Gat, M Fazel-Zarani, T Remez, ... arXiv preprint arXiv:2308.05725, 2023	21	2023
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang arXiv preprint arXiv:2303.00628, 2023	21	2023
Semi-supervised acoustic event detection based on tri-training B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	20	2019
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition B Shi, K Livescu 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	20	2017
Compression of acoustic event detection models with low-rank matrix factorization and quantization training B Shi, M Sun, CC Kao, V Rozgic, S Matsoukas, C Wang arXiv preprint arXiv:1905.00855, 2019	17	2019

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs