Ye Jia

Citée par

	Toutes	Depuis 2019
Citations	5376	5313
indice h	21	21
indice i10	26	26

1600

800

400

1200

201820192020202120222023202445 301 645 1025 1355 1566 409

Accès public

Tout afficher

1 article

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Yonghui WuGoogle BrainAdresse e-mail validée de google.com
Yu ZhangOpenAIAdresse e-mail validée de csail.mit.edu
Ron J WeissGoogleAdresse e-mail validée de google.com
Jonathan ShenGoogleAdresse e-mail validée de google.com
Heiga ZenPrincipal Scientist (Director), Google DeepMindAdresse e-mail validée de google.com
Zhifeng ChenGoogle Inc.Adresse e-mail validée de google.com
Quan WangSenior Staff Software Engineer @ Google; Instructor @ Udemy; Textbook Author; IEEE Senior MemberAdresse e-mail validée de google.com
Ignacio Lopez MorenoGoogle IncAdresse e-mail validée de google.com
Patrick NguyenResearch Scientist, Google, Inc.Adresse e-mail validée de google.com
Melvin JohnsonResearcher, GoogleAdresse e-mail validée de stanford.edu
RJ Skerry-RyanGoogle, Inc.Adresse e-mail validée de alum.mit.edu
Rob ClarkGoogleAdresse e-mail validée de google.com
Yuxuan WangByteDanceAdresse e-mail validée de cse.ohio-state.edu
Bhuvana RamabhadranManager, GoogleAdresse e-mail validée de google.com
Andrew RosenbergGoogleAdresse e-mail validée de google.com
Yuan CaoGoogle DeepMindAdresse e-mail validée de google.com
Ankur BapnaSoftware Engineer, Google DeepmindAdresse e-mail validée de google.com
Chung-Cheng ChiuAppleAdresse e-mail validée de apple.com
Michelle Tadmor (Ramanovich)GoogleAdresse e-mail validée de google.com
Wolfgang MachereyGoogle ResearchAdresse e-mail validée de google.com

Suivre

Ye Jia

Meta

Adresse e-mail validée de google.com - Page d'accueil

Speech synthesis Speech translation


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Y Jia, Y Zhang, RJ Weiss, Q Wang, J Shen, F Ren, Z Chen, P Nguyen, ... Advances in Neural Information Processing Systems, 2018	883	2018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	882	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	701	2019
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018	392	2018
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language, 101114, 2020	303	2020
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018	266	2018
Improved noisy student training for automatic speech recognition DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le arXiv preprint arXiv:2005.09629, 2020	235	2020
Direct speech-to-speech translation with a sequence-to-sequence model Y Jia, RJ Weiss, F Biadsy, W Macherey, M Johnson, Z Chen, Y Wu Proc. Interspeech 2019, 1123--1127, 2019	210	2019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	199	2019
Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ... arXiv preprint arXiv:1907.04448, 2019	178	2019
Leveraging weakly supervised data to improve end-to-end speech-to-text translation Y Jia, M Johnson, W Macherey, RJ Weiss, Y Cao, CC Chiu, N Ari, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	171	2019
Parrotron: An end-to-end speech-to-speech conversion model and its applications to hearing-impaired speech and speech separation F Biadsy, RJ Weiss, PJ Moreno, D Kanevsky, Y Jia arXiv preprint arXiv:1904.04169, 2019	121	2019
Speech recognition with augmented synthesized speech A Rosenberg, Y Zhang, B Ramabhadran, Y Jia, P Moreno, Y Wu, Z Wu 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	118	2019
Parallel tacotron: Non-autoregressive and controllable tts I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	113	2021
mslam: Massively multilingual joint pre-training for speech and text A Bapna, C Cherry, Y Zhang, Y Jia, M Johnson, Y Cheng, S Khanuja, ... arXiv preprint arXiv:2202.01374, 2022	90	2022
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu arXiv preprint arXiv:2010.04301, 2020	88	2020
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS Y Jia, H Zen, J Shen, Y Zhang, Y Wu Proc. Interspeech 2021, 151--155, 2021	77	2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Y Jia, MT Ramanovich, T Remez, R Pomerantz International Conference on Machine Learning, 10120-10134, 2022	71*	2022
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ... arXiv preprint arXiv:2110.10329, 2021	71	2021
Parallel Tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021	58	2021

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs