Suivre
Ye Jia
Ye Jia
Meta
Adresse e-mail validée de google.com - Page d'accueil
Titre
Citée par
Citée par
Année
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Y Jia, Y Zhang, RJ Weiss, Q Wang, J Shen, F Ren, Z Chen, P Nguyen, ...
Advances in Neural Information Processing Systems, 2018
9682018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis
Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ...
International conference on machine learning, 5180-5189, 2018
9622018
Libritts: A corpus derived from librispeech for text-to-speech
H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu
arXiv preprint arXiv:1904.02882, 2019
8862019
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
arXiv preprint arXiv:1810.04826, 2018
4312018
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language, 101114, 2020
3822020
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
arXiv preprint arXiv:1810.07217, 2018
2762018
Improved noisy student training for automatic speech recognition
DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le
arXiv preprint arXiv:2005.09629, 2020
2652020
Direct speech-to-speech translation with a sequence-to-sequence model
Y Jia, RJ Weiss, F Biadsy, W Macherey, M Johnson, Z Chen, Y Wu
Proc. Interspeech 2019, 1123--1127, 2019
2372019
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
2212024
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
2092019
Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning
Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ...
arXiv preprint arXiv:1907.04448, 2019
1912019
Leveraging weakly supervised data to improve end-to-end speech-to-text translation
Y Jia, M Johnson, W Macherey, RJ Weiss, Y Cao, CC Chiu, N Ari, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1752019
Speech recognition with augmented synthesized speech
A Rosenberg, Y Zhang, B Ramabhadran, Y Jia, P Moreno, Y Wu, Z Wu
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
1392019
Parrotron: An end-to-end speech-to-speech conversion model and its applications to hearing-impaired speech and speech separation
F Biadsy, RJ Weiss, PJ Moreno, D Kanevsky, Y Jia
arXiv preprint arXiv:1904.04169, 2019
1342019
Parallel tacotron: Non-autoregressive and controllable tts
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1262021
mslam: Massively multilingual joint pre-training for speech and text
A Bapna, C Cherry, Y Zhang, Y Jia, M Johnson, Y Cheng, S Khanuja, ...
arXiv preprint arXiv:2202.01374, 2022
1102022
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling
J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu
arXiv preprint arXiv:2010.04301, 2020
992020
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation
Y Jia, MT Ramanovich, T Remez, R Pomerantz
International Conference on Machine Learning, 10120-10134, 2022
88*2022
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS
Y Jia, H Zen, J Shen, Y Zhang, Y Wu
Proc. Interspeech 2021, 151--155, 2021
852021
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training
A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ...
arXiv preprint arXiv:2110.10329, 2021
842021
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20