Yu Zhang

Citée par

	Toutes	Depuis 2019
Citations	24494	22735
indice h	58	56
indice i10	115	105

7000

3500

1750

5250

2015201620172018201920202021202220232024101 264 419 865 1432 2330 4021 5133 6436 3328

Accès public

Tout afficher

7 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Yonghui WuGoogle BrainAdresse e-mail validée de google.com
Chung-Cheng ChiuAppleAdresse e-mail validée de apple.com
Wei HanOpenAIAdresse e-mail validée de illinois.edu
Ye JiaMetaAdresse e-mail validée de google.com
Ron J WeissGoogleAdresse e-mail validée de google.com
William ChanIdeogramAdresse e-mail validée de ideogram.ai
Heiga ZenPrincipal Scientist (Director), Google DeepMindAdresse e-mail validée de google.com
Ruoming Pang (庞若鸣)Apple AI/MLAdresse e-mail validée de apple.com
James GlassMIT Computer Science and Artificial Intelligence LaboratoryAdresse e-mail validée de mit.edu
James QinGoogleAdresse e-mail validée de google.com
Bo LiGoogleAdresse e-mail validée de google.com
Jonathan ShenGoogleAdresse e-mail validée de google.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowAdresse e-mail validée de global.tencent.com
Quoc V. LeResearch Scientist, GoogleAdresse e-mail validée de stanford.edu
Daniel S. ParkGoogle BrainAdresse e-mail validée de google.com
Tara SainathPrincipal Research Scientist, GoogleAdresse e-mail validée de google.com
Zhifeng ChenGoogle Inc.Adresse e-mail validée de google.com
Wei-Ning HsuFacebook AI Research (FAIR)Adresse e-mail validée de csail.mit.edu
Anmol GulatiResearcher, Google DeepmindAdresse e-mail validée de google.com
Yuxuan WangByteDanceAdresse e-mail validée de cse.ohio-state.edu

Suivre

Yu Zhang

OpenAI

Adresse e-mail validée de csail.mit.edu - Page d'accueil

Speech Recognition Speech Synthesis


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Specaugment: A simple data augmentation method for automatic speech recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le arXiv preprint arXiv:1904.08779, 2019	3901	2019
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018	3093	2018
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020	2849	2020
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018	923	2018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	918	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	787	2019
Wavegrad: Estimating gradients for waveform generation N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi, W Chan arXiv preprint arXiv:2009.00713, 2020	678	2020
Very deep convolutional networks for end-to-end speech recognition Y Zhang, W Chan, N Jaitly 2017 IEEE international conference on acoustics, speech and signal …, 2017	557	2017
An introduction to computational networks and the computational network toolkit MS Dong Yu, Adam Eversole, Mike Seltzer, Kaisheng Yao, Zhiheng Huang, Brian ... Tech. Rep. MSR, Microsoft Research, 2014, http://codebox/cntk, 2014	472*	2014
Unsupervised learning of disentangled and interpretable representations from sequential data WN Hsu, Y Zhang, J Glass Advances in neural information processing systems 30, 2017	406	2017
Spoken language understanding using long short-term memory neural networks K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi IEEE SLT, 2014	402	2014
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016	362	2016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	349	2017
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	344	2021
Pushing the limits of semi-supervised learning for automatic speech recognition Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu arXiv preprint arXiv:2010.10504, 2020	337	2020
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020	327	2020
Simple recurrent units for highly parallelizable recurrence T Lei, Y Zhang, SI Wang, H Dai, Y Artzi arXiv preprint arXiv:1709.02755, 2017	324	2017
Contextnet: Improving convolutional neural networks for automatic speech recognition with global context W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu arXiv preprint arXiv:2005.03191, 2020	293	2020
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018	270	2018
Improved noisy student training for automatic speech recognition DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le arXiv preprint arXiv:2005.09629, 2020	248	2020

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs