Shinji Watanabe

Cited by

	All	Since 2019
Citations	28889	24598
h-index	80	74
i10-index	339	300

7000

3500

1750

5250

20112012201320142015201620172018201920202021202220232024103 142 239 240 390 478 917 1482 2102 2934 4310 4883 6453 3878

Public access

View all

75 articles

4 articles

available

not available

Based on funding mandates

Co-authors

Takaaki HoriAppleVerified email at apple.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Jonathan Le RouxMERLVerified email at merl.com
Xuankai ChangApple - Carnegie Mellon UniversityVerified email at apple.com
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Atsushi NakamuraGraduate School of Natural Sciences, Nagoya City UniversityVerified email at ieee.org
Hakan ErdoganGoogleVerified email at google.com
Wangyou ZhangPh.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Shota HoriguchiNTT CorporationVerified email at ntt.com
Yusuke FujitaLY Corp.Verified email at linecorp.com
Brian YanCarnegie Mellon UniversityVerified email at cs.cmu.edu
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verified email at columbia.edu
Hung-yi LeeNational Taiwan UniversityVerified email at ntu.edu.tw
Leibny Paola GarciaJohns Hopkins UniversityVerified email at jhu.edu
Aswin Shanmugam SubramanianMicrosoftVerified email at microsoft.com

Shinji Watanabe

Carnegie Mellon University

Verified email at cmu.edu - Homepage

Speech recognition Speech processing Speech enhancement Speech translation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
ESPnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	1548	2018
Deep clustering: Discriminative embeddings for segmentation and separation JR Hershey, Z Chen, J Le Roux, S Watanabe 2016 IEEE international conference on acoustics, speech and signal …, 2016	1515	2016
Joint CTC-attention based end-to-end speech recognition using multi-task learning S Kim, T Hori, S Watanabe 2017 IEEE international conference on acoustics, speech and signal …, 2017	1045	2017
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	877	2017
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	802	2019
Superb: Speech processing universal performance benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021	753	2021
The third ‘CHiME’speech separation and recognition challenge: Dataset, task and baselines J Barker, R Marxer, E Vincent, S Watanabe 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	749	2015
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks H Erdogan, JR Hershey, S Watanabe, J Le Roux 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	745	2015
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ... Latent Variable Analysis and Signal Separation: 12th International …, 2015	677	2015
Single-channel multi-speaker separation using deep clustering Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey arXiv preprint arXiv:1607.02173, 2016	481	2016
An analysis of environment, microphone and data simulation mismatches in robust speech recognition E Vincent, S Watanabe, AA Nugraha, J Barker, R Marxer Computer Speech & Language 46, 535-557, 2017	412	2017
The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines J Barker, S Watanabe, E Vincent, J Trmal arXiv preprint arXiv:1803.10609, 2018	407	2018
Improved MVDR beamforming using single-channel mask prediction networks. H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux Interspeech, 1981-1985, 2016	356	2016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	349	2017
A review of speaker diarization: Recent advances with deep learning TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan Computer Speech & Language 72, 101317, 2022	331	2022
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020	307	2020
Self-supervised speech representation learning: A review A Mohamed, H Lee, L Borgholt, JD Havtorn, J Edin, C Igel, K Kirchhoff, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1179-1210, 2022	285	2022
Recent developments on espnet toolkit boosted by conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	278	2021
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	266	2013
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani Proc. Interspeech 2019, 2019	254	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors