Follow
Shinji Watanabe
Title
Cited by
Cited by
Year
Espnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
14932018
Deep clustering: Discriminative embeddings for segmentation and separation
JR Hershey, Z Chen, J Le Roux, S Watanabe
2016 IEEE international conference on acoustics, speech and signal …, 2016
14852016
Joint CTC-attention based end-to-end speech recognition using multi-task learning
S Kim, T Hori, S Watanabe
2017 IEEE international conference on acoustics, speech and signal …, 2017
10292017
Hybrid CTC/attention architecture for end-to-end speech recognition
S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017
8462017
A comparative study on transformer vs rnn in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
7822019
The third ‘CHiME’speech separation and recognition challenge: Dataset, task and baselines
J Barker, R Marxer, E Vincent, S Watanabe
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
7422015
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks
H Erdogan, JR Hershey, S Watanabe, J Le Roux
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
7412015
Superb: Speech processing universal performance benchmark
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
arXiv preprint arXiv:2105.01051, 2021
7062021
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR
F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ...
Latent Variable Analysis and Signal Separation: 12th International …, 2015
6772015
Single-channel multi-speaker separation using deep clustering
Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey
arXiv preprint arXiv:1607.02173, 2016
4832016
An analysis of environment, microphone and data simulation mismatches in robust speech recognition
E Vincent, S Watanabe, AA Nugraha, J Barker, R Marxer
Computer Speech & Language 46, 535-557, 2017
4072017
The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines
J Barker, S Watanabe, E Vincent, J Trmal
arXiv preprint arXiv:1803.10609, 2018
4022018
Improved mvdr beamforming using single-channel mask prediction networks.
H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux
Interspeech, 1981-1985, 2016
3522016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
arXiv preprint arXiv:1706.02737, 2017
3462017
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
3022022
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2972020
Recent developments on espnet toolkit boosted by conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2692021
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines
E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
2672013
Self-supervised speech representation learning: A review
A Mohamed, H Lee, L Borgholt, JD Havtorn, J Edin, C Igel, K Kirchhoff, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1179-1210, 2022
2532022
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration
S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani
Proc. Interspeech 2019, 2019
2412019
The system can't perform the operation now. Try again later.
Articles 1–20