Shinji Watanabe

Cited by

	All	Since 2019
Citations	27097	22777
h-index	77	71
i10-index	328	286

7000

3500

1750

5250

20112012201320142015201620172018201920202021202220232024104 142 241 241 395 478 917 1485 2088 2932 4325 4892 6510 1998

Public access

View all

56 articles

1 article

available

not available

Based on funding mandates

Co-authors

Takaaki HoriAppleVerified email at apple.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Jonathan Le RouxMERLVerified email at merl.com
Xuankai ChangCarnegie Mellon University, StudentVerified email at andrew.cmu.edu
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Atsushi NakamuraGraduate School of Natural Sciences, Nagoya City UniversityVerified email at ieee.org
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Hakan ErdoganGoogleVerified email at google.com
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Shota HoriguchiNTT CorporationVerified email at ntt.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Yusuke FujitaLY Corp.Verified email at linecorp.com
Wangyou ZhangPh.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verified email at columbia.edu
Brian YanCarnegie Mellon UniversityVerified email at cs.cmu.edu
Emmanuel VincentSenior Research Scientist, InriaVerified email at inria.fr
Aswin Shanmugam SubramanianMicrosoftVerified email at microsoft.com
Leibny Paola GarciaJohns Hopkins UniversityVerified email at jhu.edu

Shinji Watanabe

Carnegie Mellon University

Verified email at cmu.edu - Homepage

Speech recognition Speech processing Speech enhancement Speech translation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep clustering: Discriminative embeddings for segmentation and separation JR Hershey, Z Chen, J Le Roux, S Watanabe 2016 IEEE international conference on acoustics, speech and signal …, 2016	1463	2016
Espnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	1455	2018
Joint CTC-attention based end-to-end speech recognition using multi-task learning S Kim, T Hori, S Watanabe 2017 IEEE international conference on acoustics, speech and signal …, 2017	1014	2017
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	819	2017
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	750	2019
The third ‘CHiME’speech separation and recognition challenge: Dataset, task and baselines J Barker, R Marxer, E Vincent, S Watanabe 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	736	2015
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks H Erdogan, JR Hershey, S Watanabe, J Le Roux 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	733	2015
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ... Latent Variable Analysis and Signal Separation: 12th International …, 2015	667	2015
Superb: Speech processing universal performance benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021	665	2021
Single-channel multi-speaker separation using deep clustering Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey arXiv preprint arXiv:1607.02173, 2016	476	2016
An analysis of environment, microphone and data simulation mismatches in robust speech recognition E Vincent, S Watanabe, AA Nugraha, J Barker, R Marxer Computer Speech & Language 46, 535-557, 2017	401	2017
The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines J Barker, S Watanabe, E Vincent, J Trmal arXiv preprint arXiv:1803.10609, 2018	397	2018
Improved mvdr beamforming using single-channel mask prediction networks. H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux Interspeech, 1981-1985, 2016	347	2016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	344	2017
A review of speaker diarization: Recent advances with deep learning TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan Computer Speech & Language 72, 101317, 2022	287	2022
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020	287	2020
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	267	2013
Recent developments on espnet toolkit boosted by conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	262	2021
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. G Sell, D Snyder, A McCree, D Garcia-Romero, J Villalba, M Maciejewski, ... Interspeech, 2808-2812, 2018	236	2018
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani Proc. Interspeech 2019, 2019	233	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors