Shinji Watanabe
Shinji Watanabe
Johns Hopkins University
Verified email at ieee.org - Homepage
Title
Cited by
Cited by
Year
Deep clustering: Discriminative embeddings for segmentation and separation
JR Hershey, Z Chen, J Le Roux, S Watanabe
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
5822016
The third ‘CHiME’speech separation and recognition challenge: Dataset, task and baselines
J Barker, R Marxer, E Vincent, S Watanabe
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
4542015
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks
H Erdogan, JR Hershey, S Watanabe, J Le Roux
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
3652015
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR
F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ...
International Conference on Latent Variable Analysis and Signal Separation …, 2015
3342015
Joint CTC-attention based end-to-end speech recognition using multi-task learning
S Kim, T Hori, S Watanabe
2017 IEEE international conference on acoustics, speech and signal …, 2017
3132017
Single-channel multi-speaker separation using deep clustering
Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey
arXiv preprint arXiv:1607.02173, 2016
2442016
Espnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
2282018
An analysis of environment, microphone and data simulation mismatches in robust speech recognition
E Vincent, S Watanabe, AA Nugraha, J Barker, R Marxer
Computer Speech & Language 46, 535-557, 2017
2262017
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines
E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
2152013
Hybrid CTC/attention architecture for end-to-end speech recognition
S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017
1672017
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
arXiv preprint arXiv:1706.02737, 2017
1562017
Improved mvdr beamforming using single-channel mask prediction networks.
H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux
Interspeech, 1981-1985, 2016
1532016
Topic tracking model for analyzing consumer purchase behavior
T Iwata, S Watanabe, T Yamada, N Ueda
Twenty-First International Joint Conference on Artificial Intelligence, 2009
1512009
The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines
J Barker, S Watanabe, E Vincent, J Trmal
arXiv preprint arXiv:1803.10609, 2018
1482018
Recurrent deep neural networks for robust speech recognition
C Weng, D Yu, S Watanabe, BHF Juang
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
1282014
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
1092016
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
G Sell, D Snyder, A McCree, D Garcia-Romero, J Villalba, M Maciejewski, ...
Interspeech, 2808-2812, 2018
1042018
Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks
Z Chen, S Watanabe, H Erdogan, JR Hershey
Sixteenth Annual Conference of the International Speech Communication …, 2015
1022015
Variational Bayesian estimation and clustering for speech recognition
S Watanabe, Y Minami, A Nakamura, N Ueda
IEEE Transactions on Speech and Audio Processing 12 (4), 365-381, 2004
1022004
A comparative study on transformer vs RNN in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
992019
The system can't perform the operation now. Try again later.
Articles 1–20