John Hershey

Cited by

	All	Since 2019
Citations	17405	12743
h-index	58	48
i10-index	144	107

2600

1300

650

1950

200620072008200920102011201220132014201520162017201820192020202120222023202456 100 86 144 215 186 271 269 289 365 413 806 1247 1770 2252 2491 2428 2503 1296

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jonathan Le RouxMERLVerified email at merl.com
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Hakan ErdoganGoogleVerified email at google.com
Scott WisdomGoogle ResearchVerified email at google.com
Takaaki HoriAppleVerified email at apple.com
Peder A OlsenMicrosoft Research (formerly IBM Research)Verified email at microsoft.com
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verified email at columbia.edu
Steven J. RenniePryon Inc. (Formerly Fusemachines Inc, IBM Research, University of Toronto)Verified email at pryoninc.com
Felix WeningerMicrosoftVerified email at microsoft.com
Kevin WilsonGoogleVerified email at google.com
Trausti T KristjanssonAmazon Lab126, Adjoint Professor at University of Reykjavik (formerly Google, IBM, MSR)Verified email at amazon.com
Javier MovellanResearch Professor, University of California San DiegoVerified email at mplab.ucsd.edu
Chiori HoriMERLVerified email at merl.com
Tim K. MarksPrincipal Research Scientist, Mitsubishi Electric Research Labs (MERL)Verified email at merl.com
Efthymios TzinisResearch Scientist at Google | Ex. UIUC, MERL, MetaVerified email at google.com
Zhong-Qiu WangPostdoc, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Ron J WeissGoogleVerified email at google.com
Yuuki TachiokaDenso IT LaboratoryVerified email at d-itlab.co.jp
Björn SchullerProfessor, Technische Universität München (TUM) / Imperial College London & CSO, audEERINGVerified email at tum.de
Joshua M SusskindApple AI ResearchVerified email at apple.com

John Hershey

Google (formerly MERL, IBM, MSR, UCSD)

Verified email at google.com

machine learning sound separation speech recognition audio-visual perception


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep clustering: Discriminative embeddings for segmentation and separation JR Hershey, Z Chen, J Le Roux, S Watanabe 2016 IEEE international conference on acoustics, speech and signal …, 2016	1515	2016
Approximating the Kullback Leibler divergence between Gaussian mixture models JR Hershey, PA Olsen 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007	1393	2007
SDR–half-baked or well done? J Le Roux, S Wisdom, H Erdogan, JR Hershey ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	1134	2019
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	877	2017
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks H Erdogan, JR Hershey, S Watanabe, J Le Roux 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	745	2015
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ... Latent Variable Analysis and Signal Separation: 12th International …, 2015	677	2015
Deep unfolding: Model-based inspiration of novel deep architectures JR Hershey, JL Roux, F Weninger arXiv preprint arXiv:1409.2574, 2014	496	2014
Single-channel multi-speaker separation using deep clustering Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey arXiv preprint arXiv:1607.02173, 2016	481	2016
Attention-based multimodal fusion for video description C Hori, T Hori, TY Lee, Z Zhang, B Harsham, JR Hershey, TK Marks, ... Proceedings of the IEEE international conference on computer vision, 4193-4202, 2017	412	2017
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018	404	2018
Audio vision: Using audio-visual synchrony to locate sounds J Hershey, J Movellan Advances in neural information processing systems 12, 1999	378	1999
Discriminatively trained recurrent neural networks for single-channel speech separation F Weninger, JR Hershey, J Le Roux, B Schuller 2014 IEEE global conference on signal and information processing (GlobalSIP …, 2014	357	2014
Improved MVDR beamforming using single-channel mask prediction networks. H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux Interspeech, 1981-1985, 2016	356	2016
Full-capacity unitary recurrent neural networks S Wisdom, T Powers, J Hershey, J Le Roux, L Atlas Advances in Neural Information Processing Systems, 4880-4888, 2016	351	2016
Multi-channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation ZQ Wang, J Le Roux, JR Hershey 2018 IEEE International conference on acoustics, speech and signal …, 2018	259	2018
Monaural speech separation and recognition challenge M Cooke, JR Hershey, SJ Rennie Computer Speech & Language 24 (1), 1-15, 2010	247	2010
Deep beamforming networks for multi-channel speech recognition X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	216	2016
Alternative objective functions for deep clustering ZQ Wang, J Le Roux, JR Hershey 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	215	2018
Super-human multi-talker speech recognition: A graphical modeling approach JR Hershey, SJ Rennie, PA Olsen, TT Kristjansson Computer Speech & Language 24 (1), 45-66, 2010	212	2010
Universal sound separation I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ... 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019	210	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors