A study on data augmentation of reverberant speech for robust speech recognition T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 1118 | 2017 |
Recent advances in deep learning for speech research at Microsoft L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ... 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 1055 | 2013 |
The Microsoft 2017 conversational speech recognition system W Xiong, L Wu, F Alleva, J Droppo, X Huang, A Stolcke 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 964 | 2018 |
An investigation of deep neural networks for noise robust speech recognition ML Seltzer, D Yu, Y Wang 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 806 | 2013 |
Achieving human parity in conversational speech recognition W Xiong, J Droppo, X Huang, F Seide, M Seltzer, A Stolcke, D Yu, ... arXiv preprint arXiv:1610.05256, 2016 | 719 | 2016 |
Binary coding of speech spectrograms using a deep auto-encoder L Deng, ML Seltzer, D Yu, A Acero, A Mohamed, G Hinton Eleventh annual conference of the international speech communication association, 2010 | 498 | 2010 |
An introduction to computational networks and the computational network toolkit D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ... Microsoft Technical Report MSR-TR-2014–112, 2014 | 475 | 2014 |
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 396 | 2024 |
Improved bottleneck features using pretrained deep neural networks D Yu, ML Seltzer Twelfth annual conference of the international speech communication association, 2011 | 390 | 2011 |
Feature learning in deep neural networks-studies on speech recognition tasks D Yu, ML Seltzer, J Li, JT Huang, F Seide arXiv preprint arXiv:1301.3605, 2013 | 321 | 2013 |
Multi-task learning in deep neural networks for improved phoneme recognition ML Seltzer, J Droppo 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 291 | 2013 |
Crowdmos: An approach for crowdsourcing mean opinion score studies F Ribeiro, D Florêncio, C Zhang, M Seltzer 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 286 | 2011 |
Reconstruction of missing features for robust speech recognition B Raj, ML Seltzer, RM Stern Speech communication 43 (4), 275-296, 2004 | 282 | 2004 |
Augmenting speech recognition with depth imaging J Kapur, I Tashev, M Seltzer, SE Hodges US Patent App. 13/662,293, 2014 | 281 | 2014 |
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 268 | 2020 |
Toward human parity in conversational speech recognition W Xiong, J Droppo, X Huang, F Seide, ML Seltzer, A Stolcke, D Yu, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (12 …, 2017 | 256 | 2017 |
Deep beamforming networks for multi-channel speech recognition X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 223 | 2016 |
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition ML Seltzer, B Raj, RM Stern Speech Communication 43 (4), 379-393, 2004 | 215 | 2004 |
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network J Xue, J Li, D Yu, M Seltzer, Y Gong 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 200 | 2014 |
Likelihood-maximizing beamforming for robust hands-free speech recognition ML Seltzer, B Raj, RM Stern IEEE Transactions on speech and audio processing 12 (5), 489-498, 2004 | 190 | 2004 |