Ankur Bapna
Ankur Bapna
Google Research
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gpipe: Efficient training of giant neural networks using pipeline parallelism
Y Huang, Y Cheng, A Bapna, O Firat, D Chen, M Chen, HJ Lee, J Ngiam, ...
Advances in neural information processing systems, 103-112, 2019
2612019
The best of both worlds: Combining recent advances in neural machine translation
MX Chen, O Firat, A Bapna, M Johnson, W Macherey, G Foster, L Jones, ...
arXiv preprint arXiv:1804.09849, 2018
2222018
Building a conversational agent overnight with dialogue self-play
P Shah, D Hakkani-Tür, G Tür, A Rastogi, A Bapna, N Nayak, L Heck
arXiv preprint arXiv:1801.04871, 2018
612018
Towards zero-shot frame semantic parsing for domain scaling
A Bapna, G Tur, D Hakkani-Tur, L Heck
arXiv preprint arXiv:1707.02363, 2017
602017
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
552019
Massively multilingual neural machine translation in the wild: Findings and challenges
N Arivazhagan, A Bapna, O Firat, D Lepikhin, M Johnson, M Krikun, ...
arXiv preprint arXiv:1907.05019, 2019
452019
Revisiting character-based neural machine translation with capacity and compression
C Cherry, G Foster, A Bapna, O Firat, W Macherey
arXiv preprint arXiv:1808.09943, 2018
452018
Training deeper neural machine translation models with transparent attention
A Bapna, MX Chen, O Firat, Y Cao, Y Wu
arXiv preprint arXiv:1808.07561, 2018
382018
Sequential dialogue context modeling for spoken language understanding
A Bapna, G Tur, D Hakkani-Tur, L Heck
arXiv preprint arXiv:1705.03455, 2017
34*2017
Simple, scalable adaptation for neural machine translation
A Bapna, N Arivazhagan, O Firat
arXiv preprint arXiv:1909.08478, 2019
242019
Investigating multilingual nmt representations at scale
SR Kudugunta, A Bapna, I Caswell, N Arivazhagan, O Firat
arXiv preprint arXiv:1909.02197, 2019
212019
The missing ingredient in zero-shot neural machine translation
N Arivazhagan, A Bapna, O Firat, R Aharoni, M Johnson, W Macherey
arXiv preprint arXiv:1903.07091, 2019
212019
Large-scale multilingual speech recognition with a streaming end-to-end model
A Kannan, A Datta, TN Sainath, E Weinstein, B Ramabhadran, Y Wu, ...
arXiv preprint arXiv:1909.05330, 2019
202019
Non-parametric adaptation for neural machine translation
A Bapna, O Firat
arXiv preprint arXiv:1903.00058, 2019
142019
Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation.
A Siddhant, M Johnson, H Tsai, N Ari, J Riesa, A Bapna, O Firat, K Raman
AAAI, 8854-8861, 2020
122020
The missing ingredient in zero-shot neural machine translation
N Arivazhagan, A Bapna, O Firat, R Aharoni, M Johnson, W Macherey
62018
Controlling computation versus quality for neural sequence models
A Bapna, N Arivazhagan, O Firat
arXiv preprint arXiv:2002.07106, 2020
32020
Faster Transformer Decoding: N-gram Masked Self-Attention
C Chelba, M Chen, A Bapna, N Shazeer
arXiv preprint arXiv:2001.04589, 2020
32020
Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
A Siddhant, A Bapna, Y Cao, O Firat, M Chen, S Kudugunta, ...
arXiv preprint arXiv:2005.04816, 2020
22020
Machine translation using neural network models
Z Chen, MR Hughes, Y Wu, M Schuster, X Chen, LO Jones, NJ Parmar, ...
US Patent App. 16/521,780, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–20