Rohan Anil

Cited by

	All	Since 2019
Citations	7089	6741
h-index	19	19
i10-index	23	23

2000

1000

500

1500

2017201820192020202120222023202499 222 526 769 984 1174 1907 1376

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ehsan AmidSenior Research Scientist at Google DeepMindVerified email at google.com
Tomer KorenAssistant Professor at Tel Aviv UniversityVerified email at tauex.tau.ac.il
Vineet GuptaGoogle IncVerified email at google.com
George E. DahlGoogle Inc.Verified email at google.com
Christopher FiftyStanford UniversityVerified email at cornell.edu
Naman AgarwalSenior Research Scientist, Google AI PrincetonVerified email at google.com
Chelsea FinnStanford University, GoogleVerified email at cs.stanford.edu
Robert OrmandiGoogleVerified email at google.com
Alexandre PassosOpenAIVerified email at cs.umass.edu
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoVerified email at cs.toronto.edu
Cyril ZhangMicrosoft Research NYCVerified email at microsoft.com
Elad HazanProfessor at Princeton University and Director Google AI PrincetonVerified email at princeton.edu
Kunal TalwarApple IncVerified email at apple.com
Patrick NguyenResearch Scientist, Google, Inc.Verified email at google.com
Jonathan ShenGoogleVerified email at google.com
Mia Xu ChenGoogle BrainVerified email at google.com

Rohan Anil

Principal Engineer, Google Brain

Verified email at google.com

machine learning neural networks large scale training optimization algorithms


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Wide & deep learning for recommender systems HT Cheng, L Koc, J Harmsen, T Shaked, T Chandra, H Aradhye, ... Proceedings of the 1st workshop on deep learning for recommender systems, 7-10, 2016	3826	2016
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	804	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	511	2023
Large scale distributed neural network training through online distillation R Anil, G Pereyra, AT Passos, R Ormandi, G Dahl, G Hinton Sixth International Conference on Learning Representations, 2018	468	2018
Knowledge distillation: A good teacher is patient and consistent L Beyer, X Zhai, A Royer, L Markeeva, R Anil, A Kolesnikov Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	222	2022
Efficiently Identifying Task Groupings for Multi-Task Learning C Fifty, E Amid, Z Zhao, T Yu, R Anil, C Finn 2021 Conference on Neural Information Processing Systems, Spotlight, 2021	210	2021
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	199	2019
Tf-ranking: Scalable tensorflow library for learning-to-rank RK Pasumarthi, S Bruch, X Wang, C Li, M Bendersky, M Najork, J Pfeifer, ... Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019	147	2019
Robust bi-tempered logistic loss based on bregman divergences E Amid, MK Warmuth, R Anil, T Koren 2019 Conference on Neural Information Processing Systems, 2019	128	2019
Scalable Second Order Optimization for Deep Learning R Anil, V Gupta, T Koren, K Regan, Y Singer arXiv preprint arXiv:2002.09018, 2020, 2020	102*	2020
Large-Scale Differentially Private BERT R Anil, B Ghazi, V Gupta, R Kumar, P Manurangsi Privacy Preserving Machine Learning, 2021	94	2021
Sunipa Dev R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... Jacob Devlin, Mark Díaz, Nan Du, Ethan Dyer, Vladimir Feinberg, Fangxiaoyu …, 2023	53	2023
Memory-efficient adaptive optimization for large-scale learning R Anil, V Gupta, T Koren, Y Singer 2019 Conference on Neural Information Processing Systems, 2019	50*	2019
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	40	2024
A large batch optimizer reality check: Traditional, generic optimizers suffice across batch sizes Z Nado, JM Gilmer, CJ Shallue, R Anil, GE Dahl arXiv preprint arXiv:2102.06356, 2021	37	2021
Disentangling adaptive gradient methods from learning rates N Agarwal, R Anil, E Hazan, T Koren, C Zhang arXiv preprint arXiv:2002.11803, 2020	37	2020
Wide and deep machine learning models T Shaked, R Anil, HB Aradhye, G Anderson, W Chai, ML Koc, J Harmsen, ... US Patent 10,762,422, 2020	32	2020
On the factory floor: ML engineering for industrial-scale ads recommendation models R Anil, S Gadanho, D Huang, N Jacob, Z Li, D Lin, T Phillips, C Pop, ... arXiv preprint arXiv:2209.05310, 2022	23	2022
Locoprop: Enhancing backprop via local loss optimization E Amid, R Anil, MK Warmuth The 25th International Conference on Artificial Intelligence and Statistics …, 2021	23	2021
Memory-efficient adaptive optimization for large-scale learning R Anil, V Gupta, T Koren, Y Singer arXiv preprint arXiv:1901.11150 4, 2019	17	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors