Follow
Zafarali Ahmed
Zafarali Ahmed
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
21462023
Gemma: Open models based on gemini research and technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
7112024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
671*2024
Understanding the impact of entropy on policy optimization
Z Ahmed, N Le Roux, M Norouzi, D Schuurmans
International Conference on Machine Learning (ICML) 2019, 151-160, 2019
2672019
InfoBot: Transfer and Exploration via the Information Bottleneck
A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ...
International Conference on Learning Representations (ICLR) 2019, 2019
1782019
What can I do here? A Theory of Affordances in Reinforcement Learning
K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup
International Conference on Machine Learning (ICML) 2020, 5479--5488, 2020
782020
Androidenv: A reinforcement learning platform for android
D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ...
arXiv preprint arXiv:2105.13231, 2021
642021
RE-EVALUATE: Reproducibility in Evaluating Reinforcement Learning Algorithms
K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau
2nd Reproducibility in Machine Learning Workshop at ICML 2018, 2018
232018
Intratumor Heterogeneity and Circulating Tumor Cell Clusters
Z Ahmed, S Gravel
Molecular Biology and Evolution, 2017
232017
Learning to prove from synthetic theorems
E Aygün, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ...
arXiv preprint arXiv:2006.11259, 2020
212020
Training a first-order theorem prover from synthetic data
V Firoiu, E Aygun, A Anand, Z Ahmed, X Glorot, L Orseau, L Zhang, ...
arXiv preprint arXiv:2103.03798, 2021
162021
Marginalized state distribution entropy regularization in policy optimization
R Islam, Z Ahmed, D Precup
arXiv preprint arXiv:1912.05128, 2019
162019
Temporally abstract partial models
K Khetarpal, Z Ahmed, G Comanici, D Precup
Advances in Neural Information Processing Systems 34, 1979-1991, 2021
142021
Vfunc: a deep generative model for functions
P Bachman, R Islam, A Sordoni, Z Ahmed
Workshop on Prediction and Generative Modeling in Reinforcement Learning at …, 2018
82018
Generalized policy updates for policy optimization
S Kumar, R Dadashi, Z Ahmed, D Schuurmans, MG Bellemare
NeurIPS 2019 Optimization Foundations for Reinforcement Learning Workshop, 2019
22019
Discrete off-policy policy gradient using continuous relaxations
A Cianflone, Z Ahmed, R Islam, AJ Bose, WL Hamilton
Unpublished. https://joeybose. github. io/assets/Gradient_estimator. pdf, 2019
22019
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
G Comanici, A Glaese, A Gergely, D Toyama, Z Ahmed, T Jackson, ...
arXiv preprint arXiv:2204.10374, 2022
12022
Learning proposals for sequential importance samplers using reinforced variational inference
Z Ahmed, A Karuvally, D Precup, S Gravel
Deep RL Meets Structured Prediction Workshop at ICLR, 2019
12019
Controlling computing devices using hierarchical agents
GT Comanici, AMC Glaese, A Gergely, Z Ahmed, T Jackson, D Precup
US Patent App. 18/105,180, 2024
2024
Unifying Variational Inference and Policy Optimization
Z Ahmed
McGill University, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–20