Iurii Kemaev

140

105

2019202020212022202320241 4 46 95 129 41

Public access

1 article

0 articles

available

not available

Based on funding mandates

David BuddenDeepMindVerified email at csail.mit.edu
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Tom ZahavyStaff Research Scientist, Google DeepMindVerified email at deepmind.com
Fabio ViolaDeepMindVerified email at google.com
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Zhongwen XuTencentVerified email at tencent.com
David SilverDeepMind, UCLVerified email at google.com
Vivek VeeriahGoogle DeepMindVerified email at google.com
Junhyuk OhResearch Scientist, DeepMindVerified email at google.com
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
Dmitry VetrovProfessor of Computer Science at Constructor University, BremenVerified email at constructor.university
Daniil PolykovskiySr. Director of Technology, Insilico MedicineVerified email at insilico.com
Georg OstrovskiGoogle DeepMind

Iurii Kemaev

DeepMind

Verified email at deepmind.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The DeepMind JAX Ecosystem, 2020 I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/deepmind, 0	221*
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	34	2021
Podracer architectures for scalable Reinforcement Learning M Hessel, M Kroiss, A Clark, I Kemaev, J Quan, T Keck, F Viola, ... arXiv preprint arXiv:2104.06272, 2021	23	2021
Discovering a set of policies for the worst case reward T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ... arXiv preprint arXiv:2102.04323, 2021	22	2021
Return-based scaling: Yet another normalisation trick for deep rl T Schaul, G Ostrovski, I Kemaev, D Borsa arXiv preprint arXiv:2105.05347, 2021	13	2021
Reset: learning recurrent dynamic routing in resnet-like neural networks I Kemaev, D Polykovskiy, D Vetrov Asian Conference on Machine Learning, 422-437, 2018	4	2018
Learning options for action selection with meta-gradients in multi-task reinforcement learning VVJ Veeraiah, TBZ Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, ... US Patent App. 17/918,365, 2023	1	2023

The system can't perform the operation now. Try again later.

Articles 1–7

Citations per year