Iurii Kemaev

120

2019202020212022202320241 4 44 95 120 61

Öffentlicher Zugriff

1 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

David BuddenDeepMindBestätigte E-Mail-Adresse bei csail.mit.edu
Matteo HesselResearch Engineer, Google DeepMindBestätigte E-Mail-Adresse bei google.com
Tom ZahavyStaff Research Scientist, Google DeepMindBestätigte E-Mail-Adresse bei deepmind.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLBestätigte E-Mail-Adresse bei google.com
Fabio ViolaDeepMindBestätigte E-Mail-Adresse bei google.com
Satinder SinghGoogle DeepMind / U. of MichiganBestätigte E-Mail-Adresse bei umich.edu
Zhongwen XuTencentBestätigte E-Mail-Adresse bei tencent.com
David SilverDeepMind, UCLBestätigte E-Mail-Adresse bei google.com
Vivek VeeriahGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
Junhyuk OhResearch Scientist, DeepMindBestätigte E-Mail-Adresse bei google.com
Tom SchaulSenior Staff Scientist, DeepMindBestätigte E-Mail-Adresse bei nyu.edu
Dmitry VetrovProfessor of Computer Science at Constructor University, BremenBestätigte E-Mail-Adresse bei constructor.university
Daniil PolykovskiySr. Director of Technology, Insilico MedicineBestätigte E-Mail-Adresse bei insilico.com
Georg OstrovskiGoogle DeepMind

Iurii Kemaev

DeepMind

Bestätigte E-Mail-Adresse bei deepmind.com


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
The DeepMind JAX Ecosystem, 2020 I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/deepmind, 0	233*
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	34	2021
Discovering a set of policies for the worst case reward T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ... arXiv preprint arXiv:2102.04323, 2021	22	2021
Podracer architectures for scalable Reinforcement Learning M Hessel, M Kroiss, A Clark, I Kemaev, J Quan, T Keck, F Viola, ... arXiv preprint arXiv:2104.06272, 2021	20	2021
Return-based scaling: Yet another normalisation trick for deep rl T Schaul, G Ostrovski, I Kemaev, D Borsa arXiv preprint arXiv:2105.05347, 2021	13	2021
Reset: learning recurrent dynamic routing in resnet-like neural networks I Kemaev, D Polykovskiy, D Vetrov Asian Conference on Machine Learning, 422-437, 2018	4	2018
Learning options for action selection with meta-gradients in multi-task reinforcement learning VVJ Veeraiah, TBZ Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, ... US Patent App. 17/918,365, 2023	1	2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–7

Zitate pro Jahr