Folgen
Iurii Kemaev
Titel
Zitiert von
Zitiert von
Jahr
The DeepMind JAX Ecosystem, 2020
I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/deepmind, 0
232*
Discovery of options via meta-learned subgoals
V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ...
Advances in Neural Information Processing Systems 34, 29861-29873, 2021
352021
Podracer architectures for scalable Reinforcement Learning
M Hessel, M Kroiss, A Clark, I Kemaev, J Quan, T Keck, F Viola, ...
arXiv preprint arXiv:2104.06272, 2021
242021
Discovering a set of policies for the worst case reward
T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ...
arXiv preprint arXiv:2102.04323, 2021
222021
Return-based scaling: Yet another normalisation trick for deep rl
T Schaul, G Ostrovski, I Kemaev, D Borsa
arXiv preprint arXiv:2105.05347, 2021
142021
Reset: learning recurrent dynamic routing in resnet-like neural networks
I Kemaev, D Polykovskiy, D Vetrov
Asian Conference on Machine Learning, 422-437, 2018
42018
Learning options for action selection with meta-gradients in multi-task reinforcement learning
VVJ Veeraiah, TBZ Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, ...
US Patent App. 17/918,365, 2023
12023
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–7