Folgen
Robert Kirk
Robert Kirk
Bestätigte E-Mail-Adresse bei ucl.ac.uk - Startseite
Titel
Zitiert von
Zitiert von
Jahr
A survey of zero-shot generalisation in deep reinforcement learning
R Kirk, A Zhang, E Grefenstette, T Rocktäschel
Journal of Artificial Intelligence Research 76, 201-264, 2023
251*2023
Minihack the planet: A sandbox for open-ended reinforcement learning research
M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ...
arXiv preprint arXiv:2109.13202, 2021
682021
Insights from the neurips 2021 nethack challenge
E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ...
NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022
162022
Reward model ensembles help mitigate overoptimization
T Coste, U Anwar, R Kirk, D Krueger
arXiv preprint arXiv:2310.02743, 2023
112023
Understanding the effects of rlhf on llm generalisation and diversity
R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ...
arXiv preprint arXiv:2310.06452, 2023
102023
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks
S Jain, R Kirk, ES Lubana, RP Dick, H Tanaka, E Grefenstette, ...
arXiv preprint arXiv:2311.12786, 2023
82023
Graph backup: Data efficient backup exploiting markovian transitions
Z Jiang, T Zhang, R Kirk, T Rocktäschel, E Grefenstette
arXiv preprint arXiv:2205.15824, 2022
4*2022
A Study of Off-Policy Learning in Environments with Procedural Content Generation
A Ehrenberg, R Kirk, M Jiang, E Grefenstette, T Rocktäschel
ICLR Workshop on Agent Learning in Open-Endedness, 2022
42022
Generalization to new sequential decision making tasks with in-context learning
SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu
arXiv preprint arXiv:2312.03801, 2023
22023
Leading the Pack: N-player Opponent Shaping
A Souly, T Willi, A Khan, R Kirk, C Lu, E Grefenstette, T Rocktäschel
arXiv preprint arXiv:2312.12564, 2023
12023
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
A Clark, SA Siddiqui, R Kirk, U Anwar, S Chung, D Krueger
arXiv preprint arXiv:2211.14827, 2022
12022
What Mechanisms Does Knowledge Distillation Distill?
C Wu, ES Lubana, BK Mlodozeniec, R Kirk, D Krueger
UniReps: the First Workshop on Unifying Representations in Neural Models, 2023
2023
NeurIPS 2021 Competition and Demonstration Track Revised Selected Papers
D Kiela, M Ciccone, B Caputo, A Kanervisto, S Milani, K Ramanauskas, ...
NeurIPS 2021 Competitions and Demonstrations Track, i-ii, 2022
2022
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–13