Robert Kirk

Zitiert von

	Alle	Seit 2019
Zitate	541	541
h-index	6	6
i10-index	6	6

220

110

165

20212022202320245 120 193 218

Koautoren

Edward GrefenstetteDirector of Research, Google DeepMind | Honorary Professor, UCLBestätigte E-Mail-Adresse bei google.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindBestätigte E-Mail-Adresse bei cs.ucl.ac.uk
Eric HambroAnthropicBestätigte E-Mail-Adresse bei anthropic.com
David Scott KruegerUniversity Assistant Professor, University of CambridgeBestätigte E-Mail-Adresse bei cam.ac.uk
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinBestätigte E-Mail-Adresse bei austin.utexas.edu
Minqi JiangResearch Scientist at Google DeepMindBestätigte E-Mail-Adresse bei ucl.ac.uk
Roberta RaileanuResearch Scientist, MetaBestätigte E-Mail-Adresse bei fb.com
Usman AnwarUniversity of CambridgeBestätigte E-Mail-Adresse bei cam.ac.uk
Vitaly KurinResearch Scientist at Isomorphic LabsBestätigte E-Mail-Adresse bei isomorphiclabs.com
Mikayel SamvelyanMeta AI, UCLBestätigte E-Mail-Adresse bei meta.com
Fabio PetroniSamaya AIBestätigte E-Mail-Adresse bei samaya.ai
Heinrich KüttlerxAIBestätigte E-Mail-Adresse bei math.lmu.de
Jack Parker-HolderGoogle DeepMind, UCLBestätigte E-Mail-Adresse bei google.com
Hidenori TanakaGroup Leader, CBS-NTT Program in "Physics of Intelligence", Harvard UniversityBestätigte E-Mail-Adresse bei fas.harvard.edu
Robert DickUniversity of Michigan, StrydBestätigte E-Mail-Adresse bei rpdmail.dyndns.org
Ekdeep Singh LubanaUniversity of MichiganBestätigte E-Mail-Adresse bei umich.edu
Samyak JainUndergrad at Indian Institute of Technology(BHU),VaranasiBestätigte E-Mail-Adresse bei itbhu.ac.in
Thomas CosteNoah's Ark Lab & University of CambridgeBestätigte E-Mail-Adresse bei cam.ac.uk
Christoforos NalmpantisPostdoctoral Researcher, Fundamental AI Research at MetaBestätigte E-Mail-Adresse bei fb.com
Jelena LuketinaOxford UniversityBestätigte E-Mail-Adresse bei cs.ox.ac.uk

Folgen

Robert Kirk

PhD Student, University College London

Bestätigte E-Mail-Adresse bei ucl.ac.uk - Startseite

AI Alignment AI Safety Language Models Fine-tuning Generalisation


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
A survey of zero-shot generalisation in deep reinforcement learning R Kirk, A Zhang, E Grefenstette, T Rocktäschel Journal of Artificial Intelligence Research 76, 201-264, 2023	322	2023
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... arXiv preprint arXiv:2109.13202, 2021	77	2021
Reward model ensembles help mitigate overoptimization T Coste, U Anwar, R Kirk, D Krueger arXiv preprint arXiv:2310.02743, 2023	42	2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... ICLR 2024, 2023	40	2023
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks S Jain, R Kirk, ES Lubana, RP Dick, H Tanaka, E Grefenstette, ... arXiv preprint arXiv:2311.12786, 2023	26	2023
Insights from the neurips 2021 nethack challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	18	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	5	2023
A study of off-policy learning in environments with procedural content generation A Ehrenberg, R Kirk, M Jiang, E Grefenstette, T Rocktäschel ICLR Workshop on Agent Learning in Open-Endedness, 2022	5	2022
Graph backup: Data efficient backup exploiting markovian transitions Z Jiang, T Zhang, R Kirk, T Rocktäschel, E Grefenstette arXiv preprint arXiv:2205.15824, 2022	4*	2022
Leading the Pack: N-player Opponent Shaping A Souly, T Willi, A Khan, R Kirk, C Lu, E Grefenstette, T Rocktäschel arXiv preprint arXiv:2312.12564, 2023	1	2023
Domain Generalization for Robust Model-Based Offline Reinforcement Learning A Clark, SA Siddiqui, R Kirk, U Anwar, S Chung, D Krueger arXiv preprint arXiv:2211.14827, 2022	1	2022
Analyzing the Generalization and Reliability of Steering Vectors--ICML 2024 D Tan, D Chanin, A Lynch, D Kanoulas, B Paige, A Garriga-Alonso, R Kirk arXiv preprint arXiv:2407.12404, 2024		2024
Analyzing the Generalization and Reliability of Steering Vectors DCH Tan, D Chanin, A Lynch, A Garriga-Alonso, D Kanoulas, B Paige, ... ICML 2024 Workshop on Mechanistic Interpretability, 2024		2024
What Mechanisms Does Knowledge Distillation Distill? C Wu, ES Lubana, BK Mlodozeniec, R Kirk, D Krueger Proceedings of UniReps: the First Workshop on Unifying Representations in …, 2024		2024

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–14

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren