Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018 | 132 | 2018 |
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International Conference on Machine Learning, 507-517, 2020 | 78 | 2020 |
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020 | 24 | 2020 |
Making efficient use of demonstrations to solve hard exploration problems TL Paine, C Gulcehre, B Shahriari, M Denil, M Hoffman, H Soyer, ... arXiv preprint arXiv:1909.01387, 2019 | 15 | 2019 |
Never give up: Learning directed exploration strategies A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ... arXiv e-prints, arXiv: 2002.06038, 2020 | 6 | 2020 |
Agent57: Outperforming the Atari Human Benchmark A Puigdomènech Badia, B Piot, S Kapturowski, P Sprechmann, ... arXiv e-prints, arXiv: 2003.13350, 2020 | 3 | 2020 |
Value-driven hindsight modelling A Guez, F Viola, T Weber, L Buesing, S Kapturowski, D Precup, D Silver, ... arXiv preprint arXiv:2002.08329, 2020 | 2 | 2020 |
Temporal Difference Uncertainties as a Signal for Exploration S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ... arXiv preprint arXiv:2010.02255, 2020 | 1 | 2020 |
Revisiting Peng's Q() for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... arXiv preprint arXiv:2103.00107, 2021 | | 2021 |
Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ... arXiv preprint arXiv:2102.13515, 2021 | | 2021 |
Jointly learning exploratory and non-exploratory action selection policies AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ... US Patent App. 16/881,180, 2020 | | 2020 |