Human-level control through deep reinforcement learning V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ... nature 518 (7540), 529-533, 2015 | 13059 | 2015 |
Hybrid computing using a neural network with dynamic external memory A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ... Nature 538 (7626), 471-476, 2016 | 1070 | 2016 |
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 807 | 2018 |
Unifying count-based exploration and intrinsic motivation M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos Advances in neural information processing systems, 1471-1479, 2016 | 683 | 2016 |
Count-based exploration with neural density models G Ostrovski, MG Bellemare, A Oord, R Munos arXiv preprint arXiv:1703.01310, 2017 | 277 | 2017 |
Implicit quantile networks for distributional reinforcement learning W Dabney, G Ostrovski, D Silver, R Munos arXiv preprint arXiv:1806.06923, 2018 | 125 | 2018 |
Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018 | 112 | 2018 |
Increasing the action gap: New operators for reinforcement learning MG Bellemare, G Ostrovski, A Guez, P Thomas, R Munos Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016 | 95 | 2016 |
Autoregressive quantile networks for generative modeling G Ostrovski, W Dabney, R Munos arXiv preprint arXiv:1806.05575, 2018 | 35 | 2018 |
Symmetric decomposition of asymmetric games K Tuyls, J Perolat, M Lanctot, G Ostrovski, R Savani, JZ Leibo, T Ord, ... Scientific Reports 8 (1), 1-20, 2018 | 21 | 2018 |
Piecewise linear hamiltonian flows associated to zero-sum games: Transition combinatorics and questions on ergodicity G Ostrovski, S van Strien Regular and Chaotic Dynamics 16 (1-2), 128-153, 2011 | 13 | 2011 |
Payoff performance of fictitious play G Ostrovski, S van Strien Journal of Dynamics and Games 1 (4), 621-638, 2014 | 12 | 2014 |
Payoff performance of fictitious play G Ostrovski, S van Strien arXiv preprint arXiv:1308.4049, 2013 | 12 | 2013 |
Bellemare Marc G, Alex Graves, Martin Riedmiller, Andreas K V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik …, 2015 | 5 | 2015 |
Adapting behaviour for learning progress T Schaul, D Borsa, D Ding, D Szepesvari, G Ostrovski, W Dabney, ... arXiv preprint arXiv:1912.06910, 2019 | 4 | 2019 |
Temporally-Extended {\epsilon}-Greedy Exploration W Dabney, G Ostrovski, A Barreto arXiv preprint arXiv:2006.01782, 2020 | 2 | 2020 |
Topics arising from fictitious play dynamics G Ostrovski University of Warwick, 2013 | 2 | 2013 |
Dynamics of a continuous piecewise affine map of the square G Ostrovski Physica D: Nonlinear Phenomena 271, 1-9, 2014 | 1 | 2014 |
Fixed point theorem for non-self maps of regions in the plane G Ostrovski Topology and its Applications 160 (7), 915-923, 2013 | 1 | 2013 |
Distributional reinforcement learning using quantile function neural networks G Ostrovski, WC Dabney US Patent App. 16/767,046, 2020 | | 2020 |