Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019 | 919 | 2019 |
Neural scene representation and rendering SMA Eslami, D Jimenez Rezende, F Besse, F Viola, AS Morcos, ... Science 360 (6394), 1204-1210, 2018 | 700 | 2018 |
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016 | 261 | 2016 |
Rigorous agent evaluation: An adversarial approach to uncover catastrophic failures J Uesato, A Kumar, C Szepesvari, T Erez, A Ruderman, K Anderson, ... arXiv preprint arXiv:1812.01647, 2018 | 76 | 2018 |
Tighter variational representations of f-divergences via restriction to probability measures A Ruderman, M Reid, D García-García, J Petterson arXiv preprint arXiv:1206.4664, 2012 | 55 | 2012 |
Pooling is neither necessary nor sufficient for appropriate deformation stability in CNNs A Ruderman, NC Rabinowitz, AS Morcos, D Zoran arXiv preprint arXiv:1804.04438, 2018 | 54 | 2018 |
Model-free episodic control. arXiv preprint 1606.04460 C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... | 29 | 2016 |
Uncovering surprising behaviors in reinforcement learning via worst-case analysis A Ruderman, R Everett, B Sikder, H Soyer, J Uesato, A Kumar, C Beattie, ... | 13 | 2018 |
Learned deformation stability in convolutional neural networks A Ruderman, NC Rabinowitz, AS Morcos, D Zoran CoRR, abs/1804.04438, 2018 | 10 | 2018 |
On the behaviour of tests based on sample spacings for moderatesamples S Penev, A Ruderman Journal of Statistical Planning and Inference 141 (3), 1240-1249, 2011 | 2 | 2011 |
PSI Draft Specification M Reid, J Montgomery, B Drake, A Ruderman arXiv preprint arXiv:2205.09488, 2022 | | 2022 |