Follow
Abbas Abdolmaleki
Abbas Abdolmaleki
Deepmind
Verified email at google.com
Title
Cited by
Cited by
Year
Maximum a posteriori policy optimisation
A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ...
arXiv preprint arXiv:1806.06920, 2018
2932018
Deepmind control suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ...
arXiv preprint arXiv:1801.00690, 2018
2792018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning
NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ...
arXiv preprint arXiv:2002.08396, 2020
1382020
Acme: A research framework for distributed reinforcement learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
1132020
Model-based relative entropy stochastic search
A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann
Advances in Neural Information Processing Systems 28, 2015
702015
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ...
arXiv preprint arXiv:1906.07516, 2019
602019
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control
HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ...
arXiv preprint arXiv:1909.12238, 2019
592019
Magnetic control of tokamak plasmas through deep reinforcement learning
J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ...
Nature 602 (7897), 414-419, 2022
582022
Relative entropy regularized policy iteration
A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ...
arXiv preprint arXiv:1812.02256, 2018
452018
Model-free trajectory optimization for reinforcement learning
R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki
International Conference on Machine Learning, 2961-2970, 2016
422016
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing
N Shafii, A Khorsandian, A Abdolmaleki, B Jozi
2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009
392009
Value constrained model-free continuous control
S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell
arXiv preprint arXiv:1902.04623, 2019
382019
Continuous-discrete reinforcement learning for hybrid control in robotics
M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ...
Conference on Robot Learning, 735-751, 2020
372020
Deriving and improving CMA-ES with information geometric trust regions
A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann
Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017
322017
Omnidirectional walking and active balance for soccer humanoid robot
N Shafii, A Abdolmaleki, R Ferreira, N Lau, LP Reis
Portuguese Conference on Artificial Intelligence, 283-294, 2013
312013
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models
A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ...
Conference on Robot Learning, 566-589, 2020
292020
Learning a humanoid kick with controlled distance
A Abdolmaleki, D Simões, N Lau, LP Reis, G Neumann
Robot World Cup, 45-57, 2016
282016
A distributional view on multi-objective policy optimization
A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ...
International Conference on Machine Learning, 11-22, 2020
272020
Simultaneously learning vision and feature-based control policies for real-world ball-in-a-cup
D Schwab, T Springenberg, MF Martins, T Lampe, M Neunert, ...
arXiv preprint arXiv:1902.04706, 2019
242019
Guide actor-critic for continuous control
V Tangkaratt, A Abdolmaleki, M Sugiyama
arXiv preprint arXiv:1705.07606, 2017
212017
The system can't perform the operation now. Try again later.
Articles 1–20