Intelligent Autonomous Systems Group, Technische Universität Darmstadt
Intelligent Autonomous Systems Group, Technische Universität Darmstadt
Verified email at - Homepage
Cited by
Cited by
Reinforcement learning in robotics: A survey
J Kober, JA Bagnell, J Peters
The International Journal of Robotics Research 32 (11), 1238-1274, 2013
PILCO: A model-based and data-efficient approach to policy search
M Deisenroth, CE Rasmussen
Proceedings of the 28th International Conference on machine learning (ICML …, 2011
Static and dynamic characteristics of McKibben pneumatic artificial muscles
CP Chou, B Hannaford
Proceedings of the 1994 IEEE international conference on robotics and …, 1994
Reinforcement learning of motor skills with policy gradients
J Peters, S Schaal
Neural networks 21 (4), 682-697, 2008
Policy search for motor primitives in robotics
J Kober, JR Peters
Advances in neural information processing systems, 849-856, 2009
Episodic future thinking reduces reward delay discounting through an enhancement of prefrontal-mediotemporal interactions
J Peters, C Büchel
Neuron 66 (1), 138-148, 2010
Natural actor-critic
J Peters, S Schaal
Neurocomputing 71 (7-9), 1180-1190, 2008
A survey on policy search for robotics
MP Deisenroth, G Neumann, J Peters
now publishers, 2013
Policy gradient methods for robotics
J Peters, S Schaal
2006 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2006
Nonlinear causal discovery with additive noise models
PO Hoyer, D Janzing, JM Mooij, J Peters, B Schölkopf
Advances in neural information processing systems, 689-696, 2009
Relative entropy policy search.
J Peters, K Mülling, Y Altun
AAAI 10, 1607-1612, 2010
Deep reinforcement learning: A brief survey
K Arulkumaran, MP Deisenroth, M Brundage, AA Bharath
IEEE Signal Processing Magazine 34 (6), 26-38, 2017
The neural mechanisms of inter-temporal decision-making: understanding variability
J Peters, C Büchel
Trends in cognitive sciences 15 (5), 227-239, 2011
Reinforcement learning for humanoid robotics
J Peters, S Vijayakumar, S Schaal
Proceedings of the third IEEE-RAS international conference on humanoid …, 2003
Gaussian processes for data-efficient learning in robotics and control
MP Deisenroth, D Fox, CE Rasmussen
IEEE transactions on pattern analysis and machine intelligence 37 (2), 408-423, 2013
Model learning for robot control: a survey
D Nguyen-Tuong, J Peters
Cognitive processing 12 (4), 319-340, 2011
Learning movement primitives
S Schaal, J Peters, J Nakanishi, A Ijspeert
Robotics research. the eleventh international symposium, 561-572, 2005
A brief survey of deep reinforcement learning
K Arulkumaran, MP Deisenroth, M Brundage, AA Bharath
arXiv preprint arXiv:1708.05866, 2017
Overlapping and distinct neural systems code for subjective value during intertemporal and risky decision making
J Peters, C Büchel
Journal of Neuroscience 29 (50), 15727-15734, 2009
Neural representations of subjective reward value
J Peters, C Büchel
Behavioural brain research 213 (2), 135-141, 2010
The system can't perform the operation now. Try again later.
Articles 1–20