Boris Belousov
Cited by
Cited by
Catching heuristics are optimal control policies
B Belousov, G Neumann, CA Rothkopf, J Peters
NIPS, 1426-1434, 2016
Self-paced contextual reinforcement learning
P Klink, H Abdulsamad, B Belousov, J Peters
Conference on Robot Learning, 513-529, 2020
Entropic Regularization of Markov Decision Processes
B Belousov, J Peters
Entropy 21 (7), 2019
HJB optimal feedback control with deep differential value functions and action constraints
M Lutter, B Belousov, K Listmann, D Clever, J Peters
Conference on Robot Learning, 640-650, 2020
Building a Library of Tactile Skills Based on FingerVision
B Belousov, A Sadybakasov, B Wibranek, F Veiga, O Tessmann, J Peters
2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), 2019
f-Divergence constrained policy improvement
B Belousov, J Peters
arXiv preprint arXiv:1801.00056, 2017
Entropic risk measure in policy search
D Nass, B Belousov, J Peters
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
Reinforcement Learning Algorithms: Analysis and Applications
B Belousov, H Abdulsamad, P Klink, S Parisi, J Peters
Springer Nature, 2021
Receding horizon curiosity
M Schultheis, B Belousov, H Abdulsamad, J Peters
Conference on Robot Learning, 1278-1288, 2020
Interactive Structure: Robotic Repositioning of Vertical Elements in Man-Machine Collaborative Assembly through Vision-Based Tactile Sensing
B Wibranek, B Belousov, A Sadybakasov, J Peters, O Tessmann
37th eCAADe and 23rd SIGraDi Conference 2, 705-713, 2019
Evaluating the Robustness of HJB Optimal Feedback Control
M Lutter, D Clever, B Belousov, K Listmann, J Peters
ISR 2020; 52th International Symposium on Robotics, 1-8, 2020
Underactuated Waypoint Trajectory Optimization for Light Painting Photography
C Eilers, J Eschmann, R Menzenbach, B Belousov, F Muratore, J Peters
2020 IEEE International Conference on Robotics and Automation (ICRA), 1505-1510, 2020
Belief space model predictive control for approximately optimal system identification
B Belousov, H Abdulsamad, M Schultheis, J Peters
4th Multidisciplinary Conference on Reinforcement Learning and Decision Making, 2019
Mean squared advantage minimization as a consequence of entropic policy improvement regularization
B Belousov, J Peters
14th European Workshop on Reinforcement Learning, 2018
Continuous-Time Fitted Value Iteration for Robust Policies
M Lutter, B Belousov, S Mannor, D Fox, A Garg, J Peters
arXiv preprint arXiv:2110.01954, 2021
Learn2Assemble with Structured Representations and Search for Robotic Architectural Construction
N Funk, G Chalvatzaki, B Belousov, J Peters
5th Annual Conference on Robot Learning, 2021
Neural Posterior Domain Randomization
F Muratore, T Gruner, F Wiese, B Belousov, M Gienger, J Peters
5th Annual Conference on Robot Learning, 2021
Distributionally Robust Trajectory Optimization Under Uncertain Dynamics via Relative-Entropy Trust Regions
H Abdulsamad, T Dorau, B Belousov, JJ Zhu, J Peters
arXiv preprint arXiv:2103.15388, 2021
A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning
P Klink, H Abdulsamad, B Belousov, C D'Eramo, J Peters, J Pajarinen
arXiv preprint arXiv:2102.13176, 2021
Reinforcement Learning for Sequential Assembly of SL-Blocks: Self-Interlocking Combinatorial Design Based on Machine Learning
B Wibranek, Y Liu, N Funk, B Belousov, J Peters, O Tessmann
Proceedings of the 39th eCAADe Conference 1, 27-36, 2021
The system can't perform the operation now. Try again later.
Articles 1–20