Boris Belousov
Boris Belousov
Senior Researcher at German Research Centre for Artificial Intelligence (DFKI GmbH)
Verified email at - Homepage
Cited by
Cited by
Self-paced contextual reinforcement learning
P Klink, H Abdulsamad, B Belousov, J Peters
Conference on Robot Learning, 513-529, 2020
Entropic risk measure in policy search
D Nass, B Belousov, J Peters
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
Catching heuristics are optimal control policies
B Belousov, G Neumann, CA Rothkopf, J Peters
NIPS, 1426-1434, 2016
Learn2assemble with structured representations and search for robotic architectural construction
N Funk, G Chalvatzaki, B Belousov, J Peters
Conference on Robot Learning, 1401-1411, 2022
Robotic architectural assembly with tactile skills: Simulation and optimization
B Belousov, B Wibranek, J Schneider, T Schneider, G Chalvatzaki, ...
Automation in Construction 133, 104006, 2022
HJB optimal feedback control with deep differential value functions and action constraints
M Lutter, B Belousov, K Listmann, D Clever, J Peters
Conference on Robot Learning, 640-650, 2020
Neural posterior domain randomization
F Muratore, T Gruner, F Wiese, B Belousov, M Gienger, J Peters
Conference on Robot Learning, 1532-1542, 2022
Entropic Regularization of Markov Decision Processes
B Belousov, J Peters
Entropy 21 (7), 2019
Reinforcement Learning Algorithms: Analysis and Applications
B Belousov, H Abdulsamad, P Klink, S Parisi, J Peters
Springer Nature, 2021
Receding horizon curiosity
M Schultheis, B Belousov, H Abdulsamad, J Peters
Conference on robot learning, 1278-1288, 2020
A probabilistic interpretation of self-paced learning with applications to reinforcement learning
P Klink, H Abdulsamad, B Belousov, C D'Eramo, J Peters, J Pajarinen
The Journal of Machine Learning Research 22 (1), 8201-8252, 2021
Building a Library of Tactile Skills Based on FingerVision
B Belousov, A Sadybakasov, B Wibranek, F Veiga, O Tessmann, J Peters
2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), 2019
f-Divergence constrained policy improvement
B Belousov, J Peters
arXiv preprint arXiv:1801.00056, 2017
Reinforcement Learning for Sequential Assembly of SL-Blocks: Self-Interlocking Combinatorial Design Based on Machine Learning
B Wibranek, Y Liu, N Funk, B Belousov, J Peters, O Tessmann
Proceedings of the 39th eCAADe Conference 1, 27-36, 2021
How Crucial is Transformer in Decision Transformer?
M Siebenborn, B Belousov, J Huang, J Peters
arXiv preprint arXiv:2211.14655, 2022
Active inference for robotic manipulation
T Schneider, B Belousov, H Abdulsamad, J Peters
arXiv preprint arXiv:2206.10313, 2022
Distributionally Robust Trajectory Optimization Under Uncertain Dynamics via Relative Entropy Trust-Regions
H Abdulsamad, T Dorau, B Belousov, JJ Zhu, J Peters
arXiv preprint arXiv:2103.15388, 2021
Continuous-time fitted value iteration for robust policies
M Lutter, B Belousov, S Mannor, D Fox, A Garg, J Peters
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
Interactive Structure: Robotic Repositioning of Vertical Elements in Man-Machine Collaborative Assembly through Vision-Based Tactile Sensing
B Wibranek, B Belousov, A Sadybakasov, J Peters, O Tessmann
37th eCAADe and 23rd SIGraDi Conference 2, 705-713, 2019
Evaluating the Robustness of HJB Optimal Feedback Control
M Lutter, D Clever, B Belousov, K Listmann, J Peters
ISR 2020; 52th International Symposium on Robotics, 1-8, 2020
The system can't perform the operation now. Try again later.
Articles 1–20