Michel Tokic
Michel Tokic
Siemens AG, Munich
Bestätigte E-Mail-Adresse bei tokic.com - Startseite
TitelZitiert vonJahr
Adaptive ε-greedy exploration in reinforcement learning based on value differences
M Tokic
Annual Conference on Artificial Intelligence, 203-210, 2010
1232010
Value-difference based exploration: adaptive control between epsilon-greedy and softmax
M Tokic, G Palm
KI 2011: Advances in Artificial Intelligence, 335-346, 2011
982011
The Crawler, A Class Room Demonstrator for Reinforcement Learning.
M Tokic, W Ertel, J Fessler
FLAIRS Conference, 2471-2482, 2009
192009
The teaching-box: A universal robot learning framework
W Ertel, M Schneider, R Cubek, M Tokicy
Advanced Robotics, 2009. ICAR 2009. International Conference on, 1-6, 2009
162009
A benchmark environment motivated by industrial control problems
D Hein, S Depeweg, M Tokic, S Udluft, A Hentschel, TA Runkler, ...
Computational Intelligence (SSCI), 2017 IEEE Symposium Series on, 1-8, 2017
102017
Entwicklung eines lernenden laufroboters
M Tokic
Diplomarbeit, Hochschule Ravensburg-Weingarten, Doggenriedstrasse, 88250 …, 2006
72006
Adaptive exploration using stochastic neurons
M Tokic, G Palm
International Conference on Artificial Neural Networks, 42-49, 2012
62012
Batch reinforcement learning on the industrial benchmark: First experiences
D Hein, S Udluft, M Tokic, A Hentschel, TA Runkler, V Sterzing
arXiv preprint arXiv:1705.07262, 2017
52017
Teaching Reinforcement Learning Using a Physical Robot
M Tokic, H Bou Ammar
Proceedings of the Workshop on Teaching Machine Learning at the 29th …, 2012
52012
Robust Exploration/Exploitation trade-offs in safety-critical applications
M Tokic, P Ertle, G Palm, D Söffker, H Voos
IFAC Proceedings Volumes 45 (20), 660-665, 2012
52012
Towards Learning of Safety Knowledge from Human Demonstrations
P Ertle, M Tokic, R Cubek, H Voos, D Söffker
International Conference on Intelligent Robots and Systems (IROS), 1-6, 2012
52012
Meta-learning of exploration and exploitation parameters with replacing eligibility traces
M Tokic, F Schwenker, G Palm
IAPR International Workshop on Partially Supervised Learning, 68-79, 2013
42013
Gradient algorithms for exploration/exploitation trade-offs: Global and local variants
M Tokic, G Palm
IAPR Workshop on Artificial Neural Networks in Pattern Recognition, 60-71, 2012
42012
Reinforcement learning on a simple real walking robot
M Tokic, W Ertel, HP Radtke, J Akmal, W Krökel
Proceedings of the 29th Annual German Conference on Artificial Intelligence …, 2006
32006
Entwicklung eines lernfähigen Laufroboters
M Tokic
Diplomarbeit Hochschule Ravensburg-Weingarten, 2006. Inklusive …, 2006
32006
Introduction to the" Industrial Benchmark"
D Hein, A Hentschel, V Sterzing, M Tokic, S Udluft
arXiv preprint arXiv:1610.03793, 2016
22016
Reinforcement Learning mit adaptiver Steuerung von Exploration und Exploitation
M Tokic
Universität Ulm, 2013
22013
On an educational approach to behavior learning for robots
M Tokic, A Usadel, J Fessler, W Ertel
International Conference on Robotics in Education (RIE'2010), 2012
22012
Reinforcement Learning an Robotern mit neuronalen Netzen
M Tokic
Hochschule Ravensburg-Weingarten, Masterthesis, GERMANY, 2008
22008
Work In Progress: Programming in a Confined Space–A Case Study in Porting Modern Robot Software to an Antique Platform
SL Montresor, JS Kay, M Tokic, JM Summerton
Frontieres in Education 2011, 2011
12011
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20