Michel Tokic
Michel Tokic
Siemens AG, Munich
Bestätigte E-Mail-Adresse bei tokic.com - Startseite
TitelZitiert vonJahr
Adaptive ε-greedy exploration in reinforcement learning based on value differences
M Tokic
Annual Conference on Artificial Intelligence, 203-210, 2010
1582010
Value-difference based exploration: adaptive control between epsilon-greedy and softmax
M Tokic, G Palm
KI 2011: Advances in Artificial Intelligence, 335-346, 2011
1152011
The crawler, a class room demonstrator for reinforcement learning
M Tokic, W Ertel, J Fessler
Twenty-Second International FLAIRS Conference, 2009
202009
The teaching-box: A universal robot learning framework
W Ertel, M Schneider, R Cubek, M Tokicy
Advanced Robotics, 2009. ICAR 2009. International Conference on, 1-6, 2009
182009
A benchmark environment motivated by industrial control problems
D Hein, S Depeweg, M Tokic, S Udluft, A Hentschel, TA Runkler, ...
2017 IEEE Symposium Series on Computational Intelligence (SSCI), 1-8, 2017
142017
Entwicklung eines lernenden laufroboters
M Tokic
Diplomarbeit, Hochschule Ravensburg-Weingarten, Doggenriedstrasse, 88250 …, 2006
72006
Batch reinforcement learning on the industrial benchmark: First experiences
D Hein, S Udluft, M Tokic, A Hentschel, TA Runkler, V Sterzing
2017 International Joint Conference on Neural Networks (IJCNN), 4214-4221, 2017
52017
Adaptive exploration using stochastic neurons
M Tokic, G Palm
International Conference on Artificial Neural Networks, 42-49, 2012
52012
Teaching Reinforcement Learning Using a Physical Robot
M Tokic, H Bou Ammar
Proceedings of the Workshop on Teaching Machine Learning at the 29th …, 2012
52012
Robust Exploration/Exploitation trade-offs in safety-critical applications
M Tokic, P Ertle, G Palm, D Söffker, H Voos
IFAC Proceedings Volumes 45 (20), 660-665, 2012
52012
Towards Learning of Safety Knowledge from Human Demonstrations
P Ertle, M Tokic, R Cubek, H Voos, D Söffker
International Conference on Intelligent Robots and Systems (IROS), 1-6, 2012
52012
Entwicklung eines lernfähigen Laufroboters
M Tokic
Diplomarbeit Hochschule Ravensburg-Weingarten, 2006. Inklusive …, 2006
52006
Meta-learning of exploration and exploitation parameters with replacing eligibility traces
M Tokic, F Schwenker, G Palm
IAPR International Workshop on Partially Supervised Learning, 68-79, 2013
42013
Gradient algorithms for exploration/exploitation trade-offs: Global and local variants
M Tokic, G Palm
IAPR Workshop on Artificial Neural Networks in Pattern Recognition, 60-71, 2012
42012
Reinforcement learning on a simple real walking robot
M Tokic, W Ertel, HP Radtke, J Akmal, W Krökel
Proceedings of the 29th Annual German Conference on Artificial Intelligence …, 2006
32006
Introduction to the" Industrial Benchmark"
D Hein, A Hentschel, V Sterzing, M Tokic, S Udluft
arXiv preprint arXiv:1610.03793, 2016
22016
Reinforcement Learning mit adaptiver Steuerung von Exploration und Exploitation
M Tokic
Universität Ulm, 2013
22013
Reinforcement Learning: Psychologische und neurobiologische Aspekte
M Tokic
Künstliche Intelligenz 27 (3), 213-219, 2013
22013
On an educational approach to behavior learning for robots
M Tokic, A Usadel, J Fessler, W Ertel
International Conference on Robotics in Education (RIE'2010), 2012
22012
Reinforcement Learning an Robotern mit neuronalen Netzen
M Tokic
Verlag nicht ermittelbar, 2008
22008
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20