Follow
Matthieu Geist
Matthieu Geist
Google Research, Brain Team (on leave of Professor, Université de Lorraine)
Verified email at univ-lorraine.fr
Title
Cited by
Cited by
Year
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
1442019
Human activity recognition using recurrent neural networks
D Singh, E Merdivan, I Psychoula, J Kropf, S Hanke, M Geist, A Holzinger
International cross-domain conference for machine learning and knowledge …, 2017
1422017
What matters for on-policy deep actor-critic methods? a large-scale study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
International conference on learning representations, 2020
123*2020
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
1132010
Sample-efficient batch reinforcement learning for dialogue management optimization
O Pietquin, M Geist, S Chandramohan, H Frezza-Buet
ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 1-21, 2011
1102011
Approximate modified policy iteration and its application to the game of Tetris.
B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist
J. Mach. Learn. Res. 16 (49), 1629-1676, 2015
1062015
Algorithmic survey of parametric value function approximation
M Geist, O Pietquin
IEEE Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
106*2013
User simulation in dialogue systems using inverse reinforcement learning
S Chandramohan, M Geist, F Lefevre, O Pietquin
Twelfth annual conference of the international speech communication association, 2011
1032011
Inverse reinforcement learning through structured classification
E Klein, M Geist, B Piot, O Pietquin
Advances in neural information processing systems 25, 2012
972012
Off-policy learning with eligibility traces: a survey.
M Geist, B Scherrer
J. Mach. Learn. Res. 15 (1), 289-333, 2014
892014
Bridging the gap between imitation learning and inverse reinforcement learning
B Piot, M Geist, O Pietquin
IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016
702016
Laugh-aware virtual agent and its impact on user amusement
R Niewiadomski, J Hofmann, J Urbain, T Platt, J Wagner, P Bilal, T Ito, ...
University of Zurich, 2013
652013
Boosted bellman residual minimization handling expert demonstrations
B Piot, M Geist, O Pietquin
Joint European Conference on machine learning and knowledge discovery in …, 2014
622014
A comprehensive reinforcement learning framework for dialogue management optimization
L Daubigney, M Geist, S Chandramohan, O Pietquin
IEEE Journal of Selected Topics in Signal Processing 6 (8), 891-902, 2012
602012
A cascaded supervised learning approach to inverse reinforcement learning
E Klein, B Piot, M Geist, O Pietquin
Joint European conference on machine learning and knowledge discovery in …, 2013
542013
Leverage the average: an analysis of KL regularization in reinforcement learning
N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist
Advances in Neural Information Processing Systems 33, 12163-12174, 2020
53*2020
Approximate modified policy iteration
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
arXiv preprint arXiv:1205.3054, 2012
532012
Convolutional and recurrent neural networks for activity recognition in smart environment
D Singh, E Merdivan, S Hanke, J Kropf, M Geist, A Holzinger
Towards integrative machine learning and knowledge extraction, 194-205, 2017
502017
Sample efficient on-line learning of optimal dialogue policies with kalman temporal differences
O Pietquin, M Geist, S Chandramohan
Twenty-Second International Joint Conference on Artificial Intelligence, 2011
472011
Managing uncertainty within the ktd framework
M Geist, O Pietquin
Proceedings of the Workshop on Active Learning and Experimental Design (AL&E …, 2010
42*2010
The system can't perform the operation now. Try again later.
Articles 1–20