Follow
Odalric-Ambrym Maillard
Odalric-Ambrym Maillard
Inria Lille - Nord Europe
Verified email at inria.fr - Homepage
Title
Cited by
Cited by
Year
Kullback-Leibler upper confidence bounds for optimal sequential allocation
O Cappé, A Garivier, OA Maillard, R Munos, G Stoltz
The Annals of Statistics, 1516-1541, 2013
4112013
Concentration inequalities for sampling without replacement
R Bardenet, OA Maillard
1762015
CATS, a low pressure multiwire proportionnal chamber for secondary beam tracking at GANIL
S Ottini-Hustache, C Mazur, F Auger, A Musumarra, N Alamanos, ...
Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 1999
1631999
A finite-time analysis of multi-armed bandits problems with kullback-leibler divergences
OA Maillard, R Munos, G Stoltz
Proceedings of the 24th annual Conference On Learning Theory, 497-514, 2011
1582011
Compressed least-squares regression
OA Maillard, R Munos
Advances in Neural Information Processing Systems, 2009
1352009
Latent Bandits.
OA Maillard, S Mannor
International Conference on Machine Learning, 136-144, 2014
982014
The non-stationary stochastic multi-armed bandit problem
R Allesiardo, R Féraud, OA Maillard
International Journal of Data Science and Analytics 3, 267-283, 2017
752017
Robust risk-averse stochastic multi-armed bandits
OA Maillard
Algorithmic Learning Theory: 24th International Conference, ALT 2013 …, 2013
742013
LSTD with random projections
M Ghavamzadeh, A Lazaric, OA Maillard, R Munos
Advances in Neural Information Processing Systems 23, 721--729, 2010
742010
Variance-aware regret bounds for undiscounted reinforcement learning in mdps
MS Talebi, OA Maillard
Algorithmic Learning Theory, 770-805, 2018
672018
Sub-sampling for multi-armed bandits
A Baransi, OA Maillard, S Mannor
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014
602014
Linear regression with random projections
O Maillard, R Munos
Journal of Machine Learning Research 13 (1), 2735-2772, 2012
602012
How hard is my MDP?" The distribution-norm to the rescue"
OA Maillard, TA Mann, S Mannor
Advances in Neural Information Processing Systems 27, 2014
572014
PICOSEC: Charged particle timing at sub-25 picosecond precision with a Micromegas based detector
J Bortfeldt, F Brunbauer, C David, D Desforge, G Fanourakis, J Franchi, ...
Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2018
532018
Online learning in adversarial lipschitz environments
OA Maillard, R Munos
Joint european conference on machine learning and knowledge discovery in …, 2010
462010
Selecting the state-representation in reinforcement learning
OA Maillard, D Ryabko, R Munos
Advances in Neural Information Processing Systems 24, 2011
452011
Finite-sample analysis of Bellman residual minimization
OA Maillard, R Munos, A Lazaric, M Ghavamzadeh
Proceedings of 2nd Asian Conference on Machine Learning, 299-314, 2010
452010
Streaming kernel regression with provably adaptive mean, variance, and regularization
A Durand, OA Maillard, J Pineau
The Journal of Machine Learning Research 19 (1), 650-683, 2018
432018
Adaptive Bandits: Towards the best history-dependent strategy
OA Maillard, R Munos
Proceedings of the Fourteenth International Conference on Artificial …, 2011
41*2011
Sequential change-point detection: Laplace concentration of scan statistics and non-asymptotic delay bounds
OA Maillard
Algorithmic Learning Theory, 610-632, 2019
342019
The system can't perform the operation now. Try again later.
Articles 1–20