Odalric-Ambrym Maillard

Cited by

	All	Since 2019
Citations	2813	1868
h-index	26	23
i10-index	54	50

440

220

110

330

200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202411 10 6 8 3 18 11 8 7 5 19 45 62 73 97 106 110 146 186 242 269 363 396 430 165

Public access

View all

46 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Rémi MunosDeepMindVerified email at inria.fr
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIAVerified email at univ-lille.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Olivier CappéCNRSVerified email at cnrs.fr
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Daniil RyabkoVerified email at ryabko.net
Rémi BardenetCNRS, CRIStAL, Ecole Centrale Lille, Univ. Lille, FranceVerified email at ec-lille.fr
Timothy A MannMetaVerified email at fb.com
Akram BaransiVerified email at tx.technion.ac.il
Nicolas VayatisFull Professor, Centre Borelli, Department of Mathematics, ENS Paris-SaclayVerified email at ens-paris-saclay.fr
Rémi CoulomUniversité Lille 3Verified email at univ-lille3.fr

Odalric-Ambrym Maillard

Inria Lille - Nord Europe

Verified email at inria.fr - Homepage

Multi-armed Bandits Stochastic Dynamical Systems Statistical Learning Reinforcement Learning Random matrices


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Kullback-Leibler upper confidence bounds for optimal sequential allocation O Cappé, A Garivier, OA Maillard, R Munos, G Stoltz The Annals of Statistics, 1516-1541, 2013	417	2013
Concentration inequalities for sampling without replacement R Bardenet, OA Maillard	188	2015
CATS, a low pressure multiwire proportionnal chamber for secondary beam tracking at GANIL S Ottini-Hustache, C Mazur, F Auger, A Musumarra, N Alamanos, ... Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 1999	167	1999
A finite-time analysis of multi-armed bandits problems with kullback-leibler divergences OA Maillard, R Munos, G Stoltz Proceedings of the 24th annual Conference On Learning Theory, 497-514, 2011	159	2011
Compressed least-squares regression OA Maillard, R Munos Advances in Neural Information Processing Systems, 2009	139	2009
Latent Bandits. OA Maillard, S Mannor International Conference on Machine Learning, 136-144, 2014	101	2014
The non-stationary stochastic multi-armed bandit problem R Allesiardo, R Féraud, OA Maillard International Journal of Data Science and Analytics 3, 267-283, 2017	84	2017
Robust risk-averse stochastic multi-armed bandits OA Maillard Algorithmic Learning Theory: 24th International Conference, ALT 2013 …, 2013	75	2013
LSTD with random projections M Ghavamzadeh, A Lazaric, OA Maillard, R Munos Advances in Neural Information Processing Systems 23, 721--729, 2010	72	2010
Variance-aware regret bounds for undiscounted reinforcement learning in mdps MS Talebi, OA Maillard Algorithmic Learning Theory, 770-805, 2018	70	2018
Sub-sampling for multi-armed bandits A Baransi, OA Maillard, S Mannor Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	64	2014
PICOSEC: Charged particle timing at sub-25 picosecond precision with a Micromegas based detector J Bortfeldt, F Brunbauer, C David, D Desforge, G Fanourakis, J Franchi, ... Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2018	60	2018
Linear regression with random projections O Maillard, R Munos Journal of Machine Learning Research 13 (1), 2735-2772, 2012	60	2012
How hard is my MDP?" The distribution-norm to the rescue" OA Maillard, TA Mann, S Mannor Advances in Neural Information Processing Systems 27, 2014	58	2014
Online learning in adversarial lipschitz environments OA Maillard, R Munos Joint european conference on machine learning and knowledge discovery in …, 2010	52	2010
Finite-sample analysis of Bellman residual minimization OA Maillard, R Munos, A Lazaric, M Ghavamzadeh Proceedings of 2nd Asian Conference on Machine Learning, 299-314, 2010	46	2010
Selecting the state-representation in reinforcement learning OA Maillard, D Ryabko, R Munos Advances in Neural Information Processing Systems 24, 2011	45	2011
Adaptive Bandits: Towards the best history-dependent strategy OA Maillard, R Munos Proceedings of the Fourteenth International Conference on Artificial …, 2011	41*	2011
Optimal thompson sampling strategies for support-aware cvar bandits D Baudry, R Gautron, E Kaufmann, O Maillard International Conference on Machine Learning, 716-726, 2021	36	2021
Sequential change-point detection: Laplace concentration of scan statistics and non-asymptotic delay bounds OA Maillard Algorithmic Learning Theory, 610-632, 2019	35	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors