Marc Lanctot

Cited by

	All	Since 2019
Citations	38558	31719
h-index	39	36
i10-index	66	59

8000

4000

2000

6000

20142015201620172018201920202021202220232024116 122 910 1812 3160 4384 5336 6186 6579 7196 2028

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLVerified email at ucl.ac.uk
Karl TuylsResearch Scientist, Google DeepMind and Professor of computer science, University of LiverpoolVerified email at google.com
David SilverDeepMind, UCLVerified email at google.com
Michael BowlingUniversity of AlbertaVerified email at ualberta.ca
Laurent SifreGoogle DeepMindVerified email at polytechnique.edu
Arthur GuezGoogle DeepMindVerified email at google.com
Julian SchrittwieserDeepMindVerified email at furidamu.org
Joel Z LeiboResearch scientistVerified email at google.com
julien perolatDeepMindVerified email at google.com
Timothy P. LillicrapDirector of Research, Google DeepMindVerified email at google.com
Audrūnas GruslysVerified email at gruslys.com
Chris J. MaddisonUniversity of TorontoVerified email at cs.toronto.edu
Aja HuangDeepMindVerified email at google.com
George van den DriesscheDeepMindVerified email at deepmind.com
Neil BurchSony AI & Alberta Machine Intelligence Institute, University of AlbertaVerified email at ualberta.ca
Vinicius ZambaldiGoogle DeepmindVerified email at google.com
Thomas HubertGoogle DeepmindVerified email at google.com
Rémi MunosDeepMindVerified email at inria.fr
Nal KalchbrennerGoogle DeepMindVerified email at google.com
koray kavukcuogluDeepMindVerified email at kavukcuoglu.org

Marc Lanctot

Research Scientist, Google DeepMind

Verified email at google.com - Homepage

Artificial Intelligence Game Theory Search Multiagent Systems Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... Nature 529 (7587), 484-489, 2016	18643	2016
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018	6200*	2018
Dueling Network Architectures for Deep Reinforcement Learning Z Wang, T Schaul, M Hessel, H van Hasselt, M Lanctot, N de Freitas arXiv preprint arXiv:1511.06581, 2016	4754	2016
Value-decomposition networks for cooperative multi-agent learning based on team reward P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... Proceedings of the 17th international conference on autonomous agents and …, 2018	1586*	2018
Deep Q-learning from Demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Association for the Advancement of Artificial Intelligence (AAAI), 2018	1172	2018
Multi-agent Reinforcement Learning in Sequential Social Dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel AAMAS, 2017	895	2017
A unified game-theoretic approach to multiagent reinforcement learning M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ... arXiv preprint arXiv:1711.00832, 2017	709	2017
The hanabi challenge: A new frontier for ai research N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ... Artificial Intelligence 280, 103216, 2020	373	2020
Fictitious Self-Play in Extensive-Form Games J Heinrich, M Lanctot, D Silver International Conference on Machine Learning, 2015	365	2015
Monte Carlo sampling for regret minimization in extensive games M Lanctot, K Waugh, M Zinkevich, M Bowling Advances in neural information processing systems 22, 1078-1086, 2009	360	2009
Memory-efficient backpropagation through time A Gruslys, R Munos, I Danihelka, M Lanctot, A Graves Advances In Neural Information Processing Systems, 4125-4133, 2016	246*	2016
OpenSpiel: A Framework for Reinforcement Learning in Games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019	235	2019
Emergent Communication through Negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	177	2018
Actor-critic policy optimization in partially observable multiagent environments S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ... Advances in Neural Information Processing Systems, 3422-3435, 2018	158	2018
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	145	2022
Convolution by evolution: Differentiable pattern producing networks C Fernando, D Banarse, M Reynolds, F Besse, D Pfau, M Jaderberg, ... Proceedings of the Genetic and Evolutionary Computation Conference 2016, 109-116, 2016	135	2016
α-Rank: Multi-Agent Evaluation by Evolution S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ... Scientific reports 9 (1), 9937, 2019	121	2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research JZ Leibo, E Hughes, M Lanctot, T Graepel arXiv preprint arXiv:1903.00742, 2019	113	2019
Real-Time Monte-Carlo Tree Search in Ms Pac-Man T Pepels, MHM Winands, M Lanctot Transactions on Computation Intelligence and AI in Games, 2014	113	2014
Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization. M Johanson, N Bard, M Lanctot, RG Gibson, M Bowling AAMAS, 837-846, 2012	108	2012

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors