Marc Lanctot

Cited by

	All	Since 2019
Citations	40064	33213
h-index	42	37
i10-index	67	60

8000

4000

2000

6000

20142015201620172018201920202021202220232024116 122 907 1806 3138 4362 5303 6139 6536 7164 3693

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLVerified email at ucl.ac.uk
Karl TuylsFounder at H company, ex-Google DeepMind, Prof at University of LiverpoolVerified email at hcompany.ai
David SilverDeepMind, UCLVerified email at google.com
Michael BowlingAmii, University of AlbertaVerified email at ualberta.ca
Julian SchrittwieserDeepMindVerified email at furidamu.org
Laurent SifreGoogle DeepMindVerified email at polytechnique.edu
Arthur GuezGoogle DeepMindVerified email at google.com
Joel Z LeiboResearch scientistVerified email at google.com
julien perolatDeepMindVerified email at google.com
Timothy P. LillicrapDirector of Research, Google DeepMindVerified email at google.com
Audrūnas GruslysVerified email at gruslys.com
Chris J. MaddisonUniversity of TorontoVerified email at cs.toronto.edu
Aja HuangDeepMindVerified email at google.com
George van den DriesscheDeepMindVerified email at deepmind.com
Neil BurchSony AI & Alberta Machine Intelligence Institute, University of AlbertaVerified email at ualberta.ca
Vinicius ZambaldiGoogle DeepmindVerified email at google.com
Thomas HubertGoogle DeepmindVerified email at google.com
Rémi MunosGoogle DeepMindVerified email at inria.fr
Nal KalchbrennerGoogle DeepMindVerified email at google.com
koray kavukcuogluDeepMindVerified email at kavukcuoglu.org

Marc Lanctot

Research Scientist, Google DeepMind

Verified email at google.com - Homepage

Artificial Intelligence Game Theory Search Multiagent Systems Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... Nature 529 (7587), 484-489, 2016	19198	2016
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018	6518*	2018
Dueling Network Architectures for Deep Reinforcement Learning Z Wang, T Schaul, M Hessel, H van Hasselt, M Lanctot, N de Freitas arXiv preprint arXiv:1511.06581, 2016	4940	2016
Value-decomposition networks for cooperative multi-agent learning based on team reward P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... Proceedings of the 17th international conference on autonomous agents and …, 2018	1703*	2018
Deep Q-learning from Demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Association for the Advancement of Artificial Intelligence (AAAI), 2018	1227	2018
Multi-agent Reinforcement Learning in Sequential Social Dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel AAMAS, 2017	883	2017
A unified game-theoretic approach to multiagent reinforcement learning M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ... arXiv preprint arXiv:1711.00832, 2017	733	2017
The hanabi challenge: A new frontier for ai research N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ... Artificial Intelligence 280, 103216, 2020	398	2020
Fictitious Self-Play in Extensive-Form Games J Heinrich, M Lanctot, D Silver International Conference on Machine Learning, 2015	377	2015
Monte Carlo sampling for regret minimization in extensive games M Lanctot, K Waugh, M Zinkevich, M Bowling Advances in neural information processing systems 22, 1078-1086, 2009	371	2009
OpenSpiel: A Framework for Reinforcement Learning in Games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019	258	2019
Memory-efficient backpropagation through time A Gruslys, R Munos, I Danihelka, M Lanctot, A Graves Advances In Neural Information Processing Systems, 4125-4133, 2016	257*	2016
Emergent Communication through Negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	184	2018
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	172	2022
Actor-critic policy optimization in partially observable multiagent environments S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ... Advances in Neural Information Processing Systems, 3422-3435, 2018	162	2018
Convolution by evolution: Differentiable pattern producing networks C Fernando, D Banarse, M Reynolds, F Besse, D Pfau, M Jaderberg, ... Proceedings of the Genetic and Evolutionary Computation Conference 2016, 109-116, 2016	134	2016
α-Rank: Multi-Agent Evaluation by Evolution S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ... Scientific reports 9 (1), 9937, 2019	132	2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research JZ Leibo, E Hughes, M Lanctot, T Graepel arXiv preprint arXiv:1903.00742, 2019	120	2019
Real-Time Monte-Carlo Tree Search in Ms Pac-Man T Pepels, MHM Winands, M Lanctot Transactions on Computation Intelligence and AI in Games, 2014	115	2014
Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization. M Johanson, N Bard, M Lanctot, RG Gibson, M Bowling AAMAS, 837-846, 2012	109	2012

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors