Gabriel Dulac-Arnold

Cited by

	All	Since 2019
Citations	5101	4725
h-index	21	19
i10-index	26	24

1100

550

275

825

201520162017201820192020202120222023202414 22 74 212 306 593 809 972 1100 939

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Todd HesterWaymoVerified email at waymo.com
Daniel J. MankowitzGoogle DeepmindVerified email at google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
Cosmin PaduraruDeepMindVerified email at google.com
Ludovic DenoyerUbisoft, ex Facebook - FAIR -- Full Professor at Sorbonne Universités on SabaticalVerified email at ubisoft.com
Patrick GallinariProfessor Sorbonne University / Criteo AI LabVerified email at sorbonne-universite.fr
Ian OsbandOpenAIVerified email at openai.com
Mel VecerikDeepMind, University College LondonVerified email at ucl.ac.uk
Thomas DegrisDeepMindVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Richard EvansResearch Fellow, University of WolverhamptonVerified email at wlv.ac.uk
Peter SunehagGoogle - DeepMindVerified email at google.com
Ben CoppinGoogle DeepMindVerified email at deepmind.com
Nir LevineResearch Engineer at DeepMindVerified email at google.com
Sven GowalDeepMindVerified email at deepmind.com
Marc LanctotResearch Scientist, Google DeepMindVerified email at google.com
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIAVerified email at univ-lille.fr
Jean-Philippe VertOwkin / PSL UniversityVerified email at owkin.com
Richard EvansResearch Scientist, DeepMindVerified email at google.com

Gabriel Dulac-Arnold

Google DeepMind

Verified email at squirrelsoup.net - Homepage

Reinforcement Learning Planning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	1460*	2017
Deep reinforcement learning in large discrete action spaces G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ... arXiv preprint arXiv:1512.07679, 2015	758	2015
Challenges of real-world reinforcement learning G Dulac-Arnold, D Mankowitz, T Hester ICML Workshop on Real-Life RL, 2019	680	2019
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ... Machine Learning 110 (9), 2419-2468, 2021	621*	2021
The predictron: End-to-end learning and planning D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ... International Conference on Machine Learning, 3191-3199, 2017	310	2017
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	258	2020
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	209*	2020
Model-based offline planning A Argenson, G Dulac-Arnold ICLR 2021, 2020	149	2020
Learning to run a power network challenge: a retrospective analysis A Marot, B Donnot, G Dulac-Arnold, A Kelly, A O’Sullivan, J Viebahn, ... NeurIPS 2020 Competition and Demonstration Track, 112-132, 2021	73	2021
Datum-wise classification: a sequential approach to sparsity G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Machine Learning and Knowledge Discovery in Databases: European Conference …, 2011	70	2011
AI-based mobile application to fight antibiotic resistance M Pascucci, G Royer, J Adamek, MA Asmar, D Aristizabal, L Blanche, ... Nature communications 12 (1), 1173, 2021	60	2021
Deep reinforcement learning with attention for slate markov decision processes with high-dimensional states and actions P Sunehag, R Evans, G Dulac-Arnold, Y Zwols, D Visentin, B Coppin arXiv preprint arXiv:1512.01124, 2015	59	2015
Differentiable deep clustering with cluster size constraints A Genevay, G Dulac-Arnold, JP Vert arXiv preprint arXiv:1910.09036, 2019	46	2019
Barkour: Benchmarking animal-level agility with quadruped robots K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ... arXiv preprint arXiv:2305.14654, 2023	40	2023
Text classification: A sequential reading approach G Dulac-Arnold, L Denoyer, P Gallinari European Conference on Information Retrieval, 411-423, 2011	40	2011
Deep multi-class learning from label proportions G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert arXiv preprint arXiv:1905.12909, 2019	39	2019
Fast reinforcement learning with large action sets using error-correcting output codes for mdp factorization G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Joint European Conference on Machine Learning and Knowledge Discovery in …, 2012	28	2012
Robovqa: Multimodal long-horizon reasoning for robotics P Sermanet, T Ding, J Zhao, F Xia, D Dwibedi, K Gopalakrishnan, C Chan, ... 2024 IEEE International Conference on Robotics and Automation (ICRA), 645-652, 2024	27	2024
Learning dynamics models for model predictive agents M Lutter, L Hasenclever, A Byravan, G Dulac-Arnold, P Trochim, N Heess, ... arXiv preprint arXiv:2109.14311, 2021	24	2021
Residual reinforcement learning from demonstrations M Alakuijala, G Dulac-Arnold, J Mairal, J Ponce, C Schmid arXiv preprint arXiv:2106.08050, 2021	24	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors