Gabriel Dulac-Arnold
Gabriel Dulac-Arnold
Google Research
Verified email at blacksheep.ai
Title
Cited by
Cited by
Year
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ...
554*2017
Deep reinforcement learning in large discrete action spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ...
arXiv preprint arXiv:1512.07679, 2015
259*2015
The predictron: End-to-end learning and planning
D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
International Conference on Machine Learning, 3191-3199, 2017
1932017
Challenges of real-world reinforcement learning
G Dulac-Arnold, D Mankowitz, T Hester
ICML Workshop on Real-Life RL, 2019
1532019
Datum-wise classification: a sequential approach to sparsity
G Dulac-Arnold, L Denoyer, P Preux, P Gallinari
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2011
392011
Text classification: A sequential reading approach
G Dulac-Arnold, L Denoyer, P Gallinari
European Conference on Information Retrieval, 411-423, 2011
352011
Sequentially Generated Instance-Dependent Image Representations for Classification
M Cord, N Thome, L Denoyer, G Dulac-Arnold
ICLR, 2013
21*2013
Challenges of Real-World Reinforcement Learning: Definitions, Benchmarks & Analysis
C Paduraru, DJ Mankowitz, G Dulac-Arnold, J Li, N Levine, S Gowal, ...
Machine Learning Journal, 2021
20*2021
Sequential approaches for learning datum-wise sparse representations
G Dulac-Arnold, L Denoyer, P Preux, P Gallinari
Machine learning 89 (1-2), 87-122, 2012
192012
Deep reinforcement learning with attention for slate markov decision processes with high-dimensional states and actions
P Sunehag, R Evans, G Dulac-Arnold, Y Zwols, D Visentin, B Coppin
arXiv preprint arXiv:1512.01124, 2015
182015
Fast reinforcement learning with large action sets using error-correcting output codes for mdp factorization
G Dulac-Arnold, L Denoyer, P Preux, P Gallinari
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2012
162012
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 2020
11*2020
Deep multi-class learning from label proportions
G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert
arXiv preprint arXiv:1905.12909, 2019
92019
Model-based offline planning
A Argenson, G Dulac-Arnold
ICLR 2021, 2020
82020
Differentiable deep clustering with cluster size constraints
A Genevay, G Dulac-Arnold, JP Vert
arXiv preprint arXiv:1910.09036, 2019
82019
Optimizing data center controls using neural networks
RA Evans, J Gao, MC Ryan, G Dulac-Arnold, JK Scholz, TA Hester
US Patent 10,643,121, 2020
22020
ICUBAM: ICU Bed Availability Monitoring and analysis in the Grand Est region of France during the COVID-19 epidemic.
L Bonnasse-Gahot, M Dénčs, G Dulac-Arnold, S Girgin, F Husson, ...
22020
A General Sequential Model for Constrained Classification
G Dulac-Arnold
Sorbonne Université, 2014
12014
A Metric Space Perspective on Self-Supervised Policy Adaptation
C Bodnar, K Hausman, G Dulac-Arnold, R Jonschkowski
IEEE Robotics and Automation Letters, 2021
2021
Learning to run a Power Network Challenge: a Retrospective Analysis
A Marot, B Donnot, G Dulac-Arnold, A Kelly, A O'Sullivan, J Viebahn, ...
arXiv preprint arXiv:2103.03104, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20