Suivre
Rémi Leblond
Rémi Leblond
Research Scientist, DeepMind
Adresse e-mail validée de google.com - Page d'accueil
Titre
Citée par
Citée par
Année
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
18722019
ASAGA: asynchronous parallel SAGA
R Leblond, F Pedregosa, S Lacoste-Julien
AISTATS 2017, 2016
1202016
Improved asynchronous parallel optimization analysis for stochastic incremental methods
R Leblond, F Pedregosa, S Lacoste-Julien
JMLR, 2018
502018
SEARNN: Training RNNs with global-local losses
R Leblond, JB Alayrac, A Osokin, S Lacoste-Julien
ICLR 2018, 2017
392017
Breaking the nonsmooth barrier: A scalable parallel method for composite optimization
F Pedregosa, R Leblond, S Lacoste-Julien
Advances in Neural Information Processing Systems 30, 2017
392017
OPtions as REsponses: Grounding Behavioural Hierarchies in Multi-Agent Reinforcement Learning
A Vezhnevets, Y Wu, M Eckstein, R Leblond, JZ Leibo
ICML, 2020
22*2020
Machine Translation Decoding beyond Beam Search
R Leblond, JB Alayrac, L Sifre, M Pislar, JB Lespiau, I Antonoglou, ...
EMNLP 2021, 2021
142021
Cutoff phenomenon for the simple exclusion process on the complete graph
H Lacoin, R Leblond
ALEA 2011, 2010
132010
Competition-level code generation with alphacode
Y Li, D Choi, J Chung, N Kushman, J Schrittwieser, R Leblond, T Eccles, ...
arXiv preprint arXiv:2203.07814, 2022
112022
Asynchronous optimization for machine learning
R Leblond
Université Paris sciences et lettres, 2018
12018
Grandmaster level in StarCraft II using multi-agent reinforcement learning Open Website
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–11