Siqi Liu
Siqi Liu
DeepMind, University College of London
Adresse e-mail validée de - Page d'accueil
Citée par
Citée par
Improved image captioning via policy gradient optimization of spider
S Liu, Z Zhu, N Ye, S Guadarrama, K Murphy
Proceedings of the IEEE international conference on computer vision, 873-881, 2017
Emergent Coordination Through Competition
S Liu, G Lever, J Merel, S Tunyasuvunakool, N Heess, T Graepel
International Conference on Learning Representations (ICLR 2019), 2019
Hierarchical visuomotor control of humanoids
J Merel, A Ahuja, V Pham, S Tunyasuvunakool, S Liu, D Tirumala, ...
International Conference on Learning Representations (ICLR 2019), 2018
Observational learning by reinforcement learning
D Borsa, B Piot, R Munos, O Pietquin
arXiv preprint arXiv:1706.06617, 2017
V-MPO: On-policy maximum a posteriori policy optimization for discrete and continuous control
HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ...
arXiv preprint arXiv:1909.12238, 2019
dm_control: Software and tasks for continuous control
S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez, J Merel, T Erez, ...
Software Impacts 6, 100022, 2020
A generalized training approach for multiagent learning
P Muller, S Omidshafiei, M Rowland, K Tuyls, J Perolat, S Liu, D Hennes, ...
arXiv preprint arXiv:1909.12823, 2019
Reinforcement learning agents acquire flocking and symbiotic behaviour in simulated ecosystems
P Sunehag, G Lever, S Liu, J Merel, N Heess, JZ Leibo, E Hughes, ...
ALIFE 2019: The 2019 Conference on Artificial Life, 103-110, 2019
The body is not a given: Joint agent policy learning and morphology evolution
D Banarse, Y Bachrach, S Liu, C Fernando, N Heess, P Kohli, G Lever, ...
Transferring task goals via hierarchical reinforcement learning
S Xie, A Galashov, S Liu, S Hou, R Pascanu, N Heess, YW Teh
From Motor Control to Team Play in Simulated Humanoid Football
S Liu, G Lever, Z Wang, J Merel, SM Eslami, D Hennes, WM Czarnecki, ...
arXiv preprint arXiv:2105.12196, 2021
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity
M Garnelo, WM Czarnecki, S Liu, D Tirumala, J Oh, G Gidel, ...
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
Launchpad: A Programming Model for Distributed Machine Learning Research
F Yang, G Barth-Maron, P Stańczyk, M Hoffman, S Liu, M Kroiss, A Pope, ...
arXiv preprint arXiv:2106.04516, 2021
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–13