Sergio Gómez Colmenarejo

Cited by

	All	Since 2019
Citations	7287	6125
h-index	19	19
i10-index	19	19

1600

800

400

1200

20162017201820192020202120222023202488 355 666 851 946 1132 1193 1519 478

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Nando de FreitasCIFAR & DeepMindVerified email at google.com
Misha DenilDeepMindVerified email at google.com
Matthew W. HoffmanGoogle DeepMindVerified email at google.com
Caglar GulcehreProf at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMind, PhD@MILAVerified email at google.com
Marcin AndrychowiczGoogle BrainVerified email at openai.com
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
koray kavukcuogluDeepMindVerified email at kavukcuoglu.org
Raia HadsellGoogle DeepMindVerified email at google.com
Edward GrefenstetteDirector of Research, Google DeepMind | Honorary Professor, UCLVerified email at google.com
Karl Moritz HermannReliant AIVerified email at google.com
Phil BlunsomCohere & Oxford UniversityVerified email at cs.ox.ac.uk
Guillaume DesjardinsDeepMindVerified email at google.com
Volodymyr MnihDeepMindVerified email at cs.toronto.edu
Julien CornebiseHon. Associate Professor, University College LondonVerified email at ucl.ac.uk
Joel Z LeiboResearch scientistVerified email at google.com
Demis HassabisDeepMind

Sergio Gómez Colmenarejo

Research Engineer, DeepMind

Verified email at google.com

Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning to learn by gradient descent by gradient descent M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ... Advances in neural information processing systems 29, 2016	2248	2016
Hybrid computing using a neural network with dynamic external memory A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ... Nature 538 (7626), 471-476, 2016	1842	2016
Policy distillation AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ... arXiv preprint arXiv:1511.06295, 2015	743	2015
A generalist agent S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022	677	2022
Learning to learn without gradient descent by gradient descent Y Chen, MW Hoffman, SG Colmenarejo, M Denil, TP Lillicrap, M Botvinick, ... International Conference on Machine Learning, 748-756, 2017	316*	2017
Learned optimizers that scale and generalize O Wichrowska, N Maheswaranathan, MW Hoffman, SG Colmenarejo, ... International conference on machine learning, 3751-3760, 2017	298	2017
Parallel multiscale autoregressive density estimation S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ... International conference on machine learning, 2912-2921, 2017	232	2017
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	231	2020
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	184*	2020
Scaling data-driven robotics with reward sketching and batch reinforcement learning S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ... arXiv preprint arXiv:1909.12200, 2019	153*	2019
Programmable agents M Denil, SG Colmenarejo, S Cabi, D Saxton, N De Freitas arXiv preprint arXiv:1706.06383, 2017	68	2017
Task-relevant adversarial imitation learning K Zolna, S Reed, A Novikov, SG Colmenarejo, D Budden, S Cabi, M Denil, ... Conference on Robot Learning, 247-263, 2021	56	2021
Learning awareness models B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ... arXiv preprint arXiv:1804.06318, 2018	52	2018
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation K Bousmalis, G Vezzani, D Rao, CM Devin, AX Lee, MB Villalonga, ... Transactions on Machine Learning Research, 2023	40*	2023
The intentional unintentional agent: Learning to solve many continuous control tasks simultaneously S Cabi, SG Colmenarejo, MW Hoffman, M Denil, Z Wang, N Freitas Conference on Robot Learning, 207-216, 2017	39	2017
Regularized behavior value estimation C Gulcehre, SG Colmenarejo, Z Wang, J Sygnowski, T Paine, K Zolna, ... arXiv preprint arXiv:2103.09575, 2021	26	2021
One-shot high-fidelity imitation: Training large-scale deep nets with rl TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ... arXiv preprint arXiv:1810.05017, 2018	25	2018
TF-Replicator: Distributed machine learning for researchers P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ... arXiv preprint arXiv:1902.00465, 2019	24	2019
Starcraft ii unplugged: Large scale offline reinforcement learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... Deep RL Workshop NeurIPS 2021, 2021	19*	2021
Iterative multiscale image generation using neural networks NE Kalchbrenner, D Belov, SG Colmenarejo, AGA van den Oord, Z Wang, ... US Patent 11,361,403, 2022	5	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors