Siddhant M. Jayakumar
Siddhant M. Jayakumar
DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Memory-based parameter adaptation
P Sprechmann, SM Jayakumar, JW Rae, A Pritzel, AP Badia, B Uria, ...
ICLR 2018, 2018
322018
Adapting auxiliary losses using gradient similarity
Y Du, WM Czarnecki, SM Jayakumar, R Pascanu, B Lakshminarayanan
arXiv preprint arXiv:1812.02224, 2018
202018
Mix&match-agent curricula for reinforcement learning
WM Czarnecki, SM Jayakumar, M Jaderberg, L Hasenclever, YW Teh, ...
ICML 2018, 2018
202018
Been there, done that: Meta-learning with episodic recall
S Ritter, JX Wang, Z Kurth-Nelson, SM Jayakumar, C Blundell, R Pascanu, ...
ICML 2018, 2018
192018
Information asymmetry in KL-regularized RL
A Galashov, SM Jayakumar, L Hasenclever, D Tirumala, J Schwarz, ...
ICLR 2019, 2019
122019
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
72019
Distilling Policy Distillation
WM Czarnecki, R Pascanu, S Osindero, SM Jayakumar, G Swirszcz, ...
AISTATS 2019, 2019
72019
Compressive Transformers for Long-Range Sequence Modelling
JW Rae, A Potapenko, SM Jayakumar, TP Lillicrap
arXiv preprint arXiv:1911.05507, 2019
32019
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
A Stooke, V Dalibard, SM Jayakumar, WM Czarnecki, M Jaderberg
Workshop on “Structure & Priors in Reinforcement Learning” at ICLR 2019, 2019
12019
Low-pass Recurrent Neural Networks-A memory architecture for longer-term correlation discovery
T Stepleton, R Pascanu, W Dabney, SM Jayakumar, H Soyer, R Munos
arXiv preprint arXiv:1805.04955, 2018
12018
Reinforcement learning using agent curricula
W Czarnecki, S Jayakumar
US Patent App. 16/417,522, 2019
2019
Stabilizing Transformers for Reinforcement Learning
E Parisotto, HF Song, JW Rae, R Pascanu, C Gulcehre, SM Jayakumar, ...
arXiv preprint arXiv:1910.06764, 2019
2019
Information asymmetry in KL-regularized RL
WM Czarnecki, L Hasenclever, YW Teh, G Desjardins, A Galashov, ...
International Conference on Learning Representations, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–13