Thomas Degris
Thomas Degris
DeepMind
Verified email at google.com
TitleCited byYear
Deterministic policy gradient algorithms
D Silver, G Lever, N Heess, T Degris, D Wierstra, M Riedmiller
7932014
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup
The 10th International Conference on Autonomous Agents and Multiagent …, 2011
2232011
Learning the structure of factored markov decision processes in reinforcement learning problems
T Degris, O Sigaud, PH Wuillemin
Proceedings of the 23rd international conference on Machine learning, 257-264, 2006
1262006
Off-policy actor-critic
T Degris, M White, RS Sutton
arXiv preprint arXiv:1205.4839, 2012
1202012
Vector-based navigation using grid-like representations in artificial agents
A Banino, C Barry, B Uria, C Blundell, T Lillicrap, P Mirowski, A Pritzel, ...
Nature 557 (7705), 429, 2018
1112018
The predictron: End-to-end learning and planning
D Silver, H van Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1012017
Model-free reinforcement learning with continuous action in practice
T Degris, PM Pilarski, RS Sutton
2012 American Control Conference (ACC), 2177-2182, 2012
982012
Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning
PM Pilarski, MR Dawson, T Degris, F Fahimi, JP Carey, RS Sutton
2011 IEEE International Conference on Rehabilitation Robotics, 1-7, 2011
962011
Deep reinforcement learning in large discrete action spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ...
arXiv preprint arXiv:1512.07679, 2015
852015
Adaptive artificial limbs: A real-time approach to prediction and anticipation
PM Pilarski, MR Dawson, T Degris, JP Carey, KM Chan, JS Hebert, ...
IEEE Robotics & Automation Magazine 20 (1), 53-64, 2013
522013
Dynamic switching and real-time machine learning for improved human control of assistive biomedical robots
PM Pilarski, MR Dawson, T Degris, JP Carey, RS Sutton
2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and …, 2012
432012
Tuning-free step-size adaptation
AR Mahmood, RS Sutton, T Degris, PM Pilarski
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
342012
A spiking neuron model of head-direction cells for robot orientation
T Degris, L Lacheze, C Boucheny, A Arleo
Proceedings of the eighth int. conf. on the simulation of adaptive behavior …, 2004
262004
Linear off-policy actor-critic
T Degris, M White, RS Sutton
In International Conference on Machine Learning, 2012
222012
Chi-square tests driven method for learning the structure of factored MDPs
T Degris, O Sigaud, PH Wuillemin
arXiv preprint arXiv:1206.6842, 2012
202012
Rapid response of head direction cells to reorienting visual cues: a computational model
T Degris, O Sigaud, SI Wiener, A Arleo
Neurocomputing 58, 675-682, 2004
182004
Factored markov decision processes
T Degris, O Sigaud
Markov Decision Processes in Artificial Intelligence, 99-126, 2013
172013
Apprentissage par renforcement dans les processus de décision markoviens factorisés
T Degris
Paris 6, 2007
122007
Scaling-up knowledge for a cognizant robot
T Degris, J Modayil
2012 AAAI Spring Symposium Series, 2012
102012
Exploiting additive structure in factored mdps for reinforcement learning
T Degris, O Sigaud, PH Wuillemin
European Workshop on Reinforcement Learning, 15-26, 2008
72008
The system can't perform the operation now. Try again later.
Articles 1–20