Suivre
Zhaohan Daniel Guo
Zhaohan Daniel Guo
DeepMind
Adresse e-mail validée de google.com - Page d'accueil
Titre
Citée par
Citée par
Année
Bootstrap your own latent-a new approach to self-supervised learning
JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ...
Advances in Neural Information Processing Systems 33, 21271-21284, 2020
13592020
Agent57: Outperforming the atari human benchmark
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ...
International Conference on Machine Learning, 507-517, 2020
2412020
Joint semantic utterance classification and slot filling with recursive neural networks
D Guo, G Tur, W Yih, G Zweig
2014 IEEE Spoken Language Technology Workshop (SLT), 554-559, 2014
1682014
Never give up: Learning directed exploration strategies
AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
1022020
Bootstrap latent-predictive representations for multitask reinforcement learning
ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar
International Conference on Machine Learning, 3875-3886, 2020
562020
Neural predictive belief representations
ZD Guo, MG Azar, B Piot, BA Pires, R Munos
arXiv preprint arXiv:1811.06407, 2018
442018
Using options and covariance testing for long horizon off-policy policy evaluation
Z Guo, PS Thomas, E Brunskill
Advances in Neural Information Processing Systems 30, 2017
402017
A pac rl algorithm for episodic pomdps
ZD Guo, S Doroudi, E Brunskill
Artificial Intelligence and Statistics, 510-518, 2016
232016
Concurrent pac rl
Z Guo, E Brunskill
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
212015
kavukcuoglu, k.; Munos, R.; and Valko, M. 2020. Bootstrap Your Own Latent-A New Approach to Self-Supervised Learning
JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ...
Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, MF, 21271-21284, 0
19
Pac continuous state online multitask reinforcement learning with identification
Y Liu, Z Guo, E Brunskill
Proceedings of the 2016 International Conference on Autonomous Agents …, 2016
132016
Agent57: Outperforming the atari human benchmark
A Puigdomènech Badia, B Piot, S Kapturowski, P Sprechmann, ...
arXiv e-prints, arXiv: 2003.13350, 2020
122020
Bootstrap your own latent: A new approach to self-supervised learning. arXiv 2020
JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ...
arXiv preprint arXiv:2006.07733, 0
11
Bootstrap your own latent: A new approach to self-supervised learning. arXiv
JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ...
arXiv preprint arXiv:2006.07733, 2020
82020
Geometric entropic exploration
ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ...
arXiv preprint arXiv:2101.02055, 2021
72021
Never give up: Learning directed exploration strategies
A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ...
arXiv e-prints, arXiv: 2002.06038, 2020
62020
Sample efficient feature selection for factored mdps
ZD Guo, E Brunskill
arXiv preprint arXiv:1703.03454, 2017
62017
Bootstrap your own latent: a new approach to self-supervised Learning 2020
JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ...
arXiv preprint arXiv:2006.07733, 2021
52021
Directed exploration for reinforcement learning
ZD Guo, E Brunskill
arXiv preprint arXiv:1906.07805, 2019
52019
Sample efficient learning with feature selection for factored MDPs
ZD Guo, E Brunskill
Proceedings of the 14th European Workshop on Reinforcement Learning, 2018
52018
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20