Zhaohan Daniel Guo

Citée par

	Toutes	Depuis 2019
Citations	7759	7678
indice h	16	16
indice i10	19	18

3000

1500

750

2250

201820192020202120222023202446 72 248 1117 2241 2955 1035

Accès public

Tout afficher

3 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityAdresse e-mail validée de cs.stanford.edu
Philip ThomasUniversity of Massachusetts AmherstAdresse e-mail validée de cs.umass.edu
Shayan DoroudiAssistant Professor at the University of California, IrvineAdresse e-mail validée de uci.edu
Yao LiuAmazonAdresse e-mail validée de stanford.edu

Suivre

Zhaohan Daniel Guo

DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Reinforcement learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Bootstrap your own latent-a new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Advances in neural information processing systems 33, 21271-21284, 2020	5874	2020
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	600	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	312	2020
Joint semantic utterance classification and slot filling with recursive neural networks D Guo, G Tur, W Yih, G Zweig 2014 IEEE Spoken Language Technology Workshop (SLT), 554-559, 2014	241	2014
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	138	2020
Neural predictive belief representations ZD Guo, MG Azar, B Piot, BA Pires, R Munos arXiv preprint arXiv:1811.06407, 2018	83	2018
k. kavukcuoglu, R JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Munos, and M. Valko,“Bootstrap your own latent-a new approach to self …, 2020	63	2020
A pac rl algorithm for episodic pomdps ZD Guo, S Doroudi, E Brunskill Artificial Intelligence and Statistics, 510-518, 2016	62	2016
A general theoretical paradigm to understand learning from human preferences MG Azar, ZD Guo, B Piot, R Munos, M Rowland, M Valko, D Calandriello International Conference on Artificial Intelligence and Statistics, 4447-4455, 2024	53	2024
Byol-explore: Exploration by bootstrapped prediction Z Guo, S Thakoor, M Pîslar, B Avila Pires, F Altché, C Tallec, A Saade, ... Advances in neural information processing systems 35, 31855-31870, 2022	53	2022
Using options and covariance testing for long horizon off-policy policy evaluation Z Guo, PS Thomas, E Brunskill Advances in Neural Information Processing Systems 30, 2017	47	2017
Geometric entropic exploration ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ... arXiv preprint arXiv:2101.02055, 2021	38	2021
Bootstrap your own latent: A new approach to self-supervised learning. arXiv JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	36	2020
Concurrent pac rl Z Guo, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	29	2015
Understanding self-predictive learning for reinforcement learning Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ... International Conference on Machine Learning, 33632-33656, 2023	21	2023
Pac continuous state online multitask reinforcement learning with identification Y Liu, Z Guo, E Brunskill Proceedings of the 2016 International Conference on Autonomous Agents …, 2016	19	2016
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	14	2023
Directed exploration for reinforcement learning ZD Guo, E Brunskill arXiv preprint arXiv:1906.07805, 2019	11	2019
Sample efficient feature selection for factored mdps ZD Guo, E Brunskill arXiv preprint arXiv:1703.03454, 2017	11	2017
Never give up: Learning directed exploration strategies. arXiv AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	8	2020

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs