Shangtong Zhang

Citée par

	Toutes	Depuis 2019
Citations	1145	1108
indice h	16	16
indice i10	22	20

300

150

225

201720182019202020212022202320245 26 63 158 231 276 297 80

Accès public

Tout afficher

10 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoAdresse e-mail validée de cs.ox.ac.uk
Hengshuai YaoSony AIAdresse e-mail validée de ualberta.ca
Richard S. SuttonKeen, Amii, and University of AlbertaAdresse e-mail validée de richsutton.com
Bo LiuEx-Associate Professor, AAAI SM, IEEE SMAdresse e-mail validée de cs.umass.edu
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiAdresse e-mail validée de ualberta.ca
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyAdresse e-mail validée de tudelft.nl
Ray JiangResearch Scientist, DeepMindAdresse e-mail validée de google.com
Remi Tachet des CombesAdresse e-mail validée de alpacaml.com
Romain LarocheMicrosoft ResearchAdresse e-mail validée de polytechnique.org
Marcus EdelComputer Science, Free University of BerlinAdresse e-mail validée de fu-berlin.de
Ryan R. CurtinFree agentAdresse e-mail validée de ratml.org
Nando de FreitasCIFAR & DeepMindAdresse e-mail validée de google.com
Tom Le PaineStaff Research Scientist at Google DeepMindAdresse e-mail validée de google.com
Julian SchrittwieserDeepMindAdresse e-mail validée de furidamu.org
Roman RingDeepMindAdresse e-mail validée de deepmind.com
Petko GeorgievGoogle DeepMind, University of CambridgeAdresse e-mail validée de cam.ac.uk
Michael MathieuDeepMindAdresse e-mail validée de google.com
Aäron van den OordGoogle DeepMindAdresse e-mail validée de google.com
Sergio Gómez ColmenarejoResearch Engineer, DeepMindAdresse e-mail validée de google.com
Caglar GulcehreProf at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMind, PhD@MILAAdresse e-mail validée de google.com

Suivre

Shangtong Zhang

University of Virginia

Adresse e-mail validée de virginia.edu - Page d'accueil

reinforcement learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	331	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	94	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	87	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	85	2018
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	76	2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	52	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	51	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	40	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	36	2021
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	32	2018
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	31	2021
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	31	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	29*	2018
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	27	2015
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	22	2019
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	19*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	13	2022
Learning Retrospective Knowledge with Reverse Reinforcement Learning S Zhang, V Veeriah, S Whiteson NeurIPS 2020, 2020	13	2020
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	13	2019
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control S Zhang, OR Zaiane Deep Reinforcement Learning Symposium, NIPS 2017, 2017	12	2017

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs