Tengyu Xu

Citée par

	Toutes	Depuis 2019
Citations	861	845
indice h	15	15
indice i10	16	16

240

120

180

201720182019202020212022202320246 9 31 88 186 226 225 88

Accès public

Tout afficher

12 articles

1 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Yingbin LiangThe Ohio State UniversityAdresse e-mail validée de osu.edu
Guanghui (George) LanProfessor, Georgia Institute of TechnologyAdresse e-mail validée de isye.gatech.edu
HV PoorMichael Henry Strater University Professor, Princeton UniversityAdresse e-mail validée de princeton.edu
Zhaoran WangAssistant Professor at Northwestern UniversityAdresse e-mail validée de northwestern.edu

Suivre

Tengyu Xu

Meta Platforms, Inc.

Adresse e-mail validée de meta.com - Page d'accueil

Reinforcement Learning Deep Learning Natural Language Processing


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Finite-sample analysis for sarsa with linear function approximation S Zou, T Xu, Y Liang Advances in neural information processing systems 32, 2019	176	2019
Improving sample complexity bounds for (natural) actor-critic algorithms T Xu, Z Wang, Y Liang Advances in Neural Information Processing Systems 33, 4358-4369, 2020	128*	2020
Crpo: A new approach for safe reinforcement learning with convergence guarantee T Xu, Y Liang, G Lan International Conference on Machine Learning, 11480-11491, 2021	122*	2021
Two time-scale off-policy TD learning: Non-asymptotic analysis over Markovian samples T Xu, S Zou, Y Liang Advances in neural information processing systems 32, 2019	83	2019
Reanalysis of variance reduced temporal difference learning T Xu, Z Wang, Y Zhou, Y Liang arXiv preprint arXiv:2001.01898, 2020	45	2020
Algorithms for the estimation of transient surface heat flux during ultra-fast surface cooling ZF Zhou, TY Xu, B Chen International Journal of Heat and Mass Transfer 100, 1-10, 2016	42	2016
Enhanced first and zeroth order variance reduced algorithms for min-max optimization T Xu, Z Wang, Y Liang, HV Poor	41*	2020
Proximal gradient descent-ascent: Variable convergence under k {\L} geometry Z Chen, Y Zhou, T Xu, Y Liang arXiv preprint arXiv:2102.04653, 2021	29	2021
Non-asymptotic convergence of adam-type reinforcement learning algorithms under markovian sampling H Xiong, T Xu, Y Liang, W Zhang Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10460 …, 2021	28	2021
Sample complexity bounds for two timescale value-based reinforcement learning algorithms T Xu, Y Liang International Conference on Artificial Intelligence and Statistics, 811-819, 2021	27	2021
Faster algorithm and sharper analysis for constrained markov decision process T Li, Z Guan, S Zou, T Xu, Y Liang, G Lan Operations Research Letters 54, 107107, 2024	24	2024
When will generative adversarial imitation learning algorithms attain global convergence Z Guan, T Xu, Y Liang International Conference on Artificial Intelligence and Statistics, 1117-1125, 2021	24	2021
Doubly robust off-policy actor-critic: Convergence and optimality T Xu, Z Yang, Z Wang, Y Liang International Conference on Machine Learning, 11581-11591, 2021	23	2021
When Will Gradient Methods Converge to Max-margin Classifier under ReLU Models? T Xu, Y Zhou, K Ji, Y Liang arXiv preprint arXiv:1806.04339, 2018	23*	2018
Model-based offline meta-reinforcement learning with regularization S Lin, J Wan, T Xu, Y Liang, J Zhang arXiv preprint arXiv:2202.02929, 2022	15	2022
Provably efficient offline reinforcement learning with trajectory-wise reward T Xu, Y Wang, S Zou, Y Liang arXiv preprint arXiv:2206.06426, 2022	11	2022
PER-ETD: A polynomially efficient emphatic temporal difference learning method Z Guan, T Xu, Y Liang arXiv preprint arXiv:2110.06906, 2021	8	2021
Deterministic policy gradient: Convergence analysis H Xiong, T Xu, L Zhao, Y Liang, W Zhang Uncertainty in Artificial Intelligence, 2159-2169, 2022	6	2022
A Unifying Framework of Off-Policy General Value Function Evaluation T Xu, Z Yang, Z Wang, Y Liang Advances in Neural Information Processing Systems 35, 13570-13583, 2022	4*	2022
Constraint‐based multi‐agent reinforcement learning for collaborative tasks X Shang, T Xu, I Karamouzas, M Kallmann Computer Animation and Virtual Worlds 34 (3-4), e2182, 2023	2	2023

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs