Suivre
Zhengyuan Zhou
Zhengyuan Zhou
Dept of Technology, Operations and Statistics at NYU Stern
Adresse e-mail validée de stern.nyu.edu - Page d'accueil
Titre
Citée par
Citée par
Année
MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
L Jiang, Z Zhou, T Leung, LJ Li, L Fei-Fei
International Conference on Machine Learning, 2309-2318, 2018
17912018
Learning in games with continuous action sets and unknown payoff functions
P Mertikopoulos, Z Zhou
Mathematical Programming 173 (1-2), 465-507, 2019
2902019
Estimation considerations in contextual bandits
M Dimakopoulou, Z Zhou, S Athey, G Imbens
arXiv preprint arXiv:1711.07077, 2017
2392017
Balanced linear contextual bandits
M Dimakopoulou, Z Zhou, S Athey, G Imbens
Proceedings of the AAAI Conference on Artificial Intelligence 33, 3445-3453, 2019
2172019
Robust low-rank tensor recovery with rectification and alignment
X Zhang, D Wang, Z Zhou, Y Ma
IEEE transactions on pattern analysis and machine intelligence 43 (1), 238-255, 2019
1972019
Offline multi-action policy learning: Generalization and optimization
Z Zhou, S Athey, S Wager
Operations Research, 2022
1962022
Cooperative pursuit with Voronoi partitions
Z Zhou, W Zhang, J Ding, H Huang, DM Stipanović, CJ Tomlin
Automatica 72, 64-72, 2016
192*2016
Multiplayer reach-avoid games via pairwise outcomes
M Chen, Z Zhou, CJ Tomlin
IEEE Transactions on Automatic Control 62 (3), 1451-1457, 2016
1662016
Distributional soft actor critic for risk sensitive learning
X Ma, Q Zhang, L Xia, Z Zhou, J Yang, Q Zhao
arXiv preprint arXiv:2004.14547, 2020
1122020
Learning in generalized linear contextual bandits with stochastic delays
Z Zhou, R Xu, J Blanchet
Advances in Neural Information Processing Systems, 5197-5208, 2019
1052019
Stochastic mirror descent in variationally coherent optimization problems
Z Zhou, P Mertikopoulos, N Bambos, S Boyd, PW Glynn
Advances in Neural Information Processing Systems 30, 2017
1032017
A general, open-loop formulation for reach-avoid games
Z Zhou, R Takei, H Huang, CJ Tomlin
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on, 6501-6506, 2012
882012
Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning
Z Zhou, Z Zhou, Q Bai, L Qiu, J Blanchet, P Glynn
International Conference on Artificial Intelligence and Statistics, 3331-3339, 2021
862021
Infinite time horizon maximum causal entropy inverse reinforcement learning
Z Zhou, M Bloem, N Bambos
IEEE Transactions on Automatic Control 63 (9), 2787-2802, 2018
852018
Efficient path planning algorithms in reach-avoid problems
Z Zhou, J Ding, H Huang, R Takei, C Tomlin
Automatica 89, 28-36, 2018
842018
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State
S Dong, B Van Roy, Z Zhou
arXiv preprint arXiv:2102.05261, 2021
77*2021
Multiplayer reach-avoid games via low dimensional solutions and maximum matching
M Chen, Z Zhou, CJ Tomlin
2014 American control conference, 1444-1449, 2014
752014
Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go?
Z Zhou, P Mertikopoulos, N Bambos, PW Glynn, Y Ye, LJ Li, FF Li
ICML 2018-35th International Conference on Machine Learning, 1-10, 2018
712018
Evasion as a team against a faster pursuer
SY Liu, Z Zhou, C Tomlin, K Hedrick
2013 American Control Conference, 5368-5373, 2013
682013
Distributionally robust policy evaluation and learning in offline contextual bandits
N Si, F Zhang, Z Zhou, J Blanchet
International Conference on Machine Learning, 8884-8894, 2020
652020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20