Follow
Yuheng Zhang
Title
Cited by
Cited by
Year
The secret revealer: Generative model-inversion attacks against deep neural networks
Y Zhang*, R Jia*, H Pei, W Wang, B Li, D Song
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
4712020
Improving robustness to model inversion attacks via mutual information regularization
T Wang, Y Zhang, R Jia
Proceedings of the AAAI Conference on Artificial Intelligence 35 (13), 11666 …, 2021
632021
Ordinal distribution regression for gait-based age estimation
H Zhu, Y Zhang, G Li, J Zhang, H Shan
Science China Information Sciences 63, 1-14, 2020
342020
Convolutional ordinal regression forest for image ordinal estimation
H Zhu, H Shan, Y Zhang, L Che, X Xu, J Zhang, J Shi, FY Wang
IEEE Transactions on Neural Networks and Learning Systems 33 (8), 4084-4095, 2021
29*2021
A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference
C Ye*, W Xiong*, Y Zhang*, N Jiang, T Zhang
arXiv preprint arXiv:2402.07314, 2024
162024
Batch Active Learning with Graph Neural Networks via Multi-Agent Deep Reinforcement Learning
Y Zhang, H Tong, Y Xia, Y Zhu, Y Chi, L Ying
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9118-9126, 2022
152022
Improved Algorithms for Neural Active Learning
Y Ban*, Y Zhang*, H Tong, A Banerjee, J He
Advances in Neural Information Processing Systems 35, 27497-27509, 2022
82022
Offline Learning in Markov Games with General Function Approximation
Y Zhang, Y Bai, N Jiang
Proceedings of the 40th International Conference on Machine Learning 202 …, 2023
72023
Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs
H Luo*, H Tong*, M Zhang*, Y Zhang*
Proceedings of The 34th International Conference on Algorithmic Learning …, 2023
52023
Practical Contextual Bandits with Feedback Graphs
M Zhang*, Y Zhang*, O Vrousgou, H Luo, P Mineiro
Advances in Neural Information Processing Systems 36, 30592-30617, 2023
32023
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Y Zhang, N Jiang
arXiv preprint arXiv:2402.14703, 2024
22024
Efficient contextual bandits with uninformed feedback graphs
M Zhang, Y Zhang, H Luo, P Mineiro
Proceedings of the 41st International Conference on Machine Learning, 2024
22024
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
Y Zhang, D Yu, B Peng, L Song, Y Tian, M Huo, N Jiang, H Mi, D Yu
arXiv preprint arXiv:2407.00617, 2024
12024
Group Fairness via Group Consensus
E Chan, Z Liu, R Qiu, Y Zhang, R Maciejewski, H Tong
The 2024 ACM Conference on Fairness, Accountability, and Transparency, 1788-1808, 2024
12024
Active Heterogeneous Graph Neural Networks with Per-step Meta-Q-Learning
Y Zhang, Y Xia, Y Zhu, Y Chi, L Ying, H Tong
2022 IEEE International Conference on Data Mining (ICDM), 1329-1334, 2022
12022
Provably Efficient Interactive-Grounded Learning with Personalized Reward
M Zhang*, Y Zhang*, H Luo, P Mineiro
arXiv preprint arXiv:2405.20677, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–16