Follow
Xiuyuan Lu
Xiuyuan Lu
Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Ensemble sampling
X Lu, B Van Roy
Advances in Neural Information Processing Systems 31, 2017
1512017
Epistemic neural networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Advances in Neural Information Processing Systems 37, 2023
1012023
Reinforcement learning, bit by bit
X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen
Foundations and TrendsŪ in Machine Learning 16 (6), 733-865, 2023
752023
Information-theoretic confidence bounds for reinforcement learning
X Lu, B Van Roy
Advances in Neural Information Processing Systems 33, 2019
602019
Hypermodels for exploration
V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy
International Conference on Learning Representations, 2020
472020
Efficient online recommendation via low-rank ensemble sampling
X Lu, Z Wen, B Kveton
Proceedings of the 12th ACM Conference on Recommender Systems, 460-464, 2018
232018
An analysis of ensemble sampling
C Qin, Z Wen, X Lu, B Van Roy
Advances in Neural Information Processing Systems 36, 2022
222022
The neural testbed: Evaluating joint predictions
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ...
Advances in Neural Information Processing Systems 36, 12554-12565, 2022
182022
Approximate Thompson Sampling via Epistemic Neural Networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, 2023
152023
Ensembles for uncertainty estimation: Benefits of prior functions and bootstrapping
V Dwaracherla, Z Wen, I Osband, X Lu, SM Asghari, B Van Roy
Transactions on Machine Learning Research, 2023
132023
From predictions to decisions: The importance of joint predictive distributions
Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ...
arXiv preprint arXiv:2107.09224, 2021
112021
Evaluating High-Order Predictive Distributions in Deep Learning
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, B Van Roy
Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, 2022
72022
Information-directed sampling for reinforcement learning
X Lu
Stanford University, 2020
42020
Exploration using hyper-models
B Van Roy, X Lu, VR Dwaracherla, Z Wen, M Ibrahimi, IDM Osband
US Patent App. 17/639,504, 2022
12022
Robustness of epinets against distributional shifts
X Lu, I Osband, SM Asghari, S Gowal, V Dwaracherla, Z Wen, B Van Roy
arXiv preprint arXiv:2207.00137, 2022
12022
RLHF and IIA: Perverse Incentives
W Xu, S Dong, X Lu, G Lam, Z Wen, B Van Roy
ICML 2024 Workshop on Models of Human Feedback for AI Alignment, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–16