Follow
Paul Weng
Title
Cited by
Cited by
Year
Dual graph attention networks for deep latent representation of multifaceted social effects in recommender systems
Q Wu, H Zhang, X Gao, P He, P Weng, H Gao, G Chen
The world wide web conference, 2091-2102, 2019
3512019
Analytics and machine learning in vehicle routing research
R Bai, X Chen, ZL Chen, T Cui, S Gong, W He, X Jiang, H Jin, J Jin, ...
International Journal of Production Research 61 (1), 4-30, 2023
1052023
Top-k selection based on adaptive sampling of noisy preferences
R Busa-Fekete, B Szorenyi, P Weng, W Cheng, E Hüllermeier
International Conference on Machine Learning, 1094-1102, 2013
962013
Learning fair policies in multi-objective (deep) reinforcement learning with average and discounted rewards
U Siddique, P Weng, M Zimmer
International Conference on Machine Learning, 8905-8915, 2020
902020
Teacher-student framework: a reinforcement learning approach
M Zimmer, P Viappiani, P Weng
AAMAS Workshop autonomous robots and multirobot systems, 2014
832014
A survey on interpretable reinforcement learning
C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu
Machine Learning, 1-44, 2024
802024
Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
R Busa-Fekete, B Szörényi, P Weng, W Cheng, E Hüllermeier
Machine learning 97, 327-351, 2014
732014
A survey of reinforcement learning from human feedback
T Kaufmann, P Weng, V Bengs, E Hüllermeier
arXiv preprint arXiv:2312.14925, 2023
682023
On finding compromise solutions in multiobjective Markov decision processes
P Perny, P Weng
ECAI 2010, 969-970, 2010
652010
Dual sequential prediction models linking sequential recommendation and information dissemination
Q Wu, Y Gao, X Gao, P Weng, G Chen
Proceedings of the 25th ACM SIGKDD international conference on knowledge …, 2019
642019
Optimization of probabilistic argumentation with Markov decision models
E Hadoux, A Beynier, N Maudet, P Weng, A Hunter
International Joint Conference on Artificial Intelligence, 2015
572015
Qualitative multi-armed bandits: A quantile-based approach
B Szorenyi, R Busa-Fekete, P Weng, E Hüllermeier
International Conference on Machine Learning, 1660-1668, 2015
572015
Learning fair policies in decentralized cooperative multi-agent reinforcement learning
M Zimmer, C Glanois, U Siddique, P Weng
International Conference on Machine Learning, 12967-12978, 2021
562021
Multi-objective bandits: Optimizing the generalized gini index
R Busa-Fekete, B Szörényi, P Weng, S Mannor
International Conference on Machine Learning, 625-634, 2017
472017
Invariant transform experience replay: Data augmentation for deep reinforcement learning
Y Lin, J Huang, M Zimmer, Y Guan, J Rojas, P Weng
IEEE Robotics and Automation Letters 5 (4), 6615-6622, 2020
432020
Decomposition methods for distributed optimal power flow: panorama and case studies of the DC model
MH Amini, S Bahrami, F Kamyab, S Mishra, R Jaddivada, K Boroojeni, ...
Classical and recent aspects of power system optimization, 137-155, 2018
432018
Interactive value iteration for markov decision processes with unknown rewards
P Weng, B Zanuttini
IJCAI'13-Twenty-Third international joint conference on Artificial …, 2013
432013
Algebraic Markov decision processes
P Perny, O Spanjaard, P Weng
19th International Joint Conference on Artificial Intelligence, 1372-1377, 2005
432005
Sequential decision-making under non-stationary environments via sequential change-point detection
E Hadoux, A Beynier, P Weng
Learning over multiple contexts (LMCE), 2014
402014
Hierarchical electric vehicle charging aggregator strategy using Dantzig-Wolfe decomposition
MH Amini, P McNamara, P Weng, O Karabasoglu, Y Xu
IEEE Design & Test 35 (6), 25-36, 2017
382017
The system can't perform the operation now. Try again later.
Articles 1–20