A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers GS Varela, PP Santos, A Sardinha, FS Melo arXiv preprint arXiv:2101.09614, 2021 | 3 | 2021 |
The impact of data distribution on Q-learning with function approximation PP Santos, DS Carvalho, A Sardinha, FS Melo Machine Learning, 1-23, 2024 | 2* | 2024 |
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning PP Santos, DS Carvalho, M Vasco, A Sardinha, PA Santos, A Paiva, ... arXiv preprint arXiv:2210.06274, 2022 | 2 | 2022 |
Generalizing Objective-Specification in Markov Decision Processes PP Santos Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024 | 1 | 2024 |
The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes PP Santos, A Sardinha, FS Melo arXiv preprint arXiv:2409.15128, 2024 | | 2024 |
Traffic Light Control using Deep Reinforcement Learning PP Santos Instituto Superior Técnico, 2021 | | 2021 |