Max K-Armed Bandit: On the ExtremeHunter Algorithm and Beyond M Achab, S Clémençon, A Garivier, A Sabourin, C Vernade Joint European Conference on Machine Learning and Knowledge Discovery in …, 2017 | 10 | 2017 |
Profitable bandits M Achab, S Clémençon, A Garivier Asian Conference on Machine Learning, 694-709, 2018 | 8 | 2018 |
Ranking data with continuous labels through oriented recursive partitions S Clémençon, M Achab Advances in Neural Information Processing Systems, 4600-4608, 2017 | 8 | 2017 |
Weighted Empirical Risk Minimization: Sample Selection Bias Correction based on Importance Sampling R Vogel, M Achab, S Clémençon, C Tillier arXiv preprint arXiv:2002.05145, 2020 | 5 | 2020 |
Ranking and risk-aware reinforcement learning M Achab Institut polytechnique de Paris, 2020 | 3 | 2020 |
Robustness and risk management via distributional dynamic programming M Achab, G Neu arXiv preprint arXiv:2112.15430, 2021 | 2 | 2021 |
Dimensionality Reduction and (Bucket) Ranking: a Mass Transportation Approach M Achab, A Korba, S Clémençon Algorithmic Learning Theory, 64-93, 2019 | 1 | 2019 |
Checkered Regression M Achab TechRxiv preprint, 2022 | | 2022 |
Distributional deep Q-learning with CVaR regression M Achab, R ALAMI, YAD DJILALI, K Fedyanin, E Moulines, M Panov Deep Reinforcement Learning Workshop NeurIPS 2022, 0 | | |