Suivre
Mastane Achab
Mastane Achab
Technology Innovation Institute (TII)
Adresse e-mail validée de tii.ae - Page d'accueil
Titre
Citée par
Citée par
Année
Weighted empirical risk minimization: Sample selection bias correction based on importance sampling
M Achab, S Cl{\'e}men{\c{c}}on, C Tillier, R Vogel
Proceedings of the International Conference on Machine Learning, Artificial …, 2020
17*2020
Max K-Armed Bandit: On the ExtremeHunter Algorithm and Beyond
M Achab, S Clémençon, A Garivier, A Sabourin, C Vernade
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017
162017
Profitable bandits
M Achab, S Clémençon, A Garivier
Asian Conference on Machine Learning, 694-709, 2018
112018
Ranking data with continuous labels through oriented recursive partitions
S Clémençon, M Achab
Advances in Neural Information Processing Systems, 4600-4608, 2017
102017
Ranking and risk-aware reinforcement learning| Theses. fr
M Achab
Institut polytechnique de Paris, 2020
6*2020
One-Step Distributional Reinforcement Learning
M Achab, R Alami, YA Dahou Djilali, K Fedyanin, E Moulines
Transactions on Machine Learning Research, 2023
32023
Distributional deep Q-learning with CVaR regression
M Achab, R Alami, YAD Djilali, K Fedyanin, E Moulines, M Panov
Deep Reinforcement Learning Workshop NeurIPS 2022, 2022
32022
Robustness and risk management via distributional dynamic programming
M Achab, G Neu
arXiv preprint arXiv:2112.15430, 2021
32021
Dimensionality Reduction and (Bucket) Ranking: a Mass Transportation Approach
M Achab, A Korba, S Clémençon
Algorithmic Learning Theory, 64-93, 2019
32019
Investigating Regularization of Self-Play Language Models
R Alami, A Abubaker, M Achab, MEA Seddik, S Lahlou
arXiv preprint arXiv:2404.04291, 2024
12024
A Nested Matrix-Tensor Model for Noisy Multi-view Clustering
MEA Seddik, M Achab, H Goulart, M Debbah
arXiv preprint arXiv:2305.19992, 2023
12023
A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits
R Alami, M Mahfoud, M Achab
MAB-KD Workshop ICDM 2023, 2023
2023
Deep Reinforcement Learning Algorithms for Hybrid V2X Communication: A Benchmarking Study
F Boukhalfa, R Alami, M Achab, E Moulines, M Bennis
https://arxiv.org/abs/2310.03767, 2023
2023
Beyond Log-Concavity: Theory and Algorithm for Sum-Log-Concave Optimization
M Achab
https://arxiv.org/abs/2309.15298, 2023
2023
Checkered Regression
M Achab
TechRxiv preprint, 2022
2022
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–15