Teacher-student framework: a reinforcement learning approach M Zimmer, P Viappiani, P Weng AAMAS Workshop Autonomous Robots and Multirobot Systems, 2014 | 73 | 2014 |
Learning fair policies in multi-objective (deep) reinforcement learning with average and discounted rewards U Siddique, P Weng, M Zimmer International Conference on Machine Learning, 8905-8915, 2020 | 53 | 2020 |
Learning fair policies in decentralized cooperative multi-agent reinforcement learning M Zimmer, C Glanois, U Siddique, P Weng International Conference on Machine Learning, 12967-12978, 2021 | 31 | 2021 |
A survey on interpretable reinforcement learning C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu arXiv preprint arXiv:2112.13112, 2021 | 30 | 2021 |
Invariant transform experience replay: Data augmentation for deep reinforcement learning Y Lin, J Huang, M Zimmer, Y Guan, J Rojas, P Weng IEEE Robotics and Automation Letters 5 (4), 6615-6622, 2020 | 26 | 2020 |
Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results M Zimmer, S Doncieux IEEE Transactions on Cognitive and Developmental Systems, 2017 | 24 | 2017 |
Differentiable logic machines M Zimmer, X Feng, C Glanois, Z Jiang, J Zhang, P Weng, L Dong, ... arXiv preprint arXiv:2102.11529, 2021 | 15 | 2021 |
Neural Fitted Actor-Critic M Zimmer, Y Boniface, A Dutech ESANN - European Symposium on Artificial Neural Networks, Computational …, 2016 | 15 | 2016 |
Exploiting the sign of the advantage function to learn deterministic policies in continuous domains M Zimmer, P Weng International Joint Conference on Artificial Intelligence, 2019 | 11 | 2019 |
Developmental reinforcement learning through sensorimotor space enlargement M Zimmer, Y Boniface, A Dutech International Conference on Development and Learning and on Epigenetic Robotics, 2018 | 11 | 2018 |
Neuro-symbolic hierarchical rule induction C Glanois, Z Jiang, X Feng, P Weng, M Zimmer, D Li, W Liu, J Hao International Conference on Machine Learning, 7583-7615, 2022 | 10 | 2022 |
Apprentissage par renforcement développemental M Zimmer Université de Lorraine, 2018 | 7 | 2018 |
Hyperparameter auto-tuning in self-supervised robotic learning J Huang, J Rojas, M Zimmer, H Wu, Y Guan, P Weng IEEE Robotics and Automation Letters 6 (2), 3537-3544, 2021 | 4 | 2021 |
Towards More Sample Efficiency inReinforcement Learning with Data Augmentation Y Lin, J Huang, M Zimmer, J Rojas, P Weng Robot Learning Workshop, NeurIPS 2019, 2019 | 3 | 2019 |
Off-Policy Neural Fitted Actor-Critic M Zimmer, Y Boniface, A Dutech Deep Reinforcement Learning Workshop, NIPS 2016, 2016 | 2 | 2016 |
Invariant transform experience replay Y Lin, J Huang, M Zimmer, J Rojas, P Weng arXiv preprint arXiv:1909.10707, 2019 | 1 | 2019 |
Toward a data efficient neural actor-critic M Zimmer, Y Boniface, A Dutech European Workshop on Reinforcement Learning, 2016 | 1 | 2016 |
Lightweight Structural Choices Operator for Technology Mapping A Grosnit, M Zimmer, R Tutunov, X Li, L Chen, F Yang, M Yuan, ... 2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023 | | 2023 |
End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes A Maraval, M Zimmer, A Grosnit, HB Ammar arXiv preprint arXiv:2305.15930, 2023 | | 2023 |
Sample-Efficient Optimisation with Probabilistic Transformer Surrogates A Maraval, M Zimmer, A Grosnit, R Tutunov, J Wang, HB Ammar arXiv preprint arXiv:2205.13902, 2022 | | 2022 |