Matthieu Zimmer

Citée par

	Toutes	Depuis 2019
Citations	486	452
indice h	12	10
indice i10	12	10

160

120

2016201720182019202020212022202320247 9 13 20 29 54 82 149 118

Accès public

Tout afficher

4 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Paul WengDuke Kunshan UniversityAdresse e-mail validée de duke.edu
Juan RojasLipscomb UniversityAdresse e-mail validée de lipscomb.edu
Paolo ViappianiCNRS researcher, LAMSADE, Université Paris Dauphine PSLAdresse e-mail validée de dauphine.psl.eu
Stephane DoncieuxProfessor of Computer Science, ISIR, Sorbonne University-CNRSAdresse e-mail validée de isir.upmc.fr

Suivre

Matthieu Zimmer

RL Research Scientist @ Huawei Noah’s Ark Lab

Adresse e-mail validée de matthieu-zimmer.net - Page d'accueil

artificial intelligence : learning developmental learning reinforcement learning neural networks


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Learning fair policies in multi-objective (deep) reinforcement learning with average and discounted rewards U Siddique, P Weng, M Zimmer International Conference on Machine Learning, 8905-8915, 2020	86	2020
Teacher-student framework: a reinforcement learning approach M Zimmer, P Viappiani, P Weng AAMAS Workshop Autonomous Robots and Multirobot Systems, 2014	81	2014
A survey on interpretable reinforcement learning C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu Machine Learning, 1-44, 2024	67	2024
Learning fair policies in decentralized cooperative multi-agent reinforcement learning M Zimmer, C Glanois, U Siddique, P Weng International Conference on Machine Learning, 12967-12978, 2021	49	2021
Invariant transform experience replay: Data augmentation for deep reinforcement learning Y Lin, J Huang, M Zimmer, Y Guan, J Rojas, P Weng IEEE Robotics and Automation Letters 5 (4), 6615-6622, 2020	41	2020
Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results M Zimmer, S Doncieux IEEE Transactions on Cognitive and Developmental Systems, 2017	29	2017
Neuro-symbolic hierarchical rule induction C Glanois, Z Jiang, X Feng, P Weng, M Zimmer, D Li, W Liu, J Hao International Conference on Machine Learning, 7583-7615, 2022	25	2022
Differentiable logic machines M Zimmer, X Feng, C Glanois, Z Jiang, J Zhang, P Weng, D Li, J Hao, ... arXiv preprint arXiv:2102.11529, 2021	23	2021
Neural Fitted Actor-Critic M Zimmer, Y Boniface, A Dutech ESANN - European Symposium on Artificial Neural Networks, Computational …, 2016	15	2016
Developmental reinforcement learning through sensorimotor space enlargement M Zimmer, Y Boniface, A Dutech International Conference on Development and Learning and on Epigenetic Robotics, 2018	13	2018
Exploiting the sign of the advantage function to learn deterministic policies in continuous domains M Zimmer, P Weng International Joint Conference on Artificial Intelligence, 2019	12	2019
Apprentissage par renforcement développemental M Zimmer Université de Lorraine, 2018	12	2018
End-to-end meta-bayesian optimisation with transformer neural processes A Maraval, M Zimmer, A Grosnit, H Bou Ammar Advances in Neural Information Processing Systems 36, 2024	7	2024
Pangu-agent: A fine-tunable generalist agent with structured reasoning F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ... arXiv preprint arXiv:2312.14878, 2023	7	2023
Hyperparameter auto-tuning in self-supervised robotic learning J Huang, J Rojas, M Zimmer, H Wu, Y Guan, P Weng IEEE Robotics and Automation Letters 6 (2), 3537-3544, 2021	7	2021
Towards More Sample Efficiency inReinforcement Learning with Data Augmentation Y Lin, J Huang, M Zimmer, J Rojas, P Weng Robot Learning Workshop, NeurIPS 2019, 2019	5	2019
Off-Policy Neural Fitted Actor-Critic M Zimmer, Y Boniface, A Dutech Deep Reinforcement Learning Workshop, NIPS 2016, 2016	2	2016
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning CE Mower, Y Wan, H Yu, A Grosnit, J Gonzalez-Billandon, M Zimmer, ... arXiv preprint arXiv:2406.19741, 2024	1	2024
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis PJ Gorinski, M Zimmer, G Lampouras, DGX Deik, I Iacobacci arXiv preprint arXiv:2310.13669, 2023	1	2023
Sample-efficient optimisation with probabilistic transformer surrogates A Maraval, M Zimmer, A Grosnit, R Tutunov, J Wang, HB Ammar arXiv preprint arXiv:2205.13902, 2022	1	2022

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs