Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems L Matignon, GJ Laurent, N Le Fort-Piat The Knowledge Engineering Review 27 (01), 1-31, 2012 | 220 | 2012 |
Hysteretic q-learning: an algorithm for decentralized reinforcement learning in cooperative multi-agent teams L Matignon, GJ Laurent, L Fort-Piat Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International …, 2007 | 144 | 2007 |
Coordinated Multi-Robot Exploration Under Communication Constraints Using Decentralized Markov Decision Processes. L Matignon, L Jeanpierre, AI Mouaddib AAAI, 2012 | 89 | 2012 |
Reward function and initial values: better choices for accelerated goal-directed reinforcement learning L Matignon, GJ Laurent, N Le Fort-Piat Artificial Neural Networks–ICANN 2006, 840-849, 2006 | 65 | 2006 |
The world of independent learners is not markovian GJ Laurent, L Matignon, NL Fort-Piat International Journal of Knowledge-based and Intelligent Engineering Systems …, 2011 | 53 | 2011 |
Distributed control architecture for smart surfaces K Boutoustous, GJ Laurent, E Dedu, L Matignon, J Bourgeois, L Fort-Piat Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International …, 2010 | 47 | 2010 |
Distributed value functions for multi-robot exploration L Matignon, L Jeanpierre, AI Mouaddib Robotics and Automation (ICRA), 2012 IEEE International Conference on, 1544-1550, 2012 | 39 | 2012 |
A survey on intrinsic motivation in reinforcement learning A Aubret, L Matignon, S Hassas arXiv preprint arXiv:1908.06976, 2019 | 30 | 2019 |
Designing decentralized controllers for distributed-air-jet mems-based micromanipulators by reinforcement learning L Matignon, GJ Laurent, N Le Fort-Piat, YA Chapuis Journal of intelligent & robotic systems 59 (2), 145-166, 2010 | 23 | 2010 |
A study of FMQ heuristic in cooperative multi-agent games. L Matignon, GJ Laurent, N Le Fort-Piat The 7th International Conference on Autonomous Agents and Multiagent Systems …, 2008 | 17 | 2008 |
Improving reinforcement learning speed for robot control L Matignon, GJ Laurent, L Fort-Piat Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on …, 2006 | 15 | 2006 |
A Model for Verbal and Non-Verbal Human-Robot Collaboration. L Matignon, AB Karami, AI Mouaddib AAAI Fall Symposium: Dialog with Robots, 2010 | 14 | 2010 |
Multi-robot simultaneous coverage and mapping of complex scene-comparison of different strategies L Matignon, O Simonin Proceedings of the 17th International Conference on Autonomous Agents and …, 2018 | 13 | 2018 |
Multi-Robot Simultaneous Coverage and Mapping of Complex Scene L Matignon, O Simonin Proceedings of the 17th International Conference on Autonomous Agents and …, 2018 | 13* | 2018 |
Design of semi-decentralized control laws for distributed-air-jet micromanipulators by reinforcement learning L Matignon, GJ Laurent, L Fort-Piat Intelligent Robots and Systems, 2009. IROS 2009. IEEE/RSJ International …, 2009 | 10 | 2009 |
Coordination of independent learners in cooperative Markov games. L Matignon, GJ Laurent, N Le Fort-Piat | 10 | 2009 |
Distributed value functions for multi-robot exploration: a position paper L Matignon, L Jeanpierre, AI Mouaddib Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domain (MSDM …, 2012 | 6 | 2012 |
How explainable plans can make planning faster A Grea, L Matignon, S Aknine Workshop on Explainable Artificial Intelligence, 58, 2018 | 5 | 2018 |
Modeling Biological Agents Beyond the Reinforcement-learning Paradigm OL Georgeon, RC Casado, LA Matignon Procedia Computer Science 71, 17-22, 2015 | 5 | 2015 |
Cooperative multi-agent policy gradient G Bono, JS Dibangoye, L Matignon, F Pereyron, O Simonin Joint European Conference on Machine Learning and Knowledge Discovery in …, 2018 | 4 | 2018 |