Grounding large language models in interactive environments with online reinforcement learning T Carta, C Romac, T Wolf, S Lamprier, O Sigaud, PY Oudeyer International Conference on Machine Learning, 3676-3713, 2023 | 161 | 2023 |
Eager: Asking and answering questions for automatic reward shaping in language-guided rl T Carta, PY Oudeyer, O Sigaud, S Lamprier Advances in Neural Information Processing Systems 35, 12478-12490, 2022 | 22 | 2022 |
Visualhints: A visual-lingual environment for multimodal reinforcement learning T Carta, S Chaudhury, K Talamadupula, M Tatsubori arXiv preprint arXiv:2010.13839, 2020 | 3 | 2020 |
Autotelic LLM-based exploration for goal-conditioned RL G Pourcel, T Carta, G Kovač, PY Oudeyer Intrinsically Motivated Open-ended Learning Workshop at NeurIPS 2024, 2024 | 1 | 2024 |
Codeplay: Autotelic Learning through Collaborative Self-Play in Programming Environments L Teodorescu, C Colas, M Bowers, T Carta, PY Oudeyer IMOL 2023-Intrinsically Motivated Open-ended Learning workshop at NeurIPS 2023, 2023 | 1 | 2023 |
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting MS Aissi, C Romac, T Carta, S Lamprier, PY Oudeyer, O Sigaud, L Soulier, ... arXiv preprint arXiv:2410.19920, 2024 | | 2024 |
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling L Gaven, C Romac, T Carta, S Lamprier, O Sigaud, PY Oudeyer arXiv preprint arXiv:2410.12481, 2024 | | 2024 |
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting M Salim Aissi, C Romac, T Carta, S Lamprier, PY Oudeyer, O Sigaud, ... arXiv e-prints, arXiv: 2410.19920, 2024 | | 2024 |
Leveraging Visual Handicaps for Text-Based Reinforcement Learning S Chaudhury, K Murugesan, T Carta, K Talamadupula, M Tatsubori ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Les algorithmes des IA peuvent-ils comprendre notre monde? C Romac, T Carta, PY Oudeyer Pour la science, 2024 | | 2024 |
Les IA face au réel C Romac, T Carta, PY Oudeyer Pour la Science 557 (3), 24-31, 2024 | | 2024 |
VISUALHANDICAPS: Systematic Handicaps for Text-based Games S Chaudhury, K Murugesan, T CARTA, K Talamadupula, M Tatsubori | | |