‪Jean-Baptiste Gaya‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	114	114
h-index	5	5
i10-index	4	4

0

70

35

20212022202320241 11 32 70

Public access

1 article

0 articles

available

not available

Based on funding mandates

Jean-Baptiste Gaya

Jean-Baptiste Gaya

Sorbonne Université

Verified email at isir.upmc.fr

Deep Reinforcement Learning Continual Learning Large Language Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards A Rame, G Couairon, C Dancette, JB Gaya, M Shukor, L Soulier, M Cord Advances in Neural Information Processing Systems 36, 2024	53	2024
Building a subspace of policies for scalable continual learning JB Gaya, T Doan, L Caccia, L Soulier, L Denoyer, R Raileanu 11th International Conference on Learning Representations (Spotlight), 2022	23	2022
Learning a subspace of policies for online adaptation in reinforcement learning JB Gaya, L Soulier, L Denoyer 10th International Conference on Learning Representations, 2021	19	2021
Salina: Sequential learning of agents L Denoyer, A De la Fuente, S Duong, JB Gaya, PA Kamienny, ... arXiv preprint arXiv:2110.07910, 2021	13	2021
Worldsense: A synthetic benchmark for grounded reasoning in large language models Y Benchekroun, M Dervishi, M Ibrahim, JB Gaya, X Martinet, G Mialon, ... arXiv preprint arXiv:2311.15930, 2023	6	2023
Subspaces of Policies for Deep Reinforcement Learning JB Gaya Sorbonne Université, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–6