Suivre
Edward Beeching
Edward Beeching
PhD Student, INSA Lyon, France
Adresse e-mail validée de insa-lyon.fr
Titre
Citée par
Citée par
Année
Zephyr: Direct distillation of lm alignment
L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ...
arXiv preprint arXiv:2310.16944, 2023
1152023
Open llm leaderboard
E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ...
Hugging Face, 2023
902023
Trl: Transformer reinforcement learning
L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert, ...
GitHub. Available online at: https://github. com/lvwerra/trl, 2020
552020
Learning to plan with uncertain topological maps
E Beeching, J Dibangoye, O Simonin, C Wolf
European Conference on Computer Vision, 473-490, 2020
412020
Deep reinforcement learning on a budget: 3d control and reasoning without a supercomputer
E Beeching, J Debangoye, O Simonin, C Wolf
2020 25th International Conference on Pattern Recognition (ICPR), 158-165, 2021
282021
Egomap: Projective mapping and structured egocentric memory for deep RL
E Beeching, J Dibangoye, O Simonin, C Wolf
Joint European conference on machine learning and knowledge discovery in …, 2020
252020
Stackllama: An rl fine-tuned llama model for stack exchange question and answering, 2023
E Beeching, Y Belkada, K Rasul, L Tunstall, L von Werra, N Rajani, ...
URL https://huggingface. co/blog/stackllama 1, 2023
142023
Graph augmented deep reinforcement learning in the gamerland3d environment
E Beeching, M Peter, P Marcotte, J Debangoye, O Simonin, J Romoff, ...
arXiv preprint arXiv:2112.11731, 2021
92021
Godot reinforcement learning agents
E Beeching, J Debangoye, O Simonin, C Wolf
arXiv preprint arXiv:2112.03636, 2021
92021
The alignment handbook
L Tunstall, E Beeching, N Lambert, N Rajani, AM Rush, T Wolf
GitHub repository, 2023
72023
Creating a Coding Assistant with StarCoder. Hugging Face Blog
L Tunstall, N Lambert, N Rajani, E Beeching, T Le Scao, L von Werra, ...
62023
Creating a Coding Assistant with StarCoder
L Tunstall, N Lambert, N Rajani, E Beeching, T Le Scao, L von Werra, ...
Hugging Face Blog, 2023
52023
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Q Gallouédec, E Beeching, C Romac, E Dellandréa
arXiv preprint arXiv:2402.09844, 2024
2024
Large-scale automatic learning of autonomous agent behavior with structured deep reinforcement learning
E Beeching
Université de Lyon, 2022
2022
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–14