Suivre
Sobhan Miryoosefi
Titre
Citée par
Citée par
Année
Bellman Eluder dimension: New rich classes of RL problems, and sample-efficient algorithms
C Jin, Q Liu, S Miryoosefi
Advances in Neural Information Processing Systems 34, 13406-13418, 2021
2152021
Reinforcement learning with convex constraints
S Miryoosefi, K Brantley, H Daumé III, M Dudík, R Schapire
Advances in Neural Information Processing Systems 32, 14093-14102, 2019
942019
Constrained episodic reinforcement learning in concave-convex and knapsack settings
K Brantley, M Dudik, T Lykouris, S Miryoosefi, M Simchowitz, A Slivkins, ...
Advances in Neural Information Processing Systems 33, 16315-16326, 2020
492020
Provable reinforcement learning with a short-term memory
Y Efroni, C Jin, A Krishnamurthy, S Miryoosefi
International Conference on Machine Learning, 5832-5850, 2022
302022
A simple reward-free approach to constrained reinforcement learning
S Miryoosefi, C Jin
International Conference on Machine Learning, 15666-15698, 2022
292022
Rest meets react: Self-improvement for multi-step reasoning llm agent
R Aksitov, S Miryoosefi, Z Li, D Li, S Babayan, K Kopparapu, Z Fisher, ...
arXiv preprint arXiv:2312.10003, 2023
72023
Efficient training of language models using few-shot learning
SJ Reddi, S Miryoosefi, S Karp, S Krishnan, S Kale, S Kim, S Kumar
International Conference on Machine Learning, 14553-14568, 2023
22023
Efficient Stagewise Pretraining via Progressive Subnetworks
A Panigrahi, N Saunshi, K Lyu, S Miryoosefi, S Reddi, S Kale, S Kumar
arXiv preprint arXiv:2402.05913, 2024
2024
Provable Reinforcement Learning with Constraints and Function Approximation
SSM Yoosefi
Princeton University, 2022
2022
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–9