Cosmin Paduraru

Cited by

	All	Since 2019
Citations	2977	2891
h-index	17	16
i10-index	20	17

1500

750

375

1125

2017201820192020202120222023202412 18 41 95 250 405 637 1455

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Cosmin Paduraru

DeepMind

Verified email at google.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ... Machine Learning 110 (9), 2419-2468, 2021	533*	2021
Safe exploration in continuous action spaces G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa arXiv preprint arXiv:1801.08757, 2018	488	2018
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	186	2020
Hyperparameter selection for offline reinforcement learning TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ... arXiv preprint arXiv:2007.09055, 2020	150	2020
Faster sorting algorithms discovered using deep reinforcement learning DJ Mankowitz, A Michi, A Zhernov, M Gelmi, M Selvi, C Paduraru, ... Nature 618 (7964), 257-263, 2023	123	2023
Benchmarks for deep off-policy evaluation J Fu, M Norouzi, O Nachum, G Tucker, Z Wang, A Novikov, M Yang, ... arXiv preprint arXiv:2103.16596, 2021	79	2021
Autoregressive dynamics models for offline policy evaluation and optimization MR Zhang, TL Paine, O Nachum, C Paduraru, G Tucker, Z Wang, ... arXiv preprint arXiv:2104.13877, 2021	44	2021
Off-policy evaluation in Markov decision processes C Paduraru McGill University, 2013	39	2013
Coptidice: Offline constrained reinforcement learning via stationary distribution correction estimation J Lee, C Paduraru, DJ Mankowitz, N Heess, D Precup, KE Kim, A Guez arXiv preprint arXiv:2204.08957, 2022	32	2022
Responding to new information in a mining complex: Fast mechanisms using machine learning C Paduraru, R Dimitrakopoulos Mining Technology, 2019	28	2019
Off-policy learning with options and recognizers D Precup, C Paduraru, A Koop, RS Sutton, S Singh Advances in Neural Information Processing Systems 18, 2005	28	2005
Adaptive policies for short-term material flow optimization in a mining complex C Paduraru, R Dimitrakopoulos Mining Technology 127 (1), 56-63, 2018	26	2018
Controlling commercial cooling systems using reinforcement learning J Luo, C Paduraru, O Voicu, Y Chervonyi, S Munns, J Li, C Qian, P Dutta, ... arXiv preprint arXiv:2211.07357, 2022	24	2022
Active offline policy selection K Konyushova, Y Chen, T Paine, C Gulcehre, C Paduraru, DJ Mankowitz, ... Advances in Neural Information Processing Systems 34, 24631-24644, 2021	24	2021
Transformers meet directed graphs S Geisler, Y Li, DJ Mankowitz, AT Cemgil, S Günnemann, C Paduraru International Conference on Machine Learning, 11144-11172, 2023	22	2023
Development and validation of a supervised machine learning radar Doppler spectra peak-finding algorithm H Kalesse, T Vogl, C Paduraru, E Luke Atmospheric Measurement Techniques 12 (8), 4591-4617, 2019	19	2019
Grounding Abstractions in Predictive State Representations. B Tanner, V Bulitko, A Koop, C Paduraru IJCAI, 1077-1082, 2007	14	2007
Robust constrained reinforcement learning for continuous control with model misspecification DJ Mankowitz, DA Calian, R Jeong, C Paduraru, N Heess, S Dathathri, ... arXiv preprint arXiv:2010.10644, 2020	11	2020
An empirical analysis of off-policy learning in discrete mdps C Păduraru, D Precup, J Pineau, G Comănici European Workshop on Reinforcement Learning, 89-102, 2013	10	2013

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by