Understanding the impact of entropy on policy optimization Z Ahmed, N Le Roux, M Norouzi, D Schuurmans International Conference on Machine Learning (ICML) 2019, 151-160, 2019 | 123 | 2019 |
InfoBot: Transfer and Exploration via the Information Bottleneck A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ... International Conference on Learning Representations (ICLR) 2019, 2019 | 99 | 2019 |
What can I do here? A Theory of Affordances in Reinforcement Learning K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup International Conference on Machine Learning (ICML) 2020, 5479--5488, 2020 | 24 | 2020 |
Intratumor Heterogeneity and Circulating Tumor Cell Clusters Z Ahmed, S Gravel Molecular Biology and Evolution, 2017 | 15 | 2017 |
RE-EVALUATE: Reproducibility in Evaluating Reinforcement Learning Algorithms K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau 2nd Reproducibility in Machine Learning Workshop at ICML 2018, 2018 | 12 | 2018 |
Learning to prove from synthetic theorems E Aygün, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ... arXiv preprint arXiv:2006.11259, 2020 | 10 | 2020 |
Vfunc: a deep generative model for functions P Bachman, R Islam, A Sordoni, Z Ahmed Workshop on Prediction and Generative Modeling in Reinforcement Learning at …, 2018 | 8 | 2018 |
Marginalized state distribution entropy regularization in policy optimization R Islam, Z Ahmed, D Precup arXiv preprint arXiv:1912.05128, 2019 | 6 | 2019 |
AndroidEnv: A Reinforcement Learning Platform for Android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021 | 5 | 2021 |
Training a First-Order Theorem Prover from Synthetic Data V Firoiu, E Aygun, A Anand, Z Ahmed, X Glorot, L Orseau, L Zhang, ... arXiv preprint arXiv:2103.03798, 2021 | 3 | 2021 |
Temporally abstract partial models K Khetarpal, Z Ahmed, G Comanici, D Precup Advances in Neural Information Processing Systems 34, 2021 | 2 | 2021 |
Generalized Policy Updates for Policy Optimization S Kumar, Z Ahmed, R Dadashi, D Schuurmans, MG Bellemare NeurIPS 2019 Optimization Foundations for Reinforcement Learning Workshop, 2019 | 2 | 2019 |
Discrete off-policy policy gradient using continuous relaxations A Cianflone, Z Ahmed, R Islam, AJ Bose, WL Hamilton Unpublished. https://joeybose. github. io/assets/Gradient_estimator. pdf, 2019 | 2 | 2019 |
Learning proposals for sequential importance samplers using reinforced variational inference Z Ahmed, A Karuvally, D Precup, S Gravel Deep RL Meets Structured Prediction Workshop at ICLR, 2019 | 1 | 2019 |
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning G Comanici, A Glaese, A Gergely, D Toyama, Z Ahmed, T Jackson, ... arXiv preprint arXiv:2204.10374, 2022 | | 2022 |
3-07 Reproducibility of Functional Modes Identified by Non-Negative Matrix Factorization of FMRI Depends On Pre-Processing Strategy Z Ahmed, D Kang, N Meyer, M In, Y Shu MEDICAL PHYSICS 48 (6), 2021 | | 2021 |
Unifying Variational Inference and Policy Optimization Z Ahmed McGill University, 2019 | | 2019 |