Self-modification of policy and utility function in rational agents T Everitt, D Filan, M Daswani, M Hutter Artificial General Intelligence: 9th International Conference, AGI 2016, New …, 2016 | 30 | 2016 |
Q-learning for history-based reinforcement learning M Daswani, P Sunehag, M Hutter Asian Conference on Machine Learning, 213-228, 2013 | 22 | 2013 |
COVID-19 Open-Data: curating a fine-grained, global-scale data repository for SARS-CoV-2 O Wahltinez, K Murphy, M Brenner, M Lee, A Erlinger, M Daswani, ... Work in progress, 2021 | 17 | 2021 |
Feature reinforcement learning: state of the art M Daswani, P Sunehag, M Hutter Sequential decision-making with big data: papers from the AAAI-14 workshop, 2014 | 16 | 2014 |
A definition of happiness for reinforcement learning agents M Daswani, J Leike Artificial General Intelligence: 8th International Conference, AGI 2015, AGI …, 2015 | 15 | 2015 |
COVID-19 Open-Data: curating a fine-grained, global-scale data repository for SARS-CoV-2. 2020 O Wahltinez, K Murphy, M Brenner Work in progress, 2020 | 11 | 2020 |
COVID-19 Open-Data a global-scale spatially granular meta-dataset for coronavirus disease O Wahltinez, A Cheung, R Alcantara, D Cheung, M Daswani, A Erlinger, ... Scientific data 9 (1), 162, 2022 | 7 | 2022 |
Reinforcement learning with value advice M Daswani, P Sunehag, M Hutter Asian Conference on Machine Learning, 299-314, 2015 | 7 | 2015 |
Network partition handling in fault-tolerant key management system J Leiseboer, M Daswani, T Bradbury, F Poppa, K Chong, J Green, ... US Patent 10,671,643, 2020 | 6 | 2020 |
Feature Reinforcement Learning using Looping Suffix Trees M Daswani, P Sunehag, M Hutter JMLR Workshop and Conference Proceedings : EWRL 2012 24, 11-24, 2012 | 6 | 2012 |
Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians K Dvijotham, J Winkens, M Barsbey, S Ghaisas, R Stanforth, N Pawlowski, ... Nature Medicine 29 (7), 1814-1820, 2023 | 4 | 2023 |
Fault-tolerant key management system J Leiseboer, M Daswani, T Bradbury, F Poppa, K Chong, J Green, ... US Patent 10,606,864, 2020 | 2 | 2020 |
Generic Reinforcement Learning Beyond Small MDPs M Daswani The Australian National University, 2015 | 2 | 2015 |
Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians (CoDoC) K Dvijotham, J Winkens, M Barsbey, S Ghaisas, N Pawlowski, R Stanforth, ... | 1 | 2022 |
Fault-tolerant key management system J Leiseboer, M Daswani, T Bradbury, F Poppa, K Chong, J Green, ... US Patent 11,354,336, 2022 | | 2022 |