Rlaif: Scaling reinforcement learning from human feedback with ai feedback H Lee, S Phatale, H Mansoor, K Lu, T Mesnard, C Bishop, V Carbune, ... arXiv preprint arXiv:2309.00267, 2023 | 161 | 2023 |
Prose for a painting P Kashyap, S Phatale, I Drori arXiv preprint arXiv:1910.03634, 2019 | 3 | 2019 |
SAFE: Software-defined authentication framework AV Kamath, S S, K Kataoka, N Vijayvergiya, GB Reddy, S Phatale Proceedings of the 12th Asian Internet Engineering Conference, 57-63, 2016 | 1 | 2016 |
PERL: Parameter Efficient Reinforcement Learning from Human Feedback H Sidahmed, S Phatale, A Hutcheson, Z Lin, Z Chen, Z Yu, J Jin, ... arXiv preprint arXiv:2403.10704, 2024 | | 2024 |
Conversational Recommendation as Retrieval: A Simple, Strong Baseline R Gupta, R Aksitov, S Phatale, S Chaudhary, H Lee, A Rastogi arXiv preprint arXiv:2305.13725, 2023 | | 2023 |