PaLM: Scaling Language Modeling with Pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 1517 | 2022 |
Chain of thought prompting elicits reasoning in large language models J Wei, X Wang, D Schuurmans, M Bosma, E Chi, Q Le, D Zhou NeurIPS 2022, 2022 | 1430* | 2022 |
Finetuned language models are zero-shot learners J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le ICLR 2022, 2021 | 875 | 2021 |
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022 | 653* | 2022 |
Emergent abilities of large language models J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... Transactions on Machine Learning Research, 2022b, 2022 | 622 | 2022 |
Program synthesis with large language models J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ... arXiv preprint arXiv:2108.07732, 2021 | 328 | 2021 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 322 | 2022 |
GLaM: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022 | 236* | 2022 |
Show your work: Scratchpads for intermediate computation with language models M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ... ICLR 2022 Workshop DL4C, 2021 | 214 | 2021 |
Scaling Up Models and Data with and A Roberts, HW Chung, A Levskaya, G Mishra, J Bradbury, D Andor, ... arXiv preprint arXiv:2203.17189, 2022 | 64 | 2022 |
A framework for unsupervised spam detection in social networking sites M Bosma, E Meij, W Weerkamp European Conference on Information Retrieval, 364-375, 2012 | 52 | 2012 |
System and method for automatically selecting images to accompany text M Heyward, M Bosma, S Brotherton, C DePue III, MEG Contreras, ... US Patent 9,075,812, 2015 | 6 | 2015 |
Using Chains of Thought to Prompt Machine-Learned Models Pre-Trained on Diversified Objectives JW Wei, D Zhou, X Wang, DE Schuurmans, QV Le, MP Bosma, EHH Chi, ... US Patent App. 18/160,776, 2023 | | 2023 |
Performing machine learning tasks using instruction-tuned neural networks JW Wei, MP Bosma, Y Zhao, K Gu, QV Le US Patent App. 17/561,581, 2023 | | 2023 |
Inflection-1 Inflection https://inflection.ai/assets/Inflection-1.pdf, 2023 | | 2023 |