Natural Language Descriptions of Deep Visual Features E Hernandez, S Schwettmann, D Bau, T Bagashvili, A Torralba, J Andreas International Conference on Learning Representations, 2021 | 111 | 2021 |
Do we still need clinical language models? E Lehman, E Hernandez, D Mahajan, J Wulff, MJ Smith, Z Ziegler, ... Conference on health, inference, and learning, 578-597, 2023 | 105 | 2023 |
Inspecting and editing knowledge representations in language models E Hernandez, BZ Li, J Andreas arXiv preprint arXiv:2304.00740, 2023 | 44 | 2023 |
Linearity of relation decoding in transformer language models E Hernandez, AS Sharma, T Haklay, K Meng, M Wattenberg, J Andreas, ... arXiv preprint arXiv:2308.09124, 2023 | 40 | 2023 |
Measuring and manipulating knowledge representations in language models E Hernandez, BZ Li, J Andreas arXiv preprint arXiv:2304.00740, 2023 | 40 | 2023 |
The low-dimensional linear geometry of contextualized word representations E Hernandez, J Andreas arXiv preprint arXiv:2105.07109, 2021 | 34 | 2021 |
Toward a visual concept vocabulary for gan latent space S Schwettmann, E Hernandez, D Bau, S Klein, J Andreas, A Torralba Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 15 | 2021 |
A multimodal automated interpretability agent TR Shaham, S Schwettmann, F Wang, A Rajaram, E Hernandez, ... Forty-first International Conference on Machine Learning, 2024 | 6 | 2024 |
Do we still need clinical language models? 2023 E Lehman, E Hernandez, D Mahajan, J Wulff, MJ Smith, Z Ziegler, ... URL https://arxiv. org/abs/2302.08091, 0 | 5 | |
Program Synthesis from Visual Specification E Hernandez, A Vartanian, X Zhu arXiv preprint arXiv:1806.00938, 2018 | 1 | 2018 |
A Multimodal Automated Interpretability Agent T Rott Shaham, S Schwettmann, F Wang, A Rajaram, E Hernandez, ... arXiv e-prints, arXiv: 2404.14394, 2024 | | 2024 |
Automated Interpretation of Machine Learning Models E Hernandez Massachusetts Institute of Technology, 2024 | | 2024 |