Jiasen Lu

Cited by

	All	Since 2019
Citations	16905	14975
h-index	25	25
i10-index	30	28

4000

2000

1000

3000

201520162017201820192020202120222023202453 200 535 1024 1534 2046 2846 3429 3917 1196

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia TechVerified email at gatech.edu
Dhruv BatraFAIR (Meta AI) | Georgia TechVerified email at gatech.edu
Stefan LeeAssistant Professor, Oregon State UniversityVerified email at oregonstate.edu
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Stanislaw AntolAutonomous Vehicles Software Engineer, Mercedes-Benz R&DVerified email at vt.edu
Richard Socheryou.comVerified email at stanford.edu
Aniruddha KembhaviSenior Director of Computer Vision, Allen Institute of Artificial IntelligenceVerified email at allenai.org
Roozbeh MottaghiFAIR, MetaVerified email at cs.stanford.edu
Rowan ZellersOpenAIVerified email at cs.washington.edu
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Vedanuj GoswamiResearch Engineer, Meta AIVerified email at meta.com
Adam FischPh.D. student, Massachusetts Institute of TechnologyVerified email at mit.edu
Antoine BordesHelsingVerified email at helsing.ai
Chih-Yao MaStaff Research Scientist @ GenAI, MetaVerified email at meta.com
Zuxuan WuFudan UniversityVerified email at fudan.edu.cn
Christopher ClarkAllen Institute for AIVerified email at allenai.org
Peng GaoShanghai AI LabVerified email at pjlab.org.cn
Yejin ChoiUniversity of Washington / Allen Institute for Artificial IntelligenceVerified email at cs.washington.edu
Jack Hesselsamaya.aiVerified email at samaya.ai

Jiasen Lu

Senior Research Scientist, Allen Institute of Artificial Intelligence

Verified email at allenai.org - Homepage

Computer Vision Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Vqa: Visual question answering A Agrawal, J Lu, S Antol*, M Mitchell, CL Zitnick, D Parikh, D Batra International Journal of Computer Vision 123 (1), 4-31, 2017	5710*	2017
Vqa: Visual question answering S Antol, A Agrawal, J Lu, M Mitchell, D Batra, C Lawrence Zitnick, ... Proceedings of the IEEE International Conference on Computer Vision, 2425-2433, 2015	5703	2015
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks J Lu, D Batra, D Parikh, S Lee Advances in neural information processing systems, 2019	3326	2019
Hierarchical question-image co-attention for visual question answering J Lu, J Yang, D Batra, D Parikh Advances in neural information processing systems 29, 2016	1907	2016
Knowing when to look: Adaptive attention via a visual sentinel for image captioning J Lu, C Xiong, D Parikh, R Socher Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017	1722	2017
Graph R-CNN for Scene Graph Generation J Yang, J Lu, S Lee, D Batra, D Parikh arXiv preprint arXiv:1808.00191, 2018	911	2018
Neural Baby Talk J Lu, J Yang, D Batra, D Parikh In Proceedings of the IEEE conference on computer vision and pattern …, 2018	533	2018
12-in-1: Multi-Task Vision and Language Representation Learning J Lu, V Goswami, M Rohrbach, D Parikh, S Lee Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019	500	2019
Parlai: A dialog research software platform AH Miller, W Feng, A Fisch, J Lu, D Batra, A Bordes, D Parikh, J Weston arXiv preprint arXiv:1705.06476, 2017	424	2017
Self-monitoring navigation agent via auxiliary progress estimation CY Ma, J Lu, Z Wu, G AlRegib, Z Kira, R Socher, C Xiong arXiv preprint arXiv:1901.03035, 2019	271	2019
Unified-IO: A unified model for vision, language, and multi-modal tasks J Lu, C Clark, R Zellers, R Mottaghi, A Kembhavi arXiv preprint arXiv:2206.08916, 2022	261	2022
Merlot reserve: Neural script knowledge through vision and language and sound R Zellers, J Lu, X Lu, Y Yu, Y Zhao, M Salehi, A Kusupati, J Hessel, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	195	2022
Best of both worlds: Transferring knowledge from discriminative learning to a generative visual dialog model J Lu, A Kannan, J Yang, D Parikh, D Batra Advances in Neural Information Processing Systems 30, 2017	142	2017
Sentinel gate for modulating auxiliary information in a long short-term memory (lstm) neural network LU Jiasen, C Xiong, R Socher US Patent 10,565,306, 2020	136	2020
A Faster Pytorch Implementation of Faster R-CNN J Yang, J Lu, D Batra, D Parikh https://github.com/jwyang/faster-rcnn.pytorch, 2018	107	2018
Multi-modal answer validation for knowledge-based vqa J Wu, J Lu, A Sabharwal, R Mottaghi Proceedings of the AAAI conference on artificial intelligence 36 (3), 2712-2721, 2022	102	2022
X-lxmert: Paint, caption and answer questions with multi-modal transformers J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi arXiv preprint arXiv:2009.11278, 2020	102	2020
Spatially aware multimodal transformers for textvqa Y Kant, D Batra, P Anderson, A Schwing, D Parikh, J Lu, H Agrawal Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	88	2020
Deeper lstm and normalized cnn visual question answering model J Lu, X Lin, D Batra, D Parikh GitHub repository 6, 2015	80	2015
Human action segmentation with hierarchical supervoxel consistency J Lu, R Xu, JJ Corso Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2015	71	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors