Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text H Akbari, L Yuan, R Qian, WH Chuang, SF Chang, Y Cui, B Gong Advances in Neural Information Processing Systems 34, 24206-24221, 2021 | 654 | 2021 |
Unsupervised event-based learning of optical flow, depth, and egomotion AZ Zhu, L Yuan, K Chaney, K Daniilidis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 573 | 2019 |
EV-FlowNet: Self-supervised optical flow estimation for event-based cameras AZ Zhu, L Yuan, K Chaney, K Daniilidis arXiv preprint arXiv:1802.06898, 2018 | 499 | 2018 |
Movinets: Mobile video networks for efficient video recognition D Kondratyuk, L Yuan, Y Li, L Zhang, M Tan, M Brown, B Gong Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 280 | 2021 |
Surrogate gap minimization improves sharpness-aware training J Zhuang, B Gong, L Yuan, Y Cui, H Adam, N Dvornek, S Tatikonda, ... arXiv preprint arXiv:2203.08065, 2022 | 155 | 2022 |
Unsupervised event-based optical flow using motion compensation A Zihao Zhu, L Yuan, K Chaney, K Daniilidis Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 0-0, 2018 | 90 | 2018 |
Deeplab2: A tensorflow library for deep labeling M Weber, H Wang, S Qiao, J Xie, MD Collins, Y Zhu, L Yuan, D Kim, Q Yu, ... arXiv preprint arXiv:2106.09748, 2021 | 63 | 2021 |
Human gaze-driven spatial tasking of an autonomous MAV L Yuan, C Reardon, G Warnell, G Loianno IEEE Robotics and Automation Letters 4 (2), 1343-1350, 2019 | 51 | 2019 |
Learning view-disentangled human pose representation by contrastive cross-view mutual information maximization L Zhao, Y Wang, J Zhao, L Yuan, JJ Sun, F Schroff, H Adam, X Peng, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 36 | 2021 |
Zoom-in-to-check: Boosting video interpolation via instance-level discrimination L Yuan, Y Chen, H Liu, T Kong, J Shi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 32 | 2019 |
Contextualized spatio-temporal contrastive learning with self-supervision L Yuan, R Qian, Y Cui, B Gong, F Schroff, MH Yang, H Adam, T Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 26 | 2022 |
View-invariant, occlusion-robust probabilistic embedding for human pose T Liu, JJ Sun, L Zhao, J Zhao, L Yuan, Y Wang, LC Chen, F Schroff, ... International Journal of Computer Vision 130 (1), 111-135, 2022 | 19 | 2022 |
Videoprism: A foundational visual encoder for video understanding L Zhao, NB Gundavarapu, L Yuan, H Zhou, S Yan, JJ Sun, L Friedman, ... arXiv preprint arXiv:2402.13217, 2024 | 15 | 2024 |
Unified visual relationship detection with vision and language models L Zhao, L Yuan, B Gong, Y Cui, F Schroff, MH Yang, H Adam, T Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 14 | 2023 |
Videoglue: Video general understanding evaluation of foundation models L Yuan, NB Gundavarapu, L Zhao, H Zhou, Y Cui, L Jiang, X Yang, M Jia, ... arXiv preprint arXiv:2307.03166, 2023 | 12 | 2023 |
Distilling vision-language models on millions of videos Y Zhao, L Zhao, X Zhou, J Wu, CT Chu, H Miao, F Schroff, H Adam, T Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 11 | 2024 |
Polymax: General dense prediction with mask transformer X Yang, L Yuan, K Wilber, A Sharma, X Gu, S Qiao, S Debats, H Wang, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 9 | 2024 |
Learning from semantic alignment between unpaired multiviews for egocentric video recognition Q Wang, L Zhao, L Yuan, T Liu, X Peng Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 9 | 2023 |
Exploring temporal granularity in self-supervised video representation learning R Qian, Y Li, L Yuan, B Gong, T Liu, M Brown, S Belongie, MH Yang, ... arXiv preprint arXiv:2112.04480, 2021 | 8 | 2021 |
Spatiotemporally discriminative video-language pre-training with text grounding Y Xiong, L Zhao, B Gong, MH Yang, F Schroff, T Liu, CJ Hsieh, L Yuan arXiv preprint arXiv:2303.16341 2, 2023 | 4 | 2023 |