Pixel-bert: Aligning image pixels with text by deep multi-modal transformers Z Huang, Z Zeng, B Liu, D Fu, J Fu arXiv preprint arXiv:2004.00849, 2020 | 291 | 2020 |
Seeing out of the box: End-to-end pre-training for vision-language representation learning Z Huang, Z Zeng, Y Huang, B Liu, D Fu, J Fu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 158 | 2021 |
Wsod2: Learning bottom-up and top-down objectness distillation for weakly-supervised object detection Z Zeng, B Liu, J Fu, H Chao, L Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 126 | 2019 |
Beyond narrative description: Generating poetry from images by multi-adversarial training B Liu, J Fu, MP Kato, M Yoshikawa Proceedings of the 26th ACM international conference on Multimedia, 783-791, 2018 | 72 | 2018 |
M3p: Learning universal representations via multitask multilingual multimodal pre-training M Ni, H Huang, L Su, E Cui, T Bharti, L Wang, D Zhang, N Duan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 70 | 2021 |
Probing inter-modality: Visual parsing with self-attention for vision-and-language pre-training H Xue, Y Huang, B Liu, H Peng, J Fu, H Li, J Luo Advances in Neural Information Processing Systems 34, 4514-4528, 2021 | 57 | 2021 |
Unifying multimodal transformer for bi-directional image and text generation Y Huang, H Xue, B Liu, Y Lu Proceedings of the 29th ACM International Conference on Multimedia, 1138-1147, 2021 | 34 | 2021 |
Advancing high-resolution video-language representation with large-scale video transcriptions H Xue, T Hang, Y Zeng, Y Sun, B Liu, H Yang, J Fu, B Guo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 31 | 2022 |
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment H Xue, Y Sun, B Liu, J Fu, R Song, H Li, J Luo arXiv preprint arXiv:2209.06430, 2022 | 26 | 2022 |
Neural storyboard artist: Visualizing stories with coherent image sequences S Chen, B Liu, J Fu, R Song, Q Jin, P Lin, X Qi, C Wang, J Zhou Proceedings of the 27th ACM International Conference on Multimedia, 2236-2244, 2019 | 22 | 2019 |
Aesthetic-aware image style transfer Z Hu, J Jia, B Liu, Y Bu, J Fu Proceedings of the 28th ACM International Conference on Multimedia, 3320-3329, 2020 | 21 | 2020 |
Heat transfer related to gas hydrate formation/dissociation B Liu, W Pang, B Peng, C Sun, G Chen Developments in Heat Transfer, 477-502, 2011 | 19 | 2011 |
Searching the search space of vision transformer M Chen, K Wu, B Ni, H Peng, B Liu, J Fu, H Chao, H Ling Advances in Neural Information Processing Systems 34, 8714-8726, 2021 | 18 | 2021 |
Smp challenge: An overview of social media prediction challenge 2019 B Wu, WH Cheng, P Liu, B Liu, Z Zeng, J Luo Proceedings of the 27th ACM International Conference on Multimedia, 2667-2671, 2019 | 18 | 2019 |
Reference-based defect detection network Z Zeng, B Liu, J Fu, H Chao IEEE Transactions on Image Processing 30, 6637-6647, 2021 | 17 | 2021 |
Emotion reinforced visual storytelling N Li, B Liu, Z Han, YS Liu, J Fu Proceedings of the 2019 on International Conference on Multimedia Retrieval …, 2019 | 15 | 2019 |
Activitynet 2019 task 3: Exploring contexts for dense captioning events in videos S Chen, Y Song, Y Zhao, Q Jin, Z Zeng, B Liu, J Fu, A Hauptmann arXiv preprint arXiv:1907.05092, 2019 | 11 | 2019 |
Learning rich image region representation for visual question answering B Liu, Z Huang, Z Zeng, Z Chen, J Fu arXiv preprint arXiv:1910.13077, 2019 | 9 | 2019 |
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning Y Sun, H Xue, R Song, B Liu, H Yang, J Fu arXiv preprint arXiv:2210.06031, 2022 | 8 | 2022 |
Learning fine-grained motion embedding for landscape animation H Xue, B Liu, H Yang, J Fu, H Li, J Luo Proceedings of the 29th ACM International Conference on Multimedia, 291-299, 2021 | 4 | 2021 |