Mm21 pre-training for video understanding challenge: Video captioning with pretraining techniques S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021 | 5 | 2021 |
ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation J Liu, W Wang, W Liu, Q He, J Liu 2023 International Joint Conference on Neural Networks (IJCNN), 1-8, 2023 | 4 | 2023 |
Sounding video generator: A unified framework for text-guided sounding video generation J Liu, W Wang, S Chen, X Zhu, J Liu IEEE Transactions on Multimedia, 2023 | 3 | 2023 |
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation W Wang, J Liu, Z Lin, J Yan, S Chen, C Low, T Hoang, J Wu, JH Liew, ... arXiv preprint arXiv:2401.04468, 2024 | 2 | 2024 |
DreamTuner: Single Image is Enough for Subject-Driven Generation M Hua, J Liu, F Ding, W Liu, J Wu, Q He arXiv preprint arXiv:2312.13691, 2023 | 2 | 2023 |
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition J Liu, H Wang, W Wang, X He, J Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations T Qi, S Fang, Y Wu, H Xie, J Liu, L Chen, Q He, Y Zhang arXiv preprint arXiv:2403.06951, 2024 | | 2024 |