Zongjian Li's Space

[Google Scholar] [Github] [Email]

My Projects

A progressive adversarial distillation method based on Wan-T2V-14B, achieving high-quality video diffusion generation in 4 steps. Utilized techniques such as context parallel and FSDP2 to achieve adversarial distillation of a 14B model at 720P resolution and 81 frames.
A distributed computing course from MIT, covering distributed systems, distributed algorithms, and distributed computing.

My Papers

* means equal contribution.
[CVPR 2025] WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
Zongjian Li*, Bin Lin*, Yang Ye, Liuhan Chen, Xinhua Cheng, Shenghai Yuan, Li Yuan
[AAAI 2025] AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An*, Yinghao Zhu*, Zongjian Li* , Enshen Zhou, Haoran Feng, Xijie Huang, Bohua Chen, Yemin Shi, Chengwei Pan
[ICME 2025] OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model
Liuhan Chen*, Zongjian Li*, Bin Lin, Bin Zhu, Qian Wang, Shenghai Yuan, Xing Zhou, Xinhua Cheng, Li Yuan
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
Bin Lin, Zongjian Li, Xinhua Cheng, Yuwei Niu, Yang Ye, Xianyi He, Shenghai Yuan, Wangbo Yu, Shaodong Wang, Yunyang Ge, Yatian Pang, Li Yuan
ImgEdit: A Unified Image Editing Dataset and Benchmark
Yang Ye*, Xianyi He*, Zongjian Li*, Bin Lin*, Shenghai Yuan*, Zhiyuan Yan*, Bohan Hou, Li Yuan
Open-Sora Plan: Open-Source Large Video Generation Model
Bin Lin, Yunyang Ge, Xinhua Cheng, Zongjian Li, Bin Zhu, Shaodong Wang, Xianyi He, Yang Ye, Shenghai Yuan, Liuhan Chen, Tanghui Jia, Junwu Zhang, Zhenyu Tang, Yatian Pang, Bin She, Cen Yan, Zhiheng Hu, Xiaoyi Dong, Lin Chen, Zhang Pan, Xing Zhou, Shaoling Dong, Yonghong Tian, Li Yuan
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model
Jiaxi Cui*, Munan Ning*, Zongjian Li*, Bohua Chen, Yang Yan, Hao Li, Bin Ling, Yonghong Tian, Li Yuan

Articles

【矩阵】奇异值分解 07月24日13时
【概率】概率密度传输 06月01日00时
【概率】Z=XY的PDF 05月31日01时
【k8s】安装的困难汇总和解决方法 05月24日17时
【Transformer#1】各种位置编码 05月17日14时
【短记】大模型中Normalize方式的影响 12月24日14时
【短记】随着batch size增大如何调整学习率 11月27日22时