Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
Lumos-Nexus: 面向视频统一模型的高效频率桥接与同质潜在空间
发表机构 * Zhejiang University(浙江大学) ; DAMO Academy, Alibaba Group(阿里云达摩院) ; Hupan Lab(虎扑实验室) ; National University of Singapore(新加坡国立大学) ; Hong Kong University of Science and Technology(香港科技大学) ; Fudan University(复旦大学) ; Tsinghua University(清华大学)
AI总结 提出Lumos-Nexus框架,通过两阶段训练和渐进频率桥接,在保持推理能力的同时显著提升视频生成保真度。
Comments Project page (https://jiazheng-xing.github.io/nexus-lumos-home/) and Code (https://github.com/alibaba-damo-academy/Lumos-Custom/) are available