2606.07015
2026-06-08
cs.SD
cs.AI
新提交
Towards Unified Song Generation and Singing Voice Conversion with Accompaniment Co-Generation
面向统一歌曲生成与带伴奏共生成的歌声转换
Ziyu Zhang, Chunyu Qiang, Xiaopeng Wang, Yuxin Guo, Kang Yin, Wenjie Tian, Jingbin Hu, Tianlun Zuo, Zhao Guo, Teng Ma, Yuzhe Liang, Chen Zhang, Lei Xie
发表机构
*
Northwestern Polytechnical University(西北工业大学)
;
Kuaishou Technology(快手科技)
;
Beijing Institute of Technology(北京理工大学)
;
Institute of Automation, Chinese Academy of Sciences(中国科学院自动化研究所)
;
University of Science and Technology of China(中国科学技术大学)
;
Shanghai Jiao Tong University(上海交通大学)
AI总结
提出UniSinger框架,基于多模态扩散Transformer统一零样本歌曲生成与伴奏共生成歌声转换,通过共享说话人嵌入和课程学习策略实现跨任务音色控制与多任务优化。