2606.09295
2026-06-09
cs.CL
新提交
NüshuVoice: Reviving the Voice of Endangered Nüshu with Pitch-Aware Text-to-Speech
NüshuVoice:利用音高感知文本到语音技术复兴濒危女书的声音
Hongkun Yang, Xinhui Yi, Xiyan Zhao, Yibo Meng, Lionel Z. Wang, Lixu Wang, Yaqi Zhang, Ruiqi Chen, Xuanyue Zhao, Lanxin Zhang, Yu Zeng, Weijia Chu, Yiming Ma, Chenyu Liu, Jianghao Lin, Xin Xu
发表机构
*
Ocean University of China(中国海洋大学)
;
The Hong Kong Polytechnic University(香港理工大学)
;
Cornell University(康奈尔大学)
;
Nanyang Technological University(南洋理工大学)
;
Shanghai Jiao Tong University(上海交通大学)
;
University of Michigan–Ann Arbor(密歇根大学安娜堡分校)
;
University of Science and Technology of China(中国科学技术大学)
;
Harbin Institute of Technology(哈尔滨工业大学)
AI总结
针对女书语音数据稀缺问题,提出NüshuVoice基准和F0条件VITS框架Nüshu-PitchVITS,利用五级音高标注作为韵律先验,在频谱保真度、音高重建和可懂度上优于强基线。