Latency-Configurable Streaming Speech Enhancement via Asymmetric Temporal Padding
通过非对称时间填充实现延迟可配置的流式语音增强
发表机构 * Department of Electrical Engineering, Pohang University of Science and Technology (POSTECH)(电气工程系,浦项科技大学) ; Intus Co. Ltd.(Intus有限公司)
AI总结 提出LaCo-SENet,通过非对称时间填充和双缓冲流式机制,在单一超参数下实现延迟与质量的灵活权衡,在VoiceBank+DEMAND上以1.37M参数获得12.5-75.0ms延迟范围,PESQ从3.35到3.43。
Comments 5 pages, 3 figures. Accepted for presentation at Interspeech 2026