An Empirical Analysis of Task-Induced Encoder Bias in Fréchet Audio Distance
Fréchet音频距离中任务诱导编码器偏差的实证分析
发表机构 * Dept. of Computer Science and Engineering, Sogang University, South Korea(计算机科学与工程系,首尔大学,韩国)
AI总结 通过分解评估指标为召回率、精度和对齐(语义与结构维度),分析六种编码器在FAD中的任务诱导偏差,发现重建、ASR和分类训练编码器各有优劣,需发展评估原生编码器。
Comments Accepted to Interspeech 2026. Source code and evaluation pipeline are available at: https://github.com/wonwoo-jeong/fad-encoder-bias