Listening with Attention: Entropy-Guided Explainability for Transformer-Based Audio Models
基于注意力的听觉:面向Transformer音频模型的熵引导可解释性
发表机构 * Florida International University(佛罗里达国际大学) ; University of South Florida(南佛罗里达大学)
AI总结 提出LEAF-X框架,通过熵引导注意力加权、多层注意力展开和因果消融,为Transformer语音识别模型生成稀疏的帧级归因,提升忠实度32%、局部性/稀疏性35-39%。
Comments 17 pages, 3 figures, and 9 tables. Accepted in Interspeech 2026 conference