2510.07355
2026-05-29
cs.MM
cs.SD
版本更新
AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues
AV-EMO-Reasoning: 在具有视听线索的全模态大语言模型中基准测试情感推理能力
Dingkun Zhou, Krish Patel, Ajay Kankipati, Akshaj Gupta, Zeyi Austin Li, Mohul Shukla, Vibhor Narang, Sara Kofman, Zongli Ye, Grace Wang, Xiaoyu Shi, Tingle Li, Guan-Ting Lin, Kan Jen Cheng, Huang-Cheng Chou, Jiachen Lian, Gopala Anumanchipalli
发表机构
*
UC Berkeley(加州大学伯克利分校)
;
South China University of Technology(华南理工大学)
;
Zhejiang University(浙江大学)
;
National Taiwan University(台湾大学)
;
University of Southern California(美国南加州大学)
AI总结
提出AV-EMO-Reasoning基准,通过合成和真实世界的视听对话数据集及情感感知与交互推理指标,系统评估全模态大语言模型的情感推理能力。