Robust Low-Rank Sparse Framework for Video-Based Affective Computing
基于视频的情感计算的鲁棒低秩稀疏框架
Feng-Qi Cui, Jinyang Huang, Sirui Zhao, Xinyu Li, Xin Yan, Ziyu Jia, Xiaokang Zhou
AI总结 提出低秩稀疏情感理解框架(LSEF),通过层次化低秩稀疏分解将情感动态分解为情感基和瞬态波动,并采用秩感知优化策略提升鲁棒性和动态判别能力。
详情
基于视频的情感计算(VAC)对于情感分析和人机交互至关重要,但由于复杂的情感动态,存在模型不稳定和表示退化的问题。由于不同情感波动的含义在不同情感背景下可能不同,核心限制在于缺乏一种层次结构机制来分离不同的情感成分,即情感基(长期情感基调)和瞬态波动(短期情感波动)。为解决这一问题,我们提出了低秩稀疏情感理解框架(LSEF),这是一个基于低秩稀疏原理的统一模型,从理论上将情感动态重新定义为层次化的低秩稀疏组合过程。LSEF采用三个即插即用模块:稳定性编码模块(SEM)捕获低秩情感基;动态解耦模块(DDM)分离稀疏瞬态信号;一致性整合模块(CIM)重构多尺度稳定性和反应性一致性。该框架通过秩感知优化(RAO)策略进行优化,该策略自适应地平衡梯度平滑性和敏感性。跨多个数据集的大量实验证实,LSEF显著增强了鲁棒性和动态判别能力,进一步验证了层次化低秩稀疏建模对于理解情感动态的有效性和通用性。
Video-based Affective Computing (VAC), vital for emotion analysis and human-computer interaction, suffers from model instability and representational degradation due to complex emotional dynamics. Since the meaning of different emotional fluctuations may differ under different emotional contexts, the core limitation is the lack of a hierarchical structural mechanism to disentangle distinct affective components, i.e., emotional bases (the long-term emotional tone), and transient fluctuations (the short-term emotional fluctuations). To address this, we propose the Low-Rank Sparse Emotion Understanding Framework (LSEF), a unified model grounded in the Low-Rank Sparse Principle, which theoretically reframes affective dynamics as a hierarchical low-rank sparse compositional process. LSEF employs three plug-and-play modules, i.e., the Stability Encoding Module (SEM) captures low-rank emotional bases; the Dynamic Decoupling Module (DDM) isolates sparse transient signals; and the Consistency Integration Module (CIM) reconstructs multi-scale stability and reactivity coherence. This framework is optimized by a Rank Aware Optimization (RAO) strategy that adaptively balances gradient smoothness and sensitivity. Extensive experiments across multiple datasets confirm that LSEF significantly enhances robustness and dynamic discrimination, which further validates the effectiveness and generality of hierarchical low-rank sparse modeling for understanding affective dynamics.