arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 1955
2605.22462 2026-05-22 cs.CL cs.AI

From Correlation to Cause: A Five-Stage Methodology for Feature Analysis in Transformer Language Models

从相关性到因果:一种五阶段方法用于Transformer语言模型中的特征分析

Caleb Munigety

AI总结 本文提出了一种五阶段方法用于Transformer语言模型中的因果特征分析,并在GPT-2小型模型上端到端地展示了其在间接宾语识别任务中的应用,通过激活补丁恢复经典IOI电路,稀疏自编码器恢复特定名称的特征,因果验证发现这些特征具有特定但部分因果性,鲁棒性测试揭示了检测鲁棒性与因果鲁棒性之间的差距,部署评估显示了最优监控配置带来的成本节省。

详情
AI中文摘要

我们提出了一种五阶段方法用于Transformer语言模型中的因果特征分析(探针设计、特征提取、因果验证、鲁棒性测试和部署集成),并在GPT-2小型模型上端到端地执行了间接宾语识别(IOI)任务。激活补丁恢复了经典的IOI电路(第9层头9单独恢复+1.02)。稀疏自编码器恢复了每名称选择性特征,其效果大小为30到50个激活单元。因果验证发现这些特征具有特定但部分因果性:删除十五个特征后,模型在98%的提示上仍保持准确。两种受NLA启发的评估强化了这一观点:十五个选择性特征仅解释了激活方差的31%,而SAE的解释为99.7%,选择性比率与因果力呈负相关(r = -0.56)。三种分布偏移下的鲁棒性测试发现,电路能够顺利转移,但特征消融效果显著下降,揭示了检测鲁棒性与因果鲁棒性之间的差距。基于成本的部署评估(假设$50/FN,$0.42/FP,2%错误率)发现最优监控配置可使每1000次查询的成本降至$8.96,相比$1000的基准,节省了99.1%。最优组合策略随成本比和基础率变化。各阶段的结合产生了单一阶段无法产生的发现。

英文摘要

We propose a five-stage methodology for causal feature analysis in transformer language models (probe design, feature extraction, causal validation, robustness testing, and deployment integration) and demonstrate it end-to-end on GPT-2 small performing the Indirect Object Identification (IOI) task. Activation patching recovers the canonical IOI circuit (layer-9 head 9 alone gives recovery +1.02). A sparse autoencoder recovers per-name selective features with effect sizes of 30 to 50 activation units. Causal validation finds these features specifically but only partially causal: ablating fifteen of them leaves the model accurate on 98% of prompts. Two NLA-inspired evaluations strengthen this picture: the fifteen selective features explain only 31% of activation variance versus the SAE's 99.7%, and selectivity ratio anticorrelates with causal force (r = -0.56). Robustness testing under three distribution shifts finds that the circuit transfers cleanly but feature ablation effects degrade substantially, exposing a gap between detection robustness and causal robustness. A cost-based deployment evaluation (assumed $50/FN, $0.42/FP, 2% error rate) finds an optimal monitor configuration yielding $8.96 per 1000 queries against a $1000 baseline, a 99.1% saving. Optimal composition strategy varies with cost ratio and base rate. The conjunction of stages produces findings no single stage would.

2605.22457 2026-05-22 cs.AI cs.SY eess.SY

KAPPS: A knowledge-based CPPS Architecture for the Circular Factory

KAPPS:一种基于知识的闭环工厂CPPS架构

Etienne Hoffmann, Jan-Felix Klein, Sören Weindel, Max Goebels, Sebastian Behrendt, Daniel Hernández, Ratan Bahadur Thapa, Jürgen Fleischer, Kai Furmans, Steffen Staab

AI总结 本文提出KAPPS,一种基于知识的闭环工厂CPPS架构,旨在解决闭环制造中产品状态变化、动态重构过程和人机知识整合的需求,通过知识图谱和语义接口层实现数据集成与推理,提升制造系统的灵活性和适应性。

详情
Comments
Submitted to Journal of Manufacturing Systems (JMS)
AI中文摘要

尽管线性制造依赖于同质材料和预定义的过程序列,但闭环制造重新引入了具有异质和不确定条件的使用产品。这种转变要求制造系统能够处理可变的产品状态、动态可重构的过程以及人机知识的整合。传统制造IT架构,设计用于稳定结构和确定性执行,无法满足这些需求,因为它们无法充分表示和管理运行时单个组件的唯一性。遵循设计科学方法,为闭环制造设计CPPS,我们从五个互补的视角中推导出14个需求。基于这些需求,我们设计了KAPPS,一种基于知识的架构,利用以本体为基础的知识图谱作为统一的数据骨干,结合语义接口层,实现跨异构系统和服务的一致数据和信息集成、推理和通信,使知识图谱从集成层转变为工厂的权威写时状态。KAPPS集成了约束执行和事件驱动规划模块,使在不确定性和人机知识交换下执行计划能够逐步适应。通过两个实施用例验证了KAPPS的适用性:(i) 通过知识图谱中介服务进行异常检测和学习;(ii) 在模块化输送系统中运行时约束执行。随后,该架构被评估以满足14个需求(摘要已缩短)

英文摘要

While linear manufacturing relies on homogeneous materials and predefined process sequences, circular manufacturing reintroduces used products with heterogeneous and uncertain conditions. This shift demands manufacturing systems capable of handling variable product states, dynamically reconfigurable processes, and the integration of human and machine knowledge. Conventional manufacturing IT architectures, designed for stable structures and deterministic execution, are unable to meet these requirements, as they cannot adequately represent and manage the uniqueness of individual components at runtime. Following a design science methodology for developing a Cyber Physical Production System for circular manufacturing, we derive 14 requirements from five complementary perspectives. Based on these requirements, we design KAPPS, a knowledge-based architecture that uses an ontology-grounded knowledge graph as a unifying data backbone, combined with a semantic interface layer to enable consistent data and information integration, reasoning, and communication across heterogeneous systems and services, turning the knowledge graph from an integration layer into the factories authoritative write-time state. KAPPS incorporates modules for constraint enforcement and event-driven planning, enabling incremental adaptation of execution plans under uncertainty and human-machine knowledge exchange. The applicability of KAPPS is demonstrated through two implemented use cases: (i) Anomaly detection and learning through knowledge graph mediated services and (ii) runtime constraint enforcement in a modular conveyor system. Subsequently, the architecture is evaluated against the 14 requirements (ed. abstract shortened)

2605.22456 2026-05-22 cs.RO cs.AI

Steins;Gate Drive: Semantic Safety Arbitration over Structured Futures for Latency-Decoupled LLM Planning

Steins;Gate Drive: 基于结构化未来语义安全仲裁的延迟解耦LLM规划

Anjie Qiu, Hans D. Schotten

AI总结 本文提出SteinsGateDrive架构,通过延迟解耦规划与运行时架构,在保持安全边界的同时,将有效延迟从+3.07秒减少到-0.01秒,提升了自动驾驶的规划效率。

详情
Comments
10 pages, 2 figures, 5 tables, submitted to IEEE transaction of intelligent vehicles
AI中文摘要

云托管的LLM驱动代理提供有用的语义判断,但其推理延迟超过了分步车辆控制窗口。学习的世界模型预测未来,但通常将未来生成和动作选择保留在大型耦合循环中。我们提出了SteinsGateDrive,一种延迟解耦的规划-运行时架构,其中世界线隐喻来自同名故事,指出了干预的一个可能后果:LLM在最终控制时刻之前选择反事实驾驶未来,而运行时仅在安全合同有效时重用所选预测。生成器构建了三种世界线角色:alpha名义性自我条件未来、beta交互反事实(围绕附近车辆)以及gamma危险压力未来(如刹车、变道或被阻塞的走廊)。所选分支成为具有时间范围、有效/中止条件、回退和授权的类型化战略预测。在10个种子和20步的内受试匹配种子正常-高速公路协议中,GPT-5.4 mini在1秒时间范围将有效延迟从+3.07秒减少到4秒时间范围的-0.01秒,同时保持测量的无碰撞安全边界。该架构的安全贡献来自原子谓词运行时检查,而不是漂移分数,后者作为刷新频率的调节器。

英文摘要

Cloud-hosted LLM driver agents provide useful semantic judgments, but their inference latency exceeds stepwise vehicle-control windows. Learned world models predict futures, but they usually keep future generation and action selection inside large coupled loops. We present SteinsGateDrive, a latency-decoupled planner-runtime architecture in which the worldline metaphor from the eponymous story names one plausible consequence of an intervention: the LLM selects counterfactual driving futures before the final control instant, and a runtime reuses the selected forecast only while safety contracts remain valid. The generator builds three world-line roles: alpha nominal ego-conditioned futures, beta interaction counterfactuals around nearby vehicles, and gamma hazard-stress futures such as braking, cut-ins, or blocked corridors. The selected branch becomes a typed StrategicForecast with horizon, validity/abort conditions, fallback, and authority. On a within-subject, matched-seed normal-highway protocol with 10 seeds and 20 steps, GPT-5.4 mini reduces effective lag from +3.07 s at 1-second horizon to -0.01 s at 4-second horizon while preserving the measured no-collision safety boundary. The architecture's safety contribution comes from the atom-predicate runtime check, not from the drift score, which functions as a refresh-frequency knob.

2605.22455 2026-05-22 cs.CV cs.AI cs.LG physics.optics

Making the Discrete Continuous: Synthetic RAW Augmentations for Fine-Grained Evaluation of Person Detection Performance in Low Light

使离散的成为连续的:合成RAW增强用于细粒度评估人检测性能在低光环境

Valeria Pais, Malena Mendilaharzu, Daniele Faccio, Luis Oala, Christoph Clausen, Bruno Sanguinetti

AI总结 本文提出了一种合成RAW增强方法,用于在低光条件下更准确地评估人检测模型的性能,通过生成与相机传感器噪声模型匹配的低光样本,以改善基准测试的数据覆盖。

详情
Comments
Accepted non-archival paper at the CVPR 2026 AUTOPILOT Workshop (Autonomous Understanding Through Open-world Perception and Integrated Language Models for On-road Tasks)
AI中文摘要

人工智能视觉模型的实际应用既受到可用训练和测试数据的推动,也受到其限制。真实数据集稀疏且不均匀:长尾或不平衡分布会阻碍泛化,而低密度区域中的样本数量少使得评估困难。合成数据可以填补这些空白,提供更连续地采样输入空间的方法,提高基准测试的数据覆盖。专注于自动驾驶安全关键场景中的夜间行人检测,我们展示如何利用合成低光样本更好地表征状态-of-the-art目标检测模型的性能,作为场景光照函数的函数。我们使用合成RAW图像增强技术生成低光样本,以匹配相机传感器的噪声模型。在真实和合成低光数据上的性能指标相似,表明AI模型难以区分它们。

英文摘要

Real-world deployment of AI vision models is both fueled and limited by the data available for training and testing. Real datasets are sparse and uneven: long-tailed or unbalanced distributions hinder generalization, and the low number of samples in low density regions makes it hard to run evaluations. Synthetic data can fill these gaps, providing us with a way to sample the input space more continuously and improve data coverage for benchmarks. Focusing on the autonomous driving safety-critical case of pedestrian detection in the dark, we show how synthetic low-light samples can be used to better characterize the performance of a state-of-the-art object detection model as a function of the scene illumination. We use a synthetic RAW image augmentation technique to generate low-light samples that match the noise model of the camera sensor. Performance metrics on real and synthetic low-light data are similar, indicating that the AI model finds it hard to distinguish between them.

2605.22454 2026-05-22 cs.LG cs.AI

Don't Forget the Critic: Value-Based Data Rehearsal for Multi-Cyclic Continual Reinforcement Learning

不要忘记批评者:基于价值的多循环持续强化学习中的数据复习

Benjamin Poole, Andrew Quinn, Li Yang, Minwoo Lee

AI总结 本文提出了一种基于价值的数据复习方法,用于多循环持续强化学习,通过引入Qreg+NWLU方法改进学习效率、遗忘缓解和知识转移。

详情
AI中文摘要

数据复习已成为缓解持续强化学习(CRL)中灾难性遗忘的领先方法。然而,现有工作仍局限于策略梯度框架,仅正则化执行者,由于批评者正则化导致的性能下降。这种以执行者为中心的方法忽略了数据复习在价值函数近似中的潜力。此外,现有CRL评估很少考虑多循环环境,其中任务序列重复,这是关键的现实场景,加剧了遗忘和可塑性。我们研究了使用Q值正则化的深度Q网络在多循环设置中的数据复习,并提出Qreg+NWLU,引入了两个简单的修改:(1)连续数据复习,动态收集和更新存储的Q值在整个训练过程中;(2)“无等待”正则化,立即应用而不是在第一个任务之后。这些修改在价值函数近似设置中提高了学习效率、遗忘缓解和知识转移,优于Qreg和传统CRL方法。

英文摘要

Data rehearsal has emerged as a leading approach for mitigating catastrophic forgetting in Continual Reinforcement Learning (CRL). However, existing work remains confined to policy gradient frameworks, regularizing only actors due to the performance degradation incurred by critic regularization. This actor-centric approach overlooks the potential of data rehearsal for value function approximation. Moreover, existing evaluations in CRL rarely consider multi-cyclic environments where task sequences repeat, a critical real-world scenario that exacerbates forgetting and plasticity. We investigate data rehearsal for Deep Q-Networks using Q-value regularization in multi-cyclic settings and propose Qreg+NWLU which introduces two simple modifications: (1) continuous data rehearsal that dynamically collects and updates stored Q-values throughout training, and (2) "No-Wait" regularization that applies immediately rather than after the first task. Together, these modifications yield improvements in learning efficiency, forgetting mitigation, and knowledge transfer over Qreg and conventional CRL methods within value function approximation settings.

2605.22448 2026-05-22 cs.AI

S2ED: From Story to Executable Descriptions for Consistency-Aware Story Illustration

S2ED:从故事到可执行描述以实现一致性感知的故事插图

Sijing Yin, Jiamou Liu, Xiao Tang, Yaser Shakib, Qian Liu

AI总结 本文提出S2ED框架,通过将完整故事转换为可编辑的可执行描述,提升故事插图的一致性和角色真实性,适用于多帧故事插图的长时程一致性需求。

详情
Comments
6 pages, 5 figures. Accepted by IEEE ICME 2026
AI中文摘要

多帧故事插图需要超越单图像文本到图像生成的长时程一致性,包括叙事分解和持续的角色身份、布局和情感跨帧。我们提出了故事到可执行描述(S2ED),一种无需训练、模型无关、提示层框架,将完整故事转换为一系列显式、可编辑的可执行描述,以实现更一致的渲染。S2ED协调三个代理来分割叙事、确定标准角色属性并丰富空间和情感线索,使可解释的提示携带状态传播和局部编辑以修复漂移而无需重新训练生成器。在Flintstones和Shakoo Maku上的实验表明,S2ED在序列一致性、角色保真度方面优于强大的提示、大模型规划和参考训练方法,在自动指标和人类判断下均表现优异。我们还部署S2ED在一个端到端的故事到故事书系统中,为儿童插图故事提供补充视频。

英文摘要

Multi-frame story illustration requires long-horizon coherence beyond single-image text-to-image generation, including narrative decomposition and persistent character identity, layout, and affect across frames. We propose Story-to-Executable Descriptions (S2ED), a training-free, model-agnostic, prompt-layer framework that converts a full story into a sequence of explicit, editable executable descriptions for more consistent rendering. S2ED coordinates three agents to segment the narrative, ground canonical character attributes, and enrich spatial and affective cues, enabling interpretable prompt-carried state propagation and local edits to repair drift without retraining the generator. Experiments on Flintstones and Shakoo Maku show that S2ED improves sequence-level consistency and character fidelity over strong prompting, large-model planning, and a reference training-based method, under both automatic metrics and human judgments. We also deploy S2ED in an end-to-end story-to-storybook system for children's illustrated stories, with a supplementary video.

2605.22447 2026-05-22 cs.CL

Cohesion-6K: An Arabic Dataset for Analyzing Social Cohesion and Conflict in Online Discourse

Cohesion-6K: 一个用于分析在线讨论中社会凝聚力与冲突的阿拉伯语数据集

Aisha Ali Al-Athba, Wajdi Zaghouani

AI总结 本研究通过Cohesion-6K数据集探讨在线讨论中的社会凝聚力与冲突,采用五类话语分类揭示冲突与凝聚力的动态平衡,并通过公开资源支持未来计算社会科学、数字通信和阿拉伯语自然语言处理的研究。

详情
AI中文摘要

在线讨论研究已成为理解社会极化的核心。尽管许多研究聚焦于检测显性毒性,但社会凝聚力的微妙动态,即分裂与团结叙述之间的互动,仍 computationally underexplored (Bail, 2021; Gonzalez-Bailon and Lelkes, 2023). 本文提出了Cohesion-6K,一个包含六千条阿拉伯语公共Facebook帖子的数据集,这些帖子涉及巴勒斯坦被占领土问题。每条帖子被分配到五个话语类别之一,代表从冲突到凝聚力的连续体:冲突、解决、社区参与、支持性互动和共同价值观。注释过程结合了专家人类判断与模型辅助预标注,经培训注释者验证,实现了显著的注释者间一致性 (Cohens kappa = 0.85). 定量分析揭示了持续的参与差距,冲突导向的帖子获得的用户互动比解决导向的帖子多两到四倍 (p < 0.01). 这种模式展示了分裂性讨论如何在阿拉伯语社交媒体空间中获得不成比例的可见性。Cohesion-6K提供了一个透明且可重复的资源,用于研究在线凝聚力和极化。该数据集、注释指南和预处理代码将通过开放许可发布,以支持未来计算社会科学、数字通信和阿拉伯语自然语言处理的研究。

英文摘要

The study of online discourse has become central to understanding societal polarization. While much research has focused on detecting overt toxicity, the subtle dynamics of social cohesion, meaning the interaction between divisive and unifying narratives, remain computationally underexplored (Bail, 2021; Gonzalez-Bailon and Lelkes, 2023). This paper presents Cohesion-6K, a manually and ChatGPT-assisted annotated dataset of six thousand Arabic public Facebook posts related to the Israeli Occupation of Palestine. Each post is assigned to one of five discourse categories that represent a continuum from conflict to cohesion: Conflict, Resolution, Community Engagement, Supportive Interactions, and Shared Values. The annotation process combines expert human judgment with model-assisted pre-labeling verified by trained annotators, achieving substantial inter-annotator agreement (Cohens kappa = 0.85). Quantitative analysis reveals a consistent engagement gap, where conflict-oriented posts receive between two and four times more user interaction than resolution-oriented ones (p < 0.01). This pattern illustrates how divisive discourse tends to attract disproportionate visibility in Arabic social media spaces. Cohesion-6K provides a transparent and reproducible resource for the study of online cohesion and polarization. The dataset, annotation guidelines, and preprocessing code will be released for research use under an open license, supporting future work in computational social science, digital communication, and Arabic natural language processing.

2605.22446 2026-05-22 cs.CV cs.AI cs.RO

Pre-VLA: Preemptive Runtime Verification for Reliable Vision-Language-Action and World-Model Rollouts

Pre-VLA: 预防性运行时验证用于可靠视觉-语言-动作和世界模型展开

Zhen Sun, Yongjian Guo, Haoran Sun, Luqiao Wang, Wei Lu, Jiachi Ji, Shengzhe Ji, Junwu Xiong, Zhijun Meng

AI总结 本文提出Pre-VLA,一种统一的运行时验证架构,用于在物理执行或世界模型想象之前评估动作的有效性,以提高视觉-语言-动作和世界模型展开的可靠性。

详情
AI中文摘要

尽管大型视觉-语言-动作(VLA)模型和生成世界模型(WM)在长周期具身智能方面取得了进展,但其实际部署仍受到基于学习的动作生成不确定性的挑战。低质量的动作可能导致执行中的物理故障或导致冗余的渲染成本的误导性世界模型展开。为了解决这个问题,我们提出了Pre-VLA,一种统一的运行时验证架构,能够在物理执行或世界模型想象之前进行预防性动作有效性评估。Pre-VLA利用一个高效的多模态主干,具有模态感知的池化和轻量级双分支头,以预测候选动作片段的安全性信心和批评派生的优势分数。为处理严重的类别不平衡和不稳定边界决策,我们使用结合焦点分类、优势回归和软阈值校准的多任务目标来训练Pre-VLA。在部署期间,双模式预防性重采样调度器过滤低质量的动作,并在有限计算预算下触发自适应重采样。在LIBERO基准测试中,Pre-VLA将四个套件的平均闭环成功率从30.79%提高到37.62%,减少任务执行步骤,实现每个动作片段平均183.9毫秒的前向验证时间,并减轻世界模型展开中的误差累积。

英文摘要

While large vision-language-action (VLA) models and generative world models (WM) have advanced long-horizon embodied intelligence, their practical deployment remains challenged by uncertainty in learning-based action generation. Low-quality actions may cause physical failures during execution or lead to misleading world-model rollouts with redundant rendering costs. To address this issue, we propose Pre-VLA, a unified runtime verification architecture that performs preemptive action validity assessment before physical execution or world-model imagination. Pre-VLA leverages an efficient multimodal backbone with modality-aware pooling and a lightweight dual-branch head to predict both safety confidence and critic-derived advantage scores for candidate action chunks. To handle severe class imbalance and unstable boundary decisions, we train Pre-VLA with a multi-task objective combining Focal classification, advantage regression, and soft-threshold calibration. During deployment, a dual-mode preemptive resampling scheduler filters low-quality actions and triggers adaptive resampling under a limited computation budget. Experiments on the LIBERO benchmark show that Pre-VLA improves the average closed-loop success rate across four suites from 30.79\% to 37.62\% over RynnVLA-002, reduces task execution steps, achieves 183.9 ms average forward verification time per action chunk, and mitigates error accumulation in world-model rollouts.

2605.22443 2026-05-22 cs.RO

Terminal Constraint Model Predictive Control for Image-Based Visual Servoing of UAVs with Kalman Filter-Based Moment Loss Compensation

终端约束模型预测控制用于基于图像的视觉伺服控制无人机的卡尔曼滤波基于矩损失补偿

X. Wang, Y. Cao, W. L. W. Leong, Y. R. Tan, S. Huang, S. H. R. Teo, C. Xiang

AI总结 本文提出了一种终端约束模型预测控制(TC-MPC)框架,结合卡尔曼滤波机制,用于解决基于图像的视觉伺服控制中因输入和状态约束导致的闭环稳定性丧失和因运动剧烈导致的矩特征丢失问题。

详情
AI中文摘要

基于图像的视觉伺服控制(IBVS)通过直接调节图像空间误差为无人机(UAVs)提供高效的视觉引导控制范式。然而,传统IBVS控制器面临两个关键问题:由于输入和状态约束导致接近目标时闭环稳定性丧失,以及在剧烈运动下因矩基视觉特征间歇性丢失导致的控制失效。为了解决这些挑战,本文提出了一种用于IBVS的终端约束模型预测控制(TC-MPC)框架,集成了基于卡尔曼滤波(KF)的状态预测机制。TC-MPC明确将终端状态约束和终端成本纳入IBVS误差动力学中,确保在控制和状态约束下递归可行性、改进的收敛行为和闭环稳定性。同时,卡尔曼滤波预测短时间视觉退化期间图像矩的时序演变,使控制器在矩测量部分不可用时能够保持控制连续性。所提出的方法通过实时无人机视觉伺服控制实验进行了验证。

英文摘要

Image-Based Visual Servoing (IBVS) provides an efficient vision-guided control paradigm for unmanned aerial vehicles (UAVs) by directly regulating image-space errors. However, conventional IBVS controllers are vulnerable to two critical issues: loss of closed-loop stability near the target due to input and state constraints, and control failure caused by intermittent loss of moment-based visual features under aggressive motion. To address these challenges, this paper proposes a terminal-constraint model predictive control (TC-MPC) framework for IBVS, integrated with a Kalman filter (KF)-based state-prediction mechanism. The TC-MPC explicitly incorporates terminal-state constraints and a terminal cost into the IBVS error dynamics, ensuring recursive feasibility, improved convergence behavior, and closed-loop stability under control and state constraints. In parallel, the Kalman filter predicts the temporal evolution of image moments during short-term visual degradation, enabling the controller to preserve control continuity when moment measurements are partially unavailable. The proposed approach is validated through real-time UAV visual servoing experiments.

2605.22435 2026-05-22 cs.CL

Assisted Counterspeech Writing at the Crossroads of Hate Speech and Misinformation

在仇恨言论与虚假信息交汇处的辅助反诽谤写作

Genoveffa Martone, Helena Bonaldi, Marco Guerini

AI总结 本文研究了在仇恨言论和虚假信息共存的背景下,利用大型语言模型辅助专家反诽谤写作的方法,通过三种知识驱动的生成策略,结合事实核查和非政府组织的指南,提高了反诽谤文本的质量和有效性。

详情
AI中文摘要

仇恨言论和虚假信息经常在线上同时出现,加剧偏见和极化。鉴于其规模,使用大型语言模型(LLMs)来帮助专家反诽谤(CS)写作引起了关注,但以往的研究将这些现象分别处理。我们通过研究在仇恨言论和虚假信息共存的背景下生成反诽谤文本,填补了这一空白。我们测试了三种知识驱动的生成策略:首先,我们提示LLM使用事实核查员的指南和事实核查文章;其次,使用非政府组织的指南和报告;第三,我们创建了一种混合策略,结合来自两方面的指南和文档。23名专家修改生成的反诽谤文本,这些文本通过人工和自动指标进行评估。尽管LLM在40%的情况下生成了足够的反诽谤文本,但专家修改显著提高了自然性、全面性和对指南的遵守程度。基于修改后的反诽谤文本,混合策略在众包评估中证明是最有效的,结合了强大的事实纠正、刻板印象缓解和同理心参与。我们发布了包含仇恨言论和误导性声明的数据库,并附有专家验证的反诽谤文本和相关知识。

英文摘要

Hate speech and misinformation frequently co-occur online, amplifying prejudice and polarization. Given their scale, using Large Language Models (LLMs) to assist expert counterspeech (CS) writing has gained interest, yet prior work has addressed these phenomena separately. We bridge this gap by studying CS generation in contexts where both hate and misinformation co-occur. We test three knowledge-driven generation strategies: first we prompt an LLM with fact-checkers' guidelines and fact-checking articles; secondly, with NGOs' guidelines and reports; thirdly, we create a mixed strategy that combines guidelines and documents from both. 23 experts revise the generated CS, which are assessed via human and automatic metrics. While LLMs produce adequate CS in 40% of cases, expert edits substantially improve naturalness, exhaustiveness, and adherence to guidelines. Based on the post-edited CS, the mixed strategy proves to be the most effective in crowdsourcing evaluation, pairing strong factual correction with stereotype mitigation and empathetic engagement. We release a dataset of hateful and misinformed claims with expert-verified CS and supporting knowledge.

2605.22432 2026-05-22 cs.LG

AMUSE: Anytime Muon with Stable Gradient Evaluation

AMUSE: 任何时刻的Muon with Stable Gradient Evaluation

Jueun Kim, Baekrok Shin, Jihun Yun, Beomhan Baek, Minhak Song, Chulhee Yun

AI总结 本文研究了Muon算法的机制,提出了一种名为AMUSE的算法,通过结合Muon的快速批量进步和Schedule-Free平均的稳定效果,实现了无需学习率调度的任何时刻训练,并在视觉任务和大语言模型预训练中提升了性能-迭代帕累托前沿。

详情
Comments
41 pages, 25 figures
AI中文摘要

现代深度学习通常依赖于AdamW和预设的学习率调度,但最近的研究挑战了这两个组件:Schedule-Free优化通过迭代平均去除显式调度,而Muon通过正交化动量来改进矩阵参数的更新几何。尽管Muon在经验上表现强劲,但其底层机制仍部分不明确。我们通过河谷损失景观研究Muon,其中有用的训练进展发生在平坦、低曲率的 bulk 子空间(河流)中,而高曲率主导方向形成陡峭的河谷墙壁,导致振荡。我们实证显示,Muon的正交化通过增加bulk成分加速河流进展,但也放大了主导方向的噪声,导致振荡轨迹。基于此,我们提出Anytime MUon with Stable gradient Evaluation (AMUSE),它结合Muon的快速bulk进展与Schedule-Free平均的稳定效果。AMUSE使用一个随时间变化的插值系数,最初评估接近快速Muon序列的梯度以实现快速适应,然后逐渐转向稳定的平均序列以抑制河谷墙壁的振荡。结果,AMUSE不需要学习率调度并支持任何时刻训练。在视觉任务和大语言模型预训练中,AMUSE在性能-迭代帕累托前沿上一致优于(Schedule-Free) AdamW和Muon。

英文摘要

Modern deep learning commonly relies on AdamW with prescribed learning rate schedules, but recent works challenge both components: Schedule-Free optimization removes explicit schedules via iterate averaging, and Muon improves the update geometry by orthogonalizing momentum for matrix parameters. Despite Muon's strong empirical performance, its underlying mechanism remains partially understood. We study Muon through the river-valley loss landscape, where useful training progress occurs along a flat, low-curvature bulk subspace (the river), while high-curvature dominant directions form steep valley walls that induce oscillations. We empirically show that while Muon's orthogonalization accelerates river progress by increasing the bulk component, it also amplifies dominant-direction noise, causing oscillatory trajectories. Building on this, we propose Anytime MUon with Stable gradient Evaluation (AMUSE), which integrates Muon's rapid bulk progress with the stabilizing effect of Schedule-Free averaging. AMUSE uses a time-varying interpolation coefficient that initially evaluates gradients near the fast Muon sequence for rapid adaptation, then gradually shifts toward the stable averaged sequence to suppress valley-wall oscillations. As a result, AMUSE requires no learning rate schedules and supports anytime training. Across vision tasks and large language model pretraining, AMUSE consistently improves the performance-iteration Pareto frontier over (Schedule-Free) AdamW and Muon.

2605.22431 2026-05-22 cs.RO

Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation

通过利用结构的双控方法实现未知环境中的实时自优化

Shiying Dong, Haoyang Yang, Qiwei Liu, Wen-Hua Chen

AI总结 本文提出了一种快速数值双控方法,用于解决未知环境中的自优化问题,通过利用双控方法的结构特性,提高了探索与利用的效率和计算速度。

详情
AI中文摘要

本文开发了一种快速数值双控方法,用于解决未知环境中的自优化问题。在自优化问题中,最优运行条件事先未知且可能随环境变化而变化。与经典双控技术类似,计算负担仍然是双控方法中的主要问题。现有的双控方法提供了一个原理性的探索-利用目标,但主要通过标准优化包或显式梯度型更新法则实现,其中双控方法的数值结构未被充分利用。本文表明,双控方法中的奖励函数具有内在的凸-非线性结构,其中探索和利用项形成一个统一的非线性残差图,配备了凸外损失。得益于这种结构,通过仅线性化非线性残差图而保留凸外损失,开发出一种结构利用的数值方法。因此,每个子问题被转换为结构化的凸形式,可以可靠地求解。所得到的广义高斯-牛顿Hessian近似是正半定的,并且仅依赖于一阶导数,从而支持快速的在线计算。所提出的方法在车辆巡航自优化问题上进行了评估,并与现有方法进行了比较。仿真和硬件在环实验结果表明,所提出的方法提高了控制性能,并实现了约一个数量级的速度提升,最大计算时间仅为83微秒,仅在典型车辆嵌入式CPU上。

英文摘要

This paper develops a fast numerical dual control for exploration and exploitation (DCEE) method to address auto-optimization problems in unknown environments. In auto-optimization problems, the optimal operating condition is unknown a priori and may vary with the environment. As in classical dual control techniques, computational burden remains a major concern in DCEE for active learning. Existing DCEE methods provide a principled exploration-exploitation objective, but mainly realized through standard optimization packages or explicit gradient-type update laws, where the numerical structure of the DCEE has not been fully exploited. This paper shows that the reward function in DCEE has an inherent convex-over-nonlinear structure, where the exploitation and exploration terms form a unified nonlinear residual map equipped with a convex outer loss. Benefiting from this structure, a structure-exploiting numerical method is developed by linearizing only the nonlinear residual map while preserving the convex outer loss. Thus, each subproblem is transformed into a structured convex form that can be solved reliably. The resulting generalized Gauss-Newton Hessian approximation is positive semidefinite and depends only on first-order derivatives, thereby supporting fast online computation. The proposed method is evaluated on a vehicle cruising auto-optimization problem and compared with existing methods. Simulation and hardware-in-the-loop experimental results show that the proposed method improves control performance and achieves a speedup of approximately one order of magnitude, with a microsecond-level maximum computation time of only 83 μs on a typical vehicle embedded CPU.

2605.22422 2026-05-22 cs.CV cs.AI

FastTab: A Fast Table Recognizer with a Tiny Recursive Module and 1D Transformers

FastTab: 一种快速表格识别器,结合了微小递归模块和1D变换器

Laziz Hamdi, Amine Tamasna, Pascal Boisson, Thierry Paquet

AI总结 本文提出FastTab,一种基于网格的表格结构识别模型,通过轻量级的Tiny Recursive Module和轴向1D Transformer编码器,实现了高效的表格结构恢复,同时在多个基准测试中表现出低延迟和良好的鲁棒性。

详情
AI中文摘要

表格结构识别(TSR)需要在表级一致性(行/列数量、表头、跨单元格)和精确的分隔符定位之间取得平衡。我们介绍了FastTab,一种以网格为中心的TSR模型,通过结合(i)轻量级的Tiny Recursive Module(TRM)进行全局推理和(ii)轴向1D Transformer编码器,捕捉行和列上的长距离依赖关系,避免了自动回归的HTML解码。该模型预测行/列数量、表头行和分隔符以构建网格,然后利用ROI对齐的单元格特征推断行跨度/列跨度。在四个基准测试(PubTabNet、FinTabNet、PubTables-1M和SciTSR)中,FastTab在结构恢复性能方面表现优异,同时在低延迟推理中运行良好。我们进一步研究了在像素级匿名化下的鲁棒性,并展示了对相机捕获文档中弯曲分隔符的扩展。源代码将在https://github.com/hamdilaziz/FastTab上公开发布。

英文摘要

Table structure recognition (TSR) requires both table-level coherence (row/column counts, headers, spanning cells) and precise separator localization. We introduce FastTab, a grid-centric TSR model that avoids autoregressive HTML decoding by combining (i) a lightweight Tiny Recursive Module (TRM) for global reasoning and (ii) axial 1D Transformer encoders that capture long-range dependencies along rows and columns. The model predicts row/column counts, header rows, and separators to construct a grid, then infers rowspan/colspan using ROI-aligned cell features. Across four benchmarks (PubTabNet, FinTabNet, PubTables-1M, and SciTSR), FastTab achieves competitive structure recovery performance while operating at low-latency inference. We further study robustness under pixel-level anonymisation and show an extension to curved separators for camera-captured documents. The source code will be made publicly available at https://github.com/hamdilaziz/FastTab .

2605.22420 2026-05-22 cs.CV cs.AI cs.RO

Diffusion-guided Generalizable Enhancer for Urban Scene Reconstruction

基于扩散的通用增强器用于城市场景重建

Henry Che, Jingkang Wang, Yun Chen, Ze Yang, Sivabalan Manivasagam, Raquel Urtasun

AI总结 本文提出GenRe,一种基于扩散的通用增强器,用于城市场景重建,通过学习不同场景中的生成先验,高效地生成稳健且高保真的表示,能够可靠地泛化到挑战性的未见过的视角,从而在自动驾驶中实现鲁棒和可扩展的传感器模拟。

详情
Comments
ICRA 2026. Project page: https://waabi.ai/genre
AI中文摘要

从真实世界观测重建城市场景已成为自动驾驶开发和测试的强大工具。尽管当前的神经渲染方法在记录轨迹上实现了高质量的渲染,但其在大视角变化下质量显著下降,限制了闭环模拟的应用。最近的研究表明,使用扩散模型在这些具有挑战性的视角上增强质量并将其改进回3D表示具有前景。然而,它们通常需要昂贵的每场景优化,且提炼的表示仍然脆弱,无法超越有限的合成视角泛化。为了解决这些限制,我们提出了GenRe,一种新的基于扩散的通用增强器用于城市场景重建。GenRe输入任何预训练的3D高斯表示,并在几分钟内修复其中的缺陷。通过学习在多样化场景中提炼生成先验,GenRe高效地生成稳健且高质量的表示,能够可靠地泛化到具有挑战性的未见过的视角(例如,变道)。实验表明,GenRe在质量和效率上均优于现有方法,并且受益于各种下游任务,使自动驾驶中的传感器模拟更加稳健和可扩展。

英文摘要

Urban scene reconstruction from real-world observations has emerged as a powerful tool for self-driving development and testing. While current neural rendering approaches achieve high-fidelity rendering along the recorded trajectories, their quality degrades significantly under large viewpoint shifts, limiting the applicability for closed-loop simulation. Recent works have shown promising results in using diffusion models to enhance quality at these challenging viewpoints and distill improvements back into 3D representations. However, they often require costly per-scene optimization, and the distilled representations remain fragile and fail to generalize beyond limited synthesized views. To address these limitations, we propose GenRe, a novel diffusion-guided generalizable enhancer for urban scene reconstruction. GenRe takes as input any pretrained 3D Gaussian representation and fixes the deficiencies within a few minutes. By learning to distill generative priors across diverse scenes, GenRe produces robust and high-fidelity representation efficiently that generalizes reliably to challenging unseen viewpoints (e.g., lane change). Experiments show that GenRe outperforms existing methods in both quality and efficiency and benefits various downstream tasks, enabling robust and scalable sensor simulation for autonomous driving.

2605.22416 2026-05-22 cs.LG cs.DC cs.PF

Asymmetric Virtual Memory Paging for Hybrid Mamba-Transformer Inference

异构虚拟内存分页用于混合Mamba-Transformer推理

An Xuan Nguyen

AI总结 本文提出了一种异构虚拟内存分页方法,用于解决混合Mamba-Transformer模型推理中不同内存缓存类型的内存管理问题,通过分离两种缓存类型到物理上不同的池中,并在需要时迁移容量以提高内存利用率和推理吞吐量。

详情
Comments
11 pages, 8 figures, 6 tables. Code and reproducibility artifacts at https://github.com/codepawl/cachepawl
AI中文摘要

混合语言模型如Jamba将注意层与状态空间模型(SSMs)相结合,创建了两种具有相反特征的内存缓存类型:键值(KV)缓存随着序列长度线性增长,而SSM状态则每层保持固定。当前的推理引擎对此处理不佳。统一池将SSM状态填充到注意页面大小,浪费了高达7.3倍的容量。静态双池在请求之间提示分布变化时无法适应。我们提出了异构虚拟内存分页(AVMP)。分配器将这两种缓存类型分离到物理上不同的池中,背后有一个统一的虚拟地址空间,并在其中一个池用尽时迁移容量。迁移仅在分配失败时触发,保持行为确定性。我们评估了AVMP在270个合成单元和60个ShareGPT回放单元上的RTX 3060 12GB上。内存不足事件减少了7.6%,请求吞吐量在合成工作负载上提高了1.83倍至13.3倍,在ShareGPT上提高了2.36倍。所有收益在配对-Bootstrap 95%置信区间内均成立。一个相时间分解揭示了两种不同的机制:在容量压力工作负载上较短的内存不足恢复时间,以及在KV密集工作负载上更快的分配调用速度。实现是纯Python;Triton集成是未来的工作。

英文摘要

Hybrid language models like Jamba mix attention layers with State Space Models (SSMs), creating two memory cache types with opposite profiles: Key-Value (KV) caches grow linearly with sequence length, while SSM states stay fixed per layer. Current inference engines handle this poorly. Unified pools pad SSM states to attention page sizes, wasting up to 7.3x capacity. Static dual pools cannot adapt when prompt distributions shift between requests. We present Asymmetric Virtual Memory Paging (AVMP). The allocator separates the two cache types into physically distinct pools behind a unified virtual address space, and migrates capacity between pools when one runs out. Migration triggers only on allocation failure, keeping behavior deterministic. We evaluate AVMP across 270 synthetic cells plus 60 cells of ShareGPT trace replay on an RTX 3060 12GB. Out-of-Memory events drop 7.6% and request throughput improves 1.83x to 13.3x across synthetic workloads and 2.36x on ShareGPT. All gains hold under paired-bootstrap 95% confidence intervals. A phase-time breakdown reveals two distinct mechanisms: shorter OOM recovery on capacity-pressured workloads, and faster allocation calls on KV-heavy workloads. Implementation is pure Python; Triton integration is future work.

2605.22414 2026-05-22 cs.CV cs.AI

Towards Clinically Interpretable Ophthalmic VQA via Spatially-Grounded Lesion Evidence

迈向具有空间定位病变证据的临床可解释性眼科VQA

Xingyue Wang, Bo Liu, Meng Wang, Zhixuan Zhang, Chengcheng Zhu, Huazhu Fu, Jiang Liu

AI总结 本文提出FundusGround基准,通过空间定位病变证据提升眼科VQA的临床可解释性,通过三阶段流程收集标注病变的视网膜影像,并评估多种视觉语言模型在答案准确性和病变层面推理上的表现。

详情
AI中文摘要

视觉问答(VQA)在临床支持中具有巨大潜力,特别是在眼科领域,视网膜彩色照相是诊断的关键。然而,眼科VQA基准主要强调答案准确性,忽视了临床可解释性所需的显式视觉证据。本文引入FundusGround,一个新的具有空间定位病变证据的临床可解释性眼科VQA基准。具体而言,我们提出一个三阶段流程,收集了10,719张带有15,595个图像级精细标注病变的视网膜影像。为确保解剖一致性和临床有效性,所有病变均通过早期治疗糖尿病视网膜病变研究(ETDRS)网格进行空间定位,从而标准化映射到九个具有临床意义的视网膜区域。基于此结构化的病变证据,生成了72,706个问题,涵盖四种格式:开放式、封闭式、单选和多选。我们进一步使用双指标(答案准确性和病变层面推理)评估多种通用和医学大型视觉语言模型。实验表明,整合病变层面的视觉证据能持续提高模型性能和透明度,突显了显式空间定位对于可靠和可解释性眼科VQA的必要性。

英文摘要

Visual Question Answering (VQA) holds great promise for clinical support, particularly in ophthalmology, where retinal fundus photography is essential for diagnosis. However, ophthalmic VQA benchmarks primarily emphasize answer accuracy, neglecting the explicit visual evidence necessary for clinical interpretability. In this work, we introduce FundusGround, a new benchmark for clinically interpretable ophthalmic VQA with spatially-grounded lesion evidence. Specifically, we propose a three-stage pipeline that collects 10,719 fundus images with 15,595 image-level meticulously annotated lesions. To ensure anatomical consistency and clinical validity, all lesions are spatially localized using the Early Treatment Diabetic Retinopathy Study (ETDRS) grid, enabling standardized mapping to nine clinically meaningful retinal regions. Built upon this structured lesion evidence, 72,706 questions are then generated spanning four formats: open-ended, closed-ended, single-choice, and multiple-choice. We further benchmark multiple general- and medical- large vision-language models using dual metrics for answer accuracy and lesion-level reasoning. The experiments demonstrate that incorporating lesion-level visual evidence consistently improves model performance and transparency, highlighting the necessity of explicit spatial grounding for reliable and explainable ophthalmic VQA.

2605.22413 2026-05-22 cs.CV

From Recognition to Reasoning: Benchmarking and Enhancing MLLMs on Real-World Receipt Document Understanding

从识别到推理:在现实世界收据文档理解上对齐和增强MLLMs

Yandi Wang, Libin Zhan, Ziwei Huang, Tiancheng Luo, Yuxuan Jiang, Wang Dong, Leilei Gan, Jun Chen

AI总结 本文提出ReceiptBench基准,通过四个层次化子任务提升收据信息提取的结构一致性,并提出两阶段训练框架GRPO,通过强化学习信号提升模型性能,实验证明其在复杂推理任务上的优越性。

详情
AI中文摘要

从视觉文档中提取结构化信息(视觉信息提取,VIE)是业务自动化的核心。尽管最近的多模态大语言模型(MLLMs)展示了有前途的能力,但现有基准在规模和现实性方面存在关键限制,缺乏语义粒度,并未能覆盖多样化的文档类型。为弥合这一差距,我们引入ReceiptBench,一个大规模、人工标注的基准,包含10,000种多样化的收据,将信息提取组织成四个层次化子任务:(1)基础感知用于原始文本定位,(2)格式标准化用于严格遵循标准化指令,(3)语义推理用于从上下文中推断隐含属性,(4)结构解析用于处理嵌套的行项。此外,我们提出了一种两阶段训练框架,结合Metric-Aware Group Relative Policy Optimization(GRPO),将严格评估约束转化为强化学习信号以增强结构一致性。广泛的实验表明,我们的方法在复杂推理任务上实现了最先进的性能,超越了领先的专有模型。我们在此发布我们的数据集和代码:https://github.com/wwwT0ri/ReceiptBench。

英文摘要

Extracting structured information from visual documents (Visual Information Extraction, VIE) is a cornerstone of business automation. While recent Multimodal Large Language Models (MLLMs) have shown promising capabilities, existing benchmarks suffer from critical limitations in scale and realism, lack semantic granularity, and fail to cover diverse document types. To bridge this gap, we introduce ReceiptBench, a large-scale, human-annotated benchmark consisting of 10k diverse receipts, organizing information extraction into four hierarchical sub-tasks: (1) Basic Perception for raw text spotting, (2) Format Normalization for strictly following standardization instructions, (3) Semantic Reasoning for inferring implicit attributes from context, and (4) Structure Parsing for handling nested line items. Furthermore, we propose a two-stage training framework incorporating Metric-Aware Group Relative Policy Optimization (GRPO), which translates rigorous evaluation constraints into reinforcement learning signals to enhance structural consistency. Extensive experiments demonstrate that our method yields state-of-the-art performance, surpassing leading proprietary models on complex reasoning tasks. We release our datasets and code at https://github.com/wwwT0ri/ReceiptBench.

2605.22411 2026-05-22 cs.CL cs.AI cs.LG

DeferMem: Query-Time Evidence Distillation via Reinforcement Learning for Long-Term Memory QA

DeferMem: 通过强化学习进行长时记忆问答的查询时证据蒸馏

Jianing Yin, Tan Tang

AI总结 本文提出DeferMem,一种长时记忆框架,通过分离问题为高召回候选检索和查询条件证据蒸馏,以提升长时记忆问答的准确性和效率。

详情
Comments
31 pages, 3 figures
AI中文摘要

大型语言模型(LLM)代理在长时记忆问答任务中仍面临挑战,因为答案支持的证据通常分散在长对话历史中并被大量无关内容掩盖。现有记忆系统通常在未来的查询确定之前处理记忆,然后根据相似性而非其对回答查询的效用来检索结果单元。这种工作流程使下游回答者不得不对检索的候选进行去噪并重建查询特定的证据。我们提出了DeferMem,一种长时记忆框架,将该问题分解为高召回候选检索和查询条件证据蒸馏。DeferMem使用轻量级的段链接结构来组织原始历史并在查询时检索广泛的候选。然后,它应用一个通过DistillPO训练的内存蒸馏器,DistillPO是我们用于将高召回但高度嘈杂的候选蒸馏成一组忠实、自包含且查询条件的证据的强化学习算法。DistillPO将检索后的证据蒸馏制定为一个结构化的动作,包括信息选择和证据重写。它通过分解和门控奖励管道和结构对齐优势分配来优化此动作,门控奖励组件从有效性到质量检查,同时在早期暴露任务级别的正确性反馈,并将每个奖励分配给其负责的输出片段。在LoCoMo和LongMemEval-S上,DeferMem在问答准确性和记忆系统效率上超过了强大的基线,在达到最高问答准确度的同时实现了最快的运行时间和零商业API令牌成本的记忆操作。

英文摘要

Large language model (LLM) agents still struggle with long-term memory question answering, where answer-supporting evidence is often scattered across long conversational histories and buried in substantial irrelevant content. Existing memory systems typically process memory before future queries are known, then retrieve the resulting units based on similarity rather than their utility for answering the query. This workflow leaves downstream answerers to denoise retrieved candidates and reconstruct query-specific evidence. We present DeferMem, a long-term memory framework that decouples this problem into high-recall candidate retrieval and query-conditioned evidence distillation. DeferMem uses a lightweight segment-link structure to organize raw history and retrieve broad candidates at query time. It then applies a memory distiller trained with DistillPO, our reinforcement learning algorithm for distilling the high-recall but highly noisy candidates into a set of faithful, self-contained, and query-conditioned evidence. DistillPO formulates post-retrieval evidence distillation as a structured action comprising message selection and evidence rewriting. It optimizes this action with a decomposed-and-gated reward pipeline and structure-aligned advantage assignment, gating reward components from validity to quality checks while exposing task-level correctness feedback early and assigning each reward to its responsible output span. On LoCoMo and LongMemEval-S, DeferMem surpasses strong baselines in QA accuracy and memory-system efficiency, achieving the highest QA accuracy with the fastest runtime and zero commercial-API token cost for memory operations.

2605.22410 2026-05-22 cs.LG

Minimum Description Length based Granular-Ball Tree Regularization for Spectral Clustering

基于最小描述长度的粒状球树正则化谱聚类

Zeqiang Xian, Caihui Liu, Yong Zhang, Wenjing Qiu

AI总结 本文提出一种基于最小描述长度的粒状球树正则化谱聚类方法,通过局部MDL模型选择构建粒状球树,利用反向邻域连续性抑制破坏可靠局部连接的分裂,利用稳定的叶球提供编码尺度信息正则化样本级亲和图,并引入共享邻居桥码调整弱局部桥接关系,从而在统一的谱聚类框架中连接可解释的局部表示学习与亲和图构建。

详情
Comments
28 pages, 5 figures, 6 tables
AI中文摘要

谱聚类很大程度上依赖于亲和图,但构造一个能保持可靠局部连接并适应异构数据结构的图仍然具有挑战性。现有的基于粒状球的谱聚类方法通常通过使用粗粒度代表来减少图的复杂性。然而,学习到的局部区域通常被当作图节点或锚点,其结构信息未被充分用于正则化原始样本级图。为了解决这个问题,本文提出了一种基于最小描述长度的粒状球树正则化谱聚类方法,称为MDL-GBTRSC。所提出的方法通过局部MDL模型选择构建粒状球树,利用反向邻域连续性来抑制破坏可靠局部连接的分裂。从树中获得的稳定的叶球提供了用于正则化样本级亲和图的编码尺度信息。此外,引入了共享邻居桥码来调整弱局部桥接关系,而无需额外用户指定的阈值。这样,MDL-GBTRSC在统一的谱聚类框架中连接了可解释的局部表示学习与亲和图构建。在真实和合成数据集上的实验表明,与经典谱聚类基线和代表性的粒状球、微簇、锚点方法相比,MDL-GBTRSC在所采用的固定配置协议下实现了最佳的平均AR I和NMI。

英文摘要

Spectral clustering largely depends on the affinity graph, yet constructing a graph that preserves reliable local connectivity while adapting to heterogeneous data structures remains challenging. Existing granular-ball-based spectral clustering methods usually reduce graph complexity by using coarse-grained representatives. However, the learned local regions are often treated as graph nodes or anchors, and their structural information is not sufficiently used to regularize the original sample-level graph. To address this issue, this paper proposes a Minimum Description Length based Granular-Ball Tree-Regularized Spectral Clustering method, termed MDL-GBTRSC. The proposed method constructs a granular-ball tree through local MDL model selection, with reciprocal neighborhood continuity used to discourage splits that break reliable local connections. The stable leaf balls obtained from the tree provide coding-scale information for regularizing the sample-level affinity graph. In addition, a shared-neighbor bridge code is introduced to adjust weak local bridge relations without requiring an additional user-specified threshold. In this way, MDL-GBTRSC connects interpretable local representation learning with affinity graph construction in a unified spectral clustering framework. Experiments on real and synthetic datasets show that MDL-GBTRSC achieves the best average ARI and NMI under the adopted fixed-configuration protocol compared with classical spectral clustering baselines and representative granular-ball, micro-cluster, and anchor-based methods.

2605.22403 2026-05-22 cs.CV

Translating Signals to Languages for sEMG-Based Activity Recognition

将信号转换为语言以实现基于sEMG的活动识别

Ming Wang, Haoxuan Qu, Qiuhong Ke, Wei Zhou, Hossein Rahmani, Jun Liu

AI总结 本文提出了一种基于大语言模型的sEMG活动识别框架,通过将连续sEMG信号转换为语言形式,提升活动识别的准确性。

详情
AI中文摘要

基于sEMG信号的活动识别近年来受到了越来越多的研究关注。为了开发准确的sEMG信号活动识别器,已经提出了许多方法。一些研究专注于设计更大的、更具表达能力的模型架构以增强sEMG信号的表示能力,而另一些研究则通过大规模预训练来丰富模型先验知识,从而提高识别性能。最近,大语言模型(LLMs)在自然语言处理中展示了显著的泛化和推理能力,其隐含的知识,从大量的动作语言描述中学习而来,为解释sEMG信号和推断活动意图提供了新的可能性。受此启发,我们提出了LLM-sEMG,一种新的框架,利用LLMs作为sEMG活动识别器。在该框架中,我们设计了一种面向语言的映射机制,将连续的sEMG序列转换为sEMG语言,结合多种策略进一步促进信号到语言的映射过程。广泛的实验表明,所提出的框架能够利用大语言模型实现高精度的sEMG信号活动识别。

英文摘要

Surface electromyography (sEMG) signal-based activity recognition has attracted increasing research attention in recent years. To develop accurate sEMG signal-based activity recognizers, numerous approaches have been proposed. Some studies focus on designing larger and more expressive model architectures to enhance the representational capacity of sEMG signals, while others aim to enrich model priors through large-scale pretraining, thereby improving recognition performance. Recently, large language models (LLMs) have shown remarkable generalization and reasoning capabilities in natural language processing, whose implicit knowledge, learned from extensive linguistic descriptions of actions, opens new possibilities for interpreting sEMG signals and inferring activity intentions. Motivated by this, we propose LLM-sEMG, a novel framework that leverages LLMs as sEMG activity recognizers. Within this framework, we design a language-oriented mapping mechanism that converts continuous sEMG sequences into sEMG language, integrating several strategies to further facilitate the signal-to-language mapping process. Extensive experiments demonstrate that the proposed framework achieves highly accurate sEMG signal-based activity recognition using large language models.

2605.22401 2026-05-22 cs.LG cs.NE q-bio.NC

Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology

跨物种RSA揭示人类fMRI和猴子电生理学中早期视觉对齐的保守性,但更高区域的排名却呈现分歧

Nils Leutenegger

AI总结 该研究通过跨物种比较,发现早期视觉对齐在人类和猴子之间具有保守性,但更高区域的对齐性受模型容量和刺激域影响。

详情
Comments
9 pages, 6 figures
AI中文摘要

学习规则与大脑对齐之间的关系是否在物种间通用?我们扩展了之前的发现,即未经训练的CNN在人类V1中与反向传播匹配,通过将相同的五个学习规则应用于猴子电生理学进行测试。这些规则包括反向传播(BP)、反馈对齐(FA)、预测编码(PC)、脉冲时间依赖性可塑性(STDP)以及一个未经训练的随机权重基线。猴子数据来自两个数据集:MajajHong2015(V4/IT,3,200次刺激呈现,88/168个神经元)和FreemanZiemba2013(V1/V2,135个刺激,102/103个神经元)。使用与人类研究中相同的模型权重进行RSA分析,我们发现:(1)所有模型在猴子早期视觉皮层(V1/V2)的对齐度(rho = 0.15-0.30)高于人类fMRI(rho = 0.01-0.08),这与电生理学更高的信噪比一致;(2)STDP和PC在猴子V1/V2的对齐度最高(rho ~ 0.30和0.28),这与它们在人类V1中训练规则中的领先位置一致;(3)在IT区域,学习规则的跨物种排名无显著相关性(Kendall's tau = 0.00,p = 1.00),尽管这一结果预期,因为n = 5只在tau = ±1.0时有统计效力,且进一步受到刺激集差异的影响;(4)预训练的ResNet-50(ImageNet)在猴子IT区域达到rho = 0.25,显著高于所有自定义CNN条件(rho = 0.07-0.14),表明IT区域的对齐受限于模型容量和训练数据,而非学习规则。信噪比、多种子变异性(5个种子)和刺激控制分析被报告。这些结果表明,早期视觉对齐在物种间具有鲁棒性,而更高区域的对齐受模型容量和刺激域影响。

英文摘要

Does the relationship between learning rules and brain alignment generalize across species? We extend our prior finding that untrained CNNs match backpropagation at human V1 by testing the same five learning rules against macaque electrophysiology. The rules are backpropagation (BP), feedback alignment (FA), predictive coding (PC), spike-timing-dependent plasticity (STDP), and an untrained random-weights baseline. The macaque data come from two datasets: MajajHong2015 (V4/IT, 3,200 stimulus presentations, 88/168 neurons) and FreemanZiemba2013 (V1/V2, 135 stimuli, 102/103 neurons). Using RSA with identical model weights from our human study, we find: (1) all models achieve higher alignment with macaque early visual cortex (rho = 0.15-0.30 at V1/V2) than with human fMRI (rho = 0.01-0.08), consistent with the higher signal-to-noise ratio of electrophysiology; (2) STDP and PC produce the highest macaque V1/V2 alignment (rho ~ 0.30 and 0.28), consistent with their leading position among trained rules in human V1; (3) at IT, learning rule rankings show no detectable correlation across species (Kendall's tau = 0.00, p = 1.00), though this null result is expected given that n = 5 provides power only at tau = +/-1.0, and is further confounded by stimulus set differences; (4) a pretrained ResNet-50 (ImageNet) achieves rho = 0.25 at macaque IT, substantially above all custom CNN conditions (rho = 0.07-0.14), suggesting IT alignment is limited by model capacity and training data rather than by the learning rule. Noise ceilings, multi-seed variability (5 seeds), and a stimulus-control analysis are reported. These results demonstrate that early visual alignment is robust across species, while higher-area alignment is modulated by model capacity and stimulus domain.

2605.22391 2026-05-22 cs.AI cs.CL cs.CY

Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings

Epicure:探索食品成分嵌入的涌现几何

Jakub Radzikowski, Josef Chen

AI总结 本文提出Epicure,一种基于三兄弟skip-gram模型重新训练的食品成分嵌入方法,通过多语言食谱语料库构建了包含1790个标准成分的嵌入模型,并通过三种不同的随机游走方案生成了不同侧重的模型。

详情
AI中文摘要

我们提出了Epicure,一种由三个兄弟skip-gram成分嵌入模型组成的家族,这些模型是从多语言食谱语料库中从头开始重新训练的。我们汇总了来自11个来源的414万条食谱,涵盖七种语言:英语、中文、俄语、越南语、西班牙语、土耳其语、印度尼西亚语、德语和印度英语,并通过一个增强语言模型的流程将原始成分字符串标准化为1790个标准条目。一个包含203,508条边的成分-成分NPMI图和一个包含80,019条边的带类型FlavorDB成分-化合物图,以及2,247个带类型化合物节点跨越15个类别,为三种共享架构和超参数但仅在随机游走方案上不同的Metapath2Vec变体提供了基础:Cooc仅在共现图上行走,Chem仅在带类型化合物元路径上行走,Core则通过注入的成分-成分行走进行混合,在可控混合下,将每个模型置于化学与食谱上下文的谱线上不同的位置。

英文摘要

We present Epicure, a family of three sibling skip-gram ingredient embeddings retrained from scratch on a multilingual recipe corpus. We aggregate 4.14M recipes from 11 sources spanning seven languages, English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English, and normalise the raw ingredient strings to 1,790 canonical entries via an LLM-augmented pipeline. A 203,508-edge ingredient-ingredient NPMI graph and an 80,019-edge typed FlavorDB ingredient-compound graph, 2,247 typed compound nodes across 15 categories, seed three Metapath2Vec variants that share architecture and hyperparameters and differ only in the random-walk schema: Cooc walks the co-occurrence graph only, Chem walks the typed compound metapaths only, and Core blends both via injected ingredient-ingredient walks at controlled mixing, placing each model at a distinct point on the chemistry-vs-recipe-context spectrum.

2605.22390 2026-05-22 cs.LG

A Posterior-Predictive Variance Decomposition for Epistemic and Aleatoric Uncertainty in Wind Power Forecasting

后验预测方差分解用于风力发电中的epistemic和aleatoric不确定性

Yinsong Chen, Samson S. Yu, Kashem M. Muttaqi

AI总结 本文提出了一种后验预测方差分解方法,用于分离风力发电预测中的epistemic和aleatoric不确定性,通过总不确定性分解为aleatoric和epistemic组件,并提出特定于风力发电的评估框架来验证分解的有效性。

详情
AI中文摘要

准确的风力发电预测需要可靠的不确定性量化,但现有大多数方法报告单一的预测不确定性,将epistemic和aleatoric来源混淆了。本文应用总方差定律到异方差神经网络回归和贝叶斯后验近似联合设置中,推导出总不确定性(TU)的显式分解,将其分为aleatoric(AU)和epistemic(EU)组件。所得估计器与标准后验近似方法和β-NLL训练兼容,用于调节均值-方差学习的权衡。提出了一种特定于风力发电的评估框架,用于在没有地面真实不确定性标签的情况下验证分离性,包括三个模块:受控合成实验以验证对异方差噪声和分布偏移的响应;数据属性驱动验证在真实世界风力涡轮机SCADA数据集上;以及数据集大小缩放实验以检验EU的预测渐近行为。在合成和真实世界实验中,分解的AU和EU组件在噪声结构、分布偏移和训练规模变化方面表现出理论一致的方向,支持所提出分解和评估协议的理论一致性和操作实用性。

英文摘要

Accurate wind power forecasting requires reliable uncertainty quantification, yet most existing methods report a single predictive uncertainty that conflates epistemic and aleatoric sources. This paper applies the law of total variance to the joint setting of heteroscedastic neural network regression and Bayesian posterior approximation, deriving an explicit decomposition of total uncertainty (TU) into aleatoric (AU) and epistemic (EU) components. The resulting estimators are compatible with standard posterior-approximation methods and with $β$-NLL training to regulate the mean--variance learning trade-off. A wind power--specific evaluation framework is proposed to validate disentanglement without access to ground-truth uncertainty labels, comprising three modules: controlled synthetic experiments to verify responses to heteroscedastic noise and distribution shift; data-property--driven validation on a real-world wind turbine SCADA dataset; and dataset-size scaling experiments to examine the predicted asymptotic behavior of EU. Across synthetic and real-world experiments, the decomposed AU and EU components respond in theoretically consistent directions to noise structure, distributional shift, and training-scale variation, supporting the theoretical consistency and operational utility of the proposed decomposition and evaluation protocol.

2605.22389 2026-05-22 cs.CL

Unified Data Selection for LLM Reasoning

统一的数据选择用于LLM推理

Xiaoyuan Li, Yubo Ma, Chengpeng Li, Fengbin Zhu, Yiyao Yu, Keqin Bao, Wenjie Wang, Fuli Feng, Dayiheng Liu

AI总结 本文提出了一种无需训练的高熵和(HES)指标,用于评估和选择高质量的推理样本,通过在三种主流训练范式(监督微调、拒绝微调和强化学习)中验证,证明了其在提高LLM推理性能和减少计算开销方面的有效性。

详情
Comments
Under Review
AI中文摘要

有效训练大型语言模型(LLMs)进行复杂、长链推理通常受到需要大量高质量推理数据的限制。现有方法要么计算成本高,要么无法可靠地区分高质量和低质量的推理样本。为此,我们提出高熵和(HES),一种无需训练的度量标准,通过仅对每个推理样本中熵最高的(例如0.5%)高熵标记进行求和来量化推理质量。我们验证了HES在三种主流训练范式:监督微调(SFT)、拒绝微调(RFT)和强化学习(RL)中的效果,结果表明其效果一致且显著减少了计算开销。在SFT中,使用顶部20%的HES排名数据与完整数据集性能相当,而使用最低HES数据会使其下降。在RFT中,我们的HES基于训练方法显著优于基线方法。在RL中,HES选择的成功轨迹使模型能够学习到强大的推理模式,显著超越其他比较方法。我们的发现确立了HES作为一种稳健、无需训练的度量标准,使开发高级LLM推理能力成为统一、有效和高效的方法。

英文摘要

Effectively training Large Language Models (LLMs) for complex, long-CoT reasoning is often bottlenecked by the need for massive high-quality reasoning data. Existing methods are either computationally expensive or fail to reliably distinguish high- from low-quality reasoning samples. To address this, we propose High-Entropy Sum (HES), a training-free metric that quantifies reasoning quality by summing only the entropy of the top (e.g., 0.5\%) highest-entropy tokens in each reasoning sample. We validate HES across three mainstream training paradigms: Supervised Fine-tuning (SFT), Rejection Fine-tuning (RFT), and Reinforcement Learning (RL), with extensive results demonstrating its consistent effectiveness and significantly reduced computational overhead. In SFT, training on the top 20\% HES-ranked data matches full-dataset performance, while using the lowest-HES data degrades it. In RFT, our HES-based training approach significantly outperforms baseline methods. In RL, HES-selected successful trajectories enable the model to learn strong reasoning patterns, significantly surpassing other compared methods. Our findings establish HES as a robust, training-free metric that enables a unified, effective, and efficient method for developing advanced reasoning in LLMs.

2605.22387 2026-05-22 cs.LG cs.CE

Hybrid Kolmogorov-Arnold Network and XGBoost Framework for Week-Ahead Price Forecasting in Australia's National Electricity Market

混合 Kolmogorov-Arnold 网络与 XGBoost 框架用于澳大利亚国家电力市场的周 ahead 电价预测

Houxuan Zhou, Sriram Prasad, Chenghao Huang, Jiajie Feng, Hao Wang

AI总结 本文提出了一种混合 KAN+XGBoost 框架,用于预测澳大利亚国家电力市场的周 ahead 电价,该框架结合了 Kolmogorov-Arnold 网络的全局非线性表示能力和 XGBoost 的局部鲁棒性,以捕捉长期依赖和短期价格波动,实验表明该模型在 MAE 上比 XGBoost 和 naive 基线模型分别减少了 12% 和 50% 以上。

详情
Comments
The 24th IEEE International Conference on Industrial Informatics, 2026
AI中文摘要

准确的电力价格预测(EPF)对于市场参与者支持运营计划和风险管理至关重要,但因强波动性、非线性动态和频繁的极端价格尖峰而具有挑战性。这些挑战在澳大利亚国家电力市场(NEM)中尤为突出,其中高可再生能源渗透率进一步增加了不确定性。本文研究了周 ahead 电力价格预测,并提出了一种混合 KAN+XGBoost 框架,该框架结合了 Kolmogorov-Arnold 网络(KAN)与基于树的学习方法。所提出的方法结合了 KAN 的全局非线性表示能力与 XGBoost 的局部鲁棒性,以捕捉长期依赖和短期价格波动。实验在真实 NEM 数据上使用扩展窗口评估策略进行。结果表明,所提出的模型在基准方法(包括 SARIMAX、长短期记忆(LSTM)、独立 KAN 和 XGBoost)上表现更优,与 XGBoost 相比将 MAE 减少了约 12%,与 naive 基线相比减少了超过 50%。结果表明,混合学习策略为高动态电力市场中的电价预测提供了一种有效且稳健的解决方案。

英文摘要

Accurate electricity price forecasting (EPF) is essential for market participants to support operational planning and risk management, yet remains challenging due to strong volatility, nonlinear dynamics, and frequent extreme price spikes. These challenges are particularly pronounced in the Australian National Electricity Market (NEM), where high renewable penetration further increases uncertainty. This paper investigates week-ahead electricity price forecasting and proposes a hybrid KAN+XGBoost framework that integrates Kolmogorov-Arnold Networks (KAN) with tree-based learning. The proposed approach combines the global nonlinear representation capability of KAN with the local robustness of XGBoost to capture both long-term dependencies and short-term price fluctuations. Experiments are conducted on real-world NEM data using an expanding window evaluation strategy. The results demonstrate that the proposed model outperforms benchmark methods, including SARIMAX, Long Short-Term Memory (LSTM), standalone KAN, and XGBoost, reducing MAE by approximately 12% compared to XGBoost and by over 50% compared to a naive baseline. The results suggest that hybrid learning strategies provide an effective and robust solution for electricity price forecasting in highly dynamic electricity markets.

2605.22385 2026-05-22 cs.LG

Efficient Higher-order Subgraph Attribution via Message Passing

通过消息传递实现高效的高阶子图归因

Ping Xiong, Thomas Schnake, Grégoire Montavon, Klaus-Robert Müller, Shinichi Nakajima

AI总结 本文提出了一种基于消息传递的高效算法,能够在线性时间内通过GNN-LRP对子图进行归因,并扩展了子图归因方法以考虑邻接图特征,实验表明该方法具有显著加速和高实用性。

详情
Journal ref
Proceedings of the 39th International Conference on Machine Learning, PMLR 162:24478-24495, 2022
Comments
Published in ICML 2022
AI中文摘要

解释图神经网络(GNNs)近年来变得越来越重要。高阶解释方案,如GNN-LRP(针对GNN的分层相关性传播),已成为解开不同特征如何相互作用并解释GNNs的强大工具。GNN-LRP在每一层为节点之间的行走提供相关性归因,而子图归因则表示为指数级许多此类行走的总和。在本工作中,我们证明这种指数复杂性可以避免。特别是,我们提出了新的算法,能够在GNN-LRP中以线性时间(相对于网络深度)对子图进行归因。我们的算法通过利用分配属性的消息传递技术,直接计算高阶解释的量。我们进一步将高效的算法适应于计算一种扩展的子图归因方法,该方法还考虑了邻接图特征。实验结果表明,所提算法有显著的加速效果,并展示了我们新颖的扩展子图归因方法的高实用性和可扩展性。

英文摘要

Explaining graph neural networks (GNNs) has become more and more important recently. Higher-order interpretation schemes, such as GNN-LRP (layer-wise relevance propagation for GNN), emerged as powerful tools for unraveling how different features interact thereby contributing to explaining GNNs. GNN-LRP gives a relevance attribution of walks between nodes at each layer, and the subgraph attribution is expressed as a sum over exponentially many such walks. In this work, we demonstrate that such exponential complexity can be avoided. In particular, we propose novel algorithms that enable to attribute subgraphs with GNN-LRP in linear-time (w.r.t. the network depth). Our algorithms are derived via message passing techniques that make use of the distributive property, thereby directly computing quantities for higher-order explanations. We further adapt our efficient algorithms to compute a generalization of subgraph attributions that also takes into account the neighboring graph features. Experimental results show the significant acceleration of the proposed algorithms and demonstrate the high usefulness and scalability of our novel generalized subgraph attribution method.

2605.22380 2026-05-22 cs.CL cs.LG

Multi-Stage Training for Abusive Comment Detection in Indic Languages

印地语中辱骂评论检测的多阶段训练

Pranshu Rastogi, Madhav Mathur, Ramaneswaran S, Kshitij Mohan

AI总结 本文提出了一种多阶段训练方法,通过语言预处理和多个模型的集成,提高印地语中辱骂评论检测的准确性,减少误报率以保护言论自由。

详情
Comments
4 pages, EAM2021 selected
AI中文摘要

近年来,社交媒体已成为人们交流思想、分享观点和交换信息的重要工具。鉴于其普及性和广泛影响,社交媒体必须保持安全空间。生成在社交媒体上的内容可能具有攻击性,因此检测此类内容变得越来越重要。本文利用基于语言的预处理和多个模型的集成,分析其在辱骂评论检测中的性能。通过广泛实验,我们提出了一条管道,以最小化误报率(将非攻击性内容标记为攻击性),从而使这些系统能够在不损害言论自由的前提下检测攻击性评论。

英文摘要

In recent years social media has become an increasingly popular tool for communication. People use it to share their ideas, exchange information, and discuss thoughts. Given its prevalence and widespread reach, social media must remain a safe space for people. Content generated on social media can be abusive and it has become increasingly important to detect such content. In this paper, we use a language-based preprocessing and an ensemble of several models and analyze their performance of abusive comment detection. Through extensive experimentation, we propose a pipeline that minimizes the false-positive rate (marking non-abusive as abusive) so that these systems can detect abusive comments without undermining the freedom of expression.

2605.22377 2026-05-22 cs.LG

Towards Explainability of SLMs by investigating Token Level Activation

通过研究token层面激活实现SLMs的可解释性

Sayantani Ghosh, Rajashik Datta, Amit Kumar Das, Amlan Chakrabarti

AI总结 本文提出了一种轻量且通用的框架,通过BERT第8层隐藏状态的激活强度量化token层面的表示重要性,揭示了语义信息在激活强度上的集中分布,为将BERT从黑箱模型转变为更透明的玻璃箱模型提供了可解释且计算高效的替代方法。

详情
AI中文摘要

基于Transformer的语言模型,如具有1.1亿个参数的BERT,已彻底改变了自然语言理解,但其内部机制仍 largely opaque to 研究人员和从业者。传统的基于注意力的可解释性方法往往强调结构上重要但语义上弱的token,如标点符号,而不是有意义的语义关系。本文介绍了一种轻量且通用的框架,用于通过BERT第8层隐藏状态的激活强度量化token层面的表示重要性。所提出的激活流网络(AFN)框架通过第8层隐藏表示的L2范数计算token激活强度,从而能够直接对语义显著的token进行排序。进一步,本文引入了基于阈值的激活桶公式,通过经验上四分位数激活边界将token分为高激活和低激活组。实验观察表明,语义上有意义的内容词始终占据高激活桶,并主导表示激活的变化,而结构支持的token贡献相对较少。结果表明,第8层充当一个关键的语义整合区域,平衡了结构和语义信息处理。通过揭示激活强度集中在语义信息丰富的token周围,本文为将BERT从黑箱模型转变为更透明的玻璃箱模型提供了可解释且计算高效的替代方法。

英文摘要

Transformer-based language models such as BERT having 110M+ parameters have revolutionized natural language understanding, yet their internal mechanisms remain largely opaque to researchers and practitioners. Traditional attention-based interpretability methods often emphasize structurally important but semantically weak tokens such as punctuation marks rather than meaningful semantic relationships. This work introduces a lightweight and model-agnostic framework for quantifying token-level representational importance using hidden-state activation strengths at Layer 8 of BERT. The proposed Activation Flow Network (AFN) framework computes Token Activation Strength using the L2 norm of Layer-8 hidden representations, enabling direct ranking of semantically salient tokens. The study further introduces a threshold-based activation bucket formulation that partitions tokens into HIGH-activation and LOW-activation groups using an empirical upper-quartile activation boundary. Experimental observations demonstrate that semantically meaningful content words consistently occupy the HIGH-activation bucket and dominate representational activation shifts, while structurally supportive tokens contribute comparatively less. The results suggest that Layer 8 acts as a critical semantic consolidation zone balancing structural and semantic information processing. By revealing how activation magnitudes concentrate around semantically informative tokens, this work provides an interpretable and computationally efficient alternative to attentioncentric analysis, contributing toward transforming BERT from a "black box" into a more transparent "glass box" model for natural language understanding.

2605.22376 2026-05-22 cs.LG

Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning

目标对齐的贝尔曼备份用于跨域离线强化学习

Wei Liu, Ting Long

AI总结 本文提出了一种基于目标域贝尔曼目标对齐的跨域离线强化学习方法,旨在通过评估源域过渡与目标域贝尔曼目标的一致性来提升策略学习性能。

详情
AI中文摘要

跨域离线强化学习(CDRL)旨在通过利用源域收集的数据来改进目标域的策略学习。现有方法通常通过测量源域数据与目标域转换的相似性来评估数据的可迁移性,并隐式地进行转换级选择。被判定为相似的转换会被赋予更高的权重或奖励,而不相似的则被降权。然而,转换级的相似性并不一定保证长期回报的一致性。即使视觉或动态上相似的转换在目标域中也可能导致显著不同的结果,这可能会误导策略学习并降低性能。为了解决这个问题,我们重新审视了策略学习的根本目标。由于策略优化最终依赖于贝尔曼目标来评估决策的质量,我们提出基于源域转换与目标域贝尔曼目标的一致性来评估源域转换的可迁移性,而不是表面的转换相似性。基于这一见解,我们提出了一种名为目标对齐的贝尔曼备份(TABB)的方法,通过测量源域数据对目标域中准确贝尔曼目标估计的贡献来选择性地利用源域数据。我们在广泛的跨域离线RL设置中评估了TABB,尤其是在目标域数据高度有限的情况下。实验结果表明,TABB在各种情况下都实现了强大的性能。

英文摘要

Cross-domain offline reinforcement learning (CDRL) aims to improve policy learning in a target domain by leveraging data collected from a source domain. Existing works typically assess the transferability of source-domain data by measuring its similarity to target-domain transitions, and implicitly perform transition-level selection. Transitions that are considered similar are assigned higher weights or rewards, while dissimilar ones are down-weighted. However, transition-level similarity does not necessarily imply consistency in long-term returns. Even visually or dynamically similar transitions may lead to significantly different outcomes in the target domain, which can mislead policy learning and degrade performance. To address this issue, we revisit the fundamental objective of policy learning. Since policy optimization ultimately relies on Bellman targets to evaluate the quality of decisions, we propose to assess the transferability of source-domain transitions based on their alignment with target-domain Bellman targets, rather than superficial transition similarity. Based on this insight, we propose a method termed Target-Aligned Bellman Backup (TABB), which selectively leverages source-domain data by measuring their contribution to accurate Bellman target estimation in the target domain. We evaluate TABB across a broad range of cross-domain offline RL settings with highly limited target-domain data. Experimental results show that TABB consistently achieves strong performance.

2605.22372 2026-05-22 cs.LG

ASAP: Attention Sink Anchored Pruning

ASAP: 以注意力汇点为中心的剪枝

Jaehyuk Lee, Hanyoung Kim, Yanggee Kim, Donghun Lee

AI总结 本文提出ASAP方法,通过将注意力汇点作为特征,利用懒惰随机游走建模视觉Transformer的信息流,实现单次剪枝过程中的token分区和背景冗余压缩,从而在保持或超越基线精度的同时,提升吞吐量达48%。

详情
AI中文摘要

视觉Transformer(ViTs)在高分辨率下由于自注意力的二次复杂度面临严重的计算瓶颈。现有token减少方法依赖局部指标-如单层注意力分数-这些指标本质上容易受到注意力汇点现象的影响,即无信息token paradoxically被保留下来。我们提出ASAP(Attention Sink Anchored Pruning),一种无需训练的框架,将此汇点作为特征。通过将ViT信息流建模为懒惰随机游走,ASAP将汇点识别为概率质量的主要累积器。通过计算累积转移矩阵中到汇点的扩散距离,ASAP利用径向扩散聚类对token进行分区,并通过转移权重池化压缩背景冗余。在图像、视频和视觉-语言任务中的广泛实验表明,ASAP在保持或超越基线精度的同时,加速吞吐量高达48%。

英文摘要

Vision Transformers (ViTs) face severe computational bottlenecks due to the quadratic complexity of self-attention at high resolutions. Existing token reduction methods rely on local metrics - such as single-layer attention scores - that are inherently vulnerable to the attention sink phenomenon, where uninformative tokens are paradoxically preserved over salient foreground objects. We propose ASAP (Attention Sink Anchored Pruning), a training-free framework that recasts this sink as a feature. Modeling ViT information flow as a Lazy Random Walk, ASAP identifies the sink as a dominant accumulator of probability mass. By computing the diffusion distance to the sink within the cumulative transition matrix, ASAP partitions tokens via Radial Diffusion Clustering and compresses background redundancy through Transition Weight Pooling in a single shot. Extensive experiments across image, video, and vision-language tasks demonstrate ASAP outperforms state-of-the-art methods, accelerating throughput by up to 48% while maintaining - or even exceeding - baseline accuracy.