arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.03808 2026-06-03 cs.LG cs.AI cs.CR

PURGE: Projected Unlearning via Retain-Guided Erasure

PURGE: 通过保留引导擦除的投影遗忘

Vedant Jawandhia, Daksh Ahuja, Ghufran Alam Siddiqui, Prashant Trivedi, Yash Sinha, Pratik Narang

AI总结 提出一种基于持续学习与机器遗忘对偶性的遗忘算法PURGE,利用梯度投影约束保留损失,并通过多层表示擦除和保留混淆目标实现隐私与效用的平衡。

详情
Comments
13 pages, 10 figures, 6 tables
AI中文摘要

我们提出PURGE,一种基于简单但未被充分利用的观察构建的机器遗忘算法:持续学习(CL)和机器遗忘(MU)本质上是二元问题。CL试图在不遗忘旧任务的情况下学习新任务;MU试图在不损害保留性能的情况下擦除特定数据,代表了相同基本张力在相反方向上的体现。PURGE通过调整A-GEM(Chaudhry等人,2019)的梯度投影来利用这种对偶性,使得每个遗忘步骤都受到约束,不会增加保留集损失。在此基础上,它执行多层表示擦除,将中间层中遗忘集的激活推向保留分布,以从隐藏表示中移除信息,而不仅仅是在输出层抑制信息。一个关键的设计选择是保留混淆目标:不是将遗忘输出推向均匀分布(我们发现这很容易被成员推断攻击检测到),而是将目标设定为模型在保留数据上的自然混淆模式。这使得遗忘模型难以与从头重新训练的模型区分。两个自调节停止标准(保留损失预算和遗忘准确率目标)让算法自行决定何时停止,无需手动调整训练轮数。在五个数据集(CIFAR-10、MNIST、SVHN、STL10、PathMNIST)上的22个类别级遗忘任务实验中,PURGE始终将保留准确率保持在96%以上,同时实现接近0.5(理想值)的MIA AUROC,在隐私-效用前沿上优于梯度上升、KL均匀分布以及多个已发表的基线方法。

英文摘要

We propose PURGE, a machine unlearning algorithm built on a simple but an under-exploited observation: continual learning (CL) and machine unlearning (MU) which are fundamentally dual problems. CL tries to learn new tasks without forgetting old ones; MU tries to erase specific data without hurting retained performance representing the same underlying tension in opposite directions. PURGE leverages this duality by adapting gradient projection from A-GEM (Chaudhry et al., 2019) so that every unlearning step is constrained to not increase the retain-set loss. On top of this, it performs multi-layer representation erasure, pushing forget-set activations in intermediate layers towards the retain distribution to remove information from hidden representations rather than just suppressing it at the output. A key design choice is the retain-confusion target: rather than pushing forget outputs toward the uniform distribution, which we found to be surprisingly easy for membership inference attacks to detect, we instead target the model's natural confusion pattern on retain data. This makes the unlearned model hard to distinguish from one retrained from scratch. Two self-regulating stopping criteria (a retain-loss budget and a forget-accuracy target) let the algorithm decide on its own when to stop, removing the need for manual epoch tuning. In experiments on five datasets (CIFAR-10, MNIST, SVHN, STL10, PathMNIST) across 22 class-level forgetting tasks, PURGE consistently keeps retain accuracy above 96% while achieving MIA AUROC close to 0.5 (the ideal), outperforming gradient ascent, KL-uniform, and several published baselines on the privacy-utility frontier.

2606.03806 2026-06-03 cs.CV

TeX-1500: A Paired Real-World LWIR Hyperspectral Dataset and Benchmark for Temperature-Emissivity-Texture Decomposition

TeX-1500:用于温度-发射率-纹理分解的配对真实世界长波红外高光谱数据集与基准

Cheng Dai, Jiale Lin, Hongyi Xu, Bingxuan Song, Ziyang Xie, Fanglin Bao

AI总结 针对长波红外高光谱成像中温度-发射率-纹理分解缺乏配对监督数据的问题,构建了包含1522对真实场景的TeX-1500数据集,并提出波长感知基线模型TeX-UNet,实现了可量化的数据驱动热感知基准。

详情
AI中文摘要

温度-发射率-纹理(TeX)分解旨在从长波红外高光谱成像(LWIR HSI)中恢复物体热状态、材料光谱响应和可见光般的几何纹理。现有的TeX流程主要是场景特定的逆求解器,缺乏配对的LWIR HSI-TeX监督限制了基于学习的分解。为解决这一空白,我们引入了TeX-1500,一个大规模配对LWIR HSI-TeX数据集和基准,用于监督式HSI到TeX分解。TeX-1500包含来自DARPA隐形前照灯(DARPA IH)推扫式成像和我们FTIR采集的1,522个校准真实场景对,覆盖五个地点、四个季节、不同的采集时间、异构波长布局和两个传感器系列。每个样本存储一个校准的有效波段辐射立方体、校准的波长位置,以及通过一致的恢复和TeX构建协议构建的对齐温度、发射率和纹理监督。我们进一步提供了TeX-UNet,一个简单的波长感知基线,将校准的HSI波段和波长位置映射到TeX场。在保留的DARPA IH推扫场景和零样本/少样本迁移到FTIR场景上的实验表明,TeX-1500为数据驱动的以物理属性为中心的热感知提供了可用的配对监督和可测量的基准。

英文摘要

Temperature-emissivity-texture (TeX) decomposition seeks to recover object heat state, material spectral response, and visible-like geometric texture from long-wave infrared hyperspectral imaging (LWIR HSI). Existing TeX pipelines are mainly scene-specific inverse solvers, and the lack of paired LWIR HSI-TeX supervision has limited learning-based decomposition. To address this gap, we introduce TeX-1500, a large-scale paired LWIR HSI-TeX dataset and benchmark for supervised HSI-to-TeX decomposition. TeX-1500 contains 1,522 calibrated real-scene pairs from DARPA Invisible Headlights (DARPA IH) pushbroom imagery and our FTIR acquisitions, covering five locations, four seasons, diverse acquisition times, heterogeneous wavelength layouts, and two sensor families. Each sample stores a calibrated valid-band radiance cube, calibrated wavelength positions, and aligned temperature, emissivity, and texture supervision constructed through a consistent restoration and TeX-construction protocol. We further provide TeX-UNet, a simple wavelength-aware baseline that maps calibrated HSI bands and wavelength positions to TeX fields. Experiments on the held-out DARPA IH pushbroom scenes and zero-/few-shot transfer to FTIR scenes show that TeX-1500 provides usable paired supervision and a measurable benchmark for data-driven physical-property-centered thermal perception.

2606.03804 2026-06-03 cs.LG

Easy-to-Use Shielding for Reinforcement Learning

易于使用的强化学习屏蔽技术

Stefan Pranger, Bettina Könighofer

AI总结 提出tempestpy库,将形式化屏蔽合成集成到Gymnasium API中,降低强化学习安全探索的门槛,并扩展了随机多人博弈的屏蔽算法。

详情
AI中文摘要

安全探索是强化学习中的一个关键挑战,旨在防止智能体在探索环境时做出有害决策。屏蔽是一种利用环境模型形式的领域知识来决定动作安全性的技术。尽管已经成熟,但由于缺乏将形式化屏蔽合成与标准强化学习框架连接起来的可访问端到端基础设施,屏蔽在强化学习中的应用有限。应用屏蔽通常需要形式化方法的专业知识和大量的工程工作,使其脱离典型的强化学习工作流程。我们通过将屏蔽合成工具Tempest扩展为安全强化学习的实用后端来解决这一问题。我们的核心贡献是tempestpy,一个Python库,它将基于Tempest的屏蔽合成直接集成到Gymnasium API中,使得屏蔽可以在现有的强化学习管道中合成和部署。这降低了屏蔽的入门门槛,将形式化安全探索方法转化为强化学习实践者可用的组件。我们还扩展了Tempest的算法支持,以计算随机多人博弈的可靠屏蔽,保留了形式化安全保证。我们端到端地展示了最终的工作流程,并在多个环境中评估了有屏蔽和无屏蔽的强化学习。为了便于建模,我们为MiniGrid提供了符号模型,并引入了MiniGridSafe,这是一个游乐场环境集合,旨在使屏蔽易于访问且实验透明。MiniGridSafe通过具有概率转换和额外智能体的安全导向场景扩展了MiniGrid,使得在简单直观的设置中研究具有挑战性的安全方面成为可能。

英文摘要

Safe exploration is a key challenge in Reinforcement Learning (RL) that aims to prevent agents from making harmful decisions while exploring their environment. Safe exploration is a key challenge in Reinforcement Learning (RL) that aims to prevent agents from making harmful decisions while exploring their environment. Shielding is one such technique that assumes domain knowledge in the form of an environment model to decide upon action safety. Although well-established, shielding has seen limited adoption in RL due to the lack of accessible end-to-end infrastructure connecting formal shield synthesis with standard RL frameworks. Applying shielding typically requires expertise in formal methods and substantial engineering effort, keeping it outside the typical RL workflow. We address this by extending our shield synthesis tool Tempest into a practical backend for safe RL. Our core contribution is tempestpy, a Python library that integrates Tempest-based shield synthesis directly into the Gymnasium API, allowing shields to be synthesized and deployed within existing RL pipelines. This lowers the barrier to entry for shielding and turns formal safe-exploration methods into a usable component for RL practitioners. We also extend Tempest's algorithmic support to compute sound shields for stochastic multiplayer games, preserving formal safety guarantees. We demonstrate the resulting workflow end to end and evaluate shielded and unshielded RL across multiple environments. To facilitate modeling, we provide symbolic models for MiniGrid and introduce MiniGridSafe, a collection of playground environments designed to make shielding easily accessible and experimentally transparent. MiniGridSafe extends MiniGrid with safety-oriented scenarios featuring probabilistic transitions and additional agents, enabling the study of challenging safety aspects in a simple and intuitive setting.

2606.03802 2026-06-03 cs.CV

Template Collapse and Information-Theoretic Limits in Camera rPPG Pulse Morphology Restoration

模板坍塌与相机rPPG脉搏形态恢复中的信息论极限

Achraf Ben Ahmed

AI总结 本研究通过评估16种架构在153名受试者上的表现,引入跨受试者Pearson r来区分个体特异性恢复与模板坍塌,发现消费者摄像头无法编码个体动脉形态,且无架构能恢复个体特异性脉搏形态。

详情
AI中文摘要

目的:消费者面部相机远程光电容积描记法(rPPG)可实现被动心血管监测,但单周期波形形态(编码动脉硬化生物标志物)是否可从该测量中恢复尚未明确。方法:我们在三个数据集的153名受试者上评估了涵盖六个家族的16种架构,引入跨受试者Pearson r以区分个体特异性恢复与模板坍塌。结果:无架构恢复个体特异性形态(跨受试者r范围0.773--0.9999;真实上限0.601)。监督对比学习(SupCon)收敛至log N = 4.844,构成现有最强经验证据,表明测试的编码器家族无法从单周期rPPG中提取可判别形态结构。VAE解码器恢复了rPPG输入中缺失的群体级谐波内容(H2/H1:输出0.310 vs. 输入0.275),零样本泛化至UBFC(r = +0.708);方向性幻觉差距(p = 0.150)提示部分信号读取。当输入不携带可判别结构时,抗坍塌目标失效。意义:消费者摄像头无法编码个体动脉形态;跨受试者r是波形重建基准中必要的坍塌诊断指标。

英文摘要

Objective: Consumer face camera remote photoplethysmography (rPPG) enables passive cardiovascular monitoring, but whether single-cycle waveform morphology encoding arterial stiffness biomarkers is recoverable from this measurement has not been characterised. Methods: We evaluated 16 architectures spanning six families on 153 subjects across three datasets, introducing cross-subject Pearson r to distinguish subject-specific recovery from template collapse. Results: No architecture recovered subject-specific morphology (cross-subject r range 0.773--0.9999; ground-truth ceiling 0.601). Supervised Contrastive (SupCon) converged to log N = 4.844, constituting the strongest available empirical evidence that no discriminative morphological structure is extractable from single-cycle rPPG by the encoder families tested. The VAE decoder restores population-level harmonic content absent from the rPPG input (H2/H1: 0.310 output vs. 0.275 input), generalising zero-shot to UBFC (r = +0.708); a directional hallucination gap (p = 0.150) suggests partial signal reading. Anti-collapse objectives fail when input carries no discriminative structure. Significance: Consumer cameras cannot encode individual arterial morphology; cross-subject r is a necessary collapse diagnostic for waveform reconstruction benchmarks.

2606.03800 2026-06-03 cs.LG cs.AI

Trading Human Curation for Synthetic Augmentation in RLVR

在RLVR中用合成增强替代人工策展

Akshansh, Leonardo Rosa Rodrigues, Michael Korostelev, Youssef Hassan, Mark E. Whiting

AI总结 研究通过预指定、门控过滤的增强任务替代人工策展任务,在RLVR中实现成本效益权衡,并保持泛化性能。

详情
Comments
21 pages, 5 main-text figures, 4 appendix figures. Preprint
AI中文摘要

高质量训练任务的供应是基于可验证奖励的强化学习(RLVR)在智能体语言模型上的核心瓶颈。每个任务需要一个沙盒环境、一个提示和一个手工编写的奖励函数,只有通过质量标准的任务才能产生有用的训练信号。达到这一质量标准的人工策展在有效RL训练所需的任务数量上无法经济地扩展,而自动生成的任务变体与人工编写任务之间的替代率尚未确定。我们研究在RLVR期间,使用预指定、门控过滤的增强(augmentations)作为额外人工策展的替代品。我们形式化了增强任务与人工任务之间的成本调整权衡率 $\rho_{\text{cost}}$,通过在不同增强比例的训练语料库上进行受控消融实验来测量它,并描述了增强管道的端到端经济学。用增强内容替代额外的人工编写任务,在涵盖代码、指令遵循、推理和多轮智能体函数调用的十个基准测试套件上保持了聚合的留出泛化能力。在合理的 $c_{\text{human}}/c_{\text{aug}}$ 范围内,门控合成与人工RLVR任务之间的成本调整权衡率 $\rho_{\text{cost}}$ 保持在 $[1.4\times, 11.6\times]$ 之间。

英文摘要

The supply of high-quality training tasks is a central bottleneck for reinforcement learning from verifiable rewards (RLVR) on agentic language models. Each task requires a sandboxed setup, a prompt, and a hand-authored reward function, and only tasks that pass a quality bar produce useful training signal. Hand-curation at this quality bar does not scale economically to the task counts effective RL training requires, and the substitution rate between automatically generated task variants and human-authored ones is not yet established. We investigate using pre-specified, gate-filtered augmentations of a small hand-authored base as a substitute for additional human curation during RLVR. We formalize the cost-adjusted trade rate $ρ_{\text{cost}}$ between augmented and human-authored tasks, measure it through a controlled ablation across training corpora with varying augmentation share, and characterize the end-to-end economics of the augmentation pipeline. Substituting augmented content for additional human-authored tasks retains aggregate held-out generalization on a ten-benchmark suite spanning code, instruction following, reasoning, and multi-turn agentic function-calling. The cost-adjusted trade rate $ρ_{\text{cost}}$ between gated synthetic and human-authored RLVR tasks stays in $[1.4\times, 11.6\times]$ across the plausible $c_{\text{human}}/c_{\text{aug}}$ range.

2606.03798 2026-06-03 cs.RO

Optimal Design and Analytical Modeling of a Soft Fin-Ray Effect Gripper Finger Using the Finite Rigid Elements Method

基于有限刚性单元法的软体鳍射线效应夹爪手指的优化设计与解析建模

Sara Adeli, Hassan Sayyaadi

AI总结 提出采用有限刚性单元法(FREM)对软体鳍射线效应(FRE)夹爪手指进行建模与优化,实现精准力控,以轻柔抓取易损农产品。

详情
AI中文摘要

受鳍射线启发的软体夹爪为轻柔处理易损、不规则物体(尤其在农业中)提供了有前景的解决方案。本研究旨在设计、制造和建模一种鳍射线效应(FRE)软体夹爪手指,以实现未来应用中的精确力控制。该设计旨在轻柔抓取需要适应性和精确力施加的易损农产品,如番茄。为解决软体机器人固有的挑战,包括非线性行为、无限自由度和可变材料属性,采用有限刚性单元法(FREM)进行建模。该方法在保持解析精度的同时,为后续阶段力控制器的开发提供了可靠基础。使用ANSYS创建了详细的有限元模型(FEM),并通过仿真和实验测试验证了解析结果。基于四个关键标准优化了夹爪手指:尖端位移、总变形、应力分布和接触力。最优手指配置包括长度30毫米、肋间距10毫米、七根肋条角度-15度、肋条厚度1毫米。使用FREM的理论建模预测手指变形误差为3%,而ANSYS数值模型误差为2%。

英文摘要

Fin Ray-inspired soft grippers offer a promising solution for gently handling delicate, irregular objects, especially in agriculture. The objective of this research is to design, fabricate, and model a Fin Ray Effect (FRE) soft gripper finger to enable precise force control in future applications. This design aims to gently grasp delicate agricultural products, such as tomatoes, that require both adaptability and accurate force application. To address the inherent challenges of soft robotics, including nonlinear behavior, infinite degrees of freedom, and variable material properties, the Finite Rigid Elements Method (FREM) was employed for modeling. This method preserves analytical accuracy while providing a reliable foundation for the development of a force controller in later stages. A detailed Finite Element Model (FEM) was created using ANSYS, and the analytical results were validated through simulation and experimental testing. The gripper's fingers were optimized based on four key criteria: tip displacement, total deflection, stress distribution, and contact force. The optimal finger configuration includes a length of 30 mm, rib spacing of 10 mm, seven ribs angled at -15 deg, and a rib thickness of 1 mm. Theoretical modeling using the FREM predicted finger deformation with a 3% error, while the ANSYS numerical model achieved 2% error.

2606.03796 2026-06-03 cs.NE cs.AI

Signed Spiking Neuron Enabled by an Orthogonal-Easy-Axis Magnetic Tunnel Junction

基于正交易轴磁隧道结的有符号脉冲神经元

Huannan Zheng, Jingli Liu, Kezhou Yang

AI总结 提出一种基于正交易轴磁隧道结的紧凑型有符号脉冲神经元,通过自由层和钉扎层的正交易轴实现双极性脉冲生成,并映射磁矩动力学到有符号LIF膜电位演化,在CIFAR-10和CIFAR10-DVS上分别达到91.06%和77.40%的准确率。

详情
AI中文摘要

有符号脉冲神经元携带比标准脉冲神经元更丰富的信息。本文提出一种基于磁隧道结(MTJ)的紧凑型神经元,用于有符号泄漏积分点火(LIF)操作。通过自由层和钉扎层中的正交易轴,该器件能够实现双极性脉冲生成,并将磁矩动力学映射到有符号LIF膜电位演化。Landau-Lifshitz-Gilbert模拟表明,适当的自由层尺寸使器件响应能够遵循有符号LIF方程。一个代表性设计为10 nm x 45 nm x 50 nm,对应纵横比约为2:9:10。使用拟合的器件神经元模型进行网络评估,在CIFAR-10上达到91.06%,在CIFAR10-DVS上达到77.40%,保留了理想有符号LIF神经元的大部分准确率。

英文摘要

Signed spiking neurons carry richer information than standard spiking neurons. This work proposes a compact magnetic tunnel junction (MTJ)-based neuron for signed leaky integrate-and-fire (LIF) operation. With orthogonal easy axes in the free and pinned layers, the device enables bipolar spike generation and maps magnetic-moment dynamics to signed LIF membrane-potential evolution. Landau--Lifshitz--Gilbert simulations show that proper free-layer dimensions allow the device response to follow a signed LIF equation. A representative design of 10 nm x 45 nm x 50 nm corresponds to an aspect ratio of about 2:9:10. Network evaluations using the fitted device-neuron model achieve 91.06% on CIFAR-10 and 77.40% on CIFAR10-DVS, retaining most of the accuracy of ideal signed LIF neurons.

2606.03795 2026-06-03 cs.CV

Beyond Compression: Quantifying Spectral Accessibility in Vision Representations

超越压缩:量化视觉表示中的频谱可访问性

Akayou A. Kitessa, Yijun Zhao

AI总结 通过残差频谱损失(RSL)测量线性可恢复的带限傅里叶能量,研究视觉语言模型中投影层对表示频谱结构的影响,发现CLIP和DINOv2中频谱可访问性随深度非单调变化,中间层峰值后下降,且CLIP的投影是频谱中性的,而DINOv2的[CLS]池化导致频谱结构损失。

详情
AI中文摘要

视觉语言模型通过学习的投影层将视觉特征映射到共享嵌入空间,但目前尚不清楚这些变换如何改变视觉信息的结构。本研究通过空间频率可访问性(以从模型表示中线性恢复带限傅里叶能量的能力衡量)来考察表示的变化。为隔离降维之外的影响,我们引入了残差频谱损失(RSL),该损失相对于维度匹配的随机投影基线评估变化。为减少优化带来的混杂效应,分析使用所有参数冻结的预训练模型。实验结果显示,在ImageNet和MS-COCO数据集上,CLIP和DINOv2中可访问性随频率一致变化。频谱可访问性随深度呈非单调轨迹,在中间层达到峰值,然后向输出表示下降。最终变换因架构而异:CLIP的学习投影是频谱中性的,变化可由压缩解释,而DINOv2的[CLS]池化导致整个频谱的结构性损失。这些发现表明中间层和池化机制是现代视觉编码器中频谱变换的主要驱动因素。

英文摘要

Vision-language models map visual features into a shared embedding space through learned projection layers, yet it remains unclear how these transformations alter the structure of visual information. This study examines changes in representation through spatial-frequency accessibility, measured by the linear recoverability of band-limited Fourier energy from model representations. To isolate effects beyond dimensionality reduction, we introduce Residual Spectral Loss (RSL), which evaluates changes relative to a dimension-matched random projection baseline. To reduce confounding effects from optimization, the analysis uses pretrained models with all parameters frozen. The experimental results show consistent frequency-dependent changes in accessibility across CLIP and DINOv2 on ImageNet and MS-COCO datasets. Spectral accessibility follows a non-monotonic trajectory across depth, peaking at intermediate layers before decreasing toward the output representation. The final transformation differs across architectures: CLIP's learned projection is spectrally neutral, with changes explained by compression, whereas DINOv2's [CLS] pooling induces a structured loss across the spectrum. These findings identify intermediate layers and pooling mechanisms as primary drivers of spectral transformation in modern vision encoders.

2606.03794 2026-06-03 cs.LG eess.SP

Limit Analysis of Graph Neural Networks with Wireless Conflict Graphs

基于无线冲突图的图神经网络极限分析

Romina Garcia Camargo, Zhiyang Wang, Alejandro Ribeiro

AI总结 针对稀疏随机几何图上的图神经网络,通过分析其与确定性网格图的接近性,建立了跨尺度迁移性的理论界限,并在链路调度问题中验证了学习策略的优越性。

详情
AI中文摘要

图神经网络(GNN)已成为一种利用通信网络底层图结构进行无线资源分配的强大工具。其可迁移性使得在小规模图上训练的模型能够推广到大规模部署,且性能下降很小,这对于当前不断增长的网络而言是一个理想特性。无线网络是稀疏的,单个节点只与少量其他用户相连。本文建立了基于稀疏随机几何图(RGG)的图神经网络可迁移性的理论结果。特别地,我们关注用于建模链路间干扰的RGG冲突图。我们的方法考虑了RGG与确定性网格图(DGG)之间的接近性,以建立模型跨尺度迁移时性能损失的界限。我们通过链路调度问题验证了理论发现,表明学习策略在规模上始终优于现有基准。最后,我们考察了理论假设对经验性能的影响。

英文摘要

Graph Neural Networks (GNNs) have emerged as a powerful tool for wireless resource allocation that leverages the underlying graph structure of communication networks. Their transferability property enables models trained on small-scale graphs to generalize to large-scale deployments with little performance deterioration, a desirable property for currently growing networks. Wireless networks are sparse regimes, where a single node is connected to a small number of other users. This work establishes theoretical results for transferability of GNNs over graphs derived from sparse Random Geometric Graphs (RGGs). In particular, we focus on conflict graphs of RGGs used to model interference among links. Our approach considers the closeness between RGGs and Deterministic Grid Graphs (DGG) to establish bounds in the performance loss when a model is transferred across scales. We validate our theoretical findings through the problem of link scheduling, demonstrating that our learned policies consistently outperform existing benchmarks at scale. Finally, we examine the impact of our theoretical assumptions on empirical performance.

2606.03793 2026-06-03 cs.CL cs.CV

Exploring Adversarial Robustness and Safety Alignment in Multilingual Multi-Modal Large Language Models

探索多语言多模态大语言模型的对抗鲁棒性与安全对齐

Hashmat Shadab Malik, Muzammal Naseer, Salman Khan

AI总结 本研究通过梯度攻击和跨语言评估,发现多语言多模态大语言模型存在可迁移的对抗脆弱性,并揭示低资源语言因理解失败而呈现的虚假安全现象,提出深层训练整合才能实现真正的多语言安全对齐。

详情
AI中文摘要

多模态大语言模型将视觉感知整合到语言推理中,引入了一个连续的攻击面,容易受到对抗攻击。先前关于MLLM鲁棒性的工作主要关注以英语为中心的任务,多语言行为尚未被探索。我们通过对12种不同语言的对抗鲁棒性和多模态安全性进行系统研究来填补这一空白,评估通过指令调优获得多语言能力的开源MLLM。基于梯度的攻击揭示了一种可迁移的多语言脆弱性:在一种语言中优化的对抗图像会继续在其他语言中引发失败,表现出强大的跨语言可迁移性。多语言安全性进一步取决于模型检索或解释有害指令的有效性。当有害意图通过文本发出时,语言基础更强的语言更常引发允许滥用的响应,而较弱的语言产生较少的不安全输出。当嵌入图像作为排版内容时,英文脚本被可靠地识别和遵循,而非英文脚本很少被视觉编码器解析。因此,低资源语言可能看起来更安全,但这是理解和视觉基础失败的人为产物,而非真正的对齐,我们将这种现象称为“失败导致的安全”。相比之下,在整个训练阶段(而不仅仅在指令调优阶段)构建多语言能力的MLLM,如Qwen3-VL,表现出真正的跨语言安全性,跨语言保持主动拒绝,而不是掩盖理解失败。浅层多语言适应(例如在翻译的指令数据上进行微调)可能产生表面理解,在低资源语言中造成虚幻的安全感;跨训练阶段的更深层整合才能实现真正的多语言安全对齐。

英文摘要

Multimodal Large Language Models integrate visual perception into language reasoning, introducing a continuous attack surface susceptible to adversarial attacks. Prior work on MLLM robustness has focused largely on English-centric tasks, leaving multilingual behaviour unexplored. We address this gap through a systematic study of adversarial robustness and multimodal safety across 12 diverse languages, evaluating open-source MLLMs that acquire multilingual capability through instruction tuning. Gradient-based attacks reveal a transferable multilingual vulnerability: adversarial images optimized in one language continue to induce failure in others, demonstrating strong cross-lingual transferability. Multilingual safety further varies with how effectively a model retrieves or interprets harmful instructions. When harmful intent is issued through text, languages with stronger linguistic grounding more often elicit misuse-enabling responses, while weaker languages produce fewer unsafe outputs. When embedded in the image as typographic content, English scripts are reliably recognised and followed, whereas non-English scripts are rarely parsed by the vision encoder. Lower-resource languages may therefore appear safer, but this is an artefact of comprehension and visual-grounding failures rather than genuine alignment, a phenomenon we term safety-by-failure. In contrast, MLLMs that build multilingual capability throughout their training stages rather than only at instruction tuning, such as Qwen3-VL, exhibit genuine cross-lingual safety, maintaining active refusal across languages rather than masking comprehension failure. Shallow multilingual adaptation, such as fine-tuning on translated instruction data, may produce surface-level understanding that creates illusory safety in low-resource languages; deeper integration across training stages leads to genuine multilingual safety alignment.

2606.03792 2026-06-03 cs.CV cs.LG

Training-Free Multi-Concept LoRA Composition with Prompt-Aware Weighting

免训练的多概念LoRA组合与提示感知加权

Georgios Tsoumplekas, Stella Bounareli, Vasileios Argyriou

AI总结 提出一种免训练的提示感知加权策略,通过优化组合多个LoRA模块的输出实现多概念定制,提升图像质量和概念保真度。

详情
Comments
Accepted at IEEE FG 2026
AI中文摘要

低秩适应(LoRA)通过将预训练扩散模型适应到特定视觉概念和风格,成功实现了文本到图像生成中的个性化。然而,将此类模型扩展到多概念定制仍然具有挑战性。简单组合多个LoRA权重或其输出通常会导致概念间的干扰,从而降低视觉质量并减少对单个概念参考图像的保真度。本文提出了一种简单而有效的多概念定制方法,通过最优组合多个LoRA模块的输出。我们利用生成过程中每个概念的相对重要性(从其对应的提示标记推断),并引入了两种方法:W-Switch和W-Composite,它们采用提示感知的重要性加权策略,其中每个LoRA根据其触发词在目标提示中的语义影响进行加权。此外,我们通过提出一种新的基于图像的相似性评估框架来扩展现有的定量评估指标,该框架通过比较真实世界参考图像和从生成图像中自动分割的概念区域来评估图像保真度和身份保持。我们在ComposLoRA测试平台上评估了我们的方法,并在视觉质量、身份保持和组合性方面展示了相对于现有最先进方法的一致改进。定性评估,包括基于大语言模型(LLM)的评估和用户研究,进一步验证了所提出方法的有效性,并与新引入的基于图像的定量指标一致。我们的代码可在该https URL获取。

英文摘要

Low-Rank Adaptation (LoRA) successfully enables personalization in text-to-image generation by adapting pre-trained diffusion models to specific visual concepts and styles. However, extending such models to multi-concept customization remains challenging. Naively combining multiple LoRA weights or their outputs often leads to interference among concepts, resulting in degraded visual quality and reduced fidelity to the reference images of individual concepts. This paper proposes a simple yet effective approach for multi-concept customization by optimally combining the outputs of multiple LoRA modules. We leverage the relative importance of each concept during generation, as inferred from its corresponding prompt tokens and introduce two methods, W-Switch and W-Composite, that employ a prompt-aware importance weighting strategy in which each LoRA is weighted according to the semantic influence of its trigger words in the target prompt. In addition, we extend existing quantitative evaluation metrics by proposing a new image-based similarity evaluation framework that assesses image fidelity and identity preservation through comparisons between real-world reference images and automatically segmented concept regions from generated images. We evaluate our approach on the ComposLoRA testbed and demonstrate consistent improvements over existing state-of-the-art methods in terms of visual quality, identity preservation and compositionality. Qualitative evaluations, including a Large Language Model (LLM) based assessment and a user study, further validate the effectiveness of the proposed methods and align with the newly introduced quantitative image-based metrics. Our code is available at https://github.com/GeorgeTsoumplekas/Prompt-Aware-Multi-LoRA-Composition.

2606.03788 2026-06-03 cs.CV

SLU-2K: A Question-Based Benchmark for Semantic Evaluation of Sign Language Translation

SLU-2K:基于问题的手语翻译语义评估基准

Zeno Testa, Antonino Furnari, Lorenzo Baraldi, Natalia Díaz-Rodríguez

AI总结 提出SLU-2K基准,通过2350个视频问答对评估手语翻译的语义理解,揭示当前系统在语义正确性上的不足。

详情
AI中文摘要

手语翻译(SLT)通常使用表面形式指标(如BLEU和ROUGE)进行评估,这些指标奖励词汇重叠,但不直接衡量翻译是否保留了源手语序列的含义。这与将SLT集成到辅助技术中的最终目标相悖。在这项工作中,我们将重点从手语翻译(SLT)转向手语理解(SLU),特别强调语义理解。具体来说,我们根据系统从输入视频中正确恢复原始句子关键语义方面的能力来评估系统,例如发生的动作以及关于人和物体的事实。为了系统地实现这种评估,我们提出了SLU-2K,这是一个基于流行的PHOENIX-2014T和CSL-Daily数据集的2350个封闭式视频问答对的数据集。为了获得SLU-2K,我们提出并广泛评估了一个自动数据生成流水线,该流水线生成7个类别的问题,即动作、位置、数字、物体、人物、时间和天气条件。我们通过评估流行的多模态大语言模型(MLLM)和两个代表性的最先进系统MMSTL和SpaMo,展示了SLU-2K的潜力。我们的结果表明,MLLM达到了接近随机的性能,突显了当前AI系统中需要更系统地集成SLU。此外,在领域内数据上精心微调的最先进翻译系统仍然存在显著的语义差距,结果范围从56.7%到75.2%。这些发现表明,当前的SLT评估协议高估了真正的理解,未来的进展不仅应通过流畅性和n-gram重叠来衡量,还应通过语义正确性来衡量。代码、提示和基准文件可在此https URL获取。

英文摘要

Sign Language Translation (SLT) is typically evaluated with surface-form metrics such as BLEU and ROUGE, which reward lexical overlap but do not directly measure whether a translation preserves the meaning of the source sign sequence. This is in contrast with the final objective of integrating SLT in assistive technology. In this work, we shift the focus from Sign Language Translation (SLT) to Sign Language Understanding (SLU), with particular emphasis on semantic understanding. Specifically, we evaluate systems based on their ability to correctly recover, from the input video, key semantic aspects of the original sentence, such as actions taking place and facts about people and objects. To enable this evaluation systematically, we propose SLU-2K, a dataset of 2,350 closed-ended video question-answer pairs based on the popular PHOENIX-2014T and CSL-Daily datasets. To obtain SLU-2K, we propose and extensively evaluate an automated data generation pipeline which produces questions across 7 categories, namely actions, locations, numbers, objects, people, time, and weather conditions. We show the potential of SLU-2K by evaluating popular Multimodal Large Language Models (MLLMs) and two representative state-of-the-art systems, MMSTL and SpaMo. Our results show that MLLMs reach near-random performance, highlighting the need for a more systematic integration of SLU in current AI systems. Furthermore, state-of-the-art translation systems carefully fine-tuned on in-domain data still exhibit a substantial semantic gap, with results ranging from 56.7% to 75.2%. These findings suggest that current SLT evaluation protocols overestimate true understanding and that future progress should be measured not only by fluency and n-gram overlap, but also by semantic correctness. Code, prompts, and benchmark files are available at https://github.com/ZenoTsT/SLU-2K

2606.03782 2026-06-03 cs.CL

Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?

基于语法的推理:合成语言推理轨迹能否增强低资源机器翻译?

Renhao Pei, Yihong Liu, Sampo Pyysalo, Hinrich Schütze, Shaoxiong Ji

AI总结 本文提出自动生成语言推理轨迹的方法,通过上下文学习、监督微调和强化微调评估其对低资源机器翻译的影响,发现推理轨迹在推理时指导效果显著,但作为训练数据收益有限。

详情
AI中文摘要

大型语言模型(LLMs)通过上下文学习整合语言资源,为极低资源语言的机器翻译(MT)提供了一种有前景的方法。然而,LLMs在翻译过程中往往难以有效应用语法信息。受思维链推理最新进展的启发,我们研究了低资源MT是否能从结构化的中间语言分析和语法推理步骤中受益。我们提出了一种从通用依存树库、词典和语法规则库自动生成逐步语言推理轨迹的流程。我们以锡伯语和Chintang语为测试案例,在三种设置下评估这些轨迹:上下文学习(ICL)、监督微调(SFT)和强化微调(RFT)。我们的结果表明,语言推理轨迹作为推理时指导最为有效:在ICL中,可靠的句子特定轨迹在大多数模型、语言和指标上显著提升了翻译性能。相比之下,将语言推理轨迹作为训练数据使用带来的收益较小且不一致,因为模型学习了轨迹格式但往往生成错误内容。这些发现表明,当提供可靠的语言分析时,LLMs能够利用语法信息进行低资源MT,而学习生成此类分析仍然是主要瓶颈。

英文摘要

Large language models (LLMs) offer a promising approach to machine translation (MT) for extremely low-resource languages by incorporating linguistic resources through in-context learning. However, LLMs often struggle to apply grammatical information effectively during translation. Inspired by recent progress in chain-of-thought reasoning, we investigate whether low-resource MT can benefit from structured intermediate steps of linguistic analysis and grammatical reasoning. We propose a pipeline for automatically generating step-by-step linguistic reasoning traces from Universal Dependencies treebanks, dictionaries, and grammar-rule banks. We evaluate these traces in three settings: in-context learning (ICL), supervised fine-tuning (SFT), and reinforcement fine-tuning (RFT), on Xibe and Chintang as test cases. Our results show that linguistic reasoning traces are most effective as inference-time guidance: in ICL, reliable sentence-specific traces substantially improve translation performance across most models, languages, and metrics. In contrast, using the linguistic reasoning traces as training data yields smaller and less consistent gains, as models learn the trace format but often generate erroneous content. These findings suggest that LLMs can leverage grammatical information for low-resource MT when given reliable linguistic analyses, while learning to generate such analyses remains a major bottleneck.

2606.03780 2026-06-03 cs.CL cs.LG

Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models

专家感知的稀疏MoE语言模型中事实回忆的因果追踪

Yuetian Lu, Ali Modarressi, Yihong Liu, Hinrich Schütze

AI总结 针对稀疏混合专家语言模型,提出专家感知的因果追踪方法,通过干预专家级更新定位事实回忆的关键专家,发现专家级定位依赖于模型和协议。

详情
Comments
Preprint
AI中文摘要

事实回忆的因果追踪主要在密集Transformer语言模型中进行研究,其中干预将信息流定位到层或前馈模块。稀疏混合专家(MoE)语言模型引入了一个更尖锐的问题:当事实预测由路由的MoE块中介时,哪些路由的专家贡献起作用?我们为稀疏MoE语言模型制定了专家感知的因果追踪。使用CounterFact事实,我们首先通过向主题词嵌入添加噪声来破坏模型的事实偏好,然后测试干净的MoE块输出或干净的专家级更新是否恢复了真实与虚假logit对比。对于Qwen3-30B-A3B-Base,层扫描选择并验证了第44层,专家级追踪识别出L44E069作为在干净运行中反复选择的专家,其保留的补丁优于其他活跃的同一层专家补丁。对于Mixtral-8x7B-v0.1,层级追踪验证了中层信号,但该信号并未定位到选定的单个专家;相反,联盟检查通过路由的多专家更新恢复了它。这些结果表明,MoE事实追踪可以做到专家感知,同时也表明专家级定位是依赖于模型和协议的,而非普遍适用。

英文摘要

Causal tracing of factual recall has been studied predominantly in dense transformer language models, where interventions localize information flow to layers or feed-forward modules. Sparse mixture-of-experts (MoE) language models introduce a sharper question: when a factual prediction is mediated by a routed MoE block, which routed expert contributions matter? We formulate expert-aware causal tracing for sparse MoE language models. Using CounterFact facts, we first corrupt the model's factual preference by adding noise to subject-token embeddings, and then test whether clean MoE-block outputs or clean expert-level updates restore the true-vs-foil logit contrast. For Qwen3-30B-A3B-Base, a layer sweep selects and validates layer 44, and expert-level tracing identifies L44E069 as an expert repeatedly selected in the clean run whose held-out patch outperforms other active same-layer expert patches. For Mixtral-8x7B-v0.1, layer-level tracing validates a mid-layer signal, but the signal is not localized to the selected singleton expert; a coalition check instead recovers it with routed multi-expert updates. These results suggest that MoE factual tracing can be made expert-aware, while also showing that expert-level localization is model- and protocol-dependent rather than universal.

2606.03777 2026-06-03 cs.AI cs.CR q-fin.RM

From Control Boundary to Insurance Claim: Reconstructing AI-Mediated Losses Through the CER Framework

从控制边界到保险索赔:通过CER框架重构AI中介损失

Alex Leung, Rex Zhang, Kentaroh Toyoda, SiewMei Loh

AI总结 本文提出CER框架(控制边界、证据重构、保险响应),用于诊断和重构由生成式或代理式AI系统导致的损失,以支持保险索赔。

详情
AI中文摘要

通过受保组织的生成式或代理式AI系统产生的AI损失需要状态重构,而不仅仅是事件重构,因为相关状态会随着系统推理、检索、调用工具和行动而改变。相关的问题不仅是发生了什么损失,还包括系统被允许做什么、实际做了什么,以及重构的损失能否支持保险索赔。本文处理受保人的AI系统处于因果链中的损失,包括外部触发的故障,如提示注入、检索增强生成(RAG)投毒、恶意工具输出、凭证滥用和数据投毒。具体而言,本文介绍了CER,一种用于AI残余风险转移的用例级诊断。C(控制边界)询问系统是否具有可执行的操作范围。E(证据重构)询问是否可以从保留的工件中重构系统状态和因果链。R(保险响应)询问重构的损失是否被保险:保险覆盖是否在市场上可用并为受保人投保,以及支持保险索赔所需的证据。本文做出三项贡献:定义了AI特定的重构问题,通过CER操作化该问题,并指定了AI重构的索赔级证据。公开示例包括报道的PocketOS和Replit代理数据库删除事件,以及作为已裁决的输出/依赖案例的Moffatt诉加拿大航空案。关键词:AI系统;CER框架;残余风险转移;代理式AI;生成式AI;AI保险;证据重构。

英文摘要

AI losses that arise through an insured organization's generative or agentic AI system require state reconstruction, not merely event reconstruction, because the relevant state changes as the system reasons, retrieves, calls tools, and acts. The relevant question is not only what loss occurred, but what the system was allowed to do, what it actually did, and whether that reconstructed loss can support insurance claim recovery. This paper addresses losses in which the insured's AI system is in the causal chain, including externally triggered failures such as prompt injection, retrieval-augmented generation (RAG) poisoning, malicious tool output, credential misuse, and data poisoning. Specifically, this paper introduces CER, a use-case-level diagnostic for AI residual risk transfer. C (control boundary) asks whether the system had an enforceable operating envelope. E (evidence reconstruction) asks whether the system state and causal chain can be reconstructed from retained artifacts. R (insurance response) asks whether the reconstructed loss is insured: whether insurance coverage is available in the market and placed for the insured, together with the proof needed to support insurance claim recovery. The paper makes three contributions: it defines the AI-specific reconstruction problem, operationalizes that problem through CER, and specifies claim-grade evidence for AI reconstruction. Public examples include the reported PocketOS and Replit agentic database-deletion incidents and Moffatt v. Air Canada as an adjudicated output/reliance case. Keywords: AI systems; CER framework; residual risk transfer; agentic AI; generative AI; AI insurance; evidence reconstruction.

2606.03774 2026-06-03 cs.CV

AmbientEye: A Dataset for Pupil Segmentation under Natural Ambient Infrared Illumination

AmbientEye: 自然环境红外光照下的瞳孔分割数据集

Mingyu Han, Hyunyoung Han, Nitheekulawatn Thommakoon, Gangtae Park, Jieun Han, Xucong Zhang, Ian Oakley

AI总结 本文提出AmbientEye数据集,探索在无主动红外光源、仅依靠环境阳光的户外场景中,利用被动红外相机实现可靠瞳孔检测,并评估现有算法性能下降。

详情
Comments
12 pages, 7 figures
AI中文摘要

眼动追踪对于智能眼镜至关重要,因为它能为环境智能应用提供用户注意力的洞察。然而,大多数现有的眼动追踪系统依赖主动红外(IR)照明,由于功耗问题,在全天户外使用中造成了实际障碍。本文研究了在无任何主动红外光源、仅依靠环境阳光作为唯一照明源的户外环境中,单独使用被动红外相机能否实现可靠的瞳孔检测。为支持这一研究,我们引入了AmbientEye,这是一个包含来自19个国家35名参与者的2,606,225张眼部图像的大规模数据集。该数据集在户外自然阳光下,使用两种离轴相机配置和两种太阳方向条件采集。我们通过SAM2自动分割提供高质量的瞳孔标注,随后由人工标注员进行细化。我们在数据集上评估了一种最先进的瞳孔分割算法,并将其性能与在受控红外照明下的现有数据集上的性能进行了比较。结果显示,瞳孔分割性能从受控红外数据集上的0.928大幅下降到AmbientEye上的0.767。这一性能差距凸显了环境光设置的挑战。这使得AmbientEye成为未探索且高度实用的眼动追踪场景的第一个基准。

英文摘要

Eye tracking is essential for smart glasses, as it provides insight into user attention for ambient intelligence applications. However, most existing eye-tracking systems rely on active infrared (IR) illumination, creating practical barriers to all-day outdoor use due to power consumption. In this paper, we investigate whether passive IR cameras alone, without any active IR light source, can enable reliable pupil detection in unconstrained outdoor environments, where ambient sunlight serves as the sole illumination source. To support this investigation, we introduce AmbientEye, a large-scale dataset of 2,606,225 eye images collected from 35 participants from 19 countries. It is captured outdoors under natural sunlight with two off-axis camera configurations and two sun-orientation conditions. We provide high-quality pupil annotation through SAM2 automatic segmentation, followed by refinement by human annotators. We benchmark a state-of-the-art pupil segmentation algorithm on our dataset and compare its performance with that on existing datasets under controlled IR illumination. Results reveal a substantial drop in pupil segmentation performance from 0.928 on controlled IR datasets to 0.767 on AmbientEye. This performance gap highlights the challenge of the ambient-light setting. This positions AmbientEye as a first benchmark for an unexplored and highly practical eye-tracking scenario.

2606.03773 2026-06-03 cs.CL

KletterMix: Climbing Toward High-Quality German Pretraining Data

KletterMix: 攀登高质量德语预训练数据

Maurice Kraus, Ruben Härle, Sebastian Sztwiertnia, Abbas Goher Khan, Mehdi Ali, Michael Fromm, Kristian Kersting

AI总结 通过翻译高质量英语语料库构建德语预训练语料库KletterMix,并验证其在德语下游任务中的有效性。

详情
AI中文摘要

高质量的预训练数据是现代语言模型的核心要素,但德语资源的开发远不如英语资源:它们通常规模更小、筛选不仔细、文档不完善,并且很少通过受控训练实验进行验证。我们引入了KletterMix,一个用于语言模型预训练和退火的高质量德语语料库,旨在为自然语言处理和建模社区提供可重复使用的数据集产品。KletterMix通过将最先进的英语预训练语料库翻译成德语构建,同时保留文档边界、元数据、源结构和主题多样性。这种构建方式产生了一个具有现代预训练数据集规模和多样性的德语语料库,同时允许与其英语源进行直接比较。我们通过广泛的语料库级别分析来记录数据集,包括翻译质量、文档长度分布、主题覆盖、源组成和地理元数据。使用COMETKiwi,我们表明翻译后的文档在不同领域实现了高质量,表明仔细的翻译可以保留原始语料库的许多语义和风格丰富性。除了数据集构建,我们还评估了KletterMix作为训练数据。通过针对现有德语语料库的受控预训练和退火消融实验,我们表明在KletterMix上训练的模型在德语下游评估中取得了可衡量的改进。这些结果表明,精心策划的翻译数据可以显著增强德语预训练数据生态系统。

英文摘要

High-quality pretraining data is a central ingredient in modern language models, but German-language resources remain far less developed than their English counterparts: they are often smaller, less carefully curated, weakly documented, and rarely validated through controlled training experiments. We introduce KletterMix, a high-quality German corpus for language model pretraining and annealing, designed as a reusable dataset artifact for the natural language processing and modeling community. KletterMix is built by translating a state-of-the-art English pretraining corpus into German while preserving document boundaries, metadata, source structure, and topical diversity. This construction yields a German corpus with the scale and diversity of a modern pretraining dataset, while enabling direct comparison to its English source. We document the dataset through a broad set of corpus-level analyses, including translation quality, document length distributions, topic coverage, source composition, and geographic metadata. Using COMETKiwi, we show that the translated documents achieve strong quality across diverse domains, suggesting that careful translation can preserve much of the semantic and stylistic richness of the original corpus. Beyond dataset construction, we evaluate KletterMix as training data. Through controlled pretraining and annealing ablations against established German corpora, we show that models trained on KletterMix achieve measurable improvements on German-language downstream evaluations. These results demonstrate that carefully curated translated data can substantially strengthen the German pretraining data ecosystem.

2606.03770 2026-06-03 cs.DC cs.AI

E2LLM: Towards Efficient LLM Serving in Heterogeneous Edge/Fog Environments

E2LLM:异构边缘/雾环境中高效LLM服务

Truong-Thanh Le, Amir Taherkordi, Hoang-Loc La, Frank Eliassen, Phuong Hoai Ha, Peiyuan Guan

AI总结 提出E2LLM框架,通过复制模型到多设备组并采用模型并行,结合遗传算法聚类和动态规划分区,在资源受限的异构边缘/雾环境中实现高效LLM部署,相比Splitwise基线在高需求下平均等待时间降低50%以上。

详情
AI中文摘要

大型语言模型(LLM)已成为现代应用不可或缺的一部分,但其部署仍具挑战性。除了执行模型本身,实际部署必须解决成本效率、低延迟和最优资源利用问题。传统方法通常假设整个模型可以托管在单个设备上,这在许多现实场景中不成立,尤其是在设备资源受限的边缘和雾环境中。本文介绍了E2LLM,一个旨在在此类资源有限环境中实现高效LLM部署的框架。E2LLM并非简单地将单个模型分区到所有可用设备,而是将完整模型复制到多个设备组(副本),并在每个副本内应用模型并行。每个副本根据其处理输入和输出令牌的效率被分配专门角色PREFILL或DECODER。这种分离利用了LLM推理这两个阶段之间的固有差异。为了有效组织设备,我们利用遗传算法形成最大化系统性能的集群。在每个集群内,我们应用动态规划确定最优分区策略,以最小化模型并行执行中的瓶颈。实验结果表明,我们的方法能够稳健地适应不同工作负载,包括输入和输出令牌长度显著变化的场景。与Splitwise基线相比,E2LLM在高需求条件下将平均等待时间降低了50%以上。

英文摘要

Large Language Models (LLMs) have become integral to modern applications, yet their deployment remains challenging. Beyond executing the models themselves, practical deployment must address cost efficiency, low latency, and optimal resource utilization. Conventional approaches typically assume that an entire model can be hosted on a single device, which does not hold in many real-world scenarios, particularly in Edge and Fog environments where device resources are constrained. In this paper, we introduce E2LLM, a framework designed to enable efficient LLM deployment in such resource limited settings. Rather than simply partitioning a single model across all available devices, E2LLM replicates the full model across multiple groups of devices (replicas) and applies model parallelism within each replica. Each replica is assigned a specialized role PREFILL or DECODER based on its efficiency in handling input and output tokens. This separation leverages the inherent differences between these two phases of LLM inference. To effectively organize devices, we utilize a Genetic Algorithm to form clusters that maximize system performance. Within each cluster, we apply Dynamic Programming to determine an optimal partitioning strategy that minimizes bottlenecks in model-parallel execution. Experimental results demonstrate that our approach adapts robustly to varying workloads, including scenarios with significant variation in input and output token lengths. Compared to the Splitwise baseline, E2LLM reduces average waiting time by over 50% under high-demand conditions

2606.03768 2026-06-03 cs.CL

HybridThinker: Efficient Chain-of-Thought Reasoning via Compressed Memory and Transient Thought Steps

HybridThinker: 通过压缩记忆和瞬态思考步骤实现高效的思维链推理

Xin Liu, Runsong Zhao, Xinyu Liu, Junhao Ruan, Pengcheng Huang, Shichao Dong, Chunyang Xiao, Chenglong Wang, Changliang Li, Jingbo Zhu, Tong Xiao

AI总结 提出HybridThinker方法,通过保留压缩记忆和临时保留思考步骤,并采用混合训练方案,在保持推理准确性的同时显著压缩思维链长度。

详情
Comments
23 pages, 9 figures
AI中文摘要

扩展的思维链(CoT)轨迹提升了LLM的推理能力,但带来了大量的计算和内存成本。现有的CoT压缩方法通过记忆令牌将思考步骤压缩为紧凑表示,并在推理时仅保留这些表示,但细粒度信息的丢失使得后续步骤更容易出错。为了缓解这一问题,我们提出了 extbf{HybridThinker},除了保留这些表示外,思考步骤也会被临时保留以提供细粒度细节。然而,我们观察到在训练期间天真地让后续步骤可以访问思考步骤,会使模型绕过记忆令牌直接从这些步骤中检索信息,导致模型通过记忆令牌压缩和检索信息的能力训练不足。因此,我们引入了一种混合训练方案,其中只有部分思考步骤可以通过注意力直接访问后续步骤,而其他思考步骤被屏蔽,迫使模型使用记忆令牌进行压缩和检索。在4个推理基准测试中,HybridThinker与未压缩的基线性能相当,在相似的推理时间下,平均准确率比现有的CoT压缩方法提高了5.8个百分点。消融研究证实,临时思考步骤保留和混合训练方案都对这一提升有贡献。

英文摘要

Extended chain-of-thought (CoT) traces improve LLM reasoning but incur substantial computational and memory costs. While existing CoT compression methods mitigate this by condensing thought steps into compact representations via memory tokens and retaining only these representations at inference time, the loss of fine-grained information makes subsequent steps more error-prone. To alleviate this, we propose \textbf{HybridThinker}, where in addition to preserved these representations, thought steps are also temporarily retained to provide fine-grained details. However, we observe that naively keeping thought steps accessible to subsequent steps \emph{during training} lets the model bypass memory tokens by retrieving information directly from these steps, leaving the model's ability to compress and retrieve information through memory tokens insufficiently trained. We therefore introduce a hybrid training scheme, in which only some thought steps are directly accessible through attention to subsequent steps, while the other thought steps are masked, forcing the model to use memory tokens for compression and retrieval. Across 4 reasoning benchmarks, HybridThinker matches the uncompressed baseline, advancing the state of the art in CoT compression by 5.8 points on average accuracy with similar inference time. Ablation studies confirm that both temporary thought-step retention and the hybrid training scheme contribute to these gains.

2606.03762 2026-06-03 cs.LG cs.AI

Tool-Aware Optimization with Entropy Guidance for Efficient Agentic Reinforcement Learning

基于熵引导的工具感知优化用于高效智能体强化学习

Hongye Cao, Nuo Yan, Haoyuan Deng, Ziwei Wang, Tianpei Yang, Jing Huo, Yuyao Zhang, Yang Gao

AI总结 提出TAO-RL框架,通过工具感知轨迹过滤和熵引导探索解决智能体强化学习中工具使用导致的训练不稳定问题,在7个推理基准上优于现有方法。

详情
AI中文摘要

智能体强化学习(RL)使大型语言模型(LLMs)具备工具使用能力,从而显著提升复杂任务的推理性能。然而,整合外部工具常常导致训练不稳定:过度依赖工具会引发输入分布偏移,而过于保守的工具使用则限制了有效探索。为解决这一问题,我们提出统一框架TAO-RL,将工具感知轨迹过滤与熵引导探索相结合,以实现高效策略优化。具体而言,在数据层面,TAO-RL根据两个标准过滤轨迹:丢弃所有工具调用均执行失败的轨迹,以及移除所有轨迹全部正确或全部错误的轨迹,因为这两种情况都会产生退化的优势估计,无法提供有区分度的学习信号。这种联合过滤保留了既具备工具能力又包含信息量的数据,建立了高质量的训练分布。在算法层面,我们引入工具感知的熵引导奖励,重塑工具调用后token的优势函数,鼓励策略在关键决策点探索更多样化的推理路径。这两个组成部分相互增强:轨迹过滤建立了干净且信息丰富的训练基础,而熵引导探索则在关键工具交互节点驱动更强的推理行为。在3种模型规模下的7个具有挑战性的推理基准上的大量实验表明,TAO-RL优于现有方法。

英文摘要

Agentic reinforcement learning (RL) equips large language models (LLMs) with tool-use capabilities that substantially improve reasoning on complex tasks. However, integrating external tools often destabilizes training: over-reliance on tools can induce input distribution shift, while overly conservative tool use limits effective exploration. To address this issue, we propose a unified framework TAO-RL that couples tool-aware trajectory filtering with entropy-guided exploration for efficient policy optimization. Specifically, at the data level, TAO-RL filters rollout trajectories along two criteria: discarding those where all tool invocations fail to execute, and removing those where all rollouts are either correct or incorrect, as both cases yield degenerate advantage estimates that contribute no discriminative learning signal. This joint filtering retains data that are both tool-capable and informative, establishing a high-quality training distribution. At the algorithmic level, we introduce a tool-aware entropy-guided bonus that reshapes the advantage function at post-tool-call tokens, encouraging the policy to explore more diverse reasoning paths at critical decision points. These two components are mutually reinforcing: trajectory filtering establishes a clean and informative training foundation, while entropy-guided exploration drives stronger reasoning behaviors at critical tool-interaction junctures. Extensive experiments on 7 challenging reasoning benchmarks across 3 model scales demonstrate the superiority of TAO-RL over existing methods.

2606.03761 2026-06-03 cs.CL

Framing Migration News with LLMs: Structured CoT as a Support for Human Interpretation

使用LLM构建移民新闻框架:结构化思维链作为人类解释的支持

David Alonso del Barrio, Jing Wen, Daniel Gatica-Perez

AI总结 本研究提出结构化思维链(SCoT)提示方法,利用本地可部署的开源LLM(Llama3-8B)辅助移民新闻的框架分析,在单GPU上提升分类性能,并通过人类评估验证其逻辑性和对解释的影响。

详情
AI中文摘要

移民新闻的框架分析是一项具有社会重要性的任务:研究移民如何被叙述的媒体学者和研究人员需要不仅准确,而且透明、可审计,并在学术研究小组典型的资源限制下可访问的工具。现有的基于LLM的方法依赖于专有API和大模型,这引发了媒体研究人员对数据隐私、可重复性和公平访问的担忧。本研究探讨了本地可部署的开源LLM如何作为辅助工具支持可解释的框架分析。我们引入了一种使用Llama3-8B的结构化思维链(SCoT)提示方法,实现了基于预定义框架类别的逐步推理。这种结构化设计允许用户审计模型输出,并在本质上主观的任务中检查替代解释。我们在一个与移民相关的新闻数据集上评估了我们的方法,结果表明SCoT在单GPU上可行,同时比零样本和少样本基线提高了分类性能。然后,我们进行了一项以人为中心的评估,注释者评估“模型推理”的一致性和影响力。结果表明,SCoT解释通常被认为是逻辑的(平均得分4.1/5,尽管不同文本间存在显著差异),并且可以引发对初始解释的反思,即使存在分歧。我们的发现强调了LLM辅助框架分析的潜力和风险。虽然结构化推理可以增加模型输出的可追溯性并支持批判性解释,但它也可能以微妙的方式影响人类判断。通过支持本地部署并强调人在环中的交互,这项工作为研究具有社会影响力的媒体叙事的负责任和可访问的计算工具做出了贡献。

英文摘要

Frame analysis of migration news is a socially consequential task: media scholars and researchers who study how migration is narrated need tools that are not only accurate, but transparent, auditable, and accessible within the resource constraints typical of academic research groups. Existing LLM-based approaches rely on proprietary APIs and large models that raise concerns about data privacy, reproducibility and equitable access among media researchers. This work studies how a locally deployable open-source LLM can support interpretable frame analysis as an assistive tool. We introduce a Structured Chain-of-Thought (SCoT) prompting approach using Llama3-8B, enabling step-by-step justifications grounded in predefined framing categories. This structured design allows users to audit model outputs and examine alternative interpretations in a task that is inherently subjective. We evaluate our approach on a dataset of migration-related news and show that SCoT improves classification performance over zero-shot and few-shot baselines while remaining feasible on a single GPU. Then, we conduct a human-centered evaluation in which annotators assess the coherence and influence of "the model's reasoning". Results indicate that SCoT explanations are generally perceived as logical (mean score 4.1/5, though with notable variation across texts) and can prompt reflection on initial interpretations, even when disagreement persists. Our findings highlight both the potential and risks of LLM-assisted frame analysis. While structured reasoning can increase the traceability of model outputs and support critical interpretation, it can also influence human judgment in subtle ways. By enabling local deployment and emphasizing human-in-the-loop interaction, this work contributes to discussions on responsible and accessible computational tools for the study of socially impactful media narratives.

2606.03756 2026-06-03 cs.RO cs.LG

Neural Navigation Functions for Zero-Shot Generalizable Motion Planning

神经导航函数用于零样本泛化运动规划

Benjamin D. Shaffer, Pei-An Hsieh, Brooks Kinch, Nathaniel Trask, M. Ani Hsieh

AI总结 提出神经导航函数(Neural-NF),通过将数据驱动适应嵌入结构化椭圆规划器,实现跨未见环境几何的零样本迁移,并保证无碰撞、单调下降和全局最小值。

详情
Comments
17 pages, 10 figures
AI中文摘要

我们引入了神经导航函数(Neural-NF),一种学习到的反应式导航函数,能够跨未见环境几何进行零样本迁移。Neural-NF将数据驱动适应置于结构化椭圆规划器中,其中导航目标被学习,而规划器结构通过构造得以保留。具体来说,内在的拉普拉斯派生特征被映射到局部PDE系数,求解得到的边值问题在每个目标域上产生全局一致的值函数。对于每个可接受的学习模型,所得策略无碰撞,提供单调下降,并通过构造在目标处具有全局最小值。这为任何参数设置提供了线性可解的最优控制解释。实验上,Neural-NF在多样几何上实现了强大的零样本迁移,并比直接预测值函数的学习规划器性能提升高达5倍。

英文摘要

We introduce Neural Navigation Functions (Neural-NF), a learned reactive navigation function capable of zero-shot transfer across unseen environment geometries. Neural-NF places data-driven adaptation within a structured elliptic planner, where the navigation objective is learned while planner structure is preserved by construction. Specifically, intrinsic Laplacian-derived features are mapped to local PDE coefficients, and solving the resulting boundary value problem produces a globally consistent value function on each target domain. For every admissible learned model, the resulting policy is collision-free, provides monotonic descent and a global minimum at the goal by construction. This admits a linearly-solvable optimal-control interpretation for any parameter setting. Empirically, Neural-NF achieves strong zero-shot transfer across diverse geometries and outperforms learned planners that directly predict the value function by up to a $5\times$ improvement.

2606.03755 2026-06-03 cs.AI

LAP: An Agent-to-Instrument Protocol for Autonomous Science

LAP:面向自主科学的智能体-仪器协议

Linwu Zhu, Liqiang Gao, Yan Chen, Dan Zhu, Jian Huang

AI总结 针对自主科学中智能体与物理仪器连接碎片化的问题,提出LAP协议,通过InstrumentCard、预留机制、安全围栏握手和测量结果模式,实现有状态、安全关键的操作,并与现有生态系统兼容。

详情
Comments
31 pages
AI中文摘要

自主科学正从演示走向基础设施。大型语言模型智能体现在规划实验,而自动驾驶实验室执行实验。然而,每个此类系统都从头重建推理智能体与物理仪器之间的连接,面对的是为确定性软件客户端而非概率性、目标导向的智能体构建的碎片化供应商SDK和标准。最近的智能体互操作性协议明确了智能体生态系统的三个边缘中的两个(Anthropic的模型上下文协议(MCP)标准化了智能体-工具边缘,Google的Agent2Agent(A2A)标准化了智能体-智能体边缘),但两者都没有建模智能体-仪器边缘,其中操作是有状态的、安全关键的、独占的、物理体现的,并产生带有单位、校准和不确定性的测量结果。我们提出了实验室智能体协议(LAP),一种填补这一空白的协议设计。LAP保留了A2A的点对点、发现优先、任务生命周期结构,并增加了四个物理世界原语:(i)InstrumentCard,一种签名的能力和物理限制描述;(ii)用于独占仪器和样品锁定的第一类预留;(iii)安全围栏握手,带有与特定任务及其参数加密绑定的操作员确认令牌,用于控制危险和不可逆操作;(iv)MeasurementResult模式,使每个结果在物理上类型化(QUDT/UCUM)、校准锚定、带有不确定性且可重现。我们规定了角色、六层架构、JSON-RPC方法集、任务和安全状态机、错误模型以及跨实验室联合,并通过协议端到端地走通一个闭环自主实验活动。LAP在传输上与A2A/MCP生态系统兼容,并封装而非取代现有设备标准如SiLA 2和OPC-UA。

英文摘要

Autonomous science is moving from demonstration to infrastructure. Large language model agents now plan experiments, and self-driving laboratories execute them. Yet every such system rebuilds the link between the reasoning agent and the physical instrument from scratch, against fragmented vendor SDKs and standards built for deterministic software clients rather than probabilistic, goal-directed agents. Recent agent-interoperability protocols clarify two of the three edges of an agentic ecosystem (Anthropic's Model Context Protocol (MCP) standardizes the agent-to-tool edge, and Google's Agent2Agent (A2A) the agent-to-agent edge), but neither models the agent-to-instrument edge, where operations are stateful, safety-critical, exclusively owned, physically embodied, and produce measurements with units, calibration, and uncertainty. We present the Lab Agent Protocol (LAP), a protocol design that fills this gap. LAP retains A2A's peer-to-peer, discovery-first, task-lifecycle structure and adds four physical-world primitives: (i) the InstrumentCard, a signed capability and physical-limit description; (ii) first-class reservation for exclusive instrument and sample locking; (iii) a safety-fence handshake with operator-confirmation tokens cryptographically bound to a specific task and its parameters, gating hazardous and irreversible operations; and (iv) a MeasurementResult schema that makes every result physically typed (QUDT/UCUM), calibration-anchored, uncertainty-bearing, and reproducible by construction. We specify roles, a six-layer architecture, the JSON-RPC method set, the task and safety state machines, the error model, and cross-laboratory federation, and walk a closed-loop autonomous campaign through the protocol end-to-end. LAP is transport-compatible with the A2A/MCP ecosystem and encapsulates rather than replaces existing device standards such as SiLA 2 and OPC-UA.

2606.03748 2026-06-03 cs.CV cs.AI

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

Ultralytics YOLO26: 统一的实时端到端视觉模型

Glenn Jocher, Jing Qiu, Mengyu Liu, Shuai Lyu, Fatih Cagatay Akyon, Muhammet Esat Kalfaoglu

AI总结 本文提出YOLO26,通过双头设计、MuSGD优化器、渐进损失和STAL标签分配策略,实现无NMS的端到端实时检测,并在实例分割、姿态估计等任务上取得一致提升。

详情
Comments
31 pages, 8 figures
AI中文摘要

实时视觉需要准确、高效且易于在不同硬件上部署的模型。YOLO系列因此被广泛部署,但大多数YOLO检测器在推理时仍依赖非极大值抑制,由于分布聚焦损失而携带沉重的检测头,需要长时间的训练计划,并且可能使最小的物体没有正标签分配。我们提出Ultralytics YOLO26,一个统一的实时视觉模型系列,通过协调的架构和训练进展解决了这些限制。YOLO26采用双头设计实现原生无NMS的端到端推理,并完全移除DFL,产生具有无约束回归范围的更轻量头。其训练流程结合了MuSGD(一种从大语言模型训练改编的混合Muon-SGD优化器)、渐进损失(将监督转向推理时头)和STAL(一种保证小物体正覆盖的标签分配策略)。除了检测,YOLO26还为实例分割、姿态估计和旋转检测引入了特定任务的头和损失设计,在任务和尺度上产生一致的增益。该系列涵盖五个尺度(n/s/m/l/x),并在单一流程中支持检测、实例分割、姿态估计、分类和旋转检测,还有一个开放词汇扩展YOLOE-26,用于文本、视觉和提示无关的推理。在所有尺度上,YOLO26在COCO上以1.7-11.8 ms T4 TensorRT延迟实现40.9-57.5 mAP,在精度-延迟帕累托前沿上超越了先前的实时检测器,而YOLOE-26x在文本提示下于LVIS minival上达到40.6 AP。代码和模型可在https://this URL获取。

英文摘要

Real-time vision demands models that are accurate, efficient, and simple to deploy across diverse hardware. The YOLO family has become widely deployed for this reason, yet most YOLO detectors still rely on non-maximum suppression at inference, carry heavy detection heads due to Distribution Focal Loss, require long training schedules, and can leave the smallest objects without positive label assignments. We present Ultralytics YOLO26, a unified real-time vision model family that addresses these limitations through coordinated architecture and training advances. YOLO26 uses a dual-head design for native NMS-free end-to-end inference and removes DFL entirely, yielding a lighter head with unconstrained regression range. Its training pipeline combines MuSGD, a hybrid Muon-SGD optimizer adapted from large language model training; Progressive Loss, which shifts supervision toward the inference-time head; and STAL, a label assignment strategy that guarantees positive coverage for small objects. Beyond detection, YOLO26 introduces task-specific head and loss designs for instance segmentation, pose estimation, and oriented detection, producing consistent gains across tasks and scales. The family spans five scales (n/s/m/l/x) and supports detection, instance segmentation, pose estimation, classification, and oriented detection in a single pipeline, with an open-vocabulary extension, YOLOE-26, for text-, visual-, and prompt-free inference. Across all scales, YOLO26 achieves 40.9-57.5 mAP on COCO at 1.7-11.8 ms T4 TensorRT latency, advancing the accuracy-latency Pareto front over prior real-time detectors, while YOLOE-26x reaches 40.6 AP on LVIS minival under text prompting. Code and models are available at https://github.com/ultralytics/ultralytics.

2606.03743 2026-06-03 cs.AI

Proof-Refactor: Refactoring Generated Formal Proofs into Modular Artifacts

Proof-Refactor: 将生成的形式化证明重构为模块化工件

Yiming Fu, Peixuan Liu, Zichen Wang, Kun yuan

AI总结 提出一个名为 Proof-Refactor 的智能体框架,通过四阶段流程(提取候选证明片段、设计辅助声明、形式化证明提取和设计的组件、使用验证组件修复原始证明)将大语言模型生成的形式化证明重构为更模块化、可读、可维护和可重用的工件,在 PutnamBench 和 Putnam2025 上的 Lean 证明重构中优于基线。

详情
Comments
21 pages, 3 figures, 3 tables
AI中文摘要

虽然大语言模型在生成形式化证明方面表现出色,但其输出通常比成熟形式化数学库中的证明更不易读、模块化、可维护和可重用。我们认为这一差距部分源于大多数证明生成流程中隐含的“先编译”目标,这鼓励了整体式或特设的证明脚本,而非库质量的工件。现有的证明质量改进方法通常依赖于显式的、可计算的优化目标。然而在实践中,最易处理且经过实验验证的目标主要是基于长度的,而更高级的质量(如可读性、模块化、可维护性和可重用性)很难简化为可靠的自动度量。我们没有针对单一代理指标优化证明改进,而是采用受人类证明重构工作流程启发的过程引导方法。我们提出了一个智能体框架 $ extbf{Proof-Refactor}$,将证明重构分解为四个阶段:提取候选证明片段、设计辅助声明、形式化证明提取和设计的组件,以及使用验证组件修复原始证明。在来自 PutnamBench 和 Putnam2025 的生成 Lean 证明上,Proof-Refactor 在基于评分标准的重构得分上优于强大的 Claude Code 重构基线,在签名质量和人类可读性方面提升最大。这些结果表明,过程引导的重构可以在不将证明长度作为主要目标的情况下改进证明结构。

英文摘要

While Large Language Models (LLMs) have shown strong performance in generating formal proofs, their outputs often remain less readable, modular, maintainable, and reusable than proofs in mature formal mathematics libraries. We argue that this gap stems in part from the compile-first objective implicit in most proof-generation pipelines, which encourages monolithic or ad hoc proof scripts rather than library-quality artifacts. Existing approaches to proof-quality improvement often rely on explicit, computable optimization objectives. In practice, however, the most tractable and experimentally validated objectives are largely length-based, while higher-level qualities such as readability, modularity, maintainability, and reusability are difficult to reduce to reliable automatic metrics. Instead of optimizing proof improvement against a single proxy metric, we take a process-guided approach inspired by human proof-refactoring workflows. We propose an agentic framework $\textbf{Proof-Refactor}$ that decomposes proof refactoring into four phases: extracting candidate proof fragments, designing helper declarations, formally proving the extracted and designed components, and repairing the original proof using the verified components. On generated Lean proofs from PutnamBench and Putnam2025, Proof-Refactor improves rubric-based refactoring scores over a strong Claude Code refactoring baseline, with the largest gains in signature quality and human readability. These results suggest that process-guided refactoring can improve proof structure without treating proof length as the primary objective.

2606.03741 2026-06-03 cs.AI

When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning

何时重新规划:分层潜在推理中的子目标持久性

Ayushi Chadha

AI总结 研究分层潜在推理中稳定性与适应性的权衡,通过封建式管理-工人接口控制子目标持久周期,发现中等周期(3-6步)最优,内在对齐权重存在窄最优值(约0.05)。

详情
Comments
Accepted at the Workshop on Compositional Learning: Safety, Interpretability, and Agents (CompLearn), ICML 2026. 10 pages, 2 figures
AI中文摘要

长程推理要求系统承诺中期意图而不变得僵化:过于频繁地重新规划会导致计算无法凝聚成多步结构;承诺时间过长则计划会过时。我们在潜在推理设置中研究这种稳定性-适应性权衡,其中多步计算发生在隐藏状态内部而非外部化的token轨迹。我们扩展了分层推理模型(HRM),采用封建式的管理-工人接口:一个缓慢的高层模块周期性地发出一个归一化的方向性子目标,该子目标持续P个低层步骤,影响工人的隐藏状态更新并提供内在的余弦对齐损失。在ARC和ConceptARC上,我们发现子目标持久性——而非仅仅子目标注入——是关键旋钮:中等周期P∈[3,6]始终优于非常频繁(P=1)和非常长的周期,在P=3时LM损失明显最小(1.544对比P=1时的1.674,基线1.640;在5个种子上重复,均值1.595,标准差0.045)。内在对齐权重λ显示出一个互补的窄最优值(λ≈0.05)。在过去的甜点λ处的受控消融实验隔离出学习到的方向结构——而非架构容量或辅助损失单独——作为当对齐信号超过其最优值时的干扰来源。这些发现共同暗示了潜在推理系统中组合规划的设计原则:中期意图必须在足够多的计算步骤上保持一致,以便形成组合结构。

英文摘要

Long-horizon reasoning requires a system to commit to medium-horizon intent without becoming rigid: re-plan too often and computation never coheres into multi-step structure; commit too long and the plan goes stale. We study this stability-adaptivity tradeoff in the latent reasoning setting, where multi-step computation occurs inside hidden state rather than externalized token traces. We extend the Hierarchical Reasoning Model (HRM) with a feudal-style manager-worker interface: a slow high-level module periodically emits a normalized directional subgoal that persists for P low-level steps, biasing the worker's hidden-state updates and supplying an intrinsic cosine alignment loss. On ARC and ConceptARC, we find that subgoal persistence -- not subgoal injection alone -- is the central knob: moderate periods P in [3, 6] consistently outperform both very frequent (P=1) and very long horizons, with a clear minimum LM loss at P=3 (1.544 vs. 1.674 at P=1, 1.640 baseline; replicated over 5 seeds at mean 1.595, std 0.045). The intrinsic alignment weight lambda shows a complementary narrow optimum (lambda approximately 0.05). A controlled ablation at past-sweet-spot lambda isolates learned directional structure -- not architectural capacity or auxiliary loss alone -- as the source of interference when the alignment signal exceeds its optimum. Together these findings implicate a design principle for compositional planning in latent reasoning systems: medium-horizon intent must be coherent across enough computational steps for compositional structure to form.

2606.03739 2026-06-03 cs.CL cs.IT math.IT

Entropy Gate: Entropy Quenching for Near-Lossless Token Compression in LLM Pipelines

熵门:LLM流水线中用于近无损令牌压缩的熵淬火

Justice Owusu Agyemang, Jerry John Kponyo, Kwame Opuni-Boachie Obour Agyekum, Francisca Adoma Acheampong, Kwame Agyeman-Prempeh Agyekum, James Dzisi Gadze

AI总结 提出Entropy Gate框架,通过熵淬火(一种热力学过程)逐步冻结低信息令牌,实现40-60%压缩率且语义保真度高于0.80,并证明其接近信息论极限。

详情
AI中文摘要

LLM流水线在低信息内容上浪费大量令牌预算:重复上下文、冗长响应和冗余模板。我们引入Entropy Gate,一种令牌压缩框架,应用熵淬火——一种热力学过程,逐步冻结低能量令牌同时保持语义保真度。每个令牌获得一个多因素信息能量$E(t)$,结合统计、结构和位置分量。自适应淬火计划$T(\tau) = T_0 / (1 + \alpha \tau)$移除玻尔兹曼生存概率$p_i = \exp(-E_i / kT)$低于阈值的令牌,并有一个保真度门在能量加权相似度低于$\theta$时停止压缩。我们证明按$E(t)$降序选择令牌最大化预期语义保留,淬火产生嵌套生存集,且可达压缩率接近信息论极限$\text{CR} \to 1 - I(P; T)/H(P)$。第一阶段启发式方法在五种提示类别上实现40-60%压缩,同时保持$S_E > 0.80$,能量平方放大$E \to E^2$额外增加10-25个百分点。上下文去重在重复块上节省50-70%。输出端淬火受简洁性提高准确性的发现启发,进一步减少响应开销。结合外部记忆,压缩在代理工作负载上复合乘数达到88-96%。该框架无状态、模型无关,并作为兼容OpenAI的HTTP代理部署。

英文摘要

LLM pipelines waste substantial token budgets on low-information content: repeated context, verbose responses, and redundant boilerplate. We introduce Entropy Gate, a token compression framework applying entropy quenching $-$ a thermodynamic process that progressively freezes out low-energy tokens while preserving semantic fidelity. Each token receives a multi-factor information energy $E(t)$ combining statistical, structural, and positional components. An adaptive quenching schedule $T(τ) = T_0 / (1 + ατ)$ removes tokens whose Boltzmann survival probability $p_i = \exp(-E_i / kT)$ falls below threshold, with a fidelity gate halting compression when energy-weighted similarity drops below $θ$. We prove token selection by descending $E(t)$ maximizes expected semantic preservation, that quenching produces nested survival sets, and that achievable compression approaches the information-theoretic limit $\text{CR} \to 1 - I(P; T)/H(P)$. A Phase 1 heuristic achieves 40-60% compression across five prompt categories while maintaining $S_E > 0.80$, with energy-squared amplification $E \to E^2$ adding 10-25 percentage points. Context deduplication adds 50-70% savings on repeated blocks. Output-side quenching, motivated by findings that brevity improves accuracy, further reduces response overhead. Combined with external memory, reduction composes multiplicatively to 88-96% for agentic workloads. The framework is stateless, model-agnostic, and deploys as an OpenAI-compatible HTTP proxy.

2606.03731 2026-06-03 cs.LG stat.ML

Conformal Language Modeling via Posterior Sampling

通过后验采样的共形语言建模

Nicolas Emmenegger, Theo X. Olausson, Armando Solar-Lezama, Chara Podimata

AI总结 提出通过近似LLM后验采样(条件为校准的高分区域)来替代事后过滤,实现目标风险控制并提高下游效用。

详情
AI中文摘要

大型语言模型仍然受到幻觉的困扰。最近的工作试图使用基于共形预测的统计技术来抑制其普遍性,取得了理论和实证上的成功。然而,这些方法以事后方式运作,将采样过程本身视为原子操作,然后通过外科手术式地修改样本来移除幻觉声明。这种过滤与生成之间的脱节可能导致样本不连贯、不一致,或者仅仅在模型本身下不太可能。此外,事后手术无法将概率质量转移到更有用和更有帮助的响应上。为了解决这些问题,我们提出从LLM后验的近似中采样,其中条件事件对应于一个校准的高分区域。我们开发了一种针对条件序列生成场景的校准程序,该程序能有效识别该区域并实现目标风险控制。在实证中,我们将我们的方法应用于以开放式的传记生成和数学问题解决为重点的案例研究;与先前的工作相比,我们获得了相同的统计保证,且下游效用更高。

英文摘要

Large Language Models remain plagued by hallucinations. Recent work has sought to tame their prevalence using statistical techniques based on conformal prediction, with both theoretical and empirical success. However, these methods operate in a post-hoc fashion, treating the sampling procedure itself as atomic and then surgically altering samples to remove hallucinated claims. This disconnect between filtering and generation can result in samples that are incoherent, inconsistent, or simply unlikely under the model itself. Moreover, post-hoc surgery is unable to shift probability mass towards more useful and helpful responses. To address these issues, we propose to instead sample from approximations to an LLM posterior, where the conditioning event corresponds to a calibrated, high-scoring region. We develop a calibration procedure tailored to the setting of conditional sequential generation that effectively identifies this region and achieves target risk control. Empirically, we apply our method to case studies focused on open-ended biography generation and mathematical problem solving; compared to prior work, we obtain the same statistical guarantees, with higher downstream utility.

2606.03728 2026-06-03 cs.CL cs.IR

Re-Ranking Through an Attribution Lens for Citation Quality in Legal QA

通过归因视角对法律问答中的引用质量进行重排序

Mohamed Hesham Elganayni, Selim Saleh

AI总结 针对法律问答中检索增强生成系统的引用质量问题,提出基于扰动归因分数训练轻量级交叉编码器对候选段落重排序,显著提升引用忠实度并与专家答案对齐。

详情
Comments
11 pages, 4 tables, 1 figure. Published at ASAIL 2026 (8th Workshop on Automated Semantic Analysis of Information in Legal Text), co-located with ICAIL 2026, Singapore
AI中文摘要

用于法律问答的检索增强生成系统通常基于语义相似度检索段落,并将其提供给语言模型,然后生成带引用的答案。先前的工作假设高排名的段落最有可能被模型有效引用。基于扰动的归因方法(如C-LIME)仅用于事后解释。然而,在AQuAECHR基准测试中,语义相似度与段落归因并不相关。在检索器的候选池中,基于相似度的排序在呈现黄金引用段落方面表现不如随机选择。为了解决这一局限性,我们训练了一个轻量级交叉编码器,基于连续的扰动归因分数在生成前对段落进行重排序。该方法在AQuAECHR基准测试上使用两个语言模型和五折交叉验证进行评估。重排序器显著提高了引用的忠实度以及与专家黄金答案的对齐程度。值得注意的是,在不同模型上独立训练的两个重排序器收敛程度超过了它们的原始归因一致性。这一发现表明,交叉编码器减少了模型特定的噪声,并产生了一个可部分跨模型传递的共享相关性信号,尽管同模型重排序仍然更有效。这些结果表明,基于扰动的归因为引用感知检索提供了一种实用的、模型无关的训练信号。

英文摘要

Retrieval-augmented generation systems for legal question answering typically retrieve passages based on semantic similarity and provide them to a language model, which then generates cited answers. Prior work assumes that highly ranked passages are most likely to be usefully cited by the model. Perturbation-based attribution methods, such as C-LIME, have been used exclusively for post-hoc explanation. However, on the AQuAECHR benchmark, semantic similarity does not correlate with passage attribution. Within a retriever's candidate pool, similarity-based ranking performs worse than random selection at surfacing gold citation paragraphs. To address this limitation, a lightweight cross-encoder is trained on continuous perturbation-based attribution scores to re-rank passages prior to generation. This approach is evaluated on the AQuAECHR benchmark, using two language models and five-fold cross-validation. The re-ranker substantially improves citation faithfulness and alignment with gold expert answers. Notably, two re-rankers trained independently on different models converge beyond their raw attribution agreement. This finding indicates that the cross-encoder reduces model-specific noise and produces a shared relevance signal that partially transfers across models, although same-model re-ranking remains more effective. These results demonstrate that perturbation-based attribution provides a practical, model-agnostic training signal for citation-aware retrieval.

2606.03723 2026-06-03 cs.LG

Compress then Merge: From Multiple LoRAs into One Low-Rank Adapter

先压缩后合并:从多个LoRA到一个低秩适配器

Zhengbao He, Ruiqi Ding, Zhehao Huang, Ruikai Yang, Tao Li, Xiaolin Huang

AI总结 针对多LoRA合并时全参数合并破坏低秩结构的问题,提出先压缩后合并(CtM)方法,通过共享子空间投影保证输出严格秩r,性能优于现有单LoRA基线。

详情
Comments
Accepted to ICML 2026. Code: https://github.com/ZhengbaoHe/compress-then-merge
AI中文摘要

低秩适配(LoRA)实现了基础模型的参数高效特化,但任务特定适配器的激增将能力分散到多个适配器中,使复用和部署复杂化。我们研究将$T$个LoRA合并为单个秩-$r$ LoRA的问题,从而保留低秩结构的优势。现有的先合并后压缩流水线将秩约束视为事后考虑:它们在完整参数空间中合并适配器,然后通过截断SVD将合并结果压缩到秩$r$。然而,全参数合并可能破坏低秩结构,使得后续压缩难以恢复有效的秩-$r$ LoRA。我们提出先压缩后合并(CtM),一种反向流水线,在合并前强制秩-$r$瓶颈:CtM仅使用LoRA权重计算共享的$r$维子空间以捕获跨适配器的公共结构,将每个适配器投影到共享子空间以获得$r\times r$坐标,然后在此缩减空间中应用标准合并规则。CtM通过构造保证秩-$r$ LoRA,避免了事后截断,并在由拼接的LoRA因子张成的核心空间中实现高效计算。跨多个模型和任务的实验表明,CtM持续优于现有的单LoRA输出基线,同时缩小了与全参数合并方法的性能差距。

英文摘要

Low-rank adaptation (LoRA) enables parameter-efficient specialization of foundation models, but the proliferation of task-specific adapters fragments capabilities across many adapters, complicating reuse and deployment. We study the problem of merging $T$ LoRAs into a single rank-$r$ LoRA, thereby preserving the benefits of low-rank structure. Existing Merge-then-Compress pipelines treat the rank constraint as an afterthought: they merge adapters in the full parameter space, then compress the merged result to rank $r$ via truncated SVD. However, full-parameter merging may destroy the low-rank structure, making it difficult for subsequent compression to recover an effective rank-$r$ LoRA. We propose Compress-then-Merge (CtM), a reversed pipeline that enforces the rank-$r$ bottleneck before merging: CtM computes shared $r$-dimensional subspaces using only the LoRA weights to capture cross-adapter common structure, projects each adapter into the shared subspaces to obtain $r\times r$ coordinates, and then applies standard merging rules in this reduced space. CtM guarantees a rank-$r$ LoRA by construction, avoiding post-hoc truncation, and enables efficient computation in the core space spanned by concatenated LoRA factors. Experiments across multiple models and tasks show that CtM consistently outperforms existing single-LoRA-output baselines while narrowing the performance gap to full-parameter merging methods.