arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13397 2026-06-12 cs.HC cs.AI cs.CY 新提交

Mod-Guide: An LLM-based Content Moderation Feedback System to Address Insensitive Speech toward Indigenous Ethnic and Religious Minority Communities

Mod-Guide:一种基于LLM的内容审核反馈系统,用于解决针对原住民及少数族裔宗教群体的不敏感言论

Dipto Das, Achhiya Sultana, Ankit Singh Chauhan, Saadia Binte Alam, Mohammad Shidujaman, Shion Guha, Sunandan Chakraborty, Syed Ishtiaque Ahmed

AI总结 本文研究LLM审核系统对孟加拉国印度教和查克玛社区不敏感言论的认知局限,通过共同构建文化语料库和检索增强生成(RAG)方法开发Mod-Guide工具,提升模型对少数群体观点的敏感性。

详情
AI中文摘要

语言既是边缘化的机制,也是抵抗的机制,尤其是对于在网络上面对不敏感和有害言论的少数群体。随着内容审核越来越依赖大型语言模型(LLMs),人们开始担忧这些系统能否识别文化不敏感言论——即通过隐含的抹除、歪曲或规范性框架(而非公开敌意)忽视或边缘化历史上代表性不足社区的文化和宗教观点的言论。本文聚焦孟加拉国的印度教和查克玛社区——该国最大的宗教少数群体和原住民少数民族,研究了基于LLM的审核系统的认知局限,并探索融入少数群体视角的方法。我们与社区成员共同创建了一个文化敏感言论语料库,并使用检索增强生成(RAG)将他们的叙事整合到审核流程中。我们的工具Mod-Guide通过利用源自生活经验的上下文线索,提升了LLM对少数群体观点的敏感性。通过涉及少数群体和多数群体参与者的混合方法评估,我们证明RAG增强的审核响应在上下文上更准确,且不同族群对其感知存在差异。这项工作通过在前台化内容审核系统设计中的修复正义和诠释学包容,推进了人机交互、AI伦理和社会计算领域的研究。

英文摘要

Language operates as a mechanism of both marginalization and resistance, especially for minority communities navigating insensitive and harmful speech online. As content moderation increasingly depends on large language models (LLMs), concerns arise about whether these systems can recognize culturally insensitive speech-language that disregards or marginalizes the cultural and religious perspectives of historically underrepresented communities, often through implicit erasure, misrepresentation, or normative framing, rather than overt hostility. Focusing on Bangladesh's Hindu and Chakma communities -- the country's largest religious and Indigenous ethnic minorities, respectively -- this paper investigates the epistemic limits of LLM-based moderation systems and explores methods for incorporating minority perspectives. We co-created a culturally grounded corpus of insensitive speech with community members and integrated their narratives into moderation pipelines using retrieval augmented generation (RAG). Our tool, Mod-Guide, improves LLM sensitivity to minority viewpoints by leveraging contextual cues derived from lived experience. Through mixed-method evaluations involving both minority and majority participants, we demonstrate that RAG-enhanced moderation responses are more contextually accurate and perceived differently across ethnic lines. This work advances research in human-computer interaction, AI ethics, and social computing by foregrounding restorative justice and hermeneutical inclusion in the design of content moderation systems.

2606.13385 2026-06-12 cs.CR cs.AI cs.CY cs.HC cs.MM 新提交

Who Pays the Price? Stakeholder-Centric Prompt Injection Benchmarking for Real-world Web Agents

谁买单?面向真实世界网络代理的以利益相关者为中心的提示注入基准测试

Zihao Wang, Yiming Li, Yutong Wu, Zheyu Liu, Kangjie Chen, Fok Kar Wai, Pin-Yu Chen, Vrizlynn L. L. Thing, Bo Li, Dacheng Tao, Tianwei Zhang

AI总结 提出以利益相关者为中心的基准测试框架,系统分类和归因真实世界网络代理系统中的提示注入危害,揭示当前代理无法可靠抵抗任何攻击目标,且失败模式多样。

详情
Comments
32 pages
AI中文摘要

由大型语言模型驱动的网络代理越来越多地部署在真实环境中,它们在不受信任的网络内容上操作并执行具有直接后果的动作。这使得它们容易受到提示注入攻击,其中看似良性的内容嵌入了操纵代理行为的对抗性指令。现有的安全基准采用以攻击为中心的视角,关注注入的技术可行性,而忽略了由此产生的危害的细微分布。然而,在实践中,提示注入风险是受害者依赖的:单一漏洞可能对不同利益相关者产生不对称后果,同一攻击模式可能因目标不同而表现出显著不同的有效性。为了捕捉这些特性,我们引入了\sysname,一个以利益相关者为中心的基准,用于系统分类和归因真实世界网络代理系统中的危害。它区分受影响的实体(如用户、卖家、平台),将攻击分解为具体目标,并使用互补的结果和过程级指标评估每个案例。我们的结果揭示了显著且异质的漏洞:当前代理无法可靠抵抗任何单一攻击目标,失败分布在从“隐蔽寄生”(攻击成功而不干扰用户委托任务)到“错位破坏”(任务被破坏而攻击未成功)以及“复合失败”(对抗目标和任务完整性同时被违反)等不同模式。这些模式被传统评估所忽略,突显了在真实部署中对基于LLM的代理进行利益相关者感知评估的必要性。基准可在该https URL获取。

英文摘要

Web agents driven by large language models (LLMs) are increasingly deployed in real-world environments, where they operate over untrusted web content and execute actions with direct consequences. This makes them vulnerable to prompt-injection attacks, in which seemingly benign content embeds adversarial instructions that manipulate agent behaviour. Existing security benchmarks adopt an \textit{attack-centric} perspective, focusing on the technical feasibility of injections while overlooking the nuanced distribution of resulting harms. In practice, however, prompt-injection risk is victim-dependent: a single exploit can produce asymmetric consequences for different stakeholders, and the same attack pattern may exhibit substantially different effectiveness depending on whom it targets. To capture these properties, we introduce \textbf{\sysname}, a \textit{stakeholder-centric} benchmark to systematically categorize and attribute harm in real-world web agent systems. It distinguishes between affected entities (e.g., user, seller, platform), decomposes the attacks into concrete objectives, and evaluates each case with complementary outcome- and process-level metrics. Our results reveal substantial and heterogeneous vulnerabilities: not a single attack objective is reliably resisted by current agents, and failures distribute across qualitatively distinct modes ranging from \emph{stealthy parasitism} (attack succeeds without disrupting the user's delegated task) to \emph{misaligned disruption} (task disrupted without attack success) and \emph{compounded failure} (both adversarial objective and task integrity simultaneously violated). These patterns are missed by conventional evaluation, highlighting the need for stakeholder-aware assessment of LLM-based agents in real-world deployments. Benchmark is available at this https URL.

2606.13381 2026-06-12 cs.LG 新提交

Hölder++: Improving the Quality-Coherence Trade-off in Multimodal VAEs

Hölder++:改进多模态VAE中的质量-一致性权衡

Huyen Vo, María Martínez-García, Isabel Valera

AI总结 针对多模态VAE生成质量与语义一致性之间的权衡问题,提出Hölder++,通过精确Hölder池化、扩展架构和层次推理,在提升一致性的同时保持生成质量。

详情
Comments
Accepted at ICML 2026. Camera-ready version
AI中文摘要

现有的多模态变分自编码器(VAE)方法面临生成质量与一致性之间的权衡——即它们难以生成既真实多样又在各模态间语义一致的样本。最近的一项工作表明,使用Hölder池化的简单近似作为聚合方法,尽管假设所有模态共享单一表示,但能提高一致性超过SOTA MMVAE+。然而,它略微牺牲了样本多样性。受此启发,我们提出Hölder++,一种新颖的多模态VAE,通过以下方式改进生成质量-一致性权衡:(i) 首次实现无近似的Hölder池化用于多模态VAE;(ii) 扩展架构,建模不同的共享和私有(即模态特定)表示(Hölder+);(iii) 层次推理,进一步增强共享和私有表示之间的解耦(Hölder++)。我们的实验证实,Hölder++持续改进生成质量-一致性权衡,产生更结构化的潜在空间,并学习对下游任务信息丰富的共享表示。

英文摘要

Existing approaches for multimodal variational autoencoders (VAEs) face a trade-off between generative quality and coherence-i.e., they struggle to generate realistic and diverse samples that, at the same time, are semantically consistent across modalities. A recent work shows that using a simple approximation to Hölder pooling as an aggregation method improves coherence over the SOTA MMVAE+, despite assuming a single shared representation across all modalities. Yet, it slightly compromises sample diversity. Inspired by this insight, we propose Hölder++, a novel multimodal VAE that improves the generative quality-coherence trade-off through: (i) the first implementation of Hölder pooling without any approximation for multimodal VAEs; (ii) an extended architecture that models distinct shared and private (i.e., modality-specific) representations (Hölder+); and (iii) hierarchical inference that further enhances the disentanglement between the shared and private representations (Hölder++). Our experiments corroborate that Hölder++ consistently improves the generative quality-coherence trade-off, yields more structured latent spaces, and learns shared representations that are informative for downstream tasks.

2606.13376 2026-06-12 cs.CV 新提交

MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold

MoVerse: 基于全景高斯支架的实时视频世界建模

Yang Zhou, Ziheng Wang, Yuqin Lu, Haofeng Liu, Jun Liang, Shengfeng He, Jing Li

AI总结 提出MoVerse,从单张窄视场图像实时构建可交互漫游的360度全景世界,通过拓扑感知扩散补全视场、全景几何残差预测生成3D高斯支架,并结合双向扩散教师蒸馏为因果自回归学生实现低延迟视频渲染。

详情
AI中文摘要

我们提出MoVerse,一个实时视频世界模型,能够从单张窄视场图像创建可交互导航的场景。该设置具有挑战性,因为输入仅观察到环境的一小部分,而交互式漫游需要完整的周围世界、持久的几何结构、可控的相机运动以及时间上一致的高保真观测。MoVerse通过将世界构建与观测渲染分离来解决这个问题。它首先使用拓扑感知扩散将输入扩展为重力对齐的360°全景图,在3D推理之前闭合缺失的视场。然后,利用全景几何感知残差预测将全景图提升为持久的3D高斯支架,形成密集且可直接渲染的空间记忆。最后,一个高斯条件视频渲染器将沿用户指定相机轨迹的支架渲染结果转换为逼真的视频。为了使该渲染器适用于交互,我们训练了一个双向扩散教师用于高质量条件渲染,并将其蒸馏为一个因果自回归学生以实现有界延迟流式传输。这种设计结合了显式3D表示的可控性和长程一致性以及生成视频模型的感知质量。MoVerse在单个NVIDIA RTX 4090 GPU上支持8 FPS的实时场景漫游,展示了通往具有交互式视频输出的单图像世界创建的实用路径。

英文摘要

We present MoVerse, a real-time video world model that creates an interactively navigable scene from a single narrow-field-of-view image. This setting is challenging because the input observes only a small fraction of the environment, while interactive roaming requires a complete surrounding world, persistent geometry, controllable camera motion, and temporally coherent high-fidelity observations. MoVerse addresses this problem by separating world construction from observation rendering. It first expands the input into a gravity-aligned 360$^\circ$ panorama with topology-aware diffusion, closing the missing field of view before 3D reasoning. It then lifts the panorama into a persistent 3D Gaussian scaffold using panoramic geometry-aware residual prediction, yielding a dense and directly renderable spatial memory. Finally, a Gaussian-conditioned video renderer translates scaffold renderings along user-specified camera trajectories into photorealistic video. To make this renderer practical for interaction, we train a bidirectional diffusion teacher for high-quality conditional rendering and distill it into a causal autoregressive student for bounded-latency streaming. This design combines the controllability and long-range consistency of explicit 3D representations with the perceptual quality of generative video models. MoVerse supports real-time scene roaming at 8~FPS on a single NVIDIA RTX~4090 GPU, demonstrating a practical path toward single-image world creation with interactive video output.

2606.13368 2026-06-12 cs.AI cs.CV 新提交

IterCAD: An Iterative Multimodal Agent for Visually-Grounded CAD Generation and Editing

IterCAD:一种用于视觉引导的CAD生成与编辑的迭代多模态智能体

Tao Hu, Jiaxin Ai, Licheng Wen, Xueheng Li, Shu Zou, Siqi Li, Nianchen Deng, Xinyu Cai, Hongbin Zhou, Pinlong Cai, Daocheng Fu, Yu Yang, Hairong Zhang, Botian Shi, Xuemeng Yang

AI总结 提出IterCAD,一种闭环交互式CAD生成与编辑的多模态智能体框架,通过渐进式SFT和几何感知强化学习优化,在代码可执行性和几何精度上显著超越现有方法。

详情
AI中文摘要

计算机辅助设计在现代制造业中至关重要,然而现有的自动化方法主要依赖于开环、一次性生成,与迭代的实际实践不匹配。在本文中,我们提出了IterCAD,一个统一的闭环交互式CAD生成与编辑的多模态智能体框架。我们将任务形式化为多模态智能体与可执行CAD沙箱之间的多轮交互,涵盖三个任务:绘图到代码、文本到代码和交互式编辑。为此,我们开发了一个数据合成流水线,结合先进的工业制造特征,生成符合标准的多视图工程图纸、复杂的代码编辑任务和高保真交互轨迹。我们通过渐进式SFT,然后结合几何感知强化学习和可行前缀掩码来优化智能体,以增强代码可执行性和几何保真度。最后,我们引入了IterCAD-Bench评估套件,并提出了Chamfer距离容忍度-召回率(CD-TR)曲线及其AUC-TR指标,建立了一个无幸存者偏差的标准,统一了代码有效性和几何精度。大量实验表明,IterCAD在多个基准测试中取得了极具竞争力的性能,在代码可执行性和几何精度上显著优于现有方法,并在闭环迭代优化中展现出卓越的能力。

英文摘要

Computer-Aided Design is pivotal in modern manufacturing, yet existing automated methods predominantly rely on open-loop, one-shot generation, creating a mismatch with iterative real-world practices. In this paper, we present IterCAD, a unified multimodal agent framework for closed-loop, interactive CAD generation and editing. We formulate the task as a multi-turn interaction between a multimodal agent and an executable CAD sandbox, covering three tasks: Drawing-to-Code, Text-to-Code, and Interactive Editing. To support this, we develop a data synthesis pipeline incorporating advanced industrial manufacturing features to generate standard-compliant multi-view engineering drawings, complex code-editing tasks, and high-fidelity interaction trajectories. We optimize the agent via progressive SFT followed by geometry-aware reinforcement learning with viable-prefix masking to enhance code executability and geometric fidelity. Finally, we introduce the IterCAD-Bench evaluation suite and propose the Chamfer Distance Tolerance-Recall (CD-TR) curve alongside its AUC-TR metric, establishing a survivor-bias-free standard that unifies code validity and geometric precision. Extensive experiments demonstrate that IterCAD achieves highly competitive performance across multiple benchmarks, significantly outperforming existing approaches in both code executability and geometric precision, while exhibiting superior capabilities in closed-loop iterative refinement.

2606.13347 2026-06-12 cs.LG 新提交

Enhanced Low-Density Region Exploration in Classifier-Guided Diffusion Models Through Modified Reverse Diffusion Sampling

改进反向扩散采样在分类器引导扩散模型中的低密度区域探索

Jagriti Singh, Shekhar Verma, Muneendra Ojha

AI总结 提出一种无需额外训练的采样时间密度感知方法,通过修改分类器梯度引导轨迹朝向低置信区域并引导采样朝向预测真实图像,以增强扩散模型对低密度区域的探索。

详情
AI中文摘要

扩散模型已成为高保真图像合成的最先进生成模型,特别是在无分类器引导和分类器引导形式中。然而,标准分类器引导将概率质量集中在高密度类均值周围,导致对类条件分布尾部罕见样本的覆盖不足。最近关于基于扩散的尾部采样的工作通过训练一个额外的低密度寻求分类器(使用合成与真实判别器)来缓解这一问题,但代价是额外的网络和训练。与此同时,许多采样器和蒸馏技术加速或改进扩散采样,但并未明确解决长尾覆盖问题。我们提出一种纯采样时间、密度感知的分类器引导条件扩散模型扩展,针对低密度区域且无需任何额外训练。我们像大多数扩散模型一样,对噪声图像应用引导而非预测噪声。从预训练的ImageNet条件扩散模型和分类器开始,我们通过修改分类器梯度将轨迹引导向低置信区域,并在每个时间步引导采样过程朝向预测的真实图像,从而修改引导反向动力学。第一个引导有助于探索低概率样本,第二个引导有助于生成接近真实数据流形的样本。所提出的采样器在64x64分辨率下一致提高了ADM模型的召回率,同时保持可比的FID,并且使用256x256 ADM模型,我们展示了两种引导不同组合的视觉结果。我们还表明,标准ADM分类器引导结合预测真实图像引导,有助于在ImageNet上使用256x256 ADM模型生成高感知质量的样本。

英文摘要

Diffusion models have emerged as state-of-the-art generative models for high-fidelity image synthesis, particularly in their classifier-free guided and classifier-guided forms. However, standard classifier guidance concentrates probability mass around high-density class mean, leading to poor coverage of rare samples in the tails of the class-conditional distributions. Recent work on diffusion-based tail sampling mitigates this by training an additional low-density-seeking classifier with a synthetic-vs-real discriminator, at the cost of additional networks and training. In parallel, a number of samplers and distillation techniques accelerate or refine diffusion sampling, but do not explicitly address long-tail coverage. We propose a purely sampling-time, density-aware extension of classifier-guided conditional diffusion model that targets low-density regions without any additional training. We have applied guidance at noisy images not on predicted noise like most diffusion models. Starting from a pretrained conditional diffusion model and classifier on ImageNet, we modify the guided reverse dynamics by steering trajectories toward low-confidence regions via the modified classifier gradient, and at each time step, we also guide the sampling process toward the predicted real image. 1st guidance helps explore low-probability samples, and 2nd guidance helps to generate samples to be close to the real data manifold. The proposed sampler consistently improves ADM model recall at 64x64 resolution while maintaining a comparable FID, and with a 256x256 ADM model, we showed the results visually with different combinations of both guidance. We also showed that standard ADM classifier guidance, combined with predicted real image guidance, helps generate high perceptual quality samples with a 256x256 ADM model on ImageNet.

2606.13300 2026-06-12 cs.LG 新提交

Quantizing Time-Series Models As Dynamical Systems: Trajectory-Based Quantization Sensitivity Score

将时间序列模型量化为动力系统:基于轨迹的量化敏感度评分

Mariya Pavlova, Harrison Bo Hua Zhu, Elizsveta Semenova, Yingzhen Li

AI总结 提出基于轨迹的量化敏感度评分(TQS),从动力系统稳定性角度分析量化误差传播,实现无需校准数据的混合精度量化。

详情
Comments
ICML 2026, Workshop on Forecasting as a New Frontier of Intelligence
AI中文摘要

我们引入了基于轨迹的量化敏感度评分(TQS),这是一种通过动力系统稳定性视角重新定义训练后量化(PTQ)的指标。通过将网络的展开建模为离散时间动力系统,TQS 描述了量化引起的误差如何在展开时间范围内传播和放大。与传统的 PTQ 方法不同,传统方法中敏感度分析通常与量化过程耦合,而 TQS 实现了先验的敏感度估计,与量化器选择和位宽分配解耦。这种分离允许即使在具有融合算子的黑盒或编译网络中进行量化预算规划。在此基础上,我们提出了 TQS-PTQ,一个灵活的混合精度框架,不需要校准数据或昂贵的二阶近似。我们的实验表明,动力系统视角为资源受限环境下的低精度部署提供了一条稳健且高性能的路径。

英文摘要

We introduce the Trajectory-based Quantization Sensitivity Score (TQS), a metric that reframes post-training quantization (PTQ) through the lens of dynamical-systems stability. By modeling the network's rollout as a discrete-time dynamical system, TQS characterizes how quantization-induced errors propagate and amplify over the rollout horizon. Unlike conventional PTQ methods, where sensitivity analysis is often coupled to the quantization procedure, TQS enables a priori sensitivity estimation decoupled from quantizer selection and bit-width assignment. This separation allows for quantization budget planning even for black-box or compiled networks with fused operators. Building on this, we present TQS-PTQ, a flexible mixed-precision framework that requires no calibration data or costly second-order approximations. Our experiments show that a dynamical-systems perspective provides a robust, high-performing pathway for low-precision deployment in resource-constrained settings.

2606.13298 2026-06-12 cs.SE cs.AI 新提交

Mining Architectural Quality Under Agentic AI Adoption: A Causal Study of Java Repositories

在智能体AI采用下的架构质量挖掘:Java仓库的因果研究

Oliver Aleksander Larsen, Mahyar T. Moghaddam

AI总结 通过差分差分设计和Borusyak插值估计器,研究智能体AI工具采用对Java仓库架构气味密度(ASD)的因果影响,发现ASD下降6.7%源于代码量增长,而非架构改进。

详情
Comments
16 pages. Accepted for presentation at the 52nd Euromicro Conference on Software Engineering and Advanced Applications (SEAA) 2026, Krakow, Poland, 2-4 September 2026, and for publication in the Springer LNCS proceedings. This is the author's accepted manuscript
AI中文摘要

AI编码工具现已被大多数开发者使用,这些工具的智能体化使用普及了俗称“氛围编码”的实践。然而,关于其对软件架构影响的因果证据却很少。先前的因果工作衡量了代码层面的结果(复杂度、静态分析警告);这种退化是否会传播到架构层面仍未知。我们挖掘了151个开源Java仓库,其中74个检测到智能体AI采用(通过配置文件和Co-Authored-By提交尾注识别),以及77个倾向得分匹配的对照仓库,每个仓库跨越13个月,生成1,811个月度Arcan快照。我们采用交错差分差分设计和Borusyak插值估计器,估计采用对架构气味密度(ASD)的因果效应,将近期用于代码层面指标的因果设计应用于架构层面。总气味计数基本不变(+1.1%,p=0.82),而代码行数增长12.8%(p=0.003);因此,ASD下降6.7%(p=0.004)是分母效应而非架构改进。按类型估计和稳健性检验(wild cluster bootstrap、Lee bounds、陈旧观测敏感性)证实了这一模式;预处理趋势平坦(Wald p=0.90),与平行趋势一致。当处理影响系统规模时,密度归一化结果可能产生误导:对AI工具采用的因果挖掘研究需要原始计数和显式分解。完整的复现包,包括精心整理的151个仓库月度面板,已公开提供。

英文摘要

AI coding tools are now used by a majority of developers, and agentic use of these tools has popularized the practice colloquially called "vibe coding". Yet causal evidence on their effect on software architecture is scarce. Prior causal work has measured code-level outcomes (complexity, static analysis warnings); whether such degradation propagates to architecture-level outcomes remains unknown. We mine 151 open-source Java repositories, 74 with detectable agentic AI adoption (identified via configuration files and Co-Authored-By commit trailers) and 77 propensity-matched controls, across a 13-month per-repository window yielding 1,811 monthly Arcan snapshots. We estimate the causal effect of adoption on architectural smell density (ASD) with a staggered difference-in-differences design and the Borusyak imputation estimator, applying a causal design recently used for code-level metrics to the architecture level. Total smell counts are essentially unchanged (+1.1%, p = 0.82) while lines of code grow +12.8% (p = 0.003); the resulting 6.7% ASD decline (p = 0.004) is therefore a denominator effect rather than an architectural improvement. Per-type estimates and robustness checks (wild cluster bootstrap, Lee bounds, stale-observation sensitivity) corroborate the pattern; pre-trends are flat (Wald p = 0.90), consistent with parallel trends. Density-normalized outcomes can mislead when treatment affects system size: raw counts and explicit decomposition are required for causal mining studies of AI tool adoption. The complete replication package, including the curated 151-repository monthly panel, is publicly available.

2606.13282 2026-06-12 cs.AI 新提交

ERTS: Adversarial Robustness Testing of Ethical AI via Semantic Perturbation in a Bounded Consequence Space

ERTS: 通过有界后果空间中的语义扰动进行伦理AI的对抗鲁棒性测试

Pratyush Chaudhari

AI总结 提出伦理鲁棒性测试系统(ERTS),通过有界伦理后果空间、语义扰动和领域自适应评估,测试AI在伦理推理中的对抗鲁棒性,实验表明仅33%模型通过测试。

详情
Comments
8 pages, 10 tables
AI中文摘要

随着AI系统在医疗分诊、自动驾驶和就业筛选等高风险的伦理场景中部署,评估其对伦理推理的对抗性操纵鲁棒性的形式化方法仍不成熟。本文介绍了伦理鲁棒性测试系统(ERTS),一个闭环管道框架,它:(1) 将伦理困境编码为基于既定伦理理论的22维伦理后果空间(ECS);(2) 应用17种语义扰动函数,受6种有效性约束类别(包括一种新颖的语义一致性约束)约束;(3) 通过4分量伦理不稳定性指数(EII)测量决策偏差;(4) 生成领域自适应的部署前鲁棒性评估判定。我们评估了4个结构化基线模型和2个生产级LLM(Gemini 2.0 Flash和Llama 3.2),涵盖8个部署领域的50个伦理场景,生成了1500个对抗测试用例。结果表明,仅33%的模型通过评估审核,其中本地Llama-3.2模型特别容易受到公平性破坏和信息退化攻击(ERS = 0.737)。据我们所知,现有框架中没有将有限伦理后果空间、语义一致性约束和领域自适应评估结合在单个对抗测试管道中的。

英文摘要

As AI systems are deployed in high-stakes ethical contexts such as healthcare triage, autonomous vehicle control, and employment screening, formal methods for evaluating their robustness against adversarial manipulation of ethical reasoning remain underdeveloped. This paper introduces the Ethical Robustness Testing System (ERTS), a closed-pipeline framework that: (1) encodes ethical dilemmas into a 22-dimensional Ethical Consequence Space (ECS) grounded in established ethical theory; (2) applies 17 semantic perturbation functions subject to 6 validity constraint classes including a novel semantic coherence constraint; (3) measures decision deviation via a 4-component Ethical Instability Index (EII); and (4) produces domain-adaptive pre-deployment robustness assessment verdicts. We evaluate 4 structured baseline models and 2 production LLMs (Gemini 2.0 Flash and Llama 3.2) across 50 ethical scenarios spanning 8 deployment domains, generating 1,500 adversarial test cases. Results demonstrate that only 33% of models achieve assessment clearance, with the local Llama-3.2 model proving particularly vulnerable to fairness corruption and information degradation attacks (ERS = 0.737). To the best of our knowledge, no existing framework combines a bounded ethical consequence space, semantic coherence constraints, and domain-adaptive assessment in a single adversarial testing pipeline.

2606.13279 2026-06-12 cs.RO 新提交

See Selectively, Act Adaptively: Dual-Level Structural Decomposition for Bimanual Robot Manipulation

选择性观察,适应性行动:双水平结构分解用于双臂机器人操作

Yoon-Ji Choi, Young-Chae Son, Soo-Chul Lim

AI总结 提出基于双水平结构分解的双臂操作VLA框架,通过视觉选择路由和动作专家混合机制分别处理视觉相关性和双臂交互模式,在模拟和真实任务中成功率分别提升27.7%和43.3%。

详情
AI中文摘要

在双臂机器人操作中,任务相关的视觉信息随任务阶段和上下文变化,而两臂的交互在独立和协调模式之间切换,使得策略学习具有挑战性。然而,现有的整体式视觉-语言-动作(VLA)策略通过单一共享表示和动作生成路径处理多样的视觉输入和交互模式,往往无法分别考虑视觉相关性和双臂交互结构。为了解决这个问题,我们提出了一个基于双水平结构分解的双臂操作VLA框架。视图选择视觉路由器动态调整腕部视角的贡献以强调相关视觉线索,而交互感知动作专家混合(MoE)将动作生成分解为协调和单臂路径,以适应不同的双臂交互模式。我们在RoboTwin 2.0中的六个模拟双臂操作任务和三个长时域真实世界任务上评估了所提方法。我们的模型在模拟和真实世界评估中,整体平均成功率分别比整体式基线提高了27.7%和43.3%,并且在两种设置下始终优于单模块变体。这些结果表明,联合考虑选择性视觉处理和双臂交互结构的显式分解为鲁棒的双臂操作提供了有效的归纳偏置。

英文摘要

In bimanual robotic manipulation, task-relevant visual information varies with the task stage and context, while the interaction of the two arms shifts between independent and coordinated modes, making policy learning challenging. However, existing monolithic Vision-Language-Action (VLA) policies process diverse visual inputs and interaction patterns through a single shared representation and action generation pathway, often failing to separately account for visual relevance and bimanual interaction structure. To address this issue, we propose a bimanual manipulation VLA framework based on Dual-Level Structural Decomposition. The View-Selective Visual Router dynamically adjusts wrist-view contributions to emphasize relevant visual cues, while the Interaction-Aware Action Mixture-of-Experts (MoE) decomposes action generation into coordinated and arm-wise pathways to adapt to varying bimanual interaction modes. We evaluate the proposed method on six simulated bimanual manipulation tasks in RoboTwin 2.0 and three long-horizon real-world tasks. Our model improves the overall average success rate over a monolithic baseline by 27.7% in simulation and 43.3% in real-world evaluation, while consistently outperforming single-module variants across both settings. These results demonstrate that jointly considering selective visual processing and explicit decomposition of bimanual interaction structures provides an effective inductive bias for robust bimanual manipulation.

2606.13275 2026-06-12 cs.CV 新提交

Zero-Shot Captioning for Cultural Heritage: Automated Image Analysis of Traditional Indonesian Clothing

文化遗产的零样本描述:印度尼西亚传统服装的自动化图像分析

Anugrah Aidin Yotolembah, Novanto Yudistira, Gembong Edhi Setyawan

AI总结 提出Custom ZeroCLIP框架,利用检索增强的视觉-语言模型,在零样本设置下为印度尼西亚传统服装生成描述,在8个未见省份上取得优于基线的性能。

详情
Comments
accepted to ICME workshop on AIART 2026
AI中文摘要

本文提出了Custom ZeroCLIP,一个用于印度尼西亚传统服装零样本描述的检索增强视觉-语言框架。数据集包含来自印度尼西亚所有38个省份的3,800张专家标注图像。采用省份级归纳零样本协议,模型在24个可见省份上训练,在6个可见省份上验证,并在8个未见省份上评估。该框架结合了冻结的CLIP ViT-B/32图像编码器、CLIP文本编码器、BERT文本编码器和LSTM描述解码器。在推理过程中,未见省份的标签和描述不可用,检索仅使用训练省份的描述。训练、验证或检索库构建过程中未使用任何未见省份的图像、标签或描述。Custom ZeroCLIP实现了0.8536的CLIPScore、0.3342的BLEU-4和0.4859的METEOR,优于现有基线。消融实验表明,检索提高了文化词汇的恢复能力,METEOR提升了19.3%,而人工评估证实了更强的文化准确性和流畅性。结果证明了检索增强的领域自适应在低资源文化遗产环境下生成文化基础描述的有效性。数据集可在以下网址公开获取:https://this https URL。

英文摘要

This paper presents Custom ZeroCLIP, a retrieval-augmented vision-language framework for zero-shot captioning of Indonesian traditional garments. The dataset contains 3,800 expert-annotated images from all 38 Indonesian provinces. Using a province-level inductive zero-shot protocol, the model is trained on 24 seen provinces, validated on 6 seen provinces, and evaluated on 8 unseen provinces. The framework combines a frozen CLIP ViT-B/32 image encoder, a CLIP text encoder, a BERT text encoder, and an LSTM caption decoder. During inference, unseen-province labels and captions are unavailable, and retrieval uses only captions from training provinces. No unseen-province image, label, or caption is used during training, validation, or retrieval-bank construction. Custom ZeroCLIP achieves a CLIPScore of 0.8536, BLEU-4 of 0.3342, and METEOR of 0.4859, outperforming existing baselines. Ablation results show that retrieval improves cultural vocabulary recovery with a 19.3\% METEOR gain, while human evaluation confirms stronger cultural accuracy and fluency. The results demonstrate the effectiveness of retrieval-augmented domain adaptation for culturally grounded caption generation in low-resource heritage settings. The dataset is publicly available at this https URL.

2606.13267 2026-06-12 cs.CV cs.CL cs.IR 新提交

TimeLens: On-Device Artifact Recognition with Retrieval-Augmented Question Answering for the Grand Egyptian Museum

TimeLens: 面向大埃及博物馆的基于检索增强问答的设备端文物识别

Rawan Hesham, Ali Ashraf, Amr Ahmed, Malak Alaa, Omar Ahmed, Omar Wagih

AI总结 针对博物馆场景中的细粒度视觉相似性、训练数据与手持相机差距以及AI幻觉问题,提出设备端文物检测器与双语检索增强生成(RAG)问答系统,实现实时识别与可靠问答。

详情
Comments
6 pages, 4 figures, 5 tables. Submitted to AIVRCH 2026
AI中文摘要

TimeLens 是一款面向大埃及博物馆(GEM)的 AI 驱动双语移动导览应用。游客将手机对准展品时,可实时识别文物,并针对后续问题获得英语或阿拉伯语回答。本工作解决了馆内部署特有的三个问题:51 件编目文物(许多近乎相同的拉美西斯雕像)间的细粒度视觉相似性、策展训练数据与手持相机条件之间的差距,以及 AI 导览陈述未经证实的历史事实的风险。报告了两项工程贡献。首先,通过数据质量驱动的迭代研究——从基础模型自动标注(YOLO-World),经过空间标签清理规则,到完全人工标注的数据集——开发了设备端文物检测器,将标签质量确定为决定性因素:最终的 YOLOv8n 模型解决了所有先前失败的类别,同时保持为 5.97 MB 的 TensorFlow Lite 资产,可在中端手机上实时运行(mAP@0.5 = 0.995,mAP@0.5:0.95 = 0.924)。其次,基于 108 条记录的 ChromaDB 知识库的双语检索增强生成(RAG)导览,在七个候选语言模型上进行了基准测试,选定了 Gemma 4 E2B(Q4 K M);十项针对性优化将端到端延迟从超过 30 秒降低到约 10 秒。两个子系统集成在一个生产级 Flutter 应用中,具有双语界面、博物馆位置门控和文本转语音支持。

英文摘要

TimeLens is an AI-powered bilingual mobile guide for the Grand Egyptian Museum (GEM). Pointing a phone at an exhibit, a visitor sees the artifact recognized in real time and can ask follow-up questions answered in English or Arabic. The work addresses three problems specific to in-gallery deployment: fine-grained visual similarity among 51 catalogued artifacts (many near-identical Ramesside statues), the gap between curated training data and handheld camera conditions, and the risk of an AI guide stating unsupported historical facts. Two engineering contributions are reported. First, an on-device artifact detector was developed through a data-quality-driven iteration study -- from foundation-model auto-annotation (YOLO-World), through spatial label-cleaning rules, to a fully hand-annotated dataset -- isolating label quality as the decisive factor: the final YOLOv8n model resolves every previously failing class while remaining a 5.97 MB TensorFlow Lite asset that runs in real time on a mid-range phone (mAP@0.5 = 0.995, mAP@0.5:0.95 = 0.924). Second, a bilingual Retrieval-Augmented Generation (RAG) guide, grounded in a 108-record ChromaDB knowledge base, was benchmarked across seven candidate language models, with Gemma 4 E2B (Q4 K M) selected; ten targeted optimizations reduce end-to-end latency from over 30 s to approximately 10 s. Both subsystems are integrated in a production Flutter application with bilingual interface, museum location gating, and text-to-speech support.

2606.13256 2026-06-12 cs.RO cs.AI 新提交

Humor Style Drives Laughter, Topic Shapes Acceptability: Evaluating Bilingual Personal and Political Robot-Delivered AI Jokes

幽默风格驱动笑声,话题塑造可接受性:评估双语个人与政治机器人交付的AI笑话

Anna-Maria Velentza, Anne-Gwenn Bosser

AI总结 本研究通过混合因素设计,评估机器人用双语讲AI生成笑话时,幽默类型(亲和、自我增强、攻击、自贬)和内容(个人vs政治)对趣味性和适当性的影响,发现幽默类型显著影响趣味性,内容影响适当性,语言偏好受内容及参与者流利度影响。

详情
Comments
Accepted in the 35th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN 2026), Kitakyushu, Fukuoka, Japan
AI中文摘要

幽默在人类社交关系中扮演核心角色,计算幽默的最新进展为将幽默融入人机交互(HRI)创造了新机会。虽然大型语言模型(LLMs)能生成多种形式的幽默,但在群体环境中,幽默风格、笑话内容和语言偏好如何影响对机器人传递幽默的感知仍不清楚。在这项探索性研究中,我们采用混合因素设计,让参与者在大学教室中评估由机器人传递的AI生成笑话。我们考察了幽默类型(亲和型、自我增强型、攻击型、自贬型)和笑话内容(个人相关vs政治)对感知趣味性和适当性的影响,以及语言偏好。结果表明,幽默类型显著影响趣味性,攻击型和亲和型幽默评分更高;而笑话内容主要影响适当性,个人相关笑话优于政治笑话。语言偏好受笑话内容和参与者自我报告的流利度及幽默实践的影响。

英文摘要

Humor plays a central role in human social relationships, and recent advances in computational humor create new opportunities for integrating humor into human-robot interaction (HRI). While large language models (LLMs) can generate diverse forms of humor, it remains unclear how humor style, joke content, and language preference shape perceptions of robot-delivered humor in group settings. In this exploratory study, we employed a mixed factorial design in which participants evaluated AI-generated jokes delivered by a robot in a university classroom. We examined the effects of humor type (Affiliative, Self-Enhancing, Aggressive, Self-Defeating) and joke content (person-related vs. political) on perceived funniness and appropriateness, as well as preferred language. Results show that humor type significantly influences funniness, with Aggressive and Affiliative humor rated higher, while joke content primarily affects appropriateness, with person-related jokes preferred over political ones. Language preference was shaped by both joke content and participants' self-reported fluency and humor practices.

2606.13254 2026-06-12 cs.CL 新提交

Evaluating Pluralism in LLMs through Latent Perspectives

通过潜在视角评估LLM中的多元主义

Laura Majer, Jan Šnajder, Martin Tutek

AI总结 提出一种领域无关的多层无监督框架,从LLM生成文本中提取潜在视角,评估多元主义差距,发现稀有视角仍被不成比例地低估。

详情
Comments
Pluralistic Alignment Workshop @ ICML 2026
AI中文摘要

对代表多样化视角的需求日益增长,增加了对多元主义LLM生成的兴趣。尽管难以操作化,但识别文本中表达的视角将为多元主义对齐提供明确指导,并更清晰地阐明LLM生成中的多元主义差距。虽然模型已被证明会减少训练数据的多样性并生成同质化内容,但这主要是在多项选择问卷或使用自由文本的高层特征上得到证明。在本文中,我们介绍并实现了一个领域无关的多层无监督框架,用于提取适合识别LLM生成文本中多元主义差距的视角。我们在书评(一个高度意见化、代表多样化视角的数据集)上评估了该框架,并比较了各种提示和模型。我们的结果表明,虽然一些模型和提示技术接近覆盖广泛的视角,但稀有视角仍然不成比例地被低估,导致分布偏离人类文本。

英文摘要

The growing need to represent diverse perspectives has increased interest in pluralistic LLM generation. Although difficult to operationalize, identifying perspectives expressed in text would provide clear guidance on pluralistic alignment and more clearly articulate the pluralistic gap in LLM generation. While models have been shown to reduce the diversity of training data and generate homogeneously, this has been demonstrated primarily on multiple-choice questionnaires or using high-level characteristics of free-form text. In this paper, we introduce and implement a domain-agnostic multi-layered framework for unsupervised extraction of perspectives suitable for identifying the pluralistic gap in LLM-generated text. We evaluate our framework on book reviews, a highly opinionated dataset representing diverse perspectives, and compare various prompts and models. Our results show that while some models and prompting techniques come close to covering a broad spectrum of perspectives, rarer perspectives remain disproportionately underrepresented, resulting in distributions that diverge from human text.

2606.13239 2026-06-12 cs.SE cs.AI cs.CL cs.CV 新提交

ComAct: Reframing Professional Software Manipulation via COM-as-Action Paradigm

ComAct: 通过COM即行动范式重构专业软件操作

Jiaxin Ai, Tao Hu, Xuemeng Yang, Shu Zou, Hairong Zhang, Daocheng Fu, Yu Yang, Hongbin Zhou, Nianchen Deng, Pinlong Cai, Zhongyuan Wang, Botian Shi, Kaipeng Zhang, Licheng Wen

AI总结 提出COM即行动范式,将专业软件交互转化为确定性程序合成,解决GUI代理的脆弱性和API代理的异构性问题;构建ComCADBench基准和ComActor自校正代理,在工业CAD软件上实现SOTA性能。

详情
AI中文摘要

现有的计算机使用代理在专业软件操作上仍然存在根本性限制:基于GUI的代理受困于脆弱的视觉基础和长程错误累积,而基于API的方法则难以应对异构协议和不可访问的商业接口。在这项工作中,我们将组件对象模型(COM)识别为统一的、可执行的抽象,提出了COM即行动:一种新的范式,将专业软件交互重新定义为确定性程序合成,而非顺序视觉控制。为了在最苛刻的环境中验证这一范式,我们引入了ComCADBench,这是首个针对操作真实工业CAD软件的代理的基准测试。我们的实验揭示了显著的范式差距:前沿的专有模型在基于GUI的交互下几乎无法成功,而基于COM的执行则带来了实质性的即时收益。为了弥合语法正确性与几何精度之间的剩余差距,我们开发了ComActor,一个通过渐进式三阶段框架训练的自校正代理,以及ComForge,一个用于在Windows容器中进行大规模训练的可扩展平台。大量实验表明,ComActor在ComCADBench上达到了最先进的性能,在基线崩溃的长程任务中表现出强大的韧性,并泛化到外部CAD基准测试。

英文摘要

Existing computer-use agents remain fundamentally limited in professional software manipulation: GUI-based agents suffer from fragile visual grounding and long-horizon error accumulation, while API-basedapproaches struggle with heterogeneous protocols and inaccessible commercial interfaces. In this work,we identify the Component Object Model (COM) as a unified executable abstraction, proposing COM-as-Action: a new paradigm that reframes professional software interaction as deterministic program synthesisrather than sequential visual control. To validate this paradigm in the most demanding environments, weintroduce ComCADBench, the first benchmark for agents operating real industrial CAD software. Ourexperiments reveal a substantial paradigm gap: frontier proprietary models achieve near-zero successunder GUI-based interaction, whereas COM-based execution yields substantial immediate gains. Tobridge the remaining gap between syntactic correctness and geometric accuracy, we develop ComActor, aself-correcting agent trained through a progressive three-stage framework, alongside ComForge, a scalableplatform for large-scale training in Windows containers. Extensive experiments show that ComActorachieves state-of-the-art performance on ComCADBench, with strong resilience in long-horizon taskswhere baselines collapse, and generalizes to external CAD benchmark.

2606.13232 2026-06-12 cs.RO 新提交

WT-UMI: Tactile-based Whole-Body Manipulation via Force-Supervised Contact-Aware Planning

WT-UMI: 基于触觉的全身操控通过力监督的接触感知规划

Jaehwi Jang, Zhaoyuan Gu, Alfred Cueva, Zimeng Chai, Junjie Sheng, Thong Nguyen, Himank Galundia, Yifan Wu, Huishu Xue, Isaac Legene, Ojas Mediratta, Davin Doan, Andrew Collins, Sarah Sadegh, KyoungMok Kim, Rishita Dhalbisoi, Zun Chen, Ye Zhao

AI总结 提出WT-UMI系统,结合人体演示与遥操作数据,通过力监督规划器预测末端执行器位姿和接触力轨迹,并利用触觉导纳控制器提升全身操控性能。

详情
Comments
18 pages, 8 figures
AI中文摘要

全身人形操控笨重、可变形和共享负载物体需要分布式接触感知和显式力调节,然而大多数模仿策略仅隐式处理接触力。另一方面,不同的演示来源提供了具有固有权衡的互补模态:人体演示捕捉自然接触力但不包含机器人可执行动作,而遥操作直接记录机器人动作但力调节不够自然。本文提出\textbf{WT-UMI},一种可穿戴全身触觉接口,可由人类操作员佩戴或安装在人形机器人上,在人体演示和人形遥操作模式下提供触觉图像、接触力和末端执行器位姿的精确观测。我们引入一个力条件目标位姿校正模块,通过从遥操作数据中学习校正,将测量的人体位姿转换为接触感知的机器人目标。为了利用人体数据中的自然力交互,我们提出一个力监督规划器,预测末端执行器位姿块和接触力轨迹。预测的接触力作为基于触觉的导纳控制器的参考。在五个接触密集型任务中,涵盖可变形物体、笨重刚体物体和人-人形协作,WT-UMI在成功率上优于四个策略基线,并降低了接触位置跟踪误差。我们的项目页面可在此https URL访问。

英文摘要

Whole-body humanoid manipulation of bulky, deformable, and shared-load objects requires distributed contact sensing and explicit force regulation, yet most imitation policies treat contact force only implicitly. On the other hand, different demonstration sources provide complementary modalities with inherent trade-offs: human demonstrations capture natural contact forces but not robot-executable actions, while teleoperation directly records robot actions but with less natural force regulation. This paper presents \textbf{WT-UMI}, a wearable whole-body tactile interface worn by human operators or mounted on humanoids, providing accurate observations of tactile images, contact forces, and end-effector poses across both human demonstration and humanoid teleoperation modes. We introduce a force-conditioned target-pose correction module that converts measured human poses into contact-aware robot targets by learning corrections from teleoperation data. To leverage the natural force interaction in human data, we propose a force-supervised planner that predicts end-effector pose chunks and contact-force trajectories. The predicted contact force serves as the reference for a tactile-based admittance controller. Across five contact-rich tasks spanning deformable objects, bulky rigid objects, and human--humanoid collaboration, WT-UMI improves success rate and reduces contact-position tracking error over four policy baselines. Our project page is available at this https URL.

2606.13223 2026-06-12 cs.LG cs.CV 新提交

Distributional Loss for Robust Classification

分布损失用于鲁棒分类

Kathleen Anderson, Thomas Martinetz

AI总结 提出一种基于双峰高斯分布的分布损失概念,通过软化目标隐式捕捉类别模糊性,缓解过拟合,提升决策边界鲁棒性,尤其在低数据场景下效果显著。

详情
Comments
ICANN 2026
AI中文摘要

本文提出了一种用于监督分类任务的新型损失概念。我们不是强制每个输入样本直接映射到单个分配标签,而是将分类器输出的优化目标定义为双峰高斯分布。这种更柔和的目标公式隐式地捕捉了类别模糊性,减轻了过拟合,并鼓励学习更鲁棒的决策边界,所有这些都不需要额外的标签信息。实验结果表明,鲁棒性持续提升,在低数据场景下尤其明显,同时仅需对标准训练流程进行最小修改。

英文摘要

This paper proposes a novel loss concept for supervised classification tasks. Rather than enforcing a direct mapping from each input sample to a single assigned label, we define an optimization objective over all classifier outputs as a bimodal Gaussian distribution. This softer target formulation implicitly captures class ambiguity, mitigates overfitting, and encourages the learning of more robust decision boundaries, all without requiring additional label information. Experimental results demonstrate consistent improvements in robustness, with particularly pronounced gains in low-data regimes, while requiring only minimal modifications to standard training pipelines.

2606.13218 2026-06-12 cs.CL 新提交

When Similar Means Different: Evaluating LLMs on Arabic--Hebrew Cognates

当相似意味着不同:评估大语言模型在阿拉伯语-希伯来语同源词上的表现

Junhong Liang, Noor Abo Mokh, Bashar Alhafni

AI总结 针对阿拉伯语和希伯来语同源词、假朋友和借词,构建SemCog Bench基准(1858对词对),评估LLM跨语言语义理解,发现模型依赖表面形式相似性,在假朋友和借词上表现差,上下文帮助有限。

详情
AI中文摘要

阿拉伯语和希伯来语作为密切相关的闪米特语言,共享大量真正的同源词、误导性的假朋友和现代借词。这种重叠对大语言模型(LLM)的跨语言语义理解构成了挑战。为了评估这一能力,我们引入了SemCog Bench,这是一个精心策划的基准,包含1,858个阿拉伯语-希伯来语词对,并带有用于同源词识别和语义消歧的句子级注释。我们评估了开源和商业LLM在多种输入表示(原始、带变音符号、罗马化和音标)下的表现,揭示了跨语言推理中的关键差距。虽然模型在真正的同源词上达到了高准确率,但在假朋友和借词上性能急剧下降,反映出对表面形式相似性的强烈依赖。此外,句子级上下文仅带来微小的改进,表明仅靠上下文线索不足以克服误导性的形式信号。这些发现揭示了当前LLM在解决跨语言形式-意义冲突方面的根本局限性,并将SemCog Bench确立为多语言语义推理的严格基准。我们的代码和数据已公开。

英文摘要

Arabic and Hebrew, as closely related Semitic languages, share a substantial lexicon of true cognates, misleading false friends, and modern loanwords. This overlap poses a challenge for cross-lingual semantic understanding in large language models (LLMs). To evaluate this capability, we introduce SemCog Bench, a curated benchmark of 1,858 Arabic--Hebrew word pairs with sentence-level annotations for cognate identification and semantic disambiguation. We evaluate open-source and commercial LLMs across multiple input representations (raw, diacritized, Romanized, and phonetic) and reveal a critical gap in cross-lingual reasoning. While models achieve high accuracy on true cognates, performance drops sharply on false friends and loanwords, reflecting a strong reliance on surface-form similarity. Furthermore, sentence-level context yields only modest improvements, suggesting that contextual cues alone are insufficient to overcome misleading form-based signals. These findings reveal a fundamental limitation of current LLMs in resolving cross-lingual form--meaning conflicts and establish SemCog Bench as a rigorous benchmark for multilingual semantic reasoning. Our code and data are publicly available.

2606.13190 2026-06-12 cs.RO cs.HC 新提交

Multi-Modal Multi-Agent Robotic Cognitive Alignment enabled by Non-Invasive Consumer Brain Computer Interfaces: A Proof of Concept Exploration

基于非侵入式消费级脑机接口的多模态多智能体机器人认知对齐:概念验证探索

Nataliya Kosmyna, Liz Jenkins, Anoop K. Sinha

AI总结 提出一种框架,利用消费级脑机接口监测脑电信号,在高认知负荷时延迟智能体通信,实现认知对齐的多智能体交互,初步验证了实时信号处理、大语言模型与机器人结合的可行性。

详情
Comments
19 pages, 9 figures, for associated video, see this https URL
AI中文摘要

尽管非语言行为和表达性动作对于自然的人机交互至关重要,但现有方法常常忽略一个关键要素:人类的内在认知状态。主动式多智能体系统经常在不合时宜的时刻打断人类,导致认知过载和任务性能下降。本文引入了一个生成“认知对齐”多智能体交互的框架,增强了机器人系统在人类高心理工作负荷和高投入度时刻,能够上下文相关地延迟向智能体系统用户发送通信的能力。我们介绍了一种闭环架构的设计与实现,该架构探索了自主任务执行与实时神经生理学专注度之间的相互作用。使用消费级脑机接口(BCI),我们的方法在人类执行投入度诱导任务时持续监测脑电图(EEG)频谱带功率。我们提出了一种基于投入度的流水线,其中基于HTTP的信令机制在检测到高投入度时将主智能体的感官输入和音频输出置于保持状态,从而允许次级智能体在后台无缝处理复杂的委托任务。一旦人类的认知状态恢复到较低的认知负荷基线,主智能体释放排队的智能体消息。我们的初步结果证明了利用实时信号处理、大语言模型(LLMs)和物理机器人实体创建认知感知、非侵入式多智能体系统的可行性。

英文摘要

While non-verbal behaviors and expressive movements are essential for natural human-robot interaction, existing methods often overlook a crucial element: the human's internal cognitive state. Frequently, proactive multi-agent systems can interrupt humans at inopportune moments, leading to cognitive overload and decreased task performance. This paper introduces a framework for generating "cognitively aligned" multi-agent interactions, enhancing the ability of robotic systems to contextually defer communications to the user of an agent system during moments of high human mental workload and engagement. We present the design and implementation of a closed-loop architecture that explores the interplay between autonomous task execution and real-time neurophysiological focus. Using a consumer-grade Brain-Computer Interface (BCI), our approach continuously monitors Electroencephalography (EEG) spectral band powers while a human performs an engagement-inducing task. We propose an engagement-driven pipeline where an HTTP-based signaling mechanism places a primary agent's sensory inputs and audio outputs into a holding state upon detecting high engagement. This allows secondary agents to seamlessly process complex, delegated tasks in the background. Once the human's cognitive state returns to a lower cognitive load baseline, the primary agent releases the queued agent message. Our preliminary results demonstrate the feasibility of leveraging real-time signal processing, Large Language Models (LLMs), and physical robotic embodiments to create cognitively-aware, non-intrusive multi-agent systems.

2606.13179 2026-06-12 cs.ET cs.AI cs.AR cs.NE 新提交

Modern analog computing for solving differential and matrix equations

现代模拟计算用于求解微分方程和矩阵方程

Zhong Sun, Piergiulio Mannocci, Manuel Le Gallo, Abu Sebastian

AI总结 本文综述现代模拟计算在求解微分方程和矩阵方程中的核心原语、硬件实现及最新进展,强调电阻式存储器阵列的优势,并讨论精度、可扩展性及与内存计算的关系。

详情
AI中文摘要

近年来,受人工智能和科学计算等数据密集型应用的计算需求驱动,模拟计算重新获得关注。鉴于计算任务的多样性以及模拟CMOS电路和电阻式存储器技术的最新进展,我们将这一不断发展的领域称为现代模拟计算。在此背景下,我们识别出三个核心计算原语:求解微分方程、求解矩阵方程以及执行矩阵-向量乘法,并探讨它们之间的联系。我们还研究了这些模拟计算算子的各种硬件实现,包括基于分立元件、集成电路和电阻式存储器设备的实现。其中,电阻式存储器阵列因其实现效率而显得尤为有前景。本文随后调查了利用现代模拟计算(使用先进的模拟CMOS电路和电阻式存储器阵列)求解微分方程和矩阵方程的最新进展。最后,我们讨论了这些电路的应用、精度和可扩展性问题及其潜在解决方案、与内存计算的关系,以及模拟计算的独特计算复杂性。本文提供了关于模拟计算的统一视角,强调了其优势、当前发展和挑战,并将其定位为下一代计算前沿的关键推动者。

英文摘要

In recent years, driven by the computational demands of data-intensive applications such as artificial intelligence and scientific computing, analog computing has gained renewed interest. Given the diversity of computational tasks and recent advancements in analog CMOS circuits and resistive memory technologies, we refer to the evolving landscape as modern analog computing. In this context, we identify three core computational primitives: solving differential equations, solving matrix equations, and performing matrix-vector multiplications, and we explore the connections among them. We also examine various hardware implementations of these analog computing operators, including those built with discrete components, integrated circuits, and resistive memory devices. Among these, resistive memory arrays emerge as particularly promising due to their implementation efficiency. The paper then surveys recent progress in leveraging modern analog computing to solve differential and matrix equations using both advanced analog CMOS circuits and resistive memory arrays. Finally, we discuss the applications of these circuits, the precision and scalability issues and their potential solutions, the relationship with in-memory computing, and the unique computational complexity of analog computing. This paper provides a unified perspective on analog computing, highlighting its strengths, current developments, and challenges, and positioning it as a pivotal enabler of next-generation computational frontiers.

2606.13177 2026-06-12 cs.CL cs.AI cs.LG 新提交

MemRefine: LLM-Guided Compression for Long-Term Agent Memory

MemRefine: 基于LLM引导的压缩用于长期智能体记忆

Minjae Kim, Jinheon Baek, Soyeong Jeong, Sung Ju Hwang

AI总结 提出MemRefine框架,利用LLM判断事实内容,通过删除、合并和保留操作将记忆库压缩到固定预算内,在多个基准上保持下游性能并优于基于规则的基线。

详情
AI中文摘要

大型语言模型(LLM)智能体越来越需要在长期交互中运行,其中过去对话中的信息必须被保留和回忆以支持未来任务。然而,随着交互的积累,记忆存储无限制增长,并充满冗余条目,这些条目增加了存储成本,并通过排挤最有用的证据而降低了检索质量。此外,在具有硬性内存预算的资源受限平台上,这尤其受限,促使我们制定了有存储预算的记忆管理任务,即在固定预算内保持已构建的记忆库,同时保留对未来交互有用的信息。为此,我们提出了MemRefine,一个基于LLM引导的框架,由于表面相似性不能很好地反映事实价值,该框架仅使用相似性来提出候选对,并将删除、合并和保留决策推迟给基于事实内容的LLM判断,迭代直到满足预算。在多个记忆框架和长期对话基准上,MemRefine始终满足目标预算,同时保持下游性能,并在紧预算下优于基于规则的基线。

英文摘要

Large language model (LLM) agents are increasingly expected to operate over long-term interactions, where information from past dialogues must be preserved and recalled to support future tasks. However, as interactions accumulate, the memory store grows without bound and fills with redundant entries that inflate storage cost and degrade retrieval by crowding out the most useful evidence. Furthermore, this is especially limiting on resource-constrained platforms with hard memory budgets, motivating us to formulate storage-budgeted memory management, the task of keeping an already constructed memory store within a fixed budget while preserving information useful for future interactions. To this end, we then propose MemRefine, an LLM-guided framework that, since surface similarity poorly reflects factual value, uses similarity only to propose candidate pairs and defers delete, merge, and preserve decisions to an LLM judge based on factual content, iterating until the budget is met. Across multiple memory frameworks and long-term conversation benchmarks, MemRefine consistently meets target budgets while preserving downstream performance and outperforming rule-based baselines under tight budgets.

2606.13133 2026-06-12 cs.DS cs.LG 新提交

Learning-Augmented Approximation for Unrelated-Machines Makespan Scheduling

学习增强的无关联机器调度近似算法

Kaito Baba, Evripidis Bampis, Giorgos Mitropoulos

AI总结 针对无关联机器调度问题,提出学习增强算法,利用重作业分配预测实现精确预测时(1+ε)-近似,误差增大时退化为2-近似。

详情
Comments
22 pages, 3 figures
AI中文摘要

最近,Antoniadis等人(ICLR 2025)提出了一个框架,通过引入预测来近似NP-hard选择问题。尽管该方法简单,但它紧密匹配理论下界,因此其推广极具吸引力。我们解决了Antoniadis等人工作中提出的一个开放问题,即如何将该方法扩展到选择问题类之外的其他重要问题,例如调度问题。我们为无关联机器上的最小化完工时间问题(记为$R\\|C_{\max}$)开发了一种学习增强算法。通过使用重作业分配的预测,我们在预测准确时实现了多项式时间的$(1+\varepsilon)$-近似,并且随着误差增加,该近似平滑地退化为最坏情况下的2-近似。我们通过实证分析总结了我们的工作。

英文摘要

Recently, Antoniadis et al. (ICLR 2025) proposed a framework for incorporating predictions to approximate NP-hard selection problems. Despite its simplicity, this approach tightly matches theoretical lower bounds, making its generalization highly compelling. We address an open question raised in the work of Antoniadis et al., concerning the extension of this approach to other important problems outside the class of selection problems, such as scheduling. We develop a learning-augmented algorithm for the makespan minimization problem on unrelated machines, denoted by $R\|C_{\max}$. By using predictions of heavy job assignments, we achieve a polynomial-time $(1+\varepsilon)$-approximation for accurate predictions that smoothly degrades to a worst-case 2-approximation as the error increases. We conclude our work with an empirical analysis of our method.

2606.13113 2026-06-12 eess.SY cs.RO 新提交

MPC for underactuated spacecraft control with a Lyapunov supervised physics-informed neural network correction layer

基于李雅普诺夫监督的物理信息神经网络校正层的欠驱动航天器MPC控制

Amirhossein Ayanmanesh Motlaghmofrad, Carlo Cena, Mauro Martini, Marcello Chiaberge

AI总结 针对欠驱动航天器姿态控制,提出一种分层架构,结合非线性模型预测控制、物理信息神经网络和李雅普诺夫监督机制,在不确定性下降低稳态误差并保持鲁棒性。

详情
Comments
Accepted at SPAICE (AI in and for Space) 2026
AI中文摘要

欠驱动航天器面临可控性限制和对环境干扰的高度敏感性,使得姿态机动和稳定复杂化。由于沿欠驱动轴缺乏控制能力,传统控制器无法直接稳定所有姿态分量,因此需要参考规划策略。此外,MPC方法对惯性不确定性和未建模动态耦合仍然敏感,导致在失配下跟踪性能下降。为解决这些问题,我们考虑一种集成三层的分层架构:(i) 非线性模型预测控制器(NMPC),用于约束和欠驱动感知的机动规划以及在执行器限制下的标称闭环稳定性;(ii) 物理信息神经网络(PINN),在仿真数据上离线训练以估计残余干扰力矩,其损失项强制执行与刚体旋转动力学的一致性;(iii) 基于李雅普诺夫的监督安全机制,在线评估学习到的校正并限制或抑制其影响,以保持基线控制器的稳定性特性。该架构在模拟反作用轮动力学、执行器饱和及环境干扰的高保真仿真环境中进行评估。蒙特卡洛研究表明,与独立NMPC相比,稳态姿态误差有统计显著的降低,同时在不确定性下保持鲁棒行为。监督层确保当基于学习的增强不可靠时,能够优雅地退化到纯模型控制。

英文摘要

Underactuated spacecraft faces controllability limitations and heightened sensitivity to environmental disturbances, complicating attitude maneuvering and stabilization. Due to the lack of control authority along the underactuated axis, conventional controllers cannot directly stabilize all attitude components and therefore require reference planning strategies. Furthermore, MPC approaches remain sensitive to inertia uncertainty and unmodeled dynamic couplings, resulting in degraded tracking performance under mismatch. To address these issues, we consider a hierarchical architecture integrating three layers: (i) a nonlinear model predictive controller (NMPC) for constraint and underactuation-aware maneuver planning and nominal closed-loop stability under actuator limits; (ii) a physics-informed neural network (PINN) trained offline on simulation data to estimate residual disturbance torques, with loss terms that enforce consistency with rigid-body rotational dynamics; (iii) a Lyapunov-based supervisory safety mechanism that evaluates the learned correction online and bounds or suppresses its influence to preserve the stability properties of the baseline controller. The architecture is evaluated in a high-fidelity simulation environment modelling reaction wheel dynamics, actuator saturation, and environmental disturbances. Monte Carlo studies show statistically significant reductions in steady-state attitude error relative to standalone NMPC while maintaining robust behavior under uncertainty. The supervisory layer ensures graceful degradation to purely model-based control when the learning-based augmentation is unreliable.

2606.13104 2026-06-12 cs.LG 新提交

Authority, Truth, and Citation Bias: A Large-Scale Multi-Domain Benchmark for Studying Epistemic Susceptibility in Large Language Models

权威、真实性与引文偏差:研究大语言模型认知易感性的大规模多领域基准

Aryan Khurana, Aravind Ramana RN, Dhruv Kumar

AI总结 提出AuthorityBench基准,通过2x2因子设计隔离引文权威信号对LLM认知行为的影响,发现引文存在(无论真假)均提高幻觉率,真声明搭配假引文时幻觉率上升3-22个百分点。

详情
Comments
10 pages, 5 figures. Accepted to AI4GOOD and EIML at ICML 2026
AI中文摘要

大型语言模型越来越多地部署在引文增强的环境中,但引文存在对模型行为的影响(独立于事实内容)仍知之甚少。我们引入了AuthorityBench,一个包含220,564个提示的多领域基准,用于隔离基于引文的权威信号如何影响LLM的认知行为。该基准采用完全平衡的2x2因子设计,交叉声明真实性(claim veracity)与引文真实性(citation veracity),这是首个这样做的基准,涵盖四个领域(常识、科学、法律和医学),并在40个提示模板、四个场所声望等级和一个国家编码的作者姓名数据集上进行受控变化。评估七个模型在12个结构化研究问题上的表现,我们发现引文的存在(无论是真实的还是捏造的)相对于无引文基线一致地提高了幻觉率。当捏造的引文伴随真实声明时,这种效应最强,使幻觉率提高3到22个百分点,在常识领域达到35%到77%,而法律声明相对稳健,场所声望和作者人口统计学影响可忽略不计。所有数据集和评估代码均可在以下网址获取:this https URL

英文摘要

Large language models are increasingly deployed in citation-augmented settings, yet the effect of citation presence on model behavior independent of factual content remains poorly understood. We introduce AuthorityBench, a 220,564-prompt multi-domain benchmark that isolates how citation-based authority signals influence epistemic behavior in LLMs. The benchmark uses a fully balanced 2x2 factorial design crossing claim veracity with citation veracity, the first to do so, across four domains (general knowledge, science, law, and medicine), with controlled variation over 40 prompt templates, four venue prestige tiers, and a country-coded author name dataset. Evaluating seven models on 12 structured research questions, we find that citation presence, whether real or fabricated, consistently increases hallucination rates relative to a no-citation baseline. The effect is strongest when fabricated citations accompany true claims, raising hallucination rates by 3 to 22 percentage points and reaching 35 to 77% in the general knowledge domain, while legal claims are comparatively robust and venue prestige and author demographics show negligible impact. All datasets and evaluation code are available at: this https URL

2606.13097 2026-06-12 cs.PL cs.AI 新提交

Functional Cache Grafting: Robust and Rapid Code-Policy Synthesis for Embodied Agents

功能缓存嫁接:具身智能体的鲁棒且快速代码策略合成

Saehun Chun, Wonje Choi, Sera Choi, Sanghyun Ahn, Honguk Woo

AI总结 提出FCGraft框架,通过维护函数级验证代码骨架及其键值缓存,对新任务进行缓存嫁接(拼接和修补),减少预填充计算并复用验证结构,实现更鲁棒和快速的策略合成。

详情
Comments
Accepted at ICML 2026
AI中文摘要

编写代码的大型语言模型(CodeLLMs)通过将自然语言目标和环境约束转化为结构化控制程序,为具身智能体生成可执行的代码策略。然而,在开放域具身环境中,策略生成存在两个基本限制:(i) 由于长提示上的重复预填充计算导致的延迟解码,以及(ii) 由于完全生成式解码导致的鲁棒性有限,这常常产生API不匹配、缺少安全防护和不稳定的控制逻辑。为了解决这些限制,我们提出了FCGraft,一种功能缓存嫁接框架。FCGraft维护一个函数级验证代码骨架库及其相关的提示级Transformer键值(KV)缓存,并在提供新任务时通过检索相关函数并嫁接其KV缓存来合成新策略。给定检索到的函数缓存,FCGraft通过拼接(将缓存的函数片段组合成复合策略)和修补(仅局部调整必要的代码区域以满足任务特定参数和约束,且只需最少的额外解码)进行缓存嫁接。通过消除冗余的预填充计算,该方法减少了生成延迟,同时重用经过验证的控制结构提高了鲁棒性,相比提示级缓存方法RAGCache,任务成功率提高了18.31%,策略合成速度提高了2.3倍。

英文摘要

Code-writing large language models (CodeLLMs) generate executable code policies for embodied agents by translating natural language goals and environmental constraints into structured control programs. However, policy generation in open-domain embodied environments suffers from two fundamental limitations: (i) delayed decoding caused by repetitive prefill computation over long prompts, and (ii) limited robustness due to fully generative decoding, which often produces API mismatches, missing safety guards, and unstable control logic. To address these limitations, we present FCGraft, a Functional Cache Grafting framework. FCGraft maintains a library of function-level validated code skeletons and their associated prompt-level Transformer key-value (KV) caches, and synthesizes new policies by retrieving relevant functions and grafting their KV caches when a new task is provided. Given retrieved function caches, FCGraft performs cache grafting via stitching, which composes cached function segments into a composite policy, and patching, which locally adapts only the necessary code regions to satisfy task-specific parameters and constraints with minimal additional decoding. By eliminating redundant prefill computation, this approach reduces generation latency, while reusing validated control structures improves robustness over prompt-level caching methods RAGCache, achieving 18.31% higher task success rate and 2.3x faster policy synthesis.

2606.13079 2026-06-12 cs.CR cs.AI 新提交

The Emergence of Autonomous Penetration Capabilities in Large Language Model-Powered AI Systems

大型语言模型驱动的AI系统中自主渗透能力的涌现

Jiaqi Luo, Jiarun Dai, Zhile Chen, Jia Xu, Weibing Wang, Yawen Duan, Brian Tse, Geng Hong, Xudong Pan, Yuan Zhang, Min Yang

AI总结 针对现有评估方法不透明、场景简化等问题,构建包含两级目标服务器和通用代理框架的自主渗透评估体系,测试19个LLM发现成功率10.7%-69.3%,且能力随模型整体能力提升。

详情
AI中文摘要

如今,能够造成重大现实世界危害的网络攻击的自主执行被广泛视为前沿AI系统不得跨越的关键红线之一。在这个更广泛的红线场景中,自主渗透代表了一项核心使能能力和子任务:LLM驱动的AI系统在无需人工干预的情况下,独立对目标服务器进行对抗操作,识别和利用漏洞,并获得未授权访问或控制的能力。越来越多的研究试图评估AI系统的自主渗透能力。然而,现有评估通常采用不透明的方法,依赖不切实际或过度简化的渗透测试场景,或为LLM提供过多的先验知识和任务特定指导,无法准确捕捉现代AI系统在更广泛的高影响网络攻击场景中自主执行这一核心能力的程度。为解决这些局限性,我们构建了一个新的自主渗透评估框架,包含两个组成部分:目标服务器和代理脚手架。具体而言,在目标服务器端,我们基于与易受攻击服务一起部署的无已知漏洞安全服务的数量,设计了两个级别的目标环境:一级(一个安全服务)和二级(三个安全服务),共产生300个目标服务器。同时,代理脚手架采用通用代理架构,配备一组通用网络安全工具,没有任何目标特定的先验知识。我们评估了19个开源和专有LLM,发现当前模型的渗透成功率在10.7%到69.3%之间。此外,我们观察到自主渗透能力随着整体模型能力的提升而持续改进。

英文摘要

Nowadays, the autonomous execution of cyberattacks capable of causing substantial real-world harm is widely regarded as one of the critical red lines that frontier AI systems must not cross. Within this broader red-line scenario, autonomous penetration represents a core enabling capability and subtask: the ability of LLM-powered AI systems to independently conduct adversarial operations against a target server without human intervention, identify and exploit vulnerabilities, and obtain unauthorized access or control. A growing body of work has sought to assess the autonomous penetration capabilities of AI systems. However, existing evaluations often employ opaque methodologies, rely on unrealistic or overly simplified penetration-testing scenarios, or provide LLMs with excessive prior knowledge and task-specific guidance, and cannot accurately capture the extent to which modern AI systems can autonomously perform this core capability within broader high-impact cyberattack scenarios. To address these limitations, we construct a new autonomous penetration evaluation framework consisting of two components: target servers and agent scaffolding. Specifically, on the target-server side, we design two levels of target environments based on the number of secure services without known vulnerabilities deployed alongside a vulnerable service: Tier~1 (one secure service) and Tier~2 (three secure services), resulting in a total of 300 target servers. Meanwhile, the agent scaffolding adopts a general-purpose agent architecture equipped with a set of general-purpose cybersecurity tools, without any target-specific prior knowledge. We evaluate 19 open-weight and proprietary LLMs, and find that current models achieve penetration success rates ranging from 10.7% to 69.3%. Moreover, we observe that autonomous penetration capability continues to improve alongside advances in overall model capability.

2606.13076 2026-06-12 cs.MA cs.GT cs.LG 新提交

$α$-fair heterogeneous agent reinforcement learning

$\alpha$-公平异质智能体强化学习

Yao-hua Franck Xu, Tayeb Lemlouma, Jean-Marie Bonnin, Arnaud Braud

AI总结 提出一种结合$\alpha$-公平性与异质智能体信任区域学习(HATRL)的框架,通过公平优势函数动态加权智能体效用,实现单调改进并收敛至纳什均衡,在顺序社会困境中优于HATRL算法。

详情
AI中文摘要

多智能体系统中的合作通常通过功利主义目标进行优化,这些目标最大化整体效率但未能考虑奖励分配,常常导致不公平的“领导者-跟随者”动态。虽然基于公平的方法鼓励每个智能体从合作中受益的亲社会行为,但许多当前算法——包括那些利用奖励塑造的算法——破坏了马尔可夫博弈的平稳性或缺乏严格的理论保证。这在公平目标方法和理论上安全的学习框架之间造成了关键差距。我们提出了一种新颖的框架,将$\alpha$-公平性与异质智能体信任区域学习(HATRL)相结合,确保单调改进并收敛至纳什均衡。我们的方法利用一种公平优势函数,该函数根据智能体的期望回报动态加权其效用,使得全局目标能够根据参数$\alpha$从纯粹的功利主义效率过渡到$\alpha$-公平福利。我们引入了两种实用算法,$\alpha$-公平HATRPO和$\alpha$-公平HAPPO,并通过在CleanUp和CommonHarvest等顺序社会困境中的实验证明,从功利主义角度看,它们比HATRL算法表现更好,同时实现了更高的社会结果。

英文摘要

Cooperation in multi-agent systems is typically optimized through utilitarian objectives that maximize overall efficiency but fail to account for reward distribution, often resulting in inequitable "leader-follower" dynamics. While fairness-based approaches encourage pro-social behaviors where every agent benefits from cooperation, many current algorithms - including those utilizing reward shaping - break the stationarity of Markov Games or lack rigorous theoretical guarantees. This creates a critical gap between fair objective methods and theoretically safe learning frameworks. We propose a novel framework that bridges $\alpha$-fairness with Heterogeneous-Agent Trust Region Learning (HATRL), ensuring monotonic improvement and convergence toward Nash Equilibria. Our approach leverages a fair advantage function that dynamically weights agent utilities based on their expected returns, allowing the global objective to transition from purely utilitarian efficiency to $\alpha$-fairness welfare based on the parameter $\alpha$. We introduce two practical algorithms, $\alpha$-fair HATRPO and $\alpha$-fair HAPPO, and demonstrate through experiments in sequential social dilemmas like CleanUp and CommonHarvest that they perform better than HATRL's algorithms from a utilitarian point of view while achieving socially higher outcomes.

2606.13071 2026-06-12 cs.CY cs.AI cs.HC 新提交

"Is This Not Enough?": Asymmetries in Institutional Accountability and Collective Sensemaking in the Case of Canada's Algorithmic Visa Triage System

“这还不够吗?”:加拿大算法签证分类系统中的机构问责与集体意义建构的不对称性

Dipto Das, Matthew Tamura, Syed Ishtiaque Ahmed, Shion Guha

AI总结 研究加拿大签证系统中算法问责的机构表述与申请者体验,发现机构强调透明度与程序保障,而申请者通过集体意义建构应对不透明决策,揭示认知、管辖和时空关系三方面不对称。

详情
AI中文摘要

本文研究了加拿大签证系统中算法问责如何在机构层面被表述,以及跨境申请者如何体验这种问责。我们使用为公共部门调整的算法决策(ADMAPS)框架,分析了加拿大移民、难民和公民部(IRCC)针对临时居民签证(TRV)分类系统的算法影响评估(AIA),并采用混合方法分析了Reddit上申请者之间的讨论。我们表明,虽然机构工件强调透明度、程序保障和有限影响,但申请者进行集体意义建构以解读不透明决策,常常在不确定性中依赖同行知识。我们识别了机构问责结构与人们感知过程之间的三种不对称:获取决策逻辑的认知不对称、由地缘政治定位塑造的管辖不对称,以及等待和不确定性体验中的时间-关系不对称。我们强调了将注意力从机构设计转向公共部门算法治理中体验的不均匀分布的重要性。这些贡献共同展示了跨国移民背景下的算法治理系统如何产生机构披露框架未能捕捉的结构性不对称,以及扩展ADMAPS如何能够解释这些不平等的问责转化。

英文摘要

This paper examines how algorithmic accountability in Canada's visa system is articulated institutionally and experienced by applicants across borders. We analyzed Immigration, Refugees and Citizenship Canada (IRCC)'s Algorithmic Impact Assessment (AIA) for the temporary resident visa (TRV) triage system using the algorithmic decision-making adapted for the public sector (ADMAPS) framework and analyzed Reddit discussions among applicants using a mixed-methods approach. We show that while institutional artifacts emphasize transparency, procedural safeguards, and bounded impacts, applicants engage in collective sensemaking to interpret opaque decisions, often relying on peer knowledge amid uncertainty. We identify three asymmetries between how institutional accountability is structured and how people perceive the process: epistemic asymmetry in access to decision logic, jurisdictional asymmetry in exposure shaped by geopolitical positioning, and temporal--relational asymmetry in how waiting and uncertainty are experienced. We emphasize why it is important to shift attention from institutional design to the uneven distribution of experiences with public-sector algorithmic governance. Together, these contributions demonstrate how algorithmic governance systems in the context of transnational migration produce structured asymmetries not captured by institutional disclosure frameworks, and how extending ADMAPS can account for those uneven translations of accountability.

2606.13068 2026-06-12 cs.MA cs.RO 新提交

Effects of Social Interactions in Self-Organising Railway Traffic Management

自组织铁路交通管理中社交互动的影响

Fabio Oddi, Federico Naldini, Leo D'Amato, Grégory Marlière, Paola Pellegrini, Vito Trianni

AI总结 研究自组织铁路交通管理中预测邻域范围(horizon)对分布式协调过程的影响,发现短时间范围足够,长范围会损害局部可解性和计算响应性而无全局收益。

详情
AI中文摘要

最近的研究正在探索自组织交通管理作为扩展到复杂现实网络的一种解决方案。在这样的系统中,列车预测其邻域,生成交通计划假设,并通过与邻居的共识达成未来要实施的交通计划。本文研究了该流程中的一个结构参数:预测邻域范围。列车使用该范围来识别与邻居的未来潜在冲突,并建立局部交互拓扑,即要与之协商的列车子集。作为主要设计变量,范围直接决定了社交互动图的大小和密度,而其对局部子问题复杂性和分布式共识动态的影响则代表了需要探索的权衡。通过闭环仿真框架,研究评估了范围变化如何影响整个分散协调过程,从初始冲突检测到分布式调度共识。分析重点在于研究范围选择引入的潜在权衡:平衡局部可解性和计算响应性与安全关键环境中全局调度一致性和可行性的需求。与直觉相反,我们的实证结果表明,短时间范围就足够了,而长时间范围会损害局部可解性和计算响应性,且不会带来全局调度最优性的提升。

英文摘要

Recent research is exploring self-organised traffic management as a solution for scaling to complex real-world networks. In such a system, trains predict their neighbourhood, produce traffic plan hypotheses, and agree via consensus with neighbours on a future traffic plan to be implemented. This paper investigates a structural parameter within this pipeline: the predictive neighbourhood horizon. The horizon is used by trains to identify future potential conflicts with neighbours, and to establish the local interaction topology, that is, the subset of trains to negotiate with. As the primary design variable, the horizon directly determines the size and density of the social interaction graph, whereas its impact on the complexity of local sub-problems and the distributed consensus dynamics represents a trade-off to be explored. Through a closed-loop simulation framework the study evaluates how variations of the horizon impact the overall decentralised coordination process, from initial conflict detection to distributed schedule consensus. The analysis focuses on investigating the potential trade-off introduced by the horizon choice: balancing local tractability and computational responsiveness with the need for global schedule coherence and feasibility in safety-critical environments. Contrary to intuition, our empirical results indicate that the short time horizons suffice, while long values compromise local tractability and computational responsiveness with no gain in global schedule optimality.

2606.13054 2026-06-12 cs.LG cs.AI 新提交

TWLA: Achieving Ternary Weights and Low-Bit Activations for LLMs via Post-Training Quantization

TWLA:通过训练后量化实现大语言模型的三值权重和低位激活

Zhixiong Zhao, Zukang Xu, Zhixuan Chen, Xing Hu, Zhe Jiang, Dawei Yang

AI总结 提出TWLA框架,通过后训练量化实现1.58位权重和4位激活,解决激活分布长尾问题,加速推理。

详情
Comments
Accepted by ICML 2026
AI中文摘要

大型语言模型(LLMs)展现出卓越的通用语言处理能力,但其内存和计算成本阻碍了部署。三值化已成为一种有前景的压缩技术,可显著降低模型大小和推理复杂度。然而,现有方法难以处理重尾激活分布,因此将激活保持在高精度,从根本上限制了端到端推理加速。为克服这一限制,我们提出TWLA,一种后训练量化(PTQ)框架,在保持高精度的同时实现1.58位权重压缩和4位激活量化。TWLA包含三个组件:(1)欧几里得到流形非对称三值量化器(E2M-ATQ),通过从欧几里得初始化到流形重定位的两阶段优化,最小化权重三值化下的层输出误差;(2)Kronecker正交三模态整形(KOTMS),应用Kronecker结构正交旋转将权重重塑为三值友好的三模态分布,同时共享旋转统计上抑制激活异常值;(3)层间感知激活混合精度(ILA-AMP),在位分配中显式引入相邻层二阶交互成本,并联合优化由共享正交变换引起的激活量化增益的层间差异,防止少数弱层触发级联效应。大量实验表明,TWLA在W1.58A4下保持高精度,同时实现显著的推理加速。代码见<此https URL>。

英文摘要

Large language models (LLMs) exhibit exceptional general language processing capabilities, but their memory and compute costs hinder deployment. Ternarization has emerged as a promising compression technique, offering significant reductions in model size and inference complexity. However, existing methods struggle with heavy-tailed activation distributions and therefore keep activations in high precision, fundamentally limiting end-to-end inference acceleration. To overcome this limitation, we propose TWLA, a post-training quantization (PTQ) framework that achieves 1.58-bit weight compression and 4-bit activation quantization while maintaining high accuracy. TWLA comprises three components: (1) Euclidean-to-Manifold Asymmetric Ternary Quantizer (E2M-ATQ) minimizes layer-output error under weight ternarization via a two-stage optimization from Euclidean initialization to manifold relocation; (2) Kronecker Orthogonal Tri-Modal Shaping (KOTMS) applies a Kronecker-structured orthogonal rotation to reshape weights into ternary-friendly tri-modal distributions, while the shared rotation statistically suppresses activation outliers; and (3) Inter-Layer Aware Activation Mixed Precision (ILA-AMP) explicitly introduces adjacent-layer second-order interaction costs in bit allocation and jointly optimizes for the layer-wise disparity of activation quantization gains induced by the shared orthogonal transform, preventing cascades triggered by a few weak layers. Extensive experiments demonstrate that TWLA maintains high accuracy under W1.58A4, while delivering significant inference acceleration. The code is available at < this https URL.