arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.12763 2026-06-12 cs.LG cs.DS 新提交

Adaptive Weighted Averaging

自适应加权平均

Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit

发表机构 * University of Utah(犹他大学) Boston University(波士顿大学) Google(谷歌)

AI总结 提出一种从单次无偏估计中选取最大未知值的方法,具有可容许性且不劣于基线,应用于随机优化获得在线到批次的转换界限。

详情
AI中文摘要

我们研究在仅对每个 $x_i$ 有一个无偏估计 $y_i$ 的情况下,从 $n$ 个未知值 $x_1,\dots,x_n$ 中选择最大值的问题。我们设计的策略同时具有可容许性(不被任何其他策略一致支配)且不劣于给定的基线(如均匀随机选择)。我们将其应用于随机优化,获得了具有理想“无妥协”保证的在线到批次转换界限:它们从不比标准随机迭代选择差,同时在良性设置中可以显著更好。

英文摘要

We study the problem of selecting the largest among $n$ unknown values $x_1,\dots,x_n$ given only a single unbiased estimate $y_i$ for each $x_i$. We design strategies that are simultaneously admissible (not uniformly dominated by any other strategy) and also never worse than a given baseline such as uniform random selection. We provide an application to stochastic optimization, where we obtain online-to-batch conversion bounds with a desirable "no-compromise" guarantee: they are never worse than standard random iterate selection, and yet can be significantly better in benign settings.

2606.12759 2026-06-12 cs.RO 新提交

Sparse2Act: Learning Action-Aligned Sparse 3D Representations for Cross-Domain Robot Manipulation

Sparse2Act: 学习跨域机器人操作的动作对齐稀疏3D表示

Yu Guo, Chang Yu, Siyu Ma, Yunuo Chen, Yin Yang, Ying Nian Wu, Chenfanfu Jiang

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校) University of California, San Diego(加州大学圣迭戈分校) University of Utah(犹他大学)

AI总结 提出Sparse2Act框架,通过动作对齐的掩码稀疏3D编码预训练,实现跨域机器人操作,在LIBERO-10上达86.9%成功率,并支持域迁移和sim-to-real。

详情
AI中文摘要

显式3D表示对于操作任务具有吸引力,因为它们以度量坐标暴露物体形状、工作空间几何以及机器人-物体关系。然而,稀疏3D编码器通常通过下游任务目标学习,将表示与特定数据分布、策略架构和动作参数化绑定。我们引入Sparse2Act,一个用于预训练稀疏点云编码器的观察-动作对齐框架。关键思想是使用任务空间末端执行器动作作为几何监督:训练掩码稀疏3D令牌以组织场景特征,使其围绕与观察配对的工作空间运动。预训练后,仅编码器初始化被下游策略重用,允许它们保留自己的架构和动作空间,包括关节空间命令。在LIBERO-10基准上,我们的方法在500步微调后达到86.9%的平均成功率。相同的预训练编码器支持LIBERO到Meta-World的跨域迁移,在Meta-World-5基准上达到73.4%的平均成功率。关于目标和解码器容量的消融实验表明,增益来自掩码动作对齐信号,并且在下游动作解码器中仍然有用。在真实世界实验中,模拟预训练后跟有限真实数据微调,在四个任务上平均成功率达到72.5%,展示了有效的模拟到真实迁移。这些结果表明,机器人动作可以为可重用的稀疏3D表示提供紧凑的几何监督。

英文摘要

Explicit 3D representations are attractive for manipulation because they expose object shape, workspace geometry, and robot-object relations in metric coordinates. However, sparse 3D encoders are often learned through downstream task objectives, tying the representation to a particular data distribution, policy architecture, and action parameterization. We introduce Sparse2Act, an observation-action alignment framework for pretraining sparse point-cloud encoders. The key idea is to use task-space end-effector actions as geometric supervision: masked sparse 3D tokens are trained to organize scene features around the workspace motion paired with the observation. After pretraining, only the encoder initialization is reused by downstream policies, allowing them to retain their own architectures and action spaces, including joint-space commands. On the LIBERO-10 benchmark, our method achieves 86.9% average success after 500 fine-tuning steps. The same pretrained encoder supports LIBERO-to-Meta-World cross-domain transfer, achieving 73.4% average success on the Meta-World-5 benchmark. Ablations on the objective and decoder capacity show that the gains come from the masked action-alignment signal and remain useful across downstream action decoders. In real-world experiments, simulation pretraining followed by limited real-data fine-tuning achieves an average success rate of 72.5% across four tasks, demonstrating effective sim-to-real transfer. These results suggest that robot actions can provide compact geometric supervision for reusable sparse 3D representations.

2606.12747 2026-06-12 cs.AI 新提交

Prefill Awareness in Large Language Models

大型语言模型中的预填充感知

Andy Wang, Parv Mahajan, David Demitri Africa, Alexandra Souly, Jordan Taylor, Robert Kirk

发表机构 * Constellation University of Wisconsin-Madison(威斯康星大学麦迪逊分校星座研究所) Constellation Georgia Institute of Technology(佐治亚理工学院星座研究所) UK AI Security Institute(英国人工智能安全研究所)

AI总结 研究大型语言模型能否识别并响应其助手消息被预填充或篡改,发现前沿模型具有显著预填充感知能力,可能影响安全评估方法。

详情
Comments
Submitted to NeurIPS 2026
AI中文摘要

语言模型的安全相关研究,包括对齐和越狱评估以及AI控制协议,通常依赖于预填充模型输出。如果AI模型能够识别并利用其先前的助手消息被插入或编辑这一事实,这些方法的有效性和有效性可能会受到损害。我们调查了前沿语言模型是否能区分被篡改和未被篡改的助手侧上下文,我们将这种能力称为预填充感知。为此,我们构建了一个跨三种预填充机制的二元偏好基准,筛选出模型表现出一致立场的案例。我们发现前沿模型表现出显著的预填充感知:Claude Opus 4.5在9-35%的案例中检测到与其偏好相反的预填充,且在提示时假阳性率为0%;此外,模型通常会恢复到基线行为,而不会明确报告预填充是外来的。受控消融实验后来也表明,检测和抵抗依赖于不同的线索,其中风格不匹配主要影响模型是否将预填充标记为外来,而偏好不匹配主要影响模型是否恢复到其基线答案。我们还检查了更真实的智能体设置,如错位延续评估和SWE-bench轨迹,在这些设置中,前沿模型有时会否认预填充的助手轮次,其方式强烈依赖于数据集、任务成功和隐藏的格式伪影。我们的结果表明,预填充感知已经是一些基于预填充的方法的重要混淆因素。我们建议模型开发者在前沿系统中跟踪这种能力。

英文摘要

Safety-relevant studies of language models, including alignment and jailbreaking evaluations and AI control protocols, often rely on prefilling model outputs. If AI models can recognize and act on the fact their prior assistant messages have been inserted or edited, the effectiveness and validity of these methods could be compromised. We investigate whether frontier language models can distinguish between tampered and untampered assistant-side context, a capability we call prefill awareness. To do so, we construct a binary preference benchmark across three prefill mechanisms, filtering for cases where models show consistent stances. We find that frontier models show substantial prefill awareness: Claude Opus 4.5 detects prefills opposing its preferences in 9-35% of cases with a 0% false positive rate when prompted; additionally, models often revert towards baseline behavior without explicitly reporting that the prefill was foreign. Controlled ablations later also show that detection and resistance rely on different cues, where stylistic mismatch mainly affects whether models flag a prefill as foreign, while preference mismatch mainly affects whether they revert toward their baseline answer. We also examine more realistic agentic settings such as misalignment-continuation evaluations and SWE-bench trajectories, where frontier models sometimes disavow prefilled assistant turns in ways that depend strongly on dataset, task success, and hidden formatting artifacts. Our results indicate that prefill awareness is already a substantial confound for some prefill-based methods. We recommend that model developers track this capability in frontier systems.

2606.12744 2026-06-12 cs.CV 新提交

GRIP: Feedback-Guided Prompt Retrieval for Large Multimodal Models

GRIP:面向大型多模态模型的反馈引导提示检索

Garvita Allabadi, Matteo Sodano, Roberto Estevão, Yuxiong Wang, Vikram Adve, Emre Kiciman, Ranveer Chandra

发表机构 * University of Illinois Urbana Champaign(伊利诺伊大学厄巴纳-香槟分校) University of Bonn(波恩大学) Microsoft(微软)

AI总结 提出GRIP,一种可学习的视觉检索框架,利用多模态模型反馈识别真正提升上下文学习性能的示例,在分类、描述和VQA任务上优于基于相似度的检索。

详情
AI中文摘要

上下文学习(ICL)已成为一种强大的机制,使大型语言模型(LLMs)无需微调即可适应新任务。将此概念扩展到大型多模态模型(LMMs),多模态上下文学习(M-ICL)依赖于检索相关示例(如图像、标题或问答对)来指导分类、描述和视觉问答(VQA)等任务的预测。现有方法大多基于特征空间相似性选择上下文示例,假设语义相似的样本提供最有用的上下文。然而,我们的系统分析表明,这一假设并不总是成立:视觉上相似的示例并不一定是那些最有效增强上下文学习性能的示例。为解决此问题,我们提出了上下文提示的引导检索(GRIP),一种可学习的纯视觉检索框架,利用LMMs的反馈来识别真正改善模型预测的示例。GRIP通过对比训练学习区分有益和有害的上下文示例,将检索优化到超越纯相似性。在三个多模态任务(分类、描述和VQA)上,GRIP在Qwen2.5-VL-7B上持续优于基于相似度的检索,在Idefics2-8B上的分类任务中提升最为显著。此外,我们证明了从一个开放LMM训练得到的检索器可以迁移到其他模型(包括闭源的GPT-4o和Gemini)而无需重新训练,从而实现了M-ICL的可扩展且经济高效的部署。代码将在接收后发布。

英文摘要

In-Context Learning (ICL) has become a powerful mechanism for adapting Large Language Models (LLMs) to new tasks without fine-tuning. Extending this concept to Large Multimodal Models (LMMs), Multimodal In-Context Learning (M-ICL) relies on retrieving relevant examples, such as images, captions, or question-answer pairs, to guide predictions across tasks like classification, captioning, and visual question answering (VQA). Most existing approaches select in-context examples based on feature-space similarity, assuming that semantically similar samples provide the most useful context. However, our systematic analysis reveals that this assumption does not always hold: visually similar examples are not necessarily those that most effectively enhance in-context learning performance. To address this, we propose the Guided Retrieval of In-context Prompts (GRIP), a learnable vision-only retrieval framework that leverages feedback from LMMs to identify examples that truly improve model predictions. GRIP learns to distinguish beneficial from detrimental in-context examples through contrastive training, refining retrieval beyond pure similarity. Across three multimodal tasks, namely classification, captioning, and VQA, GRIP improves consistently over similarity-based retrieval on Qwen2.5-VL-7B, with its strongest gains in classification on Idefics2-8B. Moreover, we demonstrate that retrievers trained with feedback from one open LMM can be transferred to other models without retraining, including closed-source GPT-4o and Gemini, enabling scalable and cost-efficient deployment of M-ICL. Code will be published upon acceptance.

2606.12742 2026-06-12 cs.AI cs.AR 新提交

Reducing the Complexity of Deep Learning Models for EEG Analysis on Wearable Devices

降低可穿戴设备上用于脑电图分析的深度学习模型复杂度

Farough Shayeste Roodi, Parham Zilouchian Moghaddam, Mahdi Mohammadi-nasab, Mehdi Modarressi, Mostafa Ersali Salehi Nasab, Masoud Daneshtalab

发表机构 * University of Tehran(德黑兰大学) Mälardalen University(梅拉达伦大学) Royal Institute of Technology(皇家理工学院)

AI总结 研究通过参数量化和电极减少方法,在资源受限的可穿戴设备上部署DNN模型,实现脑电图分析中精度与复杂度的权衡。

详情
AI中文摘要

可穿戴医疗设备是增长最快的物联网领域。许多自动化医疗服务依赖于两种关键的生物信号,即心电图和脑电图,它们分别反映心脏和大脑的活动。尽管深度神经网络被认为是处理和分析这些信号的主要方式,但可穿戴设备中非常严格的能量和计算能力限制远低于DNN模型的计算、能量和内存带宽需求,从而阻碍了深度学习在许多实际可穿戴服务中的部署。本文研究了在资源受限的可穿戴设备上部署最先进的DNN模型的可行性。值得注意的是,我们探讨了在使用参数量化和电极减少方法时,DNN的精度与计算复杂度之间的权衡。我们的研究集中在几种用于脑电图信号分析(特别是检测癫痫发作)的最先进的DNN模型上。我们的发现表明,当明智地应用这些技术时,可以显著降低所考虑的DNN的复杂度,同时对精度的影响最小。这些结果揭示了在将基于DNN的在线脑电图分析适配到可穿戴设备时,精度与复杂度降低之间明确的权衡关系。

英文摘要

Wearable healthcare devices are the fastest-growing Internet of Things (IoT) sector. Many automated healthcare services rely on two crucial biological signals, namely ECG and EEG, which reflect the activity of the heart and brain, respectively. Although deep neural networks are considered the primary way to process and analyze these signals, the very tight energy and computational power constraints in wearable devices are far below the computational, energy, and memory bandwidth demands of DNN models, thereby impeding the deployment of deep learning in many practical wearable services. This paper investigates the feasibility of deploying state-of-the-art DNN models in resource-constrained wearable devices. Notably, we explore the trade-off between accuracy and computational complexity of DNNs when parameter quantization and electrode reduction methods are used. Our investigation centers on several state-of-the-art DNN models designed for EEG signal analysis, specifically for detecting epileptic seizures. Our findings demonstrate that, when applied judiciously, these techniques can significantly reduce the complexity of the DNNs under consideration with minimal adverse effects on accuracy. These results reveal the explicit trade-offs between accuracy and complexity reduction encountered when adapting DNN-based online EEG analysis for wearable devices.

2606.12740 2026-06-12 cs.LG 新提交

Deep Unfolded Latent Optimally Partitioned-l2/l1 Networks for Data-driven Block-Sparse Recovery

深度展开潜在最优分区l2/l1网络用于数据驱动的块稀疏恢复

Takanobu Furuhashi, Hidekata Hontani, Qibin Zhao, Tatsuya Yokota

发表机构 * Nagoya Institute of Technology(名古屋工业大学) RIKEN Center for Advanced Intelligence Project(理化学研究所革新智能研究中心)

AI总结 针对凸LOP-l2/l1方法依赖手动调参且近端算子不可微的问题,提出基于隐式微分和深度权重分解的两种深度展开架构,实现自动参数学习,在块稀疏恢复中表现优异且抗脉冲噪声。

详情
Comments
11 pages, 6 figures
AI中文摘要

凸潜在最优分区(LOP)-l2/l1方法能够在未知分区的情况下实现块稀疏信号恢复,但依赖于手动超参数调整。此外,其近端算子微分时的数值不稳定性阻碍了通过深度展开(DU)进行自动参数调整。为解决这些限制,我们提出了两种架构:一种利用隐式微分的稳定框架,以及一种利用深度权重分解(DWF)的灵活变体。基于DWF的方法还支持非凸光滑数据保真项。数值实验表明,DU-LOP-l2/l1在块稀疏恢复中具有竞争性能,并且对脉冲噪声具有高鲁棒性。

英文摘要

The convex Latent Optimal Partition (LOP)-l2/l1 approach enables block-sparse signal recovery with unknown partitions but relies on manual hyperparameter tuning. Additionally, numerical instability in differentiating its proximal operator prevents its automatic parameter tuning via Deep Unfolding (DU). To address these limitations, we propose two architectures: a stable framework utilizing implicit differentiation and a flexible variant leveraging Deep Weight Factorization (DWF). The DWF-based approach also supports nonconvex smooth data fidelity terms. Numerical experiments demonstrate that DU-LOP-l2/l1 yields competitive performance and high resilience against impulsive noise.

2606.12736 2026-06-12 cs.AI cs.LG 新提交

Benchmarking AI Agents for Addressing Scientific Challenges Across Scales

跨尺度科学挑战的AI智能体基准测试

Tianyu Liu, Allen Xin Wang, Antonia Panescu, Lisa Xinyi Chen, Wenxin Long, Xinyu Wei, Yueqian Jing, Ziyao Zeng, Jihang Chen, Sihan Jiang, Ziqing Wang, Siyi Gu, Siyu Chen, Xinyang Hu, Haoran Shao, Leqi Xu, Wangjie Zheng, Zhiyuan Cao, Ada Fang, Botao Yu, Kunyang Sun, Rex Ying, Arman Cohan, Qingyu Chen, Lingzhou Xue, Kaize Ding, Yuanqi Du, Wengong Jin, Zhuoran Yang, Marinka Zitnik, James Zou, Hua Xu, Hongyu Zhao

发表机构 * Yale University(耶鲁大学) Broad Institute of MIT and Harvard(布罗德研究所) The Pennsylvania State University(宾夕法尼亚州立大学) Northeastern University(东北大学) Northwestern University(西北大学)

AI总结 提出SciAgentArena基准,含约200个交互式任务,评估AI智能体在真实科研场景中的能力,发现其在数据分析中有效,但在创新探索和开放问题上表现不均。

详情
Comments
6 figures
AI中文摘要

AI智能体正被越来越多地开发用于加速科学发现,但它们在真实研究环境中的实际能力仍知之甚少。现有的AI智能体基准很少捕捉科学工作所需的复杂性、异质性和扩展推理,而科学任务的基准通常将研究简化为静态、直接的问题,并对交互式评估支持有限。在此,我们引入SciAgentArena,这是一个系统性的基准,用于评估AI智能体在来自多个领域新兴需求的真实科学研究场景中的表现。SciAgentArena包含约200个具有逐步验证的任务,以及一个交互式、与智能体无关的环境,用于评估不同的AI智能体。使用该基准,我们发现当前智能体能够有效贡献于明确指定的数据分析工作流,特别是当任务结构和评估标准清晰时。然而,它们在科学情境中的表现仍然不均衡:智能体难以产生真正新颖的见解,维持自主探索,并为开放的研究问题制定稳健的解决方案。我们进一步描述了智能体常见的失败模式,并识别了提高其可靠性、自主性和科学推理能力的机会。总之,SciAgentArena提供了一个实用的框架,用于衡量AI智能体在科学领域的进展,并指导未来能够应对复杂科学挑战的智能体设计。完整代码、任务和数据集可通过此链接访问:this https URL。

英文摘要

AI agents are increasingly being developed to accelerate scientific discovery, yet their practical capabilities in real research settings remain poorly understood. Existing benchmarks for AI agents rarely capture the complexity, heterogeneity, and extended reasoning required by scientific work, whereas benchmarks for scientific tasks often reduce research to static, direct problems and provide limited support for interactive evaluation. Here, we introduce SciAgentArena, a systematic benchmark for evaluating AI agents in real-world scientific research scenarios drawn from emerging needs across multiple domains. SciAgentArena comprises approximately 200 tasks with stepwise verification and an interactive, agent-agnostic environment for assessing diverse AI agents. Using this benchmark, we find that current agents can contribute effectively to well-specified data-analysis workflows, particularly when the task structure and evaluation criteria are clear. However, their performance remains uneven across scientific contexts: agents struggle to generate genuinely novel insights, sustain self-directed exploration, and formulate robust solutions for open-ended research questions. We further characterize common failure modes across agents and identify opportunities for improving their reliability, autonomy, and scientific reasoning. Together, SciAgentArena provides a practical framework for measuring progress in AI agents for science and for guiding the design of future agents capable of addressing complex scientific challenges. Full codes, tasks, and datasets can be accessed via this link: this https URL.

2606.12735 2026-06-12 cs.LG 新提交

Physics-Informed Neural Networks and Radial Basis Functions for PDEs with Dirac Delta Sources

物理信息神经网络与径向基函数求解含狄拉克δ源的偏微分方程

Manuel Reyna, Alexandre Tartakovsky

发表机构 * Department of Civil and Environmental Engineering, University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校土木与环境工程系)

AI总结 针对含狄拉克δ项的偏微分方程,通过将物理信息神经网络解释为残差最小二乘法,利用弱形式直接处理δ项,并对比径向基函数展开方法,发现径向基函数-残差最小二乘法在输运问题中更稳定。

详情
Comments
33 pages, 4 figures
AI中文摘要

物理信息神经网络(PINNs)是一种用于求解正向和逆向偏微分方程(PDEs)的机器学习方法。当应用于强迫项、边界条件或初始条件中包含狄拉克δ函数的PDEs时,PINNs需要用光滑的代理函数来近似它们,这种做法可能会引入显著的建模误差。在这项工作中,我们利用PINNs作为残差最小二乘法(RLS)的解释,并表明这种视角能够通过积分弱形式方程直接处理狄拉克δ项。在除PINN之外的RLS公式中,我们重点关注径向基函数(RBF)展开(也称为单层RBF网络)。我们证明,虽然在PINNs中积分掉狄拉克δ会导致残差无法收敛到零,但RBF-RLS始终能为输运问题提供良好的正向和逆向解。我们使用神经正切核(NTK)理论解释这一发现。我们在代表多孔介质和河流中地下水流和输运的线性PDEs上测试了这两种方法。我们求解逆问题以拟合合成数据、含噪声的合成数据以及真实世界测量值。

英文摘要

Physics-Informed Neural Networks (PINNs) are a machine learning method for solving forward and inverse Partial Differential Equations (PDEs). When applied to PDEs with Dirac delta functions in the forcing terms, boundary conditions, or initial conditions, PINNs require approximating them with smooth surrogate functions, a practice that can introduce significant modeling errors. In this work, we exploit the interpretation of PINNs as Residual Least Squares (RLS) methods and show that this perspective enables direct treatment of Dirac delta terms by integrating the weak-form equation. Among RLS formulations other than PINN, we focus on the Radial Basis Function (RBF) expansion (also known as a single-layer RBF Network). We show that while integrating out the Dirac delta in PINNs causes residuals to fail to converge to zero, RBF-RLS consistently provides good forward and inverse solutions to transport problems. We explain this finding using the Neural Tangent Kernel (NTK) theory. We test both approaches on linear PDEs that represent groundwater flow and transport in porous media and rivers. We solve inverse problems to fit synthetic data, noisy synthetic data, and real-world measurements.

2606.12733 2026-06-12 cs.LG 新提交

Let's Ask Gauss: Improved One-Run Privacy Auditing

让我们问高斯:改进的单次运行隐私审计

Adya Agrawal, Yu Wei, Jaspal Singh, Malik Magdon-Ismail, Vassilis Zikas

发表机构 * Georgia Institute of Technology(佐治亚理工学院) Rensselaer Polytechnic Institute(伦斯勒理工学院) Purdue University(普渡大学)

AI总结 提出一种基于高斯渐近分布的差分隐私审计框架,利用白盒DP-SGD中金丝雀对齐信号的归一化和,从单次训练运行中获取更紧的隐私下界。

详情
AI中文摘要

隐私审计通过估计模型实际泄露的信息提供重要保障,从而确保理论隐私保证在实践中成立。我们研究差分隐私(DP)机器学习的经验隐私审计,重点关注针对DP-SGD等机制的高效单次运行方法。先前的单次运行方法将训练示例或“金丝雀”阈值化为二元成员猜测,这丢弃了有用信息。我们证明,在白盒DP-SGD设置中,金丝雀对齐信号自然形成一系列随机变量,其归一化和渐近服从高斯分布。利用这种分布视角,我们开发了一个DP审计框架,从单次训练运行中获得更紧的隐私下界。

英文摘要

Privacy auditing provides an important safeguard by estimating the actual information leaked by a model, thus ensuring that theoretical privacy guarantees hold in practice. We study empirical privacy auditing for differentially private (DP) machine learning, focusing on efficient one-run methods for mechanisms such as DP-SGD. Prior one-run approaches threshold training examples or "canaries" into binary membership guesses, which discards useful information. We show that, in the white-box DP-SGD setting, canary-aligned signals naturally form a sequence of random variables whose normalized sum is asymptotically Gaussian. Leveraging this distributional perspective, we develop a DP-auditing framework that leads to tighter privacy lower bounds from a single training run.

2606.12731 2026-06-12 cs.LG cs.CY 新提交

Normative Robustness as a Frontier for Non-Verifiable Reasoning in LLMs

规范性鲁棒性作为LLM中不可验证推理的前沿

Elizaveta Tennant, Benjamin Henke, Anita Keshmirian, Murray Shanahan, Verena Rieser, Kristian Lum, Sydney Levine, Julia Haas

发表机构 * DeepMind Institute of Philosophy, School of Advanced Study, University of London(伦敦大学高等研究院哲学研究所) Technische Universität Berlin(柏林工业大学)

AI总结 提出道德推理作为不可验证推理的典型子域,定义道德鲁棒性并引入可扩展的多轮对抗评估框架,发现模型会向用户偏好偏移推理(平均6.5%),且受顺序和轮次影响。

详情
AI中文摘要

随着LLM越来越多地承担咨询和审议角色,用户在缺乏客观真实性的领域中依赖它们进行不可验证推理。然而,传统LLM推理评估几乎只关注基于事实的领域(如数学和科学),导致不确定模型能否以及能在多大程度上处理随时间变化的模糊、主观或价值负载问题。为解决这一问题,我们提出道德推理作为不可验证推理的一个典型子域。我们将道德鲁棒性定义为模型在不同时间和情境下展现合理道德推理的能力,并引入一个可扩展的、对抗性的多轮评估框架来实证测量这一能力。我们在四个前沿LLM上模拟了48,000次用户-智能体道德讨论,变化前提相关性、前提顺序、对话时长和用户声明的道德观点。我们发现模型成功忽略了道德无关的干扰项,但平均向用户声明的偏好道德观点偏移了6.5%的推理,并且推理因顺序(在13-22%的案例中改变道德判断)和时长(在10-24%的案例中在单轮和多轮之间改变道德判断)等因素而变化。我们的分析表明,模型不仅调整最终裁决,还调整其背后的理由以适应用户的道德观点——我们将这种失败模式称为道德审议谄媚。

英文摘要

As LLMs increasingly serve in advisory and deliberative roles, users rely on them for non-verifiable reasoning in domains lacking objective ground truths. However, traditional evaluations of LLM reasoning focus almost exclusively on fact-based domains, such as mathematics and science, leaving uncertainty over whether and to what degree models can handle ambiguous, subjective, or value-laden problems over time. To address this concern, we propose moral reasoning as a paradigmatic subdomain of non-verifiable reasoning. We define moral robustness as a model's capacity to exhibit sound moral reasoning across time and contexts, and we introduce a scalable, adversarial, multi-turn evaluation framework to empirically measure this capability. We simulate 48,000 user-agent moral deliberations across four frontier LLMs, varying premise relevance, premise order, conversation duration, and the user's stated moral view. We find that models successfully ignore morally-irrelevant distractors, but shift their reasoning by up to 6.5%, on average, towards the user's stated preferred moral view, and varying their reasoning depending on factors such as order (altering moral judgments by order in 13-22% of the cases) and duration (altering moral judgments between single-turn and multi-turn in 10-24% of the cases). Our analysis indicates that models tailor not just their final verdicts but their underlying justifications to align with a user's moral viewpoint - a failure mode we characterize as moral deliberative sycophancy.

2606.12730 2026-06-12 cs.AI cs.CL cs.CY cs.LG 新提交

Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior

重新思考LLMs的心理测量评估:自我报告何时以及为何能预测行为

Rafal Kocielnik, Pengrui Han, Peiyang Song, Myrl G. Marmarelis, Ramit Debnath, Dean Mobbs, Anima Anandkumar, R. Michael Alvarez

发表机构 * Caltech(加州理工学院) UIUC(伊利诺伊大学厄巴纳-香槟分校) University of Cambridge(剑桥大学)

AI总结 研究对比大五人格与计划行为理论,发现LLMs的自我报告-行为一致性存在选择性:在共享对话中TPB达到人类水平,跨对话仅对锚定于训练的行为保持一致性,且角色提示不能使行为对齐。

详情
Comments
Accepted as an Oral (Contributed Talk) at the ICML 2026 Workshop on Combining Theory and Benchmarks (CTB)
AI中文摘要

从低成本心理测量探针预测LLM行为倾向对于安全部署至关重要,但前提是自我报告(SR)能可靠地预测行为。近期研究记录了LLMs中显著的SR-行为分离,但依赖于广泛的人格特质(大五),这些特质即使在人类中也只能弱预测特定行为。此外,对话会话的隔离加上弱上下文匹配使得以下问题悬而未决:LLMs是否真正缺乏一致性,或者检测这种一致性所需的条件是否未满足。我们将大五与计划行为理论(TPB)进行对比,后者测量针对特定行为的意图,并且比广泛特质能更好地预测人类行为。我们在四个行为任务和11个前沿LLM上进行实验,同时改变会话上下文和身份诱导。我们发现SR-行为一致性存在但具有选择性。1) 在共享对话中,计划行为理论达到人类水平的一致性;大五则没有。2) 在跨对话中,一致性仅对锚定于即时提示之外的行为(如由训练塑造的内隐偏见)幸存,而当行为被上下文强烈启动(如谄媚)时则崩溃。3) 角色提示使自我报告在对话间更一致,但并未使行为对齐。这些发现表明,粗糙的人格框架(如大五)可能不是测试部署行为的最佳工具。需要更多任务和特定行为的工具,并且即使这些工具也必须在任务和上下文中进行评估。

英文摘要

Anticipating LLM behavioral tendencies from low-cost psychometric probes is critical for safe deployment, but only if self-reports (SR) reliably predict behavior. Recent work documented substantial SR-behavior dissociation in LLMs, but relied on broad personality traits (Big 5) that predict specific behaviors weakly, even in humans. Furthermore, the isolation of conversational sessions combined with weak context matching left open whether LLMs truly lack coherence or whether the conditions needed to detect such coherence were not met. We contrast Big 5 with the Theory of Planned Behavior (TPB), which measures intention targeted to a specific behavior and predicts human behavior substantially better than broad traits. We run experiments across four behavioral tasks and 11 frontier LLMs, while also varying session context and identity induction. We find that SR-behavior coherence exists but is selective. 1) Within a shared conversation, the Theory of Planned Behavior reaches human-level coherence; Big 5 does not. 2) Across separate conversations, coherence survives only for behaviors anchored outside the immediate prompt, such as implicit bias shaped by training, and collapses when behavior is strongly primed by context, as with sycophancy. 3) Persona prompting makes self-reports more consistent across conversations, but does not bring behavior into alignment. These findings suggest that coarse personality frameworks, such as Big 5 may not be the best tools for testing deployment behavior. More task- and behavior-specific instruments are needed, and even these must be evaluated across tasks and contexts.

2606.12728 2026-06-12 cs.RO cs.CV cs.LG 新提交

EquiDexFlow: Contact-Grounded SE(3)-Equivariant Dexterous Grasp Generative Flows

EquiDexFlow: 基于接触的SE(3)-等变灵巧抓取生成流

Clinton Enwerem, John S. Baras, Calin Belta

发表机构 * Institute for Systems Research, University of Maryland, College Park(马里兰大学帕克分校系统研究所)

AI总结 提出EquiDexFlow,一种SE(3)-等变流匹配模型,联合预测腕部姿态、关节角度、指尖接触、表面法线和接触力,通过将接触投影到物体表面并将力约束在库仑摩擦锥内,确保物理稳定抓取,在16自由度Allegro手上实现零摩擦违规和最佳综合分数。

详情
Comments
22 pages, 11 figures, 11 tables. Project page with videos, code, and checkpoints: this https URL
AI中文摘要

大多数学习型灵巧抓取生成器将接触力降级为下游验证步骤,因此运动学上可行的姿态仍可能违反稳定物理抓取的条件。我们通过EquiDexFlow解决这一问题,这是一种SE(3)-等变流匹配模型,从物体点云联合预测腕部姿态、关节角度、指尖接触、表面法线和接触力。我们的架构通过构造将接触投影到物体表面并将力约束在库仑摩擦锥内,因此无需损失惩罚即可满足放置和摩擦合规性。我们证明了端到端SE(3)等变性,并在200次旋转上经验验证,腕部残差低于$0.04^\circ$且关节偏差严格为零。该模型在81个物体的8,100个力闭合抓取上训练,适用于16自由度Allegro手,在所有消融变体中实现了零摩擦违规、最佳综合分数和最低扳手残差。我们通过每指逆运动学将解码的指尖接触重新定位到16自由度LEAP手,我们的硬件可行优化将每个关节至少置于其执行器包络的5%以内,同时保持扳手平衡。在物理机器人上,重新定位的EquiDexFlow解码抓取在所有六个测试物体上完成了开环拾取和保持试验,每个非对称物体在标准姿态和$120^\circ$共旋转下均成功。视频、代码和检查点可在https://this URL获取。

英文摘要

Most learned dexterous grasp generators relegate contact forces to a downstream verification step, so a kinematically-plausible pose can still violate the conditions for a stable physical grasp. We address this with EquiDexFlow, an SE(3)-equivariant flow-matching model that jointly predicts wrist pose, joint angles, fingertip contacts, surface normals, and contact forces from an object point cloud. Our architecture projects contacts onto the object surface and forces into the Coulomb friction cone by construction, so placement and friction compliance hold without loss penalties. We prove end-to-end SE(3) equivariance and verify it empirically over 200 rotations, with wrist residuals below $0.04^\circ$ and exactly zero joint deviation. Trained on 8,100 force-closure grasps across 81 objects for the 16-DoF Allegro Hand, our model achieves zero friction violations, the best composite score, and the lowest wrench residual among all ablation variants. We retarget decoded fingertip contacts to a 16-DoF LEAP Hand via per-finger inverse kinematics, and our hardware-feasible refinement places every joint at least 5% inside its actuator envelope while preserving wrench balance. On the physical robot, retargeted EquiDexFlow-decoded grasps complete open-loop pick-and-hold trials on all six test objects, with every asymmetric object succeeding at both the canonical pose and a $120^\circ$ co-rotation. Videos, code, and checkpoints are available at this https URL.

2606.12721 2026-06-12 cs.AI 新提交

The Theory of Mind Utility: Formal Specification of a Mentalizing Mechanism

心智理论效用:心理化机制的形式化规范

Nikolos Gurney, Stacy Marsella

发表机构 * Institute for Creative Technologies, University of Southern California(南加州大学创意技术研究所) Khoury College of Computer Sciences, Northeastern University(东北大学库里计算机科学学院)

AI总结 提出心智理论效用(ToM-U)框架,通过局部认知世界模型(LEWM)形式化推断他人信念的计算问题,定义结构、推理过程及失败痕迹,区别于贝叶斯心智理论等方法。

详情
AI中文摘要

推断他人的信念需要超越表面信号;需要追踪谁告诉了他们什么、以什么顺序以及有多可信。心智理论效用(ToM-U)在计算分析层面形式化了这一认知状态推断问题,明确了心理化计算的内容和原因,而不承诺算法或神经实现。ToM-U通过构建局部认知世界模型(LEWMs)——表示智能体、状态节点及其之间认知关系的有向类型图——并根据观察到的行为评估离散候选LEWM,直到达到足够的置信度来实现这一点。五个形式定义指定了LEWM结构、包括有序信息访问历史的智能体节点属性、递归心理化的有界增殖机制、三种推理过程以及一个残差函数,该函数捕捉失败心理化尝试留下的结构化痕迹。ToM-U不同于贝叶斯心智理论和相邻的形式化描述,后者预设而非推导信念状态,也不同于模拟理论和理论-理论,后者缺乏认知状态推断的形式化工具。该架构生成关于心理化失败的方向性、可证伪预测,这些预测源于模型的结构属性而非辅助假设,并将ToM-U定位为在目标推断和其他下游社会认知过程之前的领域无关机制。

英文摘要

Inferring others' beliefs requires more than reading surface signals; it requires tracking who told them what, in what order, and how credibly. The Theory of Mind Utility (ToM-U) formalizes this epistemic state inference problem at the computational level of analysis, specifying what mentalizing computes and why without commitment to algorithmic or neural implementation. ToM-U achieves this by constructing Local Epistemic World Models (LEWMs) -- directed typed graphs that represent agents, state nodes, and the epistemic relationships among them -- and evaluating discrete candidate LEWMs against observed behavior until one achieves sufficient confidence. Five formal definitions specify the LEWM structure, agent node properties including ordered information access history, a bounded proliferation mechanism for recursive mentalizing, three inference procedures, and a residue function that captures the structured trace left by failed mentalizing attempts. ToM-U differs from Bayesian Theory of Mind and adjacent formal accounts, which presuppose rather than derive belief states, and from simulation theory and theory-theory, which lack a formal apparatus for epistemic state inference. The architecture generates directional, falsifiable predictions about mentalizing failure that follow from structural properties of the model rather than auxiliary assumptions, and positions ToM-U as a domain-agnostic mechanism upstream of goal inference and other downstream social cognitive processes.

2606.12716 2026-06-12 cs.CL 新提交

Does AI Reviewer See the Full Picture? Attacking and Defending Multimodal Peer Review

AI审稿人是否看到全貌?攻击与防御多模态同行评审

Xinyu Zhao, Rana Muhammad Shahroz Khan, Zhen Xu, Zhen Tan, Tianlong Chen

发表机构 * University of North Carolina at Chapel Hill(北卡罗来纳大学教堂山分校)

AI总结 针对AI同行评审易受多模态对抗攻击的问题,提出PaperGuard基准,包含多领域数据集、统一攻击套件和基于分块嵌入搜索的实用防御方法。

详情
Comments
Accepted to ICML 2026, Project Page: this https URL
AI中文摘要

将大型语言模型(LLMs)和多模态LLMs(MLLMs)集成到科学同行评审工作流程中,引入了对抗性操纵的新重大风险,尤其是考虑到科学论文的多模态性质——其中图表(而非仅文本)传达了核心证据。这造成了一个显著差距:当前关于AI同行评审的鲁棒性研究绝大多数仅针对文本。此外,该问题与标准越狱不同,因为同行评审攻击旨在诱导领域特定的、有针对性的失败(例如,“提高这个分数”),而非违反一般安全策略,而目前尚无实用的防御措施。为解决此问题,我们引入了PaperGuard,这是第一个旨在系统评估和防御AI生成的同行评审免受这些领域特定、跨模态攻击的全面基准。我们的框架基于三大支柱:(1)一个新的跨多个科学领域的多模态同行评审数据集;(2)一套统一的攻击方法,包括黑盒提示注入和白盒扰动,专门针对文本(GCG)和图表(PGD);(3)一种实用的防御方法,受学术论文长上下文挑战的启发,使用基于分块的嵌入搜索来高效定位和缓解有害指令。我们在最先进模型上进行的广泛实验证实,AI审稿人普遍存在脆弱性。PaperGuard建立了必要的基准、协议和可操作的防御措施,以开创可信赖、抗攻击的AI辅助学术评审。

英文摘要

The integration of Large Language Models (LLMs) and Multimodal LLMs (MLLMs) into scientific peer-review workflows introduces novel and significant risks for adversarial manipulation, especially given the multimodal nature of scientific papers where figures, not just text, convey core evidence. This creates a significant gap: current robustness studies on AI peer-review are overwhelmingly text-only. Moreover, the problem is distinct from standard jailbreaking, as a peer-review attack seeks to induce a domain-specific, targeted failure (e.g., "inflate this score") rather than a general safety policy violation, for which no practical defenses exist. To address this, we introduce PaperGuard, the first comprehensive benchmark designed to systematically evaluate and defend AI-generated peer-review against these domain-specific, cross-modal attacks. Our framework is built on three pillars: (1) a new multimodal peer-review dataset spanning multiple scientific domains; (2) a unified suite of attacks, including black-box prompt injections and white-box perturbations, specifically designed to target both text (GCG) and figures (PGD); and (3) a practical defense, motivated by the long-context challenge of academic papers, that uses chunk-based embedding search to efficiently localize and mitigate harmful instructions. Our extensive experiments, conducted across state-of-the-art models, confirm that AI reviewers are pervasively vulnerable. PaperGuard establishes the foundational benchmark, protocols, and actionable defense necessary to pioneer trustworthy, attack-resilient AI-assisted scholarly reviewing.

2606.12713 2026-06-12 cs.AI 新提交

Definitional alignment before capability alignment: a Design-Science framework for adjudicating claims about AGI

能力对齐之前的定义对齐:一个用于裁定关于AGI主张的设计科学框架

J. E. Aguilera Briones

发表机构 * Universidad Internacional de Investigación México(墨西哥国际研究大学)

AI总结 针对AGI定义不统一导致争议的问题,提出DAF-AGI框架,包含五个序数标准和一个结构化治理审计,用于评估候选定义并裁定AGI主张。

详情
Comments
31 pages, 1 table, 2 appendices
AI中文摘要

关于人工通用智能已经到来或仍需数十年的主张常常基于重叠的证据进行辩护。“AGI”缺乏一个单一共享且稳定的指称,不同的操作化方法可能对同一系统给出不同的判定。本文将这种欠指定性视为一个设计和治理问题。遵循设计科学研究方法论,本文开发了DAF-AGI,一个二阶概念性人工制品,包含两个耦合组件:用于评估候选定义的裁定适应性的五个序数标准,以及对作者身份、利益、认证、外部验证和修订权威的结构化治理审计。该人工制品在五个显著的测量族和一个通缩边界立场上进行了演示,这些均来自一个已记录的语料库,然后对一个风格化的强到来主张进行了压力测试:即当前生成系统构成AGI,因为它们在许多认知任务上优于受过良好教育的成年人。根据引用的2024-2025年来源的证据,该主张仅在基于性能的操作化下可认证;能力本体论、心理测量学和技能习得方法未认证它,经济族仍不确定,通缩立场拒绝二元裁定。贡献在于新颖的整合和操作化,而非经验验证:独立应用、评估者间测试和作者外部案例仍然是必要的。本文进一步提出定义主权作为算法主权的使能组件:即在公共问责下对进口技术类别进行质疑、认证和修订的制度能力。

英文摘要

Claims that artificial general intelligence has already arrived and claims that it remains decades away are often defended from overlapping evidence. "AGI" lacks a single shared and stable referent and competing operationalizations can return different verdicts on the same system. This article treats that under-specification as a design and governance problem. Following Design Science Research Methodology, it develops DAF-AGI, a second-order conceptual artifact with two coupled components: five ordinal criteria for assessing the adjudicative fitness of candidate definitions and a structured governance audit of authorship, interest, certification, external verification and revision authority. The artifact is demonstrated on five prominent measurement families and one deflationary boundary position in a documented corpus and then stress-tested against a stylized strong arrival claim: that current generative systems constitute AGI because they outperform a well-educated adult on many cognitive tasks. On evidence from the cited 2024-2025 sources, the claim was certifiable only under a performance-based operationalization; capability-ontology, psychometric and skill-acquisition approaches did not certify it, the economic family remains indeterminate and the deflationary position refuses binary adjudication. The contribution is a novel integration and operationalization, not an empirical validation: independent application, inter-rater testing and author-external cases remain necessary. The paper further proposes definitional sovereignty as an enabling component of algorithmic sovereignty: the institutional capacity to contest, certify and revise imported technological categories under public accountability.

2606.12708 2026-06-12 cs.CL cs.AI 新提交

AfriSUD: A Dependency Treebank Collection for Evaluating Models on African Languages

AfriSUD:用于评估非洲语言模型的依存树库集合

Happy Buzaaba, Cheikh Mouhamadou Bamba Dione, David Ifeoluwa Adelani, Sylvain Kahane, Kim Gerdes, Bruno Guillaume, Kevin Guan, Aremu Anuoluwapo, Naome A. Etori, Shamsuddeen Hassan Muhammad, Utitofon Inyang, Peter Nabende, David Sabiiti Bamutura, Andiswa Bukula, Chinedu Uchechukwu, Rooweither Mabuya, Idris Akinade, Christiane Fellbaum

发表机构 * Princeton University(普林斯顿大学) Laboratory for Artificial Intelligence, Princeton University(普林斯顿大学人工智能实验室) Gaston Berger University(加斯顿·伯杰大学) Mila, McGill University(麦吉尔大学米拉研究所) Canada CIFAR AI Chair(加拿大CIFAR人工智能教席) Paris Nanterre University(巴黎南泰尔大学) Paris-Saclay University(巴黎-萨克雷大学) CNRS(法国国家科学研究中心) Inria(法国国家信息与自动化研究所) LORIA(洛林计算机科学实验室) Université de Lorraine(洛林大学) University of Trento(特伦托大学) University of Minnesota–Twin Cities(明尼苏达大学双城分校) Imperial College London(伦敦帝国学院) Binghamton University(宾汉姆顿大学) Makerere University(马凯雷雷大学) Penn State University(宾夕法尼亚州立大学) Mbarara University of Science and Technology(姆巴拉拉科技大学) Chalmers University of Technology(查尔姆斯理工大学) University of Ibadan(伊巴丹大学) Nnamdi Azikiwe University(纳姆迪·阿齐基韦大学) South African Centre for Digital Language Resources(南非数字语言资源中心)

AI总结 为弥补非洲语言在NLP资源上的不足,构建了首个大规模九种非洲语言句法标注树库AfriSUD,评估多种模型发现显著句法差距。

详情
AI中文摘要

尽管非洲语言具有语言多样性和全球重要性,但在支持NLP的研究和资源中仍代表性不足。我们通过引入AfriSUD来弥合这一差距,这是首个大规模句法标注树库集合,涵盖九种多样的非洲语言,跨越撒哈拉以南非洲的主要语系和地区。采用表层句法通用依存(SUD)框架,我们社区主导的努力提供了高质量、经母语者验证的数据,捕捉了如黏着和声调等类型学关键特征。我们在AfriSUD上评估了多种模型,包括非Transformer基线、多语言预训练编码器和LLM,用于词性标注和依存句法分析。我们的结果揭示了显著的句法差距,模型在九种语言上仍表现出明显局限性,表明现有架构可能无法完全捕捉非洲语言句法的结构多样性。

英文摘要

Despite their linguistic diversity and global significance, African languages remain underrepresented in research and resources to support NLP. We aim to bridge this gap by introducing AfriSUD, the first large-scale collection of syntactically annotated treebanks for nine diverse African languages spanning major language families and regions across Sub-Saharan Africa. Using the Surface-Syntactic Universal Dependencies (SUD) framework, our community-led effort provides high-quality, native-speaker verified data that capture typological key features such as agglutination and tone. We evaluate a range of models on AfriSUD for part-of-speech tagging and dependency parsing including non-transformer baselines, multilingual pretrained encoders, and LLMs. Our results reveal a significant syntax gap, where models still show clear limitations across the nine languages, suggesting that existing architectures may not fully capture the structural diversity of African-language syntax.

2606.12706 2026-06-12 cs.CV 新提交

VLADriveBench: Evaluating CoT-Action Relationship in VLA for Autonomous Driving

VLADriveBench:评估自动驾驶VLA中的CoT-动作关系

Thach Nguyen, Danhua Guo, Tom Lampo, Fei Wu, Burhan Yaman

发表机构 * Uber AV Labs(优步自动驾驶实验室)

AI总结 提出VLADriveBench框架,结合观察指标和CoT干预协议评估VLA模型中思维链与驾驶动作的相关性和因果性,发现不同模型表现差异显著。

详情
AI中文摘要

视觉-语言-动作(VLA)模型在生成驾驶轨迹的同时产生思维链(CoT)推理,但现有基准仅评估轨迹质量,不评估CoT是否与驾驶动作相关、一致或具有因果联系。我们引入VLADriveBench,一个结合观察指标(提及、幻觉、矛盾、动作对齐)与CoT干预协议的框架,以提供CoT-动作关系的互补视角。将VLADriveBench应用于两种架构的三个模型,我们发现两种分析可能产生显著分歧:ORION在观察对齐上得分最高,但其CoT是附带现象;而Alpamayo v1.5得分较低,但其CoT具有很强的因果性,视觉显著性控制着CoT影响的程度。

英文摘要

Vision-language-action (VLA) models generate chain-of-thought (CoT) reasoning alongside driving trajectories, but existing benchmarks evaluate only trajectory quality and do not assess whether the CoT is relevant, consistent, or causally connected to the driving action. We introduce VLADriveBench, a framework that combines observational metrics (mentioning, hallucination, contradiction, action alignment) with a CoT intervention protocol to provide complementary views of the CoT-action relationship. Applying VLADriveBench to three models across two architectures, we find that the two analyses can diverge sharply: ORION scores highest on observational alignment yet its CoT is epiphenomenal, while Alpamayo v1.5 scores lower yet its CoT is strongly causal, with visual salience gating the extent of CoT influence.

2606.12699 2026-06-12 cs.LG cs.AI 新提交

LLM-Powered Personalized Glycemic Assessment in Type 2 Diabetes with Wearable Sensor Data

基于可穿戴传感器数据的2型糖尿病个性化血糖评估:LLM驱动方法

Yifan Gao, Yanmin Gong, Yun Shi, Yuanxiong Guo

发表机构 * Department of Information Systems and Cybersecurity, The University of Texas at San Antonio(德克萨斯大学圣安东尼奥分校信息系统与网络安全系) School of Engineering Medicine, Texas A&M University(德克萨斯农工大学工程医学院) Department of Family and Community Medicine, The University of Texas at San Antonio(德克萨斯大学圣安东尼奥分校家庭与社区医学系)

AI总结 提出GlyLLM框架,利用大语言模型整合可穿戴传感器数据和结构化元数据,实现个性化血糖动态建模,在血糖预测和糖尿病分类任务上分别比传统ML方法提升13.66%和13.08%。

详情
Comments
The 14th IEEE International Conference on Healthcare Informatics, 2026
AI中文摘要

2型糖尿病(T2D)对全球健康构成日益严重的威胁,需要有效的血糖评估来支持个性化和改进的糖尿病护理。可穿戴传感器如连续血糖监测仪(CGM)和健身追踪器为血糖评估提供了许多有价值的见解。然而,有效分析这些数据需要与重要的个体层面背景信息整合。现有方法通常基于传统机器学习(ML),主要依赖历史血糖测量值,忽略了个性化信息,这限制了它们在多样化糖尿病群体中的性能。大语言模型(LLMs)的最新进展展示了它们整合多种数据模态同时建模序列依赖性的能力,激发了探索其在个性化血糖评估中潜力的兴趣。在本文中,我们提出了GlyLLM,一个基于LLM的框架,通过整合可穿戴传感器数据和结构化元数据来建模基于CGM的血糖动态。GlyLLM可以利用预训练LLM的广泛先验知识,并在决策时实现传感器-文本语义抽象。在AI-READI数据集上的两个相关任务实验表明,我们的模型在血糖预测的均方根误差(RMSE)上平均优于传统ML方法13.66%,在糖尿病分类的受试者工作特征曲线下面积(AUROC)上平均优于13.08%。此外,我们的消融研究表明,糖尿病调查和生物特征测试比其他健康信息对血糖评估更为关键。我们的工作为利用LLM推进T2D护理中的个性化血糖评估迈出了有希望的一步。

英文摘要

Type 2 Diabetes (T2D) poses an increasing global health threat, demanding effective glycemic assessment to support personalized and improved diabetes care. Wearable sensors such as continuous glucose monitors (CGM) and fitness trackers offer many valuable insights for glycemic assessment. However, effectively analyzing these data requires integration with essential individual-level context. Existing methods are often based on traditional machine learning (ML) and rely primarily on historical blood glucose measurements and overlook personalized information, which limits their performance across diverse diabetes populations. Recent advances in large language models (LLMs) have demonstrated their ability to integrate diverse data modalities while modeling sequential dependencies, motivating the exploration of their potential for personalized glycemic assessment. In this paper, we propose GlyLLM, an LLM-powered framework for modeling CGM-based glycemic dynamics through the integration of wearable sensor data and structured metadata. GlyLLM can leverage the extensive prior knowledge of pre-trained LLMs and achieve sensor-text semantic abstraction at decision time. Experiments on two related tasks on the AI-READI dataset demonstrate that our model outperforms traditional ML methods by an average of 13.66\% in Root Mean Squared Error (RMSE) for glucose forecasting and 13.08\% in Area Under the Receiver Operating Characteristic (AUROC) for diabetes categorization. Additionally, our ablation study shows that diabetes surveys and biometric tests are more critical than other health information for glycemic assessment. Our work presents a promising step toward harnessing the power of LLMs to advance personalized glycemic assessment in T2D care.

2606.12690 2026-06-12 cs.RO cs.AI 新提交

EWAM: An Enhanced World Action Model for Closed-Loop Online Adaptation in Embodied Intelligence

EWAM:一种用于具身智能闭环在线自适应的增强世界动作模型

Xin Zhou, Cong Miao

发表机构 * Astronex Robotics Nanjing University of Information Science and Technology(南京信息工程大学)

AI总结 提出EWAM架构,基于冻结的Cosmos3骨干网络,通过四个轻量级神经层实现零样本在线自适应,无需微调或额外演示数据,显著减少新任务布局的部署数据需求。

详情
AI中文摘要

在本文中,我们提出了增强世界动作模型(EWAM),这是一种基于预训练且完全冻结的Cosmos3骨干网络构建的闭环在线自适应架构。EWAM完全在零样本任务协议下进行评估,其核心目标是减少适应新任务布局所需的额外部署数据量。值得注意的是,所有评估中均未引入额外的任务特定演示集,也未对骨干网络进行微调。其性能提升完全源于由四个插入的轻量级神经层组成的推理时协同推理机制:位于扩散变换器(DiT)中间层的神经经验记忆层提供任务相关的执行上下文;状态预测头之后的神经异常检测层实时监测预测状态与实际状态之间的差异;神经策略路由层根据异常严重程度动态选择直接执行、保守重规划或回滚恢复;神经动作校正层利用执行诊断优化生成的动作块。与简单的特征融合不同,记忆、异常检测和校正模块以可微分的方式深度集成到Cosmos3的前向路径中,仅最终路由决策是离散监督的。

英文摘要

In this paper, we propose the Enhanced World Action Model (EWAM), a closed-loop online adaptation architecture built upon a pretrained and fully frozen Cosmos3 backbone network. Evaluated entirely under a zero-shot task protocol, EWAM is centrally focused on reducing the amount of additional deployment data required to adapt to new task layouts. Notably, no extra task-specific demonstration sets were introduced in any of the evaluations, and no fine-tuning was performed on the backbone network. Its performance gains stem entirely from an inference-time co-reasoning mechanism composed of four inserted lightweight neural layers: the Neural Experience Memory Layer located in the intermediate layers of the Diffusion Transformer (DiT) provides task-relevant execution context; the Neural Anomaly Detection Layer after the state prediction head monitors the divergence between predicted and actual states in real time; the Neural Policy Routing Layer dynamically selects direct execution, conservative replanning, or rollback recovery based on the anomaly severity; and the Neural Action Correction Layer refines the generated action chunks using execution diagnostics. Unlike naive feature fusion, the memory, anomaly detection, and correction modules are deeply integrated into the Cosmos3 forward path in a differentiable manner, with only the final routing decision being a discrete supervised one.

2606.12689 2026-06-12 cs.CL 新提交

Observable Patterns Are Not Explanations: A Causal-Geometric Analysis of Latent Reasoning Models

可观察模式并非解释:潜在推理模型的因果几何分析

Darpan Aswal, Thomas Palmeira Ferraz, Yongxin Zhou, Maxime Peyrard

发表机构 * Université Grenoble Alpes, CNRS, Grenoble INP, LIG(格勒诺布尔阿尔卑斯大学,法国国家科学研究中心,格勒诺布尔国立理工学院,信息学实验室) Université Paris-Saclay(巴黎-萨克雷大学) NAVER LABS Europe(NAVER欧洲实验室)

AI总结 本文通过对照实验和因果干预发现,潜在推理模型中的可观察模式(如BFS前沿)在控制组中也出现且不总是因果影响行为,提出潜在思维的使用是分级的,其因果效应集中在低秩方向,几何结构随行为影响增强而更有序。

详情
AI中文摘要

潜在推理模型(LRMs)用连续思维替代显式思维链。最近的研究将可观察的潜在状态模式(如BFS式前沿和可解码的算术计算)视为内部推理机制的证据。通过评估两个LRM(Coconut和CODI)与缺乏所提议的循环或课程的控制组,我们发现这些模式也出现在控制组中,并且并不总是因果性地影响行为。因果干预揭示,潜在思维的利用不是二元的,而是分级的,随着思维对模型行为的因果效应而缩放。几何分析表明,这种效应集中在低秩方向,其逐步几何结构随着行为影响的增加而变得更加结构化。因此,潜在思维应被视为隐藏计算,而非隐藏解释:仅凭可解码性、注意力或静态结构无法确立机制。因此,LRM可解释性需要匹配的控制组和因果测试。

英文摘要

Latent reasoning models (LRMs) replace explicit chain-of-thought with continuous thoughts. Recent work treats observable latent-state patterns, such as BFS-like frontiers and decodable arithmetic computation, as evidence for internal reasoning mechanisms. Evaluating two LRMs (Coconut and CODI) against controls lacking the proposed recurrence or curriculum, we find these patterns also appear in the controls and do not always causally affect behavior. Causal interventions reveal that latent-thought utilization is not binary but graded, scaling with a thought's causal effect on model behavior. Geometric analyses reveal this effect concentrates in low-rank directions whose step-to-step geometry grows more structured as their behavioral influence increases. Latent thoughts should therefore be treated as hidden computation, not hidden explanation: decodability, attention, or static structure alone cannot establish mechanism. LRM interpretability thus requires matched controls and causal tests.

2606.12688 2026-06-12 cs.LG cs.AI cs.DC 新提交

M*: A Modular, Extensible, Serving System for Multimodal Models

M*: 一个模块化、可扩展的多模态模型服务系统

Atindra Jha, Naomi Sagan, Keisuke Kamahori, Irmak Sivgin, Rohan Sanda, Steven Gao, Mark Horowitz, Luke Zettlemoyer, Olivia Hsu, Jure Leskovec, Baris Kasikci, Stephanie Wang

发表机构 * Stanford University(斯坦福大学) University of Washington(华盛顿大学) Carnegie Mellon University(卡内基梅隆大学)

AI总结 提出M*系统,通过将模型表示为数据流图并引入Walk Graph抽象,支持多模态复合模型的高效服务,在多个任务上降低延迟并提升吞吐量。

详情
AI中文摘要

我们正在进入一个复合模型架构的新时代,这些架构集成了多种组件,如视觉编码器、语言骨干网络、扩散和流头、音频编解码器、动作生成器和世界模型预测器。这种架构支撑了广泛的多模态模型类别,包括统一多模态模型、全能模型、语音-语言模型、视觉-语言-动作策略和世界模型。然而,现有的模型服务框架基于对模型结构的狭隘假设,难以适应这种新的架构多样性。在此,我们提出M*,一个用于高效服务复合AI模型的通用服务系统。M*将模型表示为数据流图,将跨越多种模态和任务的请求处理视为对这些图的遍历。核心洞察是一种模块化抽象,支持模型组件的任意组合、在物理集群上的灵活放置以及分布式运行时中的模型无关优化。我们将这种抽象称为Walk Graph,并展示它如何简洁地捕获来自广泛家族的复合模型。我们在代表性模型上实例化M*,发现与vLLM-Omni相比,在BAGEL上的文本到图像工作负载中,端到端延迟平均降低20%,同时在Qwen3-Omni上的文本到语音工作负载中,实时因子降低高达2.9倍,吞吐量提升高达2.7倍。M*在机器人规划任务上也比V-JEPA 2-AC rollout基线性能提升高达12.5倍。因此,我们的工作为以最小开发工作量高效服务复杂模型铺平了道路。

英文摘要

We are entering a new era of composite model architectures that integrate diverse components such as vision encoders, language backbones, diffusion and flow heads, audio codecs, action generators, and world-model predictors. Such architectures underpin a broad class of multimodal models, including unified multimodal models, omni models, speech-language models, vision-language-action policies, and world models. However, existing model serving frameworks were built on narrow assumptions about model structure, making them ill-suited to accommodate this new architectural diversity. Here we present M*, a universal serving system for efficient serving of composite AI models. M* represents models as dataflow graphs, processing requests spanning diverse modalities and tasks as traversals over these graphs. The core insight is a modular abstraction that supports arbitrary composition of model components, flexible placement onto a physical cluster, and model-agnostic optimizations within a distributed runtime. We call this abstraction the Walk Graph and show how it can concisely capture composite models from a broad range of families. We instantiate M* on representative models and find that it achieves, on average, 20% lower end-to-end latency than vLLM-Omni for text-to-image workloads on BAGEL, while delivering up to 2.9x lower real-time factor and 2.7x higher throughput for text-to-speech workloads on Qwen3-Omni. M* also outperforms the V-JEPA 2-AC rollout baseline for robotic planning by up to 12.5x. Thus, our work paves the road towards more efficient serving of complex models with minimal developer effort.

2606.12687 2026-06-12 cs.LG 新提交

Forecasting Is Not Attribution: Localizing Decoder Bypass in Graph-Based Neural Marketing Mix Models

预测不等于归因:在基于图的神经营销组合模型中定位解码器旁路

Yunbo Wang, Bolbi Liu

发表机构 * University of California, Irvine(加州大学尔湾分校) AdsGency AI

AI总结 针对基于图的神经营销组合模型中预测精度高但归因失败的问题,提出DICE-MMM框架,通过限制解码器通信路径来诊断和定位归因旁路,实验表明低预测误差不能保证归因正确性。

详情
AI中文摘要

营销组合模型用于预测业务结果并将这些结果归因于营销渠道,但这些目标并不等价。我们研究了基于图的神经MMM中的一种失败模式,称为归因旁路:高容量解码器可以通过目标自回归、密集通信、共同运动、上下文或潜在记忆获得低预测误差,但未能将反事实敏感性通过用作归因对象的图进行路由。我们引入DICE-MMM作为一个有界诊断和训练框架。我们不声称观测性神经MMM能够识别因果效应。相反,DICE将基于图的MMM中经常混淆的三个问题分开:图恢复、预测准确性,以及训练后的解码器的扰动诱导影响是否与图对齐。阶段1训练一个带有受限图介导解码器的图编码器。阶段2冻结选定的编码器,并训练一个图安全的潜在解码器,其跨节点通信必须通过提供的图。解码器的使用通过CIG、AR-CIG和图交换测试进行评估。在受控的R/d/T交换和外部多图原始日志压力测试中,DICE比CausalMMM提高了稳定图恢复。实验表明,预测准确性不是归因证书:在稀疏目标基准中,无图解码器和全图解码器实现了约0.004的MSE@7,而AR-CIG nAUPRC仍接近或低于零,而oracle图在可比的MSE下达到0.807 +/- 0.129。冻结图交换定位了瓶颈:相同的DICE-hard训练解码器在学习图输入下从nAUPRC -0.044 +/- 0.006移动到oracle图下的0.894 +/- 0.027。贡献在于一个压力测试和故障定位框架,表明低MSE可能隐藏归因旁路,且未解决的瓶颈是图支撑选择,而不是预测或解码器容量。

英文摘要

Marketing mix models are used to forecast business outcomes and to attribute those outcomes to marketing channels, but these goals are not equivalent. We study a failure mode in graph-based neural MMM called attribution bypass: a high-capacity decoder can obtain low forecasting error through target autoregression, dense communication, co-movement, context, or latent memory while failing to route counterfactual sensitivity through the graph used as the attribution object. We introduce DICE-MMM as a bounded diagnostic and training framework. We do not claim that observational neural MMM identifies causal effects. Instead, DICE separates three questions often conflated in graph-based MMM: graph recovery, forecasting accuracy, and whether the trained decoder's perturbation-induced influence is graph aligned. Stage 1 trains a graph encoder with a restricted graph-mediated decoder. Stage 2 freezes the selected encoder and trains a graph-safe latent decoder whose cross-node communication must pass through the supplied graph. Decoder use is evaluated with CIG, AR-CIG, and graph-swap tests. Across controlled R/d/T swaps and an external multi-graph rawlog stress test, DICE improves stable graph recovery over CausalMMM. The experiments show that forecasting accuracy is not an attribution certificate: in a sparse-target benchmark, no-graph and full-graph decoders achieve MSE@7 around 0.004 while AR-CIG nAUPRC remains near or below zero, whereas an oracle graph reaches 0.807 +/- 0.129 at comparable MSE. Frozen graph-swap localizes the bottleneck: the same DICE-hard-trained decoder moves from nAUPRC -0.044 +/- 0.006 under learned graph inputs to 0.894 +/- 0.027 with the oracle graph. The contribution is a stress test and failure-localization framework showing that low MSE can hide attribution bypass and that the unresolved bottleneck is graph-support selection, not forecasting or decoder capacity.

2606.12683 2026-06-12 cs.AI cs.CY cs.LG 新提交

From AGI to ASI

从AGI到ASI

Tim Genewein, Matija Franklin, Alexander Lerchner, Laurent Orseau, Samuel Albanie, Adam Bales, Cole Wyeth, Stephanie Chan, Iason Gabriel, Joel Z. Leibo, Allan Dafoe, Marcus Hutter, Thore Graepel, Shane Legg

发表机构 * Google DeepMind(谷歌深度思维) University of Waterloo(滑铁卢大学) Australian National University(澳大利亚国立大学) University College London(伦敦大学学院)

AI总结 探讨从人类级通用人工智能到超级智能的转变路径,包括扩展、范式转变、递归改进和多智能体涌现,并分析摩擦与瓶颈。

详情
AI中文摘要

在过去十年中,构建人类级通用人工智能已从遥不可及的猜测转变为许多大型AI组织未来十年的具体目标。实现这一目标将对人类社会产生深远影响,并引发未来十年的诸多复杂问题。本报告研究在机器智能连续体中,AI如何在后AGI世界中继续发展。该连续体的终点——通用AI——在理论上已被充分理解,这为本报告的主要焦点提供了形式基础:从人类级AGI向人工通用超级智能的转变,直观上可理解为比大型人类组织更智能、认知能力更强的系统。在描述ASI后,报告讨论了从AGI到ASI的四条潜在路径:扩展AGI、AI范式转变、递归改进以及从大规模多智能体集体中涌现ASI。随后,报告讨论了这些路径上可能的摩擦和瓶颈。确定这些摩擦的影响是微不足道还是重大,提出了若干具体的开放研究问题。由于预测ASI进展存在巨大不确定性,不能排除AI进展在未来几年继续加速的可能性。这可能意味着由人类级AGI引入社会所导致的单一变革性步骤的形象可能不准确。更恰当的前景可能是由AI在科学和技术的多个领域引发的进步和突破所导致的一系列变革性社会变化。为这一前景做准备需要全球范围内的大规模跨学科努力。

英文摘要

Over the last decade, building human-level artificial general intelligence has moved from far-fetched speculation to being a concrete next-decade target for many of the largest AI organisations. Achieving this goal would have profound and far-reaching impacts on human society, which raises many complex questions for the decade ahead. This report investigates how AI itself might continue to develop in a post-AGI world along the continuum of machine intelligence. The endpoint of this continuum, Universal AI, is theoretically well understood, which provides some formal grounding for the main focus of this report: the transition from human-level AGI to artificial general superintelligence, which, intuitively, can be understood as a system that is more intelligent and cognitively capable than large organisations of humans. After characterizing ASI, the report discusses four potential pathways from AGI to ASI: scaling AGI, AI paradigm shifts, recursive improvement, and ASI emerging from large-scale multi-agent collectives. The report then discusses possible frictions and bottlenecks along these pathways. Determining whether the impact of these frictions will be negligible or substantial raises a number of concrete open research questions. Due to large uncertainties for predicting ASI progress, it cannot be ruled out that AI progress might continue to accelerate over the next years. This could imply that the image of a single transformative step change, caused by the introduction of human-level AGI into our society, could be inaccurate. More apt might be the prospect of a series of transformative societal changes caused by AI-enabled progress and breakthroughs across many areas of science and technology. Preparing for this prospect requires a massively interdisciplinary endeavour of global scope and interest.

2606.12680 2026-06-12 cs.LG stat.ML 新提交

How Useful is Causal Invariance for Domain Adaptation in Finite-Sample Settings?

因果不变性在有限样本设置中对领域适应有多大用处?

Julia Kostin, Kasra Jalaldoust, Elias Bareinboim, Samory Kpotufe, Fanny Yang

发表机构 * Department of Computer Science, ETH Zurich(苏黎世联邦理工学院计算机科学系) Causal Artificial Intelligence Lab, Columbia University(哥伦比亚大学因果人工智能实验室) Department of Statistics, Columbia University(哥伦比亚大学统计系)

AI总结 研究线性回归中因果不变性如何提升监督领域适应,通过候选预测器的目标风险边界和有限样本估计误差推导匹配上下界,证明当边界足够大时自适应聚合可避免负迁移。

详情
AI中文摘要

机器学习模型在部署到与训练源分布不同的目标分布时,性能往往会下降。最近基于因果的领域泛化工作表明,领域间的共享因果结构可以诱导不变预测器,例如在结构化领域偏移下具有稳定风险的某些特征子集上的模型。然而,这种总体水平的因果不变性在有限样本设置中能带来多大收益仍未充分探索。特别是,在实践中我们通常只能获得少量带标签的目标样本,这种设置称为监督领域适应(sDA)。本文探讨何时(完全或部分)因果知识能够可证明地改进监督领域适应。作为第一步,我们研究线性回归,其中完全或部分因果知识指定了一组不变或可能不变的特征子集,每个子集产生一个源训练候选预测器。我们推导了匹配的上界和下界,表明有限样本收益由候选预测器之间的目标风险边界以及有限源估计误差共同决定。当这些边界相对于$n_Q$足够大时,自适应聚合过程可以匹配最佳候选预测器,同时避免相对于仅使用目标样本学习的负迁移。另一方面,当边界过小时,没有算法能够可靠地利用候选集合获得更快的有限样本速率。我们进一步将这些边界与线性SCM中的结构偏移幅度联系起来,并在真实世界的因果基准上验证了理论。

英文摘要

Machine learning models often degrade when they are deployed on a target distribution that differs from the source distributions they were trained on. Recent work in causality-based domain generalization has shown how shared causal structure between domains can induce invariant predictors, e.g., models on a subset of features which have stable risk across structured domain shifts. However, the extent to which such population-level causal invariances can lead to gains in finite-sample settings remains underexplored. In particular, in practice we often have access to a few labeled target samples, a setting called supervised domain adaptation (sDA). In this paper, we explore when (full or partial) causal knowledge can provably improve supervised domain adaptation. As a first step, we study linear regression, where full or partial causal knowledge specifies a collection of invariant or possibly invariant feature subsets, each yielding a source-trained candidate predictor. We derive matching upper and lower bounds showing that finite-sample gains are governed by the target-risk margins separating the candidates, together with the finite-source estimation error. When these margins are sufficiently large relative to $n_Q$, an adaptive aggregation procedure can match the best candidate predictor while avoiding negative transfer relative to target-only learning. On the other hand, when the margins are too small, no algorithm can reliably exploit the candidate collection to obtain faster finite-sample rates. We further connect these margins to structural shift magnitude in linear SCMs and validate the theory on real-world causal benchmarks.

2606.12679 2026-06-12 cs.LG cs.CR eess.IV 新提交

Fed-FBD: Federated Functional Block Diversification for Isolation, Privacy, and Surgical Unlearning

Fed-FBD:用于隔离、隐私和精准遗忘的联邦功能块多样化

Weijie Chen, Alan B. McMillan

发表机构 * University of Wisconsin–Madison(威斯康星大学麦迪逊分校)

AI总结 提出Fed-FBD模块化联邦架构,将ResNet分解为六个功能块并维护颜色变体仓库,实现块级隔离、隐私设计和亚秒级精准遗忘,在多个数据集上以微小精度代价换取安全保障。

详情
Comments
12 pages, 3 figures, 8 tables. Code: this https URL
AI中文摘要

联邦学习(FL)能够在无需共享原始患者数据的情况下进行协作模型训练,但标准方法(如FedAvg)将每个客户端视为黑盒,无法隔离对抗性贡献者、审计每个客户端的影响或尊重已退出参与者的被遗忘权。我们提出Fed-FBD(联邦功能块多样化),一种模块化联邦架构,将ResNet骨干网络分解为六个功能块(主干、四个残差组和分类头),并维护一个包含N种颜色变体的仓库,每种变体由独立跟踪和贡献者标记的块组装而成。Fed-FBD提供了FedAvg所不具备的三种能力:(i) 架构保证的块级隔离,使对抗性或错误标注的客户端无法污染干净颜色;(ii) 隐私设计,在应用任何隐私机制之前,成员推断优势已与随机猜测无异;(iii) 在亚秒级成本下无需重新训练即可精准遗忘已退出参与者的贡献。在六个MedMNIST-2D数据集、224x224的PathMNIST和CIFAR-10上的实验表明,Fed-FBD在规模足够的数据集上以0.3%-3.1%的IID精度差距换取这些保证,在四个数据集中的三个上,Dirichlet alpha=1.0时与FedAvg的差距在0.8%-4.0%以内,并将我们研究的所有六种对抗性攻击限制在中毒客户端自己的块内,干净颜色上的AUC漂移最多为+/-0.01。

英文摘要

Federated learning (FL) enables collaborative model training without sharing raw patient data, but standard approaches such as FedAvg treat each client as a black box and provide no mechanism for isolating an adversarial contributor, auditing per-client influence, or honoring a departed participant's right to be forgotten. We present Fed-FBD (Federated Functional Block Diversification), a modular federated architecture that decomposes a ResNet backbone into six functional blocks (the stem, four residual groups, and the classification head) and maintains a warehouse of N color variants, each assembled from independently tracked and contributor-stamped blocks. Fed-FBD provides three capabilities absent in FedAvg: (i) architecturally guaranteed block-level isolation, so that an adversarial or mislabelled client cannot contaminate the clean colous; (ii) privacy-by-design, where membership inference advantage is already indistinguishable from chance before any privacy mechanism is applied; and (iii) surgical machine unlearning of a departed participant's contribution at sub-second cost and without retraining. Experiments on six MedMNIST-2D datasets, PathMNIST at 224x224, and CIFAR-10 show that Fed-FBD trades a modest 0.3%-3.1% IID accuracy gap on the adequately sized datasets for these guarantees, remains within 0.8%-4.0% of FedAvg at Dirichlet alpha=1.0 on three of four datasets, and confines all six adversarial attacks we study to the poisoned client's own blocks with at most +/-0.01 AUC drift on the clean colors.

2606.12673 2026-06-12 cs.LG cs.AI 新提交

A Zero-shot Generalized Graph Anomaly Detection Framework via Node Reconstruction

基于节点重构的零样本广义图异常检测框架

Phan Nguyen, Dat Cao, Hien Chu, Khue Hoang

发表机构 * School of Computing, KAIST(韩国科学技术院计算机学院)

AI总结 提出AlignGAD框架,通过全局统一模块对齐异构特征、聚类模块捕获组级异常模式及节点差异评分模块聚合多视图异常证据,实现零样本跨域图异常检测。

详情
AI中文摘要

跨域图异常检测旨在识别未见过的目标图中的异常节点,在异构图数据的实际应用中展现出巨大潜力。然而,现有方法通常依赖于数据集特定的特征语义和结构模式,限制了其跨域泛化能力。为解决这一挑战,我们提出AlignGAD,一个零样本广义图异常检测框架。我们的框架基于三个关键组件:全局统一模块,用于对齐异构节点特征并在谱域中归一化图信号;聚类模块,用于构建聚类感知的图视图以捕获组级异常模式;以及节点差异评分模块,用于测量重构差异并聚合来自不同图视图的异常证据。在多个真实数据集上的实验证明了AlignGAD在零样本图异常检测设置下的有效性。

英文摘要

Cross-domain graph anomaly detection (GAD) aims to identify abnormal nodes in unseen target graphs, showing strong potential in real-world applications with heterogeneous graph data. However, existing methods often depend on dataset-specific feature semantics and structural patterns, which limits their ability to generalize across different domains. To address this challenge, we propose AlignGAD, a zero-shot generalized graph anomaly detection framework. Our framework is built upon three key components: a Global Unification Module that aligns heterogeneous node features and normalizes graph signals in the spectral domain; a Clustering Module that constructs cluster-aware graph views to capture group-level abnormal patterns; and a Node Discrepancy Scoring Module that measures reconstruction discrepancy and aggregates anomaly evidence from different graph views. Experiments on multiple real-world datasets demonstrate the effectiveness of AlignGAD under the zero-shot GAD setting.

2606.12662 2026-06-12 cs.SD cs.AI cs.LG 新提交

BASENet: Band-Adapted Speech Enhancement Network with Cross-Band Attention

BASENet: 基于频带自适应的跨频带注意力语音增强网络

Damien Martins Gomes, François Capman

发表机构 * Thales SIX GTS, FRANCE(泰雷兹SIX GTS公司,法国)

AI总结 提出BASENet,通过Bark尺度划分频带并分配自适应容量编码器,结合跨频带注意力模块,以最少参数实现高PESQ和STOI,适用于资源受限设备。

详情
AI中文摘要

语音增强模型通常对所有频率采用统一容量,忽略了人类听觉的非均匀频谱分辨率。我们提出BASENet,一种频率自适应架构,将频谱划分为Bark尺度频带,并为每个频带分配基于临界频带密度的缩放容量编码器,自动为感知密集的低频分配更深的分支,为高频分配更轻的分支。跨频带注意力模块通过紧凑的频率池化表示以线性复杂度捕获跨频带的谐波依赖性。基于具有密集连接的倒残差块和卷积循环网络,BASENet在VoiceBank+DEMAND上以仅0.83M参数和7.3 G MACs达到3.55 PESQ和STOI~96%,是所有PESQ > 3.50方法中参数最少的。因果变体(3.44 PESQ)超过了几种非因果基线,证实了其在资源受限设备上实时流传输的适用性。

英文摘要

Speech enhancement models typically apply uniform capacity across all frequencies, disregarding the non-uniform spectral resolution of human hearing. We propose BASENet, a frequency-adapted architecture that partitions the spectrum into Bark-scale bands and assigns each a scaled-capacity encoder derived from critical-band density, automatically granting deeper branches to perceptually dense low frequencies and lighter ones to high frequencies. A cross-band attention module captures harmonic dependencies across bands through compact frequency-pooled representations at linear complexity. Built on inverted residual blocks with dense connectivity and a convolutional recurrent network, BASENet achieves 3.55 PESQ and STOI~96% on VoiceBank+DEMAND with only 0.83M parameters and 7.3 G~MACs, the fewest parameters among all methods with PESQ > 3.50. A causal variant (3.44 PESQ) surpasses several non-causal baselines, confirming suitability for real-time streaming on resource-constrained devices.

2606.12657 2026-06-12 cs.AI cs.DB cs.RO 新提交

TrajGenAgent: A Hierarchical LLM Agent for Human Mobility Trajectory Generation

TrajGenAgent: 一种用于人类移动轨迹生成的分层LLM智能体

Siyu Li, Toan Tran, Lingyi Zhao, Khurram Shafique, Li Xiong

发表机构 * Emory University(埃默里大学) University of Florida(佛罗里达大学)

AI总结 提出TrajGenAgent,一种无需微调的分层LLM智能体框架,通过编排器-工作者两阶段设计生成真实轨迹,在时空保真度、语义一致性和个体行为真实性上优于现有方法。

详情
Comments
14 pages, 2 figures, 8 tables. Accepted by the 27th IEEE International Conference on Mobile Data Management (MDM 2026)
AI中文摘要

人类移动数据对于交通、城市规划和流行病控制至关重要,但大规模轨迹收集通常成本高昂且受隐私限制,这推动了逼真的合成轨迹生成。现有的基于LLM的生成器通常依赖于提示工程(保留了零样本推理但缺乏细粒度的时空基础)或轨迹级微调(提高了统计精度但产生了大量计算成本并可能削弱一般推理)。我们提出了TrajGenAgent,一种语义感知的分层LLM智能体框架,用于无需模型微调的人类移动轨迹生成。TrajGenAgent采用两阶段编排器-工作者设计:LLM首先通过上下文学习从历史证据中合成个体和星期条件化的活动链,然后确定性工作流通过个性化POI检索、距离感知位置选择、运动学感知旅行时间传播和基于LLM的持续时间估计将每个活动落地为完整的访问。为了评估超越聚合时空统计的真实性,我们引入了一个基于异常检测的评估框架,使用两个互补检测器来评估行为和语义合理性。在基准和大规模模拟数据集上的实验表明,与代表性的神经网络和基于LLM的基线相比,TrajGenAgent在时空保真度、语义一致性和个体特定行为真实性方面有所改进,同时避免了参数更新。

英文摘要

Human mobility data is important for transportation, urban planning, and epidemic control, but large-scale trajectory collection is often costly and privacy-constrained, motivating realistic synthetic trajectory generation. Existing LLM-based generators typically rely on either prompt engineering, which preserves zero-shot reasoning but lacks fine-grained spatiotemporal grounding, or trajectory-level fine-tuning, which improves statistical precision but incurs substantial computational cost and may weaken general reasoning. We propose TrajGenAgent, a semantic-aware hierarchical LLM-agent framework for human mobility trajectory generation without model fine-tuning. TrajGenAgent uses a two-stage orchestrator-worker design: an LLM first synthesizes an individual- and weekday-conditioned activity chain from historical evidence via in-context learning, and a deterministic workflow then grounds each activity into a complete visit using personalized POI retrieval, distance-aware location selection, kinematics-aware travel-time propagation, and LLM-based duration estimation. To evaluate realism beyond aggregate spatiotemporal statistics, we introduce an anomaly-detection-based evaluation framework using two complementary detectors to assess behavioral and semantic plausibility. Experiments on benchmark and large-scale simulation datasets show that TrajGenAgent improves spatiotemporal fidelity, semantic coherence, and individual-specific behavioral realism over representative neural and LLM-based baselines, while avoiding parameter updates.

2606.12649 2026-06-12 cs.CL 新提交

MentalMARBERT: Domain-Adaptive Pre-training and Two-Stage Fine-Tuning for Arabic Mental Health Disorders Detection

MentalMARBERT:面向阿拉伯语心理健康障碍检测的领域自适应预训练与两阶段微调

Fatimah Almalki, Areej Alhothali, Lulwah Alharigy, Abdulrahman Aladeem

发表机构 * King Abdulaziz University(阿卜杜勒阿齐兹国王大学)

AI总结 针对阿拉伯语社交媒体文本中心理健康障碍检测的方言差异、非正式语言、标注资源有限和类别不平衡问题,提出领域自适应预训练与两阶段微调框架,构建含5万条推文的数据集,MentalMARBERT在宏F1和准确率上分别达到0.861和0.877。

详情
Comments
17 pages, 5 figures, 13 tables
AI中文摘要

从阿拉伯语社交媒体文本中检测心理健康障碍仍然具有挑战性,原因包括方言差异、非正式语言、高质量标注资源有限以及严重的类别不平衡。虽然英语心理健康自然语言处理(NLP)已取得显著进展,但阿拉伯语多类别障碍分类的研究仍不充分。本研究提出一个两阶段框架用于阿拉伯语心理健康文本分类。在第一阶段,三个阿拉伯语预训练语言模型AraBERT、CAMeLBERT和MARBERT,使用大规模未标注阿拉伯语心理健康推文语料库进行领域自适应和任务自适应预训练(DAPT和TAPT)。在统一协议下评估自适应模型,以确定最有效的骨干模型。在第二阶段,选定的模型在四种配置下进行评估,这些配置结合了单阶段和分层两阶段分类架构,并采用全微调和低秩适应(LoRA)。为支持本研究,我们构建了一个新的标注阿拉伯语心理健康数据集,包含50,670条推文,涵盖六个类别,具有强标注者间一致性(Krippendorff's Alpha = 0.733,平均成对一致性 = 0.797)。实验结果表明,领域自适应的MARBERT(MentalMARBERT)在准确率和宏F1上均比基线模型有统计显著的提升。结合全微调的分层两阶段架构取得了最佳整体性能,宏F1达到0.861,准确率达到0.877。这些发现证明了领域特定自适应预训练和分层分类在阿拉伯语心理健康障碍检测中的有效性。

英文摘要

Detecting mental health disorders from Arabic social media text remains challenging due to dialectal variation, informal language, limited high-quality annotated resources, and severe class imbalance. While English mental health natural language processing (NLP) has progressed substantially, Arabic multi-class disorder classification remains insufficiently studied. This study proposes a two-phase framework for Arabic mental health text classification. In phase 1, three Arabic pre-trained language models, AraBERT, CAMeLBERT, and MARBERT, undergo Domain-Adaptive and Task-Adaptive Pretraining (DAPT and TAPT) using a large-scale corpus of unlabeled Arabic mental health tweets. The adapted models are evaluated under a unified protocol to identify the most effective backbone model. In phase 2, the selected model is assessed across four configurations combining single-stage and hierarchical two-stage classification architectures with full fine-tuning and Low-Rank Adaptation (LoRA). To support this study, we constructed a novel annotated Arabic mental health dataset comprising 50,670 tweets across six categories, with strong inter annotator agreement (Krippendorff's Alpha = 0.733, average pairwise agreement = 0.797). Experimental results show that the domain-adapted MARBERT (MentalMARBERT) achieves statistically significant improvements over baseline models in both accuracy and macro-F1. The hierarchical two-stage architecture combined with full fine-tuning achieves the best overall performance, reaching a macro-F1 of 0.861 and an accuracy of 0.877. These findings demonstrate the effectiveness of domain-specific adaptive pretraining and hierarchical classification for Arabic mental health disorder detection.

2606.12643 2026-06-12 cs.LG 新提交

TEDD: Robust Detection of Unstable Temporal Features

TEDD:不稳定时间特征的鲁棒检测

Ricardo Ribeiro Pereira, Bruno Casal Laraña, Nádia Soares, Miguel Araújo

发表机构 * Feedzai

AI总结 提出TEDD方法,利用回归模型检测导致时间分布变化的特征,无需参数调优,可扩展,能检测数值和类别特征的单变量及多变量漂移。

详情
Comments
8 pages, 9 figures
AI中文摘要

在处理真实世界的时间序列数据时,经常会遇到特征分布随时间变化的情况。在这种不稳定的数据上直接使用机器学习模型可能导致性能迅速下降,尤其是当新分布与训练时所见差异较大时。为了解决这个问题,自动识别随时间变化的特征至关重要。检测到这些特征后,数据科学家和其他从业者能够通过应用数据变换等方式缓解问题,部署更鲁棒的模型,使其在更长时间内保持高性能。本文描述了特征不应遭受的时间变化类型,并提出了TEDD技术,用于a) 识别数据集何时可能导致不稳定的机器学习模型,以及b) 自动检测哪些特征导致了这种不鲁棒性。为此,我们利用回归模型来突出哪些特征有助于良好预测实例的时间戳。我们将我们的方法与其他方法在真实和合成数据上进行比较,测试它们在所有简单变化模式上的检测能力。我们表明,我们的方法:检测所有类型的基本变化,包括数值和类别特征;能够检测多变量漂移;返回一个可比较的值来衡量每个特征的变化量;无需参数调优;并且在数据集的特征数量和实例数量上都具有可扩展性。

英文摘要

When working with real-world temporal data, it is common to encounter features whose distribution is changing over time. The naive employment of Machine Learning models on this unstable data might lead to rapidly degrading performance, especially if the new distribution is much different from what was previously seen during training. In order to cope with this problem, it is critical to automatically identify features that are changing over time. With these features detected, data scientists and other practitioners will be able to mitigate the issue (for instance, by applying data transformations), deploying more robust models that retain high performance for longer periods of time. In this paper, we describe which temporal changes a feature should not suffer from, and propose TEDD, a technique to a) identify when a dataset might lead to an unstable Machine Learning model and b) automatically detect which features cause such lack of robustness. In order to achieve it, we leverage a regression model to highlight which features contribute to a good prediction of an instance's timestamp. We compare our approach to other methods in real and synthetic data, testing their detection capability on all simple change patterns. We show that our method: detects all types of basic changes, both for numerical and categorical features; can detect multivariate drifts; returns a comparable value measuring the amount of change of each feature; requires no parameter tuning; and is scalable both on number of features and instances of the dataset.