GroupToM-Bench: Benchmarking Group Theory of Mind and Nonlinear Social Emergence in MLLMs
GroupToM-Bench: 多模态大语言模型中群体心智理论和非线性社会涌现的基准测试
AI总结 针对多模态大语言模型在群体心智理论推理上的不足,提出GroupToM-Bench基准,通过七级认知审计框架评估模型从微观BDI状态到宏观结果预测的因果链,揭示模型在处理社会结构和非线性集体动态上的缺陷。
GroupToM-Bench: 多模态大语言模型中群体心智理论和非线性社会涌现的基准测试
Weidong Tang, Jierui Li, Yueling Hou, Zihan Mei, Can Zhang, Xinyan Wan, Zhiyuan Liang, Pengfei Zhou, Yang You, Wangbo Zhao
AI总结 针对多模态大语言模型在群体心智理论推理上的不足,提出GroupToM-Bench基准,通过七级认知审计框架评估模型从微观BDI状态到宏观结果预测的因果链,揭示模型在处理社会结构和非线性集体动态上的缺陷。
真正的通用智能不仅需要物理世界模型,还需要社会世界模型:即推断个体心理状态如何相互作用并结晶为群体层面结果的能力。尽管在个体层面的心智理论推理方面取得了显著进展,现有的多模态大语言模型在这一更广泛的任务上仍然失败。集体行为从社会张力、从众动态和结构约束中非线性地涌现,这意味着它不能通过简单地对个体意图求和来恢复。我们提出了GroupToM-Bench,第一个针对群体层面心智理论的多模态基准,围绕一个跨越微观层面BDI状态(信念、欲望、意图)、中观层面群体张力和结构约束以及宏观层面结果预测和机制归因的因果链构建。为了探测这一完整弧线,我们开发了一个七级认知审计框架。实验揭示了当前模型与人类基线之间的差距,突出了模型在处理社会结构和非线性集体动态方面的失败。
True general intelligence requires not only a model of the physical world but also a social world model: the capacity to infer how individual mental states interact and crystallize into group-level outcomes. Despite notable progress in individual-level Theory of Mind (ToM) reasoning, existing multimodal large language models fail at this broader task. Collective behavior emerges non-linearly from social tensions, conformity dynamics, and structural constraints, meaning it cannot be recovered by merely summing individual intentions. We present GroupToM-Bench, the first multimodal benchmark for group-level ToM, built around a causal chain spanning micro-level BDI states (belief, desire, intention), meso-level group tension and structural constraints, and macro-level outcome prediction and mechanistic attribution. To probe this full arc, we develop a seven-level cognitive audit framework. Experiments reveal a gap between current models and human baselines, highlighting a failure to process social structures and non-linear collective dynamics.
强化学习中的精确遗忘
Thanh Nguyen-Tang, Raman Arora
AI总结 本文提出强化学习中的精确遗忘问题,通过ρ-TV稳定算法实现数据删除后输出与从未学习该数据时不可区分,并给出近乎最优的遗憾界。
我们提出了强化学习中的精确遗忘问题,目标是设计一个高效框架,使得在收到删除请求后能够移除任何用户的数据,即遗忘后在线学习者的输出与从未与学习者交互过的用户所产生的结果不可区分。对于任意 $ρ>0$,我们证明存在一个 $ρ$-TV 稳定的强化学习算法,支持精确遗忘过程,其期望计算成本仅为从头重新训练计算成本的 $ρ\sqrt{\ln T}$ 分之一。我们为表格型马尔可夫决策过程构造了这样一个 $ρ$-TV 稳定的强化学习算法,其遗憾界为 $\mathcal{O}(H^2 \sqrt{SAT} + H^3 S^2 A + {H^{2.5} S^2 A}/ρ)$,其中 $S, A, H, T$ 分别表示状态数、动作数、回合长度和回合数。我们还为 $ρ$-TV 稳定的强化学习算法建立了 $\Omega(H\sqrt{\!SAT}\! +\! {SAH}/ρ)$ 的下界,表明我们的算法几乎是极小化最优的。
We formulate the problem of \emph{exact unlearning} in reinforcement learning, where the goal is to design an efficient framework that enables the removal of any user's data upon deletion request, i.e., the online learner's output after unlearning is \emph{indistinguishable} from what would have been produced had the deleted user never interacted with the learner. For any $ρ>0$, we show that there exists a reinforcement learning (RL) algorithm that is $ρ$-TV-stable and supports an exact unlearning procedure whose expected computational cost is only a $ρ\sqrt{\ln T}$ fraction of the computational cost of retraining from scratch. We construct such a $ρ$-TV-stable RL algorithm for tabular Markov decision processes (MDPs), which achieves a regret bound of $\mathcal{O}(H^2 \sqrt{SAT} + H^3 S^2 A + {H^{2.5} S^2 A}/ρ)$, where $S, A, H$, and $T$ denote the number of states, the number of actions, the episode horizon, and the number of episodes, respectively. We also establish a lower bound of $Ω(H\sqrt{\!SAT}\! +\! {SAH}/ρ)$ for $ρ$-TV-stable RL algorithms, showing that our algorithm is nearly minimax optimal.
KODA: 视觉-语言基础模型的对比表示比较与对齐
Youqi Wu, Mohammad Jalali, Farzan Farnia
AI总结 提出KODA框架,通过核优化方法对比分析视觉-语言基础模型的表示差异,并识别弱聚类与强聚类的样本子集,实现表示对齐。
视觉-语言基础模型(如CLIP和SigLIP)为多模态学习系统提供了广泛使用的表示。虽然这些模型通常通过下游性能进行比较,但这种评估往往不能解释它们的表示在结构上如何不同。在本文中,我们通过对比嵌入聚类任务研究这一问题:识别在一个表示下弱聚类但在另一个表示下强聚类的样本子集。我们提出了\emph{核优化差异分析(KODA)},一个基于核的对比表示比较与对齐框架。KODA通过模态核组合构建统一的多模态核,并将差异发现形式化为一个约束优化问题,该问题在一个表示中搜索一致结构,同时抑制参考表示中的一致性。这产生了与特定样本子集和模态交互相关的可解释差异方向。为了将KODA扩展到大型视觉-语言数据集,我们开发了使用随机投影的联合核随机低维近似,包括用于平移不变核的随机傅里叶特征。实验上,KODA在视觉-语言表示中识别出一致且可解释的差异结构,并为表示对齐提供了样本子集。代码可在https://github.com/yokiwuuu/KODA获取。
Vision-language foundation models such as CLIP and SigLIP provide widely used representations for multimodal learning systems. While these models are typically compared through downstream performance, such evaluations often do not explain how their representations differ structurally. In this work, we study this problem through the task of Contrastive Embedding Clustering: identifying sample subsets that are weakly clustered under one representation but strongly clustered under another. We propose \emph{Kernel Optimization for Discrepancy Analysis (KODA)}, a kernel-based framework for contrastive representation comparison and alignment. KODA constructs unified multimodal kernels through modality-wise kernel composition and formulates discrepancy discovery as a constrained optimization problem that searches for coherent structures in one representation while suppressing coherence in a reference representation. This yields interpretable discrepancy directions associated with specific sample subsets and modality interactions. To scale KODA to large vision-language datasets, we develop randomized low-dimensional approximations of joint kernels using random projections, including Random Fourier Features for shift-invariant kernels. Empirically, KODA identifies consistent and interpretable discrepancy structures across vision-language representations and provides sample subsets for representation alignment. The code is available at https://github.com/yokiwuuu/KODA.
跨领域与模型的人工智能生成文本检测中语言特征的系统分析
Yassir El Attar, Esra Dönmez, Maximilian Maurer, Agnieszka Falenska
AI总结 通过大规模实证研究,系统评估284个可解释语言特征在27个LLM和10个文本领域中的鲁棒性,发现词汇丰富度是跨模型和领域的最可靠信号。
可解释的语言特征为解释给定文本为何看似机器生成提供了一种有前景的方法,尤其对于非专业用户。然而,关于哪些特征可靠地指示LLM生成文本的现有发现仍然分散在不同的特征集、模型和文本领域中。为解决这一差距,我们进行了一项大规模实证研究,评估语言信号在表征AI生成文本方面的鲁棒性。我们的分析涵盖了来自27个LLM和十个文本领域的输出中的284个可解释语言特征,并在跨模型和跨领域泛化设置下进行。我们表明,仅基于语言特征的分类器可以可靠地区分AI生成文本和人类撰写文本。然而,许多先前提出的指标被证明高度依赖上下文,但词汇丰富度指标除外,这些指标在模型家族和文本领域中保持鲁棒信号。这些结果展示了哪些语言信号在上下文中泛化,并为更可靠、可解释的AI生成语言分析提供了基础。
Interpretable linguistic features offer a promising approach for explaining why a given text appears machine-generated, particularly for non-expert users. However, existing findings on which features reliably indicate LLM-generated text remain fragmented across feature sets, models, and text domains. To address this gap, we conduct a large-scale empirical study assessing the robustness of linguistic signals for characterizing AI-generated text. Our analysis covers 284 interpretable linguistic features across outputs from 27 LLMs and ten text domains under cross-model and cross-domain generalization settings. We show that classifiers based solely on linguistic features can reliably distinguish AI-generated from human-written text. However, many previously proposed indicators prove strongly context-dependent, with the exception of measures of lexical richness, which remain robust signals across model families and text domains. These results demonstrate which linguistic signals generalize across contexts and provide a foundation for more reliable, interpretable analyses of AI-generated language.
低秩分布矩阵补全
Jiayi Wang, Raymond K. W. Wong
AI总结 针对每个条目为概率分布的矩阵,提出基于核均值嵌入和Tucker秩的低秩结构,通过函数展开算子连接无限维与有限维,实现分布矩阵补全并给出非渐近误差界。
我们研究了矩阵补全问题的分布推广,其中目标矩阵的每个条目是概率分布而非标量。在此设置中,仅观察到矩阵条目的一个子集,即使对于观察到的条目,底层分布也无法直接获取;相反,我们观察到从这些分布中抽取的有限样本。为了表示分布条目,我们采用核均值嵌入,并引入分布值矩阵的Tucker秩概念以捕捉其低秩结构。核嵌入的无限维性质带来了重大的方法论挑战。为解决此问题,我们引入了函数展开算子,将所提出的分布低秩结构与有限维张量的经典Tucker秩联系起来。基于此框架,我们提出了一种用于分布矩阵补全的新估计器。我们建立了非渐近误差界,刻画了估计器的统计性能。在合成数据和真实世界应用上的大量实验证明了所提方法的有效性。
We study a distributional generalization of the matrix completion problem in which each entry of the target matrix is a probability distribution rather than a scalar. In this setting, only a subset of matrix entries is observed, and even for observed entries, the underlying distributions are not directly accessible; instead, we observe finitely many samples drawn from them. To represent distributional entries, we employ kernel mean embeddings and introduce a notion of Tucker rank for distribution-valued matrices to capture their low-rank structure. The infinite-dimensional nature of kernel embeddings poses significant methodological challenges. To address this, we introduce functional unfolding operators that link the proposed distributional low-rank structure to the classical Tucker rank for finite-dimensional tensors. Based on this framework, we propose a novel estimator for distributional matrix completion. We establish non-asymptotic error bounds that characterize the statistical performance of the estimator. Extensive experiments on synthetic data and a real-world application demonstrate the effectiveness of the proposed method.
Affordance2Action: 任务条件下的场景级功能区域定位用于实时操作
Litao Liu, Yifan Han, Pengfei Yi, Wenbo Yu, Hanqing Wang, Haoran Du, Enze Yuan, Zilin Yuan, Ruiding Feng, Michael Liu, Qi Zhang, Jingjin Yu
AI总结 提出Affordance2Action框架,通过构建A2A-Bench基准和A2A-AffordGen标注流程,解决场景级任务条件功能区域定位中的多区域对应问题,并支持实时操作。
任务条件操作需要将指令定位到与任务相关的功能部件,而非物体类别。这种设置依赖于场景,并且在杂乱场景中通常是一对多的:同一物体在不同任务中可能提供不同的交互,而单个任务可能对应一个功能区域或多个有效功能区域,具体取决于场景布局。现有的功能区域数据集和基准与此设置不一致,因为它们通常侧重于抓取或物体级功能区域,依赖合成场景,或假设单一的指令-区域对应关系。我们提出了Affordance2Action (A2A),一个以基准为中心的学习框架,用于场景级、任务条件的功能区域定位。其核心是A2A-Bench,一个面向操作的基准,涵盖了日常场景中的单区域和多区域指令对应关系,其中多区域对应关系突显了现实多物体环境中功能区域定位的模糊性和多样性。为了大规模构建该基准,我们构建了A2A-AffordGen,一个代理辅助的标注流程,结合了语言模型过滤、交互式部件分割、实例级遮罩细化、任务推理指令生成和人工验证。A2A-Bench的监督进一步支持多种下游应用,其中实时功能区域定位和功能区域条件操作策略是两个代表性示例。实验表明,A2A暴露了通用分割、基于VLM的定位和功能区域蒸馏基线中的显著差距,同时改进了任务级定位并为下游操作提供了有用的空间先验。所有数据集和代码将公开发布,以促进开放研究。
Task-conditioned manipulation requires grounding instructions to task-relevant functional parts rather than object categories. This setting is scene-dependent and often one-to-many in cluttered scenes: the same object may afford different interactions across tasks, while a single task may correspond to either one functional region or multiple valid functional regions, depending on the scene layout. Existing affordance datasets and benchmarks remain misaligned with this setting, as they typically focus on grasping or object-level affordances, rely on synthetic scenes, or assume a single instruction-region correspondence. We present Affordance2Action (A2A), a benchmark-centered learning framework for scene-level, task-conditioned part affordance grounding. At its core is A2A-Bench, a manipulation-oriented benchmark that covers both single-region and multi-region instruction correspondences in everyday scenes, with the latter highlighting the ambiguity and diversity of affordance grounding in realistic multi-object environments. To construct it at scale, we build A2A-AffordGen, an agent-assisted annotation pipeline that combines language-model filtering, interactive part segmentation, instance-level mask-out refinement, task-reasoning instruction generation, and human verification. A2A-Bench's supervision further supports diverse downstream applications, with real-time affordance grounding and affordance-conditioned manipulation policies as two representative examples. Experiments show that A2A exposes substantial gaps in generic segmentation, VLM-based grounding, and affordance distillation baselines, while improving task-level localization and providing useful spatial priors for downstream manipulation. All datasets and code will be publicly released to promote open research.
MimeLens: 二进制片段的位置无关内容类型检测
Michael J. Bommarito
AI总结 针对现有文件类型分类系统(如Magika)无法处理无头片段、随机磁盘块等非完整文件输入的问题,提出MimeLens,一种基于BERT的小型编码器家族,通过随机偏移采样训练实现位置无关的二进制内容分类,在libmagic标记数据上top-1准确率比Magika v1.1高10.7个百分点,并能从单个UDP数据包或随机磁盘块中分类。
文件类型分类是恶意软件分类、取证雕刻、数据包检查和存储索引等工作流程的基础。像Google的Magika这样的学习系统假设在已知偏移处访问整个文件,因此它们无法处理这些任务实际产生的许多输入,例如单个数据包负载、无头的雕刻片段、随机磁盘块或分块上传。我们引入了MimeLens,这是一个小型BERT风格编码器家族,在从每个文件内均匀随机偏移处采样的窗口的二进制内容上进行预训练,没有特权文件头位置,有标准上下文和短上下文变体。一个字节块来自文件中的任何位置,无需头部且无固定大小;输出是libmagic的125个MIME标签之一。在完整文件的干净头部上,MimeLens在libmagic标记数据上的top-1准确率比Magika v1.1高10.7个百分点,并且在Magika无法分类的地方(例如单个中间流UDP数据包)仍然能分类,在随机中间文件磁盘块上的准确率是libmagic和Magika的两倍以上。代价是延迟:在CPU上,MimeLens每个样本的运行速度大约比Magika慢一到两个数量级,但在消费级GPU或批处理中与之相当。所有训练好的检查点已在Hugging Face上发布(mjbommar/mimelens-001-*)。
File-type classification underlies many workflows like malware triage, forensic carving, packet inspection, and storage indexing. Learned systems such as Google's Magika assume whole-file access at a known offset, so they break on the inputs many of these tasks actually produce, like a single packet payload, a header-less carved fragment, a random disk block, or a chunked upload. We introduce MimeLens, a family of small BERT-style encoders pretrained on binary content from windows sampled at a uniformly random offset within each file, with no privileged head-of-file position, in standard- and short-context variants. A byte chunk goes in from anywhere in a file, no header needed and no fixed size; out comes one of libmagic's 125 MIME labels. On the clean head of complete files, MimeLens beats Magika v1.1 by +10.7 pp top-1 on libmagic-labeled data, and it keeps classifying where Magika cannot: from a single mid-stream UDP packet, and more than twice as accurately as libmagic and Magika on random mid-file disk blocks. The cost is latency: MimeLens runs roughly one to two orders of magnitude slower per sample on CPU than Magika, though it matches on consumer GPUs or in batch. All trained checkpoints are released on Hugging Face (mjbommar/mimelens-001-*).
当自回归一致性损害安全对齐
Bochen Lyu, Yiyang Jia, Xiaohao Cai, Zhanxing Zhu
AI总结 本文通过分析自回归一致性机制,揭示了大语言模型安全对齐的浅层性,并提出随机插入攻击和对抗性安全对齐方法。
大语言模型(LLMs)的安全对齐是脆弱的,部分原因在于它通常是浅层的:微调主要重塑模型在最初几个输出标记附近的行为。我们认为,这种现象可以通过自回归一致性来理解,即下一个标记预测倾向于一致地保持和扩展当前响应轨迹。通过分析安全对齐的学习动态,我们表明自回归一致性可以将对齐更新集中在早期标记上,为浅层安全对齐提供机制解释。同样的机制还预测了一类更广泛的LLM攻击:在输出轨迹的任意位置诱导有害延续状态的攻击。作为一个具体例子,我们引入了随机插入攻击,该攻击将一个短的有害片段插入原本安全的拒绝轨迹中,并利用自回归一致性维持由此产生的有害分支,从而绕过安全对齐。值得注意的是,即使在一个长的拒绝前缀之后,一个短的有害片段也能将生成重定向为有害,这突显了自回归一致性作为一个潜在的更广泛失败机制。这表明安全对齐还应该在整个输出轨迹中打破有害的自回归一致性。因此,我们提出了对抗性安全对齐,一个基于最坏情况有害延续状态的初始框架,并通过随机最坏插入训练实例化它。总体而言,我们的结果表明,自回归一致性应被视为安全对齐和攻击设计中的核心考虑因素。
Safety alignment in large language models (LLMs) is fragile in part because it is often shallow: fine-tuning mainly reshapes the model's behavior near the first few output tokens. We argue that this phenomenon can be understood through autoregressive consistency, the tendency of next-token prediction to preserve and extend the current response trajectory consistently. By analyzing the learning dynamics of safety alignment, we show that autoregressive consistency can concentrate alignment updates on early tokens, offering a mechanistic explanation for shallow safety alignment. The same mechanism also predicts a broader class of attacks on LLMs: attacks that induce harmful continuation states at arbitrary positions in the output trajectory. As a concrete example, we introduce random insertion attack, which inserts a short harmful span into an otherwise safe refusal trajectory and exploits autoregressive consistency to sustain the resulting harmful branch, thereby bypassing safety alignment. Notably, a short harmful span can redirect the generation to be harmful even after a long refusal prefix, highlighting autoregressive consistency as a potential broader failure mechanism. This suggests that safety alignment should also break harmful autoregressive consistency throughout the output trajectory. We therefore propose adversarial safety alignment, an initial framework based on worst-case harmful continuation states, and instantiate it with random worst-insertion training. Overall, our results suggest that autoregressive consistency should be treated as a central consideration in both safety alignment and attack design.
无神经元的智能交通——基于表格强化学习的公平地铁网络扩展
Dimitris Michailidis, Sennay Ghebreab, Fernando P. Santos
AI总结 针对地铁网络扩展问题,提出将非马尔可夫奖励决策过程与表格强化学习相结合的方法,在保证性能的同时大幅降低训练轮次和碳排放,并融入社会公平性指标。
我们解决了地铁网络扩展问题(MNEP),这是交通网络设计问题(TNDP)的一个子集,专注于扩展地铁系统以满足出行需求。传统方法依赖于精确和启发式方法,需要专家定义的约束来缩小搜索空间。最近,深度强化学习(Deep RL)因其在复杂序列决策过程中的有效性而出现,但它仍然计算成本高、环境成本高,并且需要额外的工程来解释。我们表明,MNEP问题规模足够小,不需要深度强化学习方法。将MNEP重新表述为非马尔可夫奖励决策过程(NMRDP),我们使用表格强化学习以显著更少的训练轮次实现类似的性能,此外还提供了更高的可解释性。此外,我们将社会公平标准纳入奖励函数,侧重于效率和公平性,突出了我们方法的多功能性。在现实场景中——西安和阿姆斯特丹——我们的方法平均将总轮次减少了18倍,总碳排放减少了12倍,同时与深度强化学习保持竞争力。这种方法提供了一种可复制、模块化、可解释且资源高效的解决方案,并具有应用于其他组合优化问题的潜力。
We tackle the Metro Network Expansion Problem (MNEP), a subset of the Transport Network Design Problem (TNDP), which focuses on expanding metro systems to satisfy travel demand. Traditional methods rely on exact and heuristic approaches that require expert-defined constraints to reduce the search space. Recently, deep reinforcement learning (Deep RL) has emerged due to its effectiveness in complex sequential decision-making processes-it remains, however, computationally expensive, environmentally costly, and requires additional engineering to interpret. We show that MNEP problems are small enough to not require Deep RL methods. Reformulating the MNEP as a Non-Markovian Rewards Decision Process (NMRDP), we use tabular RL to achieve similar performance with significantly fewer training episodes, additionally offering greater interpretability. Additionally, we incorporate social equity criteria into the reward functions, focusing on efficiency and fairness, highlighting the versatility of our method. Evaluated in real-world settings-Xi'an and Amsterdam-our method reduces total episodes by a factor of 18 and total carbon emissions by a factor of 12 on average, while remaining competitive with Deep RL. This approach offers a replicable, modular, interpretable, and resource-efficient solution with potential applications to other combinatorial optimization problems.
端到端文本行检测与排序
Benjamin Kiessling
AI总结 提出Orli模型,将文本行检测与阅读顺序排序统一为图像到序列问题,通过自回归生成基线实现端到端处理,在多种历史文档上达到先进性能。
实际的历史文档文本识别流程通常将布局分析分解为行检测和单独的阅读顺序步骤,后者通常由手工编码的几何启发式方法处理,但难以应对旁注、多列、表格和特定来源的编辑惯例。本文介绍了Orli(行的有序回归),一个端到端模型,将两个子任务视为单一的图像到序列问题:从页面图像中,Orli以自回归方式直接按阅读顺序生成文本行基线。基线采用弦框架参数化表示,该参数化锚定行的位置、方向和范围,同时通过垂直偏移编码局部几何;迭代细化头和局部视觉细化器生成最终曲线。在涵盖十种书写系统的196,691页异构语料库上训练,Orli在没有数据集特定训练的情况下,略微超过了之前报道的cBAD行检测的最先进水平,在多个阅读顺序基准测试中零样本达到近乎完美的覆盖率和排序,并通过有限的微调适应更专业的域外布局。该方法的源代码和模型权重在开放许可下可从https://github.com/mittagessen/orli获取。
Practical text-recognition pipelines for historical documents typically decompose layout analysis into line detection followed by a separate reading-order step, with the latter most often handled by a hand-coded geometric heuristic that struggles with marginalia, multiple columns, tables, and source-specific editorial conventions. This article introduces Orli (Ordered Regression of Lines), an end-to-end model that casts both sub-tasks as a single image-to-sequence problem: from a page image, Orli autoregressively generates text-line baselines directly in reading order. Baselines are represented in a chord-frame parameterization that anchors a line's position, orientation, and extent while encoding local geometry through perpendicular offsets; an iterative refinement head and a local visual refiner produce the final curve. Trained on a heterogeneous corpus of 196,691 pages spanning ten writing systems, Orli marginally exceeds the previously reported state of the art for cBAD line detection without dataset-specific training, reaches near perfect coverage and ordering on multiple reading-order benchmarks zero-shot, and adapts to more specialized out-of-domain layouts with limited fine-tuning. The method's source code and model weights are available under an open license at https://github.com/mittagessen/orli.
CaloTrilogy:迈向现代量热器一步式端到端物理引导簇射生成的突破
Cheng Jiang, Sitian Qian, Kevin Pedro, Oz Amram, Huilin Qu, Maggie Voetberg
AI总结 提出一种结合平均速度场积分器、学习生成先验和物理引导损失项的框架,实现一步或少量评估步骤的高质量簇射生成,性能与最先进的流和扩散模型相当。
当前和未来对撞机的高精度量热器模拟对计算资源的需求快速增长,促使开发机器学习替代传统蒙特卡洛工具(如Geant4)。流匹配和基于扩散的生成模型因其样本质量而成为高维快速模拟的主流方法,但通常在推理时需要${\cal O}(100)$次函数评估,并常依赖辅助网络约束全局可观测量,损害了简化的端到端生成。我们引入了一个统一框架,改进了速度、簇射质量和物理保真度之间的平衡。该方法结合了:(i)平均速度场积分器,实现一步或少量评估的采样;(ii)从数据而非随机噪声构建的簇射空间学习生成先验;(iii)训练期间对关键可观测量施加归纳偏置的物理引导损失项。这些元素是训练时的正则化器,保持了端到端推理且无额外成本。仅需一步或少量评估步骤,该模型在多个公开的高粒度量热器数据集上达到了与最先进的流和扩散模型竞争的簇射质量。结果表明层间簇射结构与底层物理一致,为未来的快速模拟工作流提供了有力候选。
High-precision calorimeter simulation at current and future colliders imposes rapidly growing computational demands, motivating the development of machine-learning surrogates for traditional Monte Carlo tools such as Geant4. Flow matching and diffusion-based generative models have become leading approaches for high-dimensional fast simulation because of their sample quality, but typically require ${\cal O}(100)$ function evaluations at inference and often rely on auxiliary networks to constrain global observables, compromising streamlined end-to-end generation. We introduce a unified framework that improves the balance between speed, shower quality, and physics fidelity. The method combines: (i) an average velocity field integrator that enables sampling in one or a few evaluations; (ii) a learned generative prior in shower space, constructed from data rather than random noise; and (iii) physics-guided loss terms that impose inductive biases on key observables during training. These elements are training time regularizers, preserving end-to-end inference with no additional cost. With only one or a few evaluation steps, the model achieves shower quality competitive with state-of-the-art flow and diffusion approaches, tested on several public high granularity calorimeter datasets. The results demonstrate inter-layer shower structure consistent with the underlying physics, providing a strong candidate for future fast simulation workflows.
ADAPTOOD:面向分布外心电图时间序列模型的不确定性感知微调
Sotirios Vavaroutas, Yu Yvonne Wu, Ali Etemad, Cecilia Mascolo
AI总结 提出ADAPTOOD框架,利用数据不确定性量化分布偏移严重性,结合低秩更新和自适应超参数优化,在分布外心电图时间序列任务上提升准确率高达7%和精确率12.9%。
用于训练的数据样本通常与微调和部署期间遇到的数据不同,尽管机器学习模型显示出潜力,但在只有少量标注数据集可用时,其性能仍然有限。在由不同传感器、人群和应用设置引起的分布偏移下,性能通常会下降。尽管预训练有所帮助,但模型在现实环境中经常遇到分布外(OOD)数据,导致鲁棒性降低。现有的自适应方法通常假设固定的分布偏移,并在出现多种类型或严重性时难以应对。特别是,它们忽略了偏移的严重性,例如将适应大型熟悉数据集与适应带有新任务的小型数据集同等对待,这限制了泛化能力。为了解决这个问题,我们提出了ADAPTOOD,这是一个新颖的框架,利用数据不确定性来量化分布偏移的严重性并指导时间序列的微调。这种不确定性衡量目标部署分布中的样本与预训练分布偏离的程度,提供了OOD严重性的直接信号。我们的框架将这种不确定性与低秩模型更新和自适应超参数优化相结合,以改进自适应。我们表明,在OOD任务中,ADAPTOOD比现有方法实现了高达7%的准确率和12.9%的精确率提升,在分布偏移严重性增加时仍保持强劲性能。
Data samples used for training often differ from those encountered during fine-tuning and deployment, and while ML models show promise, their performance remains limited when only small annotated datasets are available. Performance often degrades under distribution shifts caused by diverse sensors, populations, and application settings. Although pre-training helps, models frequently encounter out-of-distribution (OOD) data in real-world settings, leading to reduced robustness. Existing adaptation methods usually assume fixed distribution shifts and struggle when multiple types or severities occur. In particular, they overlook shift severity, for example treating adaptation to a large familiar dataset the same as adaptation to a small dataset with a new task, which limits generalisation. To address this, we propose ADAPTOOD, a novel framework that leverages data uncertainty to quantify distribution shift severity and guide fine-tuning for time series. This uncertainty measures how strongly samples from the target deployment distribution deviate from the pre-training distribution, providing a direct signal of OOD severity. Our framework combines this uncertainty with low-rank model updates and adaptive hyperparameter optimisation to improve adaptation. We show that ADAPTOOD achieves up to 7% higher accuracy and 12.9% higher precision than existing methods in OOD tasks, maintaining strong performance as distribution shift severity increases.
当离线选择器无法超越最佳单一模型:基于edX辍学预测的诊断研究
Tyler Crosse, Alan Nadelsticher Ruvalcaba, Dustin Khang LeDuc, Thomas Trask, Nicholas Lytle, David Joyner
AI总结 针对离线选择器在实践中的表现常不如最佳单一模型的问题,提出三阶段诊断方法,通过k-NN标签一致性、离线学习器性能比较和状态特征消融实验,识别瓶颈为局部表示模糊性,建议改进状态或收集新数据而非调优学习器。
不同的预测器通常在不同输入上表现优异,因此每实例选择最佳预测器有望比固定单一模型获得更高准确率。在实践中,从日志数据训练的选择器经常无法击败最强的单一预测器。在进一步调优之前,三个原因通常未被区分:不匹配的学习器、无法预测哪个模型获胜的状态、或从缓存到部署的标签偏移。 一个三阶段诊断在共享缓存上排除这些原因。第一阶段通过k-NN标签一致性估计oracle恢复的局部上限。第二阶段询问配对BC和离线RL学习器(BC、DQN和CQL,跨惩罚权重)是否达到该上限。第三阶段消融选择器状态,测试更丰富的特征是否会提高上限。综合结论指向最有希望的下一步:调优学习器、重新设计状态或收集新数据。 我们将其应用于在edX点击流数据上选择五个辍学预测模型。在16个时间窗口上,oracle平均比最强单一基模型高出9.7个准确率点,但BC、DQN和CQL均落在其下方的相同测试准确率带内(对十倍缓存扫描和N=2,000个保留样本鲁棒)。瓶颈是局部表示模糊性:CQL缩小了模仿差距但无部署增益(非保守性),遗憾在学习器间紧密聚集(非打破平局),三个学习器在测试准确率上收敛(非偏移)。下一次迭代应改变状态或收集新数据,而非进一步调优离线学习器。
Different predictors often excel on different inputs, so picking the best one per instance promises higher accuracy than committing to a single model. In practice, selectors trained from logged data routinely fail to beat the strongest single predictor. Three causes typically go unseparated before more tuning is applied: a mismatched learner, a state that does not predict which model wins, or buffer-to-deployment label shift. A three-stage diagnostic rules them out on a shared buffer. Stage~1 estimates a local ceiling on oracle recovery from $k$-NN label consistency. Stage~2 asks whether paired BC and offline-RL learners (BC, DQN, and CQL across penalty weights) reach that ceiling. Stage~3 ablates the selector state to test whether richer features would raise it. The combined verdict points to the most promising next step: tuning the learner, redesigning the state, or collecting new data. We apply it to selecting among five dropout-prediction models on edX clickstream data. Across 16 windows, the oracle beats the strongest single base model by 9.7 accuracy points on average, yet BC, DQN, and CQL land in the same test-accuracy band below it (robust to a tenfold buffer sweep and $N{=}2{,}000$ held-out examples). The bottleneck is local representational ambiguity: CQL closes the imitation gap without a deployment gain (not conservatism), regret clusters tightly across learners (not tie-breaking), and the three learners converge on test accuracy (not shift). The next iteration should change the state or collect new data, not tune the offline learner further.
专家感知的拒绝引导
Anna C. Marbut, Daniel R. Olson, Travis J. Wheeler
AI总结 研究在混合专家(MoE)大语言模型中,通过专家感知的引导向量抑制拒绝行为,发现单个专家输出即可有效引导,且注意力机制在MoE拒绝行为中起重要作用。
指令调优的大语言模型(LLM)的安全对齐依赖于模型可靠地拒绝回答有害或不允许请求的能力。最近的研究表明,在推理过程中对密集LLM应用引导向量可以有效抑制拒绝行为,诱导模型响应有害请求。我们将这种拒绝引导方法扩展到三个开源混合专家(MoE)LLM,并发现引导性能不受MoE架构固有的复杂路由模式影响。然后,我们提出了两种专家感知的拒绝引导方法,利用拒绝特定的专家路由模式和专家特定的引导方向来抑制正常的拒绝行为。我们发现,基于单个专家的输出即可有效引导拒绝行为。我们的结果表明,引导方法捕获的拒绝信号与专家路由行为不同,这表明注意力在MoE拒绝行为中扮演重要角色。
Safety alignment in instruction-tuned large language models (LLMs) depends on a model's ability to reliably refuse to respond to harmful or disallowed requests. Recent work has shown that a steering vector can be applied to a dense LLM during inference to effectively suppress refusal behavior, inducing response to harmful requests. We extend this refusal steering method to three open-source Mixture-of-Experts (MoE) LLMs and find that steering performance is uninhibited by the complex routing patterns inherent to the MoE architecture. We then propose two expert-aware refusal steering methods that leverage refusal-specific expert routing patterns and expert-specific steering directions to suppress normal refusal behavior. We find that refusal behavior can be effectively steered based on the output of a single expert. Our results show that refusal signals captured by steering methods differ from expert routing behavior, suggesting a substantial role for attention in MoE refusal behavior.
多智能体风险规避规划中的下一最佳视角优化
Amirhossein Mollaei Khass, Vivek Pandey, Guangyi Liu, Athanasios Cosse, Emrah Bayrak, Nader Motee
AI总结 提出一种分布式、风险感知的多智能体下一最佳视角框架,通过共识ADMM优化信息增益并建模碰撞风险,在降低通信开销的同时接近集中式方法的映射质量和轨迹安全性。
在不确定和未知环境中,多智能体下一最佳视角选择用于安全路径规划需要信息丰富、安全感知且高效的协调。集中式方法依赖于共享原始传感器数据或大量通信开销,导致可扩展性有限。我们提出一种分布式、风险感知的多智能体NBV框架,其中每个机器人维护一个私有的局部3D高斯溅射地图,团队共同最大化沿规划轨迹的掩蔽区域内的期望信息增益。通过通信图上的共识ADMM求解分布式目标,每个机器人仅交换候选视角、规划轨迹描述符和标量EIG贡献。通过局部3DGS地图上的平均风险价值对每条轨迹的碰撞风险进行建模,并用于塑造掩蔽半径和评分规划路径。在多个团队规模的Gibson环境中的实验表明,分布式公式在映射质量和轨迹安全性方面接近集中式基线,同时将通信量降低数个数量级。
Multi-agent Next-Best-View (NBV) selection for safe path planning in uncertain and unknown environments requires informative, safety-aware, and efficient coordination. Centralized approaches rely on sharing raw sensor data or significant communication overhead, resulting in limited scalability. We propose a distributed, risk-aware multi-agent NBV framework in which each robot maintains a private local 3D Gaussian Splatting map and the team jointly maximizes expected information gain (EIG) restricted to masked zones along planned trajectories. The resulting distributed objective is solved by Consensus ADMM (C-ADMM) over a communication graph, with each robot exchanging only candidate viewpoints, planned trajectory descriptors, and scalar EIG contributions. Collision risk along each trajectory is modeled via Average Value-at-Risk (AV@R) over the local 3DGS map and used both to shape the masking radius and to score planned paths. Experiments in Gibson environments at multiple team sizes show that the distributed formulation approaches the centralized baseline in mapping quality and trajectory safety while reducing communication by orders of magnitude.
遥操作中触觉引导模型的选择:来自比较用户研究的指南
Alexis Boulay, Margot Vulliez, David Daney
AI总结 通过用户研究比较弹簧-阻尼器、势场和引导管三种触觉引导模型,提出基于环境特征和实时评估指标的模型选择指南。
遥操作中的触觉引导通过力反馈增强操作员性能。本文提出了考虑任务、环境和操作员的最合适模型选择指南。我们定义了一个统一公式,将最常见的模型(弹簧-阻尼器、势场和引导管)表示为具有特定模型引导函数的刚度-阻尼系统的变体。我们进行了一项用户研究,在垂直农业任务中比较了三种经典模型在六种不同环境条件下的场景。结果显示没有普遍优越的模型:弹簧-阻尼器在杂乱环境中表现优异,势场在自由空间中表现良好(但在障碍物附近存在风险),而引导管提供了平衡的折衷。我们提出了新颖的客观指标来评估交互,并表明引导力大小与舒适度和信任度评分相关。这些发现通过环境特征和实时评估指标提供了实用的模型选择指南。
Haptic guidance in teleoperation enhances operator performance through force feedback. This paper presents guidelines to select the most appropriate model considering the task, the environment and the operator. We define a unified formulation expressing most common models (spring-damper, potential field, and guiding tube) as variations of a stiffness-damping system with model-specific guiding functions. We conducted a user study comparing the three classical models across six scenarios with varying environmental conditions in a vertical farming task. Results show no universally superior model: spring-damper excels in cluttered environments, potential field in free spaces (but it shows risks near obstacles), and guiding tube offers a balanced compromise. We propose novel objective metrics to evaluate the interaction, and show that guiding force magnitude correlates with comfort and trust scores. These findings provide practical model selection guidelines through environmental characteristics and real-time evaluation metrics.
SocialCoach: 基于强化学习的智能辅导与练习的个性化社交技能学习
Tianfu Wang, Max Xiong, Jianxun Lian, Hongyuan Zhu, Zhengyu Hu, Yuxuan Lei, Linxiao Gong, Xiaofang Li, Peiting Tsai, Nicholas Jing Yuan, Qi Zhang
AI总结 提出SocialCoach系统,利用多智能体管道构建知识语料库、强化学习优化自适应练习调度,并结合沉浸式实践与反思辅导,以解决社交技能学习中专家辅导稀缺和知行差距问题。
社交技能如谈判和领导力在当今互联世界中对于个人和职业成功至关重要。然而,由于专家辅导的稀缺,可扩展且有效的培训仍然是一个重大挑战。在本文中,我们介绍了SocialCoach,一个全面的LLM驱动的智能辅导系统,用于大规模个性化社交技能发展。首先,SocialCoach利用多智能体管道,从多样化的专家来源自动构建一个基于教学法的理论到实践知识语料库。其次,为了个性化学习旅程,它采用了一个自适应练习调度模块,遵循处方-检索-适应过程。为了在克服冷启动问题的同时最大化长期学习体验,该策略通过强化学习在学习者模拟环境中进行优化。最后,SocialCoach整合了沉浸式目标驱动练习、因果驱动能力评估和基于知识的反思辅导,以帮助解决知行差距。我们在产品EQoach中部署了该系统,并进行了广泛实验。结果表明,SocialCoach在模拟路径质量和评委评估的辅导质量上优于基线方法,而早期用户反馈表明其具有强烈的感知参与度和有用性。这些发现为个性化、游戏化的软技能学习教学平台提供了一种实用架构。
Social skills such as negotiation and leadership are crucial for personal and professional success in today's interconnected world. However, scalable and effective training remains a significant challenge due to the scarcity of expert coaching. In this paper, we introduce SocialCoach, a holistic LLM-powered agentic tutoring system for personalized social skill development at scale. First, SocialCoach automatically constructs a pedagogically-grounded, theory-to-practice knowledge corpus from diverse expert sources, leveraging a multi-agent pipeline. Second, to personalize the learning journey, it employs an adaptive practice scheduling module that follows a prescription-retrieval-adaptation process. To maximize the long-term learning experience while overcoming the cold-start problem, this policy is optimized within a learner simulation environment through reinforcement learning. Finally, SocialCoach integrates immersive, goal-driven practice, causality-driven proficiency assessment and knowledge-grounded, reflective tutoring to help address the knowing-doing gap. We deploy it in our product, EQoach, and conduct extensive experiments. The results show that SocialCoach improves simulated pathway quality and judge-rated tutoring quality over baseline approaches, while early user feedback indicates strong perceived engagement and usefulness. These findings suggest a practical architecture for personalized and gamified pedagogical platforms on soft skill learning.
EpiFormer: 通过几何深度学习学习抗原-抗体相互作用进行表位预测
Mansoor Ahmed, Huirong Chai, Haoxin Wang, Hemanth Venkateswara, Murray Patterson
AI总结 提出EpiFormer编码器-解码器框架,通过GNN层间交叉注意力实现抗原-抗体双向信息流,结合稀疏感知目标,在表位预测任务上F1分数提升超40%。
抗体通过结合称为表位的特定表面区域来中和外来抗原。计算表位预测对于理解免疫识别和指导抗体工程至关重要。然而,现有方法面临三个基本挑战:抗体感知模型独立编码每条链并在后期才进行组合,无法捕捉定义结合界面的共依赖结构特征;而严重的类别不平衡和已知抗体-抗原复合物的稀缺使得标准训练目标无效。我们提出EpiFormer,一个通用的编码器-解码器框架,联合解决这些挑战。我们的关键设计原则是在GNN编码层内进行交错交叉注意力,使得抗原-抗体信息流贯穿整个表示学习过程,而不仅仅在输出时。这种早期融合原则与主干无关,从简单的GCN到等变模型,在各种GNN架构上都能提供一致的改进。我们进一步表明,当与早期融合架构配对时,稀疏感知目标对于表位预测任务是有效的。EpiFormer在标准基准上的F1分数比之前的最佳方法提高了40%以上,展示了泛化能力和跨数据集迁移性。值得注意的是,EpiFormer发现已知的生物学原理作为端到端训练的涌现行为,其中学习到的交叉注意力门控倾向于抗原到抗体的信息流,与两条链在结合界面的不对称角色一致,并且模型对几何特征而非进化特征的偏好与已建立的发现(表位残基并非进化保守)一致。源代码可在https://github.com/mansoor181/epiformer.git获取。
Antibodies neutralize foreign antigens by binding to specific surface regions called epitopes. Computational epitope prediction is critical for understanding immune recognition and guiding antibody engineering. However, existing methods face three fundamental challenges: antibody-aware models encode each chain independently and combine them only at a late stage, failing to capture co-dependent structural features that define binding interfaces, whereas severe class imbalance and scarcity of known antibody-antigen complexes render standard training objectives ineffective. We propose EpiFormer, a general encoder-decoder framework that addresses these challenges jointly. Our key design principle is interleaved cross-attention within GNN encoding layers, enabling bidirectional antigen-antibody information flow throughout representation learning rather than only at the output. This early-fusion principle is backbone-agnostic, providing consistent gains across GNN architectures from simple GCNs to equivariant models. We further show that sparsity-aware objectives are effective when paired with early-fusion architectures for the epitope prediction task. EpiFormer improves over the previous best method by over 40% in F1 score on standard benchmarks, demonstrating generalizability and cross-dataset transferability. Notably, EpiFormer discovers known biological principles as emergent behaviors of end-to-end training, where the learned cross-attention gates favor antigen-to-antibody information flow, consistent with the asymmetric roles of the two chains at the binding interface, and the model's preference for geometric over evolutionary features aligns with the established finding that epitope residues are not evolutionarily conserved. The source code is available at: https://github.com/mansoor181/epiformer.git
通过符号思考:PEEL作为认知可问责的AI赋能研究的符号脚手架
Clarisse de Souza, Gabriel Barbosa, Simone Diniz Junqueira Barbosa, Bárbara Betts, Renato Cerqueira, Juliana Jansen Ferreira
AI总结 本文提出PEEL框架,结合Voyant Tools的确定性远读与Claude的LLM解释,基于皮尔斯符号学和溯因推理,揭示AI生成摘要中的系统性扭曲,并得出三项设计启示。
大型语言模型正在重塑研究实践,同时悄然侵蚀研究者的认知可问责性。本文评论介绍了PEEL——AI中认知参与素养的协议,这是一个工作脚手架,它结合了通过Voyant Tools进行的确定性远读和通过Claude进行的LLM解释,基于皮尔斯符号学和溯因推理。应用于三个源文本的AI生成浓缩版本,PEEL揭示了在没有非AI测量的情况下不可见的数量、词频和认知声音的系统性扭曲,并产生了三项设计启示:确定性工具必须伴随AI工具;流畅性不等于保真度;认知权威必须被设计进来,而不是被假定。
Large language models are reshaping research practice while quietly eroding researchers epistemic accountability. This commentary introduces PEEL - Protocols for Epistemically Engaged Literacy in AI, a working scaffolding that combines deterministic distant reading via Voyant Tools with LLM interpretation via Claude, grounded in Peircean semiotics and abductive reasoning. Applied to AI-generated condensations of three source texts, PEEL reveals systematic distortions in quantity, term frequency, and epistemic voice that are invisible without non-AI measurement -- and yields three design implications: deterministic instruments must accompany AI tools; fluency is not fidelity; epistemic authority must be designed in, not assumed.
偶然陷入AI情感依赖:日常AI互动如何重塑人际关系
Yaoxi Shi, Cathy Mengying Fang, Pattie Maez, Amit Goldenberg
AI总结 本文通过实证研究,揭示AI情感支持通常在日常任务导向的互动中偶然产生,且这种路径依赖会改变人们对AI情感能力的信念,导致对AI的偏好增加、对人类的偏好减少。
公共讨论和新兴政策通常假设AI情感支持是一种有意的行为:孤独的用户有意识地寻求专用伴侣聊天机器人的安慰。在本文中,我们基于新兴的实证证据,认为这种描述在两个层面上不准确,既涉及AI情感支持的产生方式,也涉及它如何塑造未来行为。首先,AI情感支持通常是在通用平台上的任务导向互动中偶然产生的,就像工作场所的友谊通过合作加深一样。其次,这些偶然遭遇是路径依赖的:对AI情感支持的积极体验会更新人们对AI情感能力的信念,并改变他们未来寻求情感支持的选择,增加对AI的偏好,减少对人类的偏好。我们回顾了最近的证据,包括与OpenAI合作进行的一项大规模纵向研究,该研究显示,每天与AI进行五分钟关于个人问题的对话,持续28天,导致寻求人类支持的偏好下降10.3%,对AI的偏好上升11.6%。这些发现表明,当前专注于伴侣应用和孤立互动的政策无法充分保护人际关系。相反,有效的监管应扩展到通用AI系统,并解决人们寻求支持方式的累积性、轨迹层面的变化。认识到人们如何偶然陷入AI情感支持,以及这些遭遇如何随时间重塑人际关系,对于保障人类福祉至关重要。
Public discourse and emerging policy typically assume that AI emotional support is a deliberate act: a lonely user consciously seeking comfort from a dedicated companion chatbot. In this paper, we draw on emerging empirical evidence and argue that this picture is inaccurate on two accounts, both in how AI emotional support arises and how it shapes future behavior. First, AI emotional support commonly emerges incidentally within task-oriented interactions on general-purpose platforms, much as workplace friendships deepen through collaboration. Second, these incidental encounters are path-dependent: positive experiences of AI emotional support update people's beliefs about AI's emotional capabilities and redirect their choices for future emotional support, increasing preference for AI and decreasing preference for humans. We review recent evidence, including a large-scale longitudinal study conducted in collaboration with OpenAI, showing that daily five-minute conversations with an AI about personal issues over 28 days led to a 10.3% decrease in the preference for seeking support from humans and an 11.6% increase in the preference for AI. These findings suggest that current policy, focused on companion apps and isolated interactions, cannot adequately protect human connection. Instead, effective regulations should extend to general-purpose AI systems and address cumulative, trajectory-level changes in how people seek support. Recognizing how people stumble into AI emotional support and how those encounters redirect human connections over time is essential to safeguarding human well-being.
CoPark:通过自我对弈学习反应式泊车
Jiarong Wei, Yanxing Chen, Sinuo Song, Yin Wu, Anna Rehr, Abhinav Valada
AI总结 提出CoPark,一种基于残差策略的多智能体自我对弈强化学习方法,通过固定先验与残差头结合,在反应式泊车中实现高精度与安全交互的平衡,显著优于基线方法。
学习一个能够以高几何精度达到目标同时与附近智能体安全交互的单一策略面临相互冲突的目标。精度有利于固定几何计划的执行,而交互则要求在另一智能体侵入时立即偏离,导致针对一个目标优化的策略往往在另一个目标上失败。我们在反应式自主泊车的背景下研究这一问题,其中多辆车必须达到指定车位,终端精度达到亚米级,同时在整个操作过程中对邻近车辆保持响应。我们提出CoPark,一种基于残差策略架构的多智能体自我对弈RL方法。预计算的离线计划提供固定的动作先验,而残差头学习反应式修正。残差策略在自我对弈下学习行为,弥补数据和脚本的不足,而固定先验保持纯策略难以可靠达到的车位框架几何。关键设计是一种合作伙伴威胁调制的通道非对称先验释放。连续威胁信号将纵向通道的权限转移给残差头以实现让行,而横向通道仍锚定在预计算参考上以保持亚米级车位对齐。闭环细化层修正动作网格离散化带来的残差终端误差。我们在六个停车场训练策略,并在我们的新反应式泊车基准(包括Dragon Lake Parking (DLP)和DeepScenario Open 3D (DSC3D))上进行零样本评估。CoPark实现了约70-85%的成功率,碰撞率仅为3-6%,显著优于经典、模仿学习和大规模RL基线。重要的是,结果展示了涌现的交互行为,如倒车让行、中途让行、狭窄通道通行和排队。
Learning a single policy that reaches a goal with high geometric precision while interacting safely with nearby agents poses conflicting objectives. Precision favors commitment to a fixed geometric plan, whereas interaction requires immediate deviation when another agent intrudes, causing policies optimized for one objective to often fail at the other. We study this problem in the context of reactive autonomous parking, where multiple vehicles must reach assigned slots with sub-meter terminal accuracy while remaining responsive to neighboring vehicles throughout the maneuver. We propose CoPark, a multi-agent self-play RL approach built on a residual-policy architecture. A precomputed offline plan provides a fixed action prior, while a residual head learns the reactive corrections. The residual policy learns behaviors under self-play, where data and scripting fall short, while the fixed prior holds the slot-frame geometry that pure policies struggle to reach reliably. The key design is a partner-threat-modulated, channel-asymmetric release of the prior. A continuous threat signal shifts authority of the longitudinal channel to the residual head to enable yielding, while the lateral channel remains anchored to the precomputed reference to preserve sub-meter slot alignment. A closed-loop refinement layer corrects residual terminal error from action-grid discretization. We train our policy on six parking lots and evaluate zero-shot on our new reactive-parking benchmark spanning Dragon Lake Parking (DLP) and DeepScenario Open 3D (DSC3D). CoPark achieves ~70-85% success with only 3-6% collision rate, substantially outperforming classical, imitation-learning, and large-scale RL baselines. Importantly, the results demonstrate emergent interaction behaviors such as reverse-yielding, mid-maneuver yielding, tight-corridor passing, and queuing.
物理信息机器学习用于短期洪水预测
Tewodros Syum Gebre, Jagrati Talreja, Leila Hashemi-Beni
AI总结 提出一种物理信息机器学习框架,通过将水文知识作为趋势对齐约束嵌入LSTM损失函数,在数据稀缺和极端天气下提升洪水预测的物理一致性和可靠性。
准确的洪水预测对于减轻灾害风险和保护社区至关重要。然而,纯数据驱动的机器学习模型在数据稀缺环境中常常表现不佳,并可能违反基本的水文原理。标准长短期记忆(LSTM)网络可能产生物理上不一致的预测,特别是在外推到极端天气条件时。为了解决这些限制,我们提出了一种物理信息机器学习(PIML)框架,将水文知识直接纳入LSTM模型的损失函数中。具体来说,趋势对齐约束惩罚降水与流量趋势之间的方向不一致性,从而在不需复杂水动力学方程的情况下提高模型鲁棒性。这种正则化鼓励模型学习物理上合理的水文过程线行为,即使在训练数据有限的情况下,也能增强峰值洪水事件期间的可靠性。实验结果表明,所提出的物理信息模型在数据稀缺环境下优于标准LSTM基线,当仅使用5%的可用数据训练时,纳什-萨特克利夫效率(NSE)从0.20提高到0.23。在模拟极端气候情景下的额外压力测试表明,基线模型表现出不稳定的行为,而物理信息模型保持了方向一致性和物理合理性。尽管在数据有限的情况下准确预测极端峰值幅度仍然具有挑战性,但所提出的方法显著减少了纯数据驱动模型中常见的非物理波动。这些发现表明,简单的物理约束可以显著提高深度学习模型在实时洪水预测中的可靠性,为无测站流域和不断变化的气候条件提供了实用解决方案。
Accurate flood forecasting is essential for mitigating disaster risks and protecting communities. However, purely data-driven machine learning models often struggle in data-scarce environments and may violate fundamental hydrological principles. Standard Long Short-Term Memory (LSTM) networks can generate physically inconsistent predictions, particularly when extrapolating to extreme weather conditions. To address these limitations, we propose a Physics-Informed Machine Learning (PIML) framework that incorporates hydrological knowledge directly into the loss function of an LSTM model. Specifically, a Trend Alignment constraint penalizes directional inconsistencies between precipitation and discharge trends, improving model robustness without requiring complex hydrodynamic equations. This regularization encourages the model to learn physically plausible hydrograph behavior, even with limited training data, while enhancing reliability during peak flood events. Experimental results show that the proposed physics-informed model outperforms a standard LSTM baseline in data-scarce settings, increasing the Nash-Sutcliffe Efficiency (NSE) from 0.20 to 0.23 when trained on only 5% of the available data. Additional stress tests under simulated extreme climate scenarios demonstrate that the baseline model exhibits unstable behavior, whereas the physics-informed model maintains directional consistency and physical plausibility. Although accurately predicting extreme peak magnitudes remains challenging with limited data, the proposed approach substantially reduces unphysical fluctuations common in purely data-driven models. These findings demonstrate that simple physical constraints can significantly improve the reliability of deep learning models for real-time flood forecasting, offering a practical solution for ungauged basins and evolving climate conditions.
当场抓获(激活):面向LLM智能体的凭证泄露预输出和多轮检测
Kargi Chauhan, Pratibha Revankar
AI总结 研究通过激活探针、蜜令令牌和累积信息流追踪三种互补防御方法,在预输出和多轮对话中检测LLM智能体的凭证泄露。
LLM智能体通常将敏感凭证与不可信检索内容置于同一上下文窗口中,为间接提示注入诱导凭证泄露提供了直接途径。我们通过三种互补防御研究这种失效模式。首先,我们探究激活探针能否在输出令牌发出前检测凭证访问。其次,我们从格式特定的字符模型构建蜜令令牌,并使用分裂共形预测校准检测。第三,我们将多轮泄露视为累积信息流问题,并跨对话轮次追踪估计的泄露预算。在开放权重模型的受控实验中,激活特征能够高精度区分良性提示和凭证窃取提示,包括在保留编码变换下。在一个小型合成多轮测试集中,累积会计检测到了每轮检测器遗漏的攻击。这些结果是初步的:多轮基准测试为内部小型数据集,激活方法需要白盒访问,信息估计器提供的是实用信号而非正式上界。尽管如此,结果表明凭证泄露防御应结合预输出监控、校准的金丝雀检测和时间泄露会计,而非仅依赖文本级输出过滤器。
LLM agents often place sensitive credentials in the same context window as untrusted retrieved content, creating a direct path for indirect prompt injection to induce credential exfiltration. We study this failure mode through three complementary defenses. First, we ask whether activation probes can detect credential access before output tokens are emitted. Second, we construct honeytokens from format-specific character models and calibrate detection with split conformal prediction. Third, we treat multi-turn exfiltration as a cumulative information-flow problem and track an estimated leakage budget across conversation turns. In controlled experiments on open-weight models, activation features separate benign and credential-seeking prompts with high accuracy, including under held-out encoding transformations. In a small synthetic multi-turn suite, cumulative accounting detects attacks that per-turn detectors miss. These results are preliminary: the multi-turn benchmark is in-house and small, the activation method requires white-box access, and the information estimator provides a practical signal rather than a formal upper bound. Still, the results suggest that credential-exfiltration defenses should combine pre-output monitoring, calibrated canary detection, and temporal leakage accounting rather than relying only on text-level output filters.
平稳性感知的检索增强时间序列预测
Shiqiao Zhou, Holger Schöner, Zipeng Wu, Edouard Fouché, IAG Wilson, Shuo Wang
AI总结 提出SARAF框架,通过自适应平衡检索相关性与多样性,并利用平稳性感知聚合,提升非平稳时间序列预测的准确性和鲁棒性。
时间序列预测依赖于历史模式,但真实世界序列通常表现出非平稳性和制度转换,这对全参数预测器构成挑战。受检索增强生成(RAG)启发,最近的工作通过检索相关历史片段并在推理时将其作为外部证据来增强预测器。然而,由于真实世界时间序列的内在非平稳性,高度相似的过去片段并不一定意味着相似的未来,这使得仅基于相似性的检索脆弱且容易冗余。我们提出平稳性感知的检索增强时间序列预测(SARAF),这是一个自适应平衡检索中相关性和多样性的框架。SARAF首先通过时间对齐增强的时间相似性形成候选池,然后应用多样性感知选择策略覆盖异质历史制度,其中多样化强度由数据集级别的平稳性自动调节。此外,SARAF使用平稳性感知聚合来融合检索到的未来。在八个真实世界数据集上的大量实验表明,SARAF实现了有竞争力的预测性能,并在强基线上提高了平均准确性和鲁棒性,在具有挑战性的非平稳设置下尤其明显。代码:https://github.com/ShiqiaoZhou/SARAF。
Time series forecasting relies on historical patterns, but real-world series often exhibit non-stationarity and regime shifts that challenge fully parametric forecasters. Inspired by Retrieval-Augmented Generation (RAG), recent work augments forecasters by retrieving relevant historical segments and using them as external evidence at inference time. However, due to the intrinsic non-stationarity of real-world time series, a highly similar past segment does not necessarily imply a similar future, rendering similarity-only retrieval brittle and prone to redundancy. We propose Stationarity-Aware Retrieval-Augmented Time Series Forecasting (SARAF), a framework that adaptively balances relevance and diversity in retrieval. SARAF first forms a candidate pool via temporal similarity with time-aligned enhancement, then applies a diversity-aware selection strategy to cover heterogeneous historical regimes, with the diversification strength automatically modulated by dataset-level stationarity. Moreover, SARAF uses stationarity-aware aggregation to fuse the retrieved futures. Extensive experiments on eight real-world datasets show that SARAF achieves competitive forecasting performance and improves average accuracy and robustness over strong baselines, with particularly clear benefits under challenging non-stationary settings. Code: https://github.com/ShiqiaoZhou/SARAF.
Pinpoint: 基于跨源检索与重排序的全球图像地理定位
Nika Chuzhoy, Brian Hu, Amit A. Arora, Jae Ro, Sarthak S. Sahu
AI总结 提出一种检索-重排序架构Pinpoint,通过对比学习融合Flickr照片和街景图像,结合注意力重排序器利用跨源证据实现全球图像地理定位,在多个基准上达到最优。
图像地理定位旨在根据视觉内容估计照片拍摄地点。在全球范围内,由于视觉证据往往模糊、多样且分布不均,这仍然具有挑战性。先前的工作通常将普通互联网照片和街景图像的地理定位视为独立任务,尽管它们具有互补优势:互联网照片更匹配用户拍摄查询的外观分布,而街景图像提供更密集、地理覆盖更广的参考。我们提出Pinpoint,一种检索-重排序架构,以由粗到细的流程结合两种数据源。对比图像-GPS嵌入器在用户上传的Flickr照片和街景图像上训练,学习共享的图像-GPS嵌入空间,用于检索候选位置。然后,基于注意力的重排序器通过结合候选级别的视觉和GPS特征以及来自附近位置的跨源证据,对检索到的候选进行重新评分,以确定预测。与最近的先前工作不同,Pinpoint不依赖多模态大语言模型,使得推理更快且更具可重复性。Pinpoint在互联网照片(IM2GPS3k和YFCC4k)和街景图像(OSV-5M)的标准基准上,在所有指标上均达到最先进的结果。
Image geolocation aims to estimate where a photograph was taken from its visual content. At worldwide scale, this remains challenging because visual evidence is often ambiguous, diverse, and unevenly distributed. Prior work has typically treated geolocation of ordinary internet photos and street-view imagery as separate tasks, despite their complementary strengths: internet photos better match the appearance distribution of user-captured queries, while street-view imagery provides denser, geographically grounded coverage. We present Pinpoint, a retrieve-and-rerank architecture that combines both sources in a coarse-to-fine pipeline. A contrastive image-GPS embedder is trained on both user-uploaded Flickr photos and street-view imagery, learning a shared image-GPS embedding space that is used to retrieve candidate locations. An attention-based reranker then rescores retrieved candidates by combining candidate-level visual and GPS features with cross-source evidence from nearby locations to ground the prediction. Unlike recent prior work, Pinpoint does not rely on multimodal large-language models, making inference faster and more reproducible. Pinpoint achieves state-of-the-art results across all metrics on standard benchmarks for internet photos (IM2GPS3k and YFCC4k) and street-view imagery (OSV-5M).
CLAW: 通过对抗潜在正则化学习连续潜在动作世界模型
Tewodros Ayalew, Matthew Jeung, Samuel Wheeler, Xiao Zhang, Andre de la Cruz Arce, Kaylene Stocking, Michael Maire, Matthew R. Walter
AI总结 提出CLAW框架,利用对抗潜在正则化和扩散视频生成,从无动作视频中端到端学习世界模型与连续潜在动作表示,支持观察模仿学习和目标导向规划。
我们引入了CLAW,一个完全端到端的自监督框架,用于直接从无动作视频中联合学习世界模型和连续潜在动作表示。我们的方法利用对抗潜在正则化和基于扩散的视频生成来捕获结构化和语义上有意义的动作表示,同时建模丰富的、可预测的环境动态,而不依赖于任何动作标签或注释。通过同时训练潜在动作模型和世界模型,CLAW学会仅从视觉观察中推理推断的动作如何引起环境转变。我们展示了由此产生的潜在动作世界模型支持从观察中模仿学习和目标导向规划。在模仿学习中,从原始视频中提取的潜在动作实现了行为克隆。对于规划,CLAW生成潜在动作序列并将其映射到可执行动作以达到期望目标。跨多种任务和实体的广泛实验表明,CLAW产生了语义上有意义的潜在动作表示,支持有效的动作迁移,并实现了规划和从观察中模仿,优于现有方法。
We introduce CLAW, a fully end-to-end self-supervised framework for learning a world model jointly with continuous latent action representations directly from action-free videos. Our approach leverages adversarial latent regularization and diffusion-based video generation to capture structured and semantically meaningful action representations while modeling rich, predictive environment dynamics, without relying on any action labels or annotations. By simultaneously training the Latent Action Model and world model, CLAW learns to reason about how inferred actions induce environment transitions from visual observations alone. We show that the resulting latent action world model supports both imitation learning from observation and goal-directed planning. In imitation learning, latent actions extracted from raw videos enable behavior cloning. For planning, CLAW generates sequences of latent actions and maps them to executable actions to reach desired goals. Extensive experiments across diverse tasks and embodiments demonstrate that CLAW produces semantically meaningful latent action representations, supports effective action transfer, and enables planning and imitation from observation, outperforming existing methods.
当检索无济于事:生物医学RAG的大规模研究
Erfan Nourbakhsh, Rocky Slavin, Ke Yang, Anthony Rios
AI总结 本研究通过大规模实验发现,检索增强生成(RAG)在生物医学问答中仅带来微小且不一致的提升(1-2%),主要瓶颈在于模型有效利用检索证据的能力不足。
医学问答是一个高风险场景,事实错误可能导致严重后果。检索增强生成(RAG)被广泛视为一种有前景的解决方案,先前的研究报告称大型医学问答模型有显著提升。我们在一系列7B到72B参数的开源指令调优模型上重新审视了这一假设。在五个模型、十个生物医学QA数据集、四种检索方法和四个检索语料库上,我们发现与无检索基线相比,检索仅带来微小且不一致的改进,通常在1-2个百分点内。相比之下,骨干模型的选择比检索器或语料库的选择影响大得多,并且在大多数设置中,专家和外行检索源的表现相似。这些结果表明,主要瓶颈不仅仅是检索质量,而是模型有效利用检索证据的能力有限。
Medical question answering is a high-stakes setting where factual errors can have serious consequences. Retrieval-augmented generation (RAG) is widely viewed as a promising solution, and prior work has reported substantial gains for large medical QA models. We revisit this assumption across a broad range of open-weight instruction-tuned models spanning 7B to 72B parameters. Across five models, ten biomedical QA datasets, four retrieval methods, and four retrieval corpora, we find that retrieval yields only small and inconsistent improvements over a no-retrieval baseline, typically within 1-2 points. In contrast, the choice of backbone model has a much larger effect than the choice of retriever or corpus, and expert and layman retrieval sources perform similarly in most settings. These results suggest that the main bottleneck is not retrieval quality alone, but the model's limited ability to use retrieved evidence effectively.
HighTide:一个由智能体策划的开源VLSI基准测试套件
Benjamin Goldblatt, Paolo Pedroso, Farhad Modaresi, Ethan Sifferman, Matthew R. Guthaus
AI总结 提出HighTide,一个由AI辅助策划的开源VLSI基准测试套件,通过12种智能体技能覆盖设计生命周期,并集成Bazel增量编译和远程缓存。
我们介绍HighTide,一个不断演进的AI辅助基准测试套件。具体贡献包括:(i) 一个涵盖多种设计语言和技术节点的多样化开源套件,(ii) 基于Bazel的增量RTL到GDS编译,支持远程缓存,(iii) 通过十二种智能体技能进行AI辅助设计策划,覆盖设计生命周期、流程优化、工具参考和元维护,并配有每个设计的决策日志,作为跨套件调优理由的长期记忆,以及(iv) 一个包含RTL编译验证的基础设施,用于稳定发布。该套件公开可用,并旨在与开源硬件生态系统共同成长。
We introduce HighTide, an evolving AI-assisted benchmark suite. Specifically, the contributions are: (i) a diverse open-source suite spanning multiple design languages and technology nodes, (ii) Bazel-based incremental RTL-to-GDS compilation with remote caching, (iii) AI-assisted design curation through twelve agent skills covering the design lifecycle, flow optimization, tool reference, and meta-maintenance, backed by per-design decision logs that serve as long-term memory of tuning rationale across the suite, and (iv) an infrastructure with RTL compilation verification for stable releases. The suite is publicly available and designed to grow with the open-source hardware ecosystem.
基于大语言模型的语义约束综合用于自适应轨迹优化
Eleanor Brosius, Yuji Takubo, Daniele Gammelli, Simone D'Amico, Marco Pavone
AI总结 提出利用大语言模型将自然语言描述的任务需求转化为可执行的轨迹优化代码和数学公式,在航天器交会场景中实现了从语义需求重构凸轨迹优化问题的高成功率。
轨迹优化是实现太空探索中安全可靠自主操作的关键组成部分。随着太空任务在频率、复杂性和范围上的增加,迫切需要快速制定数学上合理的轨迹优化问题,以准确反映任务目标和操作约束。然而,将任务意图转化为易于处理的轨迹优化分析公式需要大量的领域专业知识。本文提出一个框架,利用大语言模型(LLMs)将任务需求和约束的自然语言描述转化为可执行的轨迹优化代码及相应的数学公式。在航天器交会场景中的实验表明,从语义任务需求重构凸轨迹优化问题具有高成功率。最终,这项工作凸显了LLMs在连接高层意图与形式化优化模型方面的潜力,从而实现更灵活高效的航天器轨迹设计。
Trajectory optimization is a critical component for enabling safe and reliable autonomous operations in space exploration. As space missions increase in frequency, complexity, and scope, there is a growing need to rapidly formulate mathematically sound trajectory optimization problems that accurately reflect mission objectives and operational constraints. However, translating mission intent into tractable analytical formulations for trajectory optimization requires substantial domain expertise. This paper presents a framework that leverages large language models (LLMs) to translate natural language descriptions of mission requirements and constraints into executable trajectory optimization code and corresponding mathematical formulations. Experiments in spacecraft rendezvous scenarios demonstrate a high success rate in reconditioning a convex trajectory optimization problem from semantic mission requirements. Ultimately, this work highlights the potential of LLMs to bridge high-level intent and formal optimization models, enabling more flexible and efficient trajectory design of spacecraft.
veriFIRE:基于DNN的野火检测系统一致性属性验证的工业案例研究
Idan Refaeli, Maya Swisa, Itay Buchnik, Alon Zada, Guy Amir, Elad Mandelbaum, Ziv Freund, Guy Katz
AI总结 本文提出一种端到端方法,通过将应用需求编码为求解器兼容查询,利用现有神经网络验证器验证野火检测系统中的单调性和有界响应等一致性属性,并在真实背景样本上评估,展示了工业系统可获得有意义的领域特定保证。
我们介绍了veriFIRE项目的当前工作:一个工业界与学术界的合作项目,旨在应用验证来提高一个真实世界安全关键系统的可靠性。具体来说,我们针对一个用于野火检测的机载平台,该平台包含两个深度神经网络。我们提出了一种端到端的方法来验证该系统中的 extit{一致性属性}。我们的方法将基于应用的需求编码为现有神经网络验证器可求解的查询。我们研究了关键操作场景下的感兴趣属性:(i) 检测器置信度随目标强度增加而单调递增;(ii) 在传感器物理上合理的模糊下,检测器响应有界。我们使用最先进的神经网络验证后端实例化这些编码,并在真实背景样本上大规模评估。对于第一个属性,所有验证查询在五分钟内解决。对于第二个属性,验证难度显著增加,突出了更丰富、更高维规格的关键可扩展性挑战。总体而言,结果表明可以为工业系统获得有意义的、领域特定的保证。
We present our ongoing work on the veriFIRE project: a collaboration between industry and academia, aimed at applying verification to increase the reliability of a real-world, safety-critical system. Specifically, we target an airborne platform for wildfire detection, which incorporates two deep neural networks. We present an end-to-end methodology for verifying \textit{consistency properties} in this system. Our approach encodes application-grounded requirements into solver-compatible queries for existing neural network verifiers. We study properties of interest over critical operational scenarios: (i) monotonicity of detector confidence as target intensity increases; and (ii) bounded detector response under physically plausible blur over the sensor. We instantiate these encodings using state-of-the-art neural network verification backends and evaluate them at scale on real background samples. For the first property, all verification queries are solved in under five minutes. For the second property, verification is substantially harder, highlighting key scalability challenges for richer, higher-dimensional specifications. Overall, the results demonstrate that meaningful, domain-specific guarantees can be obtained for industrial systems.