arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2083
2605.12919 2026-05-14 cs.CV

GuardMarkGS: Unified Ownership Tracing and Edit Deterrence for 3D Gaussian Splatting

Utae Jeong, Jaewan Choi, Junseok Lee, Jongheon Jeong, Sang Ho Yoon, ByoungSoo Koh, Sangpil Kim

AI总结 本文提出了一种名为 GuardMarkGS 的统一保护框架,旨在解决 3D Gaussian Splatting(3DGS)资产在版权归属追踪与防止未经授权编辑之间的双重风险。该方法结合了全局水印优化与对抗性编辑抑制策略,通过分离潜在特征、扰动编辑轨迹以及选择性增强对抗更新,实现了版权归属可追溯与编辑行为有效遏制的双重目标。实验表明,该框架在保持渲染质量的同时,有效平衡了水印准确性与编辑抑制效果。

Comments Preprint

详情
英文摘要

3D Gaussian Splatting (3DGS) is becoming a practical representation for novel view synthesis, but its growing adoption, together with rapid advances in instruction-driven 3DGS editing, also exposes a dual copyright risk: once a 3DGS-based asset is released, it can be used without permission and manipulated through 3D editing. Existing protection methods address only one side of this problem. Watermarking can trace ownership after unauthorized use, but it cannot prevent malicious editing. Adversarial edit-deterrence methods can disrupt editing, but they do not provide evidence of ownership. To the best of our knowledge, we present the first unified protection framework for 3DGS that jointly optimizes ownership tracing and unauthorized editing deterrence. Our framework combines a scene-wide watermarking objective over all Gaussians with an adversarial objective for edit deterrence. The adversarial branch combines latent-anchor separation, denoising-trajectory diversion, and cross-attention diversion to divert the editing trajectory, while an update-saliency-motivated Gaussian selection strategy assigns stronger adversarial updates to mask-selected Gaussians, improving the balance among watermark recovery, edit deterrence, and rendering fidelity. Experiments on scenes from Mip-NeRF 360 and Instruct-NeRF2NeRF demonstrate that the proposed framework achieves a favorable balance among bit accuracy, edit deterrence, and rendering quality. These results suggest that practical copyright protection of 3DGS-based assets can be more effectively addressed by integrating ownership tracing and unauthorized editing deterrence into a single optimization framework.

2605.12918 2026-05-14 cs.CL

CommonWhy: A Dataset for Evaluating Entity-Based Causal Commonsense Reasoning in Large Language Models

Armin Toroghi, Faeze Moradi Kalarde, Scott Sanner

AI总结 为了有效与现实世界交互,大型语言模型(LLMs)需要具备基于实体的常识推理能力,这要求模型将具体实体的事实知识与常识推理相结合。本文提出CommonWhy数据集,包含15,000个“为什么”问题,用于评估模型在因果关系上的常识推理能力,并作为知识图谱问答(KGQA)的基准,所有问题答案均可在Wikidata中找到。与现有KGQA数据集不同,CommonWhy重点考察因果推理而非单纯的事实检索,实验表明当前先进模型在该任务上仍存在事实幻觉和因果推理失败等问题。

详情
英文摘要

To effectively interact with the real world, Large Language Models (LLMs) require entity-based commonsense reasoning, a challenging task that necessitates integrating factual knowledge about specific entities with commonsense inference. Existing datasets for evaluating LLM entity-based commonsense reasoning have largely focused on True/False or multiple-choice questions, leaving the explicit assessment of the model's ability in abductive reasoning about causes and effects and generating explanations largely unexamined. In this work, we introduce CommonWhy, a dataset of 15,000 why questions designed to evaluate entity-based commonsense reasoning about causal relationships in LLMs. CommonWhy also serves as a Knowledge Graph Question Answering (KGQA) benchmark, as all supporting knowledge required to answer its queries is available in the Wikidata knowledge graph. Unlike existing KGQA datasets, which primarily test fact retrieval, CommonWhy targets causal commonsense reasoning, establishing a new paradigm for KGQA evaluation. Experiments with state-of-the-art LLMs and LLM-based KGQA methods reveal their significant shortcomings, including frequent factual hallucinations and failures in causal reasoning.

2605.12917 2026-05-14 cs.CV cs.LG

Adaptive Conformal Prediction for Reliable and Explainable Medical Image Classification

One Octadion, Novanto Yudistira, Lailil Muflikhah

AI总结 该研究针对医学图像分类中深度学习模型过度自信的问题,提出了一种自适应的置信度预测方法,以提高诊断的可靠性和可解释性。通过改进RAPS方法,引入自适应Lambda准则,有效控制预测集的覆盖偏差,确保在不同输入难度下均保持较高的覆盖性能。实验表明,该方法在多个医学图像数据集上实现了高覆盖率与小预测集大小的平衡,且具有良好的跨领域泛化能力,适用于对安全性要求高的医疗AI应用。

Comments To appear in IEA/AIE 2026 (Springer LNAI)

详情
英文摘要

Deep learning models for medical imaging often exhibit overconfidence, creating safety risks in ambiguous diagnostic scenarios. While Conformal Prediction (CP) provides distribution-free statistical guarantees, standard methods such as Regularized Adaptive Prediction Sets (RAPS) optimize for average efficiency and can mask severe failures on difficult inputs. We propose an Adaptive Lambda Criterion for RAPS that minimizes the worst-case coverage violation across prediction set size strata. On OrganAMNIST (58,850 abdominal CT images, 11 classes), standard size-optimized RAPS converges to near-deterministic behavior with stratified undercoverage on uncertain samples, while our method achieves 95.72 percent global coverage with average set size 1.09 and at least 90 percent coverage across all strata. Cross-domain validation on PathMNIST (107,180 pathology images, 9 classes) confirms generalizability. Quantitative Grad-CAM analysis (rho = -0.30, p < 1e-22) shows that multi-label predictions correspond to focused attention on anatomically ambiguous regions. These results demonstrate that the proposed method improves reliability while maintaining efficiency, making it suitable for safety-critical medical AI applications.

2605.12913 2026-05-14 cs.LG

Revisiting DAgger in the Era of LLM-Agents

Changhao Li, Rushi Qiang, Jiawei Huang, Chenxiao Gao, Chao Zhang, Niao He, Bo Dai

AI总结 本文研究了在大语言模型代理(LLM-Agents)时代下如何改进长期任务的学习方法,针对现有监督微调和强化学习方法的不足,重新引入并改进了数据聚合(DAgger)算法。该方法通过在每一步骤中融合学生策略与教师策略生成轨迹,并利用教师提供的监督标签进行训练,从而有效缓解协变量偏移问题并提供丰富的反馈。实验表明,该方法在软件工程任务中显著提升了模型性能,优于现有主流方法。

详情
英文摘要

Long-horizon LM agents learn from multi-turn interaction, where a single early mistake can alter the subsequent state distribution and derail the whole trajectory. Existing recipes fall short in complementary ways: supervised fine-tuning provides dense teacher supervision but suffers from covariate shift because it is trained on off-policy teacher trajectories; while reinforcement learning with verifiable rewards avoids this off-policy mismatch by learning from on-policy rollouts but with only sparse outcome feedback. We address this dilemma by revisiting Dataset Aggregation (DAgger) for multi-turn LM agents: the algorithm collects trajectories through a turn-level interpolation of student and teacher policies, and the student is then trained on these trajectories using supervised labels provided by the teacher. By directly interacting with environments, we expose the model to realistic states likely to be encountered during deployment, thereby effectively mitigating covariate shift. Besides, since the student is learned by mimicking the teacher's behavior, it receives rich feedback during learning. To demonstrate DAgger enjoys the benefits of both worlds, we tested the algorithm to train a software-engineering agent with 4B- and 8B-scale student models. On SWE-bench Verified, our DAgger-style training improves over the strongest post-training baseline by +3.9 points at 4B and +3.6 points at 8B. The resulting 4B agent reaches 27.3%, outperforming representative published 8B SWE-agent systems, while the 8B agent achieves 29.8%, surpassing SWE-Gym-32B and coming within 5 points of stronger 32B-scale agents. Together with consistent gains on the held-out SWE-Gym split, these results suggest the effectiveness of DAgger for modern long-horizon LM agents.

2605.12904 2026-05-14 cs.LG

VIP-COP: Context Optimization for Tabular Foundation Models

Yilong Chen, Xueying Ding, Leman Akoglu

AI总结 表格基础模型(TFMs)在结构化数据的上下文学习中表现出色,但其性能受到上下文长度限制的制约,难以处理超出预训练规模的数据。本文提出VIP-COP方法,通过评估训练样本和特征对预测的重要性,实现对上下文的优化选择,有效抑制噪声并聚焦关键信息。该方法具备高效、预算感知、模型无关、可解释且鲁棒等优势,在多个大规模高维任务中显著优于现有方法,为表格基础模型的测试时上下文优化树立了新的标杆。

详情
英文摘要

Tabular foundation models (TFMs) have emerged as a powerful paradigm for in-context learning on structured data, enabling direct prediction on new tabular tasks without task-specific training. However, their effectiveness is constrained by context length limits, restricting application to medium-scale data and degrading performance when inference-time data exceed pretraining size distributions. Our work introduces VIP-COP, estimating the Value of Importance for Prediction of training examples and features for hard Context OPtimization for TFMs. Its explicit selection mechanism suppresses noise and isolates influential data, enabling the model to also benefit from data augmentation by prioritizing high-value augmented samples and features. VIP-COP is (i) fast, boosting performance often within minutes of optimization, based on an online KernelSHAP-based regression with iterative refinement, value-guided context sampling, and multi-fidelity pruning; (ii) budget-aware and any-time, improving with additional test-time compute unlike heuristics that produce fixed contexts; (iii) model-aware yet fully black-box, requiring no access to model internals, making it compatible with both proprietary and open-source TFMs; (iv) interpretable, identifying discrete ``Very Important Predictors'' (samples and features) that maximize signal-to-noise, which makes it (v) robust, isolating high-value data from noise. In contrast, soft-prompt optimization requires model gradients, produces abstract latent tokens, and lacks explicit signal discrimination. Extensive experiments show that VIP-COP consistently outperforms heuristic and optimized baselines across large-scale high-dimensional testbeds, including data augmentation and data-noise settings, establishing a new state of the art in test-time context refinement for TFMs.

2605.12897 2026-05-14 cs.RO

DynoJEPP: Joint Estimation, Prediction and Planning in Dynamic Environments

Mikolaj Kliniewski, Jesse Morris, Yiduo Wang, Ian R. Manchester, Viorela Ila

AI总结 DynoJEPP 是一个基于因子图的框架,旨在动态环境中联合优化状态估计、预测与路径规划。为了解决传统方法中预测和规划信息反馈导致估计污染和不安全行为的问题,DynoJEPP 引入了一种新型有向因子,以确保信息在因子图中的单向流动。实验表明,该方法对安全导航至关重要,而合作版 DynoJEPP 进一步支持机器人在预测和规划中融入协作对象的行为,提升了整体系统的鲁棒性与安全性。

详情
英文摘要

DynoJEPP is a factor-graph-based framework that jointly formulates and simultaneously optimizes estimation, prediction, and planning in dynamic environments. In conventional factor-graph-based approaches that jointly formulate estimation, prediction, and planning, information from prediction and planning feeds back into state estimation, yielding corrupted estimates, undesired behaviors, and unsafe plans. To address this, DynoJEPP introduces a novel directed factor that enforces directional information flow within the factor graph, preventing prediction and planning from corrupting state estimation. We evaluate the impact of directed factors on inter-module interactions during navigation in both static and dynamic environments. Our results demonstrate that these factors are critical for safe operation, as without them, the robot collides in the majority of experiments. Building on this, we further introduce Cooperative DynoJEPP, which enables the ego robot to incorporate cooperative object behavior into its prediction and trajectory planning.

2605.12894 2026-05-14 cs.AI cs.CL

Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents

Harshita Chopra, Kshitish Ghate, Aylin Caliskan, Tadayoshi Kohno, Chirag Shah, Natasha Jaques

AI总结 该研究旨在解决大型语言模型(LLM)代理在面对真实用户多样化行为时表现不佳的问题,提出了一种名为Persona Policies(PPol)的可插拔控制层,用于生成具有真实行为特征的用户角色,从而提升代理的鲁棒性。通过将角色生成建模为基于LLM的进化程序搜索,该方法优化Python生成器以发现符合任务目标的行为模式,并生成多样化的用户角色。实验表明,PPol显著提升了用户模拟的真实性与代理任务成功率,为基于模拟器的评估和训练提供了新的有效方法。

Comments Preprint under review

详情
英文摘要

Large Language Model (LLM) agents are increasingly deployed in settings where they interact with a wide variety of people, including users who are unclear, impatient, or reluctant to share information. However, collecting real interaction data at scale remains expensive. The field has turned to LLM-based user simulators as stand-ins, but these simulators inherit the behavior of their underlying models: cooperative and homogeneous. As a result, agents that appear strong in simulation often fail under the unseen, diverse communication patterns of real users. To narrow this gap, we introduce Persona Policies (PPol), a plug-and-play control layer that induces realistic behavioral variation in user simulators while preserving the original task goals. Rather than hand-crafting personas, we cast persona generation as an LLM-driven evolutionary program search that optimizes a Python generator to discover behaviors and translate them into task-preserving roleplay policies. Candidate generators are guided by a multi-objective fitness score combining human-likeness with broad coverage of human behavioral patterns. Once optimized, the generator produces a diverse population of human-like personas for any task in the domain. Across tau^2-bench retail and airline domains, evolved PPol programs yield 33-62% absolute gains in fitness score over the baseline simulator. In a blinded evaluation, annotators rated PPol-conditioned users as human 80.4% of the time, close to real human traces and nearly twice as frequently as baseline simulators. Agents trained with PPol are more robust to challenging, out-of-distribution behaviors, improving task success by +17% relative to training only on existing simulated interactions. This offers a novel approach to strengthen simulator-based evaluation and training without changing tasks or rewards.

2605.12882 2026-05-14 cs.CL cs.CV

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Dongsheng Ma, Jiayu Li, Zhengren Wang, Yijie Wang, Jiahao Kong, Weijun Zeng, Jutao Xiao, Jie Yang, Wentao Zhang, Bin Wang, Conghui He

AI总结 CiteVQA 是一个用于评估可信文档智能的新型基准,旨在解决当前文档问答系统中忽视证据溯源的问题。该基准要求模型在回答问题的同时提供具体的引用区域,从而同时评估答案的正确性和引用的准确性。通过引入严格归因准确率(SAA)指标,CiteVQA 揭示了现有大型语言模型在答案正确但引用错误方面的普遍问题,为提升文档理解系统的可靠性提供了新的评估工具。

详情
英文摘要

Multimodal Large Language Models (MLLMs) have significantly advanced document understanding, yet current Doc-VQA evaluations score only the final answer and leave the supporting evidence unchecked. This answer-only approach masks a critical failure mode: a model can land on the correct answer while grounding it in the wrong passage -- a critical risk in high-stakes domains like law, finance, and medicine, where every conclusion must be traceable to a specific source region. To address this, we introduce CiteVQA, a benchmark that requires models to return element-level bounding-box citations alongside each answer, evaluating both jointly. CiteVQA comprises 1,897 questions across 711 PDFs spanning seven domains and two languages, averaging 40.6 pages per document. To ensure fidelity and scalability, the ground-truth citations are generated by an automated pipeline-which identifies crucial evidence via masking ablation-and are subsequently validated through expert review. At the core of our evaluation is Strict Attributed Accuracy (SAA), which credits a prediction only when the answer and the cited region are both correct. Auditing 20 MLLMs reveals a pervasive Attribution Hallucination: models frequently produce the right answer while citing the wrong region. The strongest system (Gemini-3.1-Pro-Preview) achieves an SAA of only 76.0, and the strongest open-source MLLM reaches just 22.5. Ultimately, towards trustworthy document intelligence, CiteVQA exposes a reliability gap that answer-only evaluations overlook, providing the instrumentation needed to close it. Our repository is available at https://github.com/opendatalab/CiteVQA.

2605.12879 2026-05-14 cs.LG

ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection

Huy Tran, Max Milkert, David Hyde

AI总结 本文提出了一种名为ASAP的新方法,用于高效实现双重随机注意力机制。该方法结合了Sinkhorn缩放的训练优势和切片双投影的推理优化,通过在训练阶段学习参数映射,在推理阶段用固定操作替代迭代缩放,从而显著提升计算效率。实验表明,ASAP在保持低成本训练的同时,在语言和视觉任务中表现出与现有方法相当甚至更优的性能。

详情
英文摘要

Doubly-stochastic attention has emerged as a transport-based alternative to row-softmax attention, with recent Transformer variants using it to reduce attention sinks and rank collapse while improving performance. In this family, the standard approach is Sinkhorn scaling, which trains more efficiently but still repeats matrix scaling in every inference forward pass. Sliced-transport attention removes the online iteration, but its soft sorting approximation materializes dense tensors for each slice, requiring substantially more training resources than Sinkhorn attention. We introduce ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection, a train-then-compile method that trains the doubly-stochastic layer with Sinkhorn, then replaces the iterative scaling loop at inference with a fixed sliced-dual operator. It learns a lightweight parametric map from exact one-dimensional Kantorovich potentials to the Sinkhorn query-side dual, then reconstructs the attention plan with a two-sided entropic c-transform. Across language and vision benchmarks, ASAP keeps the cheaper training setup and remains highly competitive with recent baselines. In the main frozen-layer benchmark, ASAP is 5.3 faster than the trained Sinkhorn teacher while matching its accuracy; in downstream replacements, ASAP recovers most of the teacher performance without any retraining.

2605.12876 2026-05-14 cs.LG

Certified Robustness under Heterogeneous Perturbations via Hybrid Randomized Smoothing

Blaise Delattre, Hengyu Wu, Paul Caillon, Wei Yang Bryan Lim, Yang Cao

AI总结 该论文研究了在异构扰动下如何为多模态模型提供认证鲁棒性的问题,提出了一种统一的随机平滑框架,能够处理离散和连续混合输入的联合扰动。通过分析离散与连续噪声的联合似然排序,该方法得到了一个严格推广图像和文本单独扰动认证的闭式一维鲁棒性证书。该框架在多模态安全过滤任务中得到了验证,提供了首个针对文本-图像交互依赖场景下联合离散和连续扰动的模型无关的Neyman-Pearson认证。

Comments ICML 2026. Code: https://github.com/tdsai-lab/hybrid-randomized-smoothing

详情
英文摘要

Randomized smoothing provides strong, model-agnostic robustness certificates, but existing guarantees are limited to single modalities, treating continuous and discrete inputs in isolation. This limitation becomes critical in multimodal models, where decisions depend on cross-modal semantics and adversaries can jointly perturb heterogeneous inputs, rendering unimodal certificates insufficient. We introduce a unified randomized smoothing framework for mixed discrete--continuous inputs based on an analytically tractable Neyman--Pearson formulation of the joint worst-case problem. By analyzing the joint likelihood ordering induced by factorized discrete and continuous noise, our approach yields a closed-form, one-dimensional certificate that strictly generalizes both Gaussian (image-only) and discrete (text-only) randomized smoothing. We validate the framework on multimodal safety filtering, providing, to our knowledge, the first model-agnostic Neyman--Pearson certificate for joint discrete-token and continuous-image perturbations in interaction-dependent text--image safety filtering.

2605.12874 2026-05-14 cs.LG

Descriptive Collision in Sparse Autoencoder Auto-Interpretability: When One Explanation Describes Many Features

Jordan F. McCann

AI总结 本文研究了稀疏自编码器(SAE)在语言模型解释性任务中的一种新问题——描述性碰撞,即多个不同的特征被赋予相同的自然语言解释。作者通过分析大量人工标注的SAE特征数据,发现同一解释常被重复使用,导致特征区分度下降。为此,他们提出了两个新的评估指标,用于修正现有方法对特征解释性的高估问题,从而提升自动解释性的准确性与可靠性。

Comments 11 pages, 2 figures, 3 tables

详情
英文摘要

Sparse autoencoders (SAEs) are now standard tools for decomposing language model activations into interpretable features, and automated interpretability pipelines routinely assign each feature a short natural-language explanation. Existing critiques of this practice focus on polysemanticity -- one feature with many meanings -- or on whether explanations predict activations. We identify a complementary, structurally distinct problem we call descriptive collision: many distinct SAE features admit the same explanation. Reanalyzing the largest publicly-available dataset of human-annotated SAE features (Marks et al., 2025), comprising 722 annotated features across Gemma 2 2B and Pythia 70M, we find that the mean annotation string is reused across 3.07 features; 82.1% of features share their annotation with at least one other feature; and the single most common annotation string ("plural nouns") labels 101 distinct features spanning 18 layers and four model components. Information-theoretically, the average annotation resolves only 70% of feature identity. We formalize a property called discrimination, prove that current detection-style auto-interpretability scoring is invariant to collision, and propose two complementary corrective metrics -- collision-adjusted detection and discrimination scoring -- that explicitly penalize explanations that fail to distinguish a feature from its neighbors. The collision problem is independent of, and additive with, previously identified failure modes of auto-interpretability; ignoring it inflates reported feature interpretability by a quantity equal to roughly one-third of the bits required to identify a feature.

2605.12872 2026-05-14 cs.LG

SMA: Submodular Modality Aligner For Data Efficient Multimodal Learning

Truong Pham, Anay Majee, Rishabh Iyer

AI总结 尽管多模态基础模型在近期取得了显著进展,但它们依赖大量配对数据,限制了其在数据稀缺场景下的应用。本文提出了一种基于子模态互信息的组合式对齐方法——SMA,通过将多组增强和描述视为集合,捕捉更丰富的跨模态结构,从而在有限数据下实现更有效的多模态对齐。实验表明,SMA在少样本分类和检索任务中表现出色,仅需数万样本即可达到强多模态泛化能力,显著优于传统方法。

详情
英文摘要

Despite the recent success of Multimodal Foundation Models (FMs), their reliance on massive paired datasets limits their applicability in low-data and rare-scenario settings where aligned data is scarce and expensive. A key bottleneck is the adoption of an instance-level formulation, which learns alignment by maximizing correlation between individual image-text pairs while neglecting the underlying geometric structure across modalities resulting in a modality gap across input modalities. In this paper, we propose a combinatorial paradigm for multimodal alignment that moves beyond pairwise learning and introduce the \emph{Submodular Modality Aligner (SMA)}, which treats multiple augmentations and descriptions of an entity as a set, leveraging multiple descriptions of the data to capture richer cross-modal structure. We instantiate SMA using a principled objective based on Submodular Mutual Information (SMI), which jointly maximizes inter-modality mutual information while reducing cross-modal divergence. This formulation enables the model to effectively utilize multiple positive associations and extract significantly more information from limited data. We evaluate SMA on 14 zero-shot classification and retrieval tasks from the CLIP benchmark and demonstrate consistent gains in the low-data regime. Notably, SMA achieves strong multimodal generalization using only tens of thousands of samples. This is orders of magnitude fewer than standard approaches. Our results highlight the importance of set-based formulations and submodular objectives for data-efficient multimodal learning.

2605.12855 2026-05-14 cs.CV

Prediction of Rectal Cancer Regrowth from Longitudinal Endoscopy

Jorge Tapias Gomez, Despoina Kanata, Aneesh Rangnekar, Christina Lee, Hannah Williams, Hannah Thompson, J. Joshua Smith, Francisco Sanchez-Vega, Mert R. Sabuncu, Julio Garcia-Aguilar, Harini Veeraraghavan

AI总结 该研究提出了一种基于纵向内镜图像的深度学习方法TREX,用于预测接受“观察等待”治疗的直肠癌患者肿瘤的复发情况。TREX通过结合治疗后复查和随访期间的图像,利用双交叉注意力机制和预训练的Swin Transformer模型,在无需图像配准的情况下提取并融合特征,从而区分完全缓解与局部复发。实验表明,TREX在复发检测和早期预警方面均优于现有方法,并在临床验证中表现出与专业医生相当的诊断准确性。

Comments 14 Pages, 9 figures, 2 tables

详情
英文摘要

Clinical trial studies indicate benefit of watch-and-wait (WW) surveillance for patients with rectal cancer showing a complete or near clinical response (CR) directly after treatment (restaging). However, there are no objectively accurate methods to early detect local tumor regrowth (LR) in patients undergoing WW from follow-up exams. Hence, we developed Temporal Rectal Endoscopy Cross-attention (TREX), a longitudinal deep learning approach that combines pairs of images acquired at restaging and follow-up to distinguish CR from LR. TREX uses pretrained Swin Transformers in a siamese setting to extract features from longitudinal images and dual cross-attention to combine the features without spatial co-registration between image pairs. TREX and Swin-based baselines were trained under two settings: (a) detecting LR or CR at the last available follow-up and (b) early detection of LR at 3--6, 6--12, and 12--24 months before clinical confirmation. TREX achieved the highest accuracy in detecting LR with a high sensitivity of 97% $\pm$ 6% and a balanced accuracy of 90% $\pm$ 3%, and outperformed all baselines in early detection at both 3--6 (74% $\pm$ 1%) and 6--12 months (62% $\pm$ 4%) prior to clinical detection. Clinical validation via a surgeon survey showed that TREX matched attending-level overall accuracy (TREX: 86.21% vs.\ Clinicians: 87.84% $\pm$ 1.28%). Finally, we explored TREX's ability to predict treatment response by combining pre-treatment (pre-TNT) and restaging endoscopies, achieving a balanced accuracy of 73% $\pm$ 12%. These results show that longitudinal deep learning analysis of endoscopy may improve surveillance and enable earlier identification of rectal cancer regrowth.

2605.12852 2026-05-14 cs.LG q-bio.QM

Multitask Multimodal Fusion with Tabular Foundation Models for Peak and Durability Prediction of Pertussis Booster Response

Divya Sitani

AI总结 该研究旨在同时预测百日咳加强疫苗接种后的免疫反应峰值和持续时间,这两个过程由不同的生物学机制驱动。研究提出了一种多任务多模态融合模型,结合冻结的TabPFN-v2编码器、双标签对比损失、缺失校准的模态丢弃和注意力融合机制,以应对数据模态异质性、缺失值和任务间关联弱的挑战。实验表明,该模型在两个预测任务上均优于传统方法,且结果与免疫学机制一致,揭示了不同模态对峰值和持续时间预测的特异性贡献。

Comments 22 pages, 8 figures, 4 tables. Code available at https://github.com/Divya1205/cmi-pb-multitask

详情
英文摘要

Pertussis booster vaccination produces immune responses that vary widely across individuals in both peak magnitude and long-term durability. These two phases are governed by partly distinct biological compartments:peak reflects acute B-cell activation and antibody secretion, while durability reflects the establishment of long-term humoral memory. Yet most computational models target only one, missing the full boost-and-wane trajectory. Jointly predicting both is non-trivial because the two endpoints are biologically dissociated rather than redundant; samples are small, modalities are heterogeneous with structured missingness, and the two tasks rely on different measurement windows. We propose a multi-task contrastive multimodal fusion architecture combining frozen TabPFN-v2 per-modality encoders, a dual-label supervised contrastive loss that treats two subjects as a positive pair if they agree on the Task 1 label or the Task 2 label, modality dropout calibrated to empirical missingness, and missingness-masked attention fusion. Applied to a curated subset of the CMI-PB pertussis booster dataset (n = 158 subjects, four modalities, 44.9% with at least one modality missing; Spearman r = -0.58 between peak and durability, n = 96), the model achieves test AUROC 0.797 (95% CI [0.621, 0.948]) for peak response and 0.755 (95% CI [0.519, 0.945]) for durability, with both significant under joint label permutation (N = 1000; p = 0.002 and p = 0.045). Across logistic regression, XGBoost, and MLP baselines on raw features and on TabPFN embeddings, the proposed model is the only one whose 95% CIs lie above chance on both tasks simultaneously. Per-modality contribution analyses recover task-specific modality contributions consistent with the underlying immunology: peak prediction is carried by cytokine signatures, while durability is carried by baseline antibody features.

2605.12851 2026-05-14 cs.CV cs.AI

PRISM: Perinuclear Ring-based Image Segmentation Method for Acute Lymphoblastic Leukemia Classification

Larissa Ferreira Rodrigues Moreira, Leonardo Gabriel Ferreira Rodrigues, Rodrigo Moreira, André Ricardo Backes

AI总结 该研究针对急性淋巴细胞白血病(ALL)分类中外周血涂片图像分析的挑战,提出了一种基于核周环的图像分割方法PRISM。该方法通过围绕细胞核构建自适应同心区域,替代传统的细胞质轮廓分割,从而在无需精确细胞边界检测的情况下提取鲁棒的细胞质特征。实验表明,该方法结合传统分类器的校准集成,在分类准确率和AUC指标上均表现出色,分别达到98.46%和0.9937。

Comments Paper accepted for publication at the XXVI Simpósio Brasileiro de Computação Aplicada à Saúde (SBCAS 2026), Ouro Preto, MG, Brazil

详情
英文摘要

Automated analysis of peripheral blood smears for Acute Lymphoblastic Leukemia (ALL) is hindered by low contrast and substantial variability in cytoplasmic appearance, which complicate conventional membrane-based segmentation. We found that many recent approaches rely on heavy neural architectures and extensive training, but still struggle to generalize across staining and acquisition variability. To address these limitations, we propose the Perinuclear Ring-based Image Segmentation Method (PRISM), which replaces explicit cytoplasmic delineation with adaptive concentric zones constructed around the nucleus. These perinuclear regions enable the extraction of robust cytoplasmic descriptors by integrating color information with texture statistics derived from grey-level co-occurrence patterns, without requiring accurate cell-boundary detection. A calibrated stacking ensemble of traditional classifiers leverages these descriptors to achieve a high performance, with an accuracy of 98.46% and a precision-recall AUC of 0.9937.

2605.12845 2026-05-14 cs.CV cs.AI

AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects

Danrui Li, Jiahao Zhang, Bernhard Egger, Moitreya Chatterjee, Suhas Lohit, Tim K. Marks, Anoop Cherian

AI总结 本文提出AssemblyBench,一个包含2,789个工业对象的合成数据集,包含多模态装配说明、对应的3D部件模型及装配轨迹,旨在解决工业装配中复杂形状和装配路径的问题。研究还提出基于Transformer的模型AssemblyDyno,能够联合预测装配顺序和部件轨迹,相比现有方法在装配姿态估计和轨迹可行性方面表现更优,其中轨迹可行性通过物理仿真进行评估。

Comments Accepted at CVPR 2026

详情
英文摘要

Assembling objects from parts requires understanding multimodal instructions, linking them to 3D components, and predicting physically plausible 6-DoF motions for each assembly step. Existing datasets focus on simplified scenarios, overlooking shape complexities and assembly trajectories in industrial assemblies. We introduce AssemblyBench, a synthetic dataset of 2,789 industrial objects with multimodal instruction manuals, corresponding 3D part models, and part assembly trajectories. We also propose a transformer-based model, AssemblyDyno, which uses the instructional manual and the 3D shape of each part to jointly predict assembly order and part assembly trajectories. AssemblyDyno outperforms prior works in both assembly pose estimation and trajectory feasibility, where the latter is evaluated by our physics-based simulations.

2605.12843 2026-05-14 cs.LG cs.AI

Bayesian Model Merging

Kaiyang Li, Shaobo Han, Qing Su, Shihao Ji

AI总结 本文提出了一种名为Bayesian Model Merging(BMM)的模型合并方法,旨在在无需联合重训练的情况下将多个任务专家模型合并为一个统一模型。该方法采用了一种双层优化框架,内层基于锚定模型的强先验进行激活驱动的贝叶斯回归,得到高效的闭式解;外层则通过贝叶斯优化全局搜索各模块的超参数。此外,BMM还揭示了激活统计量与任务向量之间的关键对齐关系,从而实现了无需辅助数据的无数据变体。实验表明,BMM在多个基准测试中均优于现有方法,尤其在多任务视觉与语言任务中表现出色。

详情
英文摘要

Model merging aims to combine multiple task-specific expert models into a single model without joint retraining, offering a practical alternative to multi-task learning when data access or computational budget is limited. Existing methods, however, face two key limitations: (1) they overlook the valuable inductive bias of strong anchor models and estimate the merged weights from scratch, and (2) they rely on a shared hyperparameter setting across different modules of the network, lacking a global optimization strategy. This paper introduces Bayesian Model Merging (BMM), a plug-and-play bi-level optimization framework, where the inner level formulates the model merging as an activation-based Bayesian regression under a strong prior induced by an anchor model, yielding an efficient closed-form solution; and the outer level leverages a Bayesian optimization procedure to search module-specific hyperparameters globally based on a small validation set. Furthermore, we reveal a key alignment between activation statistics and task vectors, enabling us to derive a data-free variant of BMM that estimates the Gram matrix for regression without any auxiliary data. Across extensive benchmarks, including up to 20-task merging in vision and 5-task merging in language, BMM consistently outperforms all plug-and-play anchor baselines (e.g., TA, WUDI-Merging, and TSV). In particular, on the ViT-L/14 benchmark for 8-task merging, a single merged model reaches 95.1, closely matching the average performance of eight task-specific experts (95.8).

2605.12838 2026-05-14 cs.AI

Multimodal Hidden Markov Models for Persistent Emotional State Tracking

Anamika Ragu, Aneesh Jonelagadda

AI总结 本文提出了一种基于多模态情感表示的轻量级隐马尔可夫模型框架,用于追踪对话中持续的情感状态变化。该方法利用粘性因子HDP-HMM对来自视频、音频和文本的多模态情感特征进行建模,能够更准确地捕捉对话中长期的情感阶段。实验表明,该模型在计算成本远低于基于大语言模型的方法的前提下,能够生成更具可解释性的情感序列,并在临床数据集上验证了其在情感阶段恢复和提升对话质量方面的有效性。

Comments 8 pages, 2 figures

详情
英文摘要

Tracking an interpretable emotional arc of a conversation via the sentiment of individual utterances processed as a whole is central to both understanding and guiding communication in applied, especially clinical, conversational contexts. Existing approaches to emotion recognition operate at the utterance level, obscuring the persistent phases that characterize real conversational dynamics. We propose a lightweight framework that models conversational emotion as a sequence of latent emotional regimes using sticky factorial HDP-HMMs over multimodal valence-arousal representations derived from simultaneous video, audio and textual input. We evaluate the quality of regime prediction using LLM-as-a-Judge, geometric, and temporal consistency metrics, demonstrating that the sticky HDP-HMM produces more interpretable regime sequences than the baseline Gaussian HMM at a fraction of the computational cost of LLM-based dialogue state tracking methods. In addition, Question-Answer experiments in a clinical dataset suggest that meaningful emotional phases can reliably be recovered from multimodal valence-arousal trajectories and used to improve the quality of LLM responses in unstable affective regimes via context augmentation. This framework thus opens a path toward interpretable, lightweight, and actionable analysis of conversational emotion dynamics at scale.

2605.12835 2026-05-14 cs.AI

PROMETHEUS: Automating Deep Causal Research Integrating Text, Data and Models

Sridhar Mahadevan

AI总结 PROMETHEUS 是一个将文本、数据和模型整合为因果地图的框架,旨在自动化深度因果研究。该方法通过构建局部因果预测状态模型的集合,形成可导航的因果图谱,支持对不同区域的因果声明进行比较与整合。研究展示了该框架在多个实际案例中的应用,包括从文献中提取因果关系以及基于原始数据进行反事实验证,显著提升了因果推理的系统性和可解释性。

Comments 27 pages

详情
英文摘要

Large language models can extract local causal claims from text, but those claims become more useful when organized as persistent, navigable world models rather than as flat summaries. We introduce PROMETHEUS, a framework that turns retrieved literature, filings, reviews, reports, agent traces, source data, code, simulations, and scientific models into causal atlases: sheaf-like families of local causal predictive-state models over an explicit cover of a research substrate. Each local region contains causal episodes, structured claim tables, predictive tests, support statistics, and provenance; restriction maps compare overlapping regions; gluing diagnostics expose agreement, drift, contradiction, and underdetermination. The resulting Topos World Model is not a single universal graph. It is a research instrument for navigating what a corpus says, where it says it, how strongly it is supported, and where local claims fail to assemble into a coherent global view. Three literature-atlas case studies -- ocean-temperature impacts on marine populations, GLP-1 weight-loss evidence, and resveratrol/red-wine health-benefit claims -- illustrate deep causal research from text with explicit locality, evidence, persistent state, and gluing tension. Four grounded-counterfactual case studies -- a Nature Climate Change microplastics forcing paper, an Indus Valley hydrology paper with VIC-derived figure data and model code, the canonical Sachs protein-signaling study with single-cell perturbation data, and a Nature singing-mouse study with MAPseq projection matrices -- show a stronger mode: when a paper ships source data, simulation outputs, or code, PROMETHEUS can evaluate a counterfactual against that scientific substrate and then rebuild the sheaf world model around the

2605.12831 2026-05-14 cs.LG

Quantifying Potential Observation Missingness in Inverse Reinforcement Learning

Leo Benac, Abhishek Sharma, Alihan Huyuk, Finale Doshi-Velez

AI总结 逆强化学习(IRL)通过示范数据推断奖励函数,是建模和理解决策行为的重要工具。然而,现实中的行为数据可能存在未被记录的观测信息,导致专家行为看似次优,从而影响奖励函数的学习。本文提出了一种方法,用于量化专家行为在缺失观测情况下的潜在最优性,并开发了相应的算法,通过多个实验验证其在导航任务、癌症治疗模拟和ICU治疗数据中的有效性。

详情
英文摘要

Inverse reinforcement learning (IRL), which infers reward functions from demonstrations, is a valuable tool for modeling and understanding decision-making behavior. Many variants of IRL have been developed to capture complexities of human decision-making, such as subjective beliefs, imperfect planning, and dynamic goals. However, an often-overlooked issue in real-world behavioral datasets is that the recorded data may be missing observations that were available to the original decision-maker. In use-inspired settings such as healthcare, this can make expert actions appear suboptimal, even when they were near-optimal given the information available at the time. As a result, the rewards learned by standard IRL may be misleading. In this paper, we identify the minimal perturbations to the recorded observations needed for the expert's actions to appear optimal. We develop a practical algorithm for this problem and demonstrate its utility for quantifying the possible extent of missing observations in behavioral datasets through extensive experiments on synthetic navigation tasks, a cancer treatment simulator, and ICU treatment data.

2605.12826 2026-05-14 cs.CV cs.AI

FRAME: Forensic Routing and Adaptive Multi-path Evidence Fusion for Image Manipulation Detection

Kaixiang Zhao, Tianrun Yu, Aoxu Zhang, Junhao Su, Porter Jenkins, Amanda Hughes

AI总结 随着图像编辑工具和生成式人工智能的普及,数字图像的真实性验证变得愈发困难。为了解决现有方法在鲁棒性、证据碎片化和泛化能力方面的不足,本文提出了一种名为FRAME的新方法,通过多路径分析空间组织多种取证算法,自适应选择适合的取证路径并融合互补证据,从而提升检测与定位性能。FRAME在保持多源取证线索可解释性的基础上,提供了更稳健且灵活的图像取证方案,并在多种篡改场景中展现出良好的效果。

Comments Accepted to CVPR 2026 SAFE Workshop

详情
英文摘要

The proliferation of sophisticated image editing tools and generative artificial intelligence models has made verifying the authenticity of digital images increasingly challenging, with important implications for journalism, forensic analysis, and public trust. Although numerous forensic algorithms, ranging from handcrafted methods to deep learning-based detectors, have been developed for manipulation detection, individual methods often suffer from limited robustness, fragmented evidence, or weak generalization across manipulation types and image conditions. To address these limitations, we present \textbf{FRAME}, a method for \textbf{F}orensic \textbf{R}outing and \textbf{A}daptive \textbf{M}ulti-path \textbf{E}vidence fusion for image manipulation detection. FRAME organizes diverse forensic algorithms into a multi-path analysis space, adaptively selects informative forensic paths for each input image, and fuses complementary evidence to improve detection and localization performance. By moving beyond single-method analysis and fixed fusion strategies, FRAME provides a more robust and flexible approach to image forensic reasoning while preserving interpretable forensic cues from multiple evidence sources. Experimental results demonstrate the effectiveness of FRAME across diverse manipulation scenarios. Code is available at \href{https://github.com/kzhao5/FRAME}{https://github.com/kzhao5/FRAME}.

2605.12823 2026-05-14 cs.LG physics.chem-ph physics.comp-ph q-bio.BM

Hessian Matching for Machine-Learned Coarse-Grained Molecular Dynamics

Sanya Murdeshwar, Sanjit Shashi, Kevin Bachelor, William Noid, Ashwin Lokapally, Razvan Marinescu

AI总结 该研究提出了一种基于Hessian向量积匹配的机器学习粗粒化分子动力学方法,旨在提升粗粒化势能函数对自由能曲率的建模能力。通过引入随机探针向量,该方法在不显式构造Hessian矩阵的情况下,将二阶曲率信息融入粗粒化势能函数中,从而提高了模拟的准确性。实验表明,该方法在多个蛋白质体系中显著优于传统的梯度匹配方法,尤其在慢模动力学指标上表现出更优的性能。

Comments 15 pages, 4 figures, 1 table

详情
英文摘要

Coarse-grained (CG) molecular dynamics enables simulations of atomic systems such as biomolecules at timescales inaccessible to all-atom (AA) methods, but existing CG neural potentials trained via force matching capture only the gradient of the free-energy surface, leaving its curvature unconstrained. We introduce a framework that augments force matching with stochastic Hessian-vector product (HVP) matching, instilling second-order curvature information into CG potentials without constructing the full Hessian. We derive a decomposition of the target CG Hessian into a model-independent projected AA Hessian, precomputed once before training, and a model-dependent covariance correction computed online at negligible cost. We construct an unbiased stochastic estimator of the Hessian-matching objective by using random probe vectors. We evaluate our method by comparing against force matching on a benchmark of nine fast-folding proteins unseen during training. HVP matching outperforms plain force matching on 8 of 9 proteins on slow-mode metrics, with reductions of up to 85% in the Kullback--Leibler divergence between the CG and reference distributions along the slowest collective mode of the largest protein. Our results demonstrate that higher-order physical supervision is a practical path to more accurate and transferable CG potentials for biomolecular simulation.

2605.12817 2026-05-14 cs.LG cs.AI cs.CL

Training Large Language Models to Predict Clinical Events

Benjamin Turtel, Paul Wilczewski, Kris Skotheim

AI总结 该研究旨在利用纵向临床记录训练大型语言模型以预测临床事件。通过将时间顺序的MIMIC-III病历转化为包含过去病史、未来事件问题及后续记录标签的预测示例,构建了涵盖药物、手术、器官支持、微生物学和死亡率等多方面的预测数据集。研究采用LoRA微调方法显著提升了模型的预测性能,并在无需人工设计结构特征或专用分类器的情况下实现了对临床预测的可复用监督学习。

详情
英文摘要

Longitudinal clinical notes contain rich evidence of how patients evolve over time, but converting this signal into training supervision for clinical prediction remains challenging. We extend Foresight Learning to clinical prediction by converting time-ordered MIMIC-III notes into examples consisting of past patient context, a natural-language question about a possible future event, and a label resolved from later documentation. This process yields 6,900 prediction examples from 702 admissions across medications, procedures, organ support, microbiology, and mortality. A small LoRA adapter trained on these examples improves over the prompted base model, reducing expected calibration error from 0.1269 to 0.0398 and Brier score from 0.199 to 0.145, while slightly outperforming GPT-5 point estimates on held-out questions. The approach enables reusable clinical prediction supervision from longitudinal notes without hand-engineered structured features or endpoint-specific classifiers.

2605.12816 2026-05-14 cs.LG

AGOP as Explanation: From Feature Learning to Per-Sample Attribution in Image Classifiers

Raj Kiran Gupta Katakam

AI总结 本文研究了平均梯度外积(AGOP)在神经网络特征学习中的作用,并探索其作为图像分类器中单样本解释方法的潜力。提出了一种新的归因方法AGOP-Weighted,结合了训练分布先验以提升像素重要性识别的准确性,并引入了两种变体AGOP-Local和AGOP-Global。实验表明,该方法在多个基准上显著优于现有归因方法,尤其在计算效率和小分辨率图像处理方面表现突出。

Comments 8 pages. Accepted at the 4th World Conference on eXplainable Artificial Intelligence (XAI 2026), Late-Breaking Work track, Fortaleza, Brazil, July 1-3, 2026

详情
英文摘要

The Average Gradient Outer Product (AGOP) governs feature learning in neural networks: the Neural Feature Ansatz states that weight Gram matrices at each layer align with the corresponding AGOP matrices computed over the training distribution. We ask a complementary question: can this same quantity serve as a post-hoc attribution method for explaining individual predictions? We introduce AGOP-Weighted: a novel attribution method that multiplies the per-sample gradient by sqrt(diag(M) / max diag(M)), a training-distribution prior that suppresses gradient noise and amplifies consistently important pixels -- a combination not present in any prior attribution method. We formalise two companion variants -- AGOP-Local (per-sample gradient, equivalent to VanillaGrad) and AGOP-Global (diag(M) directly as a zero-cost saliency map) -- and implement an efficient training-time accumulation hook; AGOP-Global then requires zero inference cost (disk lookup) while AGOP-Weighted requires only a single gradient pass. We conduct the first rigorous comparison of AGOP attribution against Integrated Gradients (IG), SmoothGrad, GradCAM, and VanillaGrad across two benchmarks with pixel-level ground truth: (i) the synthetic XAI-TRIS benchmark (four classification scenarios, 8x8 images, CNN8by8) and (ii) the photorealistic CLEVR-XAI benchmark (ResNet-18 fine-tuned from ImageNet). AGOP-Weighted achieves 44% higher mIoU than IG on linear tasks; AGOP-Global achieves 7x higher mIoU than IG on multiplicative tasks (where IG falls below random) at zero inference cost. Both findings generalise to ResNet-18 on CLEVR-XAI (+18% and +37% respectively). We further show that GradCAM fails on small-resolution images due to spatial resolution collapse, and that diag(M) quality improves monotonically throughout training even after classification accuracy has plateaued.

2605.12809 2026-05-14 cs.LG cs.AI

Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces

Shixing Yu, Promit Ghosal, Kyra Gan

AI总结 该研究旨在提高大语言模型在医疗等关键领域中的可靠性,通过识别模型预测所依赖的训练数据中的具体 token。为解决现有方法在 token 独立性假设和分解性上的局限,作者提出了一种基于正交潜在空间的框架,利用稀疏自编码器学习近似独立的潜在特征,并通过雅可比向量积和逆 Hessian 近似实现 token 级别的影响分析。实验表明,该方法能有效识别出稀疏且可解释的 token 集合,有助于增强模型可信度和决策透明性。

详情
英文摘要

A critical step for reliable large language models (LLMs) use in healthcare is to attribute predictions to their training data, akin to a medical case study. This requires token-level precision: pinpointing not just which training examples influence a decision, but which tokens within them are responsible. While influence functions offer a principled framework for this, prior work is restricted to autoregressive settings and relies on an implicit assumption of token independence, rendering their identified influences unreliable. We introduce a flexible framework that infers token-level influence through a latent mediation approach for general prediction tasks. Our method attaches sparse autoencoders to any layer of a pretrained LLM to learn a basis of approximately independent latent features. Unlike prior methods where influence decomposes additively across tokens, influence computed over latent features is inherently non-decomposable. To address this, we introduce a novel method using Jacobian-vector products. Token-level influence is obtained by propagating latent attributions back to the input space via token activation patterns. We scale our approach using efficient inverse-Hessian approximations. Experiments on medical benchmarks show our approach identifies sparse, interpretable sets of tokens that jointly influence predictions. Our framework enhances trust and enables model auditing, generalizing to high-stakes domain requiring transparent and accountable decisions.

2605.12805 2026-05-14 cs.LG cs.AI

Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels

Fairoz Nower Khan, Nabuat Zaman Nahim, Md Sajid Ahmed, Ruiquan Huang, Peizhong Ju

AI总结 该论文提出了一种名为 Discrete MeanFlow 的新方法,用于在离散状态空间中实现一步生成。与连续空间中的 MeanFlow 不同,它通过连续时间马尔可夫链的条件转移核来建模概率质量的转移,并定义了一个平均离散速率来衡量转移概率在时间区间内的变化。该方法通过边界构建设计直接参数化转移核,确保生成过程无需迭代去噪或微分方程求解,只需一次前向传播和分类采样即可完成生成,实验表明其在有限状态马尔可夫链和合成序列生成任务中具有高精度。

详情
英文摘要

MeanFlow enables one-step generation in continuous spaces by learning an average velocity over a time interval rather than the instantaneous velocity field of flow matching. However, discrete state spaces do not have smooth trajectories or spatial derivatives, so the continuous formulation does not directly apply. We introduce Discrete MeanFlow, which replaces the motion of a point with the transport of probability mass over finite states. Our key object is the conditional transition kernel of a continuous-time Markov chain (CTMC), from which we define a mean discrete rate that measures the average change in transition probability over a time interval. We prove a Discrete MeanFlow identity that relates this finite-interval rate to the instantaneous CTMC generator at the endpoint, with the Kolmogorov forward equation replacing the spatial chain rule of continuous MeanFlow. Based on this identity, we parameterize the transition kernel directly using a boundary-by-construction design that guarantees valid probability outputs and exact boundary conditions without auxiliary losses. Since the learned kernel is itself a probability distribution, generation reduces to a single forward pass followed by one categorical draw meaning no iterative denoising, ODE integration, or multi-step refinement is required. We validate the framework on exact finite-state Markov chains, where the learned kernel recovers the analytical ground truth to high precision, and on factorized synthetic sequence generation tasks with varying alphabet sizes and sequence lengths.

2605.12803 2026-05-14 cs.LG

Pitfalls of Unlabeled Disagreement-Based Drift Detection in Streaming Tree Ensembles

Lara Sá Neves, Afonso Lourenço, Lizy K. John, Goreti Marreiros

AI总结 本文研究了在未标记数据流中基于分歧的漂移检测方法在增量决策树集成中的应用问题。作者通过构造批次特定的分歧度量并进行实验,发现该方法在多层感知机集成中表现良好,但在增量决策树集成中却显著劣于基于损失的检测方法。研究认为,这是由于增量决策树结构扩张为主的特性限制了模型的适应性,使得分歧无法准确反映其学习潜力。文章指出,利用增量决策树的规则分解特性进行重构,可能为提升其适应性提供新方向。

Comments Published as a conference paper at CAO Workshop at ICLR 2026

详情
英文摘要

Detecting concept drift in high-speed data streams remains challenging, particularly when models must operate on unlabeled data and avoid false alarms caused by benign shifts. While disagreement-based uncertainty has shown promise in neural networks, its adaptation to ensembles of incremental decision trees (IDTs) remains largely unexplored. We investigate this approach by constructing batch-specific disagreement measures via label flipping in ensemble members and evaluating their effectiveness for drift detection in tabular data streams. Our experiments show that, although this method performs well in ensembles of multi-layer perceptrons (MLPs), it consistently underperforms loss-based detectors when applied to IDTs. We attribute this behavior to the intrinsic rigidity of IDTs: learning primarily through structural expansion, with limited parameter adaptation, restricts model plasticity and prevents disagreement from reliably reflecting learning potential. Recent work on restructuring IDTs using their intrinsic decomposition into non-overlapping rules offers a promising direction for improving adaptability.

2605.12798 2026-05-14 cs.LG cs.AI cs.CL

Emergent and Subliminal Misalignment Through the Lens of Data-Mediated Transfer

Baris Askin, Muhammed Ustaomeroglu, Anupam Nayak, Gauri Joshi, Guannan Qu, Carlee Joe-Wong

AI总结 该研究探讨了在有限有害数据集上微调大语言模型时可能引发的“涌现性对齐偏差”(EM)和“潜意识学习”(SL)现象。研究认为,这类偏差并非由单一有害示例引起,而是数据结构、任务难度与模型能力之间相互作用的结果。通过实验发现,当微调与评估提示具有相似功能结构、存在更多连贯有害补全空间,或目标行为已被模型可靠学习时,偏差更容易出现。研究还首次对比了在策略外与策略内蒸馏下偏差的传递机制,强调应从数据和训练流程的整体视角理解对齐偏差的成因。

详情
英文摘要

Fine-tuning LLMs on narrow harmful datasets can induce Emergent Misalignment (EM), where models exhibit misaligned behavior far beyond the fine-tuning distribution. We argue that emergent misalignment can be better understood as a data-mediated transfer phenomenon: harmful fine-tuning examples do not induce uniform behavioral spillover, but interact with the structural properties of the dataset and the difficulty of the tasks relative to the model. Across our experiments, we find that misalignment appears more readily when fine-tuning and evaluation prompts share similar underlying functional structure, when prompts leave more room for coherent harmful completions, and when the target behavior has been more reliably learned by the model. The training pipeline itself also matters: pretraining composition shapes later misalignment. We further study Subliminal Learning (SL), where misalignment is transmitted by fine-tuning on seemingly benign data generated by a harmful teacher. Moving beyond the standard SFT setting, we for the first time compare this transfer under off-policy and on-policy distillation as well, allowing us to separate the roles of the teacher guidance and the training data distribution in transmitting misalignment. Together, these results argue for a data-centric view: Emergent/subliminal misalignment should not be treated as a simple consequence of isolated harmful fine-tuning examples, but as the result of interactions between fine-tuning data structure, pretraining distributions, and training channels.

2605.12792 2026-05-14 cs.LG

SoK: A Comprehensive Analysis of the Current Status of Neural Tangent Generalization Attacks with Research Directions

Thushari Hapuarachchi, Kaiqi Xiong

AI总结 本文对神经切线泛化攻击(NTGA)的现状进行了全面分析,并指出了其优缺点及改进方向。NTGA是首个在黑盒环境下实现的干净标签泛化攻击,用于应对深度神经网络训练中使用未经授权数据的问题。研究通过实验验证了NTGA在对抗训练和图像变换下的脆弱性,并发现近期提出的其他干净标签攻击在数据保护效果上已超越NTGA,从而揭示了进一步研究NTGA的必要性。

详情
英文摘要

There is recently a serious issue that Deep Neural Networks (DNNs) training uses more and more unauthorized data. A clean-label generalization attack, one type of data poisoning attacks, has been suggested to address this issue. The Neural Tangent Generalization Attack (NTGA) is considered as the first well-known clean-label generalization attack under the black-box settings, which provided an unprecedented step in data protection approaches. In this paper, we conduct a comprehensive analysis on the state-of-the-art of NTGA; to the best of our knowledge, this is the first thorough analysis regarding NTGA. First, we provide a classification of attacks against DNNs with their explanations and relations to NTGA. Then, this paper presents a taxonomy of black-box attacks and demonstrate that the NTGA is the first clean-label generalization attack under the black-box setting. We further analyze the existing studies of NTGA and give a comprehensive comparisons of their findings by conducting our own experiments to verify these findings. Moreover, our extensive experiments show that NTGA is vulnerable to adversarial training and image transformations, and applying linear separability to NTGA-generated images makes them more susceptible to such vulnerablities. We present the pros and cons of NTGA and suggest ways to improve NTGA robustness based on our analysis. Our further experiments indicate that several recently proposed clean-label generalization attacks outperform NTGA on data protection. Finally, we unveil the necessity of further research with future research insights on NTGA.

2605.12790 2026-05-14 cs.RO

Few-Shot Physics-Informed Neural Network for Shape Reconstruction of Concentric-Tube Robots

Navid Feizi, Filipe C. Pedrosa, Rajni V. Patel, Jagadeesan Jayender

AI总结 本文提出了一种基于物理信息的神经网络(PINN),用于具有三个预弯曲管的六自由度同心管机器人(CTR)的运动学建模。该方法将科瑟拉杆的微分方程嵌入神经网络,并通过少量观测数据进行训练,实现了对机器人形状、扭转角、扭矩、弯曲力矩和姿态的完整状态估计。实验表明,该模型在形状误差方面优于纯物理模型,且计算效率高,适用于实时控制。

Comments to be published in 2026 IEEE International Conference on Robotics & Automation proceedings

详情
英文摘要

Modeling concentric tube robots (CTRs) involves complex nonlinear continuum mechanics, and despite recent progress, physics-based models often lack an accurate representation of the experimental setups. To overcome these limitations, deep neural network-based models have been explored as alternatives with superior accuracy; however, they often overlook known mechanics, require large training datasets, and typically discard shape estimation of the robot. We present a physics-informed neural network (PINN) for kinematic modeling of a 6-DoF CTR with three pre-curved tubes that embeds the Cosserat rod differential equations and learns from few-shot observational data, balancing physics priors with data-driven fitting. PINN enables full-state estimation of shape, twist angle, torsional strain, bending moment, and orientation. Benchmark tests show a mean shape error below 1% of the robot length and accurately recovered other kinematic states, outperforming a purely physics-based Cosserat rod model baseline while using a minimal training set. The resulting model is also computationally efficient and robust, making it well-suited for real-time control applications.