arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1670
专题追踪
2605.14260 2026-05-18 stat.ML cs.LG

On the Burden of Achieving Fairness in Conformal Prediction

在符合预测中实现公平性的负担

Ziang Gao, Pengqi Liu, Archer Yi Yang, Mouloud Belbahri, Jesse C. Cresswell, Masoud Asgharian

发表机构 * McGill University(麦吉尔大学) TD Insurance(TD保险) Layer 6 AI

AI总结 研究揭示了单一阈值校准在符合预测中隐藏的跨组异质性,证明了公平性定义之间的根本矛盾,并量化了不同校准策略的成本。

详情
AI中文摘要

符合预测通常使用单一池化阈值进行校准,但这种方法可能隐藏分数分布中的跨组异质性并扭曲各组的覆盖范围。我们通过分割符合校准下的总体分数分布研究了这一现象。首先,我们推导出一个守恒定律和下限,表明池化校准在跨组分位数异质性的尺度上不可避免地导致各组覆盖范围的扭曲。其次,我们证明了符合预测中两种主要公平性定义,即等覆盖和等集合大小,本质上存在根本矛盾。第三,我们量化了在不同策略之间转换的成本,这些策略分别处理各组或池化各组。在合成和真实数据上的实验验证了有限样本校准后的相同双向权衡。我们的结果表明,对于所研究的校准家族,校准选择不会消除跨组异质性;它决定了由此产生的扭曲出现在覆盖或大小维度中,为实际公平导向的校准选择提供了原理性的分析视角。

英文摘要

Conformal prediction is often calibrated with a single pooled threshold, but this can hide cross-group heterogeneity in score distributions and distort group-wise coverage. We study this phenomenon through the population score distributions underlying split conformal calibration. First, we derive a conservation law and lower bound showing that pooled calibration incurs irreducible group-wise coverage distortion at a scale set by cross-group quantile heterogeneity. Second, we demonstrate that the two leading fairness definitions for conformal prediction, Equalized Coverage and Equalized Set Size, are fundamentally in tension. Third, we quantify the cost of moving between policies which treat groups separately or pool them. Experiments on synthetic and real data confirm the same bidirectional trade-off after finite-sample calibration. Our results show that, for the policy families studied here, calibration choice does not remove cross-group heterogeneity; it determines whether the resulting distortion appears in the coverage or size dimension, providing a principled lens for analyzing fairness-oriented calibration choices in practice.

2605.12581 2026-05-18 cs.LO cs.AI cs.FL math.OC

Ensuring Logic in the Fog: Sound POMDP Synthesis with LTL Objectives

确保雾中的逻辑:带有LTL目标的可靠POMDP综合

Can Zhou, Yulong Gao, Pian Yu

发表机构 * Imperial College London(伦敦帝国理工学院) University College London(伦敦大学学院)

AI总结 本文提出一种新的可靠奖励塑造机制,用于在部分可观测马尔可夫决策过程中实现LTL目标的合成,通过增强的蒙特卡洛规划框架提升在部分可观测环境中的导航能力。

Comments Accepted by IJCAI-ECAI 2026, the 35th International Joint Conference on Artificial Intelligence

详情
AI中文摘要

合成能够导航不确定环境并遵守复杂时间约束的自主代理仍然是基本挑战。虽然线性时序逻辑(LTL)提供了一种严格指定此类任务的语言,但部分可观测马尔可夫决策过程(POMDP)中验证LTL满足的固有不可判定性使得定量合成困难,尤其是在为近似求解器设计可靠奖励信号时。本文通过一种新颖且可靠的奖励塑造机制填补了这一空白,该机制动态生成基于信念的奖励,这些奖励基于已认证的LTL满足。通过将此机制整合到增强的蒙特卡洛规划框架中,我们使代理能够通过专注于最大化可验证成功的搜索过程来导航部分可观测性中的'雾'。实验表明,该方法不仅在现有求解器失败的场景中表现出色,而且在多样化的基准领域中保持了有效性和可扩展性。

英文摘要

Synthesising autonomous agents that can navigate uncertain environments while adhering to complex temporal constraints remains a fundamental challenge. While Linear Temporal Logic (LTL) provides a rigorous language for specifying such tasks, the inherent undecidability of qualitatively verifying LTL satisfaction in partially observable Markov decision processes renders quantitative synthesis difficult, especially when designing reliable reward signals for approximate solvers. In this paper, we bridge this gap with a novel, sound reward-shaping mechanism that dynamically generates belief-dependent rewards grounded in certified LTL satisfaction. By integrating this mechanism into an enhanced Monte Carlo Planning framework, we empower agents to navigate the `fog' of partial observability with a search process focused on maximising verifiable success. Our experiments demonstrate that this approach not only thrives in scenarios where existing solvers fail but also maintains effectiveness and scalability across diverse benchmark domains.

2604.26578 2026-05-18 cs.SE cs.AI

Graph Construction and Matching for Imperative Programs using Neural and Structural Methods

基于神经方法和结构方法的命令式程序图构建与匹配

Arshad Beg, Diarmuid O'Donoghue, Rosemary Monahan

发表机构 * Maynooth University(梅诺思大学)

AI总结 本文提出通过神经和结构方法构建命令式程序图,实现跨语言和注释风格的图表示一致性,为语义丰富和近似图匹配提供基础。

Comments 20 Pages. Technical Report. Maynooth University, Ireland. Submitted on 29 April 2026

详情
AI中文摘要

重用验证制品需要识别程序及其规范的结构和语义相似性。本文聚焦图构建作为实现这一目标的基础步骤。我们提出一个管道,将命令式程序及其注释转换为带类型和属性的图。实验涵盖包含C与ACSL、Java与JML以及Dafny for C#的数据集。该管道整合了抽象语法树解析与从SentenceTransformer和CodeBERT等模型中获得的语义嵌入。这使生成的图表示能够捕捉结构关系和语义上下文。我们的结果表明,可以在不同语言和注释风格下构建一致的图表示。本文为未来语义丰富和近似图匹配的可扩展验证制品重用提供了实用基础。

英文摘要

Reusing verification artefacts requires identifying structural and semantic similarities across programs and their specifications. In this paper, we focus on graph construction as a foundational step toward this goal. We present a pipeline that converts imperative programs and their annotations into typed, attributed graphs. Our experiments cover datasets including C with ACSL, Java with JML, and Dafny for C\#. The pipeline integrates abstract syntax tree parsing with semantic embeddings derived from models such as SentenceTransformer and CodeBERT. This enables the generation of graph representations that capture both structural relationships and semantic context. Our results show that consistent graph representations can be constructed across different languages and annotation styles. This work provides a practical basis for future steps in semantic enrichment and approximate graph matching for scalable verification artefact reuse.

2604.09631 2026-05-18 cs.DC cs.AI

Hardware Utilization and Inference Performance of Edge Object Detection Under Fault Injection

边缘目标检测在故障注入下的硬件利用与推断性能

Faezeh Pasandideh, Mehdi Azarafza, Achim Rettberg

发表机构 * Hamm-Lippstadt University of Applied Sciences (HSHL)(哈姆-利普施塔特应用科学大学(HSHL))

AI总结 研究通过故障注入测试评估了TensorRT优化的YOLO模型在边缘平台上的硬件行为,发现其在资源降级下保持稳定性能,为边缘推断可靠性提供硬件层面的视角。

详情
AI中文摘要

随着深度学习模型部署在资源受限的边缘平台,了解硬件在资源降级下的行为变得至关重要。本文系统地表征了在大规模故障注入测试下,TensorRT优化的YOLOv10s、YOLOv11s和YOLO2026n管道在NVIDIA Jetson Nano上的CPU负载、GPU利用率、RAM消耗、功耗、吞吐量和热行为。故障通过解耦框架合成,利用大型语言模型和潜在扩散模型。结果表明,两种任务和两种模型的推断引擎在资源降级下保持GPU占用稳定,温度上升受控,功耗在安全范围内,内存使用在初始暖机阶段后趋于一致释放模式。目标检测在内存和热行为上略有波动,但两者均得出结论:TensorRT管道在输入数据严重降级时仍表现良好。这些发现提供了模型可靠性的硬件层面视角,与边缘推断性能研究形成补充。

英文摘要

As deep learning models are deployed on resource constrained edge platforms in autonomous driving systems, reli able knowledge of hardware behavior under resource degradation becomes an essential requirement. Therefore, we introduce a systematic characterization of CPU load, GPU utilization, RAM consumption, power draw, throughput, and thermal behaviour of TensorRT-optimized YOLOv10s, YOLOv11s and YOLO2026n pipelines running on NVIDIA Jetson Nano under a large-scale fault injection campaign targeting both lane-following and ob ject detection tasks. Faults are synthesized using a decoupled framework that leverages large language models (LLMs) and latent diffusion models (LDMs), based on original data from our JetBot platform data collection. Results show that across both tasks and both models the inference engines keep GPU occupancy stable, temperature rise under control, and power consumption within safe limits, while memory usage settles into a consistent release pattern after the initial warm-up phase. Object detection tends to show somewhat more variability in memory and thermal behavior, yet both tasks point to the same conclusion: the TensorRT pipelines hold up well even when the input data is heavily degraded. These findings offer a hardware-level view of model reliability that sits alongside, rather than against, the broader body of work focused on inference performance at the edge.

2603.29617 2026-05-18 q-bio.NC cs.AI cs.CL

Convergent Representations of Linguistic Constructions in Human and Artificial Neural Systems

人类和人工神经系统的语言构造收敛表征

Pegah Ramezani, Thomas Kinfe, Andreas Maier, Achim Schilling, Patrick Krauss

发表机构 * Department of English and American Studies, University Erlangen-Nuremberg(英语与美国研究系,埃尔朗根-纽伦堡大学) Pattern Recognition Lab, University Erlangen-Nuremberg(模式识别实验室,埃尔朗根-纽伦堡大学) Neuromodulation and Neuroprosthetics, University Hospital Mannheim, University Heidelberg(神经调控与神经假体,曼海姆大学医院,海德堡大学) BGU Ludwigshafen, Germany(吕贝克大学吕贝克分校,德国) Neuroscience Lab, University Hospital Erlangen(神经科学实验室,埃尔朗根大学医院)

AI总结 研究通过EEG验证人类神经活动对语言构造的表征,发现句末alpha波段出现构造特异性神经签名,与人工语言模型的构造表征模式相似,支持语言构造作为形式-意义映射的神经编码。

详情
AI中文摘要

理解大脑如何处理语言构造是认知神经科学和语言学的核心挑战。最近的计算研究表明,人工神经语言模型会自发发展出对论元结构构造(ASCs)的差异化表征,生成关于构造层面信息在处理过程中何时何地出现的预测。本研究通过脑电图(EEG)在人类神经活动中测试这些预测。十名母语英语者在听200个合成生成的句子时,这些句子涵盖四种构造类型(单及物、双及物、因果运动、结果性)。利用时频方法、特征提取和机器学习分类分析,发现构造特异性神经签名主要出现在句末位置,即论元结构完全歧义化的位置,并且最显著地出现在alpha波段。成对分类显示可靠区分,尤其是双及物和结果性构造之间,而其他对则有重叠。关键的是,这些效应的出现时间和相似性结构与基于循环和变压器的语言模型中的构造表征模式相似,其中构造性表征在整合处理阶段出现。这些发现支持语言构造作为神经编码的独立形式-意义映射的观点,与构造语法一致,并表明生物和人工系统在相似的表征解决方案上趋于一致。更广泛地说,这种趋同与学习系统在基础表征景观中发现稳定区域(最近称为柏拉图表征空间)的想法一致,该景观约束了高效语言抽象的出现。

英文摘要

Understanding how the brain processes linguistic constructions is a central challenge in cognitive neuroscience and linguistics. Recent computational studies show that artificial neural language models spontaneously develop differentiated representations of Argument Structure Constructions (ASCs), generating predictions about when and how construction-level information emerges during processing. The present study tests these predictions in human neural activity using electroencephalography (EEG). Ten native English speakers listened to 200 synthetically generated sentences across four construction types (transitive, ditransitive, caused-motion, resultative) while neural responses were recorded. Analyses using time-frequency methods, feature extraction, and machine learning classification revealed construction-specific neural signatures emerging primarily at sentence-final positions, where argument structure becomes fully disambiguated, and most prominently in the alpha band. Pairwise classification showed reliable differentiation, especially between ditransitive and resultative constructions, while other pairs overlapped. Crucially, the temporal emergence and similarity structure of these effects mirror patterns in recurrent and transformer-based language models, where constructional representations arise during integrative processing stages. These findings support the view that linguistic constructions are neurally encoded as distinct form-meaning mappings, in line with Construction Grammar, and suggest convergence between biological and artificial systems on similar representational solutions. More broadly, this convergence is consistent with the idea that learning systems discover stable regions within an underlying representational landscape - recently termed a Platonic representational space - that constrains the emergence of efficient linguistic abstractions.

2603.25099 2026-05-18 cs.CE cs.AI

Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

大语言模型作为优化控制器:SIMP拓扑优化的自适应延续

Shaoliang Yang, Jun Wang, Yunsheng Wang

发表机构 * Department of Mechanical Engineering, Santa Clara University(圣克拉拉大学机械工程系)

AI总结 本文提出利用大语言模型作为SIMP拓扑优化的在线自适应控制器,通过实时状态条件参数决策替代传统固定调度延续方法,提升优化效果。

Comments 32 pages, 11 figures

详情
AI中文摘要

我们提出一个框架,其中大语言模型(LLM)作为SIMP拓扑优化的在线自适应控制器,取代传统固定调度延续方法。在每次第k次迭代中,LLM接收结构化观察(当前合规性、灰度指数、停滞计数器、棋盘度量、体积分数和预算消耗),并通过直接数字控制接口输出惩罚指数p、投影锐度β、滤波半径r_min和移动限制δ的数值。硬灰度门防止过早二元化,元优化循环使用第二个LLM迭代来调整代理的调用频率和门阈值。我们对四个基线(固定、标准三场延续、专家启发法、仅调度消融)在三个二维问题(悬臂、MBB梁、L型支架)和两个三维问题(悬臂、MBB梁)上进行基准测试,所有问题均运行300次迭代。标准化的40次锐化尾部从最佳有效快照应用,使得合规性差异仅反映探索阶段。LLM代理在每个基准测试中均达到最低最终合规性:相对于固定基线,-5.7%至-18.1%,所有解决方案均为完全二进制。仅调度消融在三个问题中的两个上表现低于固定基线,确认LLM的实时干预(而非调度几何)驱动了增益。代码和再生产脚本将在发表时发布。

英文摘要

We present a framework in which a large language model (LLM) acts as an online adaptive controller for SIMP topology optimization, replacing conventional fixed-schedule continuation with real-time, state-conditioned parameter decisions. At every $k$-th iteration, the LLM receives a structured observation$-$current compliance, grayness index, stagnation counter, checkerboard measure, volume fraction, and budget consumption$-$and outputs numerical values for the penalization exponent $p$, projection sharpness $β$, filter radius $r_{\min}$, and move limit $δ$ via a Direct Numeric Control interface. A hard grayness gate prevents premature binarization, and a meta-optimization loop uses a second LLM pass to tune the agent's call frequency and gate threshold across runs. We benchmark the agent against four baselines$-$fixed (no-continuation), standard three-field continuation, an expert heuristic, and a schedule-only ablation$-$on three 2-D problems (cantilever, MBB beam, L-bracket) at $120\!\times\!60$ resolution and two 3-D problems (cantilever, MBB beam) at $40\!\times\!20\!\times\!10$ resolution, all run for 300 iterations. A standardized 40-iteration sharpening tail is applied from the best valid snapshot so that compliance differences reflect only the exploration phase. The LLM agent achieves the lowest final compliance on every benchmark: $-5.7\%$ to $-18.1\%$ relative to the fixed baseline, with all solutions fully binary. The schedule-only ablation underperforms the fixed baseline on two of three problems, confirming that the LLM's real-time intervention$-$not the schedule geometry$-$drives the gain. Code and reproduction scripts will be released upon publication.

2511.19931 2026-05-18 cs.IR cs.AI

LLM-EDT: Large Language Model Enhanced Cross-domain Sequential Recommendation with Dual-phase Training

LLM-EDT: 基于大语言模型的跨领域序列推荐增强方法与双阶段训练

Ziwei Liu, Qidong Liu, Wanyu Wang, Yejing Wang, Pengyue Jia, Tong Xu, Wei Huang, Chong Chen, Xiangyu Zhao

发表机构 * City University of Hong Kong Hong Kong China Xi'an Jiaotong University \& City University of Hong Kong Xi'an China University of Science Independent Researcher Beijing China Tsinghua University Beijing China City University of Hong Kong Xi'an Jiaotong University \& City University of Hong Kong Independent Researcher Tsinghua University

AI总结 本文提出LLM-EDT,通过双阶段训练策略解决跨领域序列推荐中的领域不平衡和过渡问题,引入可转移物品增强器和领域感知配置模块,提升推荐效果。

详情
AI中文摘要

跨领域序列推荐(CDSR)旨在通过整合多领域信息丰富用户-物品交互。尽管已有进展,领域不平衡和过渡问题阻碍了进一步发展。前者导致某一领域交互主导整体行为,难以捕捉其他领域特征;后者导致混合交互序列中难以捕捉用户跨领域偏好,影响特定领域下一项预测性能。大语言模型(LLMs)通过生成和编码能力部分缓解这些问题,但现有LLM增强的CDSR方法仍需改进。为此,我们提出LLM-EDT,通过可转移物品增强器减少无关噪声,双阶段训练策略增强领域特定线程的领域共享背景,以及领域感知配置模块总结用户偏好并自适应聚合生成综合用户画像。实验验证了LLM-EDT的有效性。

英文摘要

Cross-domain Sequential Recommendation (CDSR) has been proposed to enrich user-item interactions by incorporating information from various domains. Despite current progress, the imbalance issue and transition issue hinder further development of CDSR. The former one presents a phenomenon that the interactions in one domain dominate the entire behavior, leading to difficulty in capturing the domain-specific features in the other domain. The latter points to the difficulty in capturing users' cross-domain preferences within the mixed interaction sequence, resulting in poor next-item prediction performance for specific domains. With world knowledge and powerful reasoning ability, Large Language Models (LLMs) partially alleviate the above issues by performing as a generator and an encoder. However, current LLMs-enhanced CDSR methods are still under exploration, which fail to recognize the irrelevant noise and rough profiling problems. Thus, to make peace with the aforementioned challenges, we proposed an LLMs Enhanced Cross-domain Sequential Recommendation with Dual-phase Training ({LLM-EDT}). To address the imbalance issue while introducing less irrelevant noise, we first propose the transferable item augmenter to adaptively generate possible cross-domain behaviors for users. Then, to alleviate the transition issue, we introduce a dual-phase training strategy to empower the domain-specific thread with a domain-shared background. As for the rough profiling problem, we devise a domain-aware profiling module to summarize the user's preference in each domain and adaptively aggregate them to generate comprehensive user profiles. The experiments on three public datasets validate the effectiveness of our proposed LLM-EDT. To ease reproducibility, we have released the detailed code online at {https://anonymous.4open.science/r/LLM-EDT-583F}.

2511.15623 2026-05-18 cs.DB cs.AI cs.LO

Sufficient Explanations in Databases and their Connections to Database Repairs

数据库中的充分解释及其与数据库修复的关系

Leopoldo Bertossi, Nina Pardal

发表机构 * Carleton University, Canada \& IMFD, Chile. University of Edinburgh, UK.

AI总结 研究数据库中充分解释的概念及其与数据库修复的联系,提出基于答案集程序计算充分解释和度量的方法。

详情
AI中文摘要

我们研究了充分解释的概念,以及用于查询回答的数据库元组的充分性度数作为归因分数。我们还探讨了充分解释与用于处理不一致数据库的数据库修复之间的联系,并与基于因果的必要解释相结合,获得新的计算结果。我们展示了如何使用答案集程序来指定充分解释并计算充分性度数。

英文摘要

We investigate the notion of sufficient explanation, and a sufficiency-degree as attribution score for database tuples in relation to query answering. We also investigate and exploit connections with database repairs as used for dealing with inconsistent databases; and with causality-based necessary explanations, obtaining new computational results. We show how to use answer-set programs to specify sufficient explanations and compute sufficiency-degrees.

2511.14482 2026-05-18 cs.DB cs.LG

Gradient-Based Join Ordering

基于梯度的连接顺序

Tim Schwabe, Maribel Acosta

发表机构 * Technical University of Munich(慕尼黑技术大学)

AI总结 本文提出基于梯度的连接顺序方法,通过连续松弛和可微约束,在放松空间中寻找低成本计划,优于传统离散搜索方法。

详情
AI中文摘要

连接顺序是NP难问题,涉及选择数据库查询中最有效的连接顺序。传统方法将问题视为二叉树的离散组合搜索,但存在效果与效率的权衡。本文展示当成本模型可微时,查询计划可连续松弛为软邻接矩阵,结合可微约束确保计划有效性,从而在放松空间中通过梯度搜索低成本计划。使用图神经网络作为成本模型,证明该方法在两个不同图数据集上能获得与传统离散搜索相当甚至更低的成本,并且运行时间优于离散搜索算法。

英文摘要

Join ordering is the NP-hard problem of selecting the most efficient order in which to evaluate joins (conjunctive, binary operators) in a database query. Because query execution performance critically depends on this choice, join ordering lies at the core of query optimization. Traditional approaches cast this problem as a discrete combinatorial search over binary trees guided by a cost model, but they have trade-offs between effectiveness and efficiency. We show that when the cost model is differentiable, query plans can be continuously relaxed into a soft adjacency matrix that represents a superposition of plans. This continuous relaxation, combined with differentiable constraints that enforce plan validity, enables a gradient-based search for low-cost plans within this relaxed space. Using a Graph Neural Network as the cost model, we demonstrate that this gradient-based approach can find comparable and even lower-cost plans compared to traditional discrete search methods on two different graph datasets. Furthermore, we empirically show that the runtime of this approach scales better than discrete search algorithms. We believe this first step towards gradient-based join ordering can lead to more effective and efficient query optimizers in the future.

2510.06194 2026-05-18 hep-ex astro-ph.IM cs.CV

Overlap-aware segmentation for topological reconstruction of obscured objects

关注重叠的分割以重建被遮挡物体的拓扑结构

J. Schueler, H. M. Araújo, S. N. Balashov, J. E. Borg, C. Brew, F. M. Brunbauer, C. Cazzaniga, A. Cottle, D. Edgeman, C. D. Frost, F. Garcia, D. Hunt, M. Kastriotou, P. Knights, H. Kraus, A. Lindote, M. Lisowska, D. Loomba, E. Lopez Asamar, P. A. Majewski, T. Marley, C. McCabe, L. Millins, R. Nandakumar, T. Neep, F. Neves, K. Nikolopoulos, E. Oliveri, A. Roy, T. J. Sumner, E. Tilly, W. Thompson, M. A. Vogiatzi

发表机构 * Department of Physics and Astronomy, University of New Mexico(新墨西哥大学物理与天文学系) Department of Physics, Blackett Laboratory, Imperial College London(伦敦帝国理工学院物理系) Particle Physics Department, STFC Rutherford Appleton Laboratory(英国科学与技术设施委员会拉瑟福德-苹果顿实验室粒子物理部) Luleå University of Technology(卢勒阿高校) CERN(欧洲核子研究中心) ISIS Neutron and Muon Source, STFC Rutherford Appleton Laboratory(英国科学与技术设施委员会拉瑟福德-苹果顿实验室ISIS中子与穆子源) University College London (UCL), Department of Physics and Astronomy(伦敦大学学院(UCL)物理与天文学系) Department of Physics, Keble Road, University of Oxford(牛津大学物理系) Helsinki Institute of Physics, University of Helsinki(赫尔辛基大学物理研究所) School of Physics and Astronomy, University of Birmingham(伯明翰大学物理与天文学学院) LIP – Laboratório de Instrumentação e Física Experimental de Partículas, University of Coimbra(科英布拉大学粒子物理实验仪器实验室) Departamento de Fisica Teorica, Universidad Autonoma de Madrid(马德里自治大学理论物理系) Department of Physics, King’s College London(伦敦国王学院物理系) University of Hamburg(汉堡大学)

AI总结 本文提出OASIS框架,通过加权损失函数优先处理重叠区域,提升被遮挡物体的像素强度和拓扑特征重建。在MIGDAL实验中,OASIS显著改善了低能电子轨迹的重建效果。

详情
AI中文摘要

重叠物体的分离在科学成像中是一个重大挑战。尽管深度学习分割-回归算法能预测像素强度,但通常平等对待所有区域,而非优先处理重叠区域。最近的实例分割进展表明,训练中加权重叠像素区域可改善重叠区域的分割边界预测,但此方法尚未扩展到分割回归。本文提出OASIS:一种新的分割-回归框架,其加权损失函数旨在训练期间优先处理物体重叠区域,从而从严重遮挡的物体中提取像素强度和拓扑特征。在MIGDAL实验中,OASIS被用于直接成像Migdal效应——一种罕见过程,其中电子发射由核散射诱导——在低气压光学时间投影室中。此设置是一个极端测试案例,因为重建目标是微弱的电子 recoil 轨迹,通常被数量级更亮的核 recoil 轨迹严重遮挡。与无权分割回归相比,我们证明OASIS的新型重叠区域目标损失函数权重是提高低能电子轨迹强度和拓扑重建的最重要训练权重。在八次训练活动中平均,我们进一步显示添加重叠目标权重可将这些低能电子的中位强度重建误差从-41.1%提高到-13.3%。这些性能提升证明OASIS是一种通用的方法,可用于恢复重叠主导区域的被遮挡信号。

英文摘要

The separation of overlapping objects presents a significant challenge in scientific imaging. While deep learning segmentation-regression algorithms can predict pixel-wise intensities, they typically treat all regions equally rather than prioritizing overlap regions where attribution is most ambiguous. Recent advances in instance segmentation show that weighting regions of pixel overlap in training can improve segmentation boundary predictions in regions of overlap, but this idea has not yet been extended to segmentation regression. We address this with Overlap-Aware Segmentation of ImageS (OASIS): a new segmentation-regression framework with a weighted loss function designed to prioritize regions of object-overlap during training, enabling extraction of pixel intensities and topological features from heavily obscured objects. We demonstrate OASIS in the context of the MIGDAL experiment, which aims to directly image the Migdal effect--a rare process where electron emission is induced by nuclear scattering--in a low-pressure optical time projection chamber. This setting poses an extreme test case, as the target for reconstruction is a faint electron recoil track which is often heavily-buried within the order(s)-of-magnitude brighter nuclear recoil track. Compared to unweighted segmentation regression, we demonstrate OASIS's novel overlap region-targeted loss function weight to be the single most important training weight for improving intensity and topological reconstructions of the low-energy electron tracks that tend to be most dominated by pixel overlap. Averaging over eight training campaigns, we further show the addition of overlap-targeted weights to improve median intensity reconstruction errors from -41.1% to -13.3% for these low-energy electrons. These performance gains demonstrate OASIS as a generalizable methodology for recovering obscured signals in overlap-dominated regions.

2510.01632 2026-05-18 q-bio.BM cs.AI

BioBlobs: Unsupervised Discovery of Functional Substructures for Protein Function Prediction

BioBlobs:无监督发现蛋白质功能预测的的功能子结构

Xin Wang, Kaiwen Shi, Carlos Oliver

发表机构 * Vanderbilt University(范德比大学) Yale University(耶鲁大学)

AI总结 BioBlobs通过无监督方法发现蛋白质的功能子结构,利用端到端可微分框架压缩蛋白质为少量连贯子结构并预测功能,实现了对功能区域的候选识别。

详情
AI中文摘要

蛋白质功能由如催化三元组、结合口袋和结构模体等紧密子结构驱动,这些子结构仅占据蛋白质残基的小部分。然而,现有基于蛋白质编码器的流程并未在子结构层面建模,未能回答核心生物学问题:蛋白质的哪一部分负责其功能?我们引入了BioBlobs,一种编码器无关、端到端可微分的框架,能够将蛋白质压缩为少量连贯的子结构(blobs),并仅基于这些blobs预测功能,使得每个blob对应一个候选功能区域。在多样化的蛋白质功能预测任务和多种基于序列和结构的编码器上,BioBlobs在仅使用少量残基的情况下,匹配或超过了强大的基线模型。发现的blobs会根据任务调整其空间尺度,从局部催化位点到整个结构域。仅在蛋白质层面标签上训练,BioBlobs能够恢复M-CSA数据库中实验注释的催化位点,证明了无监督的功能子结构发现,并为未注释的整个蛋白质组的规模化功能位点发现开辟了道路。

英文摘要

Protein function is driven by cohesive substructures, such as catalytic triads, binding pockets, and structural motifs, that occupy only a small fraction of a protein's residues. Yet existing pipelines built on protein encoders do not model proteins at the substructure level, leaving the central biological question unanswered: which substructure of a protein is responsible for its function? We introduce BioBlobs, an encoder-agnostic, end-to-end differentiable framework that compresses a protein into a small set of cohesive substructures (blobs) and predicts function from these blobs alone, so that each blob corresponds to a candidate functional region. Across diverse protein function prediction tasks and multiple sequence- and structure-based encoders, BioBlobs matches or exceeds strong baselines while operating on only a small fraction of residues. The discovered blobs adapt their spatial scale to the task, ranging from local catalytic sites to entire structural domains. Trained only on protein-level labels, BioBlobs recovers experimentally annotated catalytic sites in the M-CSA database, demonstrating unsupervised functional substructure discovery and opening a path to large-scale functional site discovery across the unannotated proteome.

2509.07404 2026-05-18 math.OC cs.LG

Reinforcement learning for adaptive interior point methods in convex quadratic programming

强化学习用于凸二次规划中自适应内点方法

Jeremy Bertoncini, Alberto De Marchi, Matthias Gerdts, Simon Gottschalk

发表机构 * Department of Aerospace Engineering, Institute of Applied Mathematics and Scientific Computing, University of the Bundeswehr Munich(航空航天工程系、应用数学与科学计算研究所、联邦国防军大学穆尔尼奇分校)

AI总结 本文提出利用强化学习优化内点法求解凸二次规划问题,通过调整双循环流程和控制参数提升求解效率,实验表明轻量训练后策略能有效泛化至不同问题类。

Comments 20 pages, 9 figures, 4 tables

详情
AI中文摘要

二次规划是现代非线性优化、控制和数据科学中的重要工具。尽管正则化方法在最少量假设下提供收敛保证,但它们通常表现出一阶方案典型的慢尾收敛特性,需要许多迭代才能获得高精度解。此外,超参数调优显著影响求解器性能,但如何找到合适的参数配置仍是一个悬而未决的研究问题。为解决这些问题,我们探索数据驱动方法如何加速求解过程。针对高精度解,我们专注于正则化内点求解器,并仔细处理其双循环流程和控制参数。我们将展示强化学习如何在促进求解器调优和加速优化过程方面做出重要贡献。数值实验表明,在轻量训练后,学习到的策略能有效泛化至不同问题类,具有不同维度。

英文摘要

Quadratic programming is a workhorse of modern nonlinear optimization, control, and data science. Although regularized methods offer convergence guarantees under minimal assumptions on the problem data, they can exhibit the slow tail-convergence typical of first-order schemes, thus requiring many iterations to achieve high-accuracy solutions. Moreover, hyperparameter tuning significantly impacts the solver performance but how to find an appropriate parameter configuration remains an elusive research question. To address these issues, we explore how data-driven approaches can accelerate the solution process. Aiming at high-accuracy solutions, we focus on a regularized interior-point solver and carefully handle its two-loop flow and control parameters. We will show that reinforcement learning can make a significant contribution to facilitating the solver tuning and to speeding up the optimization process. Numerical experiments demonstrate that, after a lightweight training, the learned policy generalizes well to different problem classes with varying dimensions.

2508.08431 2026-05-18 eess.IV cs.CV eess.SP

Preprocessing Algorithm Leveraging Geometric Modeling for Scale Correction in Hyperspectral Images for Improved Unmixing Performance

基于几何建模的预处理算法用于超光谱图像的尺度校正以提升解混性能

Praveen Sumanasekara, Athulya Ratnayake, Buddhi Wijenayake, Keshawa Ratnayake, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath

发表机构 * Department of Electrical and Electronic Engineering, University of Peradeniya(珀斯德尼亚大学电子与电气工程系) School of Electrical and Computer Engineering, Purdue University(普渡大学电子与计算机工程学院)

AI总结 本文提出一种预处理算法,通过校正像素签名的尺度变化,提升超光谱解混性能,实验验证其在多种解混方法上的有效性,实现约50%的误差降低。

Comments 20 pages, 14 figures

详情
AI中文摘要

光谱变化显著影响超光谱解混算法的准确性和收敛性。许多方法处理复杂光谱变化,但因地形、光照和阴影导致的像素签名大规模畸变仍是主要挑战。这些变化通常会降低解混性能并使模型拟合复杂化。因此,校正这些变化可为实际GIS应用提供显著优势。本文提出了一种新的预处理算法,在解混前校正由尺度引起的光谱变化。通过估计并校正像素签名的尺度畸变,该算法生成具有最小尺度畸变的像素签名。由于这些尺度畸变(阻碍许多解混方法性能)在所提出方法的输出中被大大减少,解混算法的丰度估计显著提高。我们提供了一个严谨的数学框架来描述和校正尺度变化,并对所提算法进行了广泛的实验验证。此外,该算法的影响在多种最先进的解混方法上评估了两个合成和两个真实超光谱数据集。所提出的预处理步骤在这些方法上一致提高了性能,即使对于专门处理光谱变化的算法,也实现了约50%的误差降低。这表明尺度校正作为一种补充步骤,有助于更准确的解混,利用现有方法。该算法的通用性、一致影响和显著影响突显了其在实际超光谱解混管道中的潜力。实现代码将在发表时公开。

英文摘要

Spectral variability significantly impacts the accuracy and convergence of hyperspectral unmixing algorithms. Many methods address complex spectral variability; yet large-scale distortions to the scale of the observed pixel signatures due to topography, illumination, and shadowing remain a major challenge. These variations often degrade unmixing performance and complicate model fitting. Because of this, correcting these variations can offer significant advantages in real-world GIS applications. In this paper, we propose a novel preprocessing algorithm that corrects scale-induced spectral variability prior to unmixing. By estimating and correcting these distortions to the scale of the pixel signatures, the algorithm produces pixel signatures with minimal distortions in scale. Since these distortions in scale (which hinder the performance of many unmixing methods) are greatly minimized in the output provided by the proposed method, the abundance estimation of the unmixing algorithms is significantly improved. We present a rigorous mathematical framework to describe and correct for scale variability and provide extensive experimental validation of the proposed algorithm. Furthermore, the algorithm's impact is evaluated across a wide range of state-of-the-art unmixing methods on two synthetic and two real hyperspectral datasets. The proposed preprocessing step consistently improves the performance of these algorithms, achieving error reductions of around 50%, even for algorithms specifically designed to handle spectral variability. This demonstrates that scale correction acts as a complementary step, facilitating more accurate unmixing with existing methods. The algorithm's generality, consistent impact, and significant influence highlight its potential as a key component in practical hyperspectral unmixing pipelines. The implementation code will be made publicly available upon publication.

2506.22440 2026-05-18 cs.CY cs.LG cs.MA econ.GN q-fin.EC

From Model Design to Organizational Design: Complexity Redistribution and Trade-Offs in Generative AI

从模型设计到组织设计:生成AI中的复杂性再分配与权衡

Sharique Hasan, Alexander Oettl, Sampsa Samila

发表机构 * Duke University(杜克大学) Georgia Institute of Technology(佐治亚理工学院) IESE Business School(IESE商学院)

AI总结 本文提出GAS框架,分析大语言模型如何重塑组织与竞争策略,揭示生成AI中通用性、准确性与简洁性之间的权衡及复杂性再分配对管理挑战的影响。

详情
AI中文摘要

本文引入通用性-准确性-简洁性(GAS)框架,分析大语言模型如何重塑组织和竞争策略。我们认为,将AI视为简单输入成本降低忽略了两个关键动态:(a)通用性、准确性和简洁性之间的固有权衡;(b)复杂性在利益相关者间的再分配。尽管LLMs通过简单接口提供高通用性和准确性,这种用户端的简洁性掩盖了复杂性向基础设施、合规性和专业人员的转移。因此,GAS权衡并未消失,而是从用户转移到组织,带来新的管理挑战,尤其是在高风险应用中的准确性问题。我们主张,竞争优势不再来自单纯的AI采用,而是来自通过抽象层设计、流程对齐和互补专业知识掌握再分配的复杂性。本研究通过阐明可扩展认知如何再分配复杂性并重新定义技术整合的条件,推动了AI战略的发展。

英文摘要

This paper introduces the Generality-Accuracy-Simplicity (GAS) framework to analyze how large language models (LLMs) are reshaping organizations and competitive strategy. We argue that viewing AI as a simple reduction in input costs overlooks two critical dynamics: (a) the inherent trade-offs among generality, accuracy, and simplicity, and (b) the redistribution of complexity across stakeholders. While LLMs appear to defy the traditional trade-off by offering high generality and accuracy through simple interfaces, this user-facing simplicity masks a significant shift of complexity to infrastructure, compliance, and specialized personnel. The GAS trade-off, therefore, does not disappear but is relocated from the user to the organization, creating new managerial challenges, particularly around accuracy in high-stakes applications. We contend that competitive advantage no longer stems from mere AI adoption, but from mastering this redistributed complexity through the design of abstraction layers, workflow alignment, and complementary expertise. This study advances AI strategy by clarifying how scalable cognition relocates complexity and redefines the conditions for technology integration.

2504.18522 2026-05-18 stat.ML cs.LG

Extrapolation Guarantees for Perturbation Modeling Under the Additive Latent Shift Assumption

在加性潜在位移假设下对扰动建模的外推保证

Julius von Kügelgen, Jakob Ketterer, Michael Vollenweider, Michael Scholkemper, Xinwei Shen, Nicolai Meinshausen, Jonas Peters

发表机构 * Seminar for Statistics, ETH Zurich(统计系,苏黎世联邦理工学院) ETH Zurich(苏黎世联邦理工学院) DZNE, Bonn, Germany(波恩德国DZNE) Department of Statistics, University of Washington, Seattle, USA(华盛顿大学统计系,美国西雅图)

AI总结 本文研究了在加性潜在位移假设下,通过扰动建模预测新扰动组合的分布,提出PDAE模型并证明了外推保证。

Comments Updated preprint with new material and empirical results; previous version presented at the ICLR'25 Workshop on Learning Meaningful Representations of Life

详情
AI中文摘要

我们考虑了建模如基因敲除等扰动对测量(如单细胞RNA计数)的影响问题。给定某些扰动的数据,我们旨在预测新扰动组合的测量分布。为此,我们假设扰动在合适但未知的嵌入空间中是加性的。我们将数据生成过程建模为潜在变量模型,其中扰动相当于潜在空间中的均值位移,并且可以加性组合。我们证明,在训练扰动足够多样时,表示和扰动效应可识别到正交变换为止,并利用此推导出对未见扰动的外推保证,这些未见扰动可表示为已见扰动的线性组合。为了从数据中估计模型,我们提出扰动分布自编码器(PDAE),该模型通过最大化真实与模拟扰动分布之间的分布相似性进行训练。训练后的模型可用于预测之前未见的扰动分布。为了支持我们的理论结果,我们通过模拟展示了PDAE能够准确预测未见但可识别的扰动效应,并在组合基因扰动数据上展示了该方法。

英文摘要

We consider the problem of modeling the effects of perturbations like gene knockouts on measurements such as single-cell RNA counts. Given data for some perturbations, we aim to predict the distribution of measurements for new combinations of perturbations. To address this challenging extrapolation task, we posit that perturbations act additively in a suitable, unknown embedding space. We formulate the data-generating process as a latent variable model, in which perturbations amount to mean shifts in latent space and can be combined additively. We then prove that, given sufficiently diverse training perturbations, the representation and perturbation effects are identifiable up to orthogonal transformation and use this to derive extrapolation guarantees for unseen perturbations that can be expressed as linear combinations of seen ones. To estimate the model from data, we propose the perturbation distribution autoencoder (PDAE), which is trained by maximizing the distributional similarity between true and simulated perturbation distributions. The trained model can then be used to predict previously unseen perturbation distributions. In support of our theoretical results, we demonstrate through simulations that PDAE can accurately predict the effects of unseen but identifiable perturbations, and showcase the method on combinatorial gene perturbation data.

2504.09006 2026-05-18 cs.GT cs.LG

Learning in Structured Stackelberg Games

在结构化Stackelberg游戏中学习

Maria-Florina Balcan, Kiriaki Fragkia, Keegan Harris

发表机构 * Carnegie Mellon University(卡内基梅隆大学) University of California(加州大学)

AI总结 本文研究了结构化Stackelberg游戏,提出Stackelberg-Littlestone维度以优化在线学习算法,并在分布设定中提供样本复杂度的上下界。

Comments Accepted as a spotlight paper to ICML 2026

详情
AI中文摘要

我们首次研究了结构化Stackelberg游戏,这是一种领导者与跟随者之间的新型战略互动形式,其中上下文信息可以预测跟随者的(未知)类型。受安全游戏和AI安全应用的启发,我们展示了这种额外结构如何帮助领导者在在线和分布设定中学习最优效用策略。在在线设定中,我们证明标准学习理论复杂性度量不刻画领导者学习任务的难度。值得注意的是,我们发现存在一种类似于在线分类中Littlestone维度的学习理论复杂性度量,能够紧密刻画领导者实例最优遗憾。我们将其称为Stackelberg-Littlestone维度,并利用它提供可证明最优的在线学习算法。在分布设定中,我们通过展示两个新维度控制样本复杂度的上界和下界,提供了类比结果。

英文摘要

We initiate the study of structured Stackelberg games, a novel form of strategic interaction between a leader and a follower where contextual information can be predictive of the follower's (unknown) type. Motivated by applications such as security games and AI safety, we show how this additional structure can help the leader learn a utility-maximizing policy in both the online and distributional settings. In the online setting, we first prove that standard learning-theoretic measures of complexity do not characterize the difficulty of the leader's learning task. Notably, we find that there exists a learning-theoretic measure of complexity, analogous to the Littlestone dimension in online classification, that tightly characterizes the leader's instance-optimal regret. We term this the Stackelberg-Littlestone dimension, and leverage it to provide a provably optimal online learning algorithm. In the distributional setting, we provide analogous results by showing that two new dimensions control the sample complexity upper- and lower-bound.

2503.23927 2026-05-18 stat.ML cs.LG

Detecting Localized Density Anomalies in Multivariate Data via Coin-Flip Statistics

通过硬币翻转统计检测多变量数据中的局部密度异常

Sebastian Springer, Andre Scaffidi, Maximilian Autenrieth, Gabriella Contardo, Alessandro Laio, Roberto Trotta, Heikki Haario

发表机构 * Scuola Internazionale Superiore di Studi Avanzati (SISSA)(国际先进研究高等学院) University of Cambridge(剑桥大学) Imperial College London(伦敦帝国理工学院) University of Nova Gorica(诺瓦戈里察大学) LUT University(卢托拉大学)

AI总结 本文提出EagleEye方法,通过编码k近邻列表为二进制序列,检测多变量数据中的局部过密度和欠密度异常,并在三种场景中验证其有效性。

Comments Code Availability: The code used to generate the results of this study is available at GitHub via the link: https://github.com/sspring137/EagleEye

详情
AI中文摘要

检测两个样本之间的局部差异是科学数据分析的核心任务,用于识别信号事件、制度变化或模型不匹配。我们引入EagleEye方法,通过将有序k近邻列表编码为二进制成员序列,并测试该序列中累积成功次数是否与二项式(硬币翻转)空模型一致,来定位多变量特征空间中的局部过密度和欠密度。在存在真实局部异常时,邻居将优先属于其中一个数据集,导致相对于二项式空模型的“成功”次数过多。这些局部点检测通过确定性细化程序整合为可解释的异常集,同时可以估计不可约背景和局部密度异常纯度。我们通过三种场景展示了EagleEye的有效性:首先考虑具有已知局部过密度和欠密度的人工数据示例;其次展示EagleEye在粒子对撞机实验中检测新物理现象时在系统背景建模差异下的应用;最后进行气候分析研究,揭示了时空温度模式重复中的局部变化。

英文摘要

Detecting localized differences between two samples is a central task in scientific data analysis, required for the identification of signal events, regime changes, or model mismatch. We introduce EagleEye, a method that pinpoints local over- and under-densities in multivariate feature spaces. EagleEye assigns each point an anomaly score by encoding its ordered k-nearest-neighbour list as a binary membership sequence and testing whether the cumulative number of successes in this sequence is consistent with a binomial (coin-flipping) null model. In the presence of a genuine local anomaly, neighbours will preferentially belong to one of the two datasts, yielding an excess of ``successes'' relative to the binomial null model. These local, pointwise detections are consolidated into interpretable anomaly sets through a deterministic refinement procedure that can also estimate the irreducible background and local density anomaly purity. We demonstrate EagleEye's efficacy in three scenarios. We first consider an artificial data example with known localized over- and under-densities. Second, we demonstrate how EagleEye may be used for new physics searches at particle collider experiments in the presence of systematic background modelling differences. Finally, we conduct a climate analysis study that reveals localized changes in spatiotemporal temperature-pattern recurrence.

2502.04271 2026-05-18 quant-ph cs.LG

Variational decision diagrams for quantum-inspired machine learning applications

变分决策图用于量子启发式机器学习应用

Vladimir Vargas-Calderón, Santiago Acevedo-Mancera, Herbert Vinck-Posada

发表机构 * D-Wave Systems(D-Wave系统) Zapata Computing Inc.(Zapata计算公司) Grupo de Superconductividad y Nanotecnología, Departamento de Física, Universidad Nacional de Colombia(超导与纳米技术小组,物理系,哥伦比亚国家大学)

AI总结 本文提出变分决策图,结合决策图结构优势与变分方法适应性,用于高效表示量子态,解决Ising和Heisenberg哈密顿量的基态估计问题,证明训练可行性。

Comments 11 pages, 3 figures, presented at Quantum Information in Spain (ICE-9)

详情
AI中文摘要

决策图(DDs)因能利用量子态和操作中的数据冗余,已成为模拟量子电路的有效工具。然而其在量子机器学习(QML)中的应用尚未被探索。本文引入变分决策图(VDDs),一种结合DD结构优势与变分方法适应性的新型图结构。我们通过将VDDs应用于横向场Ising和Heisenberg哈密顿量的基态估计问题,研究其可训练性。梯度方差分析表明,VDDs的训练是可能的,未观察到消失梯度(即 barren plateaus)的现象。本文为在QML中使用决策图作为变分方案设计和训练的替代方法提供了新见解。

英文摘要

Decision diagrams (DDs) have emerged as an efficient tool for simulating quantum circuits due to their capacity to exploit data redundancies in quantum states and quantum operations, enabling the efficient computation of probability amplitudes. However, their application in quantum machine learning (QML) has remained unexplored. This paper introduces variational decision diagrams (VDDs), a novel graph structure that combines the structural benefits of DDs with the adaptability of variational methods for efficiently representing quantum states. We investigate the trainability of VDDs by applying them to the ground state estimation problem for transverse-field Ising and Heisenberg Hamiltonians. Analysis of gradient variance suggests that training VDDs is possible, as no signs of vanishing gradients--also known as barren plateaus--are observed. This work provides new insights into the use of decision diagrams in QML as an alternative to design and train variational ansätze.

2412.11308 2026-05-18 stat.ML cs.LG

From XAI to MLOps: Explainable Concept Drift Detection with Profile Drift Detection

从XAI到MLOps:基于轮廓漂移检测的可解释概念漂移检测

Ugur Dar, Mustafa Cavus

发表机构 * Eskisehir Technical University(埃斯克谢希尔技术大学)

AI总结 本文提出轮廓漂移检测方法,利用可解释AI工具部分依赖性轮廓图,通过新的漂移度量标准检测概念漂移并理解其原因,实验表明其在保持预测性能的同时有效平衡了漂移信号的敏感性和稳定性。

Comments 15 pages, 6 figures

Journal ref Future Generation Computer Systems (2026)

详情
AI中文摘要

预测模型的性能往往因数据分布的变化而下降,这种现象称为数据漂移。其中,概念漂移(解释变量与响应变量之间的关系变化)尤其难以检测和适应。传统漂移检测方法通常依赖准确率或边缘变量分布等指标,可能无法捕捉到微妙但重要的概念变化。本文提出了一种新方法,轮廓漂移检测(PDD),通过利用可解释AI工具部分依赖性轮廓图(PDPs),实现了对概念漂移的检测和对其潜在原因的深入理解。PDD通过新的漂移度量标准量化PDPs的变化,这些度量标准对数据流中的变化敏感,同时保持计算效率。该方法与MLOps实践一致,强调在动态环境中持续的模型监控和适应性重训练。在合成和实际数据集上的实验表明,PDD在保持高预测性能的同时,有效平衡了漂移信号的敏感性和稳定性。结果突显了其在实时应用中的适用性,本文最后讨论了该方法的优势、限制以及向更广泛应用场景扩展的潜力。

英文摘要

Predictive models often degrade in performance due to evolving data distributions, a phenomenon known as data drift. Among its forms, concept drift, where the relationship between explanatory variables and the response variable changes, is particularly challenging to detect and adapt to. Traditional drift detection methods often rely on metrics such as accuracy or marginal variable distributions, which may fail to capture subtle but important conceptual changes. This paper proposes a novel method, Profile Drift Detection (PDD), which enables both the detection of concept drift and an enhanced understanding of its underlying causes by leveraging an explainable AI tool: Partial Dependence Profiles (PDPs). PDD quantifies changes in PDPs through new drift metrics that are sensitive to shifts in the data stream while remaining computationally efficient. This approach is aligned with MLOps practices, emphasizing continuous model monitoring and adaptive retraining in dynamic environments. Experiments on synthetic and real-world datasets demonstrate that PDD outperforms existing methods by maintaining high predictive performance while effectively balancing sensitivity and stability in drift signals. The results highlight its suitability for real-time applications, and the paper concludes by discussing the method's advantages, limitations, and potential extensions to broader use cases.

2407.20240 2026-05-18 cs.CY cs.AI

Social and Ethical Risks Posed by General-Purpose LLMs for Settling Newcomers in Canada

通用大型语言模型对加拿大新移民融入社会的潜在风险

Isar Nejadgholi, Maryam Molamohammadi, Samir Bakhtawar

发表机构 * National Research Council Canada(加拿大国家研究委员会) Mila - Quebec Artificial Intelligence Institute(魁北克人工智能研究所)

AI总结 研究探讨通用大语言模型在移民安置领域可能带来的风险,强调需开发定制化AI工具以确保人类监督与责任。

Comments 26 pages, 8 figures

详情
AI中文摘要

加拿大非营利安置部门支持新移民实现成功融入。该部门面临日益增长的操作压力,凸显了提高效率和创新的必要性,可能通过可靠的AI解决方案实现。随意使用通用生成式AI,如ChatGPT,可能成为移民和服务机构的常见做法,但这些工具未针对安置领域进行优化,可能对移民和难民产生有害影响。本文探讨这些工具可能对新移民造成的风险,警告避免未经监管的生成式AI使用,并鼓励进一步研究开发AI素养课程及定制化LLM,使其符合受影响社区的偏好。关键在于此类技术应无缝集成到安置部门现有流程中,确保人类监督、可信度和问责制。

英文摘要

The non-profit settlement sector in Canada supports newcomers in achieving successful integration. This sector faces increasing operational pressures amidst rising immigration targets, which highlights a need for enhanced efficiency and innovation, potentially through reliable AI solutions. The ad-hoc use of general-purpose generative AI, such as ChatGPT, might become a common practice among newcomers and service providers to address this need. However, these tools are not tailored for the settlement domain and can have detrimental implications for immigrants and refugees. We explore the risks that these tools might pose on newcomers to first, warn against the unguarded use of generative AI, and second, to incentivize further research and development in creating AI literacy programs as well as customized LLMs that are aligned with the preferences of the impacted communities. Crucially, such technologies should be designed to integrate seamlessly into the existing workflow of the settlement sector, ensuring human oversight, trustworthiness, and accountability.

2605.15765 2026-05-18 cs.CG cs.DS cs.RO math.OC

Optimizing Line Segment Inspection with Limited-Range Drones

利用有限范围无人机优化线段检测

José-Miguel Díaz-Báñez, José-Manuel Higes, Alina Kasiuk, Inmaculada Ventura

发表机构 * Department of Applied Mathematics, University of Seville, Spain(应用数学系,塞维利亚大学,西班牙)

AI总结 本文研究如何利用无人机高效检测线段,提出近似算法解决NP难问题,证明在单行线段和两架无人机情况下问题的复杂性,实验显示算法在多种场景下表现接近最优。

Comments 28 pages, 14 figures

详情
AI中文摘要

无人机优化问题在民用任务中广泛研究,主要因其能穿越崎岖地形并携带摄像头等传感器进行监视任务。这些空中机器人有限的电池寿命给运筹学研究带来挑战。本文解决以下优化问题:给定一组线段(如太阳能电站中的管道)进行检测,目标是利用人工智能检测损坏的管道,路径规划必须高效进行。一方面,电池容量有限需要定期访问固定基站,但希望为每架无人机分配一组行程以确保快速覆盖线段,旨在最小化makespan,即任何无人机的最大时间。我们能证明该优化问题即使在线段位于一条线上且仅涉及两架无人机的情况下也是强NP难的。然后提出了近似算法。我们的计算实验表明,所提出的算法在多种运营场景下实现了接近最优的性能。

英文摘要

Optimization problems with drones are widely studied in a variety of civilian tasks, mainly due to their ability to traverse rough terrains and to carry cameras and other sensors for surveillance tasks. The limited battery life of these aerial robots poses challenges in operational research. In this paper, we address the following optimization problem. We are given a set of line segments (e.g. tubes in a solar plant) to inspect by drones. The objective is to detect broken pipes using artificial intelligence and path planning must be carried out efficiently. On the one hand, the limited capacity of the batteries necessitates periodic visits (tours) to a fixed base station. However, it is desirable to allocate a set of tours for each drone to ensure that the segments are covered as quickly as possible, aiming to minimize the makespan, which is the maximum time spent by any drone. We are able to prove that this optimization problem is strongly NP-hard even when the segments are positioned on a line and the scenario involves only two drones. Then, approximation algorithms are proposed. Our computational experiments demonstrate that the proposed algorithm achieves near-optimal performance across diverse operational scenarios.

2605.15733 2026-05-18 cs.NE cs.AI cs.CV

Structure Abstraction and Generalization in a Hippocampal-Entorhinal Inspired World Model

在启发式世界模型中的结构抽象与泛化

Tianqiu Zhang, Muyang Lyu, Xiao Liu, Si Wu

发表机构 * Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, IDG/McGovern Institute for Brain Research, Center of Quantitative Biology, School of Psychological and Cognitive Sciences, Key Laboratory of Machine Perception (Ministry of Education), Peking University(北京大学-清华大学生命科学中心,先进跨学科研究院,IDG/麦克戈文脑科学研究院,定量生物学中心,心理与认知科学学院,机器感知重点实验室(教育部),北京大学)

AI总结 本文提出了一种脑启发的分层模型,通过逆向模型提取潜在转换并构建预测视觉世界模型,展示了在连续高维动态中同时提取抽象结构的能力,实现了结构泛化。

Comments Project page: https://hpc-mec-worldmodel.github.io/

详情
AI中文摘要

人类将经验抽象为结构化表示以促进模式推断和知识转移。尽管海马-内侧颞叶(HPC-MEC)回路已知能表示空间和概念空间,但如何同时从连续、高维动态中提取抽象结构的机制仍不明确。我们提出了一种脑启发的分层模型,同时推断潜在转换并构建预测视觉世界模型。该架构采用逆向模型进行结构提取,同时结合HPC-MEC耦合模型,将关系结构(MEC)与整合的事件场景(HPC)分离。通过使用原始变换动态作为基准,我们展示了该模型在结构抽象方面的能力。通过利用速度驱动的路径整合,该框架能够在不同情境中实现稳健的预测和结构重用,从而实现结构泛化。本文提供了一个新的计算框架,用于理解如何通过脑启发的自监督学习世界模型,促进可重用的抽象知识的获取。

英文摘要

Humans abstract experiences into structured representations to facilitate pattern inference and knowledge transfer. While the hippocampal-entorhinal (HPC-MEC) circuit is known to represent both spatial and conceptual spaces, the mechanisms for concurrently extracting abstract structures from continuous, high-dimensional dynamics remain poorly understood. We propose a brain-inspired hierarchical model that simultaneously infers latent transitions and constructs a predictive visual world model. Our architecture employs an inverse model for structural extraction alongside an HPC-MEC coupling model that dissociates relational structures (MEC) from integrated episodic scenes (HPC). Using primitive transformation dynamics as a benchmark, we demonstrate the model's capacity for structural abstraction. By leveraging velocity-driven path integration, the framework enables robust prediction and structural reuse across diverse contexts, thereby achieving structural generalization. This work provides a novel computational framework for understanding how brain-inspired, self-supervised learning of world models facilitates the acquisition of reusable abstract knowledge.

2605.15714 2026-05-18 cs.SE cs.AI

Position: Early-Stage Quality Assurance in Annotation Pipelines Is More Cost-Effective Than Late-Stage Validation

位置:标注流程早期阶段的质量保证比后期验证更具成本效益

Sunil Kothari, Sumukha Sharma Thoppanahalli Chandramouli, Naman Khandelwal, Parth Kulshreshtha, Ashi Jain, Kriti Banka, Tanuja Chintada, Venkata Triveni, Gulipalli Praveen Kumar, Manish Mehta, Tao Liu

发表机构 * Centific AI Research(科学人工智能研究)

AI总结 本文指出标注流程早期质量保证比后期验证更有效,强调时间因素对误差率和成本的影响,提出三种质量保证触发点并建议改进研究和实践方法。

Comments 8 pages

详情
AI中文摘要

本文主张机器学习社区应优先考虑标注流程早期阶段的质量保证,而非传统的后期验证。数据质量瓶颈日益限制基础模型的改进,然而质量保证研究几乎只关注验证方法而非验证时机。当验证发生时,不仅所采用的方法,根本上决定了误差率和标注成本。这种对时间的忽视令人费解,鉴于软件工程中已确立的“左移”原则,实证研究显示缺陷在后期发现时成本乘数为4-100倍(Boehm, 1981; Shull et al., 2002)。标注流程展现出类似动态:在标注开始前发现的错误成本仅为审查周期结束后发现的分数之一。我们提出三种质量保证触发点,即标注前(T0)、标注后(T1)和审查后(T2),将标注工作流分解为离散的验证机会。一个参数化的误差传播模型正式化了何时时间影响最终误差率 versus 仅经济因素,使时间成为可测量的设计变量而非配置后的考虑。对47篇近期论文的调查发现,仅有4%报告了验证发生的时间,这在相邻领域中显示出时间的影响,令人惊讶。如果没有对质量保证时间的明确关注,社区将有风险在优化验证方法的同时忽略可能最相关的结构性变量。采取这一立场需要三个步骤:研究人员应在报告质量保证时间配置的同时报告验证方法;标注平台应将时间作为首要参数暴露;并且社区应运行受控实验,直接测量各阶段的检测率。

英文摘要

This position paper argues that the machine learning community should prioritize early-stage quality assurance in annotation pipelines over the prevailing practice of late-stage validation. Data quality bottlenecks increasingly limit foundation model improvement, yet quality assurance research focuses almost exclusively on validation methods rather than validation timing. When validation occurs, not merely what methods are employed, fundamentally determines both error rates and annotation costs. This temporal neglect is puzzling given the well-established "shift-left" principle from software engineering, where empirical studies demonstrate 4--100x cost multipliers for defects detected in later stages (Boehm, 1981; Shull et al., 2002). Annotation pipelines exhibit analogous dynamics: errors caught before annotation begins cost a fraction of those discovered after review cycles complete. We propose a taxonomy of three QA trigger points, namely pre-annotation (T0), post-annotation (T1), and post-review (T2), that decompose annotation workflows into discrete validation opportunities. A parametric error-propagation model formalizes when timing affects final error rates versus only economics, making timing a measurable design variable rather than a configuration afterthought. A survey of 47 recent papers reveals that only 4% report when validation occurs, a striking gap given timing's demonstrated impact in adjacent fields. Without explicit attention to QA timing, the community risks optimizing validation methods while ignoring the structural variable that may matter most. Acting on this position requires three steps: researchers should report QA timing configurations alongside validation methods; annotation platforms should expose timing as a first-class parameter; and the community should run controlled experiments that measure stage-specific detection rates directly.

2605.15707 2026-05-18 eess.IV cs.CV

Evaluation of Anatomical Shape Priors in Deep Learning-Based Cardiac Multi-Compartment Segmentation

基于深度学习的心脏多腔分割中解剖形状先验的评估

Michael Hudler, Franz Thaler, Martin Urschler

发表机构 * Institute for Medical Informatics, Statistics and Documentation(医学信息学、统计学与文档研究所)

AI总结 本文评估了轻量级显式形状先验在心脏多腔CT分割中的效果,发现标准3D U-Net仍为强大基线,手工先验效果有限,未来需更 expressive 的学习先验。

Comments Published in the Proceedings of the Third Austrian Symposium on AI, Robotics, and Vision (AIRoV 2026), pp. 23-27

详情
AI中文摘要

全心多腔CT分割在临床中具有重要意义,但标准CNN未显式强制解剖合理性。基于训练数据统计,我们评估了轻量级显式形状先验,以形状感知损失和空间标签分布热图引导的U-Net变体改进3D心脏分割。在所有实验中,标准3D U-Net意外保持了非常强的基线,手工先验仅带来微小且不一致的变化,有时甚至退化性能。这些结果表明,基线已捕捉了显著的隐式解剖规律,未来改进可能需要更 expressive 的学习先验,而非简单的手工解剖形状约束。

英文摘要

Whole-heart multi-compartment CT segmentation is clinically important, but standard CNNs do not explicitly enforce anatomical plausibility. Based on statistics derived from the training data, we evaluate whether lightweight explicit shape priors, implemented as shape-aware losses and spatial label distribution heatmap-guided U-Net variants, improve 3D cardiac segmentation on MM-WHS CT and WHS++. Across all experiments, a standard 3D U-Net surprisingly remained a very strong baseline, with handcrafted priors yielding at best marginal and inconsistent changes and often degrading performance. These results suggest that the baseline already captures substantial implicit anatomical regularities and that future gains will likely require more expressive learned priors rather than simple handcrafted anatomical shape constraints.

2605.15688 2026-05-18 stat.ML cs.AI cs.LG math.PR

$α$-TCAV: A Unified Framework for Testing with Concept Activation Vectors

$α$-TCAV:基于概念激活向量的测试统一框架

Ekkehard Schnoor, Jawher Said, Malik Tiomoko, Wojciech Samek, Alexander Jung

发表机构 * Department of Computer Science(计算机科学系) Department of Artificial Intelligence(人工智能系) Aalto University(阿alto大学) Fraunhofer Heinrich Hertz Institute(弗劳恩霍夫海因里希·赫兹研究所) Department of Artificial Intelligence, Fraunhofer HHI(人工智能系,弗劳恩霍夫HHI研究所) Huawei Noah’s Ark Lab(华为诺亚实验室) Department of EECS, Technische Universität Berlin(电子工程与计算机科学系,柏林技术大学)

AI总结 本文提出$α$-TCAV框架,解决传统TCAV方法中因指示函数不连续导致的方差问题,通过参数化平滑函数统一概率表述,并提供参数调优指导,挑战现有实践惯例。

Comments 44 pages, 12 figures

详情
AI中文摘要

概念激活向量(CAVs)是深度学习中基于概念的可解释性基础工具,但其实际应用受限于统计不稳定性。本文分析了CAVs和TCAV方法的随机性质,推导了主要CAV类别的分布,包括PatternCAV、FastCAV和基于岭回归的CAV。识别了标准TCAV得分的根本缺陷:其依赖不连续指示函数导致关键区域方差不衰减。为此,引入$α$-TCAV,一种通用框架,用参数化平滑函数替代指示函数,得到统一的概率表述,涵盖TCAV和Multi-TCAV。刻画了灵敏度得分和不同TCAV变体的诱导分布,显示现有最先进的选择缺乏理论依据。提供原理指导,调优$α$-TCAV参数:要么以较低计算成本模仿Multi-TCAV,要么获得校准的贝叶斯最优概率度量。最终分析产生实用建议,挑战现有惯例:最显著的是将全部采样预算分配给单一CAV而非多个。

英文摘要

Concept Activation Vectors (CAVs) are a fundamental tool for concept-based explainability in deep learning, yet their practical utility is limited by statistical instability. We analyze the stochastic nature of CAVs and the Testing with CAVs (TCAV) method, deriving the distributions of major CAV classes including PatternCAV, FastCAV, and ridge regression-based CAVs. We then identify a fundamental flaw in the standard TCAV score: its reliance on a discontinuous indicator function induces non-decaying variance in critical regimes. To address this, we introduce $α$-TCAV, a generalized framework that replaces the indicator with a parameterized smooth function, yielding a unified probabilistic formulation that subsumes both TCAV and Multi-TCAV. We characterize the induced distributions of sensitivity scores and different TCAV variants, showing that established state-of-the-art choices lack theoretical justification. We provide principled guidance on tuning the parameter in $α$-TCAV -- either to imitate Multi-TCAV at substantially lower computational cost, or to obtain a calibrated Bayes-optimal probabilistic measure of a concept's influence. Finally, our analysis yields practical recommendations that challenge established routines: most notably, allocating the full sampling budget to a single CAV rather than splitting it across several.

2605.15681 2026-05-18 cs.GR cs.CV

DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer

DealMaTe: 通过扩散变换器实现多维材料传输

Nisha Huang, Yizhou Lin, Jie Guo, Xiu Li, Tong-Yee Lee, Zitong Yu

发表机构 * Tsinghua University(清华大学) Pengcheng Laboratory(鹏城实验室) National Cheng-Kung University(国立成功大学) Great Bay University(大湾大学) Dongguan Key Laboratory for Intelligence and Information Technology(东莞智能与信息技术重点实验室)

AI总结 DealMaTe通过深度、规范和光照图像实现材料传输,采用简化扩散框架,消除文本引导和参考网络,设计轻量3D信息注入方法,优化注意力机制,实现高效高质量的材料传输。

详情
AI中文摘要

最近,基于扩散的材料传输方法依赖于图像微调或复杂的架构和辅助网络,面临文本依赖、额外计算成本和特征对齐等挑战。为了解决这些限制,我们提出了DealMaTe,使用深度、规范和光照图像进行材料传输。DealMaTe是一种简化扩散框架,消除了文本引导和参考网络。我们设计了一种轻量的3D信息注入方法,多维3D着色LoRA,无需修改基础模型权重,实现了兼容的控制条件,并获得了和谐稳定的结果。此外,我们通过着色因果互注意力机制优化注意力机制,并使用键值(KV)缓存来减少由多个条件引起的推理延迟,提高计算效率,并在低架构复杂度下实现高质量的材料传输结果。广泛的实验涵盖了各种物体和光照条件,一致地证明DealMaTe在任意输入材料下实现了显著的高保真材料传输。代码可在https://github.com/haha-lisa/DealMaTe上获得。

英文摘要

Recently, diffusion-based material transfer methods rely on image fine-tuning or complex architectures with auxiliary networks but face challenges such as text dependency, additional computational costs, and feature misalignment. To address these limitations, we propose \textbf{DealMaTe}, using \underline{\textbf{de}}pth, norm\underline{\textbf{a}}l, and \underline{\textbf{l}}ighting images for \underline{\textbf{ma}}terial \underline{\textbf{t}}ransf\underline{\textbf{e}}r. DealMaTe is a simplified diffusion framework that eliminates text guidance and reference networks. We design a lightweight 3D information injection method, Multi-Dim 3D Shader LoRA, which, without modifying the base model weights, enables compatible control conditions and achieves harmonious and stable results. Additionally, we optimize the attention mechanism with Shader Causal Mutual Attention and key-value (KV) caching to reduce inference latency caused by multiple conditions, improve computational efficiency, and achieve high-quality material transfer results with low architectural complexity. Extensive experiments covering a wide variety of objects and lighting conditions consistently demonstrate that DealMaTe achieves remarkable high-fidelity material transfer under arbitrary input materials. The code is available at https://github.com/haha-lisa/DealMaTe.

2605.15673 2026-05-18 eess.IV cs.CV cs.LG

Highly Detailed and Generalizable Broadleaf Tree Crown Instance Segmentation from UAV Imagery

基于无人机影像的高精度通用性阔叶林树冠实例分割

Mitsutaka Nakada, Takahiko Ikebata, Kengo Ikebata, Yuji Mizuno, Yusuke Onoda, Ryuichi Takeshige, Kyaw Kyaw Htoo, Kanehiro Kitayama, Robert Ong, Masanori Onishi

发表机构 * DeepForest Technologies Co., Ltd.(深森林技术有限公司) YM Lab.(YM实验室) Graduate School of Agriculture, Kyoto University(京都大学农业研究院) Graduate School of Science, Osaka Metropolitan University(大阪 metropolitan 大学理学研究院) Faculty of Tropical Forestry, Universiti Malaysia Sabah(马来西亚沙巴大学热带林业学院) Forest Research Centre, Sabah Forestry Department(沙巴林业部门森林研究中心)

AI总结 本文提出一种高精度树冠实例分割模型,通过无人机影像实现阔叶林中单个树冠的精确定界,利用大规模高质量标注数据集提升分割性能,适用于复杂结构和不同地理生物的森林环境。

Comments 12 pages, 5 figures, 3 Tables

详情
AI中文摘要

我们提出了一种高精度的实例分割模型,用于利用无人机获取的高空影像确定自然阔叶林中单个树冠。阔叶林中的树冠界定比其他森林类型更具挑战性,因为树冠形状多样且缺乏明显树顶。为解决这一问题,我们开发了一个基于深度学习的树冠分割模型,该模型在高质量标注的树冠轮廓上进行训练。我们通过熟练标注员手动定义了18,507个树冠多边形,从日本七个森林收集的正射影像中,并基于Mask2Former开发了多个主干架构的模型。最佳模型仅使用RGB影像即可在结构复杂的阔叶林中实现高分割性能。当应用于日本不同地理区域的森林以及婆罗洲生物不同的热带雨林时,性能仍然保持。这些结果表明,使用大量高质量标注数据集对于实现跨多样森林生态系统精确且通用的树冠分割至关重要。所开发的模型已整合到DF Scanner Pro软件中,该软件支持使用无人机进行实际森林监测,这种实现预计能够使广泛用户从无人机分析阔叶林的树级信息。

英文摘要

We present a highly detailed instance segmentation model for delineating individual tree crowns in natural broadleaf forests using aerial imagery acquired by unmanned aerial vehicles (UAVs). Tree crown delineation in broadleaf forests is more challenging than in other forest types due to diversity of crown shapes and the lack of clearly defined treetops. To address this issue, we developed a deep-learning-based crown segmentation model trained on high-quality annotated crown outlines. We manually delineated 18,507 crown polygons from orthomosaic images collected across seven forests in Japan by skilled annotators, and developed a model based on Mask2Former with multiple backbone architectures. The best model achieved high segmentation performance in structurally complex broadleaf forests using only RGB imagery. This performance was maintained when applied to geographically distinct forests within Japan, as well as to biologically distinct tropical rainforests in Borneo. These results demonstrate that using a large number of high-quality annotated datasets is critical for achieving detailed and generalizable crown segmentation across diverse forest ecosystems. The developed model has been integrated into DF Scanner Pro, a software that supports practical forest monitoring using UAVs, and this implementation is expected to enable a wide range of users to analyze tree-level information in broadleaf forest from UAVs.

2605.15671 2026-05-18 eess.IV cs.CV

Degradation-Aware Blur-Segmentation of Brain Tumor

考虑退化因素的脑肿瘤模糊分割

Yuchun Wang, Xiaosong Li, Gefei Liang, Yang Liu

发表机构 * School of Physics and Optoelectronic Engineering, Foshan University, China(物理与光电工程学院,佛山大学,中国)

AI总结 本文提出DABSeg网络,通过同步去模糊和精确分割,提升多模态3D脑肿瘤分割在退化条件下的鲁棒性与临床实用性。

详情
AI中文摘要

多模态3D MRI脑肿瘤分割是放疗目标勾画、手术规划和治疗后评估的关键步骤。现有方法通常假设MRI图像无伪影,但扫描过程中不可避免的患者运动引入伪影和模糊,导致边界和纹理特征退化,影响分割性能。为此,我们引入考虑退化因素的模糊分割网络(DABSeg),一种同步去模糊的3D多模态MRI分割网络,统一了模糊去除和准确分割。具体而言,我们提出一个特征域运动去模糊茎以补偿模糊并平衡强度。同时,骨干网络嵌入了一个模糊感知的跨模态交叉注意力模块和多尺度残差聚合,以实现有效的模态互补性。值得注意的是,我们优化了一个联合损失,结合加权Dice与清晰参考重建项,其中不平衡的权重应用于小目标以增强学习强度和预测稳定性,以小病变和边界区域。系统比较和消融实验在BraTS2020数据集上,无论是清晰还是退化条件均一致表明,DABSeg在肿瘤Dice分数和边界精度上优于现有最先进方法。这些结果验证了考虑退化因素的跨任务协作学习在提升多模态3D脑肿瘤分割在现实退化条件下的鲁棒性和临床实用性方面的有效性。源代码可在https://github.com/YuchunWang24/DABSeg_ICPR获取。

英文摘要

Multimodal 3D MRI brain tumor segmentation is a pivotal step in radiotherapy target delineation, surgical planning and post-treatment assessment. Existing methods often assume artifact-free MRI images. However, inevitable patient motion during scanning introduces artifacts and blur that degrade boundary and texture features, leading to poor segmentation performance. To bridge this gap, we introduce Degradation-Aware Blur-Segmentation Net (DABSeg), a synchronous deblurring 3D multimodal MRI segmentation network that unifies blur removal and accurate segmentation. Specifically, we propose a feature-domain motion-deblurring stem to compensate for blur and rebalance intensity. Concurrently, the backbone network embeds a blur-aware cross-modal cross-attention module and multi-scale residual aggregation to yield effective modality complementarity. Notably, we optimize a joint loss that combines weighted Dice with a clear-reference reconstruction term, where imbalanced weights are applied to small targets to boost learning intensity and predictive stability for small lesions and border regions. Systematic comparisons and ablation experiments on the BraTS2020 dataset under both clear and degenerative conditions consistently demonstrate that DABSeg surpasses state-of-the-art methods in tumor Dice score and boundary precision. These results validate the effectiveness of degenerative-aware cross-task collaborative learning in improving the robustness and clinical utility of multi-modal 3D brain tumor segmentation under realistic degenerative conditions. The source code is available at https://github.com/YuchunWang24/DABSeg_ICPR

2605.15656 2026-05-18 eess.SP cs.AI

TFZ-Tree: An Ultra-Lightweight Waveform Classification Framework for Resource-Constrained Devices

TFZ-Tree:一种面向资源受限设备的超轻量波形分类框架

Hao Wang, Kuang Zhang, Yonggang Chi, Tianqi Zhao, Yanbo Fu, Jiaxing Guo

发表机构 * x86 platform(x86平台) Einstein-sworder

AI总结 本文提出TFZ-Tree框架,通过时间频率多维特征和优化的Z检验树实现超轻量波形分类,实现在资源受限设备上实时识别十种物联网波形类型,测试精度达99.5%。

详情
AI中文摘要

在6G物联网多波形共存趋势下,智能接收器必须首先识别物理层波形类型才能正确解调和资源调度。然而,现有信号识别研究主要聚焦于符号级调制分类,直接针对物理层波形类型(如OFDM、OTFS、LoRa)的研究极为稀缺,且依赖深度神经网络和复杂时频变换,难以部署在资源受限终端。符号调制分类方法本身也无法规避“波形识别先于解调”的前提。为解决这一双重缺口,本文提出一种基于时频多维特征的超轻量波形分类框架,采用低复杂度时域特征提取,分类后端采用优化的Z检验树,利用假设检验置信度自动控制决策树分裂和大小,确保在资源有限处理器上高效执行。在包含OFDM、OTFS、DSSS、LoRa和NB-IoT在内的十种6G候选波形上测试,方法在AWGN信道下平均精度达99.5%,在TDL-C多径信道下为87.4%,主要混淆OTFS与LoRa。在x86平台用C语言实现,单次推理延迟低于4ms。据所知,这是首次实现十种物联网波形类型实时识别的工作。未来工作将针对嵌入式MCU上的部署加速。代码和数据集已开源:https://github.com/Einstein-sworder/IoT-wave.

英文摘要

Under the trend of multi-waveform coexistence in 6G IoT, intelligent receivers must first identify physical-layer waveform types before performing correct demodulation and resource scheduling. However, existing signal identification research largely focuses on symbol-level modulation classification. Research directly targeting physical-layer waveform types (e.g., OFDM, OTFS, LoRa) is not only extremely scarce but also heavily reliant on deep neural networks and complex time-frequency transforms, making deployment on resource-constrained terminals difficult. Symbol modulation classification methods themselves cannot circumvent the prerequisite of ``waveform identification first.'' To address this dual gap, we propose an ultra-lightweight waveform classification framework based on time-frequency multidimensional features with a cooperative Z-test tree (ZTree). The framework employs low-complexity time-domain feature extraction, and the classification backend adopts a ZTree optimized by Z-statistical testing, which uses hypothesis testing confidence to automatically control decision tree splitting and size, ensuring efficient execution on resource-limited processors. Tested on ten 6G candidate waveforms including OFDM, OTFS, DSSS, LoRa, and NB-IoT, the method achieves 99.5\% average accuracy under AWGN and 87.4\% under TDL-C multipath channels, with main confusion between OTFS and LoRa. Implemented in C on an x86 platform, single inference latency is under 4~ms. To the best of our knowledge, this is the first work achieving real-time recognition of ten IoT waveform types. Future work will target deployment acceleration on embedded MCUs. Code and dataset are open-sourced at: https://github.com/Einstein-sworder/IoT-wave.

2605.15630 2026-05-18 physics.chem-ph cond-mat.stat-mech cs.LG

Reweighting free energy profiles between universal machine learning interatomic potentials for fast consensus building

在通用机器学习原子势能函数之间重新加权自由能轮廓以实现快速共识构建

Sauradeep Majumdar, Miguel Steiner, Johannes C. B. Dietschreit, Swagata Roy, Daniel Willimetz, Lukaš Grajciar, Rafael Gómez-Bombarelli

发表机构 * Department of Material Science and Engineering, Massachusetts Institute of Technology(材料科学与工程系,麻省理工学院) Institute of Theoretical Chemistry, University of Vienna(理论化学研究所,维也纳大学) Department of Physical and Macromolecular Chemistry, Charles University(物理与大分子化学系,查尔斯大学)

AI总结 本文提出一种系统且可扩展的框架,通过重新加权潜在平均力,实现不同机器学习势能函数之间的自由能轮廓匹配,从而在低计算成本下获得高精度的热力学性质。

Comments 19 pages, 4 figures, 1 table, SI appended

详情
AI中文摘要

自由能轮廓在微观原子波动与宏观热力学可观测量之间起到桥梁作用。沿反应坐标估计自由能轮廓(即平均势能轮廓,PMF)的密度泛函理论(DFT)精度计算成本很高。通用机器学习原子势能函数(MLIPs)大幅降低了这一成本,但其精度取决于训练数据,因此对于特定系统可能不确定。本文提出一种系统且可扩展的框架,用于重新加权PMF,初始由单一‘源’MLIP采样,然后扩展到代表性目标MLIP集合。由于传统直接指数重新加权在大系统中因相空间重叠低而失效,我们部署了稳健的分析修正。应用于复杂601原子系统中的Li+在纳米受限电解质中的传输,证明平均能量间隙近似有效避免了统计崩溃,产生高度稳定的PMF匹配目标PMF。使用此方法,我们可以在多个DFT参考水平(PBE+D3、PBE-sol、r²SCAN、r²SCAN-D4)下以远低于完整模拟的计算成本恢复高保真的目标热力学性质。进一步的热力学分析表明,所研究的MLIPs根据其训练数据分为两个不同的簇。我们的重新加权框架即使在相空间重叠极低时也能恢复目标热力学性质——特别是反应和激活自由能。最终,此方法建立了一种关键的诊断协议,以在不冗余且资源密集的模拟下实现材料化学性质的跨模型共识。

英文摘要

Free energy profiles serve as a fundamental bridge between microscopic atomic fluctuations and macroscopic thermodynamic observables. Estimating the free energy profile along a reaction coordinate, referred to as the potential of mean force (PMF), with density functional theory (DFT) accuracy is computationally expensive. Universal machine learning interatomic potentials (MLIPs) drastically reduce this cost, but their accuracy is strongly determined by their training data and hence can be uncertain for a given system. In this work, we present a systematic and scalable framework for reweighting PMFs, initially sampled with a single 'source' MLIP, across a representative suite of target MLIPs. Because traditional direct exponential reweighting fails for large system sizes due to low phase-space overlap between potentials, we deploy robust analytical corrections. Applying this to a complex 601-atom system of Li$^+$ transport in a nanoconfined electrolyte, we demonstrate that a mean energy-gap approximation effectively bypasses statistical collapse, producing a highly stable PMF matching the target PMF. Using this approach, we recover high-fidelity target thermodynamics across multiple DFT reference levels (PBE+D3, PBE-sol, r$^2$SCAN,r$^2$SCAN-D4) at a fraction of the computational cost of full simulations. Furthermore, thermodynamic analysis reveals that the studied MLIPs partition into two distinct clusters driven by their training data. Our reweighting framework successfully recovers target thermodynamic properties--specifically, reaction and activation free energies--even when the phase-space overlap between potentials is critically low. Ultimately, this approach establishes a vital diagnostic protocol to achieve affordable cross-model consensus on materials chemistry properties without redundant, resource-intensive simulations.