arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2510.21546 2026-06-19 eess.SY cs.SY 版本更新

Auction-Based Responsibility Allocation for Scalable Decentralized Safety Filters in Cooperative Multi-Agent Collision Avoidance

基于拍卖的责任分配用于可扩展的去中心化安全滤波器在多智能体协同避碰中

Johannes Autenrieb, Mark Spiller

AI总结提出基于高阶控制屏障函数和拍卖责任分配的可扩展去中心化安全滤波器，通过非对称分配约束减少计算负荷，实现多智能体协同避碰。

Comments 6 pages, 3 figures, accepted for presentation at the IFAC World Congress 2026

2510.00831 2026-06-19 cs.AI cs.LG eess.SP 版本更新

Controlled Comparison of Machine Learning Models for Fault Classification and Localization in Power System Protection

电力系统保护中故障分类与定位的机器学习模型受控比较

Julian Oelhaf, Georg Kordowich, Changhun Kim, Paula Andrea Pérez-Toro, Christian Bergler, Andreas Maier, Johann Jäger, Siming Bayer

发表机构 * Department of Electrical Engineering, Media and Computer Science, Ostbayerische Technische Hochschule Amberg-Weiden（奥贝格-魏登应用技术大学电气工程、媒体与计算机科学系）

AI总结在统一电磁暂态数据集和10-50ms决策窗口下，对比机器学习模型在故障分类与定位中的性能，发现分类在10ms时F1>0.98，定位误差稳定在约10%线路长度。

Comments Accepted at IEEE PES Innovative Smart Grid Technologies Europe 2026 (ISGT Europe 2026). Pre-camera-ready author version; final proceedings version may differ

详情

AI中文摘要

现代电力系统因逆变器基和分布式能源的集成而日益复杂，挑战了传统保护方案的可靠性，并推动了机器学习在保护任务中的应用。然而，由于不同研究中的数据集、传感假设和决策时域各异，已发表的结果往往难以比较。本文在相同的传感、时序和验证条件下，基于公共电磁暂态数据集，使用10-50ms的决策窗口以反映保护相关时间尺度，对故障分类（FC）和故障定位（FL）的机器学习模型进行了受控比较。对于FC，性能最佳的非线性模型在10ms时F1分数已超过0.98，而低容量模型在较短时域下性能下降，但随窗口延长而改善，表明相关故障类型信息在最早暂态中已存在。对于FL，顶级模型在所有评估时域下达到约10%归一化线路长度的稳定定位误差，而较弱模型形成明显分离的第二性能层级。线路解析分析显示，定位精度随电网段变化，表明存在拓扑依赖的难度而非仅时间上下文不足。这些发现为比较两个信息需求根本不同的保护任务中的机器学习模型提供了受控参考。

英文摘要

The increasing complexity of modern power systems, driven by the integration of inverter-based and distributed energy resources, challenges the reliability of conventional protection schemes and motivates the use of machine learning for protection tasks. However, published results are often difficult to compare because datasets, sensing assumptions, and decision horizons vary across studies. This paper presents a controlled comparison of machine learning models for fault classification (FC) and fault localization (FL) under identical sensing, timing, and validation conditions on a common electromagnetic transient dataset, using decision windows of 10-50 ms to reflect protection-relevant time scales. For FC, the best-performing nonlinear models achieve F1 scores above 0.98 already at 10 ms, while lower-capacity models degrade at shorter horizons but improve with longer windows, indicating that relevant fault-type information is already present in the earliest transient. For FL, the top-performing models reach a stable localization error of about 10 % of normalized line length across all evaluated horizons, while weaker models form a clearly separated second performance tier. Line-resolved analysis shows that localization accuracy varies across grid segments, indicating topology-dependent difficulty rather than insufficient temporal context alone. These findings provide a controlled reference for comparing machine learning models across two protection tasks with fundamentally different information requirements.

URL PDF HTML ☆

赞 0 踩 0

2507.14952 2026-06-19 eess.SY cs.SY 版本更新

An approach to the LQG/LTR design problem with specifications for finite-dimensional SISO control systems

有限维SISO控制系统LQG/LTR设计问题的规格化方法

Mahyar Mahinzaeim, Kamyar Mehran

AI总结提出一种基于加权增广的LQG/LTR设计方法，将低高频设计规格纳入LTR框架，通过灵敏度函数塑造闭环性能与鲁棒性，并用齿轮直流电机扭矩控制实例验证。

Comments typos corrected; references added; additional computational details added

详情

AI中文摘要

这是一篇说明性论文，讨论了有限维单变量（单输入/单输出，SISO）控制系统的线性二次高斯/回路传递恢复（LQG/LTR）设计问题的一种方法。该方法基于利用加权增广，将设计规格纳入LTR技术框架中用于LQG补偿器设计。LQG补偿器需同时满足给定的分析性低频和高频设计规格，这些规格以期望的灵敏度和控制器噪声灵敏度函数表示。本文面向非专业人士，特别是有限维LQG理论中的实践者，他们关注在实际情况下为SISO控制系统的闭环性能和鲁棒性塑造设计反馈补偿器。通过一个详细的设计实例——带弹性安装输出轴的齿轮直流电机的扭矩控制——说明了所提出的方法。

英文摘要

This is an expository paper which discusses an approach to the linear quadratic Gaussian/loop transfer recovery (LQG/LTR) design problem for finite-dimensional single-variable (single-input/single-output, SISO) control systems. The approach is based on the utilisation of weighting augmentation for incorporating design specifications into the framework of the LTR technique for LQG compensator design. The LQG compensator is to simultaneously meet given analytical low- and high-frequency design specifications expressed in terms of desirable sensitivity and controller noise sensitivity functions. The paper is aimed at non-specialists and, in particular, practitioners in finite-dimensional LQG theory interested in the design of feedback compensators for closed-loop performance and robustness shaping of SISO control systems in realistic situations. The proposed approach is illustrated by a detailed design example: the torque control of a geared DC motor with an elastically mounted output shaft.

URL PDF HTML ☆

赞 0 踩 0

2509.03488 2026-06-19 eess.SP 版本更新

Efficient DoA Estimation for Linear and Rectangular Arrays with Hybrid Architectures Using Compact DFT Codebooks

基于紧凑DFT码本的线性和矩形阵列混合架构高效DoA估计

Miguel Rivas-Costa, Carlos Mosquera

AI总结针对混合架构中维度压缩导致空间协方差矩阵自由度不足的问题，提出利用DFT波束成形后的柯西型位移结构的广义最小二乘框架，实现线性阵列的协方差矩阵高效恢复，复杂度为O(N_RF^2 N_x)，逼近CRB并优于现有方法。

详情

AI中文摘要

混合模拟数字（HAD）架构显著降低了硬件开销，但引入了严重的维度压缩，这剥夺了空间协方差矩阵（SCM）进行高分辨率波达方向（DoA）估计所需的自由度。离散傅里叶变换（DFT）模拟波束成形的无源巴特勒矩阵实现避免了有源移相器和放大器，进一步加剧了这一挑战。在本文中，我们提出了一个广义最小二乘（GLS）框架，该框架利用了DFT波束成形后出现的柯西型位移结构。通过利用这种结构，我们开发了一种高效的数值技术来恢复均匀线性阵列的SCM，复杂度为$\mathcal{O}(N_{\text{RF}}^2 N_x)$，其中$N_x$是天线数量，$N_{\text{RF}}$是射频链数量。仿真表明，我们的估计器逼近克拉美-罗界（CRB），同时优于最先进的方法。

英文摘要

Hybrid Analog and Digital (HAD) architectures significantly reduce hardware overhead but introduce severe dimensionality compression, which strips the Spatial Covariance Matrix (SCM) of the degrees of freedom required for high-resolution Direction-of-Arrival (DoA) estimation. This challenge is further compounded by passive Butler-matrix implementations of Discrete Fourier Transform (DFT) analog beamforming, which avoid active phase shifters and amplifiers. In this paper, we propose a Generalized Least Squares (GLS) framework that exploits the Cauchy-like displacement structure that arises after DFT beamforming. By leveraging this structure, we develop a highly efficient numerical technique to recover the SCM for uniform linear arrays with a complexity of $\mathcal{O}(N_{\text{RF}}^2 N_x)$, where $N_x$ is the number of antennas and $N_{\text{RF}}$ the number of RF-chains. Simulations demonstrate that our estimator approaches the Cramér-Rao Bound (CRB) while outperforming state-of-the-art methods.

URL PDF HTML ☆

赞 0 踩 0

2508.01819 2026-06-19 eess.IV 版本更新

Decoding the Alzheimer's Continuum: Interpretable Multi-Gate Routing for Diagnosis and Transition Prediction

解码阿尔茨海默病连续谱：可解释的多门路由用于诊断与转换预测

Yufeng Jiang, Hexiao Ding, Hongzhao Chen, Jing Lan, Xinzhi Teng, Gerald W. Y. Cheng, Yunlin Mao, Zongxi Li, Haoran Xie, Jung Sun Yoo, Jing Cai

AI总结提出M$^3$AD统一框架，利用可解释多门专家混合架构，基于T1加权sMRI同时实现三分类诊断和阶段转换预测，准确率达95.13%。

Comments Accepted by MICCAI2026

详情

AI中文摘要

阿尔茨海默病（AD）表现为从正常认知（NC）经轻度认知障碍（MCI）到痴呆的连续进展。然而，大多数深度学习方法将此连续谱简化为不连续的分类任务，很大程度上忽略了动态阶段转换。为了解码这一复杂进展，我们提出M$^3$AD，一个统一框架，仅使用T1加权sMRI联合处理三分类诊断和诊断阶段转换预测。M$^3$AD利用可解释的多门专家混合架构，采用专门的路由机制动态捕获诊断特定的病理模式和跨连续谱的共享结构特征。它进一步通过自适应注意力融合整合临床先验（年龄、性别、eTIV）以增强泛化能力。M$^3$AD在原始实验设置下达到95.13%的准确率（MCLNC报告为90.44%），转换预测准确率为94.87%。关键的是，分析多门路由揭示了区分稳定性和进展性MCI的独特专家激活特征，为个体水平的进展风险分层提供了机制基础。代码见：此 https URL。

英文摘要

Alzheimer's disease (AD) manifests as a continuous progression from normal cognition (NC) through mild cognitive impairment (MCI) to dementia. However, most deep learning approaches reduce this continuum to disjointed classification tasks, largely ignoring dynamic stage transitions. To decode this complex progression, we propose M$^3$AD, a unified framework that jointly addresses three-class diagnosis classification and diagnosis stage transition prediction using only T1-weighted sMRI. M$^3$AD leverages an interpretable multi-gate mixture of experts architecture, employing specialized routing mechanisms to dynamically capture both diagnosis-specific pathological patterns and shared structural features across the continuum. It further integrates clinical priors (age, sex, eTIV) via adaptive attention fusion to enhance generalization. M$^3$AD achieves 95.13% accuracy, compared to 90.44% reported by MCLNC under its original experimental setting, and 94.87% for transition prediction. Crucially, analyzing the multi-gate routing reveals distinct expert activation signatures distinguishing stable from progressive MCI, providing a mechanistic basis for individual-level progression risk stratification. Code is available at https://github.com/csyfjiang/M3AD.

URL PDF HTML ☆

赞 0 踩 0

2507.19137 2026-06-19 eess.AS cs.AI cs.SD 版本更新

Assessment of Personality Dimensions Across Situations in Dyadic Role-Play Scenarios

二元角色扮演场景中跨情境的人格维度评估

Alice Zhang, Skanda Muralidhar, Daniel Gatica-Perez, Mathew Magimai-Doss

发表机构 * Idiap Research Institute（日内瓦研究所）； The University of Texas at Austin（德克萨斯大学奥斯汀分校）

AI总结研究通过对话语音分析，发现感知人格在不同工作情境下显著变化，并识别出与各人格特质相关的声学特征。

Comments Accepted to IEEE Transactions on Affective Computing

详情

AI中文摘要

先前研究表明，用户偏好与其人格相匹配的辅助技术。这引发了对自动人格感知（APP）的兴趣，旨在预测个体感知到的人格特质。以往的APP研究将人格视为静态特质，独立于情境。然而，心理学研究表明，感知人格会随情境和场景而变化。在本研究中，我们调查了参与两种工作情境（中性面试和压力客户互动）的参与者对话语音与感知人格之间的关系。我们的主要发现是：1）感知人格在不同互动中显著不同；2）响度、声压级和频谱通量特征在中性互动中指示感知的外向性、宜人性、尽责性和开放性，而在压力情境中，神经质与这些特征相关；3）手工声学特征和非语言特征在感知人格推断中优于说话人嵌入；4）压力互动更能预测神经质，这与现有心理学研究一致。

英文摘要

Prior research indicates that users prefer assistive technologies whose personalities align with their own. This has sparked interest in automatic personality perception (APP), which aims to predict an individual's perceived personality traits. Previous studies in APP have treated personalities as static traits, independent of context. However, perceived personalities can vary by context and situation as shown in psychological research. In this study, we investigate the relationship between conversational speech and perceived personality for participants engaged in two work situations (a neutral interview and a stressful client interaction). Our key findings are: 1) perceived personalities differ significantly across interactions, 2) loudness, sound level, and spectral flux features are indicative of perceived extraversion, agreeableness, conscientiousness, and openness in neutral interactions, while neuroticism correlates with these features in stressful contexts, 3) handcrafted acoustic features and non-verbal features outperform speaker embeddings in inference of perceived personality, and 4) stressful interactions are more predictive of neuroticism, aligning with existing psychological research.

URL PDF HTML ☆

赞 0 踩 0

2505.18726 2026-06-19 cs.SD cs.LG eess.AS 版本更新

Bioacoustic Geolocation: Species Sounds as Geographic Signals

生物声学地理定位：物种声音作为地理信号

Mustafa Chasmai, Wuao Liu, Subhransu Maji, Grant Van Horn

发表机构 * University of Massachusetts, Amherst（马萨诸塞大学阿姆赫斯特分校）

AI总结本文研究仅通过声音进行全球尺度地理定位，利用生物声学信号中的物种地理分布线索，提出结合物种范围预测与检索的地理定位方法，并验证多模态融合的潜力。

Comments Accepted to ICML 26

详情

AI中文摘要

我们能否仅通过听到的声音确定某人的地理位置？声学信号是否足以定位到国家、州甚至城市？在这项工作中，我们应对全球尺度音频地理定位的挑战，特别关注野生动物和自然声音。我们假设生物声学信号包含信息丰富的地理定位线索，因为物种具有明确的地理分布范围。为了验证这一假设，我们对图像地理定位和声景映射方法进行基准测试，设计预言机和以物种为中心的基线，并提出一种结合物种范围预测与基于检索的地理定位的混合方法。我们进一步探究地理定位是否随着物种多样性记录和跨邻近样本的时空聚合而改善。最后，我们将研究扩展到多模态地理定位，通过结合音频和视觉内容的电影案例研究。我们的结果突出了将生物声学信号纳入地理空间任务的潜力，为物种识别和音频地理定位的未来工作提供了动力。

英文摘要

Can we determine someone's geographic location solely from the sounds they hear? Are acoustic signals enough to localize within a country, state, or even city? In this work, we tackle the challenge of global-scale audio geolocation, with a particular focus on wildlife and natural sounds. We posit that bioacoustic signals contain informative geolocation cues because of well-defined geographic ranges of species. To test this hypothesis, we benchmark image geolocation and soundscape mapping methods, design oracles and species-centric baselines, and propose a hybrid approach that combines species range prediction with retrieval-based geolocation. We further ask whether geolocation improves with species-diverse recordings and spatiotemporal aggregation across neighboring samples. Finally, we extend our study to multimodal geolocation with case studies from movies that combine both audio and visual content. Our results highlight the potential of incorporating bioacoustic signals into geospatial tasks, motivating future work on species recognition and audio geolocation.

URL PDF HTML ☆

赞 0 踩 0

2503.23179 2026-06-19 eess.IV cs.CV 版本更新

OncoReg: Medical Image Registration for Oncological Challenges

OncoReg：面向肿瘤学挑战的医学图像配准

Wiebke Heyer, Yannic Elser, Lennart Berkel, Xinrui Song, Xuanang Xu, Pingkun Yan, Xi Jia, Jinming Duan, Zi Li, Tony C. W. Mok, BoWen LI, Tim Hable, Christian Staackmann, Christoph Großbröhmer, Lasse Hansen, Alessa Hering, Malte M. Sieren, Mattias P. Heinrich

发表机构 * Institute of Medical Informatics, University of Lübeck（吕贝克大学医学信息学研究所）； Institute of Radiology and Nuclear Medicine, University Hospital Schleswig-Holstein（石勒斯维希-霍尔斯坦大学医院放射科和核医学研究所）； Department of Biomedical Engineering and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute（伦塞拉塞尔理工学院生物医学工程系和生物技术与跨学科研究中心）； School of Computer Science, University of Birmingham（伯明翰大学计算机科学学院）； Division of Informatics, Imaging and Data Sciences, University of Manchester（曼彻斯特大学信息学、成像和数据科学系）； DAMO Academy, Alibaba Group（阿里集团DAMO学院）； Hangzhou Shengshi Technology Co., Ltd（杭州盛世科技有限公司）； Department of Radiation Oncology, University Hospital Schleswig-Holstein（石勒斯维希-霍尔斯坦大学医院放射肿瘤科）； EchoScout GmbH ； Radboud University Medical Center, Nijmegen（奈密根大学医学中心）； Institute of Interventional Radiology, University Hospital Schleswig-Holstein（石勒斯维希-霍尔斯坦大学医院介入放射科）

AI总结提出OncoReg挑战，通过两阶段框架在保护患者隐私的同时开发可泛化的图像配准方法，用于放射治疗中锥束CT与扇束CT的配准，发现特征提取是关键，深度学习和经典方法结合最有效。

Comments 21 pages, 13 figures

详情

AI中文摘要

在现代癌症研究中，由于患者隐私相关的挑战，产生的大量医学数据往往未被充分利用。OncoReg挑战通过一个两阶段框架解决了这一问题，该框架使研究人员能够在确保患者隐私的同时开发和验证图像配准方法，并促进更可泛化的AI模型的发展。第一阶段涉及使用公开可用的数据集，第二阶段则专注于在安全的医院网络内对私有数据集进行模型训练。OncoReg建立在Learn2Reg挑战的基础上，纳入了放射治疗中介入性锥束计算机断层扫描与标准计划扇束CT图像的配准。准确的图像配准在肿瘤学中至关重要，特别是在图像引导放射治疗的动态治疗调整中，需要精确对齐以最小化对健康组织的辐射暴露，同时有效靶向肿瘤。本文详细介绍了OncoReg挑战的方法和数据，并对竞赛参赛作品和结果进行了全面分析。研究发现，特征提取在此配准任务中起着关键作用。从该挑战中涌现的一种新方法展示了其多功能性，而现有方法的表现与新技术相当。深度学习和经典方法在图像配准中仍扮演重要角色，尤其是方法的组合，特别是在特征提取方面，被证明最为有效。

英文摘要

In modern cancer research, the vast volume of medical data generated is often underutilised due to challenges related to patient privacy. The OncoReg Challenge addresses this issue by enabling researchers to develop and validate image registration methods through a two-phase framework that ensures patient privacy while fostering the development of more generalisable AI models. Phase one involves working with a publicly available dataset, while phase two focuses on training models on a private dataset within secure hospital networks. OncoReg builds upon the foundation established by the Learn2Reg Challenge by incorporating the registration of interventional cone-beam computed tomography with standard planning fan-beam CT images in radiotherapy. Accurate image registration is crucial in oncology, particularly for dynamic treatment adjustments in image-guided radiotherapy, where precise alignment is necessary to minimise radiation exposure to healthy tissues while effectively targeting tumours. This work details the methodology and data behind the OncoReg Challenge and provides a comprehensive analysis of the competition entries and results. Findings reveal that feature extraction plays a pivotal role in this registration task. A new method emerging from this challenge demonstrated its versatility, while established approaches continue to perform comparably to newer techniques. Both deep learning and classical approaches still play significant roles in image registration, with the combination of methods, particularly in feature extraction, proving most effective.

URL PDF HTML ☆

赞 0 踩 0

2507.12052 2026-06-19 eess.SY cs.SY 版本更新

Distributed Resilient State Estimation and Control with Strategically Implemented Security Measures

具有战略性安全措施的分布式弹性状态估计与控制

Takumi Shinohara, Karl H. Johansson, Henrik Sandberg

AI总结针对恶意虚假数据注入传感器攻击和有界噪声，提出一种通过战略性部署网络安全措施来最大化系统弹性并平衡成本的方法，并开发了相应的分布式弹性状态估计与控制方案。

Comments Accepted for publication in the IEEE Transactions on Automatic Control

详情

DOI: 10.1109/TAC.2026.3705299

AI中文摘要

本文研究了线性时不变系统在恶意虚假数据注入传感器攻击和有界噪声下的分布式弹性状态估计与控制问题。我们考虑一个能够部署网络安全措施来对抗传感器妥协的系统操作员（防御者）。尽管此类措施增强了对对抗性攻击的弹性，但它们可能产生大量成本；因此，战略性地选择对策以平衡弹性增益和成本效率至关重要。我们首先证明，通过适当实施安全措施，系统对攻击的弹性最大化，这意味着没有攻击者能够执行不可检测的传感器攻击。基于此分析，我们提出了一种识别最优安全措施的算法。虽然确定该措施通常是NP难的，但我们也推导了有效计算可行的充分条件。此外，我们开发了一种基于最优安全措施的分布式弹性状态估计与控制方案，并建立了保证有界估计和控制误差的条件。最后，通过车辆队列场景的数值模拟验证了方法的有效性。

英文摘要

This paper addresses the problem of distributed resilient state estimation and control for linear time-invariant systems in the presence of malicious false data injection sensor attacks and bounded noise. We consider a system operator (defender) capable of deploying cybersecurity measures to counteract the sensor compromises. Although such measures enhance resilience against adversarial attacks, they may incur substantial costs; hence, it is crucial to select countermeasures to balance resilience gains and cost efficiency strategically. We first demonstrate that the system's resilience against attacks is maximized through the appropriate implementation of security measures, implying that no attacker can execute undetectable sensor attacks. Building on this analysis, we propose an algorithm that identifies the optimal security measure. While determining this measure is NP-hard in general, we also derive sufficient conditions under which efficient computation is feasible. Furthermore, we develop a distributed resilient state estimation and control scheme informed by the optimal security measure and establish conditions that guarantee bounded estimation and control errors. Finally, we validate the efficacy of our approach via numerical simulations of a vehicle platooning scenario.

URL PDF HTML ☆

赞 0 踩 0

2503.20646 2026-06-19 cs.HC cs.RO cs.SY eess.SY 版本更新

Immersive and Wearable Thermal Rendering for Augmented Reality

增强现实的沉浸式可穿戴热渲染

Alexandra Watkins, Ritam Ghosh, Evan Chow, Nilanjan Sarkar

发表机构 * Vanderbilt University（范德比大学）

AI总结提出一种掌戴式热反馈原型，通过间接反馈、主动热透传和时空变化渲染策略，在增强现实中实现沉浸式热触觉体验，实验验证了其可行性与权衡。

详情

AI中文摘要

我们提出了一种概念验证的掌戴式热反馈原型，针对增强现实（AR）中的热渲染挑战，用户必须在其物理工作空间中与真实和虚拟物体交互。与为虚拟现实开发的热反馈系统相比，AR热反馈必须保持手部灵活性、维持对真实世界热线索的访问，并在不阻碍自然物体交互的情况下提供连贯的虚拟温度感知。我们提出了三个AR特定的设计考虑，并由我们的原型实现：间接反馈以保持指尖灵活性、主动热透传以感知和渲染接触物理表面的温度，以及手掌上的空间和时间变化热渲染。人体实验评估了AR交互过程中的感知灵敏度、间接反馈、主动热透传、空间模式识别和移动热渲染。结果表明，尽管间接反馈在指尖视觉接触时降低了感知真实感，但并未降低沉浸感或舒适度；主动热透传支持真实与渲染表面之间的温度辨别；时空渲染相比静态热刺激显著提高了沉浸感和真实感。这些发现表明，我们的设计考虑是AR热触觉的可行设计策略，同时澄清了需要精确真实感与更广泛沉浸式热体验的应用之间的权衡。

英文摘要

We present a proof-of-concept palm-mounted thermal feedback prototype addressing thermal rendering challenges specific to augmented reality (AR), where users must interact with both real and virtual objects in their physical workspace. In contrast to thermal feedback systems developed for virtual reality, AR thermal feedback must preserve manual dexterity, maintain access to real-world thermal cues, and provide coherent virtual temperature sensations without obstructing natural object interaction. We propose three AR-specific design considerations, which our prototype implements: indirect feedback to preserve fingertip dexterity, active thermal passthrough to sense and render the temperature of contacted physical surfaces, and spatially and temporally varying thermal rendering across the palm. Human-subject experiments evaluated perceptual sensitivity, indirect feedback, active thermal passthrough, spatial pattern recognition, and moving thermal rendering during AR interaction. Results showed that although indirect feedback reduced perceived realism during visual contact at the fingertips, it did not reduce immersion or comfort; active thermal passthrough supported temperature discrimination between real and rendered surfaces; and spatiotemporal rendering significantly improved immersion and realism compared with static thermal stimulation. These findings suggest that our design considerations are viable design strategies for AR thermal haptics, while also clarifying tradeoffs for applications that require precise realism versus broader immersive thermal experience.

URL PDF HTML ☆

赞 0 踩 0

2503.17386 2026-06-19 eess.SY cs.LG cs.SY 版本更新

A graph neural network surrogate model for mesh-based crashworthiness prediction of vehicle panel components

基于图神经网络的网格级车辆面板部件耐撞性预测代理模型

Haoran Li, Yingxue Zhao, Haosu Zhou, Tobias Pfaff, Nan Li

发表机构 * Dyson School of Design Engineering, Imperial College London（迪森设计工程学院，帝国理工学院伦敦分校）； NVIDIA

AI总结提出递归图U-Net (ReGUNet) 代理模型，通过图表示有限元网格，结合层次架构和递归机制，高效准确预测车辆B柱等面板部件的动态变形和耐撞性指标。

Comments Accepted manuscript version. Final published version available in Results in Engineering via DOI: 10.1016/j.rineng.2026.110925

Journal ref Results in Engineering 30 (2026) 110925

详情

DOI: 10.1016/j.rineng.2026.110925

AI中文摘要

耐撞性是安全关键车辆面板部件（如B柱）设计中的关键性能指标。有限元（FE）模拟广泛用于评估碰撞响应，但对于大规模非线性碰撞场景，特别是当集成到迭代设计和优化过程中时，计算成本仍然很高。尽管基于机器学习的代理模型已被开发用于快速耐撞性分析，但它们在对复杂三维部件的详细表示方面存在局限性。图神经网络（GNN）已成为处理复杂结构数据的有前景的解决方案。然而，现有的GNN模型通常缺乏足够的精度和计算效率以满足工业需求。本文提出了递归图U-Net（ReGUNet），一种用于车辆面板部件耐撞性分析的基于图的代理模型。通过将有限元网格表示为图形式，该模型自然地适应复杂的非规则结构几何。其层次架构提高了计算效率和精度，而递归的引入增强了多时间步长上时间预测的稳定性。使用不同几何形状的热冲压钢B柱的侧面碰撞案例研究来生成训练数据集。训练后的模型在预测未见过的部件设计的动态变形行为和耐撞性指标方面表现出高精度。与基线方法相比，ReGUNet在平均变形预测误差上实现了超过52%的降低，同时计算效率显著提高。ReGUNet提供了快速可靠的耐撞性评估，从而加速了车辆面板部件的设计周期。

英文摘要

Crashworthiness is a key performance measure in the design of safety-critical vehicle panel components such as B-pillars. Finite element (FE) simulations are widely used to evaluate crash responses but remain computationally expensive for large-scale, nonlinear impact scenarios, particularly when integrated into iterative design and optimisation processes. Although machine learning-based surrogate models have been developed for rapid crashworthiness analysis, they exhibit limitations in detailed representation of complex 3-dimensional components. Graph Neural Networks (GNNs) have emerged as a promising solution for processing data with complex structures. However, existing GNN models often lack sufficient accuracy and computational efficiency to meet industrial demands. This paper proposes Recurrent Graph U-Net (ReGUNet), a graph-based surrogate model for crashworthiness analysis of vehicle panel components. By representing FE meshes in graph form, the model naturally accommodates complex irregular structural geometries. Its hierarchical architecture improves computational efficiency and accuracy, while the introduction of recurrence enhances stability of temporal predictions over multiple time steps. A side-impact case study of hot-stamped steel B-pillars with varying geometries is used to generate training dataset. The trained model demonstrates high accuracy in predicting the dynamic deformation behaviour and crashworthiness indicators of previously unseen component designs. ReGUNet achieves over a 52% reduction in the average deformation prediction error relative to baseline methods, together with markedly improved computational efficiency. ReGUNet provides rapid and reliable crashworthiness assessments, which in turn accelerates the design cycle of vehicle panel components.

URL PDF HTML ☆

赞 0 踩 0

2501.06670 2026-06-19 eess.SY cs.SY 版本更新

A Geometric Analysis-Based Safety Assessment Framework for Marine Vehicle Route Decision-Making

基于几何分析的船舶航线决策安全评估框架

Zilong Xu, Zihao Wang, He Li, Dingli Yu, Zaili Yang, Jin Wang

AI总结提出基于几何分析的航线安全评估框架（GARSA），利用线和点几何元素定义航道边界，构建动态宽度表征函数量化受限水域空间安全性，并建立考虑整体及局部约束的航行安全指数，为船舶选择最安全航道提供定量依据。

Comments Accepted for publication in Reliability Engineering & System Safety (RESS)

详情

AI中文摘要

本文开发了一种基于几何分析的航线安全评估（GARSA）框架，以增强船舶在受限水域航行的安全性。利用线和点几何元素定义航道边界，该框架能够构建动态宽度表征函数，以量化复杂受限航行空间内的空间安全性。开发了一种迭代方法计算动态宽度表征函数，实现了航道空间属性的抽象表示。在此基础上，创建了考虑整体航线安全性以及局部航段空间约束的航行安全指数，从而能够选择最安全的航道通过。在汉堡港和假设航道的案例研究表明，GARSA能够识别不同航线之间的空间安全差异。例如，GARSA分别对两条长度分别为7,258米和7,845米的港内转运航线给出了0.6和135的安全评估值，其中较高的值表示更安全的航线。总体而言，GARSA为受限水域中船舶面向安全的航线决策提供了定量基础。

英文摘要

This paper develops a Geometric Analysis-based Route Safety Assessment (GARSA) framework to enhance the safety of marine vehicles navigating in restricted waters. Utilizing line and point geometric elements to define waterway boundaries, the framework enables the construction of a dynamic width characterization function to quantify spatial safety within complex restricted navigation spaces. An iterative method is developed to calculate the dynamic width characterization function, enabling an abstracted spatial property representation of waterways. Based on this, a navigational safety index that accounts for the overall route safety as well as the spatial constraints of local segments is created, enabling the selection of the safest waterway to pass through. Case studies in Hamburg Port and hypothetical waterways show that GARSA identifies spatial safety differences among routes. For example, GARSA assigned safety assessment values of 0.6 and 135 to two intra-port transit routes of 7,258 m and 7,845 m, respectively, with the higher value indicating a safer route. Overall, GARSA provides a quantitative basis for safety-oriented route decision-making of marine vehicles in restricted waters.

URL PDF HTML ☆

赞 0 踩 0

2405.10705 2026-06-19 eess.IV cs.CV 版本更新

3D Vessel Reconstruction from Sparse-View Dynamic DSA Images via Vessel Probability Guided Attenuation Learning

基于血管概率引导衰减学习的稀疏视角动态DSA图像三维血管重建

Zhentao Liu, Huangxuan Zhao, Wenhui Qin, Zhenghong Zhou, Xinggang Wang, Wenping Wang, Xiaochun Lai, Chuansheng Zheng, Dinggang Shen, Zhiming Cui

发表机构 * School of Biomedical Engineering \& State Key Laboratory of Advanced Medical Materials ； Devices, ShanghaiTech University, Shanghai, China ； National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, China ； School of Electronic Information ； Communications, Huazhong University of Science ； Department of Computer Science \& Engineering, Texas A\&M University, USA

AI总结提出血管概率引导衰减学习框架，通过静态与动态衰减场互补加权实现稀疏视角DSA重建，降低辐射剂量，并采用渐进训练和时间扰动损失提升质量。

Comments Accepted by Medical Image Analysis (MedIA), 2026

详情

DOI: 10.1016/j.media.2026.104088

AI中文摘要

数字减影血管造影（DSA）是血管疾病诊断的金标准之一。借助造影剂，时间分辨的二维DSA图像提供全面的血流信息，可用于重建三维血管结构以进行医学评估。当前的商用DSA系统通常需要数百个扫描视角进行重建，导致大量辐射暴露。在本研究中，我们提出了一种基于神经渲染的优化框架，专门用于高质量稀疏视角DSA重建，以减少辐射剂量。我们的方法称为血管概率引导衰减学习，将DSA成像表示为静态和动态衰减场的互补加权组合，权重来自时间无关的血管概率场。作为前景掩膜，血管概率为静态和动态场提供适应不同场景类型的适当梯度。该机制实现了静态背景与动态造影剂流的自监督分解，并显著提高了重建质量。我们的模型通过最小化合成投影与真实DSA图像之间的差异进行训练。我们进一步采用两种训练策略来提高重建质量：（1）由粗到细的渐进训练以改善几何结构，以及（2）时间扰动渲染损失以保持时间一致性。实验结果表明了高质量的三维血管重建和二维DSA图像合成。

英文摘要

Digital Subtraction Angiography (DSA) is one of the gold standards for vascular disease diagnosis. With the help of a contrast agent, time-resolved 2D DSA images deliver comprehensive blood flow information and can be utilized to reconstruct 3D vessel structures for medical assessment. Current commercial DSA systems typically require hundreds of scanning views to perform reconstruction, resulting in substantial radiation exposure. In this study, we propose a neural rendering-based optimization framework tailored for high-quality sparse-view DSA reconstruction to reduce radiation dosage. Our approach, termed vessel probability guided attenuation learning, represents DSA imaging as a complementary weighted combination of static and dynamic attenuation fields, with the weights derived from the time-independent vessel probability field. Functioning as a foreground mask, vessel probability provides proper gradients for both static and dynamic fields adaptive to different scene types. This mechanism enables self-supervised decomposition between static backgrounds and dynamic contrast agent flow, and significantly improves reconstruction quality. Our model is trained by minimizing the discrepancy between synthesized projections and real captured DSA images. We further employ two training strategies to improve reconstruction quality: (1) coarse-to-fine progressive training for better geometry and (2) temporal perturbed rendering loss for temporal consistency. Experimental results have demonstrated high-quality 3D vessel reconstruction and 2D DSA image synthesis.

URL PDF HTML ☆

赞 0 踩 0

2507.15475 2026-06-19 eess.SP math.PR stat.AP

On the Distribution of a Two-Dimensional Random Walk with Restricted Angles

二维受限角度随机游走的分布

Karl-Ludwig Besser

AI总结研究受限角度二维随机游走的分布，推导两步联合与边缘分布，提供一般步数的数值解及大步数近似，明确支持集的精确描述。

Comments 14 pages, 14 figures

Journal ref IEEE Transactions on Signal Processing, vol. 74, pp. 2316-2330, 2026

2605.10078 2026-06-19 eess.SY cs.SY

Scalable Design of Attack-Resilient Controllers for Positive Systems

可扩展的抗攻击控制器设计方法用于正系统

Alba Gurpegui, Sribalaji C. Anand, André M. H. Teixeira

AI总结本文提出了一种针对正系统在面对网络攻击时的安全和鲁棒控制器设计框架，通过最小化最大损失分析攻击影响，并展示线性策略可优化攻击策略。

Comments 3 figures, submitted to L-CSS and CDC 2026

详情

DOI: 10.1109/LCSYS.2026.3704261

AI中文摘要

本文提出了一种用于正系统在面对网络攻击时的 secure 和 resilient 控制器设计框架。我们考虑了一个网络控制系统，其中攻击者通过注入虚假数据到执行器通道来增加控制成本（性能度量），同时惩罚攻击努力并受状态依赖约束。使用最小最大公式，我们分析了此类攻击导致的最坏情况性能损失，其由差分方程和当时间范围无限时的代数方程给出。我们证明，在可能的非线性策略中，最优攻击策略是线性的。尽管没有显式的隐蔽约束，我们还展示当测量输出具有一个不稳定的零点，但该零点不是性能度量的不稳定零点时，攻击可以导致性能退化无界。所提出的框架还扩展到具有模型不确定性的系统。数值示例展示了结果，并展示了如何利用正系统和线性调节器理论的工具来以低计算成本缓解网络攻击。

英文摘要

This paper proposes a framework for secure and resilient controller design for positive systems against cyber-attacks. In particular, we consider a network-controlled system where an adversary injects false data into the actuator channels to increase the control cost (performance measure) while penalizing the attack effort and subject to state-dependent constraints. Using a minimax formulation, we analyze the worst-case performance loss caused by such adversaries, which is given by the solution of a difference equation, and an algebraic equation when the time horizon is infinite. We show that the optimal attack policy, among possible nonlinear policies, is linear. Despite the lack of explicit stealthiness constraints, we also show that when the measured output has an unstable zero which is not an unstable zero of the performance measure, the attacks can induce unbounded performance degradation. The proposed framework is also extended to systems with model uncertainty. Numerical examples illustrate the results and demonstrate how tools from positive systems and linear regulator theory can be used to mitigate cyber-attacks with low computational effort.

URL PDF HTML ☆

赞 0 踩 0

2601.12433 2026-06-19 eess.SP cs.LG

Temporal Data and Short-Time Averages Improve Multiphase Mass Flow Metering

Amanda Nyholm, Yessica Arellano, Jinyu Liu, Damian Krakowiak, Pierluigi Salvo Rossi

发表机构 * Dept. Electronic Systems, Norwegian University of Science and Technology（电子系统系，挪威科学与技术大学）； Dept. Gas Technology, SINTEF Energy Research（气体技术系，SINTEF能源研究）； Dept. Research and Development, KROHNE Ltd.（研发部，KROHNE有限公司）

Comments 9 pages, 6 figures

Journal ref IEEE Sensors Journal, vol. 26, no. 11, pp. 17252-17261, 1 June 2026

2602.20953 2026-06-19 eess.SP

Timing Recovery and Sequence Detection for Integrate-and-Fire Time Encoding Receivers

Neil Irwin Bernardo

Comments 6 pages, 3 figures, accepted in 2026 IEEE Wireless Communications and Networking Conference (WCNC 2026)

2601.15119 2026-06-19 eess.IV cs.CV

Vision Models for Medical Imaging: A Hybrid Approach for PCOS Detection from Ultrasound Scans

Md Mahmudul Hoque, Md Mehedi Hassain, Muntakimur Rahaman, Md. Towhidul Islam, Shaista Rani, Md Sharif Mollah

发表机构 * Department of CSE, CCN University of Science & Technology（计算机科学与工程系，CCN科学与技术大学）； Department of EEE,International Islamic University Chittagong（电子工程系，国际伊斯兰大学恰tagong分校）； Faculty of Engineering, Multimedia University（工程学院，多媒体大学）； Department of CSE, Stamford University of Bangladesh（计算机科学与工程系，斯塔福德大学孟加拉国分校）； Department of Biology, Lucknow University（生物学系，拉胡尔大学）； Department of CSE, Bangladesh Army International University of Science & Technology（计算机科学与工程系，孟加拉国军队国际科学与技术大学）

2509.04390 2026-06-19 eess.AS cs.SD

Accelerated Interactive Auralization of Highly Reverberant Spaces using Graphics Hardware

Hannes Rosseel, Toon van Waterschoot

发表机构 * KU Leuven, Dept. of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing

Comments 9 pages, 6 figures, submitted to Journal of the Audio Engineering Society

1406.0214 2026-06-19 eess.SY cs.SY math.AT stat.ML

Topological and Statistical Behavior Classifiers for Tracking Applications

拓扑与统计行为分类器用于跟踪应用

Paul Bendich, Sang Chin, Jesse Clarke, Jonathan deSena, John Harer, Elizabeth Munch, Andrew Newman, David Porter, David Rouse, Nate Strawn, Adam Watkins

AI总结本文提出基于多假设跟踪、拓扑数据分析和机器学习的统一理论，通过拓扑特征编码行为信息，利用统计模型拟合拓扑特征分布，并结合目标类型分类方法提升跟踪性能。