arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.12406 2026-06-11 cs.RO cs.AI cs.LG eess.SY 新提交

FACTR 2: Learning External Force Sensing for Commodity Robot Arms Improves Policy Learning

FACTR 2: 学习商用机器人手臂的外部力感知提升策略学习

Steven Oh, Jason Jingzhou Liu, Tony Tao, Philip Han, Kenneth Shaw, Satoshi Funabashi, Ruslan Salakhutdinov, Deepak Pathak

发表机构 * Carnegie Mellon University（卡内基梅隆大学）； Waseda University（早稻田大学）

AI总结提出无需专用力传感器的数据驱动方法NEXT，可在1分钟内从10分钟自由运动数据中训练，实现与专用关节力矩传感器相当的估计，并结合FIRST采样策略提升策略学习性能。

详情

Comments: Website at this https URL

AI中文摘要

接触丰富的操作需要力敏感性，但由于成本高昂，许多机器人手臂缺乏专用的力传感器。我们提出了神经外部力矩估计（NEXT），一种无需任何专用力传感器即可估计外部关节力矩的数据驱动方法。NEXT 仅需 10 分钟的自由运动数据即可在 1 分钟内完成训练，却能实现与专用关节力矩传感器相当的估计。NEXT 能够在低成本手臂上实现力反馈遥操作，并通过力信息重采样训练（FIRST）改进策略学习，该训练在行为克隆过程中对预接触和接触段进行上采样。在五个长时域任务中，FIRST 在任务进展上比先前的力感知策略提高了超过 17%。NEXT 和 FIRST 共同将力感知遥操作和策略学习引入现成的机器人，无需额外的传感硬件。视频结果和代码可在 https://this URL 获取。

英文摘要

Contact-rich manipulation requires force sensitivity, but many robot arms lack dedicated force sensors due to their high cost. We present Neural External Torque Estimation (NEXT), a data-driven method that estimates external joint torques without needing any dedicated force sensors. NEXT trains in 1 minute from only 10 minutes of free-motion data, yet achieves estimates comparable to dedicated joint-torque sensors. NEXT enables force-feedback teleoperation on low-cost arms and improves policy learning through Force-Informed Re-Sampling Training (FIRST), which up-samples pre-contact and contact segments during behavior cloning. Across five long-horizon tasks, FIRST outperforms prior force-aware policies by over 17% in task progress. Together, NEXT and FIRST bring force-aware teleoperation and policy learning to off-the-shelf robots without additional sensing hardware. Video results and code are available at this https URL

URL PDF HTML ☆

赞 0 踩 1

2606.12349 2026-06-11 cs.RO eess.SY 新提交

Traceable Virtual Sea Trials in the Marine Robotics Unity Simulator for Manoeuvring Assessment of Unmanned Surface Vehicles

面向无人水面艇操纵性评估的海洋机器人Unity仿真器中可追溯虚拟海试

Paria Rezayan

发表机构 * School of Engineering and Built Environment, Sheffield Hallam University（谢菲尔德哈勒姆大学工程与建筑环境学院）

AI总结针对USV水动力导数辨识数据获取难的问题，在MARUS仿真器中建立标准化虚拟海试框架，通过TC/ZZ机动自动化执行、数据采集与后处理管道，生成符合IMO/ITTC指标的可重复数据集，案例验证了框架的有效性。

详情

AI中文摘要

精确识别水动力导数对于无人水面艇（USV）的控制与导航至关重要，但物理海试的高保真操纵数据受成本和安全性限制。回转试验（TC）和Z形试验（ZZ）仍是IMO和ITTC评估程序的基础。本文扩展了海洋机器人Unity仿真器（MARUS），引入标准化虚拟海试框架，用于TC/ZZ机动的自动化执行和数据生成，包括可追溯的命令-执行日志记录、面向系统辨识（SI）的数据调理以及自动提取符合IMO/ITTC的操纵性指标。一个关键贡献是专用的TC/ZZ数据采集和后处理管道，提高了基于仿真的机动的可重复性和可审计性，同时生成适用于水动力导数辨识和数字孪生工作流的SI就绪数据集。另一个特点是差动推力转向的显式命令-执行分离，其中输入记录为有序的等效舵命令，而实际执行则记录为基于施加推力的执行级代理。案例研究结果表明了可重复且合规的机动行为。对于TC试验，左舷和右舷之间的归一化进距差异约为3.9%，战术直径差异约为4.6%至4.7%。对于ZZ试验，±10度和±20度机动下的第一和第二超越角超调量均保持在1度以下，满足IMO标准，而峰值偏航速率约为4.1至5.8度/秒。总体而言，该框架提供了一种可重复且可审计的虚拟海试工作流，用于生成符合IMO/ITTC的数据集，并支持系统辨识、水动力导数估计和数字孪生校准。

英文摘要

Accurate identification of hydrodynamic derivatives is essential for control and navigation of Unmanned Surface Vehicles (USVs), but high-fidelity manoeuvring data from physical sea trials are constrained by cost and safety. Turning Circle (TC) and Zig-Zag (ZZ) trials remain fundamental to IMO and ITTC assessment procedures. This paper extends the Marine Robotics Unity Simulator (MARUS) by introducing a standardised Virtual Sea Trial framework for automated execution and data generation of TC/ZZ manoeuvres, with traceable command-actuation logging, system-identification (SI)-focused data conditioning, and automated extraction of IMO/ITTC-aligned manoeuvring metrics. A key contribution is a dedicated TC/ZZ data acquisition and post-processing pipeline, improving the repeatability and auditability of simulator-based manoeuvres while producing SI-ready datasets for hydrodynamic-derivative identification and digital-twin workflows. Another feature is explicit command-execution separation for differential-thrust steering, where inputs are recorded as ordered rudder-equivalent commands and realised actuation is logged as an execution-level proxy derived from applied thrust. Case-study results demonstrate repeatable and compliant manoeuvre behaviour. For TC tests, the normalised advance differs by approximately 3.9 percent between port and starboard sides, while the tactical diameter differs by approximately 4.6 to 4.7 percent. For ZZ tests, first and second overshoot excesses remain below 1 degree for both +/- 10 degree and +/- 20 degree manoeuvres, satisfying IMO criteria, while peak yaw rates range from approximately 4.1 to 5.8 deg/s. Overall, the framework provides a repeatable and auditable virtual sea-trial workflow for generating IMO/ITTC-aligned datasets and supporting system identification, hydrodynamic-derivative estimation, and digital-twin calibration.

URL PDF HTML ☆

赞 0 踩 0

2606.12336 2026-06-11 eess.SY math.OC 新提交

Analysis of a Distributed Optimization-Based Control Architecture for Inverter-Interfaced Virtual Power Plants

基于分布式优化的逆变器接口虚拟电厂控制架构分析

Vivek Khatana, Soham Chakraborty, Murti V. Salapaka

AI总结针对虚拟电厂中逆变器接口分布式能源，提出一种基于采样数据优化二次控制的大信号稳定性分析方法。

2606.12328 2026-06-11 eess.AS 新提交

HALO: Half-Frame-Rate Adaptive Learnable Operator for Lightweight STFT-Based Speech Enhancement

HALO：半帧率自适应可学习算子用于轻量级基于STFT的语音增强

Jiadong Zhao, Dahan Wang, Yu Sun, Leyan Yang, Xiaobin Rong, Shiruo Sun, Yuxiang Hu, Jing Lu

AI总结提出HALO模块，通过半帧率处理减少STFT重叠帧冗余，降低轻量模型计算成本，在DNS3数据集上验证了有效性。

详情

Comments: Accepted by Interspeech 2026

AI中文摘要

基于STFT的语音增强通常采用重叠分析帧。虽然重叠对于稳定的STFT处理至关重要，但它使相邻帧高度相关，导致轻量模型中的冗余计算。我们提出了半帧率自适应可学习算子（HALO），这是一个因果插件模块，在不改变STFT过程的情况下将内部帧率减半。HALO广泛适用于许多轻量模型，在骨干网络之前应用自适应速率降低，之后进行恢复，在原始STFT网格上重建全速率频谱。降低和恢复均通过轻量动态卷积实现。通过将处理帧率减半，HALO在不增加算法延迟的情况下降低了骨干网络的计算成本，为通道扩展释放了预算。在DNS3数据集上的实验表明，在匹配复杂度下，各种轻量模型均获得一致提升，证明了减少重叠引起的冗余的有效性。

英文摘要

STFT-based speech enhancement typically adopts overlapping analysis frames. While overlap is essential for stable STFT processing, it makes adjacent frames highly correlated, causing redundant computation in lightweight models. We propose Half-frame-rate Adaptive Learnable Operator (HALO), a causal plug-in module that halves the internal frame rate without altering the STFT procedure. Broadly applicable to many lightweight models, HALO applies adaptive rate reduction before the backbone and restoration afterward, reconstructing the full-rate spectrum on the original STFT grid. Both reduction and restoration are implemented with lightweight dynamic convolutions. By halving the processed frame rate, HALO reduces backbone compute cost with no added algorithmic latency, freeing budget for channel widening. Experiments on the DNS3 dataset show consistent gains across diverse lightweight models under matched complexity, demonstrating the effectiveness of reducing overlap-induced redundancy.

URL PDF HTML ☆

赞 0 踩 0

2606.12327 2026-06-11 eess.SY math.OC 新提交

From the Linear Quadratic Regulator (LQR) to the (Deterministic) Kalman Filter in Two Easy Steps

从线性二次型调节器（LQR）到（确定性）卡尔曼滤波器的两步简易推导

Bassam Bamieh

AI总结本文通过两步推导，将确定性卡尔曼滤波器转化为LQR问题，利用齐次坐标和矩阵微分Riccati方程求解，并给出最优动态观测器。

详情

AI中文摘要

本文是关于确定性卡尔曼滤波器（状态估计器）的教程，其表述为：寻找与系统方程一致的状态轨迹，使得$L^2$过程和测量不确定性最小。如所述，这是一个输入信号设计问题，具有线性动力学和关于状态与输入仿射二次的目标函数。第一步是通过使用“齐次坐标”嵌入到更大的系统中，将该问题转化为纯二次目标的问题。这将问题转化为纯二次（即LQR）问题，但具有非标准的初始或最终状态约束。然后可以使用更大LQR问题的矩阵微分Riccati方程（DRE）版本求解后一个问题。第二步是对这个更大问题进行划分，从而得到最优动态观测器和传统卡尔曼滤波器的DRE。作为比较，还使用类似构造处理了传统LQ跟踪（伺服机构）问题的解。

英文摘要

This note is a tutorial on the deterministic version of the Kalman filter (state estimator), which is formulated as finding the state trajectory consistent with the system's equations with the minimal amount of $L^2$ process and measurement uncertainty. As stated, this is an input signal design problem with linear dynamics and an objective that is affine-quadratic in the state and inputs. The first step is to convert this problem to one with a purely quadratic objective by embedding in a larger system using ``homogeneous coordinates''. This converts the problem to a purely quadratic (i.e. an LQR) problem, but with non-standard initial or final state constraints. This latter problem can then be solved using a version of the matrix Differential Riccati Equation (DRE) for the larger LQR problem. The second step is a partitioning of this larger problem, which then yields the optimal dynamic observer and the DRE of the traditional Kalman filter. For comparison, the solution of the traditional LQ-tracking (Servomechanism) problem is also treated using a similar construction.

URL PDF HTML ☆

赞 0 踩 0

2606.12321 2026-06-11 eess.SP 新提交

Bending the Rules of Propagation: Caustic Beamforming for Next-Generation Wireless Systems

弯曲传播规则：面向下一代无线系统的焦散波束成形

Shicong Liu, Xianghao Yu, Robert Schober

AI总结本文提出焦散波束成形作为无线波束控制的新范式，利用自弯曲、自修复和近场无衍射特性，在6G网络中提升物理层安全和服务稳定性，并讨论硬件架构与开放挑战。

详情

Comments: 7 pages, 5 figures

AI中文摘要

传统的波束成形技术主要沿期望方向引导能量或将其聚焦在特定位置。这些技术在面对频繁阻塞和高度动态的传播环境时变得脆弱。在本文中，我们提出焦散波束成形作为无线波束控制的新范式。首先，我们根据其数学起源对代表性焦散波束进行分类，并呈现三个独特性质，即自弯曲、自修复和近场无衍射。基于这些传播特性，我们随后提出第六代（6G）网络中的几个应用场景。我们进行了两个案例研究，重点关注物理层安全和服务稳定性，突出焦散波束绕过潜在窃听者、提供更均匀覆盖以及维持抗阻塞链路的能力。我们进一步讨论了促进实际部署的使能硬件架构，并最后概述了焦散波束需要进一步研究的关键开放挑战。

英文摘要

Conventional beamforming techniques primarily steer energy along desired directions or focus it at specific locations. These techniques become fragile when facing frequent blockage and highly dynamic propagation environments. In this article, we present caustic beamforming as a new paradigm for wireless beam control. First, we classify representative caustic beams according to their underlying mathematical origins and present three unique properties, namely self-bending, self-healing, and near-field non-diffracting. Building on these propagation properties, we then propose several application scenarios in sixth-generation (6G) networks. We undertake two case studies focused on physical layer security and service stability that highlight the capability of caustic beams to bypass potential eavesdroppers, deliver more uniform coverage, and sustain blockage-resilient links. We further discuss the enabling hardware architectures that facilitate practical deployments, and finally outline key open challenges regarding caustic beams that require further research.

URL PDF HTML ☆

赞 0 踩 0

2606.12314 2026-06-11 eess.SP 新提交

Near Field Multi-Band Localization: CRB, Efficient Estimator, and Threshold SNR

近场多频带定位：CRB、高效估计器和阈值信噪比

Roberto Bomfin, Marco Mezzavilla, Sundeep Rangan, Marwa Chafii

AI总结针对单路径SIMO系统，推导了均匀线阵下AoA和距离的闭式CRB，提出了基于Levenberg-Marquardt的单/多频带ML估计器，并解析表征了阈值信噪比（TSNR），多频带处理可同时提高精度和降低SNR需求。

详情

AI中文摘要

本文提出了单路径单输入多输出（SIMO）系统多频带定位的理论框架。我们推导了均匀线性阵列（ULA）下到达角（AoA）和距离的闭式Cramer-Rao界（CRB），以及任意阵列形状的中间矩阵形式公式。我们还开发了基于结构化Levenberg-Marquardt（LM）细化过程的基准单频带和多频带最大似然（ML）估计器，用于AoA-距离联合估计。一个关键贡献是对所提估计器的阈值信噪比（TSNR）的解析表征。这是估计器从“偏离图表”过渡到接近CRB性能的SNR阈值，适用于TDoA和距离估计。数值模拟证实，所提出的单频带和多频带估计器在高于预测TSNR的SNR下达到CRB，并且多频带处理同时提高了估计精度并降低了SNR要求。由此产生的框架为下一代多频带定位提供了严格的基础，并可轻松扩展到仰角估计、分布式阵列和多径环境。

英文摘要

This paper presents a theoretical framework for multi-band localization for a single-path single-input multiple-output (SIMO) system. We derive closed-form Cramer-Rao bounds (CRBs) for angle-of-arrival (AoA) and distance for uniform linear arrays (ULAs), and an intermediate matrix-form formulation for arbitrary array shapes. We also develop benchmark single- and multi-band maximum-likelihood (ML) estimators for AoA-Distance, leveraging a structured Levenberg-Marquardt (LM) refinement procedure. A key contribution is an analytical characterization of the threshold SNR (TSNR) for the proposed estimators. This is the SNR threshold at which the estimator transitions from "off the chart" to CRB-approaching performance, for both TDoA and distance estimation. Numerical simulations confirm that the proposed single- and multi-band estimators achieve the CRB at SNRs above the predicted TSNR, and that multi-band processing simultaneously improves estimation accuracy and reduces SNR requirements. The resulting framework provides a rigorous foundation for next-generation multi-band localization and can be readily extended to elevation estimation, distributed arrays, and multi-path environments.

URL PDF HTML ☆

赞 0 踩 0

2606.12294 2026-06-11 cs.CV eess.IV 新提交

Bridging the Modality Gap in Forensic Image Retrieval

弥合法医图像检索中的模态差距

Ricardo González-Gazapo, Annette Morales-González, Yoanna Martínez-Díaz, Heydi Méndez-Vázquez, Milton García-Borroto

发表机构 * Advanced Technologies Application Center (CENATAV)（先进技术应用中心（CENATAV））； Centro de Sistemas Complejos, Facultad de Física, Universidad de La Habana（哈瓦那大学物理学院复杂系统中心）

AI总结提出统一检索框架，利用多模态大语言模型生成文本描述并结合视觉与文本特征融合，提升纹身、人脸素描等法医任务的检索精度与鲁棒性。

详情

Comments: 23 pages, 5 figures, paper submitted to Elsevier journal

AI中文摘要

自动图像检索在现代法医分析中扮演着越来越关键的角色，支持依赖于视觉证据高效比较的调查工作流程。虽然先前的工作主要集中在开发和优化多模态检索系统，但很少关注评估这些技术在多样化真实场景中的法医适用性。在本研究中，我们提出了一个统一的检索框架，适用于四个关键的法医任务：（1）给定纹身查询图像的纹身图像检索；（2）由人类专家文本描述引导的纹身检索，模拟目击者口头描述纹身的常见情况；（3）从手绘草图中检索纹身；（4）从法医面部素描中检索人脸。我们的系统利用多模态大语言模型（MLLM）自动为所有查询和图库图像生成结构化文本描述，然后使用句子变换器嵌入进行基于文本的比较。我们使用仅视觉嵌入、仅文本嵌入以及一种多模态融合策略来评估检索性能，该策略结合了来自与每个任务相关的最先进视觉特征提取器的文本和图像相似性分数。模态融合一致地提高了检索精度和鲁棒性，特别是在视觉信息有限或嘈杂的场景中（例如，素描、部分纹身或零碎的目击者陈述）。这项工作突显了统一多模态检索流程的法医价值，并展示了现代MLLM如何能够操作化传统上依赖人工专家分析的具有挑战性的法医任务。我们的结果将多模态检索定位为支持涉及纹身、面部合成和目击者描述的调查工作流程的有前途工具。

英文摘要

Automated image retrieval plays an increasingly critical role in modern forensic analysis, supporting investigative workflows that rely on efficient comparison of visual evidence. While prior work has focused primarily on developing and optimizing multimodal retrieval systems, limited attention has been paid to evaluating the forensic applicability of these technologies across diverse real-world scenarios. In this study, we present a unified retrieval framework adapted to four key forensic tasks: (1) tattoo image retrieval given a tattoo query image; (2) tattoo retrieval guided by human-expert textual descriptions, modelling the common situation where a witness verbally describes a tattoo; (3) tattoo retrieval from hand-drawn sketches; and (4) face retrieval from forensic face sketches. Our system leverages a multimodal large language model (MLLM) to automatically generate structured textual descriptions for all queries and gallery images, followed by sentence-transformer embedding for text-based comparison. We evaluate retrieval using visual-only embeddings, text-only embeddings and a multimodal fusion strategy that combines text- and image-based similarity scores derived from state-of-the-art visual feature extractors relevant to each task. The fusion of modalities consistently improves retrieval precision and robustness, especially in scenarios where visual information is limited or noisy (e.g., sketches, partial tattoos, or fragmented witness statements). This work highlights the forensic value of a unified multimodal retrieval pipeline and demonstrates how modern MLLMs can operationalize challenging forensic tasks that traditionally rely on manual expert analysis. Our results position multimodal retrieval as a promising tool for supporting investigative workflows involving tattoos, facial composites, and witness descriptions.

URL PDF HTML ☆

赞 0 踩 0

2606.12293 2026-06-11 eess.SP 新提交

LLM-Based Digital Twin Intelligence for Application-Aware Network Selection in 6G Heterogeneous Wireless Networks

基于大语言模型的数字孪生智能：面向6G异构无线网络中应用感知的网络选择

Brahim Mefgouda, Anis Bara, Lina Bariah, Hang Zou, Yuzhi Yang, Merouane Debbah

AI总结提出一种基于大语言模型的数字孪生框架，通过融合物理传播、分组级QoS仿真和决策记忆，实现候选集演化下的稳定应用感知RAT选择，显著降低秩反转和切换次数。

详情

Comments: Submitted to an IEEE venue

AI中文摘要

未来的6G异构无线网络（HWNs）预计将支持多种无线接入技术（RATs）、动态无线环境以及具有多样化服务质量（QoS）需求的应用。在这样的环境中，网络选择（NS）不能仅依赖于瞬时无线测量或静态排序规则。相反，接入决策必须考虑不断演变的无线状态、服务意图、分组级QoS行为以及候选RAT的动态变化。本文提出了一种基于大语言模型（LLM）的数字孪生（DT）框架，用于在候选集演化下进行稳定的、应用感知的RAT选择。主要思想是将NS从瞬时决策矩阵操作转变为基于演变的无线DT状态的决策过程。构建的DT结合了站点特定几何、基于Sionna RT的传播描述符、ns-3分组级QoS仿真、服务上下文、候选RAT信息和决策记忆。LLM并非作为6G网络的通用控制器，而是用于此特定NS任务中基于DT的决策智能。在此DT之上，一个统一的意图代理将用户和服务需求转化为两个互补NS分支的结构化决策优先级：LLM辅助的多属性决策分支（MADM--LLM--NS）和直接基于LLM的排序分支（LLM--NS）。为了提高决策稳定性，该框架进一步引入了历史感知自适应归一化（HAAN）和DT记忆驱动的检索增强上下文学习（RA--ICL）。数值结果表明，与代表性的基于MADM的NS基线相比，所提出的框架减少了秩反转问题和不必要的切换事件，同时提高了服务感知的QoS满意度。

英文摘要

Future 6G heterogeneous wireless networks (HWNs) are expected to support multiple radio access technologies (RATs), dynamic wireless environments, and applications with diverse quality-of-service (QoS) requirements. In such environments, network selection (NS) cannot rely only on instantaneous radio measurements or static ranking rules. Instead, access decisions must account for the evolving wireless state, service intent, packet-level QoS behavior, and candidate-RAT dynamics. This paper proposes a large language model (LLM)-based digital twin (DT) framework for stable, application-aware RAT selection under candidate-set evolution. The main idea is to shift NS from an instantaneous decision-matrix operation to a decision process over an evolving wireless DT state. The constructed DT combines site-specific geometry, Sionna RT-based propagation descriptors, ns-3 packet-level QoS emulation, service context, candidate-RAT information, and decision memory. Rather than acting as a general-purpose controller for 6G networks, the LLM is used for DT-grounded decision intelligence in this specific NS task. On top of this DT, a unified intent agent translates user and service requirements into structured decision priorities for two complementary NS branches: an LLM-assisted multi-attribute decision-making branch (MADM--LLM--NS) and a direct LLM-based ranking branch (LLM--NS). To improve decision stability, the framework further introduces history-aware adaptive normalization (HAAN) and DT-memory-driven retrieval-augmented in-context learning (RA--ICL). Numerical results show that the proposed framework reduces rank-reversal problem and unnecessary handover events, while improving service-aware QoS satisfaction compared with representative MADM-based NS baselines.

URL PDF HTML ☆

赞 0 踩 0

2606.12226 2026-06-11 cs.CV eess.IV 新提交

An Electric Potential-Augmented Benchmark Dataset for Physics-Guided Image Reconstruction of Electrical Capacitance Tomography

一种电势增强的基准数据集，用于电容层析成像的物理引导图像重建

Xinqi Zhang, Qiming Ma, Lihui Peng

发表机构 * Department of Automation, Tsinghua University（清华大学自动化系）

AI总结针对电容层析成像（ECT）数据驱动方法忽略电势场的问题，提出一个包含电势图的基准数据集，通过COMSOL-MATLAB管道生成20,000个样本，并验证其提升建模精度和鲁棒性。

详情

AI中文摘要

虽然深度学习显著推进了电容层析成像（ECT）的图像重建，但大多数数据驱动方法直接映射电容和介电常数分布，将传感器视为黑箱。这忽略了电势场——控制非线性和病态“软场”效应的基本物理联系。为解决此问题，我们提出一个电势增强的ECT基准数据集，旨在将ECT背后的潜在物理显式集成到学习过程中。通过COMSOL-MATLAB管道为八电极传感器生成示例，数据集包含20,000个随机样本，涵盖四种典型流型。关键的是，除了传统的电容向量和以图像形式描绘的介电常数分布外，每个样本还保留了八个激励方向的全场电势图。除了数据发布，我们还提供了ECT正问题和逆问题的说明性评估协议。通过在分布内（IID）和分布外（OOD）场景下的全面测试，我们系统地展示了包含电势图如何增强建模精度和鲁棒性。从根本上说，潜在场信息的显式包含显著降低了将物理定律集成到ECT建模中的障碍，从而为未来ECT图像重建的物理引导机器学习建立了标准化基础。

英文摘要

While deep learning has significantly advanced image reconstruction of Electrical Capacitance Tomography (ECT), most data-driven methods map directly between capacitance and permittivity distribution, treating the sensor as a black box. This overlooks the electric potential field -- the fundamental physical link governing the nonlinear and ill-posed ``soft-field'' effect. To address this, we propose an electric potential-augmented ECT benchmark dataset designed to explicitly integrate latent physics behind ECT into the learning process. Generated via a COMSOL-MATLAB pipeline for an eight-electrode sensor as an example, the dataset comprises 20,000 randomized samples across four typical flow patterns. Crucially, alongside the conventional capacitance vectors and permittivity distributions depicted as images, each sample preserves eight excitation-wise full-field potential maps. Beyond data release, we provide illustrative evaluation protocols for both forward and inverse problems of ECT. Through comprehensive testing on both in-distribution (IID) and out-of-distribution (OOD) scenarios, we systematically demonstrate how the inclusion of electric potential maps enhances modeling accuracy and robustness. Fundamentally, the explicit inclusion of latent field information significantly lowers the barrier to integrating physical laws into ECT modeling, thereby establishing a standardized foundation for future physics-guided machine learning of ECT image reconstruction.

URL PDF HTML ☆

赞 0 踩 0

2606.12223 2026-06-11 eess.SP 新提交

Characterization of Speech Imagery in Scalp EEG and Comparison with Motor Imagery

头皮脑电中言语想象的特性及其与运动想象的比较

Bob Van Dyck, Liuyin Yang, Qiang Sun, Ang Li, Marc M. Van Hulle

AI总结本研究通过头皮脑电分析言语想象的时空特征，并与手指运动想象对比，发现言语想象呈现更弱、更分散的α波增强，分类准确率较低，表明其主导模式不同于运动想象，更接近非发音任务相关活动。

详情

AI中文摘要

言语想象作为一种内源且本质语言性的脑机接口范式，在通信领域具有吸引力。然而，尽管兴趣日益增长，其主导的头皮脑电时空特征仍未得到充分表征。在此，我们探究了言语想象在头皮脑电中的表现，并将其与手指运动想象进行比较。利用一个包含言语想象、手指运动想象和无任务试次的受试者内数据集（所有试次采用相同的实验结构），我们分析了跨通道和时间的频带功率动态。手指运动想象在感觉运动区域显示出预期的对侧mu/alpha和低beta去同步化，而言语想象则显示出更弱、更分散的alpha主导增强。在归一化到各自条件自身的试后期后，言语相关的alpha增加在提示开始后仅发生适度变化，表明言语与无任务差异的大部分在指令期间已经存在。区分想象与无任务的分类器对言语想象的平均平衡准确率为0.563 ± 0.072，对运动想象为0.718 ± 0.127，运动想象对alpha/beta的依赖性比言语想象更强。总之，这些结果提供了言语想象在头皮脑电中更清晰的群体级特征，并表明其主导时空模式不同于手指运动想象，且更符合大量非发音任务相关贡献，而非清晰的发音运动类似物。

英文摘要

SSpeech imagery is attractive as a brain-computer interface paradigm for communication because it is endogenous and intrinsically linguistic. Yet despite growing interest, its dominant scalp-EEG spatiotemporal characteristics remain poorly characterized. Here, we asked how speech imagery appears in scalp EEG and compared it against finger motor imagery. Using a within-subject dataset containing speech imagery, finger motor imagery, and no-task trials recorded under the same trial structure, we analyzed band-power dynamics across channels and time. Finger motor imagery showed the expected contralateral mu/alpha and low-beta desynchronization over sensorimotor areas, whereas speech imagery showed a weaker, more distributed alpha-dominant increase. After normalization to each condition's own post-trial interval, the speech-related alpha increase changed only modestly after cue onset, indicating that much of the speech-versus-no-task difference was already present during the instruction period. A classifier discriminating imagery from no-task reached mean balanced accuracies of 0.563 $\pm$ 0.072 for speech imagery and 0.718 $\pm$ 0.127 for motor imagery, with a stronger alpha/beta dependence for motor imagery than for speech imagery. Together, these results provide a clearer group-level characterization of speech imagery in scalp EEG and indicate that its dominant spatiotemporal pattern differs from that of finger motor imagery and is more consistent with substantial non-articulatory task-related contributions than with a clear articulatory-motor analogue.

URL PDF HTML ☆

赞 0 踩 0

2606.12199 2026-06-11 eess.AS cs.CL cs.SD 新提交

Which Speech Representation Better Matches Text-Native Reasoning? A Study of Speech-Text Alignment on Frame Rate and Representation

哪种语音表示更匹配文本原生推理？帧率和表示对语音-文本对齐的研究

Zhen Ye, Xu Tan, Yiming Li, Guangyan Zhang, Chimin Chan, Haohe Liu, Zhengxi Liu, Hongzhan Lin, Zheqi Dai, Xinshen Zhang, Peiwen Sun, Qiuqiang Kong, Wei Xue

AI总结研究语音与文本模态差异中的时间粒度不匹配问题，提出因子化FSQ和轻量非自回归音频LM头以降低帧率，发现4.17Hz帧率结合中间层表示对齐在语音问答中表现最佳。

详情

Comments: Accepted by Interspeech 2026 long paper

AI中文摘要

口语对话模型通常以文本LLM骨干网络为基础，但在以语音而非文本为条件时，推理能力往往会下降。我们将这种模态差异部分归因于时间粒度不匹配：在语义匹配的情况下，语音标记在时间上是冗余的，且远长于文本，这稀释了每个标记的语义密度，削弱了文本原生的推理动态。我们将语音标记设计视为一个表示选择问题，并在固定信息速率下，在冻结的LLM骨干网络中扫描帧率。为了实现低帧率，我们引入了因子化FSQ和一个轻量级的非自回归音频LM头，在不牺牲高效预测的情况下将容量扩展到近300比特/帧。在消除瓶颈后，我们扫描帧率（50→2.08 Hz）和对齐深度，并观察到在4.17 Hz帧率下，结合中间层表示对齐，语音问答存在一致的最佳区域。

英文摘要

Spoken dialogue models typically start from text LLM backbones, yet reasoning often degrades when conditioning on speech instead of text. We attribute part of this modality gap to a temporal-granularity mismatch: speech tokens are temporally redundant and far longer than text under matched semantics, diluting per-token semantic density and weakening text-native reasoning dynamics. We study speech token design as a representation selection problem and sweep frame rates under a frozen LLM backbone with a fixed information rate. To make low frame rates feasible, we introduce factorized FSQ and a lightweight non-autoregressive audio LM head, scaling capacity to nearly 300\,bits/frame without sacrificing efficient prediction. With the bottleneck removed, we sweep frame rates (50$\rightarrow$2.08\,Hz) and alignment depth, and observe a consistent best regime for speech QA at 4.17\,Hz with intermediate-layer representation alignment.

URL PDF HTML ☆

赞 0 踩 0

2606.12151 2026-06-11 eess.SY 新提交

Lexicographic optimization for real-time CNC feedrate planning with coupled orientation handling

面向耦合姿态处理的实时CNC进给率规划的字典优化

Haijia Xu, Alexander Verl

AI总结提出一种无调参的字典进给率优化方法，通过稀疏性利用和顺序窗口策略实现实时执行，并统一处理刀具位置和姿态，在五轴自由曲面测试中相比工业CNC内核缩短加工时间超过15%。

详情

AI中文摘要

基于优化的进给率规划有潜力显著提高加工生产率，但其工业应用受到高计算成本和大量调参工作的限制。本文提出一种字典进给率优化原则，以无调参方式自适应平衡加工时间和运动平滑性。为进一步提高计算效率，通过结合稀疏性利用的公式和顺序窗口策略扩展优化方案，实现实时执行能力。此外，在优化框架内引入统一的刀具路径参数化方案，以同步处理刀具位置和姿态。对于五轴自由曲面测试轮廓，所提方法在Intel i5-3470 CPU上对具有100,000个约束检查点的长刀具路径优化进给率曲线耗时14秒，在高性能AMD 9950X CPU上处理一百万个检查点耗时52秒。与工业CNC内核相比，最终加工时间减少了超过15%。

英文摘要

Optimization-based feedrate planning offers the potential to significantly increase machining productivity, but its industrial adoption has been limited by high computational cost and extensive tuning effort. This paper proposes a lexicographic feedrate optimization principle that adaptively balances finishing time and motion smoothness in a tuning-free manner. To further improve computational efficiency, the optimization scheme is extended by a sparsity-exploiting formulation combined with a sequential windowing strategy, enabling real-time capable execution. In addition, a unified toolpath parameterization scheme is incorporated to synchronously handle tool position and orientation within the optimization framework. For a five-axis freeform test contour, the proposed method takes 14 s on an Intel i5-3470 CPU to optimize feedrate profiles for long toolpaths with 100,000 constraint checkpoints, and 52 s on a high-performance AMD 9950X CPU to handle one million checkpoints. Compared to an industrial CNC kernel, the resulting finishing time is reduced by more than 15 %.

URL PDF HTML ☆

赞 0 踩 0

2606.12123 2026-06-11 eess.IV 新提交

An Indoor Localization Technique Utilizing Passive Tags and 3-D Microwave Passive Radar Imaging

利用无源标签和三维微波无源雷达成像的室内定位技术

Quanfeng Wang, Alexander H. Paulus, Mei Song Tong, Thomas F. Eibert

AI总结提出一种利用三维近场无源雷达成像的隐私合规室内定位方法，通过无源标签增强散射场强度实现精确定位，并支持非理想成像场景。

详情

Comments: This paper is published in Progress In Electromagnetics Research (PIER), Vol.181, pp.89--98, 2024. This is the author's version which has not been fully edited and content may change prior to final publication. This repository copy is provided to comply with open-access requirements

AI中文摘要

提出一种利用三维近场（NF）无源雷达成像技术的隐私合规室内定位方法。该技术利用普遍辐射的电磁场进行成像，引入无源标签以增强散射场强度，从而在成像层面实现精确定位。该方法还支持非理想成像场景中的定位，例如有限带宽或高反射环境。基于几何特性，简单且低成本的无源标签能够直观地区分个体或物体。讨论了相关的隐私保护机制，其中无源标签的频率变化特性在隐私和伦理考量下提供了额外的灵活性和潜在应用。提出了多种形式的无源标签，仿真和实验结果均验证了所提出的无源标签设计的有效性。

英文摘要

A privacy-compliant indoor localization approach utilizing a 3-D near-field (NF) passive radar imaging technique is presented. This technique leverages ubiquitously radiated electromagnetic fields for imaging, with passive tags introduced to enhance the strength of scattering fields, thereby enabling precise localization at the imaging level. The method also supports localization in non-ideal imaging scenarios, such as for limited bandwidth or in highly-reflective environments. Based on their geometrical properties the simple and low-cost passive tags enable intuitive differentiation between individuals or objects. Associated privacy protection mechanisms are discussed, where the frequency-varying properties of the passive tags provide additional flexibility and potential applications under privacy and ethical considerations. Several forms of passive tags are presented, where both simulation and experimental results validate the effectiveness of the proposed passive tag designs.

URL PDF HTML ☆

赞 0 踩 0

2606.12118 2026-06-11 eess.SY 新提交

On the Dynamics and State Dependent Multiple Equilibria of a Post-Buckled Ultra-Flexible Inverted Pendulum on a Rotating Hub

旋转轮毂上后屈曲超柔性倒立摆的动力学与状态依赖的多重平衡

Prasanna S Gandhi, Dhruvi Joshi, Vivek Natarajan, Ravit Anand

AI总结针对旋转轮毂驱动的超柔性倒立摆，采用假设模态法推导大变形动力学方程，揭示后屈曲状态下平衡点随轮毂角变化的连续状态依赖特性，并通过实验验证。

详情

Comments: 9 pages, 8 figures, Appendix

AI中文摘要

具有超大变形的柔性元件系统展现出丰富的非线性动力学，并带来具有挑战性的控制问题，解决这些问题可以增强软体机器人、MEMS和生物医学应用等机电一体化系统的性能。本文考虑由旋转轮毂驱动的超柔性倒立摆的后屈曲动力学分析。我们首先使用假设模态法框架，考虑超大变形，推导出捕捉系统动力学的完整方程组，这对控制开发至关重要。采用约束拉格朗日公式进行推导。在轮毂角为零的完美倒立配置下，屈曲梁会呈现两个对称的稳定平衡和一个不稳定平衡。然而，随着轮毂角向两侧变化，平衡位置发生移动，最终其中两个消失，只剩下一个稳定平衡。我们利用动力学方程来表征这一有趣现象，展示了多重平衡的连续状态依赖性。此外，精心获取并讨论了平衡结果的实验对应物。此外，仿真结果捕捉了该系统的非线性动力学。总体而言，这项工作为未来超柔性机电一体化系统建立了一个具有控制适用模型的坚实数学基础。

英文摘要

Compliant element systems with ultra-large deformation display rich nonlinear dynamics and pose challenging control problems, which, when solved, could enable enhancements in several mechatronics applications, such as soft robotics, MEMS, and biomedical applications. This paper considers post-buckled dynamic analysis of an inverted ultra-flexible pendulum actuated by a rotary hub. We first derive a complete set of equations capturing the dynamics of the system, essential for control development, using the assumed modes method framework, considering ultra-large deformations. Constrained Lagrange formulation is used for the same. In the perfect inverted configuration with zero hub angle, the buckled beam would display two symmetric stable equilibria and one unstable. However, as the hub angle changes on either side, the equilibrium positions shift, and eventually two of them vanish, and we are left with only one stable equilibrium. We use the dynamic equations to characterize this interesting phenomenon, demonstrating the continuous state dependence of multiple equilibria. Furthermore, experimental counterparts of the equilibrium results are meticulously obtained and discussed. Moreover, simulation results capture the nonlinear dynamics of this system. Overall, the work establishes a solid mathematical foundation with a control-amenable model for futuristic ultra-compliant mechatronic systems.

URL PDF HTML ☆

赞 0 踩 0

2606.12085 2026-06-11 eess.SY 新提交

Zero Knowledge Verification of Transaction Guides for P2P Energy Trading in Distribution Networks

配电网中P2P能源交易指南的零知识验证

Hyunjoong Kim

AI总结针对P2P能源交易中灵敏度信息泄露与验证的矛盾，提出基于零知识证明的方法，在不暴露网络敏感数据的前提下验证交易指南的计算完整性，并通过区块链实现防篡改审计。

详情

Comments: 10 pages

AI中文摘要

点对点（P2P）能源交易需要网络感知协调，因为交易通过配电网物理实现。然而，基于灵敏度的协调导致机密性与可验证性之间的权衡，因为网络灵敏度可能揭示脆弱组件，而未公开的灵敏度则阻止参与者验证公用事业提供的交易指南。本文提出一种基于零知识证明的方法，用于验证网络约束交易指南相对于承诺的私有网络数据的计算完整性，而不暴露网络灵敏度信息。该指南定义了从符号分解灵敏度矩阵导出的允许注入和提取量，同时满足平衡、电压、线路潮流和最优性条件。这些条件被编码为算术电路，表示为R1CS约束和二次算术程序，并通过双线性配对进行验证。区块链承诺将批准的电路、公共输入、语句标识符、证明和验证结果绑定，以实现防篡改审计。所提出的证明证明了从承诺的网络数据正确计算指南；承诺网络数据的真实性通过明确的注册和认证假设处理。在修改后的IEEE 33节点系统上的案例研究表明，清算后满足网络约束，拒绝公共输入和证人不一致攻击，并且链上开销实用，证明大小为806字节。

英文摘要

Peer-to-peer (P2P) energy trading requires network-aware coordination because transactions are physically realized through distribution networks. However, sensitivity-based coordination causes a confidentiality-verifiability tradeoff, as network sensitivities may reveal vulnerable components while undisclosed sensitivities prevent participants from verifying utility-provided transaction guides. This paper proposes a zero-knowledge-proof-based method for verifying the computational integrity of network-constrained transaction guides with respect to committed private network data, without exposing network-sensitivity information. The guide defines admissible injection and withdrawal volumes derived from sign-decomposed sensitivity matrices while satisfying balance, voltage, line-flow, and optimality conditions. These conditions are encoded in an arithmetic circuit, represented as R1CS constraints and a quadratic arithmetic program, and verified using a bilinear pairing. Blockchain commitments bind the approved circuit, public inputs, statement identifiers, proof, and verification result for tamper-evident auditability. The proposed proof certifies correct guide computation from committed network data; the authenticity of the committed network data is handled through an explicit registration and attestation assumption. Case studies on a modified IEEE 33-bus system show satisfaction of network constraints after clearing, rejection of public-input and witness-inconsistency attacks, and practical on-chain overhead, with an 806-byte proof.

URL PDF HTML ☆

赞 0 踩 0

2606.12078 2026-06-11 eess.SP eess.SY 新提交

Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target

面向移动目标的ISAC系统中自适应功率分配的深度强化学习

Zhilin Fu, Sangmin Kim, Sangwon Hwang, Jihwan Moon, Jeongwon Kim, Jaewan Kim, Inkyu Lee

AI总结针对跟踪移动目标的集成感知与通信系统，提出基于软演员-评论家的深度强化学习方法，结合狄利克雷策略设计奖励函数，实现动态功率分配以提升跟踪性能并维持通信性能。

2606.12074 2026-06-11 cs.CV cs.AI eess.IV 新提交

Non-frontal face recognition using GANs and memristor-based classifiers

基于GAN和忆阻器分类器的非正面人脸识别

Semih Vazgecen, Cristian Sestito, Spyros Stathopoulos, Themis Prodromakis

发表机构 * Centre for Electronics Frontiers, Institute for Integrated Micro and Nano Systems, School of Engineering, The University of Edinburgh（爱丁堡大学工程学院集成微纳系统研究所电子前沿中心）

AI总结提出将轻量级GAN正面化与忆阻器神经形态识别结合，解决非正面人脸识别，在数据集上达96%准确率。

详情

Comments: 12 pages, 4 figures, 1 Supplementary (22 pages, 16 figures, 6 tables, 4 supplementary notes)

AI中文摘要

人脸识别系统通过深度学习技术取得了显著进展，在复杂场景中实现了高性能和鲁棒性。然而，这些方法带来了巨大的计算开销，限制了它们在资源受限平台（如无人机）上的原位适用性，而这些平台需要应对非正面人脸图像等挑战。基于忆阻器的神经形态系统已成为边缘AI应用的一种引人注目的方法，它将生物启发式处理与高效可扩展的计算相结合。在这项工作中，我们提出了一种人脸识别框架，通过集成基于轻量级生成对抗网络（GAN）的正面化处理和基于忆阻器的神经形态识别，来解决非正面姿态变化问题。在两个数据集上的实验结果表明，将对抗学习与忆阻技术相结合的有效性，实现了高达96%的识别准确率。所提出的方法缓解了传统AI的计算瓶颈，并为动态真实环境中的人脸识别提供了一种可扩展、高效的解决方案。

英文摘要

Face recognition systems have advanced significantly through deep learning techniques, delivering high performance and robustness in complex scenarios. However, these approaches incur substantial computational overhead, limiting their in situ applicability in resource-constrained platforms such as drones, where they can address challenges including non-frontal facial imagery. Memristor-based neuromorphic systems have emerged as a compelling approach for edge AI applications, combining biologically inspired processing with efficient and scalable computation. In this work, we propose a facial recognition framework that addresses non-frontal pose variations by integrating lightweight generative adversarial network (GAN)-based pose frontalisation with memristor-based neuromorphic recognition. The experimental results on two datasets demonstrate the effectiveness of combining adversarial learning with memristive technology, achieving up to 96% identification accuracy. The proposed approach alleviates the computational bottlenecks of conventional AI and offers a scalable, efficient solution for face recognition in dynamic real-world environments.

URL PDF HTML ☆

赞 0 踩 0

2606.12024 2026-06-11 eess.SP 新提交

Unlocking the Potential of Movable Antennas: General and Practical Antenna Position Optimization

解锁可移动天线的潜力：通用且实用的天线位置优化

Weidong Mei, Changhao Liu, Dong Wang, Xin Wei, Yiming Wu, Boyu Ning, Zhi Chen, Jun Fang, Rui Zhang

AI总结针对可移动天线位置优化缺乏可处理信道模型的问题，提出连续和离散两类通用优化算法，分别用于大规模阵列信号处理和小规模多径信道重构，并引入基于学习的低开销方案。

详情

AI中文摘要

近年来，可移动天线（MA）因其在受限区域内通过局部移动增强无线通信性能的潜力而受到广泛关注。然而，由于缺乏关于天线位置的可处理、解析且精确的信道模型，天线位置优化（APO）已成为MA的主要挑战。尽管现有工作已开发出多种APO算法，但大多数基于简化的理论信道模型，限制了其通用性。为解决这一挑战，本文针对不同目的提出了更通用且有效的APO算法，分别归类为连续APO和离散APO。连续APO主要应用于灵活阵列信号处理以提升大规模通信性能，而离散APO则应用于小规模多径信道重构。具体而言，离散APO将天线移动区域离散化为多个采样点，并基于逐点信道状态信息（CSI）采用离散算法确定最优MA位置，无需解析信道模型。为降低CSI获取开销，我们还提出了更高效的基于学习的APO算法，无需完整逐点CSI。最后，我们比较了所提算法的应用场景，并通过数值结果验证了其有效性。

英文摘要

Recently, movable antenna (MA) has attracted wide attention in wireless communications due to its potential in enhancing wireless communication performance via local movement within a confined region. However, antenna position optimization (APO) has emerged as a major challenge for MAs, due to the lack of a tractable, analytical, and accurate channel model in terms of antenna positions. Although existing works have developed various algorithms for APO, most of them are based on simplified theoretical channel models, which limit their generality. To address this challenge, in this article, we present more general and effective APO algorithms for different purposes, categorized as continuous APO and discrete APO, respectively. Continuous APO is mainly applied for flexible array signal processing to boost large-scale communication performance, while discrete APO is applied for small-scale multi-path channel reshaping. Specifically, the discrete APO discretizes the antenna movement region into multiple sampling points and employs discrete algorithms to determine the optimal MA positions based on the point-wise channel state information (CSI), without the need for an analytical channel model. To reduce the overhead for CSI acquisition, we also present more efficient learning-based APO algorithms that operate without requiring full point-wise CSI. Finally, we compare the application scenarios of the proposed algorithms and validate their effectiveness with numerical results.

URL PDF HTML ☆

赞 0 踩 0

2606.12000 2026-06-11 eess.SY 新提交

Physics-guided residual Kalman learning for state-of-charge estimation of lithium iron phosphate batteries

物理引导的残差卡尔曼学习用于磷酸铁锂电池荷电状态估计

Feng Guo, Luis D. Couto, Khiem Trad, Ru Hong, Guangdi Hu, Mohammadhosein Safari

AI总结针对磷酸铁锂电池SOC估计难题，提出物理引导的残差卡尔曼学习框架，结合扩展卡尔曼滤波与门控循环单元残差学习器，在公开数据集上实现1.19%的全局平均RMSE，较纯物理方法降低77%。

详情

Comments: 36 pages, 4 figures. Author accepted manuscript. Accepted for publication in Journal of Energy Chemistry, published by Elsevier. Final version of record available at DOI: https://doi.org/10.1016/j.jechem.2026.05.040

AI中文摘要

由于磷酸铁锂电池具有平坦的开路电压-荷电状态特性、温度依赖动态以及对初始化误差的敏感性，其准确的荷电状态估计仍然具有挑战性。本文提出了一种物理引导的残差卡尔曼学习框架，用于基于电化学模型的SOC估计。PRKL结合了面向控制的基于单粒子模型的扩展卡尔曼滤波（提供递归物理状态传播）和门控循环单元残差学习器（利用电化学状态和测量特征补偿结构化的EKF误差）。该框架在公开的石墨/LFP数据集上进行了评估，该数据集涵盖三种动态驾驶循环、从-10到50摄氏度的八个温度以及高达20%的初始化偏移。使用动态应力测试和联邦城市驾驶调度循环进行训练，并在同一电池数据集内使用补充联邦测试程序（US06）循环进行跨剖面测试，PRKL实现了1.19%的全局平均均方根误差，相对于纯物理EKF降低了77%。这些结果表明，电化学状态信息可以指导残差学习并改进LFP电池的递归SOC估计。本验证支持所研究数据集内的跨剖面鲁棒性，并为未来的跨电池、老化感知和嵌入式平台验证提供了基础。

英文摘要

Accurate state of charge (SOC) estimation of lithium iron phosphate (LFP) batteries remains challenging because of their flat open-circuit-voltage (OCV)-SOC characteristics, temperature-dependent dynamics, and sensitivity to initialization errors. Here, we propose a physics-guided residual Kalman learning (PRKL) framework for electrochemical-model-based SOC estimation. PRKL combines a control-oriented single-particle-model-based extended Kalman filter (EKF), which provides recursive physical state propagation, with a gated recurrent unit (GRU) residual learner that compensates structured EKF errors using electrochemical states and measurement features. The framework is evaluated on a public graphite/LFP dataset covering three dynamic drive cycles, eight temperatures from -10 to 50 degrees C, and initialization offsets up to 20 percent. Using dynamic stress test (DST) and federal urban driving schedule (FUDS) cycles for training and the supplemental federal test procedure (US06) cycle for cross-profile testing within the same cell dataset, PRKL achieves a global average root mean square error (RMSE) of 1.19 percent, corresponding to a 77 percent reduction relative to the physics-only EKF. These results show that electrochemical state information can guide residual learning and improve recursive SOC estimation for LFP batteries. The present validation supports cross-profile robustness within the studied dataset and provides a basis for future cross-cell, ageing-aware, and embedded-platform validation.

URL PDF HTML ☆

赞 0 踩 0

2606.11999 2026-06-11 eess.SY 新提交

Robust Zonotopic Control

鲁棒区域控制

Fouzi Tabouri, Kim Guldstrand Larsen, Christian Schilling

AI总结提出一种区域框架，通过单一凸优化问题合成一个鲁棒状态反馈控制器，保证在矩阵区域描述的参数不确定性下稳定所有系统，降低计算复杂度并减少保守性。

详情

Comments: accepted at CCTA 2026

AI中文摘要

我们提出了一种区域框架，用于合成一个单一的鲁棒状态反馈控制器，该控制器被证明能够稳定矩阵区域内的每一个系统，该区域描述了线性变化参数或参数不确定性。常见的鲁棒设计策略依赖于检查许多顶点模型或复杂的增益调度，导致离线计算和实现复杂度高。我们的方法找到一个单一的增益，在整个参数域内被证明有效，实现更简单，并且可以通过利用区域的结构来减少保守性。我们将鲁棒综合问题表述为一个针对区域表示的单一凸规划，并将实际性能要求（执行器约束、衰减率、干扰抑制）纳入同一综合阶段。在一个代表性的4状态示例的数值实验中，我们的控制器在参数域内提供了更大的稳定覆盖范围，达到了与更复杂设计相当的瞬态性能和控制努力，并且与公共顶点增益、$H_{\infty}$和$\mu$综合基线相比，显著减少了其他鲁棒方法所需的离线综合问题的数量和规模。

英文摘要

We propose a zonotopic framework for synthesizing a single robust state feedback controller that is certified to stabilize every plant inside a matrix zonotope, describing linearly varying parameters or parametric uncertainty. Common robust design strategies rely on checking many vertex models or on complex gain-scheduling, leading to high offline computation and implementation complexity. Our approach finds a single gain that is provably valid across the entire parameter domain, which is simpler to implement and can reduce conservatism by exploiting the structure of the zonotope. We formulate the robust synthesis as a single convex program tailored to the zonotope representation and incorporate practical performance requirements (actuator constraints, decay rate, disturbance attenuation) into the same synthesis stage. In numerical experiments on a representative 4-state example, our controller provides larger stability coverage across the parameter domain, attains comparable transient performance and control effort to more complex designs, and significantly reduces the number and scale of offline synthesis problems required by other robust approaches, compared to common-vertex gain, $H_{\infty}$, and $\mu$-synthesis baselines.

URL PDF HTML ☆

赞 0 踩 0

2606.11971 2026-06-11 eess.SY math.OC 新提交

Cooperative Switched Formation Control of Autonomous Vehicles: An Event-triggered Approach to Input Saturation and Time-delay Challenges

自主车辆协同切换编队控制：一种应对输入饱和与时延挑战的事件触发方法

Ziming Wang, Guanxuan Jiang, Yihuai Zhang, Karl H. Johansson, Apostolos I. Rikos

AI总结提出一种协同自适应编队控制框架，通过输入饱和补偿、时延补偿辅助系统及动态阈值事件触发控制，解决自主车辆在系统不确定性、物理约束和通信时延下的编队问题，并通过数值仿真和3D可视化验证有效性。

详情

AI中文摘要

本文提出了一种自主车辆（AV）的协同自适应编队控制框架，明确处理系统不确定性、输入饱和和通信时延。为克服转向和制动执行器固有的物理扭矩限制，引入输入饱和补偿机制，使非线性问题易于处理并提高控制可靠性。此外，设计了时延补偿辅助系统以减轻通信时延的影响并减少跟踪误差。我们的框架结合了动态阈值事件触发控制（ETC）策略以优化资源使用。同时，开发了不确定性观测器和对称障碍李雅普诺夫函数以确保鲁棒且安全的编队机动。最后，通过车辆编队的数值仿真以及展示动态车队重构过程的3D可视化视频，验证了所提方法的有效性。

英文摘要

This paper presents a collaborative adaptive formation control framework for autonomous vehicles (AVs), that explicitly handles system uncertainties, input saturation, and communication delays. To overcome the inherent physical torque limits of steering and braking actuators, an input saturation compensation mechanism is introduced to render nonlinearities tractable and improve control reliability. Additionally, a delay-compensating auxiliary system is designed to mitigate the effects of communication delays and reduce tracking errors. Our framework incorporates a dynamic-threshold event-triggered control (ETC) strategy to optimize resource usage. Additionally, uncertainty observers and symmetric barrier Lyapunov functions are developed to ensure robust and safe formation maneuvers. Finally, the effectiveness of the proposed approach is validated through numerical simulations of vehicle formations, complemented by a 3D visualization video demonstrating the dynamic fleet reconfiguration process.

URL PDF HTML ☆

赞 0 踩 0

2606.11970 2026-06-11 eess.SP 新提交

Low-Density EEG for Seizure Detection: Evaluating CNN-RNN Architectures on a Behind-the-Ear Montage Setup

低密度脑电图用于癫痫检测：评估耳后导联设置下的CNN-RNN架构

Annika Stiehl, Patrick Wingert, Nicolas Weeger, Nicole Ille, Christian Uhl, Stefan Geißelsöder

AI总结本研究使用CNN-RNN混合深度学习模型，在模拟耳后导联的低密度脑电图上实现癫痫自动检测，提出CNN-Merged模型，在TUSZ数据集上达到85.89%的ROC AUC和79.11%的平衡准确率，验证了低密度配置下鲁棒检测的可行性。

详情

Comments: Accepted to the 34th European Signal Processing Conference (EUSIPCO 2026)

AI中文摘要

癫痫影响全球超过5000万人，凸显了自动化癫痫检测系统的需求，以减轻临床医生负担并提高患者癫痫日记的准确性。然而，在可穿戴脑电图应用中，由于低密度电极配置的空间分辨率有限、信噪比降低以及缺乏多样化的公开训练数据集，可靠检测仍然具有挑战性。本研究利用从坦普尔大学癫痫语料库（TUSZ，v2.0.3）导出的模拟耳后导联，探讨了混合深度学习架构在自动癫痫检测中的有效性。我们对几种CNN-RNN模型（包括基于LSTM和GRU的变体）在多个脑电图导联上进行了系统比较，以评估它们补偿因电极配置减少而固有的空间信息损失的能力。所提出的CNN-Merged模型整合了时间和频谱特征表示，在保留测试集上表现出优越性能，实现了85.89%的ROC AUC和79.11%的平衡准确率。此外，该模型在不同参考导联上表现出强大的鲁棒性，有效弥合了传统全头皮记录与资源受限可穿戴系统之间的性能差距。这些发现证实了混合深度学习模型作为低密度脑电图应用中稳健、患者独立癫痫检测的有前景途径的潜力。

英文摘要

Epilepsy affects over 50 million individuals globally, underscoring the need for automated seizure detection systems that can alleviate clinicians workload and enhance the accuracy of patient seizure diaries. In wearable EEG applications, however, reliable detection remains challenging due to the limited spatial resolution of low-density electrode configurations, reduced signal-to-noise ratios, and the scarcity of diverse, publicly available training datasets. This study investigates the efficacy of hybrid deep learning architectures for automated seizure detection using a simulated behind-the-ear montage derived from the Temple University Seizure Corpus (TUSZ, v2.0.3). We conduct a systematic comparison of several CNN-RNN models, including LSTM- and GRU-based variants, across multiple EEG montages to evaluate their capacity to compensate for the loss of spatial information inherent to reduced electrode configurations. The proposed CNN-Merged model, which integrates temporal and spectral feature representations, demonstrates superior performance, achieving a ROC AUC of 85.89% and a balanced accuracy of 79.11% on the held-out test set. Furthermore, the model exhibits strong robustness across different reference montages, effectively bridging the performance gap between conventional full-scalp recordings and resource-constrained wearable systems. These findings substantiate the potential of hybrid deep learning models as a promising avenue toward robust, patient-independent seizure detection in low-density EEG applications.

URL PDF HTML ☆

赞 0 踩 0

2606.11948 2026-06-11 eess.SY 新提交

Robust Tuning of Model Predictive Control for MMC-Based High-Voltage Power Systems

基于MMC的高压电力系统模型预测控制的鲁棒整定

Victor Daniel Reyes Dreke, Rahul Rane, Aleksandra Lekić

AI总结针对MMC-HVDC系统中模型不确定性导致的稳定性问题，提出一种通过凸线性优化求解最优加权矩阵的MPC鲁棒整定方法，在不增加在线计算负担下增强鲁棒性，经RTDS验证优于传统LQR方法。

详情

Comments: Submitted to IEEE Transactions on Power Delivery

AI中文摘要

基于模块化多电平换流器（MMC）的高压直流（HVDC）输电系统已成为现代电力系统的关键拓扑。MMC的动态特性表现出强多变量耦合、约束和不确定性，促使使用模型预测控制（MPC）来增强电流调节性能。然而，MPC整定并非易事，且不能固有地保证稳定性或鲁棒性，特别是在存在模型不确定性的情况下。本文提出一种MPC整定方法，确保在有界模型不确定性下的鲁棒性能。该方法通过求解凸线性优化问题来计算最优加权矩阵Q、R和P，保证最优性和可重复性。因此，在不增加在线计算负担的情况下增强了鲁棒性。通过在点对点HVDC系统的实时数字仿真器（RTDS）模型上进行测试，验证了该方法的有效性。结果表明，与基于LQR的传统MPC整定相比，性能得到了改善。

英文摘要

High-voltage direct current (HDVC) transmission systems based on modular multilevel converters (MMCs) have become a key topology in modern power systems. The dynamics of MMCs exhibit strong multivariable coupling, constraints, and uncertainties, motivating the use of model predictive control (MPC) to enhance current regulation performance. However, MPC tuning is nontrivial and does not inherently guarantee stability or robustness, particularly in the presence of model uncertainties. This paper proposes a MPC tuning method that ensures robust performance under bounded model uncertainties. This method solves a convex linear optimization problem to compute the optimal weighting matrices Q, R, and P ensuring optimality and reproducibility. As a result, robustness is enhanced without increasing the online computation burden. The effectiveness of the method is validated through testing on a real-time digital simulator (RTDS) model of a point-to-point HVDC system. Results demonstrate improved performance compared to conventional LQR-based MPC tuning.

URL PDF HTML ☆

赞 0 踩 0

2606.11914 2026-06-11 eess.SP cs.LG 新提交

NARRAS: Edge-Triggered Distributed Inference for CSI-Based Localization in Vehicular IoT Networks

NARRAS：车载物联网中基于CSI的定位的边缘触发分布式推理

Rodrigo Oliver, Ricardo Vazquez Alvarez, Alejandro Lancho, Stefano Rini

AI总结针对分布式天线阵列CSI定位中资源受限问题，提出NARRAS边缘触发分布式推理策略，各阵列本地决策是否上报观测，通过可微活动惩罚和通道图正则化实现预算控制，在低活动率下提升定位精度。

详情

Comments: 10 pages, 5 figures, 5 tables. Under review at the IEEE Internet of Things Journal

AI中文摘要

基于CSI的定位与空间分布式天线阵列存在基本的资源权衡。每个阵列可以提供丰富的信道视图，但当只有少数阵列携带有用信息时，将所有阵列的观测结果转发到融合中心是浪费的，且共享上行链路仅支持有限数量的同时传输。我们让每个阵列本地决定其当前观测是否值得报告，受限于平均活跃发射机数量的预算。我们将这种抽象称为边缘触发分布式推理（ETDI）。它捕获了一类更广泛的任务导向通信问题，其中资源受限设备共享接入信道以完成共同推理任务。我们将ETDI实例化用于基于CSI的定位，这是车载物联网中的常见场景。空间分布的远程天线阵列（RAA）将来自用户设备（UE）传输的本地信道状态信息（CSI）编码为潜在特征，融合中心根据报告的特征子集估计UE位置。我们提出NARRAS，一种去中心化的报告策略，其中每个RAA将其最近观测的循环摘要与其最后传输的潜在记忆相结合。训练通过可微活动惩罚和验证校准的确定性阈值来控制显式活动预算，并使用通道图正则化来塑造潜在几何结构。实验表明，在可比的上行链路活动下，NARRAS比学习型和启发式稀疏报告策略提高了定位精度，而密集全报告模型仍然作为有用的无预算参考。在低活动率下，图正则化进一步减少了高百分位定位误差，表明几何感知的潜在表示在稀疏报告下更加鲁棒。

英文摘要

CSI-based localization with spatially distributed antenna arrays exposes a basic resource trade-off. Each array can provide a rich view of the channel, but forwarding observations from all arrays to a fusion center is wasteful when only a few carry useful information, and the shared uplink supports only a limited number of simultaneous transmissions. We let each array decide locally whether its current observation is worth reporting, subject to a budget on the average number of active transmitters. We refer to this abstraction as Edge-Triggered Distributed Inference (ETDI). It captures a broader class of task-oriented communication problems where resource-constrained devices share an access channel for a common inference task. We instantiate ETDI for CSI-based localization, a common scenario in vehicular IoT networks. Spatially distributed remote antenna arrays (RAAs) encode local channel state information (CSI) from user equipment (UE) transmissions into latent features, and the fusion center estimates the UE position from the subset of reported features. We propose NARRAS, a decentralized reporting policy in which each RAA combines a recurrent summary of its recent observations with a memory of the last latent it transmitted. Training controls an explicit activity budget through differentiable activity penalties and validation-calibrated deterministic thresholds, and uses channel-chart regularization to shape the latent geometry. Experiments show that, at comparable uplink activity, NARRAS improves localization accuracy over learned and heuristic sparse-reporting strategies, while dense full-report models remain useful budget-free references. In low-activity regimes, chart regularization further reduces high-percentile localization errors, suggesting that geometry-aware latent representations are more robust under sparse reporting.

URL PDF HTML ☆

赞 0 踩 0

2606.11905 2026-06-11 eess.SY 新提交

Risk-Aware AoII-Based Scheduling with Hybrid Transmission for a Semi-Markov Source

基于混合传输的半马尔可夫源的风险感知AoII调度

Saeid Sadeghi Vilni, Risto Wichman

AI总结针对半马尔可夫源的多接收器状态更新系统，提出风险感知调度策略，通过模型基和无模型方法最小化平均错误信息年龄、风险比和传输成本的加权和，数值结果验证其优于基线并利用单播与广播混合传输。

2606.11890 2026-06-11 eess.SP 新提交

Efficiency Meets Reliability: Enhanced Generalized Interleaved Transform for Random Multiplexing

效率与可靠性兼具：面向随机复用的增强型广义交织变换

Ming Wang, Shufeng Li, Lei Liu, Yao Ge, Yuhao Chi

AI总结针对6G高移动场景，提出一种存储高效且高可靠的随机复用通信系统RM-MAMP，通过混沌映射交织器和双级高阶置换多项式交织器将存储从O(N)降至O(1)，并设计交织变换框架提升等效信道矩阵的非相干性和分集增益，在严重时变信道下获得超过4dB增益。

详情

Comments: This paper has been accepted for publication in Chinese Journal of Electronics, 2026

AI中文摘要

为满足6G无线系统在高移动场景下的需求，本文提出了一种存储高效且高可靠的随机复用（RM）通信系统设计。原则上，采用跨域记忆近似消息传递（CD-MAMP）的RM可以通过构建全密集等效信道矩阵实现复制最大后验（MAP）最优性能。然而，其实际实现受到传统交织器的大存储开销以及严重病态信道下性能下降的阻碍，现有相关工作（聚焦于交织和变换设计）无法同时解决这些问题。为克服这些难题，我们开发了一种存储高效且高可靠的系统，将RM与CD-MAMP集成，称为RM-MAMP。具体而言，我们提出了一种具有定量参数选择准则的Logistic混沌映射交织器，以及一种双级高阶置换多项式交织器，两者在实现与完全随机交织器几乎相同的误码率（BER）的同时，将交织器存储从O(N)降至O(1)，并显著降低交织器信令开销。我们进一步提出了一种高可靠的交织变换框架，包括交织相位扰动变换和多层交织耦合变换，以增强等效信道矩阵的非相干性和分集度。仿真结果表明，所提出的存储高效交织器保持了与完全随机交织器相当的BER性能，而高可靠变换在严重时变信道下提供了超过4dB的增益，证实了增强型RM-MAMP系统在降低存储开销和提升鲁棒性方面的双重优势。

英文摘要

To meet the demands of 6G wireless systems operating in high-mobility scenarios, this paper presents a design of a random multiplexing (RM) communication system that is both storage-efficient and highly reliable. In principle, RM with cross-domain memory approximate message passing (CD-MAMP) can achieve replica maximum a posteriori (MAP)-optimal performance by constructing a fully dense equivalent channel matrix. However, its practical implementation is hindered by the large storage overhead of conventional interleavers and by performance degradation in severely ill-conditioned channels, which existing related work (focusing on interleaving and transform designs) fails to address simultaneously. To overcome these issues, we develop a storage-efficient and highly reliable system that integrates RM with CD-MAMP, referred to as RM-MAMP. Specifically, we propose a Logistic chaotic mapping interleaver with a quantitative parameter-selection criterion, and a dual-stage high-order permutation polynomial interleaver, both of which achieve nearly identical bit-error-rate (BER) as fully random interleavers while reducing the interleaver storage from O(N) to O(1) and significantly lowering interleaver signaling overhead. We further propose a highly reliable interleaved transform framework, comprising an interleaved phase perturbation transform and a multi-layer interleaved coupled transform, to enhance the incoherence and diversity of the equivalent channel matrix. Simulation results show that the proposed storage-efficient interleavers maintain BER performance comparable to fully random interleavers, while the highly reliable transforms provide over 4 dB gain in severely time-varying channels, confirming the dual benefits of reduced storage overhead and improved robustness for the enhanced RM-MAMP system.

URL PDF HTML ☆

赞 0 踩 0

2606.11883 2026-06-11 eess.SY 新提交

CBF-based Driving Assistance for Traffic Flow Stabilization

基于CBF的交通流稳定驾驶辅助

Hayate Irie, Masaki Inoue, Banri Okita, Akira Yamaguchi, Tomohiro Taki, Takashi Hatano

AI总结提出一种分层控制系统，下层利用控制屏障函数确保跟车安全间距，上层基于数据驱动激活下层控制器，以抑制交通拥堵，并通过真实数据仿真验证有效性。

2606.11879 2026-06-11 eess.SP 新提交

On the Robustness of AFBM Sensing to Power Amplifier Nonlinearities

关于AFBM感知对功率放大器非线性的鲁棒性

Eya Gourar, Henrique L. Senger, Gustavo P. Gonçalves, Kuranage R. R. Ranasinghe, Hyeon Seok Rou, Bruno S. Chang, Yahia Medjahdi, Giuseppe T. F. de Abreu, Didier Le Ruyet

AI总结研究功率放大器非线性对仿射滤波器组调制（AFBM）感知性能的影响，发现AFBM的模糊函数和整体感知性能对非线性具有显著不敏感性，使其成为硬件受限的集成感知与通信（ISAC）系统的可行候选。

详情

Comments: Submitted to to the 2026 Asilomar Conference on Signals, Systems, and Computers

AI中文摘要

我们研究了功率放大器（PA）非线性对仿射滤波器组调制（AFBM）感知性能的影响。虽然AFBM为集成感知与通信（ISAC）提供了几个有利特性——包括降低带外发射（OOBE）、低峰均功率比（PAPR）以及对双弥散（DD）信道效应的自然鲁棒性——但减轻波形失真通常需要高度线性的PA。这与要求高发射功率以实现可靠感知的ISAC应用产生了根本矛盾。我们的分析结果表明，有效AFBM调制矩阵的结构决定了失真如何在模糊函数（AF）中传播。此外，仿真表明，AFBM的AF和整体感知性能对这种非线性仍然非常不敏感。这些发现凸显了AFBM的鲁棒性，使其成为受硬件损伤限制的实际ISAC部署的高度可行候选。

英文摘要

We investigate the impact of power amplifier (PA) nonlinearities on the sensing performance of affine filter bank modulation (AFBM). While AFBM offers several advantageous properties for integrated sensing and communications (ISAC) - including reduced out-of-band emission (OOBE), low peak-to-average power ratio (PAPR), and natural robustness to doubly-dispersive (DD) channel effects - mitigating waveform distortion typically requires highly linear PAs. This creates a fundamental contradiction with ISAC applications, which demand high transmit power for reliable sensing. Our analytical results reveal that the structure of the effective AFBM modulation matrix dictates how distortion propagates within the ambiguity function (AF). Furthermore, simulations demonstrate that both the AF and the overall sensing performance of AFBM remain remarkably insensitive to such nonlinearities. These findings highlight the robustness of AFBM, making it a highly viable candidate for practical ISAC deployments constrained by hardware impairments.

URL PDF HTML ☆

赞 0 踩 0

2606.11858 2026-06-11 eess.SY 新提交

Koopman-based NMPC for Virtually Coupled Train Control System

基于Koopman的非线性模型预测控制在虚拟耦合列车控制系统中的应用

Yiwen Zhang, Lorenzo Calogero, Shukai Li, Alessandro Rizzo, Anton V. Proskurnikov

AI总结提出基于Koopman的非线性模型预测控制（K-NMPC）方法，通过闭式可观测函数将列车动力学提升至有限维Koopman空间，将在线最优控制问题转化为二次规划，显著降低计算时间，实现与离散NMPC相当的控制性能。

详情

Comments: to be presented at IFAC World Congress 2026

AI中文摘要

本文研究了一种基于Koopman的分析型非线性模型预测控制（K-NMPC）方法，用于虚拟耦合列车系统的跟踪控制。通过闭式可观测函数，将包含列车动力学、速度和控制输入限制、乘客舒适度约束以及碰撞避免的非线性列车运动模型系统地提升到有限维Koopman空间。在沿移位预测轨迹冻结仿射参数变化提升预测器后，在线最优控制问题被求解为一个可以高效求解的二次规划。所提出的K-NMPC与时间离散NMPC方案进行了基准测试，显示出相当的控制性能，同时显著减少了在线计算时间，并在实际虚拟耦合列车控制系统中具有强大的实时实现潜力。

英文摘要

This paper investigates an analytical Koopman-based nonlinear model predictive control (K-NMPC) approach for tracking control of virtually coupled train systems. A nonlinear train movement model incorporating train dynamics, speed and control input limits, passenger comfort constraints, and collision avoidance is systematically lifted into a finite-dimensional Koopman space through closed-form observable functions. After freezing the affine parameter-varying lifted predictor along the shifted predicted trajectory, the online optimal control problem is solved as a quadratic program that can be solved efficiently. The proposed KNMPC is benchmarked against a time-discrete NMPC scheme, demonstrating comparable control performance with significantly reduced online computation time and strong potential for real-time implementation in practical virtually coupled train control systems.

URL PDF HTML ☆

赞 0 踩 0