arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.12210 2026-06-11 eess.SY math.OC 版本更新

Solvability of the Output Corridor Control Problem by Pulse-Modulated Feedback

脉冲调制反馈下输出走廊控制问题的可解性

Alexander Medvedev, Anton V. Proskurnikov

AI总结针对具有特定结构的三阶正系统，证明脉冲调制反馈在稳态下总能解决输出走廊控制问题，并用于评估药代动力学-药效学模型的患者安全性可行性。

详情

Comments: shortened version will be presented at IFAC World Congress 2026, Busan, Korea

AI中文摘要

本文处理了将正时不变单输入单输出系统的输出维持在预定义走廊内的问题。对于具有特定结构的三阶系统，证明了在稳态条件下通过脉冲调制反馈该问题总是可解的。所得结果用于评估患者特异性药代动力学-药效学模型在患者安全性方面的可行性。研究了一组捕捉神经肌肉阻滞剂动态的Wiener模型，以探讨是否可以通过临床可接受的序贯药物剂量（推注）将其驱动到期望的输出走廊。结果表明，非线性药效学部分的一个参数的低值是导致检测到的模型不可行性的原因。

英文摘要

The problem of maintaining the output of a positive time-invariant single-input single-output system within a predefined corridor of values is treated. For third-order plants possessing a certain structure, it is proven that the problem is always solvable under stationary conditions by means of pulse-modulated feedback. The obtained result is utilized to assess the feasibility of patient-specific pharmacokinetic-pharmacodynamic models with respect to patient safety. A population of Wiener models capturing the dynamics of a neuromuscular blockade agent is studied to investigate whether or not they can be driven into the desired output corridor by clinically acceptable sequential drug doses (boluses). It is demonstrated that low values of a parameter in the nonlinear pharmacodynamic part lie behind the detected model infeasibility.

URL PDF HTML ☆

赞 0 踩 0

2601.08136 2026-06-11 cs.LG eess.SY 版本更新

Reverse Flow Matching: A Unified Framework for Online Reinforcement Learning with Diffusion and Flow Policies

反向流匹配：基于扩散与流策略的在线强化学习统一框架

Zeyang Li, Sunbochen Tang, Navid Azizan

AI总结针对在线强化学习中扩散与流策略缺乏目标样本的问题，提出反向流匹配框架，通过后验均值估计和Langevin Stein算子构造控制变量，统一了噪声期望与梯度期望两类方法，并扩展到流策略，提升训练效率与稳定性。

详情

Comments: ICML 2026 (Spotlight); Code: this https URL

AI中文摘要

扩散和流策略因其强大的表达能力在在线强化学习（RL）中日益重要，但高效训练它们仍是一个关键挑战。在线RL与标准生成建模的一个根本区别在于缺乏来自Q函数定义的目标玻尔兹曼分布的直接样本。为此，针对扩散策略提出了两类看似不同的方法：噪声期望族，使用噪声的加权平均作为训练目标；梯度期望族，使用Q函数梯度的加权平均。然而，这些目标如何正式相关，或者它们能否被综合成一个更通用的公式，目前尚不清楚。在本文中，我们提出了一个统一框架——反向流匹配（RFM），该框架严格解决了在没有直接目标样本的情况下训练扩散和流模型的问题。通过采用反向推理视角，我们将训练目标表述为给定中间噪声样本的后验均值估计问题。关键地，我们引入Langevin Stein算子来构造零均值控制变量，推导出一类具有相同期望的通用估计器。我们表明，现有的噪声期望和梯度期望方法只是这个更广泛类别中的两个具体实例。这种统一观点带来了两个关键进展：它将针对玻尔兹曼分布的能力从扩散策略扩展到流策略，并使得能够原则性地结合Q值和Q梯度信息形成有效估计器，从而提高训练效率和稳定性。我们将RFM实例化以在在线RL中训练流策略，并在连续控制基准测试中展示了相比扩散策略基线的改进性能。

英文摘要

Diffusion and flow policies are gaining prominence in online reinforcement learning (RL) due to their expressive power, yet training them efficiently remains a critical challenge. A fundamental difficulty that distinguishes online RL from standard generative modeling is the lack of direct samples from the target Boltzmann distribution defined by the Q-function. To address this, two seemingly distinct families of methods have been proposed for diffusion policies: a noise-expectation family, which uses a weighted average of noise as the training target, and a gradient-expectation family, which employs a weighted average of Q-function gradients. However, it remains unclear how these objectives are formally related, or whether they can be synthesized into a more general formulation. In this paper, we propose a unified framework, reverse flow matching (RFM), which rigorously addresses the problem of training diffusion and flow models without direct target samples. By adopting a reverse inferential perspective, we formulate the training target as a posterior mean estimation problem given an intermediate noisy sample. Crucially, we introduce Langevin Stein operators to construct zero-mean control variates, deriving a general class of estimators that share the same expectation. We show that existing noise-expectation and gradient-expectation methods are simply two specific instances within this broader class. This unified view yields two key advancements: it extends the capability of targeting Boltzmann distributions from diffusion to flow policies, and it enables the principled combination of Q-value and Q-gradient information to form an effective estimator, thereby improving training efficiency and stability. We instantiate RFM to train a flow policy in online RL and demonstrate improved performance on continuous-control benchmarks compared to diffusion policy baselines.

URL PDF HTML ☆

赞 0 踩 0

2512.19245 2026-06-11 eess.SY cs.RO 版本更新

Vision-Aided Relative State Estimation for Approach and Landing on a Moving Platform with Inertial Measurements

基于视觉辅助的相对状态估计用于移动平台进近与着陆的惯性测量

Tarek Bouazza, Alessandro Melis, Soulaimane Berkane, Robert Mahony, Tarek Hamel

AI总结提出一种级联观测器，结合SO(3)互补滤波和线性Riccati观测器，利用IMU和单目相机估计无人机与移动平台的相对位姿和速度，在持续激励条件下实现几乎全局渐近稳定。

详情

Comments: 13 pages, 4 figures. To appear in proceedings of IFAC World Congress 2026

AI中文摘要

本文解决了在进近和着陆过程中，无人机与经历任意三维运动的平面平台之间的相对位置、姿态和速度的估计问题。该估计依赖于安装在两个系统上的惯性测量单元（IMU）的测量值，假设存在合适的通信信道来交换数据，以及由机载单目相机提供的视觉信息，从中提取平台中心的方位（视线方向）和其平面表面的法向量。我们提出了一种级联观测器，在$\mathbf{SO}(3)$上采用互补滤波器来重构相对姿态，随后使用线性Riccati观测器进行相对位置和速度估计。在持续激励条件下建立了两个观测器的收敛性，并证明了级联是几乎全局渐近和局部指数稳定的。我们进一步将设计扩展到平台旋转限制在其法向轴的情况，并表明可以利用其测量的线性加速度来恢复剩余不可观测的旋转角。提供了该情况下局部指数收敛的充分条件。通过大量仿真验证了所提出的观测器。

英文摘要

This paper tackles the problem of estimating the relative position, orientation, and velocity between a UAV and a planar platform undergoing arbitrary 3D motion during approach and landing. The estimation relies on measurements from Inertial Measurement Units (IMUs) mounted on both systems, assuming there is a suitable communication channel to exchange data, together with visual information provided by an onboard monocular camera, from which the bearing (line-of-sight direction) to the platform's center and the normal vector of its planar surface are extracted. We propose a cascade observer with a complementary filter on $\mathbf{SO}(3)$ to reconstruct the relative attitude, followed by a linear Riccati observer for relative position and velocity estimation. Convergence of both observers is established under persistently exciting conditions, and the cascade is shown to be almost globally asymptotically and locally exponentially stable. We further extend the design to the case where the platform's rotation is restricted to its normal axis and show that its measured linear acceleration can be exploited to recover the remaining unobservable rotation angle. A sufficient condition for local exponential convergence in this setting is provided. The proposed observers are validated through extensive simulations.

URL PDF HTML ☆

赞 0 踩 0

2512.13765 2026-06-11 eess.IV cs.AI cs.LG 版本更新

Towards Deep Learning Surrogate for the Forward Problem in Electrocardiology: A Scalable Alternative to Physics-Based Models

面向心电学正问题的深度学习代理模型：一种可扩展的物理模型替代方案

Shaheim Ogbomo-Harmitt, Cesare Magnetti, Chiara Spota, Jakub Grzelak, Oleg Aslanidi

AI总结提出基于注意力机制的序列到序列深度学习框架，作为心电学正问题的代理模型，从心脏电压传播图预测心电图信号，在2D组织模拟中达到高精度（平均R²=0.99±0.01），为物理模型提供可扩展、低成本的替代方案。

详情

Comments: Accepted to CinC conference 2025

AI中文摘要

心电学中的正问题，即从心脏电活动计算体表电位，传统上使用基于物理的模型（如双域或单域方程）求解。虽然准确，但这些方法计算成本高，限制了其在实时和大规模临床中的应用。我们提出一个概念验证的深度学习（DL）框架，作为正问题求解器的高效代理。该模型采用基于时间依赖注意力机制的序列到序列架构，从心脏电压传播图预测心电图（ECG）信号。引入了一种混合损失函数，结合Huber损失和谱熵项，以保持时域和频域的保真度。使用包含健康、纤维化和缝隙连接重塑条件的2D组织模拟，模型实现了高精度（平均$R^2 = 0.99 \pm 0.01$）。消融研究证实了卷积编码器、时间感知注意力和谱熵损失的贡献。这些发现突显了DL作为物理求解器的可扩展、低成本替代方案的潜力，适用于临床和数字孪生应用。

英文摘要

The forward problem in electrocardiology, computing body surface potentials from cardiac electrical activity, is traditionally solved using physics-based models such as the bidomain or monodomain equations. While accurate, these approaches are computationally expensive, limiting their use in real-time and large-scale clinical applications. We propose a proof-of-concept deep learning (DL) framework as an efficient surrogate for forward solvers. The model adopts a time-dependent, attention-based sequence-to-sequence architecture to predict electrocardiogram (ECG) signals from cardiac voltage propagation maps. A hybrid loss combining Huber loss with a spectral entropy term was introduced to preserve both temporal and frequency-domain fidelity. Using 2D tissue simulations incorporating healthy, fibrotic, and gap junction-remodelled conditions, the model achieved high accuracy (mean $R^2 = 0.99 \pm 0.01$). Ablation studies confirmed the contributions of convolutional encoders, time-aware attention, and spectral entropy loss. These findings highlight DL as a scalable, cost-effective alternative to physics-based solvers, with potential for clinical and digital twin applications.

URL PDF HTML ☆

赞 0 踩 0

2512.12580 2026-06-11 cs.CR eess.SP hep-th math-ph math.RA 版本更新

Cryptographic transformations over polyadic rings

基于多元环的密码学变换

Steven Duplij, Qiang Guo, Na Fu

AI总结提出基于多元环的密码学范式，利用参数到元数的映射Φ(a,b)构建非单射、非满射且多值的复杂关系，设计两种加密过程，通过多元量化模拟信号传输信息，增强安全性。

详情

Comments: 21 pages, revtex 4.2

AI中文摘要

本文介绍了一种基于非派生多元代数结构的新型密码学范式。传统密码系统依赖于群、环或域内的二元运算，其良好理解的特性可被密码分析利用。为克服这些漏洞，我们提出转向多元环，它通过允许更高元数的运算来推广经典环：一个$m$元加法和一个$n$元乘法。我们方法的基础是多元整数的构造——普通整数的同余类，赋予这样的$m$元和$n$元运算。一个关键创新是参数到元数的映射$\Phi(a,b)=(m,n)$，它将定义同余类的参数$(a,b)$与代数闭包所需的特定元数联系起来。该映射在数学上是复杂的：它是非单射、非满射且多值的。这种复杂、非唯一的关系构成了所提密码系统安全性的核心。我们提出了两种具体的加密过程，利用这种结构，将明文编码在多元环的参数中，并通过多元量化的模拟信号传输信息。在一种方法中，明文与加法元数$m_{i}$相关联，并通过此类信号的求和来保护；在另一种方法中，明文与环参数$a_{i}$相关联，并通过它们的乘法来保护。在这两种情况下，多元运算的“量化”性质生成方程组，对于拥有正确密钥的合法接收者来说直接明了，但对于没有密钥的攻击者来说极其困难。由此产生的框架有望大幅提高密码安全性。这项工作为这类新型加密方案奠定了理论基础，并突显了它们在构建鲁棒的下一代密码协议方面的潜力。

英文摘要

This article introduces a novel cryptographic paradigm based on nonderived polyadic algebraic structures. Traditional cryptosystems rely on binary operations within groups, rings, or fields, whose well-understood properties can be exploited in cryptanalysis. To overcome these vulnerabilities, we propose a shift to polyadic rings, which generalize classical rings by allowing operations of higher arity: an $m$-ary addition and an $n$-ary multiplication. The foundation of our approach is the construction of polyadic integers -- congruence classes of ordinary integers endowed with such $m$-ary and $n$-ary operations. A key innovation is the parameter-to-arity mapping $\Phi(a,b)=(m,n)$, which links the parameters $(a,b)$ defining a congruence class to the specific arities required for algebraic closure. This mapping is mathematically intricate: it is non-injective, non-surjective, and multivalued. This complex, non-unique relationship forms the core of the proposed cryptosystem's security. We present two concrete encryption procedures that leverage this structure by encoding plaintext within the parameters of polyadic rings and transmitting information via polyadically quantized analog signals. In one method, plaintext is linked to the additive arity $m_{i}$ and secured using the summation of such signals; in the other, it is linked to a ring parameter $a_{i}$ and secured using their multiplication. In both cases, the "quantized" nature of polyadic operations generates systems of equations that are straightforward for a legitimate recipient with the correct key but exceptionally difficult for an attacker without it. The resulting framework promises a substantial increase in cryptographic security. This work establishes the theoretical foundation for this new class of encryption schemes and highlights their potential for constructing robust, next-generation cryptographic protocols.

URL PDF HTML ☆

赞 0 踩 0

2507.11919 2026-06-11 eess.SP 版本更新

Time-Frequency Mode Decomposition for Wind Turbine Vibration Monitoring under Variable Speed Operation

变速运行下风力发电机振动监测的时频模态分解

Wei Zhou, Wei-Jian Li, Desen Zhu, Hongbin Xu, Wei-Xin Ren

AI总结提出时频模态分解（TFMD）方法，通过分割短时傅里叶变换支撑区域并重构模态，无需预设模态数即可分离变速运行下的非平稳转子阶次分量，在合成和实验数据中验证了低重构误差和有效谐波提取能力。

详情

AI中文摘要

变速运行下的风力发电机振动监测需要分离非平稳的转子阶次分量，这些分量的频率和运行区间取决于运行状态。这些分量在短时傅里叶变换（STFT）平面中占据局部支撑区域，而非固定的频谱带或连续脊线。本研究提出时频模态分解（TFMD），一种基于分割的方法，用于估计连通的STFT支撑区域并从每个区域重构一个模态。TFMD选择高幅值的STFT系数，通过连通分量标记进行分组，滤除小区域，通过掩膜扩张和冲突解决扩展保留的支撑区域，并通过逆STFT重构模态。在包含六个运行状态的合成响应中，TFMD分离了每个状态的分量，并在无需预先指定分量数量的情况下产生低重构误差。在受控的风力发电机叶片应变实验中，首次分解重构了九个模态，其峰值频率接近标称的每转一次频率，且能量集中在相应的运行区间。残差分解进一步揭示了较弱的谐波结构。这些结果支持TFMD作为变速运行下振动分析的实用候选方法，而海上现场应用需要在环境载荷和实测运行参考下进行验证。

英文摘要

Wind turbine vibration monitoring under variable speed operation requires separating nonstationary rotor-order components whose frequencies and operating intervals depend on operating state. These components can occupy local support regions in the short-time Fourier transform (STFT) plane rather than fixed spectral bands or continuous ridges. This study presents time-frequency mode decomposition (TFMD), a segmentation-based method that estimates connected STFT support regions and reconstructs one mode from each region. TFMD selects STFT coefficients with high magnitude, groups them by connected component labeling, filters small regions, expands retained support regions with mask dilation and conflict resolution, and reconstructs modes by inverse STFT. In a synthetic response with six operating states, TFMD separates the components of each state and produces low reconstruction error without specifying the number of components in advance. In a controlled wind turbine blade strain experiment, the first decomposition reconstructs nine modes whose peak frequencies lie near the nominal once per revolution frequencies and whose energies are concentrated in the corresponding operating intervals. Residual decomposition further reveals weaker harmonic structure. These results support TFMD as a practical candidate for vibration analysis under variable speed operation, while offshore field use requires validation under environmental loading and with measured operating references.

URL PDF HTML ☆

赞 0 踩 0

2510.23320 2026-06-11 eess.AS cs.CL cs.SD 版本更新

LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization

LibriConvo：从阅读文献模拟对话用于ASR和说话人日志

Máté Gedeon, Péter Mihajlik

AI总结提出LibriConvo合成对话语音语料库，基于说话人感知模拟对话框架构建，用于说话人日志和ASR基准测试，包含240.1小时音频，基线实验显示Sortformer在日志中优于pyannote，Fast Conformer-CTC在ASR中优于Whisper。

详情

Comments: Accepted by TSD 2026

AI中文摘要

我们介绍了LibriConvo，一个用于说话人日志和自动语音识别（ASR）的合成对话语音语料库，通过在数据集和基准测试设置中实例化先前提出的说话人感知模拟对话（SASC）框架构建而成。本文的主要贡献是基于该框架的语料库构建流程和基准测试。为了使数据更适合下游ASR和说话人日志，我们使用外部语音活动检测从英语CallHome估计对话时间统计信息，压缩长停顿，按书籍分组LibriTTS话语以改善局部语义连续性，并通过空间合理性启发式选择房间脉冲响应。生成的语料库包含240.1小时的音频，涉及830个说话人的1496个对话，划分为说话人不重叠的训练、验证和测试集。我们报告了说话人日志和ASR的基线结果。在测试集上，Sortformer在说话人日志中优于pyannote流水线（DER 11.1%对比24.4%）。对于ASR，使用序列化输出训练微调的Fast Conformer-CTC XLarge模型实现了7.29%的WER和6.97%的cpWER，优于零样本Whisper-large-v3。这些结果使LibriConvo成为研究合成对话语音和评估多说话人语音处理系统的实用基准。

英文摘要

We introduce LibriConvo, a synthetic conversational speech corpus for speaker diarization and automatic speech recognition (ASR), built by instantiating the previously proposed Speaker-Aware Simulated Conversation (SASC) framework in a dataset and benchmarking setting. The main contribution of this paper is a corpus construction pipeline and benchmark derived from that framework. To make the data more suitable for downstream ASR and diarization, conversational timing statistics are estimated from English CallHome using external voice activity detection, long pauses are compressed, LibriTTS utterances are grouped by book to improve local semantic continuity, and room impulse responses are selected with a spatial-plausibility heuristic. The resulting corpus contains 240.1 hours of audio across 1,496 dialogues involving 830 speakers, partitioned into speaker-disjoint train, validation, and test splits. We report baseline results for both diarization and ASR. On the test split, Sortformer outperforms the pyannote pipeline in diarization (11.1\% vs.~24.4\% DER). For ASR, a Fast Conformer-CTC XLarge model fine-tuned with Serialized Output Training achieves 7.29\% WER and 6.97\% cpWER, outperforming zero-shot Whisper-large-v3. These results position LibriConvo as a practical benchmark for studying synthetic conversational speech and for evaluating multi-speaker speech processing systems.

URL PDF HTML ☆

赞 0 踩 0

2510.17816 2026-06-11 eess.SP cs.CV 版本更新

Cross-Domain Multi-Person Human Activity Recognition via Near-Field Wi-Fi Sensing

基于近场Wi-Fi感知的跨域多人人体活动识别

Xin Li, Jingzhi Hu, Yinghui He, Hongbo Wang, Jin Gan, Jun Luo

AI总结针对Wi-Fi多人活动识别中跨域适应难题，提出WiAnchor框架，通过预训练扩大类间特征间隔、微调阶段引入锚点匹配机制过滤个体干扰，实现缺失类别下的高效跨域识别，准确率超90%。

详情

AI中文摘要

基于Wi-Fi的人体活动识别（HAR）提供了极大的便利，并已成为一个蓬勃发展的研究领域，然而Wi-Fi固有的粗空间分辨率严重阻碍了其区分多个目标的能力。通过利用近场主导效应，为每个目标通过其个人Wi-Fi设备建立专用传感链路，为原生流量下的多人HAR提供了一种有前景的解决方案。然而，由于近场信号的目标特定特性和不规则模式，HAR神经网络模型需要微调（FT）以实现跨域适应，这在某些类别不可用时变得特别具有挑战性。在本文中，我们提出WiAnchor，一种新颖的训练框架，用于在活动类别不完整的情况下实现高效的跨域适应。该框架通过三个步骤处理嵌入不规则时间信息的Wi-Fi信号：在预训练期间，我们扩大类间特征间隔以增强活动的可分离性；在微调阶段，我们创新性地引入一种锚点匹配机制用于跨域适应，根据不完整的活动类别过滤目标特定干扰，而不是试图从中提取完整特征；最后，基于输入样本与锚点的特征级相似性进一步改进识别。我们构建了一个全面的数据集来彻底评估WiAnchor，在缺失活动类别的情况下实现了超过90%的跨域准确率。

英文摘要

Wi-Fi-based human activity recognition (HAR) provides substantial convenience and has emerged as a thriving research field, yet the coarse spatial resolution inherent to Wi-Fi significantly hinders its ability to distinguish multiple subjects. By exploiting the near-field domination effect, establishing a dedicated sensing link for each subject through their personal Wi-Fi device offers a promising solution for multi-person HAR under native traffic. However, due to the subject-specific characteristics and irregular patterns of near-field signals, HAR neural network models require fine-tuning (FT) for cross-domain adaptation, which becomes particularly challenging with certain categories unavailable. In this paper, we propose WiAnchor, a novel training framework for efficient cross-domain adaptation in the presence of incomplete activity categories. This framework processes Wi-Fi signals embedded with irregular time information in three steps: during pre-training, we enlarge inter-class feature margins to enhance the separability of activities; in the FT stage, we innovate an anchor matching mechanism for cross-domain adaptation, filtering subject-specific interference informed by incomplete activity categories, rather than attempting to extract complete features from them; finally, the recognition of input samples is further improved based on their feature-level similarity with anchors. We construct a comprehensive dataset to thoroughly evaluate WiAnchor, achieving over 90% cross-domain accuracy with absent activity categories.

URL PDF HTML ☆

赞 0 踩 0

2510.03520 2026-06-11 cs.LG cs.AI eess.SY 版本更新

Certifiable Safe RLHF: Semantic Grounding and Fixed Penalty Constraint Optimization for Safer LLM Alignment

可认证安全RLHF：基于语义基础与固定惩罚约束优化的更安全大语言模型对齐

Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Shaahin Angizi, Arnob Ghosh

AI总结针对现有RLHF方法依赖奖励/成本函数和双变量调优导致性能敏感且缺乏可证明安全保证的问题，提出CS-RLHF，通过语义基础成本模型和固定惩罚约束优化，实现可认证安全对齐，效率提升至少5倍。

详情

AI中文摘要

确保安全是大语言模型（LLMs）的基本要求。在增强模型输出效用与减轻其潜在危害之间取得适当平衡是一个复杂且持续的挑战。当代方法通常将这个问题形式化为约束马尔可夫决策过程（CMDP）框架，并采用成熟的CMDP优化技术。然而，这些方法表现出两个显著的限制。首先，它们对奖励和成本函数的依赖使得性能对底层评分机制高度敏感，而该机制必须捕捉语义含义，而不是被表面关键词触发。其次，基于CMDP的训练需要调整双变量，这一过程计算成本高昂，并且对于可能通过对抗性越狱利用的固定双变量，不提供任何可证明的安全保证。为了克服这些限制，我们引入了可认证安全RLHF（CS-RLHF），它引入了一个在大规模语料库上训练的成本模型，以分配基于语义的安全分数。与基于拉格朗日的方法相比，CS-RLHF采用了一种修正的基于惩罚的公式。该设计借鉴了约束优化中精确惩罚函数理论，其中约束满足直接通过适当选择的惩罚项来强制执行。通过适当缩放的惩罚，可以在优化器处保证安全约束的可行性，从而消除了双变量更新的需要。实证评估表明，CS-RLHF优于最先进的LLM模型响应，对正常和越狱提示的效率至少提高5倍。

英文摘要

Ensuring safety is a foundational requirement for large language models (LLMs). Achieving an appropriate balance between enhancing the utility of model outputs and mitigating their potential for harm is a complex and persistent challenge. Contemporary approaches frequently formalize this problem within the framework of Constrained Markov Decision Processes (CMDPs) and employ established CMDP optimization techniques. However, these methods exhibit two notable limitations. First, their reliance on reward and cost functions renders performance highly sensitive to the underlying scoring mechanism, which must capture semantic meaning rather than being triggered by superficial keywords. Second, CMDP-based training entails tuning dual-variable, a process that is both computationally expensive and does not provide any provable safety guarantee for a fixed dual variable that can be exploitable through adversarial jailbreaks. To overcome these limitations, we introduce Certifiable Safe-RLHF (CS-RLHF) that introduces a cost model trained on a large-scale corpus to assign semantically grounded safety scores. In contrast to the lagrangian-based approach, CS-RLHF adopts a rectified penalty-based formulation. This design draws on the theory of exact penalty functions in constrained optimization, wherein constraint satisfaction is enforced directly through a suitably chosen penalty term. With an appropriately scaled penalty, feasibility of the safety constraints can be guaranteed at the optimizer, eliminating the need for dual-variable updates. Empirical evaluation demonstrates that CS-RLHF outperforms state-of-the-art LLM model responses rendering at-least 5 times efficient against nominal and jail-breaking prompts

URL PDF HTML ☆

赞 0 踩 0

2509.01459 2026-06-11 eess.SY cs.SE 版本更新

Semantic Technologies in Practical Demand Response: An Informational Requirement-based Roadmap

实际需求响应中的语义技术：基于信息需求的路标图

Ozan Baris Mulayim, Anand Krishnan Prakash, Yuvraj Agarwal, Mario Bergés, Marco Pritoni, Derek Supple, Steve Schaefer, Mitali Shah

AI总结本文针对商业建筑激励型需求响应，通过形式化本体评估方法定义信息需求，评估现有本体（Brick、DELTA、EFOnt、CIM）的不足，并提出扩展与整合路标图以增强语义互操作性。

详情

Comments: Accepted at ACM eEnergy 2026. Not yet published/

AI中文摘要

向现代高效未来电网的转型依赖于分布式能源资源和需求响应（DR）等应用的无缝协调。虽然这种转变带来了更大的灵活性，但也增加了电网的复杂性和去中心化程度，需要有效协调数百万硬件资产和软件代理。实现这一愿景需要互操作性方面的进步，以确保这些异构系统能够在不产生过高定制成本的情况下进行通信。语义互操作性旨在通过利用本体来保证交换数据的无歧义解释。然而，当前商业建筑和DR领域的本体面临两个关键限制。首先，现有本体通常在没有反映实际DR需求的正式框架下开发。其次，通用本体与DR专用本体的集成方案大多停留在概念层面，缺乏形式化或实证验证。在本文中，我们开始通过应用形式化本体评估/开发方法来定义语义互操作性所需的信息需求（IRs），以美国商业建筑中基于激励的DR项目为起点，来填补这些空白。我们识别了与基于激励的DR每个阶段相关的IRs。利用这些IRs，我们评估了现有本体（特别是Brick、DELTA、EFOnt和CIM）对DR参与操作需求的支持程度。我们的发现揭示了当前本体与实际DR需求之间的显著差距，并提出了这些本体必要扩展和整合的路标图。这项工作最终旨在增强当今和未来智能电网的互操作性，从而促进DR系统可扩展地集成到电网复杂的运行框架中。

英文摘要

The transition to a modern and efficient future grid relies on the seamless coordination of distributed energy resources and applications such as Demand Response (DR). While this transformation enables greater flexibility, it increases grid complexity and decentralization, requiring the effective coordination of millions of hardware assets and software agents. Realizing this vision demands advances in interoperability to ensure these heterogeneous systems can communicate without prohibitive customization costs. Semantic interoperability aims to address this by leveraging ontologies to guarantee the unambiguous interpretation of exchanged data. However, current ontologies in the commercial building and DR domains face two critical limitations. First, existing ontologies are often developed without a formal framework that reflects real-world DR requirements. Second, proposals for integrating general and DR-specific ontologies remain mostly conceptual, lacking formalization or empirical validation. In this paper, we begin to address these gaps by applying a formal ontology evaluation/development approach to define the information requirements (IRs) necessary for semantic interoperability, focusing on incentive-based DR programs for commercial buildings in the United States as a starting point. We identify the IRs associated with each stage of the incentive-based DR. Using these IRs, we evaluate how well existing ontologies, specifically Brick, DELTA, EFOnt, and CIM support the operational needs of DR participation. Our findings reveal substantial gaps between current ontologies and practical DR requirements and we propose a roadmap of necessary extensions and integrations for these ontologies. This work ultimately aims to enhance the interoperability of today's and future smart grid, thereby facilitating scalable integration of DR systems into the grid's complex operational framework.

URL PDF HTML ☆

赞 0 踩 0

2507.21164 2026-06-11 cs.LG cs.AI eess.IV stat.ML 版本更新

OCSVM-Guided Representation Learning for Unsupervised Anomaly Detection

OCSVM引导的无监督异常检测表示学习

Nicolas Pinon (MYRIAD), Robin Trombetta (MYRIAD), Carole Lartizien (MYRIAD)

AI总结提出一种将表示学习与可解析求解的一类SVM耦合的方法，通过定制损失函数直接对齐潜在特征与决策边界，在MNIST-C和脑MRI病变检测任务上展现了鲁棒性和性能。

详情

AI中文摘要

无监督异常检测（UAD）旨在无需标签数据检测异常，这在许多机器学习应用中是必要的，因为异常样本稀少或不可用。大多数最先进的方法分为两类：基于重构的方法（通常重构异常过于完美）和与密度估计器解耦的表示学习（可能遭受次优特征空间）。虽然一些近期方法尝试耦合特征学习和异常检测，但它们通常依赖替代目标、限制核选择或引入近似，从而限制了表达能力和鲁棒性。为解决这一挑战，我们提出了一种新颖方法，通过自定义损失公式将表示学习与可解析求解的一类SVM（OCSVM）耦合，该损失直接使潜在特征与OCSVM决策边界对齐。该模型在两个任务上评估：基于MNIST-C的新基准，以及具有挑战性的脑MRI细微病变检测任务。与大多数关注图像级别大而高信号病变的方法不同，我们的方法成功针对小而非高信号的病变，同时我们评估体素级别的指标，处理了更具临床相关性的场景。两个实验评估了对领域偏移的鲁棒性形式，包括MNIST-C中的损坏类型以及MRI中的纹理或人群年龄变化。结果展示了我们提出模型的性能和鲁棒性，突显了其在通用UAD和现实医学成像应用中的潜力。源代码可在此https URL获取。

英文摘要

Unsupervised anomaly detection (UAD) aims to detect anomalies without labeled data, a necessity in many machine learning applications where anomalous samples are rare or not available. Most state-of-the-art methods fall into two categories: reconstruction-based approaches, which often reconstruct anomalies too well, and decoupled representation learning with density estimators, which can suffer from suboptimal feature spaces. While some recent methods attempt to couple feature learning and anomaly detection, they often rely on surrogate objectives, restrict kernel choices, or introduce approximations that limit their expressiveness and robustness. To address this challenge, we propose a novel method that couples representation learning with an analytically solvable One-Class SVM (OCSVM), through a custom loss formulation that directly aligns latent features with the OCSVM decision boundary. The model is evaluated on two tasks: a \deleted{new} benchmark based on MNIST-C, and a challenging brain MRI \deleted{subtle} lesion detection task. Unlike most methods that focus on large, hyperintense lesions at the image level, our approach succeeds to target small, non-hyperintense lesions, while we evaluate voxel-wise metrics, addressing a more clinically relevant scenario. Both experiments evaluate a form of robustness to domain shifts, including corruption types in MNIST-C and texture or population age variations in MRI. Results demonstrate performance and robustness of our proposed model, highlighting its potential for general UAD and real-world medical imaging applications. The source code is available at this https URL.

URL PDF HTML ☆

赞 0 踩 0

2405.06995 2026-06-11 cs.SD cs.CV cs.MM eess.AS 版本更新

Benchmarking Cross-Domain Audio-Visual Deception Detection

跨域音视频欺骗检测基准测试

Xiaobao Guo, Zitong Yu, Nithish Muthuchamy Selvaraj, Bingquan Shen, Adams Wai-Kin Kong, Alex C. Kot

AI总结提出首个跨域音视频欺骗检测基准，评估不同场景下的泛化能力，并设计MM-IDGM算法和Attention-Mixer融合方法提升性能。

详情

Comments: 17 pages

AI中文摘要

自动欺骗检测对于帮助人类准确评估真实性和识别欺骗行为至关重要。传统的接触式技术，如测谎仪，依赖生理信号来确定个体陈述的真实性。然而，自动欺骗检测的最新进展表明，从音频和视频模态中提取的多模态特征在公开数据集上可能优于人类观察者。尽管有这些积极发现，现有音视频欺骗检测方法在不同场景下的泛化能力仍 largely unexplored。为弥补这一空白，我们提出了首个跨域音视频欺骗检测基准，使我们能够评估这些方法在现实场景中的泛化能力。我们使用了广泛采用的音频和视觉特征以及不同的架构进行基准测试，比较了单到单和多到单域泛化性能。为了进一步利用来自多个源域的数据进行训练的影响，我们研究了三种域采样策略，包括域同步、域交替和逐域采样，用于多到单域泛化评估。我们还提出了一种通过最大化模态编码器之间的梯度内积来增强泛化性能的算法，称为“MM-IDGM”。此外，我们提出了Attention-Mixer融合方法来提高性能，并相信这一新的跨域基准将促进未来音视频欺骗检测的研究。

英文摘要

Automated deception detection is crucial for assisting humans in accurately assessing truthfulness and identifying deceptive behavior. Conventional contact-based techniques, like polygraph devices, rely on physiological signals to determine the authenticity of an individual's statements. Nevertheless, recent developments in automated deception detection have demonstrated that multimodal features derived from both audio and video modalities may outperform human observers on publicly available datasets. Despite these positive findings, the generalizability of existing audio-visual deception detection approaches across different scenarios remains largely unexplored. To close this gap, we present the first cross-domain audio-visual deception detection benchmark, that enables us to assess how well these methods generalize for use in real-world scenarios. We used widely adopted audio and visual features and different architectures for benchmarking, comparing single-to-single and multi-to-single domain generalization performance. To further exploit the impacts using data from multiple source domains for training, we investigate three types of domain sampling strategies, including domain-simultaneous, domain-alternating, and domain-by-domain for multi-to-single domain generalization evaluation. We also propose an algorithm to enhance the generalization performance by maximizing the gradient inner products between modality encoders, named ``MM-IDGM". Furthermore, we proposed the Attention-Mixer fusion method to improve performance, and we believe that this new cross-domain benchmark will facilitate future research in audio-visual deception detection.

URL PDF HTML ☆

赞 0 踩 0

2505.05305 2026-06-11 eess.SY 版本更新

Optimal Microgrid Sizing of Offshore Renewable Energy Sources for Offshore Platforms and Coastal Communities

面向海上平台和沿海社区的离岸可再生能源微电网优化规模

Ann Mary Toms, Xingpeng Li, Kaushik Rajashekara

AI总结提出一种集成深度神经网络电池退化模块的微电网优化器(REMO-DNN-BD)，用于确定离岸微电网中可再生能源发电和储能的最优容量，最小化全生命周期成本并保证高可靠性。

详情

AI中文摘要

全球能源格局正朝着可再生能源和先进储能解决方案发生变革性转变，这是由对可持续和弹性电力系统的迫切需求所驱动的。孤立的离岸社区，如岛屿和海上平台，传统上依赖大陆电网或柴油发电机，将从可再生能源整合中显著受益。有前景的离岸可再生能源技术包括风力涡轮机、波浪能和潮汐能转换器以及浮动光伏系统，并配以电池储能系统等存储解决方案。本文介绍了一种可再生能源微电网优化器（REMO），这是一种旨在确定离岸微电网中可再生能源发电和储能资源最优规模的工具。此类模型的一个关键挑战是准确核算电池退化成本。为解决这一问题，REMO模型集成了一个基于深度神经网络的电池退化（DNN-BD）模块，该模块考虑了环境温度、充放电速率、荷电状态、放电深度和电池健康等变量。在六个测试区域上的仿真表明，REMO-DNN-BD方法在保持高可靠性和可持续性的同时最小化了全生命周期能源成本，使其成为离岸微电网系统的可行设计解决方案。

英文摘要

The global energy landscape is undergoing a transformative shift towards renewable energy and advanced storage solutions, driven by the urgent need for sustainable and resilient power systems. Isolated offshore communities, such as islands and offshore platforms, which traditionally rely on mainland grids or diesel generators, stand to gain significantly from renewable energy integration. Promising offshore renewable technologies include wind turbines, wave and tidal energy converters, and floating photovoltaic systems, paired with a storage solution like battery energy storage systems. This paper introduces a renewable energy microgrid optimizer (REMO), a tool designed to identify the optimal sizes of renewable generation and storage resources for offshore microgrids. A key challenge in such models is accurately accounting for battery degradation costs. To address this, the REMO model integrates a deep neural network-based battery degradation (DNN-BD) module, which factors in variables like ambient temperature, charge/discharge rates, state of charge, depth of discharge and battery health. Simulations on six test regions demonstrate that the REMO-DNN-BD approach minimizes lifetime energy costs while maintaining high reliability and sustainability, making it a viable design solution for offshore microgrid systems.

URL PDF HTML ☆

赞 0 踩 0

2503.06578 2026-06-11 cs.RO eess.SY 版本更新

Non-Equilibrium MAV-Capture-MAV via Time-Optimal Planning and Reinforcement Learning

非平衡MAV捕获MAV：基于时间最优规划和强化学习

Canlun Zheng, Zhanyu Guo, Zikang Yin, Chunyu Wang, Zhikun Wang, Shiyu Zhao

AI总结针对高机动性目标捕获难题，本文设计紧凑型捕获MAV，结合时间最优规划与强化学习方法，在非稳定状态下实现目标捕获。

详情

AI中文摘要

由于飞行MAV（微型飞行器）的捕获具有挑战性和广阔应用前景，近年来受到越来越多的研究关注。尽管已有进展，现有工作的一个关键限制是捕获策略通常相对简单且受平台性能约束。本文研究能够捕获高机动性目标的控制策略。在非稳定条件下实现目标捕获这一独特挑战使其区别于传统的追逃和制导问题。在本研究中，我们从较大的MAV平台过渡到一种专门设计的、配备定制发射装置的紧凑型捕获MAV，同时保持高机动性。我们探索了时间最优规划（TOP）和强化学习（RL）方法。仿真表明，TOP提供高机动性和更短的轨迹，而RL在实时适应性和稳定性方面表现优异。此外，RL方法已在真实场景中测试，成功实现了即使在非稳定状态下的目标捕获。

英文摘要

The capture of flying MAVs (micro aerial vehicles) has garnered increasing research attention due to its intriguing challenges and promising applications. Despite recent advancements, a key limitation of existing work is that capture strategies are often relatively simple and constrained by platform performance. This paper addresses control strategies capable of capturing high-maneuverability targets. The unique challenge of achieving target capture under unstable conditions distinguishes this task from traditional pursuit-evasion and guidance problems. In this study, we transition from larger MAV platforms to a specially designed, compact capture MAV equipped with a custom launching device while maintaining high maneuverability. We explore both time-optimal planning (TOP) and reinforcement learning (RL) methods. Simulations demonstrate that TOP offers highly maneuverable and shorter trajectories, while RL excels in real-time adaptability and stability. Moreover, the RL method has been tested in real-world scenarios, successfully achieving target capture even in unstable states.

URL PDF HTML ☆

赞 0 踩 0

2604.15028 2026-06-11 eess.SY astro-ph.EP astro-ph.IM cs.SY

Nonlinear backstepping with saturation for low-thrust station-keeping of libration point orbits

António Nunes, Sérgio Brás, Pedro Batista

2604.11252 2026-06-11 eess.SP

A Unified Approach to Human-Scale Blockage and Scattering Analysis in Sub-THz Propagation With Application to RF Sensing

Stefano Savazzi, Fabio Paonessa, Sanaz Kianoush, Alessandro Nordio, Giuseppe Virone

详情

DOI: 10.1109/TAP.2026.3690188
Comments: under review for possible publication in IEEE Transactions on Antennas and Propagation

英文摘要

RF sensing exploits phase-sensitive measurements of stray electromagnetic (EM) fields from wireless devices across various frequency bands to detect EM blockage and to reconstruct and map the surrounding environment in 2D/3D. Although blockage effects caused by objects or human motion are well-studied in ISM bands and frequencies up to 60~GHz, there is a significant lack of research for frequencies above 100~GHz. The paper proposes a unified signal processing framework for RF sensing in the sub-THz D-band (105--175~GHz), explicitly integrating EM blockage and scattering as a single process through the birth-death dynamics of multipath components (MPCs). The framework extracts, associates, and classifies MPCs from angle-delay measurements using statistically grounded detection and classification, enabling human-scale sensing from a single radio link. The modeling and classification of MPCs, along with large-scale EM parameters, are demonstrated through an indoor measurement campaign using multiple test targets. Experimental results show that newly formed, attenuated, and suppressed MPCs can be reliably identified with millimeter-scale delay resolution. Static object localization achieves average positioning errors of $8-20$~cm depending on range and material, while passive human localization yields errors of 12-17cm at 0.5m and 26-30cm at 2m, respectively. The proposed framework demonstrates that accurate sensing and localization are feasible at sub-THz frequencies using a single link.

URL PDF HTML ☆

赞 0 踩 0

2601.07436 2026-06-11 eess.SP cs.LG physics.optics

PIDT: Physics-Informed Digital Twin for Optical Fiber Parameter Estimation

Zicong Jiang, Magnus Karlsson, Erik Agrell, Christian Häger

2512.00898 2026-06-11 eess.SP

Covariance-Guided DFT Beam Selection for Beamspace ESPRIT in Hybrid mmWave Sensor Arrays

协方差引导的DFT波束选择用于混合毫米波传感器阵列中的波束空间ESPRIT

Rıfat Volkan Şenyuva

AI总结针对混合模拟-数字毫米波传感器阵列中射频链和训练波束有限的问题，提出一种协方差引导的离散傅里叶变换（DFT）波束选择框架，通过短混合训练阶段重建去噪全孔径协方差矩阵，并在每个粗扇区内选择连续DFT波束块，以在严格波束预算下集中信号能量并保持有效孔径，从而提升波束空间ESPRIT的角度估计精度、降低失败率并实现良好的精度-时间权衡。

详情

DOI: 10.1109/JSEN.2026.3700043
Journal ref: IEEE Sensors Journal, Early Access, Jun. 2026
Comments: 13 pages, 9 figures, 2 tables. Accepted for publication in IEEE Sensors Journal

AI中文摘要

对于混合模拟-数字毫米波传感器阵列，精确的到达方向估计对于定位、环境感知和传感应用的测量波束控制至关重要。然而，实际硬件中射频链和训练波束数量有限，使得接近全数字阵列的角度分辨率变得困难。本文针对混合毫米波接收机中的波束空间ESPRIT，开发了一种协方差引导的离散傅里叶变换（DFT）波束选择框架。一个短的混合训练阶段实现虚拟中心对称子阵列，并得到样本协方差，该协方差通过前向-后向平均、非负最小二乘功率和噪声拟合以及Toeplitz正半定投影进行处理，以重建去噪的全孔径协方差矩阵。然后利用该协方差在每个粗扇区内评分并选择小的连续DFT波束块，这些波束块在严格的波束预算下集中信号能量并保持有效孔径。所选波束馈入一个稀疏波束空间ESPRIT阶段，该阶段仅对实际可用的相邻波束对进行操作，因此整体复杂度由单个低维ESPRIT调用主导。对于具有三条路径的32单元均匀线性阵列的蒙特卡洛模拟表明，在所考虑的场景中，与基于相同码本和估计器的扇区化基线相比，所提方法可以缩小与克拉美-罗界的差距，降低失败率，并提供有利的精度-运行时间权衡。对于本文研究的酉DFT码本，细级波束选择器简化为协方差引导的连续能量窗口规则；更广泛的评分公式也适用于由硬件非理想性引起的非酉有效波束字典。

英文摘要

Accurate direction-of-arrival estimation with hybrid analog--digital millimeter-wave sensor arrays is important for localization, environment sensing, and measurement beam control for sensing applications. However, the limited number of radio-frequency chains and training beams in practical hardware makes it difficult to approach the angular resolution of fully digital arrays. This paper develops a covariance-guided discrete Fourier transform (DFT) beam selection framework tailored to beamspace ESPRIT for hybrid millimeter-wave receivers. A short hybrid training phase realizes a virtual centro-symmetric subarray and yields a sample covariance that is processed by forward--backward averaging, nonnegative least-squares power and noise fitting, and a Toeplitz positive-semidefinite projection to reconstruct a denoised full-aperture covariance matrix. This covariance is then used to score and select, within each coarse sector, small contiguous blocks of DFT beams that concentrate signal energy and preserve effective aperture under a strict beam budget. The selected beams feed a sparse beamspace ESPRIT stage that operates only on actually available adjacent beam pairs, so that the overall complexity is dominated by a single low-dimensional ESPRIT call. Monte Carlo simulations for a thirty-two-element uniform linear array with three paths indicate that, in the considered scenarios, the proposed method can reduce the gap to the Cramér--Rao bound, lower the failure rate, and provide favorable accuracy--runtime trade-offs compared with a sectorization-based baseline built from the same codebook and estimator. For the unitary DFT codebook studied here, the fine-stage beam selector reduces to a covariance-guided contiguous energy-window rule; the broader score formulation also accommodates non-unitary effective beam dictionaries arising from hardware non-idealities.

URL PDF HTML ☆

赞 0 踩 0

2406.07909 2026-06-11 eess.AS cs.CL cs.SD stat.ML

Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

Eungbeom Kim, Hantae Kim, Kyogu Lee

2305.13108 2026-06-11 eess.AS cs.CL cs.LG cs.SD

Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test

Eungbeom Kim, Yunkee Chae, Jaeheon Sim, Kyogu Lee

2107.00693 2026-06-11 eess.SP cs.LG

Inter-Beat Interval Estimation with Tiramisu Model: A Novel Approach with Reduced Error

Asiful Arefeen, Ali Akbari, Seyed Iman Mirzadeh, Roozbeh Jafari, Behrooz A. Shirazi, Hassan Ghasemzadeh