arXivDaily arXiv每日学术速递 周一至周五更新
重置
EESS电气与系统 105
2605.22746 2026-05-22 cs.LG eess.AS stat.ML

Plug-in Losses for Evidential Deep Learning: A Simplified Framework for Uncertainty Estimation that Includes the Softmax Classifier

插件损失用于证据深度学习:一个简化框架用于不确定性估计,其中包括softmax分类器

Berk Hayta, Hannah Laus, Simon Mittermaier, Felix Krahmer

AI总结 本文提出了一种简化框架,用于通过插件损失近似证据深度学习中的不确定性估计,证明了在特定证据到狄利克雷分布映射下,该框架包含标准的softmax分类器,并在Google语音命令数据集上验证了其有效性。

详情
AI中文摘要

现实中的基于传感器的学习系统需要可靠且计算高效的不确定性估计。证据深度学习(EDL)通过狄利克雷分布建模类概率,从而实现单次通过的不确定性估计,其中狄利克雷参数由一个学习的神经网络映射预测。然而,这种方法可能导致计算挑战,因为狄利克雷期望目标比标准监督学习损失更复杂,增加了分析和实现的难度。我们通过近似由EDL诱导的一阶经验风险最小化问题的目标,使用在狄利克雷均值上评估的插件损失,证明在温和假设下,对于广泛的一类损失函数,包括均方误差和交叉熵损失,近似误差随着证据的增长而减小。作为特殊情况,我们的分析为在不确定性估计中使用softmax提供了正当性,因为在特定的证据到狄利克雷分布映射下,我们的框架包含标准的softmax分类器。我们在Google语音命令数据集上验证了所提出的简化目标,并展示了其在预测准确性和选择性预测性能上与经典EDL相当,同时使用标准深度学习损失和训练流程实现起来更简单。到目前为止,本文的实证分析是首次通过EDL获得语音识别任务中的覆盖-准确性权衡。

英文摘要

Real-world sensor-based learning systems require uncertainty estimation that is both reliable and computationally efficient. Evidential Deep Learning (EDL) provides single-pass uncertainty estimation by modeling the class probabilities via Dirichlet distributions, where the Dirichlet parameters are predicted by a learned neural network mapping. However, this approach can lead to computational challenges, as Dirichlet expected objectives are more complex than standard supervised learning losses, complicating their analysis and implementation. We address this issue by approximating the objective of the first-order empirical risk minimization problem induced by EDL with a plug-in loss evaluated at the Dirichlet mean and show that, under mild assumptions, the approximation error decays with growing evidence for a broad class of loss functions, including mean-squared error and cross-entropy loss. As a special case, our analysis provides justification for the use of softmax in the context of uncertainty estimation, since under a particular evidence-to-Dirichlet mapping, our framework includes the standard softmax classifier. We validate the proposed simplified objectives on the Google Speech Commands dataset and show that they achieve predictive accuracy and selective prediction performance comparable to classical EDL, while being simpler to implement using standard deep learning losses and training pipelines. To the best of our knowledge, this empirical analysis is the first to obtain coverage-accuracy trade-offs for speech recognition tasks through EDL.

2605.22732 2026-05-22 cs.AI cs.CL cs.HC cs.SD eess.AS

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

超越语音情感识别:利用基于LLM和语音情感模型的政治演讲多模态Pathos分析

Juergen Dietrich

AI总结 本文研究了语音情感识别模型是否能作为政治演讲分析中Pathos维度的代理,通过TRUST多智能体大语言模型(LLM)管道进行操作。使用德国议会全体会议中Felix Banaszak的演讲作为案例研究,比较了三种分析模式:(1) emotion2vec_plus_large,一个通过后验Russell Circumplex投影得到连续唤醒度和估值的语音情感识别(SER)模型;(2) Gemini 2.5 Flash,一个分析完整演讲音频及其转录文本的LLM,以开放和上下文感知的方式进行;(3) TRUST-Pathos分数,来自三个倡导者LLM监督集合。斯皮尔曼等级相关性显示,Gemini估值与TRUST-Pathos高度相关(rho = +0.664,p < 0.001),而emotion2vec估值不相关(rho = +0.097,p = 0.499)。我们进一步通过系统评估柏林情感语音数据库(EMO-DB)使用Gemini在开放注释范式下,证明标准SER基准语料库存在表演性演讲、文化偏见和类别不兼容性。我们的结果表明,基于LLM的多模态分析在捕捉语义定义的政治情感方面比单独的语音模型更有效,而语音特征仍对低层次唤醒度估计有帮助。未来的工作将扩展这种方法到视频分析中,结合面部表情和眼神。

详情
Comments
13 pages, 1 figure
AI中文摘要

我们研究语音情感识别模型是否能作为政治演讲分析中Pathos维度的代理,如由TRUST多智能体大语言模型(LLM)管道定义的那样。使用Felix Banaszak在德国议会全体会议中的演讲(51个片段,245秒)作为案例研究,我们比较了三种分析模式:(1) emotion2vec_plus_large,一个通过后验Russell Circumplex投影得到连续唤醒度和估值的语音情感识别(SER)模型;(2) Gemini 2.5 Flash,一个分析完整演讲音频及其转录文本的LLM,以开放和上下文感知的方式进行;(3) TRUST-Pathos分数,来自三个倡导者LLM监督集合。斯皮尔曼等级相关性显示,Gemini估值与TRUST-Pathos高度相关(rho = +0.664,p < 0.001),而emotion2vec估值不相关(rho = +0.097,p = 0.499)。我们进一步通过系统评估柏林情感语音数据库(EMO-DB)使用Gemini在开放注释范式下,证明标准SER基准语料库存在表演性演讲、文化偏见和类别不兼容性。我们的结果表明,基于LLM的多模态分析在捕捉语义定义的政治情感方面比单独的语音模型更有效,而语音特征仍对低层次唤醒度估计有帮助。未来的工作将扩展这种方法到视频分析中,结合面部表情和眼神。

英文摘要

We investigate whether acoustic emotion recognition models can serve as proxies for the Pathos dimension in political speech analysis, as operationalised by the TRUST multi-agent large language model (LLM) pipeline. Using a Bundestag plenary speech by Felix Banaszak (51 segments, 245 s) as a case study, we compare three analysis modalities: (1) emotion2vec_plus_large, an acoustic speech emotion recognition (SER) model whose continuous Arousal and Valence values are derived via post-hoc Russell Circumplex projection; (2) Gemini 2.5 Flash, an LLM analysing the full speech audio together with its transcript in an open-ended, context-aware fashion; and (3) TRUST-Pathos scores from a three-advocate LLM supervisor ensemble. Spearman rank correlations reveal that Gemini Valence correlates strongly with TRUST-Pathos (rho = +0.664, p < 0.001), whereas emotion2vec Valence does not (rho = +0.097, p = 0.499). We further demonstrate, via a systematic quality evaluation of the Berlin Database of Emotional Speech (EMO-DB) using Gemini in an open-ended annotation paradigm, that standard SER benchmark corpora suffer from acted speech, cultural bias, and category incompatibility. Our results suggest that LLM-based multimodal analysis captures semantically defined political emotion substantially better than acoustic models alone, while acoustic features remain informative for low-level Arousal estimation. Future work will extend this approach to video-based analysis incorporating facial expression and gaze.

2605.22726 2026-05-22 eess.SY cs.SY

Dynamic Lane Allocation in UAM Corridors for Efficient Multimodal Door-to-Door Mobility

动态车道分配用于高效多模式门到门移动

Jung Ho Park, Jordan Kam, Vishwanath Bulusu, Alexandre Bayen, Raja Sengupta

AI总结 本文提出了一种动态方向车道分配方法,用于城市空中交通(UAM)走廊,通过离散时间混合整数线性规划(MILP)模型来动态激活、停用和反转车道方向,以应对双向空域需求的变化。研究通过分解每个行程为多模式序列,并通过垂直机场侧调度模型路由UAM服务的中段,利用旧金山湾区作为案例研究,发现动态策略可减少未使用空域容量5倍,提高车道利用率至67%,并减少通勤人口的平均出行时间高达21.6%。

详情
Comments
Submitted to AIAA Aviation Forum
AI中文摘要

本文提出了一种动态方向车道分配方法,用于城市空中交通(UAM)走廊,通过离散时间混合整数线性规划(MILP)模型来动态激活、停用和反转车道方向,以应对双向空域需求的变化。我们通过将离散地面旅行数据分解为多模式序列,包括首段、中段和末段,来建模需求,并通过垂直机场侧调度模型路由UAM服务的中段。我们以旧金山湾区为案例研究,通过在康特拉科斯塔县和硅谷之间设置一个多区域覆盖走廊。我们发现,动态策略可减少未使用空域容量5倍,将平均车道利用率从36-48%提高到67%,在相同服务水平下,同时将通勤人口的平均出行时间减少高达21.6%。这些结果表明,动态配置空域容量显著缓解了基于车道的UAM空域设计和UAM概念运作中的利用率问题。此外,这种动态分配还提供了一种安全、结构化的方法来提高吞吐量,使UAM成为多模式门到门移动系统更具可行性的补充。

英文摘要

This article presents dynamic directional lane allocation in urban air mobility (UAM) corridors as a discrete-time mixed-integer linear program (MILP). This formulation activates, deactivates, and reverses lane direction as bi-directional airspace demand evolves. We model demand from disaggregate ground travel data by decomposing each trip into a multi-modal sequence with first-, middle-, and last-mile legs and routing the UAM-served middle-mile segment through a vertiport-side dispatch model. We use the San Francisco Bay Area as a case study by placing a multi-region spanning corridor between Contra Costa county and Silicon Valley. We find that the dynamic policy cuts unused airspace capacity by 5x, increases mean lane utilization from 36-48% to 67% at the same service level relative to baselines, and reduces commuting-population mean travel time by up to 21.6%. These results show that dynamic configuration of airspace capacity alleviates a significant percentage of the under-utilization issue of lane-based UAM airspace design and UAM concept of operations. This dynamic allocation also provides a safe, structural way to increase throughput, making UAM a more viable complement to multimodal door-to-door mobility systems.

2605.22722 2026-05-22 cs.RO cs.SY eess.SY

N3P: Accelerated Automated Parking via a Learning-Based Naturalistic Three-Stage Scheme

N3P:通过基于学习的自然三阶段方案实现加速的自动泊车

Yifan Xue, Toktam Mohammadnejad, Faizan M Tariq, Sangjae Bae, David Isele, Yosuke Sakamoto, Nadia Figueroa, Jovin D'sa

AI总结 本文提出N3P,一种基于学习的三阶段框架,用于自动泊车,通过引入中间预备姿态和学习模块预测该姿态,将泊车操作分解为更简单的子问题,从而降低计算复杂度并加速路径生成,实验表明其在垂直和平行泊车场景中显著提升了规划速度,并在成功率和轨迹质量上优于强化学习基线。

详情
Comments
Accepted at IEEE Intelligent Transportation Systems Conference (ITSC 2026)
AI中文摘要

自动驾驶泊车需要高效的路径规划,以确保运动学可行性并在受限环境中实现碰撞避免。混合A*被广泛使用,但计算成本高,而强化学习(RL)方法缺乏可靠性,往往在长时间几何约束下表现不佳,导致轨迹次优。我们提出了N3P,一种快速基于学习的三阶段框架用于自动泊车。通过引入中间预备姿态并使用学习模块预测该姿态,N3P将操作分解为更简单的子问题,从而降低计算复杂度并加速路径生成。我们通过将其与混合A*算法结合来验证该框架。在垂直和平行泊车场景中的实验表明,N3P增强的混合A*将规划速度提高了超过80%。它在成功率和轨迹质量上优于RL基线,产生更短的轨迹和更少的换挡,同时在大多数情况下实现可比或更低的规划时间。

英文摘要

Autonomous parking requires efficient path planning that ensures kinematic feasibility and collision avoidance in constrained environments. Hybrid A* is widely used but computationally expensive, while reinforcement learning (RL) methods lack reliability and often struggle with long-horizon geometric constraints, leading to suboptimal trajectories. We present N3P, a fast learning-based three-stage framework for automated parking. By introducing an intermediate preparatory pose and using a learning module to predict it, N3P decomposes the maneuver into simpler subproblems, thereby reducing computational complexity and accelerating path generation. We validate the framework by integrating it with Hybrid A* algorithms. Experiments in perpendicular and parallel parking scenarios show that N3P-enhanced Hybrid A* speeds up planning by more than 80%. It also outperforms RL baselines in success rate and trajectory quality, producing shorter trajectories with fewer gear changes, while achieving comparable or lower planning time in most cases.

2605.22709 2026-05-22 cs.CR cs.ET cs.RO cs.SY eess.SY

TriSweep: A Four-Drone Swarm Framework for Electromagnetic Side-Channel Analysis

TriSweep: 一种四无人机群框架用于电磁侧信道分析

Eric Yocam, Varghese Vaidyan

AI总结 本文提出TriSweep框架,通过四无人机群实现自主远距离电磁侧信道分析,针对嵌入式微控制器在0.25-1.5米范围内进行攻击,通过空间专业化收集无人机和固定积累无人机的协同工作,实现信号增强和掩码消除,验证了无人机群在对抗环境中的有效性。

详情
Comments
Simulation framework + systems design for a four-drone swarm performing standoff electromagnetic side-channel analysis. No hardware fabricated yet
AI中文摘要

电磁(EM)侧信道分析传统上假设存在一个静止且近距离的探测器,这种威胁模型低估了空中对手的威胁。TriSweep是一种模拟框架,设计并评估了一种四无人机群架构,用于自主远距离电磁侧信道分析(EM-SCA)嵌入式微控制器,距离为0.25-1.5米。三个空间专业化收集无人机——锚点(全频谱)、掩码探测器(掩码寄存器加载泄漏)和密码探测器(掩码SubBytes输出泄漏)——将信号馈入一个固定积累无人机,该无人机通过两个空间分离泄漏流的中心乘积进行相干结合(+4.8 dB信噪比增益)和二次掩码消除。在三个真实的ANSSI ASCAD数据集(ATmega8515掩码AES-128和50/100样本非同步变体)上评估该框架,其在0.25米范围内主要掩码数据集上实现了模拟密钥排名为18±1.7(五种子)。通过轮廓跟踪轨迹交叉相关对齐,单无人机排名从89降低到21,在100样本抖动变体上展示了对无人机悬停振动的补偿。积累无人机中的两个通道CNN收敛到损失为0.454(与随机基线5.545相比)并在非同步数据集上提高了排名。尚未制造物理硬件;原型构建是下一步计划。

英文摘要

Electromagnetic (EM) side-channel analysis traditionally assumes a stationary, close-proximity probe - a threat model that underestimates aerial adversaries. TriSweep is a simulation framework that designs and evaluates a four-drone swarm architecture for autonomous standoff EM-SCA of embedded microcontrollers at 0.25-1.5 m. Three spatially specialized collector drones - Anchor (full-spectrum), Mask Probe (mask-register loading leakage), and Cipher Probe (masked SubBytes output leakage) - feed a stationary Accumulator drone that performs coherent combining (+4.8 dB SNR gain) and second-order mask cancellation via a centered product of the two spatially separated leakage streams. Evaluated against three real ANSSI ASCAD datasets (ATmega8515 masked AES-128 and 50/100-sample desynchronized variants), the framework achieves a simulated key rank of 18 +/- 1.7 (five-seed) at 0.25 m on the primary masked dataset. Profiling-trace cross-correlation alignment reduces single-drone rank from 89 to 21 on the 100-sample-jitter variant, demonstrating compensation for drone hover vibration. A two-channel CNN in the Accumulator converges to a loss of 0.454 (vs. random baseline 5.545) and improves rank on desynchronized datasets. No physical hardware has been fabricated; prototype construction is the planned next step.

2605.22659 2026-05-22 eess.SP

A Metalens-based Bicycle Safety Reflector for Autonomous Vehicle Radars

基于金属 lens 的自行车安全反光器用于自动驾驶车辆雷达

Sepideh Ghasemi, Jimmy Hester, Aline Eid

AI总结 本文提出了一种基于金属 lens 的新型自行车安全反光器,旨在提高骑行者在恶劣天气条件下的可见性。该反光器利用平面金属 lens 和微带天线像素层,实现了回向发射操作,从而在毫米波汽车频段内实现了可靠检测和雷达散射截面的显著提升。

详情
AI中文摘要

随着自动驾驶或传感器辅助车辆之间相互作用的增加,尤其是在恶劣天气条件下,对新型自行车安全反光器的需求和机会也随之增加。为此,提出了一种首个基于平面金属 lens 的标签,其在毫米波汽车频段内运行。该紧凑、轻量(0.61克)的设计由两层组成:金属 lens 层和微带天线像素层。金属 lens 将来自不同入射角度的平面波聚焦到第二层的相应微带天线像素上,这些像素重新辐射信号通过金属 lens,从而实现回向发射操作。所提出的标签经过彻底评估,证明其在70米以外的可靠检测能力,以及峰值单站雷达散射截面(RCS)为3.54 dBsm,稳定回向发射性在±40度范围内,相对于无透镜参考,提供了7.58 dB的平均增益改进和15.16 dB的RCS增强。在金属自行车上的现实部署场景中,其在正前方方向的可检测性提高了110倍。这些结果突显了所提出被动标签作为低成本、轻重量、易于集成的自行车安全反光器在下一代自动驾驶车辆雷达系统中的潜力。

英文摘要

With the rising number of interactions between autonomous or sensor-assisted vehicles -- especially in poor weather conditions -- come the need and opportunity for a new class of bicycle safety reflectors designed to enhance cyclist visibility to radars. To this effect, the first retrodirective planar metalens-based tag operating in the millimeter-wave automotive frequency range is proposed. The compact, lightweight ($0.61~\mathrm{g}$) design consists of two layers: a metalens layer and a patch antenna pixel layer. The metalens focuses incoming plane waves from different incidence angles onto corresponding patch antenna pixels on the second layer, which re-radiate the signal back through the metalens, enabling retrodirective operation. The proposed tag was thoroughly evaluated, demonstrating reliable detection beyond 70 m and a peak monostatic radar cross section (RCS) of $3.54~\mathrm{dBsm}$ with stable retrodirectivity over $\pm 40^\circ$, providing an average gain improvement of $7.58~\mathrm{dB}$ and an RCS enhancement of $15.16~\mathrm{dB}$ relative to a lens-less reference. A realistic deployment scenario on a metallic bicycle demonstrated up to a 110x improvement in its detectability at broadside. These results highlight the potential of the proposed passive tag to operate as a low-cost, lightweight, and easily integrable bicycle safety reflector for next-generation autonomous vehicle radar systems.

2605.22658 2026-05-22 cs.CV cs.LG cs.MM eess.IV

SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation

SegCompass: 探索通过稀疏自编码器实现可解释对齐以增强推理分割

Zhenyu Lu, Liupeng Li, Jinpeng Wang, Haoqian Kang, Yan Feng, Ke Chen, Yaowei Wang

AI总结 本文提出SegCompass,一种通过稀疏自编码器实现可解释对齐的端到端模型,以提升推理分割的性能和可解释性。

详情
Comments
Accepted by CVPR 2026. 15 pages, 9 figures, 6 tables
AI中文摘要

尽管大语言模型提供了强大的组合推理能力,但现有推理分割流程未能清晰地将这种推理与视觉感知连接起来。当前方法,如潜在查询对齐,虽然端到端但却是不透明的“黑箱”。相反,文本定位读出仅可读但不真正可解释,通常作为无约束的后处理步骤。为弥合这一可解释性差距,我们提出了SegCompass,一种端到端模型,利用稀疏自编码器(SAE)建立一个显式、可解释且可微的对齐路径。给定一个图像-指令对,SegCompass首先生成一个思维链(CoT)轨迹。该方法的核心是一个将CoT和视觉标记映射到共享高维稀疏概念空间的SAE。一个查询代码本从该空间中选择显著概念,然后通过槽映射器在空间上定位到多槽热图,引导最终的掩码解码器。整个模型联合训练,将强化学习用于推理路径与标准分割监督相结合。这种由SAE驱动的接口提供了显著比潜在查询更可追溯的“白盒”连接,比文本读出更连贯。在五个具有挑战性的基准测试中,SegCompass匹配或超越了最先进的性能。关键的是,我们的视觉和定量分析显示,所学稀疏概念的质量与最终掩码准确性之间存在强相关性,证实了SegCompass通过其增强且可检查的对齐实现了优越的结果。代码可在https://github.com/ZhenyuLU-Heliodore/SegCompass获取。

英文摘要

While large language models provide strong compositional reasoning, existing reasoning segmentation pipelines fail to transparently connect this reasoning to visual perception. Current methods, such as latent query alignment, are end-to-end yet opaque "black boxes". Conversely, textual localization readout is merely readable, not truly interpretable, often functioning as an unconstrained post-hoc step. To bridge this interpretability gap, we propose SegCompass, an end-to-end model that leverages a Sparse Autoencoder (SAE) to forge an explicit, interpretable, and differentiable alignment pathway. Given an image-instruction pair, SegCompass first generates a chain-of-thought (CoT) trace. The core of our method is an SAE that maps both the CoT and visual tokens into a shared, high-dimensional sparse concept space. A query codebook selects salient concepts from this space, which are then spatially grounded by a slot mapper into a multi-slot heatmap that guides the final mask decoder. The entire model is trained jointly, unifying reinforcement learning for the reasoning path with standard segmentation supervision. This SAE-driven interface provides a "white-box" connection that is significantly more traceable than latent queries and more coherent than textual readouts. Extensive experiments on five challenging benchmarks demonstrate that SegCompass matches or surpasses state-of-the-art performance. Crucially, our visual and quantitative analyses show a strong correlation between the quality of the learned sparse concepts and final mask accuracy, confirming that SegCompass achieves superior results through its enhanced and inspectable alignment. Code is available at https://github.com/ZhenyuLU-Heliodore/SegCompass.

2605.22562 2026-05-22 math.OC cs.SY eess.SY

Output regulation via input-output data

通过输入输出数据实现输出调节

Andrea Bisoffi, Wenjie Liu, Zhongjie Hu, Claudio De Persis

AI总结 本文研究了如何利用输入输出数据设计反馈控制器,以消除系统输出对未知外信号的影响,核心方法是通过半正定规划设计控制器,并通过辅助系统与原系统的分析关系实现迁移。

详情
AI中文摘要

从一个多输入多输出(MIMO)离散时间线性系统中,我们收集受噪声影响的输入输出数据,形式为未知的外信号。从这些数据点(不了解系统模型)中,我们设计一个反馈控制器,使其渐近地消除该外信号对输出的影响。这相当于仅从输入输出数据中解决MIMO线性系统的输出调节问题。控制器的设计对应于一个半正定规划,并在合适的辅助系统上进行。此类设计通过严格分析两个系统解之间的关系,从辅助系统转移到原系统。

英文摘要

From a multi-input-multi-output (MIMO) discrete-time linear system, we collect input-output data affected by noise in the form of an unknown exosignal and, from these data points (without knowledge of the system model), we design a feedback controller that asymptotically annihilates the effect of that exosignal on the output. This amounts to solving an output regulation problem purely from input-output data, for MIMO linear systems. The design of the controller corresponds to a semidefinite program and is pursued on a suitable auxiliary system. Such design carries over from the auxiliary system to the original one by a rigorous examination of the relation between the solutions of the two systems.

2605.22490 2026-05-22 eess.SP

UAV-based Energy-Efficient Data Collection in Smart Grids with ISAC QoS Guarantees

基于无人机的智能电网节能数据采集与ISAC服务质量保障

Yibin Xie, Jin Zhao, Indrakshi Dey, Nicola Marchetti

AI总结 本文提出一种集成传感与通信的无人机数据采集框架,通过联合能量最小化问题优化无人机轨迹和采集调度,在满足DLR分钟级时间尺度要求的同时,将能耗降低34.6%。

详情
Comments
6 pages, 4 figures, submitted to IEEE GLOBECOM 2026
AI中文摘要

动态线路评级(DLR)是一种需要及时监测数据以确定电力线路实时载流量的方法。然而,DLR监测设备(MD)易受连接中断影响,导致数据缺失或延迟。尽管无人机(UAV)可以实现从MD的稳健数据采集,但其有限的机载能量难以在长距离传输走廊上实现及时监测,且存在飞行危险。本文提出一种集成传感与通信(ISAC)的无人机数据采集框架,以支持及时的DLR更新。在此框架中,ISAC用于维持安全和协作的无人机数据采集所需的传感和通信质量。因此,提出了一个在ISAC约束下优化无人机轨迹和采集调度的联合能量最小化问题。为了解决这个问题,提出了一种结合深度强化学习(DRL)和半定松弛(SDR)的混合算法,其中DRL优化轨迹和采集调度,而SDR用于处理非凸的ISAC约束。仿真结果表明,所提出的方案在满足DLR分钟级时间尺度要求的同时,相比离线基准降低了34.6%的能耗,并比独立的传感-通信基线降低了约2.2%的能耗。

英文摘要

Dynamic line rating (DLR) is a methodology that requires timely monitoring data to determine the real-time ampacity of power lines. However, DLR monitoring devices (MD) are vulnerable to connectivity disruptions, leading to missing or delayed data. Although unmanned aerial vehicles (UAV) can enable resilient data collection from MD, their limited onboard energy challenges timely monitoring over extended transmission corridors with flight hazards. This paper proposes a cooperative UAV-based data collection framework with integrated sensing and communication (ISAC) to support timely DLR updates. In this framework, ISAC is employed to maintain the sensing and communication quality required for safe and cooperative UAV data collection. Accordingly, a joint energy minimization problem is formulated over UAV trajectories and collection scheduling under ISAC constraints. To solve it, a hybrid algorithm combining deep reinforcement learning (DRL) and semidefinite relaxation (SDR) is proposed, where DRL optimizes the trajectory and collection scheduling, while SDR is used to handle the non-convex ISAC constraints. Simulation results show that the proposed scheme reduces energy consumption by up to 34.6% compared with offline benchmarks and by about 2.2% compared with the separated sensing-and-communication baseline, while satisfying the minute-level timescale requirement of DLR.

2605.22457 2026-05-22 cs.AI cs.SY eess.SY

KAPPS: A knowledge-based CPPS Architecture for the Circular Factory

KAPPS:一种基于知识的闭环工厂CPPS架构

Etienne Hoffmann, Jan-Felix Klein, Sören Weindel, Max Goebels, Sebastian Behrendt, Daniel Hernández, Ratan Bahadur Thapa, Jürgen Fleischer, Kai Furmans, Steffen Staab

AI总结 本文提出KAPPS,一种基于知识的闭环工厂CPPS架构,旨在解决闭环制造中产品状态变化、动态重构过程和人机知识整合的需求,通过知识图谱和语义接口层实现数据集成与推理,提升制造系统的灵活性和适应性。

详情
Comments
Submitted to Journal of Manufacturing Systems (JMS)
AI中文摘要

尽管线性制造依赖于同质材料和预定义的过程序列,但闭环制造重新引入了具有异质和不确定条件的使用产品。这种转变要求制造系统能够处理可变的产品状态、动态可重构的过程以及人机知识的整合。传统制造IT架构,设计用于稳定结构和确定性执行,无法满足这些需求,因为它们无法充分表示和管理运行时单个组件的唯一性。遵循设计科学方法,为闭环制造设计CPPS,我们从五个互补的视角中推导出14个需求。基于这些需求,我们设计了KAPPS,一种基于知识的架构,利用以本体为基础的知识图谱作为统一的数据骨干,结合语义接口层,实现跨异构系统和服务的一致数据和信息集成、推理和通信,使知识图谱从集成层转变为工厂的权威写时状态。KAPPS集成了约束执行和事件驱动规划模块,使在不确定性和人机知识交换下执行计划能够逐步适应。通过两个实施用例验证了KAPPS的适用性:(i) 通过知识图谱中介服务进行异常检测和学习;(ii) 在模块化输送系统中运行时约束执行。随后,该架构被评估以满足14个需求(摘要已缩短)

英文摘要

While linear manufacturing relies on homogeneous materials and predefined process sequences, circular manufacturing reintroduces used products with heterogeneous and uncertain conditions. This shift demands manufacturing systems capable of handling variable product states, dynamically reconfigurable processes, and the integration of human and machine knowledge. Conventional manufacturing IT architectures, designed for stable structures and deterministic execution, are unable to meet these requirements, as they cannot adequately represent and manage the uniqueness of individual components at runtime. Following a design science methodology for developing a Cyber Physical Production System for circular manufacturing, we derive 14 requirements from five complementary perspectives. Based on these requirements, we design KAPPS, a knowledge-based architecture that uses an ontology-grounded knowledge graph as a unifying data backbone, combined with a semantic interface layer to enable consistent data and information integration, reasoning, and communication across heterogeneous systems and services, turning the knowledge graph from an integration layer into the factories authoritative write-time state. KAPPS incorporates modules for constraint enforcement and event-driven planning, enabling incremental adaptation of execution plans under uncertainty and human-machine knowledge exchange. The applicability of KAPPS is demonstrated through two implemented use cases: (i) Anomaly detection and learning through knowledge graph mediated services and (ii) runtime constraint enforcement in a modular conveyor system. Subsequently, the architecture is evaluated against the 14 requirements (ed. abstract shortened)

2605.22433 2026-05-22 quant-ph cs.PL cs.SY eess.SY

QuCtrl-BELL: A Compiler-Driven Sub-Microsecond Feedback Control Stack for Scalable Trapped-Ion Quantum Experiments

QuCtrl-BELL:一种用于可扩展囚离子量子实验的编译器驱动子微秒反馈控制堆栈

Junpeng She, Ruoyu Yan, Zhizhen Qin, Zhanyu Li, Zhongtao Shen, Zichao Zhou, Binxiang Qi, Luming Duan

AI总结 本文提出QuCtrl-BELL,一种编译器驱动的子微秒反馈控制堆栈,用于可扩展的囚离子量子实验,通过分离控制流与硬件状态数据,解决经典控制系统的根本权衡问题,展示编译器基础设施在可扩展囚离子量子控制中的可编程性、确定性定时和模块化。

详情
Comments
7 pages, 6 figures
AI中文摘要

随着囚离子量子计算扩展到更大的量子比特寄存器和更复杂的控制协议,经典控制系统面临一个根本性的权衡:子微秒级板级反馈需要紧密的硬件耦合,而可维护性和可扩展性则需要干净、模块化的软件抽象。本文提出了QuCtrl-BELL(Bell),一种用于囚离子量子控制的编译器驱动软件堆栈。该设计通过将控制流(包括循环、分支和同步)与硬件状态数据分离,解决了这一权衡问题。通过一个六阶段的转译管道,包括控制流图(CFG)构造、静态单一赋值(SSA)转换、活跃性分析和图着色寄存器分配,将Python嵌入的领域特定语言(DSL)降低。编译器生成确定性的分布式板级程序和紧凑的步骤表数据。跨板同步协议支持在无主机干预的情况下,反馈循环的延迟低于700~ns。Bell在QuCtrl-BELL平台上部署和评估,证明了基于编译器的基础设施可以为可扩展的囚离子量子控制提供可编程性、确定性定时和模块化。

英文摘要

As trapped-ion quantum computing scales to larger qubit registers and more complex control protocols, classical control systems face a fundamental tradeoff: sub-microsecond board-level feedback requires tight hardware coupling, whereas maintainability and extensibility require clean, modular software abstractions. This paper presents QuCtrl-BELL (Bell), a compiler-driven software stack for trapped-ion quantum control. The design resolves this tradeoff by decoupling control flow -- including loops, branches, and synchronization -- from hardware state data. A Python-embedded domain-specific language (DSL) is lowered through a six-stage transpilation pipeline covering control flow graph (CFG) construction, static single-assignment (SSA) conversion, liveness analysis, and graph-coloring register allocation. The compiler generates deterministic distributed board-level programs and compact step-table data. A cross-board synchronization protocol supports feedback loops with latency below 700~ns without host intervention. Bell is deployed and evaluated on the QuCtrl-BELL platform (RISC-V + PXIe), demonstrating that a compiler-based infrastructure can provide programmability, deterministic timing, and modularity for scalable trapped-ion quantum control.

2605.22425 2026-05-22 eess.IV cs.CV

Time-varying rPPG signal separation via block-sparse signal model

基于块稀疏信号模型的时变rPPG信号分离

Kosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto

AI总结 本文提出了一种利用rPPG信号近似周期特性进行信号提取的方法,通过构建时变信号分离框架,在光照变化下实现适应性信号分离,实验验证了方法的有效性。

详情
Comments
Accepted by IEEE International Conference on Image Processing (ICIP 2026)
AI中文摘要

远程光脉冲波形图(rPPG)通过分析面部视频中细微的颜色变化来实现非接触式心脏脉搏信号测量。然而,由于rPPG信号极弱且易受光照噪声影响,提取rPPG信号仍然具有挑战性。本文提出了一种rPPG信号提取方法,利用rPPG信号的近似周期特性,将其近似周期性建模为时频域中的块稀疏结构。为了整合块稀疏模型并实现光照波动下的自适应信号分离,我们构建了时变信号分离框架。使用公共数据集的实验验证了该方法的有效性。

英文摘要

Remote photoplethysmography (rPPG) enables non-contact measurement of cardiac pulse signals by analyzing subtle color changes in facial videos. Nevertheless, extracting rPPG signals remains challenging because of their extremely weak signal strength and susceptibility to illumination noise. In this paper, we propose an rPPG signal extraction method that exploits the quasi-periodic characteristics of rPPG signals. Our approach models quasi-periodicity of the rPPG signal, which arises from the stable cardiac cycle, as a block-sparse structure in the time-frequency domain. To incorporate a block-sparse model and enable adaptive signal separation under illumination fluctuations, we construct a time-varying signal separation framework. Experiments using a public dataset demonstrate the effectiveness of our method.

2605.22384 2026-05-22 eess.SP

Experimental Comparison of Local and Over-the-Air Phase Calibration for MIMO Arrays

MIMO阵列中局部和空中相位校准的实验比较

Carl Collmann, Ahmad Nimr, Gerhard Fettweis

AI总结 本文通过实验比较了MIMO阵列中局部和空中相位校准方法,研究了硬件失真对通信性能和信道估计精度的影响,发现局部校准在相位稳定性方面更优,而空中校准在不需要额外硬件的情况下对多径效应更敏感。

详情
Comments
6 pages, 6 figures, 2 tables
AI中文摘要

通信性能和MIMO系统中的信道估计精度已知受到硬件失真的限制。具体来说,相位失真,如相位噪声,使实时相干传输成为一项具有挑战性的任务。虽然相位失真补偿通常在接收端进行,但实现发射端相干传输的实用方法仍不明确。已建立的MIMO系统空中校准方法面临诸多限制,例如对相位稳定性假设和准确信道知识的假设。在本工作中,通过在完全数字USRP X310软件定义收发器上对实时局部相位校准方法与空中校准进行实验比较。使用RMS周期到周期抖动作为指标,表明对于低和高同步信号带宽,两种方法都能有效消除相位漂移并白化相位噪声。局部校准实现了更高的相位稳定性,并且与信道无关,而空中校准不需要额外硬件,但对多径效应和信道引起的失真更敏感。基于测量结果讨论了实际部署的权衡。

英文摘要

Communication performance and channel estimation accuracy in MIMO systems are known to be limited by hardware impairments. Specifically, the presence of phase impairments, such as phase noise, makes real-time coherent transmission a challenging task. While phase impairment compensation is typically performed at the receiver, practical methods for enabling coherent transmission at the transmitter side remain underexplored. Established methods for OTA calibration of MIMO systems face several limitations such as assumptions of phase stationarity and accurate channel knowledge. In this work, a real-time local phase calibration method is experimentally compared with OTA calibration on a fully digital array of USRP X310 software-defined radios. Using RMS cycle-to-cycle jitter as a metric, it is shown that for low and high synchronization signal bandwidths, both approaches effectively eliminate phase drift and whiten the phase noise. Local calibration achieves higher phase stability and is channel-independent, whereas OTA calibration requires no additional hardware but is sensitive to multipath effects and channel-induced impairments. Practical deployment trade-offs are discussed based on the measurement results.

2605.22361 2026-05-22 eess.SP

Propagation-Consistent Wireless Environment Digital Twin Construction Under Sparse Measurements

基于稀疏测量的无线环境数字孪生构建:传播一致性

Junjie Ai, Shurui Xu, Yanqing Ren, Zhuoyu Liu, Weicong Chen, Wankai Tang, Xiao Li, Chao-Kai Wen, Shi Jin

AI总结 本文提出了一种无线环境数字孪生(WEDT)构建方法,通过校准场景级电磁特性场,将重建的几何数字孪生转化为传播一致的无线环境表示,实现了从稀疏测量到密集概率监督的转换,并展示了其在不同场景下的有效性。

详情
Comments
This work has been submitted to the IEEE for possible publication
AI中文摘要

数字孪生(DTs)在无线部署、优化和数据生成中具有潜力,但从稀疏真实测量中构建传播忠实的孪生仍然具有挑战性。本文提出了一种无线环境数字孪生(WEDT)构建范式,通过校准场景级电磁(EM)属性场,将重建的几何DT转化为传播一致的无线环境表示。该方法首先构建了一个基于几何先验的贝叶斯信道图(BCM),将稀疏位置标记的信道状态信息(CSI)转换为密集的概率监督并估计不确定性。然后,将可学习的EM属性场嵌入到可微射线追踪(RT)基于的信道计算中,从而通过显式的传播链进行校准。在公共和现实场景中的实验表明,WEDT实现了准确的信道预测,能够泛化到未见的收发器拓扑,并在不同采样条件下保持有效性。WEDT还提供了用于材料相关环境感知、更可靠的物理层规划以及更高质量的无线AI合成数据生成的用途。这些结果展示了所提出范式在构建传播一致的WEDT及其相关无线应用中的价值。

英文摘要

Digital twins (DTs) are promising for wireless deployment, optimization, and data generation, but building a propagation-faithful twin from sparse real measurements remains difficult. This paper proposes a wireless environment digital twin (WEDT) construction paradigm that evolves a reconstructed geometric DT into a propagation-consistent wireless environment representation through calibration of a scene-level electromagnetic (EM) property field. Instead of directly fitting link-specific channel responses, the proposed paradigm first constructs a geometry-prior Bayesian channel map (BCM) to convert sparse position-labeled channel state information (CSI) into dense probabilistic supervision with uncertainty estimates. It then embeds the learnable EM property field into differentiable ray tracing (RT) based channel computation, thereby enabling calibration through an explicit propagation chain. Experiments in both public and real-world scenes show that WEDT achieves accurate channel prediction, generalizes to unseen transceiver topologies, and remains effective across different sampling conditions. WEDT also offers utility for material-related environment sensing, more reliable physical-layer planning, and higher-quality synthetic data generation for wireless AI. These results demonstrate the value of the proposed paradigm for propagation-consistent WEDT construction and related wireless applications.

2605.22360 2026-05-22 eess.SP cs.SY eess.SY

Low-Complexity Tensor Beamforming for RIS-Aided Multiuser Multistream MIMO Systems

低复杂度张量波束成形用于RIS辅助多用户多流MIMO系统

Bruno Sokal, André L. F. de Almeida, Martin Haardt

AI总结 本文提出了一种低复杂度的张量波束成形方法,用于RIS辅助的多用户多流MIMO系统,通过张量投影优化接收组合器、用户预编码器和RIS相位向量,以降低计算复杂度并提高大规模RIS的性能。

详情
Comments
5 pages, 3 figures,
AI中文摘要

我们针对具有联合检测的上行RIS辅助多用户多流MIMO系统,解决联合主动和被动波束成形问题。通过三阶复合通道张量,将接收组合器、块对角用户预编码器和RIS相位向量的耦合设计进行建模。利用这种多线性结构,我们提出了一种多流张量交替优化方法,通过低维张量投影来更新组合器、用户预编码器和RIS系数。仿真结果表明,所提方法在降低计算复杂度并提高大规模RIS扩展性能方面优于多启动交替优化基准。

英文摘要

We address joint active and passive beamforming for uplink RIS-assisted multi-user multi-stream MIMO systems with joint detection. The coupled design of the receive combiner, block-diagonal user precoders, and RIS phase vector is formulated through a third-order composite channel tensor. Exploiting this multilinear structure, we propose a multi-stream tensor alternating optimization method that updates the combiner, user precoders, and RIS coefficients via low-dimensional tensor projections. Simulations show that the proposed method approaches a multi-start alternating-optimization benchmark while reducing computational complexity and improving large-RIS scaling.

2605.22354 2026-05-22 stat.ME eess.SP

From Volterra Series to Kunchenko Stochastic Polynomials: Half a Century of Non-Gaussian Estimation Methodology

从Volterra级数到Kunchenko随机多项式:半个世纪的非高斯估计方法学

Serhii Zabolotnii

AI总结 本文回顾了由Yuriy P. Kunchenko(1939-2006)创立的科学学派半个世纪的发展历程,探讨了非高斯估计的半参数方法。从Kunchenko 1972/1973年的博士论文开始,应用Volterra级数估计随机过程参数,直到2006-2026年。Kunchenko随机多项式被呈现为一组一致的矩-混积程序:参数估计的多项式最大化方法(PMM)、假设检验的多项式准则以及在生成元空间中的分解。文章详细描述了学派的结构:15篇通过的博士论文的验证家谱、波兰、斯洛伐克和德国的合作,以及R包EstemPMM。分析了2026年一篇基于Volterra的信号处理论文,展示了Kunchenko的非线性公式在应用无线电工程中的再现。建立了有限Volterra模型与广义Kunchenko多项式之间的正式桥梁,同时将MMSE/L2准则与PMM分开:前者是核适应的协方差投影,而PMM是参数依赖的矩程序。PMM效率声明是条件性的:收益要求矩存在,中心相关矩阵非退化,且方差缩减系数低于一。结论研究计划将历史重建转化为可测试的统计和信号处理任务。

详情
Comments
Bilingual submission: English followed by Ukrainian translation
AI中文摘要

本文重建了由Yuriy P. Kunchenko(1939-2006)创立的科学学派半个世纪的发展历程,探讨了非高斯估计的半参数方法。从Kunchenko 1972/1973年的博士论文开始,应用Volterra级数估计随机过程参数,直到2006-2026年。Kunchenko随机多项式被呈现为一组一致的矩-混积程序:参数估计的多项式最大化方法(PMM)、假设检验的多项式准则以及在生成元空间中的分解。文章详细描述了学派的结构:15篇通过的博士论文的验证家谱、波兰、斯洛伐克和德国的合作,以及R包EstemPMM。分析了2026年一篇基于Volterra的信号处理论文,展示了Kunchenko的非线性公式在应用无线电工程中的再现。建立了有限Volterra模型与广义Kunchenko多项式之间的正式桥梁,同时将MMSE/L2准则与PMM分开:前者是核适应的协方差投影,而PMM是参数依赖的矩程序。PMM效率声明是条件性的:收益要求矩存在,中心相关矩阵非退化,且方差缩减系数低于一。结论研究计划将历史重建转化为可测试的统计和信号处理任务。

英文摘要

This paper reconstructs the half-century evolution of the scientific school founded by Yuriy P. Kunchenko (1939--2006) as the development of a semiparametric methodology for non-Gaussian estimation. Starting with Kunchenko's 1972/1973 dissertation applying Volterra series to estimate parameters of random processes, the trajectory is followed through 2006--2026. Kunchenko stochastic polynomials are presented as a coherent family of moment-cumulant procedures: the polynomial maximization method (PMM) for parameter estimation, polynomial criteria for hypothesis testing, and decomposition in spaces with a generating element. The paper details the school's structure: a verified genealogy of 15 defended dissertations, collaborations in Poland, Slovakia, and Germany, and the R package EstemPMM. A recent 2026 paper on Volterra-based signal processing is analyzed, showing how Kunchenko's nonlinear formulation reappears in applied radio engineering. We build a formal bridge between finite Volterra models and generalized Kunchenko polynomials, while separating the MMSE/L2 criterion from PMM: the former is a covariance projection for kernel adaptation, whereas PMM is a parameter-dependent moment procedure. PMM efficiency claims are stated conditionally: gains require that moments exist, the centered correlant matrix is nondegenerate, and the variance reduction coefficient is below one. The concluding research program operationalizes the historical reconstruction into testable statistical and signal-processing tasks.

2605.22288 2026-05-22 cs.IT eess.SP math.IT

Multi-Cell 6DMA: Cooperative Interference Management and Antenna Rotation Optimization

多小区6DMA:协作干扰管理和天线旋转优化

Qijun Jiang, Xiaodan Shao, Rui Zhang

AI总结 本文研究了一种多小区六维可移动天线(6DMA)网络,旨在通过协作干扰管理提升下行链路通信性能。通过联合优化短时下行预编码和长时6DMA旋转,提出了一种平均加权总速率最大化问题,以解决多小区系统中的天线旋转设计和传输预编码固有的耦合问题。

详情
Comments
14 pages, 12 figures; submitted to IEEE for possible publication
AI中文摘要

本文研究了一种多小区六维可移动天线(6DMA)网络,旨在通过协作干扰管理提升下行链路通信性能。每个基站(BS)配备多个6DMA表面,6DMA旋转影响小区内用户所需信号增强和对邻小区的干扰泄漏,这使得天线旋转设计和传输预编码在BS之间固有耦合。为了解决这个问题,我们通过联合优化短时下行预编码和长时6DMA旋转,提出了一种多小区系统的平均加权总速率最大化问题,以在实际天线几何约束下进行优化。为解决由此产生的非凸问题,我们提出了一种基于邻近BS之间干扰功率约束(IPC)协调的分布式双时间尺度设计,在此之下,每个BS根据即时信道状态信息(CSI)进行本地短时预编码优化,并根据统计CSI进行长时6DMA旋转更新,同时限制了BS间的信息交换。特别地,开发了一种基于两阶段一维网格搜索和随机最大匹配的边缘-wise IPC协调机制,以实现可扩展的分布式实现。还提供了一个集中式的离线基准用于性能比较。数值结果表明,所提出的分布式设计在不同干扰条件下能够实现接近集中式基准的性能,同时随着网络规模的增加,保持了良好的可扩展性。

英文摘要

In this paper, we investigate a multi-cell six-dimensional movable antenna (6DMA) network for enhancing downlink communication performance under inter-cell interference (ICI). Each base station (BS) is equipped with multiple 6DMA surfaces, and the 6DMA rotations affect both the desired-signal enhancement for in-cell users and the interference leakage toward neighboring cells, which makes the antenna-rotation design and transmit precoding intrinsically coupled across BSs. To address this issue, we formulate an average weighted sum-rate maximization problem for the multi-cell system by jointly optimizing the short-term downlink precoders and long-term 6DMA rotations under practical antenna geometric constraints. To tackle the resulting nonconvex problem, we propose a distributed two-timescale design based on inter-cell interference power constraint (IPC) coordination among neighboring BSs, under which each BS performs local short-term precoder optimization based on instantaneous channel state information (CSI) and long-term 6DMA rotation update according to statistical CSI with limited inter-BS information exchange. In particular, an edge-wise IPC coordination mechanism based on two-stage one-dimensional grid search and random maximal matching is developed to enable scalable distributed implementation. A centralized offline benchmark is also provided for performance comparison. Numerical results show that the proposed distributed design achieves performance close to the centralized benchmark under different interference conditions, while maintaining favorable scalability as the network size increases.

2605.22262 2026-05-22 cs.SD cs.LG eess.AS

Automatic Contextual Audio Denoising

自动上下文音频去噪

Diep Luong, Konstantinos Drossos, Mikko Heikkinen, Tuomas Virtanen

AI总结 本文提出了一种自动上下文音频去噪方法,通过推断音频场景类别来区分有用和无关声音成分,从而提高去噪效果。

详情
AI中文摘要

音频上下文决定了哪些声音成分和来源是相关的,哪些可以被听众感知为无关(噪声)。例如,在城市监控中交通噪声是有信息的,而在同一地点的电话通话中则为噪声。大多数当前的音频去噪系统使用固定的目标-噪声定义,往往在一种上下文中去除有用成分而在另一种上下文中无法抑制无关成分。为此,我们引入了自动上下文音频去噪(ACAD)的概念,该概念基于推断的上下文定义目标和噪声。在本工作中,我们将上下文限制为与声学场景类别相关联。我们将场景类别外的事件分布之外的声音事件(噪声)标记为离上下文(OC),而典型于该场景的事件标记为在上下文中(IC)。我们实现了一种深度学习方法,该方法能够自动推断音频信号的上下文并去除OC成分,并将其与无上下文推断、有 oracle 上下文和单独提供无信息上下文的变体进行比较。在跨多样上下文的配对干净/噪声数据上,其中一种上下文中的OC成分可能在另一种上下文中是IC,我们的方法在标准客观指标上优于其他方法,表明模型能够推断上下文,并且上下文依赖的处理可以增强去噪。

英文摘要

Audio context determines which sound components and sources are relevant and which can be perceived as irrelevant (noise) by listeners. For example, traffic noise is informative in urban surveillance but noise for a phone call at the same location. Most current audio denoising systems apply fixed target-noise definitions, often removing useful components in one context while failing to suppress irrelevant components. To address this, we introduce the concept automatic contextual audio denoising (ACAD) which defines target and noise based on the inferred context. In this work, we restrict context to be associated with an acoustic scene class. We label sound events outside the event distribution of a scene class (noise) as out-of-context (OC) and events typical for that scene as in-context (IC). We implement a deep learning method that automatically infers the context of the audio signal and removes OC components, and benchmark it against variants: without context inference, with oracle context, and with separately provided uninformative context. On paired clean/noisy data across diverse contexts, where OC components in one context may be IC in another, our proposed method outperforms other approaches across standard objective metrics, indicating that the model can infer context and context-dependent processing can enhance denoising.

2605.22251 2026-05-22 math.OC cs.SY eess.SY

Online Optimization with Unknown Time-Varying Parameters from Noisy Gradient Measurements

具有未知时间变化参数的在线优化(基于噪声梯度测量)

Shivanshu Tripathi, Maziar Raissi

AI总结 研究在线优化问题,其中成本函数依赖于不可测的时间变化参数,通过控制理论工具从梯度观测中重建潜在参数,使用工具变量估计参数动态,并预测参数以计算未来最小值,提供预期跟踪误差的界。

详情
AI中文摘要

我们研究在线优化问题,其中成本函数依赖于不可测的时间变化参数,这些参数由未知动态控制。具体而言,我们考虑一个强凸成本函数,其线性项按照未知的线性随机动态演变,而算法只能获取有限的噪声梯度测量。我们提出了一种解决方案,利用控制理论工具从梯度观测中使用高斯-马尔可夫估计器重建潜在参数,然后使用工具变量估计器识别参数动态,并最终预测参数以计算未来最小值。我们提供了预期跟踪误差的界。我们通过一系列数值示例展示了算法的有效性。

英文摘要

We study online optimization problems in which the cost function depends on latent, time-varying parameters that are unmeasurable and governed by unknown dynamics. Specifically, we consider a strongly convex cost function whose linear term evolves according to unknown linear stochastic dynamics, while the algorithm has access only to finite noisy gradient measurements. We propose a solution that uses control theoretic tools to reconstruct the latent parameters from gradient observations using a Gauss-Markov estimator, then identifies the parameter dynamics using an instrumental-variable estimator, and finally forecasts the parameters to compute the future minimizer. We provide a bound on the expected tracking error. We illustrate the effectiveness of our algorithm on a series of numerical examples.

2605.22207 2026-05-22 eess.SY cs.LG cs.SY

Kernel-Based Safe Exploration in Deep Reinforcement Learning

基于核的深度强化学习安全探索

Rupak Majumdar, Nikhil Singh, Sadegh Soudjani

AI总结 本文提出了一种基于核的方法,用于在深度强化学习中安全探索,通过学习屏障函数来保证策略不会进入危险区域,同时在探索过程中同时学习最优策略和屏障函数,提供更可靠的概率安全保证。

详情
Comments
Accepted at L4DC Conference (22 Jan 2026)
AI中文摘要

安全性在将深度强化学习算法部署到现实世界时是一个主要关注点。一种有前景的方向是学习一个屏障函数,以确保学习的策略不会访问危险区域。屏障函数是从状态到实数的函数,它将初始状态赋予低值,将危险状态赋予高值,并在每次转移中减少期望值;这样的函数可用于限制到达危险状态的概率。以前的研究直接从探索数据中学习屏障函数,但需要大量数据或对系统动力学的限制。在本文中,我们展示了如何利用核嵌入来学习深度强化学习中随机系统的屏障函数。我们的算法,称为基于核的安全探索(KBSE),在探索过程中同时学习最优策略和屏障函数。屏障函数是通过迭代计算得到的,并以条件均值嵌入表示,随着探索的增加,它们提供更好的概率安全保证。探索算法使用学习到的屏障函数来识别安全违规。在发生违规时,它会干预,将危险动作改为安全动作,从而确保探索仅限于限制到达危险状态概率的动作。我们评估了KBSE在多个复杂的连续控制基准上的性能。实验结果表明,我们的新算法适用于合成概率安全的控制策略,而不会影响奖励的累积。

英文摘要

Safety has been a major concern when deploying deep reinforcement learning algorithms in the real world. A promising direction that ensures that the learned policy does not visit unsafe regions is to learn a \emph{barrier function} along with the policy. A barrier is a function from states to reals that assigns low values to the initial states, high values to the unsafe states, and decreases in expectation on each transition; such a function can be used to bound the probability of reaching unsafe states. Previous attempts learned a barrier function directly from exploration data, but this required either large amounts of data or restrictions on the system dynamics. In this paper, we show how kernel embeddings can be used to learn barrier functions during deep reinforcement learning for stochastic systems with unknown dynamics. Our algorithm, \emph{kernel-based safe exploration (KBSE)}, learns an optimal policy and a barrier simultaneously during exploration. The barriers are computed iteratively, represented as conditional mean embeddings, and provide better probabilistic safety guarantees with more exploration. The exploration algorithm uses the learned barrier functions to identify safety violations. In the case of violation, it intervenes to modify the unsafe action to a safe action, thereby ensuring that the exploration is restricted to actions that bound the probability of reaching unsafe states. We evaluate KBSE on several complex continuous control benchmarks. Experimental results establish our new algorithm to be suitable for synthesizing control policies that are probabilistically safe without degradation in reward accumulation.

2605.22171 2026-05-22 eess.SY cs.SY

Equilibrium-Free Contraction Stability Analysis for Grid-Forming Converter-Based Microgrids

基于无均衡收缩理论的电网形成逆变器微电网平衡稳定性分析

Shijie Peng, Xiuqiang He, Xi Ru, Hua Geng

AI总结 本文提出了一种基于半收缩理论的无均衡收缩稳定性方法,用于分析由电网形成(GFM)逆变器主导的可再生能源微电网在持续功率波动下的稳定性,通过构建对称感知的投影状态空间消除内在旋转模式,并利用块状雅可比分解来表征有功和无功功率动态,从而得到可计算的区域收缩条件,进一步转换为前向不变稳定性证书,为轨迹级性能提供保障。

详情
AI中文摘要

由电网形成(GFM)逆变器主导的可再生能源微电网面临持续功率波动,使得基于均衡的稳定性评估变得受限。本文开发了一种基于半收缩理论的无均衡收缩稳定性方法。通过构建对称感知的投影状态空间,消除由均匀角度偏移引起的内在旋转模式。引入块状雅可比分解来表征耦合的有功和无功功率动态,从而得到可计算的区域收缩条件。该条件随后转换为前向不变稳定性证书,提供轨迹级性能保证。对于无干扰的自主运行,该方法提供无均衡非线性稳定性表征以及吸引区域(ROA)的估计。对于存在干扰的非自主运行,它推导出在缓慢变化注入下的准稳态跟踪的显式界限,以及在快速或复合干扰下的鲁棒性界限。对9节点系统的案例研究验证了所提方法。

英文摘要

Renewable-driven microgrids dominated by grid-forming (GFM) converters are subject to persistent power fluctuations, making equilibrium-known stability assessments restrictive. This paper develops an equilibrium-free contraction stability method based on semi-contraction theory. By formulating the system in a symmetry-aware projected state space, the intrinsic rotational mode induced by uniform angle shifts is removed. A blockwise Jacobian decomposition is introduced to characterize the coupled active and reactive power dynamics, yielding a computable regional contraction condition. This condition is then converted into forward-invariant stability certificates that provide trajectory-level performance guarantees. For autonomous operation without disturbances, the method provides an equilibrium-free nonlinear stability characterization together with an estimation of the region of attraction (ROA). For non-autonomous operation under disturbances, it derives explicit bounds for quasi-steady tracking under slowly varying injections and for robustness under fast or composite disturbances. Case studies on a 9-bus system validate the proposed method.

2605.22120 2026-05-22 eess.AS cs.SD

Effective User-defined Keyword Spotting with Dual-stage Matching, Multi-modal Enrollment, and Continual Adaptation

高效的用户定义关键词侦测:双阶段匹配、多模态注册与持续适应

Zhiqi Ai, Han Cheng, Shiyi Mu, Xinnuo Li, Yongjin Zhou, Shugong Xu

AI总结 本文提出DMA-KWS框架,通过双阶段匹配、多模态注册和持续适应方法,解决用户定义关键词侦测中的混淆词区分、说话人发音不一致和高数据成本问题,实验表明其在LibriPhrase Hard子集上达到97.85%的AUC和6.13%的EER,性能领先。

详情
Comments
14 pages, 13 figures, 12 tables. Accepted by TASLP
AI中文摘要

用户定义关键词侦测(KWS)对于个性化语音交互至关重要,但现有方法面临几个挑战:(1)混淆词之间的区分度不足,(2)在发音不同的说话人之间性能不一致,(3)高数据成本以确保可靠的唤醒词性能。本文介绍DMA-KWS,一种高效的、稳健的用户定义关键词侦测框架。首先,它采用双阶段匹配流程:CTC解码结合流式音素搜索来定位候选段,随后使用QbyT结合音素匹配器进行精细验证,使其能够更好地区分混淆词。接下来,多模态注册融合用户特定的语音与文本嵌入,进一步提高已注册用户的准确性。最后,参数高效的持续适应机制通过合成和真实数据进行轻量级更新。广泛的实验表明DMA-KWS的优越性能。在LibriPhrase Hard子集上,它实现了97.85%的AUC和6.13%的EER,达到最先进的性能。在说话人依赖设置中,DMA-KWS始终优于文本-only注册,显示出显著的性能提升。此外,所提出的参数高效的微调机制仅需187k个更新参数即可适应DMA-KWS,进一步提高KWS性能,同时确保适用于设备部署。

英文摘要

User-defined keyword spotting (KWS) is crucial for personalized voice interaction, yet existing methods face several challenges: (1) insufficient discriminability among confusable words, (2) performance inconsistency across speakers with varying pronunciations, and (3) high data cost to ensure reliable wake-word performance. In this paper, we introduce DMA-KWS, an efficient and robust framework for user-defined keyword spotting. First, it adopts a dual-stage matching pipeline: CTC decoding with streaming phoneme search to locate candidate segments, followed by QbyT with a phoneme matcher for fine-grained verification, enabling it to better distinguish confusable words. Next, multi-modal enrollment fuses user-specific speech with text embeddings to further improve accuracy for registered users. Finally, a parameter-efficient continual adaptation mechanism performs lightweight updates using synthetic and real data. Extensive experiments demonstrate the superior performance of DMA-KWS. On the LibriPhrase Hard subset, it achieves 97.85% AUC and 6.13% EER, reaching state-of-the-art performance. In speaker-dependent settings, DMA-KWS consistently outperforms text-only enrollment, demonstrating significant performance gains. Moreover, the proposed parameter-efficient fine-tuning mechanism adapts DMA-KWS with only 187k updated parameters, further enhancing KWS performance while ensuring suitability for on-device deployment.

2605.22117 2026-05-22 eess.SP

Beyond Spherical Wavefront: Near-Field Channel Estimation Under Wavefront Anisotropy

超越球面波前:考虑波前各向异性下的近场信道估计

Heling Zhang, Xiujun Zhang, Xiaofeng Zhong, Shidong Zhou

AI总结 本文针对存在近场曲面反射面时波前各向异性问题,提出了一种参数化模型和基于物理参数恢复的信道估计算法,验证了该算法在各向异性波前场景中的有效性。

详情
AI中文摘要

极大规模孔径阵列(ELAAs)和毫米波(mmWave)技术是实现未来无线通信系统高数据速率的关键。为了实现精确波束成形,这些系统需要准确的信道估计,其中必须考虑近场波前曲率效应。现有的信道估计方法依赖于球面波前信道(SWC)模型,该模型适用于具有点源、散射体和反射面的近场传播。然而,当存在近场曲面反射面时,反射波的波前变为各向异性而非球面,导致SWC模型不再准确。为了解决这个问题,本文提出了一种参数化模型用于各向异性波前信道(AWC)。基于该模型,我们提出了一种基于物理参数恢复的AWC信道估计算法。仿真结果表明,AWC在角度-距离域中不再保持稀疏性。此外,结果还展示了不同传播场景的物理特性如何影响波前各向异性程度,并验证了所提出算法在AWC场景中的有效性。

英文摘要

Extremely large aperture arrays (ELAAs) and millimeter-wave (mmWave) technologies are essential for achieving high data rates in future wireless communication systems. To perform precise beamforming, these systems require accurate channel estimation, in which the near-field wavefront curvature effect must be taken into account. Existing channel estimation methods rely on the spherical wavefront channel (SWC) model, which is suitable for near-field propagation with point sources, scatterers, and reflection planes. However, when a near-field curved reflecting surface exists, the wavefront of the reflected wave becomes anisotropic rather than spherical, causing the SWC model to become inaccurate. To address this problem, in this paper, we formulate a parameterized model for the anisotropic wavefront channel (AWC). Using this model, we propose a channel estimation algorithm based on physical parameter recovery for the AWC. Simulation results reveal that the AWC no longer retains sparsity in the angle-distance domain. Furthermore, the results demonstrate how different physical characteristics of the propagation scenario affect the degree of wavefront anisotropy, and confirm the effectiveness of our proposed algorithm in AWC scenarios.

2605.19965 2026-05-22 cs.LG eess.SP

Normative Networks for Source Separation via Local Plasticity and Dendritic Computation

通过局部可塑性和树突计算进行源分离的规范网络

Bariscan Bozkurt, Efe Ali Gorguner, Francesco Innocenti, Rafal Bogacz

AI总结 本文提出了一种基于局部可塑性和树突计算的预测熵最大化方法,用于源分离,该方法在结构化源域上最大化正则化的二阶熵,实现了在增加的源相关性和观测噪声下的鲁棒性,并在生物合理算法和精确基线中表现优异。

详情
AI中文摘要

盲源分离(BSS)是研究如何从感觉混合中恢复潜在原因的自然框架,但推导出针对结构化(即受限于已知领域)且可能相关源的在线和生物合理算法仍然具有挑战性。最近的工作从最大化熵度量出发推导出BSS的神经网络,但其在线实现涉及复杂且非局部的递归动力学。受此视角启发,我们提出了预测熵最大化方法,仅使用局部权重更新即可实现BSS的竞争力。该方法采用熵度量的近似,产生一个具有易于解释组件的目标函数。最小化该目标导致预测神经架构,其中前馈突触遵循误差驱动规则(可通过树突机制实现),横向抑制连接通过局部海马体可塑性学习,源域约束通过简单的输出非线性性强制执行。我们推导了对偶误差的显式频谱界限,表征了何时近似是准确的。经验上,预测熵最大化在增加的源相关性和观测噪声下保持稳健,优于依赖更强独立性或去相关假设的生物合理算法,并在精确行列式和相关信息基线中表现竞争。这些结果展示了如何通过最大化结构化源域上的正则化二阶熵,使局部可塑性和适应性横向抑制得以出现。我们的实现代码可在https://github.com/BariscanBozkurt/Predictive-Entropy-Maximization上获得。

英文摘要

Blind source separation (BSS) is a natural framework for studying how latent causes may be recovered from sensory mixtures, but deriving online and biologically plausible algorithms for structured (i.e., constrained to known domains) and potentially correlated sources remains challenging. Recent work has derived neural networks for BSS from maximization of an entropy measure, yet its online implementations involve complex and nonlocal recurrent dynamics. Motivated by this perspective, we propose Predictive Entropy Maximization, which achieves competitive performance in BSS, using only local weight updates. The method employs a close approximation of an entropy measure, yielding an objective function with easily interpretable components. Minimizing this objective leads to a predictive neural architecture in which feedforward synapses follow an error-driven rule (that can be realized through dendritic mechanisms), lateral inhibitory connections are learned with local Hebbian plasticity, and source-domain constraints are enforced through simple output nonlinearities. We derive explicit spectral bounds on the surrogate error, characterizing when the approximation is accurate. Empirically, Predictive Entropy Maximization remains robust under increasing source correlation and observation noise, outperforms biologically plausible algorithms that rely on stronger independence or decorrelation assumptions, and remains competitive with exact determinant- and correlative-information-based baselines. These results show how local plasticity and adaptive lateral inhibition can emerge from maximizing a regularized second-order entropy over structured source domains. Our implementation code is available at https://github.com/BariscanBozkurt/Predictive-Entropy-Maximization.

2605.19354 2026-05-22 eess.IV cs.CV

Next-Acceleration-Scale Prediction for Autoregressive MRI Reconstruction

用于自回归MRI重建的下一步加速尺度预测

Yilmaz Korkmaz, Vishal M. Patel

AI总结 本文提出了一种基于离散多尺度潜在空间的自回归下一步加速尺度预测方法,通过引入特权信息蒸馏技术,提升了在极端欠采样下的MRI重建性能。

详情
AI中文摘要

MRI重建本质上是一个病态的逆问题,因为不完整的测量允许许多可能的解决方案。在高加速情况下,这种不确定性变得更加严重,像素域连续预测器倾向于在可行的重建之间平均并抑制高频解剖结构。我们通过将重建移动到离散多尺度潜在空间,并将其作为自回归下一步加速尺度预测来解决这一限制。利用在视觉自回归建模中证明有效的离散先验,我们的方法将解限制在紧凑的代码本令牌序列中,即使从极稀疏的测量中也能实现锐利的重建。这种离散自回归公式也自然与现代大型语言模型后训练技术对齐。基于这一观察,我们引入了视觉自回归建模中的在线策略特权信息蒸馏,其中教师仅在训练时使用不可用的特权上下文进行训练,在本案例中是完全采样获取,监督学生在自己的滚动生成中进行训练,从而实现一致的重建增益。通过在fastMRI基准上的广泛实验,我们展示了我们的方法在各种采样模式下在极端欠采样下提供了改进的重建性能。项目网站是https://yilmazkorkmaz1.github.io/discrete-mri-reconstruction-opd/。

英文摘要

MRI reconstruction is an inherently ill-posed inverse problem, since incomplete measurements admit many plausible solutions. This ambiguity becomes more severe under high acceleration, where pixel-domain continuous predictors tend to average over feasible reconstructions and suppress high-frequency anatomy. We address this limitation by moving reconstruction to discrete multi-scale latent space and posing it as autoregressive next-acceleration-scale prediction. Leveraging discrete priors proven effective in visual autoregressive modeling, our method restricts the solution to compact sequences of codebook tokens, enabling sharp reconstructions even from extremely sparse measurements. This discrete autoregressive formulation also aligns naturally with modern large language model post-training techniques. Building on this observation, we introduce on-policy privileged information distillation for visual autoregressive modeling, where a teacher is provided training only privileged context that is unavailable at inference, in our case fully sampled acquisitions, and supervises a student trained on its own rollouts, leading to consistent reconstruction gains. Through extensive experiments on the fastMRI benchmark, we show that our approach delivers improved reconstruction performance across diverse sampling patterns under extreme undersampling. Project website is \href{https://yilmazkorkmaz1.github.io/discrete-mri-reconstruction-opd/}{here}.

2605.19081 2026-05-22 eess.SP

Automotive Radar Performance in Environments with Multiple Interference Sources

汽车雷达在多重干扰源环境中的性能

Oren Longman, Guy Mardiks, Tomer Maayan, Gaston Solodky

AI总结 本文研究了高密度干扰环境下汽车雷达的性能,提出了一种端到端的仿真框架,评估了多种干扰场景对雷达性能的影响,并验证了传统干扰抑制技术的局限性,强调了未来需要协调和可扩展的干扰管理策略。

详情
AI中文摘要

汽车雷达正越来越容易受到邻近雷达系统的相互干扰,这可能导致虚假目标检测和有效目标的掩盖。尽管当前的干扰水平仍可管理,由于雷达车辆的渗透率相对较低,但这一假设预计随着雷达的普及和每辆车雷达密度的增加而崩溃。本文对高密度干扰环境下的汽车雷达性能进行了全面分析。在中间频段(IF)级别开发了一个现实的端到端仿真框架,结合了分析性干扰建模和详细的雷达信号处理。研究评估了干扰在一系列未来场景中的影响,这些场景以增加的雷达密度和每辆车的多雷达配置为特征。传统干扰抑制技术被系统地评估以验证仿真结果,通过使用暴露于多达30个干扰雷达的主机雷达,在消音和真实环境进行了受控实验。结果表明,在高干扰条件下性能显著下降,检测概率和有效范围有显著减少。在评估的技术中,时频编码始终提供最稳健的性能,即使在雷达渗透率较高时仍能保持较高的检测概率。这些发现突显了当前抑制方法的局限性,并强调了未来汽车雷达系统中协调和可扩展的干扰管理策略的重要性。

英文摘要

Automotive radars are increasingly susceptible to mutual interference from neighboring radar systems, which can lead to false target detections and the masking of valid targets. While current interference levels remain manageable due to the relatively low penetration of radar-equipped vehicles, this assumption is expected to break down as radar adoption and per-vehicle radar density continue to increase. This paper presents a comprehensive analysis of automotive radar performance in high-density interference environments. A realistic end-to-end simulation framework is developed at the intermediate frequency (IF) level, incorporating analytical interference modeling and detailed radar signal processing. The study evaluates the impact of interference across a range of future scenarios characterized by increased radar density and multiple radar configurations per vehicle. Conventional interference mitigation techniques are systematically assessed to validate the simulation results, controlled experiments were conducted using a host radar exposed to up to 30 interfering radars in both anechoic and real-world environments. The results demonstrate significant performance degradation under high interference conditions, with substantial reductions in detection probability and effective range. Among the evaluated techniques, time-frequency coding consistently provides the most robust performance, maintaining high detection probability even at elevated radar penetration rates. These findings highlight the limitations of current mitigation approaches and emphasize the need for coordinated and scalable interference management strategies in future automotive radar systems.

2605.17950 2026-05-22 cs.RO cs.SY eess.SY

Active Defense Against False Data Injection Attacks in Robotic Manipulators

对抗机器人机械臂中虚假数据注入攻击的主动防御

Gabriele Gualandi, Carl Mikael Larsson, Alessandro V. Papadopoulos

AI总结 本文提出两种防御方法,即异常感知虚拟阻尼和操作性降低,以提高机器人机械臂在有限时间范围内抵御虚假数据注入攻击的能力,并通过仿真验证其有效性。

详情
Comments
Extended 8-page version containing full proofs. An abridged 6-page version has been accepted for publication in the Proceedings of the 23rd IFAC World Congress (2026). v3: Minor typographical fixes and updated reference formatting
AI中文摘要

机器人系统容易受到虚假数据注入攻击(FDIAs)的影响,其中攻击者通过篡改传感器信号来获得恶意控制。反馈线性化使机器人系统暴露于积分器漏洞,使其容易受到隐蔽攻击,这些攻击可能导致末端执行器行为出现显著偏差而不会引发警报。本文通过形式化两种防御方法,即异常感知虚拟阻尼和操作性降低,以提高机械臂在有限时间范围内抵御FDIAs的韧性,并在名义任务执行中提供概率保证。在7自由度冗余机械臂上的仿真显示,所提出的防御方法在与仅使用阈值基ADS如卡方检测相比时,显著减少了FDIA的影响,同时在无攻击情况下保持了名义任务性能。

英文摘要

Robotic systems are vulnerable to False Data Injection Attacks (FDIAs), where adversaries corrupt sensor signals to gain malicious control. Feedback linearization exposes robotic systems to integrator vulnerability, making them susceptible to stealthy attacks that can cause significant deviations in end-effector behavior without raising alarms. This paper addresses the resilience of manipulators against finite-horizon FDIAs by formalizing two defense methods, namely anomaly-aware virtual damping and manipulability reduction, with probabilistic guarantees on nominal task execution. Simulations on a 7-DOF redundant manipulator show that the proposed defenses substantially reduce the impact of FDIA compared to using solely a threshold-based ADS like the Chi-squared, while preserving nominal task performance in the absence of attack.

2605.01369 2026-05-22 eess.SP cs.AI cs.LG

MU-SHOT-Fi: Self-Supervised Multi-User Wi-Fi Sensing with Source-free Unsupervised Domain Adaptation

MU-SHOT-Fi: 基于源无关无监督域适应的多用户Wi-Fi感知

Ahmed Y. Radwan, Hina Tabassum

AI总结 本文提出MU-SHOT-Fi框架,通过源无关无监督域适应方法,在单用户和多用户Wi-Fi感知中实现准确的活动分类和占用估计,同时防止模型崩溃。

详情
Journal ref
IEEE Internet of Things Journal, Early Access, 2026
AI中文摘要

深度学习已被广泛应用于基于Wi-Fi CSI的人体活动识别(HAR),因为它能够以隐私保护和成本效益的方式学习时空特征。然而,基于深度学习的模型在跨环境泛化能力差,特别是在多用户设置中,重叠活动导致CSI纠缠和域偏移。实际部署通常由于隐私限制限制访问标记源数据,这促使使用仅未标记目标域CSI和预训练源模型进行源无关适应。在本文中,我们提出了MU-SHOT-Fi,一种用于单用户和多用户Wi-Fi感知的源无关无监督域适应框架。MU-SHOT-Fi在源训练期间采用排列不变的集合预测与匈牙利匹配,随后在目标域中采用冻结分类器骨干适应。为了实现无标签的稳定适应,我们引入了占用加权信息最大化,通过将多样性正则化集中在可能占用的槽位上,同时排除主导类别的边际熵。此外,我们采用二进制旋转预测作为空间自监督,利用CSI频率-时间结构学习域不变特征。对于单用户场景,我们引入SU-SHOT-Fi,通过将占用加权替换为标准信息最大化,并结合对比预测编码以利用时间一致性。在WiMANS和Widar 3.0数据集上进行了广泛的实验,涵盖了跨环境、跨频率、跨方向和组合域偏移,证明MU-SHOT-Fi在大域偏移下有效恢复多用户精确活动分类性能,同时保持准确的占用估计并防止向主导类崩溃。

英文摘要

Deep learning has been widely adopted for WiFi CSI-based human activity recognition (HAR) due to its ability to learn spatio-temporal features in a privacy-preserving and cost-effective manner. However, DL-based models generalize poorly across environments, a challenge amplified in multi-user settings where overlapping activities cause CSI entanglement and domain shifts. Practical deployments often limit access to labeled source data due to privacy constraints, motivating source-free adaptation using only unlabeled target-domain CSI and a pre-trained source model. In this paper, we propose MU-SHOT-Fi, a source-free unsupervised domain adaptation framework for single- and multi-user Wi-Fi sensing. MU-SHOT-Fi employs permutation-invariant set prediction with Hungarian matching during source training, followed by frozen-classifier backbone adaptation in the target domain. To enable stable adaptation without labels, we introduce occupancy-weighted information maximization that prevents model collapse by focusing diversity regularization on likely-occupied slots while excluding the dominant class from marginal entropy. Additionally, we employ binary rotation prediction as spatial self-supervision that exploits CSI frequency-time structure to learn domain-invariant features. For single-user scenarios, we introduce SU-SHOT-Fi by replacing occupancy weighting with standard information maximization and incorporating contrastive predictive coding to exploit temporal consistency. Extensive experiments on the WiMANS and Widar 3.0 datasets across cross-environment, cross-frequency, cross-orientation, and combined domain shifts demonstrate that MU-SHOT-Fi effectively recovers multi-user exact-activity classification performance under large domain shifts while maintaining accurate occupancy estimation and preventing collapse toward dominant classes.

2605.00024 2026-05-22 q-bio.NC eess.SP

Self-organized criticality enables conscious integration through brain-body resonance

自组织临界性通过脑-体共振实现意识整合

Ahmed Gamal Eldin

AI总结 该研究通过脑-体共振维持的自组织临界性揭示了意识整合的机制,发现传统预处理方法会破坏整合动态,而原始数据中的临界动态支持大规模神经协调与事件相关处理的耦合。

详情
AI中文摘要

自组织临界性通过脑-体共振维持,使意识整合成为可能。我们使用64通道EEG数据表明,传统预处理方法无意中消除了其试图测量的整合动态。移除通常被视为'伪影'的生理信号会显著降低全局相位同步与刺激诱发振幅之间的共享方差,这种效应高度特异于生理成分。我们追溯到78毫秒的基本脑-体共振,其通过强大的双向因果关系建立零延迟同步。关键的是,原始数据表现出重尾幂律动态,表明接近临界状态,而传统清洗数据明确拒绝幂律分布,表明人为转向亚临界状态。最后,我们展示这些临界动态能够实现全息信息编码,证据是共振后显著出现的空间干涉图案。这些发现表明,生理信号积极且选择性地支持大规模神经协调与事件相关处理之间的耦合。

英文摘要

The "binding problem" of how distributed neural activity unifies into conscious experience has remained an open challenge since its articulation in 1890. We present evidence that conscious integration relies on self-organized criticality maintained by brain-body resonance, placing human cognition within the universality class of critical systems. Using 64-channel EEG data, we demonstrate that conventional preprocessing inadvertently eliminates the very integrative dynamics it seeks to measure. Removing physiological signals conventionally treated as "artifacts" drastically reduces the shared variance between global phase synchronization and stimulus-evoked amplitude, an effect highly specific to physiological components. We trace this to a fundamental brain-body resonance at 78 milliseconds that establishes zero-lag synchronization driven by robust bidirectional causality. Crucially, raw data exhibits heavy-tailed avalanche dynamics indicative of a near-critical regime, whereas conventionally cleaned data definitively rejects power-law distributions, signaling an artificial shift to subcriticality. Finally, we show these critical dynamics enable holographic information encoding, evidenced by a significant emergence of spatial interference patterns post-resonance. Together, these findings indicate that physiological signals actively and selectively support the coupling between large-scale neural coordination and event-related processing.

2604.26836 2026-05-22 cs.LG cs.SY eess.SY

Uncertainty-Aware Predictive Safety Filters for Probabilistic Neural Network Dynamics

具有不确定性的预测安全过滤器用于概率神经网络动态

Bernd Frauenknecht, Lukas Kesper, Daniel Mayfrank, Henrik Hose, Sebastian Trimpe

AI总结 本文提出了一种具有不确定性的预测安全过滤器(UPSi),通过将未来结果建模为可达集,利用概率集合(PE)神经网络动态模型提供严格的安全预测,从而在模型基于强化学习(MBRL)中提升探索安全性,同时保持与标准MBRL相当的性能。

详情
AI中文摘要

预测安全过滤器(PSFs)利用模型预测控制在深度强化学习(RL)探索期间强制约束满足,但其对第一原理模型或高斯过程的依赖限制了可扩展性和更广泛的应用。同时,基于模型的RL(MBRL)方法通常使用概率集合(PE)神经网络来从数据中捕捉复杂的、高维动态,且在最少的先验知识下。然而,现有将PE整合到PSFs中的尝试缺乏严格的不确定性量化。我们引入了具有不确定性的预测安全过滤器(UPSi),一种通过将未来结果建模为可达集来提供严格安全预测的PSF,利用PE动态模型。UPSi引入了显式的确定性约束,防止模型被利用,并无缝集成到常见的MBRL框架中。我们评估了UPSi在Dyna-style MBRL中的标准安全RL基准上,并报告了在先前神经网络PSFs上显著改进的探索安全性,同时保持与标准MBRL相当的性能。UPSi弥合了现代MBRL的可扩展性和通用性与预测安全过滤器的安全保证之间的差距。

英文摘要

Predictive safety filters (PSFs) leverage model predictive control to enforce constraint satisfaction during deep reinforcement learning (RL) exploration, yet their reliance on first-principles models or Gaussian processes limits scalability and broader applicability. Meanwhile, model-based RL (MBRL) methods routinely employ probabilistic ensemble (PE) neural networks to capture complex, high-dimensional dynamics from data with minimal prior knowledge. However, existing attempts to integrate PEs into PSFs lack rigorous uncertainty quantification. We introduce the Uncertainty-Aware Predictive Safety Filter (UPSi), a PSF that provides rigorous safety predictions using PE dynamics models by formulating future outcomes as reachable sets. UPSi introduces an explicit certainty constraint that prevents model exploitation and integrates seamlessly into common MBRL frameworks. We evaluate UPSi within Dyna-style MBRL on standard safe RL benchmarks and report substantial improvements in exploration safety over prior neural network PSFs while maintaining performance on par with standard MBRL. UPSi bridges the gap between the scalability and generality of modern MBRL and the safety guarantees of predictive safety filters.