arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.13836 2026-05-14 eess.SY cs.SY

Reachable-Set Decomposition for Real-Time Aggregation of Multi-Zone HVAC Fleets

Jingguan Liu, Xiaomeng Ai, Cong Chen, Shaoze Li, Shichang Cui, Jiakun Fang, Jinyu Wen

AI总结 本文研究了多区域暖通空调(HVAC)系统实时聚合中的灵活性刻画问题,面对区域间强耦合和实时信息逐步揭示带来的挑战,提出了一种可达集分解框架。该方法通过离线阶段构建后向可达集,将剩余时段的可行性转化为每时段的状态约束,结合定制的内近似方法实现高效计算;在实时阶段,通过并行线性规划和功率区间闵科夫斯基求和,快速计算聚合灵活性并保证调度信号的递归可行性。实验验证了该方法在灵活性刻画、分解可行性及计算可扩展性方面的有效性。

Comments 10 pages, 9 figures

详情
英文摘要

Aggregating building heating, ventilation, and air-conditioning (HVAC) fleets provides substantial real-time flexibility to power system operations. However, real-time aggregation of multi-zone HVAC fleets faces two key challenges: (i) strong coupling across zones and time makes flexibility characterization high-dimensional and computationally demanding, and (ii) the sequential revelation of temperature states and exogenous conditions requires that decisions made at each period preserve feasibility over the remaining horizon using only currently realized information. To address these challenges, this paper proposes a reachable-set decomposition framework comprising an offline decomposition stage and a real-time policy. In the offline stage, backward reachable sets are formulated to encode remaining-horizon feasibility into per-period state constraints, so that any state within the current reachable set is guaranteed to sustain feasible operation over the entire remaining horizon. A tailored inner approximation is then developed for tractable calculation in multi-zone-coupled HVAC settings. In the real-time stage, aggregate flexibility is computed efficiently via building-level parallel linear programs followed by closed-form Minkowski summation of power intervals, and any regulation signal within the reported flexibility interval admits a recursively feasible disaggregation. Case studies demonstrate the effectiveness of the proposed framework in aggregate flexibility characterization, disaggregation feasibility, and scalable computation.

2605.13822 2026-05-14 cs.RO cs.SY eess.SY

Loiter UAV Reinsertion Guidance for Fixed-wing UAV Corridors

Pradeep J, Kedarisetty Siddhardha, Ashwini Ratnoo

AI总结 本文研究固定翼无人机走廊中的滞留无人机重新插入主航道的问题,该走廊包括主航道、用于缓解交通拥堵的环形滞留航道以及连接两者的过渡航道。为确保安全无冲突地将滞留无人机重新插入主航道,提出了一种基于虚拟插槽和速度约束的引导算法。该方法通过数值仿真验证了其有效性,为无人机交通管理提供了可行的自动化策略。

详情
Journal ref
AIAA SCITECH 2026
英文摘要

This paper considers fixed-wing unmanned aerial vehicle (UAV) corridors comprising a main lane, a circular loiter lane for managing traffic congestion, and transit lanes connecting the two. In particular, we address the problem of conflict-free reinsertion of UAVs from the loiter lane back into the main lane. The loiter lane contains a fixed number of equidistant virtual slots that UAVs can occupy. Reinsertion of loiter UAVs into the main lane becomes essential either due to reduced traffic in the main lane or due to a loiter UAV needing to reach its destination urgently. Given the total number of loiter slots, UAV speed limits, and the minimum safety distance, a guidance algorithm is developed to compute the required speed of a loiter UAV in the transit lane to ensure safe reinsertion. The proposed guidance and automation strategies are validated through numerical simulations.

2605.13751 2026-05-14 cs.RO cs.SE cs.SY eess.SY

Learning Responsibility-Attributed Adversarial Scenarios for Testing Autonomous Vehicles

Yizhuo Xiao, Haotian Yan, Ying Wang, Zhongpan Zhu, Yuxin Zhang, Xintao Yan, Mustafa Suphi Erden, Cheng Wang

AI总结 该研究旨在为自动驾驶系统(ADS)建立可信的安全保障,通过区分系统缺陷与不可避免的交通冲突,生成具有责任归属的对抗场景。提出的方法CARS结合上下文感知的对抗体选择与闭环模拟优化的生成对抗策略,能够生成物理可行且责任可追溯的碰撞场景。该框架在多国交通环境下表现出色,能够有效发现符合法规要求的高责任归属碰撞场景,为自动驾驶系统的可解释性验证提供了新的方向。

详情
英文摘要

Establishing trustworthy safety assurance for autonomous driving systems (ADSs) requires evidence that failures arise from avoidable system deficiencies rather than unavoidable traffic conflicts. Current adversarial simulation methods can efficiently expose collisions, but generally lack mechanisms to distinguish these fundamentally different failure modes. Here we present CARS (Context-Aware, Responsibility-attributed Scenario generation), a framework that integrates responsibility attribution directly into adversarial scenario generation. CARS combines context-aware adversary selection with a generative adversarial policy optimized in closed-loop simulation to construct collision scenarios that are both physically feasible and diagnostically attributable. Across benchmark datasets spanning heterogeneous national traffic environments, CARS consistently discovers feasible collision scenarios with high attribution rates under multiple regulation-prescribed careful and competent driver models. By coupling adversarial generation with normative responsibility assessment, CARS moves simulation testing beyond collision discovery toward the construction of interpretable, regulation-aligned safety evidence for scalable ADS validation.

2605.13720 2026-05-14 eess.IV

An Underwater Dehazing Network with Implicit Transmission Estimation

Sahana Ray, Sanjay Ghosh

AI总结 该论文提出了一种用于水下图像去雾的深度网络UDehaze-iT,旨在解决水下成像中因光的吸收和散射导致的视觉质量下降问题。该方法通过隐式估计场景深度,并结合比尔-朗伯定律和可学习的衰减系数,对每个颜色通道的透射率进行建模,同时采用半经典方法估计大气光,并利用残差细化模块消除去雾后的残余伪影。实验表明,该网络在UIEB和UFO-120数据集上取得了具有竞争力的性能,参数量约为0.9M。

Comments 5 pages, 2 figures

详情
英文摘要

Underwater images suffer from wavelength-dependent light absorption and scattering, which reduces visual quality. This phenomenon could limit the operational reliability of autonomous underwater vehicles, marine surveys, and offshore inspection systems. Purely classical methods often achieve suboptimal performance in real-world datasets, while purely data-driven methods lack physical interpretability. In this letter, we propose UDehaze-iT, a deep network for underwater image enhancement that estimates scene depth implicitly and derives per-channel transmission through the Beer-Lambert law with learnable attenuation coefficients. We estimate atmospheric light as a semi-classical per-channel scalar, and a zero-initialized residual refiner corrects remaining artefacts after dehazing. To effectively train our method, we apply a composite loss function consisting of five key terms: a L1 loss, a multi-scale patchwise DCT loss, a forward model reconstruction loss, and two regularization terms. With ~0.9M parameters, UDehaze-iT achieves competitive performance on UIEB and UFO-120 datasets.

2605.13713 2026-05-14 cs.CV eess.IV

Learning to Optimize Radiotherapy Plans via Fluence Maps Diffusion Model Generation and LSTM-based Optimization

Isabella Poles, Simon Arberet, Riqiang Gao, Martin Kraus, Marco D. Santambrogio, Florin C. Ghesu, Ali Kamen, Dorin Comaniciu

AI总结 本文提出了一种基于扩散模型和LSTM的端到端优化方法,用于放射治疗计划的生成。该方法通过分布匹配的扩散模型生成临床可行的射线强度图,并利用LSTM模块学习梯度更新动态,从而快速优化剂量分布。实验表明,该方法在提升计划效率、灵活性和机器可执行性方面优于现有方法。

Comments Early Accept at MICCAI 2026

详情
英文摘要

Volumetric Modulated Arc Therapy (VMAT) is a cornerstone of modern radiation therapy, enabling highly conformal tumor irradiation and healthy-tissue sparing. Yet, its planning solves inverse and nested optimization for multi-leaf collimators, monitor units and dose parameters, while enforcing their consistency to ensure mechanical deliverability. Nevertheless, this process often requires repeated re-optimization when treatment configurations change, resulting in substantial planning time per patient. To address these problems, we present a diffusion-driven Learning-to-Optimize (L2O) method for end-to-end VMAT planning. A distribution-matching distilled diffusion model learns a clinically feasible manifold of fluence maps, enabling their one-shot generation. On top of this, an LSTM-based L2O module learns gradient update dynamics to swiftly refine fluence maps toward prescribed dose objectives during inference. Experimental results on clinical and public prostate cancer cohorts demonstrate improved planning efficiency, flexibility, and machine deliverability over currently available end-to-end VMAT planners.

2605.13669 2026-05-14 eess.SY cs.RO cs.SY math.DS

Bounded-Input True Proportional Navigation for Impact-Time Control

Lohitvel Gopikannan, Shashi Ranjan Kumar, Abhinav Sinha

AI总结 本文提出了一种非线性制导策略,能够在严格满足控制输入(指令加速度)约束的前提下,拦截匀速且不机动的目标。该方法以真比例导航(TPNG)为基础,采用精确的飞行时间公式,适用于更广泛的目标运动情况,并通过滑模控制技术设计了一种考虑输入约束的制导律,实现了时间约束下的有效拦截。该策略在多种交战场景中进行了性能验证,展示了其优越性。

Comments Preprint; Accepted for presentation at the 15th Asian Control Conference, June 17th-21st, 2026, Indonesia

详情
英文摘要

This paper proposes a nonlinear guidance strategy capable of intercepting a constant-velocity, non-maneuvering target while strictly satisfying the prescribed bounds on the control input (commanded acceleration). Unlike conventional strategies that estimate time-to-go using linearization or small-angle approximations, the proposed strategy employs true proportional-navigation guidance (TPNG) as a baseline, which utilizes an exact time-to-go formulation and is applicable over a wide range of target motions. In contrast to most existing strategies, which do not incorporate control input bounds into the guidance design, the proposed approach explicitly accounts for these limits by modeling the interceptor acceleration as a dynamic variable. Based on the sliding mode control technique, an effective guidance law that achieves time-constrained interception while accounting for bounded input is then derived. The performance of the proposed strategy is evaluated for various engagement scenarios.

2605.13661 2026-05-14 eess.SP

Air-Sea Surface Modeling and Operating Link Range Evaluation for AUV-to-UAV Optical Wireless Communication Links

Ikenna Chinazaekpere Ijeh, Mohammad Ali Khalighi, Wasiu O. Popoola

AI总结 本文研究了水下自主水下机器人(AUV)与空中无人机(UAV)之间光无线通信(OWC)链路中,海面粗糙度对通信性能的影响。作者采用经典的Cox-Munk模型和ECKV模型进行分析,并推导了ECKV模型的可解析表达式,验证了其与实测数据的一致性。通过解析和蒙特卡洛方法,评估了链路的平均容量,重点关注操作范围、指向误差、接收视场角和太阳噪声水平,为实际系统设计提供了重要参考。

详情
英文摘要

Air-sea surface interactions play a critical role in underwater-to-air optical wireless communication (OWC) links, particularly in vertical autonomous underwater vehicle (AUV) to unmanned aerial vehicle (UAV) scenarios, where the stochastic nature of the sea surface introduces optical distortions that impair link reliability. This work investigates the impact of air-sea surface roughness on AUV-to-UAV OWC systems using two experimentally validated models: the classical Cox-Munk and the Elfouhaily-Chapron-Katsaros-Vandemark (ECKV). A tractable analytical representation of the ECKV model is derived and validated against measured sea-state data. Using both analytical and Monte Carlo approaches, the link ergodic capacity is evaluated with particular emphasis on operating range, pointing errors, receiver field-of-view, and solar noise level, providing practical system design insights.

2605.13580 2026-05-14 eess.SP

Joint Segment Activation and Antenna Placement for Uplink SWAN Systems

Songnan Gu, Zhenqiao Cheng, Hao Jiang, Chongjun Ouyang, Yuanwei Liu, Arumugam Nallanathan

AI总结 本文研究了多用户上行链路分割波导增强型压接天线系统(SWAN)的可实现总速率,推导了总速率的上界,并据此理论证明了存在最优的分段激活水平。基于这一结果,提出了混合分段选择与聚合(HSS/A)方案,联合优化分段激活与压接天线布置,并设计了低复杂度的贪心算法。数值结果验证了理论分析,并表明所提HSS/A方案优于传统的全分段聚合方法。

Comments 5 pages

详情
英文摘要

This article analyzes the achievable sum-rate of multiuser uplink segmented waveguide-enabled pinching-antenna systems (SWANs). To unveil system-design insights, an upper bound on the achievable sum-rate is derived, based on which the existence of an optimal segment activation level is theoretically established. Motivated by this result, hybrid segment selection and aggregation (HSS/A) schemes are proposed to jointly optimize segment activation and pinching-antenna (PA) placement. Correspondingly, low-complexity greedy algorithms are developed for the considered optimization problem. Numerical results validate the theoretical analysis and demonstrate that the proposed HSS/A schemes outperform conventional full-segment aggregation.

2605.13529 2026-05-14 eess.SY cs.SY

Decentralized Frequency-Domain Conditions for D-Stability with Application to DC Microgrids

Zelin Sun, Shanshan Jiang, Xiaoyu Peng, Xiang Zhu, Xiuqiang He, Hua Geng

AI总结 本文提出了一种用于网络化系统区域极点配置(D-稳定性)的去中心化方法。为解决现有基于LMI的方法在子系统模型保密性和通信基础设施缺失方面的限制,研究将目标区域映射到辅助左半平面,并引入正函数处理复系数动态特性,从而在无需共享模型或跨子系统通信的情况下,通过局部频域条件保证D-稳定性。该方法被应用于直流微电网,通过环路变换实现稳定性认证负担的重新分配,推导出可用于去中心化参数合成的可广播电网标准,数值实验验证了方法的有效性。

详情
英文摘要

This paper proposes a decentralized method for regional pole placement, or $\mathcal{D}$-stability, in linearized networked systems. Existing LMI-based methods are hindered by confidentiality concerns regarding proprietary subsystem models and the absence of communication infrastructures. To overcome these barriers, we map the target region $\mathcal{D}$ of pole placement to an auxiliary left-half plane and introduce positive functions to handle the resulting complex-coefficient dynamics. We prove that $\mathcal{D}$-stability is guaranteed via local frequency-domain criteria without requiring shared subsystem models or inter-subsystem communication. This method is then tailored to DC microgrids, where a loop transformation is utilized to reallocate the burden of stability certification, deriving a broadcastable grid code for decentralized parameter synthesis. Numerical examples verify the efficacy of the proposed method.

2605.13524 2026-05-14 eess.SP

Manifold-Aware Information Gain and Lower Bounds for Gaussian-Process Bandits on Riemannian Quotient Spaces

Yuriy Dorn, Changsheng Chen, Ning Xie

AI总结 本文研究了在黎曼商空间上基于高斯过程的带宽算法的最小遗憾下界,揭示了臂空间几何结构对常数因子的影响。作者提出了一个与流形体积相关的显式几何常数,并证明了与已有上界一致的遗憾指数。此外,文章还拓展了分析方法,包括改进下界证明、商空间算法的上界分析、常数的显式表达以及曲率依赖性的提取,为流形上的高斯过程优化提供了更深入的理论支持。

Comments It will be submitted to IEEE Transactions on Information Theory

详情
英文摘要

We prove a regret lower bound for Gaussian-process bandits on a smooth compact Riemannian manifold $\M$ of dimension $d$ with intrinsic Matérn-$ν$ kernel ($ν>d/2$) that exposes how the geometry of the arm space enters the constant. For any algorithm and time horizon $T$ exceeding an explicit threshold, the worst-case expected regret over the RKHS-ball $\|f\|_{\Hil_{k_ν}}\!\le\!B$ satisfies \begin{multline*} \E[R_T(f)]\;\ge\;c_*(d,ν)\,B^{d/(2ν+d)}\,σ_n^{2ν/(2ν+d)} \\ \cdot\,\vol_g(\M)^{ν/(2ν+d)}\,T^{(ν+d)/(2ν+d)}(\log T)^{ν/(2ν+d)}. \end{multline*} The exponent matches the Vakili--Khezeli--Picheny upper bound \cite{vakili2021information}; the $\vol_g(\M)^{ν/(2ν+d)}$ factor is, to our knowledge, the first explicit volume-dependent geometric constant in a manifold GP-bandit lower bound. We extend the analysis in five directions: (i)~a companion Assouad-style proof gives a different lower bound with a strictly smaller $T$-exponent $(2ν+3d)/(4(ν+d))$ but with a polylog factor of the form $1/(\log\log T)^{(2ν+d)/(4(ν+d))}$, sharpening the $(\log T)^{ν/(2ν+d)}$ Fano polylog of Theorem~\ref{thm:main}; (ii)~we prove a $|G|^{1/2}$ upper bound on the regret of an extrinsic-kernel GP-UCB algorithm on a quotient space $\M=\Mt/G$, plus a bracketing theorem (Theorem~\ref{thm:gauge-bracket}); the precise constant is conjectured to take the modulated form $(1+(|G|-1)h(\rinj/κ))^{1/2}$ (Conjecture~\ref{conj:gauge-modulated}), validated numerically on $\SO(3)$; (iii)~we write the leading constant $c_*(d,ν)$ out fully; (iv)~we extract a curvature dependence $1+O(K\eps_T^2)$ via Bishop--Gromov; (v)~we transfer the bound to the Bayesian regret framework via the Yang--Barron / Castillo et al.\ Bayesian-Fano transfer.

2605.13516 2026-05-14 eess.SP

Sensing-Assisted LoS/NLoS Identification in Dynamic UAV Positioning Systems

Huijuan Qiao, Lu Bai, Mingran Sun, Mengyuan Lu, Jiajing Chen, Xiang Cheng

AI总结 本文首次提出了一种用于动态无人机定位系统的感知辅助视距/非视距(LoS/NLoS)识别方法。研究构建了一个多模态感知通信融合数据集,涵盖两种典型城市场景和广泛飞行高度,并设计了一种双输入特征融合网络,有效融合RGB图像与信道冲激响应(CIR)数据,提升识别性能。实验表明,该方法在识别准确率上达到97.69%,相比传统仅使用CIR或RGB的方法有显著提升,并在少量样本下表现出强大的泛化能力,验证了其在实际应用中的有效性。

详情
英文摘要

In this paper, a sensing-assisted non-line-of-sight (NLoS) identification method for dynamic uncrewed aerial vehicle (UAV) positioning is proposed for the first time. For urban UAV-to-ground scenarios, a new multi-modal sensing-communication integrated dataset is constructed to support line-of-sight (LoS)/NLoS identification, covering two typical urban scenarios and a wide range of flight altitudes. Based on the constructed dataset, a novel dual-input feature fusion network is proposed, which addresses the challenge of heterogeneous representations between RGB images and channel impulse response (CIR) data to enable the joint extraction and fusion of sensing and communication features for LoS/NLoS identification. Simulation results show that the identification accuracy can reach up to 97.69%, while achieving an improvement of at least 3.59% compared to traditional CIR-only and RGB-only methods. Moreover, strong few-shot generalization is observed, as the proposed method stabilizes and approaches full-sample performance with fewer than 200 target samples and exceeds traditional CIR-only and RGB-only methods with fewer than 100 target samples in all cross-scenario and cross-altitude experiments. Even under Gaussian noise with a variance of 0.35 applied to RGB images, the accuracy degradation remains approximately 0.5%. By utilizing the proposed LoS/NLoS identification method, the error of trilateration positioning can be reduced by approximately 70% in a crossroad scenario, verifying the utility of the proposed method.

2605.13502 2026-05-14 eess.SP

A Multi-Modal Intelligent U2V Channel Model for 6G Sensing-Communication Integration

Shuo Wang, Zengrui Han, Lu Bai, Xiang Cheng

AI总结 本文提出了一种基于三维散射体预测的新型无人机到车辆(U2V)信道模型,用于第六代(6G)智能感知与通信融合系统。研究构建了一个高保真的混合感知-通信集成仿真数据集,并设计了3D-SPADE算法,利用激光雷达点云准确预测散射体的空间分布,有效提升了动态U2V场景下的建模精度与计算效率。实验结果表明,该方法在不同车流密度和无人机高度的宽车道场景中表现出优异的散射体检测性能,验证了其在信道建模中的有效性与可靠性。

详情
英文摘要

This paper proposes a novel UAV-to-Vehicle (U2V) channel model for sixth-generation (6G) intelligent sensing-communication integration, based on three-dimensional (3D) scatterer prediction. To explore the mapping relationship between physical environment and electromagnetic space, a new high-fidelity mixed sensing-communication integration U2V simulation dataset under wide-lane scenarios with different vehicular traffic densities (VTDs) and UAV heights is constructed. Based on the constructed dataset, a novel 3D Scatterer Prediction and Distribution Estimation (3D-SPADE) algorithm is proposed, which leverages LiDAR point clouds to accurately predict the spatial distribution of scatterers. Furthermore, the clustering of scatterers and the subsequent classification into dynamic and static types are meticulously designed for highly dynamic U2V scenarios, while reducing computational complexity and improving modeling accuracy. As LiDAR point clouds vary over time, dynamic and static clusters evolve via 3D-SPADE, enabling precise modeling of channel non-stationarity and consistency. Simulation results demonstrate that, in the wide-lane scenario with varying VTDs and UAV heights, the proposed 3D-SPADE consistently achieves high scatterer occupancy detection performance within the voxel grid. In particular, under favorable configurations, recall reaches 93.26%, and precision reaches 95.74%, highlighting the reliability of 3D-SPADE. Key channel statistical characteristics are simulated and analyzed. These characteristics from the simulation experiments are highly consistent with ray-tracing results and exhibit better agreement than with the standardized model and inconsistent model, validating the necessity of exploring the mapping relationship and the effectiveness of the proposed model.

2605.13453 2026-05-14 eess.SY cs.SY

Learning a Contracting KKL-observer with Local Optimal Guarantees

Clara Lucía Galimberti, Johan Peralez, Daniele Astolfi, Vincent Andrieu, Madiha Nadri

AI总结 本文研究了如何学习一种具有全局稳定性保证和局部最优性能的Kazantzis-Kravaris-Luenberger(KKL)观测器。作者提出了一种方法,通过设计满足特定条件的隐含动态,使观测器在局部上模拟最小能量估计器的行为,并利用深度学习近似KKL变换和隐含动态,结构上保证收缩性。实验表明,该方法在存在状态和测量噪声的非线性系统中表现出良好的估计性能。

Comments Accepted to the 23rd IFAC World Congress 2026

详情
英文摘要

The Kazantzis-Kravaris-Luenberger (KKL) observer provides a general framework for nonlinear state estimation by immersing the system dynamics into a stable linear or nonlinear latent dynamics. However, the performance of KKL observers relies heavily on the specific choice of these latent dynamics, which is often heuristic. This paper proposes a methodology to learn a KKL observer that combines global stability guarantees with local optimality. We derive a condition on the latent dynamics such that the observer locally mimics the behavior of a Minimum Energy Estimator (Mortensen observer). We then employ Deep Learning to approximate the KKL transformation and the latent dynamics, using neural network architectures that structurally enforce the contraction property. The proposed strategy is validated through numerical simulations on nonlinear benchmarks, demonstrating a good performance in the presence of state and measurement noise.

2605.13394 2026-05-14 eess.SP

Decoupled Azimuth Elevation AoA Estimation Exploiting Kronecker Separable Steering Matrices

Faizan A. Khattak, Ian K. Proudler, Stephan Weiss, Fazal-E Asim

AI总结 本文研究了利用克罗内克可分导向矩阵结构进行二维到达角(AoA)估计的问题,提出了一种经济高效的子空间解耦框架。该方法通过从空间协方差矩阵中提取联合信号子空间,并利用低复杂度解耦方案恢复方位和仰角导向矩阵的列空间,从而将二维估计问题分解为两个独立的一维问题,再通过二维谱函数进行配对。仿真结果表明,该方法在中大型阵列中具有更高的估计精度,且所需快照数更少,提升了频谱效率。

详情
英文摘要

Uniform rectangular arrays (URA), structured non-uniform rectangular arrays (NURA), and parallelogram shaped (UPgA and NUPgA) arrays admit steering vectors that can be expressed as the Kronecker product of azimuth and elevation steering vectors. Accordingly, the full steering matrix can be represented as the Khatri Rao product of the corresponding azimuth and elevation steering matrices. This paper exploits this structure to develop an economical subspace decoupling framework for two dimensional angle of arrival (AoA) estimation. The proposed method first extracts the joint signal subspace from the spatial covariance matrix. Then it applies a low complexity decoupling scheme to recover the column spaces of the azimuth and elevation steering matrices. With the estimated decoupled subspaces, conventional one dimensional algorithms such as MUSIC, root MUSIC, and ESPRIT can be applied independently along each dimension, followed by pairing through a two dimensional spectral function. Monte Carlo simulations show that the proposed approach achieves higher accuracy than state of the art methods, i.e., two dimensional MUSIC, reduced-dimension MUSIC, and two-dimensional ESPRIT, for medium- and large scale arrays while requiring fewer snapshots, consequently with improved spectral efficiency.

2605.13390 2026-05-14 eess.SY cs.SY

Sensitivity Quantification for Distribution System State Estimation

Betül Mamudi, Jochen Stiasny, Jochen Cremer

AI总结 本文研究了基于加权最小二乘(WLS)的配电网状态估计(DSSE)中,不确定性边界对伪测量分布假设的敏感性问题,并提出利用费舍尔信息矩阵(FIM)进行量化分析。通过构建一种诊断框架,对比真实克拉默-拉奥界(CRB)与WLS假设下的CRB,实验表明,重尾和偏态分布会导致WLS高估不确定性,且不同母线和运行场景下的误差程度存在差异,揭示了基于方差的不确定性诊断方法在均值偏移问题上的局限性。研究结果强调了伪测量分布选择对置信限的影响,为改进不确定性感知的DSSE方法提供了理论依据。

Comments Submitted for peer review

详情
英文摘要

Pseudo-measurements are the dominant source of uncertainty in distribution system state estimation (DSSE), yet their distributional assumptions are treated as fixed inputs by existing uncertainty quantification methods. This paper investigates whether the uncertainty bounds assumed by weighted least squares (WLS)-based DSSE are sensitive to these distributional assumptions, and whether this sensitivity is quantifiable using the Fisher Information Matrix (FIM). We propose a diagnostic framework that compares the true Cramér-Rao Bound (CRB) against the WLS-assumed CRB via a per-bus, per-scenario ratio, computed directly from the converged WLS solution. Pseudo-measurement distributions are varied across five types in 22 variants matched at equal spread to isolate shape effects from variance. Experiments on the CIGRE MV network across 100 operating scenarios yield three findings. First, heavy-tailed and skewed distributions show consistently that WLS systematically overstates its uncertainty bounds. Second, the degree of miscalibration varies across buses and operating scenarios, confirming that distributional sensitivity is not uniform. Third, the CRB ratio is structurally blind to mean-shift bias, exposing a fundamental limitation of variance-based uncertainty diagnostics. Together, these results confirm the hypothesis and show that the choice of pseudo-measurement distribution directly distorts the confidence limits under WLS-based assumptions, which must be explicitly accounted for in any uncertainty-aware DSSE method.

2605.13355 2026-05-14 eess.SY cs.SY

Impedance-Based VSC Unit Commitment with STATCOM Support under High IBG Penetration

Aoun Abbas, Zhongda Chu, Charalambos Konstantinou

AI总结 随着大量同步机组被基于逆变器的发电(IBG)替代,电压和频率稳定性面临严峻挑战。本文提出一种基于阻抗的联合优化单元组合(UC)方法,结合合成惯性(SI)调度和IBG节点的SOC电压稳定边界,并将STATCOM作为无功功率决策变量纳入混合整数二阶锥规划(MISOCP)框架。研究通过改进的IEEE 30节点系统评估了三种调度策略,结果表明该方法在提高电压安全性、保证频率最低值合规性以及降低运行成本方面表现优异,同时STATCOM的引入进一步提升了高IBG渗透下的调度可行性。

Comments Electric Power Systems Research

详情
英文摘要

The large-scale replacement of synchronous machines with inverter-based generation (IBG) introduces critical challenges to both voltage and frequency stability. This work builds on a mixed-integer second-order cone programming (MISOCP) framework that co-optimizes unit commitment (UC) model which embeds frequency-nadir constraints through synthetic inertia (SI) dispatch and an SOC voltage stability boundary for IBG buses. The formulation extends by modeling a STATCOM as a reactive-power decision variable in the same MISOCP model. A modified IEEE 30-bus system is used to assess three scheduling strategies: (i) baseline UC with SI only, (ii) voltage-stability-constrained (VSC) UC with SI, and (iii) the joint UC with SI and reactive power support from IBGs. The impact of incorporating a 30~MVAr STATCOM at a weak grid location near the IBG buses is investigated. Simulation results show that the proposed framework enhances voltage security, maintains frequency-nadir compliance, and reduces operating cost, while STATCOM integration further improves dispatch feasibility under high IBG.

2605.12244 2026-05-14 eess.SY cs.SY math.RA

Estimation Problems and the Modulating Function Method: The Algebra of Modulating Functions

Davi G. Accioli, Jerome Jouffroy

AI总结 本文研究了控制系统中状态估计、参数估计和故障检测等关键问题,并提出了一种统一的调制函数方法。该方法通过具有特定边界导数性质的调制函数,实现了对集中参数系统、分布参数系统及分数阶系统的估计。文章系统分析了调制函数的代数性质,提出了构造新调制函数的算法,并利用其向量空间和代数结构构建正交调制函数,应用于船舶横摇动力学参数估计,有效避免了矩阵求逆问题。

Comments 13 pages, 6 figures

详情
英文摘要

State and parameter estimation, along with fault detection, are three crucial estimation problems within the control systems community. Although different approaches have been proposed for each type of problem, the modulating function method proposes a more unified approach to all three problem classes, being used for state and parameter estimation of lumped systems, fault detection, and estimation of distributed and fractional systems. At the core of the method is the modulating function: a function that evaluates to 0 at the left or right boundaries up to a certain order of derivatives. By selecting the modulating functions, one directly determines the filter characteristics, and, for that reason, different function families have been proposed over the years. Nevertheless, many families of modulating functions are given in a rather similar mathematical structure. In light of these structures, this paper formally discusses the algebraic properties of modulating functions, and, after formalizing the closedness and group properties of modulating functions, a simple algorithm to construct new modulating functions is proposed, discussed, and illustrated with the construction of the newly introduced logarithmic modulating function families and 3 non-analytic modulating function families. Moreover, the fact that total modulating functions form a vector space and an algebra is exploited to construct orthonormal modulating functions, which are then used for the parameter estimation of a boat's roll dynamics, effectively avoiding matrix inversion issues.

2605.07653 2026-05-14 cs.CV eess.IV

Aquatic Neuromorphic Optical Flow

Pei Zhang, Yunkai Liang, Kaiqiang Wang

AI总结 本文研究了水下环境中基于神经形态视觉的光流估计问题,提出了一种基于脉冲神经网络的自监督框架,能够从异步事件流中高效估计逐像素光流,有效克服了水下数据稀缺的瓶颈。该方法在保证视觉和定量性能的同时,显著提升了计算效率,为资源受限的水下边缘平台提供了轻量、实时且低成本的感知解决方案。

Comments This work is under review. Project page: https://github.com/pz-even/event_underwater_optical_flow

详情
英文摘要

Underwater environments impose severe constraints on conventional imaging systems and demand solutions that balance high-quality sensing with strict resource efficiency. While emerging event cameras offer a promising alternative, their potential in aquatic scenarios remains largely unexplored. Through the lens of neuromorphic vision, this work pioneers the investigation of motion fields that serve as key media for agile underwater perception. Built upon spiking neural networks, we introduce a self-supervised framework to estimate per-pixel optical flow from asynchronous event streams, elegantly bypassing the long-standing bottleneck of underwater data scarcity. Extensive evaluations demonstrate that our method achieves competitive visual and quantitative results against leading techniques while operating with superior computational efficiency. By bridging neuromorphic sensing and aquatic intelligence, this work opens new frontiers for lightweight, real-time, and low-cost perception on resource-constrained underwater edge platforms.

2604.24610 2026-05-14 eess.SP

Matching-free Acquisition of Channels with Anisotropic Wavefronts

Heling Zhang, Shidong Zhou

AI总结 未来无线通信对数据速率的高需求促使了超大孔径天线阵列(ELAA)的应用,而准确获取信道状态信息对于实现有效的预编码至关重要。现有信道估计方法多基于各向同性散射场景下的球面波前模型(SWC),难以应对实际中由曲面反射引起的各向异性波前(AWC)问题。本文提出了一种无需字典匹配的各向异性波前信道估计算法(MACAW),通过基于快速傅里叶变换的频率分析直接恢复信道参数,在保证估计精度的同时显著降低了计算复杂度。仿真结果验证了该方法在各向异性信道下的有效性。

详情
英文摘要

The escalating data rate demands of future wireless communications necessitate the deployment of extremely large aperture arrays (ELAAs) in communication systems. Acquiring accurate channel state information is crucial to execute effective precoding for such systems, in which the near-field curvature effects on the channel must be considered. Current channel estimation algorithms are generally restricted to the spherical wavefront channel (SWC), which is appropriate for isotropic scatterers, point sources, and planar reflecting surfaces. However, in practical scenarios involving curved reflecting surfaces, the reflected waves exhibit anisotropic rather than spherical wavefronts, significantly degrading the accuracy of conventional SWC-based algorithms. To tackle this challenge, we first derive a parameterized model for the anisotropic wavefront channel (AWC). Based on this model, we then propose the matching-free acquisition of channels with anisotropic wavefronts (MACAW) algorithm. Unlike conventional dictionary-based matching pursuit techniques, MACAW recovers channel parameters through fast-Fourier-transform-based frequency analysis. This approach enables precise channel estimation in AWC scenarios while maintaining a significantly lower computational complexity than existing methods. Simulation results illustrate how physical characteristics of the propagation environment influence the degree of wavefront anisotropy, and demonstrate the effectiveness of the proposed algorithm.

2604.18263 2026-05-14 eess.SP

Passive RIS Is Not Silent: Revisiting Performance Limits Under Thermal Noise

Farjam Karim, Deepak Kumar, Prathapasinghe Dharmawansa, Nurul Huda Mahmood, Arthur Sousa de Sena, Matti-Latva-aho

AI总结 本文重新审视了被动可重构智能表面(RIS)在无线通信中的性能限制,指出尽管被动RIS通常被认为几乎没有噪声,但其元件仍会产生热噪声,从而显著影响系统性能。研究提出了一种可分析的框架,将RIS热噪声纳入系统模型,并推导了包括中断概率和吞吐量在内的关键性能指标的闭式表达式。仿真结果验证了该分析的有效性,并突显了忽略RIS热噪声可能导致的性能偏差,为6G通信系统设计提供了重要参考。

详情
英文摘要

Reconfigurable intelligent surfaces (RISs) have emerged as a promising solution for enabling energy-efficient and flexible spectrum usage in wireless communication, particularly in the context of sixth-generation (6G) networks. While passive RIS architectures are widely regarded as virtually noiseless due to the lack of active components, this idealized assumption can lead to misleading performance evaluations. In this paper, we revisit this assumption and demonstrate that the thermal noise generated by passive RIS elements, though often neglected, can significantly affect system performance. We propose a tractable approximated analytical framework that incorporates RIS-induced thermal noise into the system and derive closed-form expressions for key performance metrics, such as outage probability and throughput. Simulation results validate our approximated analysis and highlight the substantial performance discrepancies that arise when RIS thermal noise is ignored. Our results offer valuable insights into the trade-offs between receiver and RIS noise, guiding the development of robust and efficient 6G communication systems.

2604.14842 2026-05-14 eess.SY cs.SY math.OC

Simplification Ad Absurdum? Revisiting Gas Flow Modeling for Integrated Energy System Planning

Thomas Klatzer, Yannick Werner, Sonja Wogrin

AI总结 本文研究了简化天然气流动模型对综合能源系统规划的影响,指出在综合电力-氢能扩展规划中,简化压力-流量关系和气体动力学可能导致在更真实的动态模型下产生显著的规划后悔,表现为系统扩展和运行不优以及氢能供应不足。数值实验表明,基于高度简化的输送和管存模型进行规划可能带来数千百分比的后悔,并且缺乏对不同需求水平的鲁棒性;尽管稳态规划可部分缓解此问题,但仍未能充分挖掘动态规划中管存灵活性带来的成本优化潜力。开发高效的动态模型求解算法是未来研究的重要方向。

详情
英文摘要

This paper analyzes the implications of simplified pipeline gas flow models for integrated energy system planning. A case study of an integrated power-hydrogen expansion planning problem shows that simplifying pressure-flow relationships and gas dynamics can lead to expansion plans that incur substantial regret when evaluated under a more realistic dynamic gas flow model -- due to suboptimal system expansion, operation, and non-supplied hydrogen. Numerical experiments show that planning under the highly simplified transport and transport-linepack models -- commonly used in expansion studies -- can result in regret exceeding several thousand percent and yield expansion plans that lack robustness across demand levels. Planning under steady-state conditions partially mitigates these effects, but still leaves significant cost-reduction potential untapped compared to dynamic planning due to neglected linepack flexibility. Developing efficient solution algorithms for the dynamic model is a promising direction for future research.

2604.13505 2026-05-14 eess.SY cs.SY

Cascaded TD3-PID Hybrid Controller for Quadrotor Trajectory Tracking in Wind Disturbance Environments

Yukang Zhang, Shuqi Chai, Yuhang Zhang, Danlan Huang, Quanbo Ge

AI总结 本文提出了一种用于四旋翼飞行器轨迹跟踪的级联混合控制框架,旨在应对非线性动力学和外部干扰的影响。该方法结合了传统的PID控制器用于高度和姿态控制,并引入改进的TD3算法增强水平位置控制,以应对强干扰环境。此外,通过嵌入混合扰动观测器进一步提升了系统的抗干扰能力,实验表明该方法在风扰环境下相比基准方法具有更高的轨迹跟踪精度和鲁棒性。

详情
英文摘要

This work presents a cascaded hybrid control framework for quadrotor trajectory tracking under nonlinear dynamics and external disturbances. In quadrotor systems, the altitude and attitude channels exhibit fast, structured dynamics that are well suited to reliable regulation, whereas horizontal-position control is more strongly affected by coupling effects, uncertainty, and disturbances, so that neither pure feedback control nor purely learning-based control alone is equally well suited to all channels. Accordingly, the proposed framework augments conventional proportional-integral-derivative (PID) stabilization for altitude and attitude control with an enhanced Twin Delayed Deep Deterministic Policy Gradient (TD3) agent incorporating a multi-Q-network structure, thereby improving horizontal-position control under severe disturbances. To further strengthen disturbance rejection in altitude and attitude control, a hybrid disturbance observer (HDOB) using low-pass and exponential moving average filtering is embedded in the control loops. The proposed TD3 enhancements are verified through ablation studies, and both numerical simulations and real-world flight tests on the quadrotor platform demonstrate that the proposed method achieves more accurate and robust trajectory tracking under wind disturbances than baseline approaches.

2603.23777 2026-05-14 cs.RO cs.AI cs.SY eess.SY

Human-in-the-Loop Pareto Optimization: Trade-off Characterization for Assist-as-Needed Training and Performance Evaluation

Harun Tolasa, Volkan Patoglu

AI总结 在人类运动技能训练和康复过程中,任务难度与用户表现之间存在内在权衡关系,准确刻画这一权衡对评估表现、设计按需辅助(AAN)方案至关重要。本文提出了一种基于人机闭环的帕累托优化方法,结合定量性能指标和定性挑战度指标,系统高效地刻画任务表现与感知挑战水平之间的权衡关系。通过用户实验和三个应用场景验证,该方法不仅可用于设计和评估AAN训练方案,还能在不同辅助水平下公平评估个体训练进展和用户间表现差异。

Comments Under review for publication in IEEE Transactions on Haptics

详情
英文摘要

During human motor skill training and physical rehabilitation, there is an inherent trade-off between task difficulty and user performance. Characterizing this trade-off is crucial for evaluating user performance, designing assist-as-needed (AAN) protocols, and assessing the efficacy of training protocols. In this study, we propose a novel human-in-the-loop (HiL) Pareto optimization approach to characterize the trade-off between task performance and the perceived challenge level of motor learning or rehabilitation tasks. We adapt Bayesian multi-criteria optimization to systematically and efficiently perform HiL Pareto characterizations. Our HiL optimization employs a hybrid model that measures performance with a quantitative metric, while the perceived challenge level is captured with a qualitative metric. We demonstrate the feasibility of the proposed HiL Pareto characterization through a user study. Furthermore, we present the utility of the framework through three use cases in the context of a manual skill training task with haptic feedback. First, we demonstrate how the characterized trade-off can be used to design a sample AAN training protocol for a motor learning task and to evaluate the group-level efficacy of the proposed AAN protocol relative to a baseline adaptive assistance protocol. Second, we demonstrate that individual-level comparisons of the trade-offs characterized before and after the training session enable fair evaluation of training progress under different assistance levels. This evaluation method is more general than standard performance evaluations, as it can provide insights even when users cannot perform the task without assistance. Third, we show that the characterized trade-offs also enable fair performance comparisons among different users, as they capture the best possible performance of each user under all feasible assistance levels.

2603.22267 2026-05-14 cs.CL cs.AI eess.AS

TiCo: Time-Controllable Spoken Dialogue Model

Kai-Wei Chang, Wei-Chih Chen, En-Pei Hu, Hung-yi Lee, James Glass

AI总结 本文提出 TiCo,一种可控制时间的语音对话模型,能够根据时间约束指令(如“生成约15秒的回应”)生成时长可控的语音响应。为解决现有模型缺乏时间感知能力的问题,研究引入了 TiCo-Bench 作为首个评估时间可控性的基准,并通过语音时间标记(STM)帮助模型在生成过程中估计已用时间并调整内容以满足目标时长。实验表明,TiCo 在不依赖问答对数据的情况下,通过自生成和可验证奖励的强化学习进行高效微调,显著提升了时长控制精度,同时保持了响应质量。

详情
英文摘要

We introduce TiCo, a time-controllable spoken dialogue model (SDM) that follows time-constrained instructions (e.g., "Please generate a response lasting about 15 seconds") and generates spoken responses with controllable duration. This capability is valuable for real-world spoken language systems such as voice assistants and interactive agents, where controlling response duration can improve interaction quality. However, despite their strong ability to generate natural spoken responses, existing models lack time awareness and struggle to follow duration-related instructions. To systematically evaluate this, we introduce TiCo-Bench, the first benchmark for time-controllable instruction following in SDMs, on which existing open-source and commercial models frequently fail to satisfy explicit time constraints. TiCo addresses this limitation by enabling an SDM to estimate elapsed speaking time during generation through Spoken Time Markers (STM) (e.g., <10.6 seconds>). These markers help the model maintain awareness of time and adjust the remaining content to meet the target duration. TiCo is post-trained efficiently without question-answer paired data, relying on self-generation and reinforcement learning with verifiable reward. Experimental results show that TiCo reduces duration error by 2.7x over its backbone and 1.6x over the strongest baseline, while preserving response quality.

2603.20146 2026-05-14 eess.SY cs.SY

A Controller Synthesis Framework for Weakly-Hard Control Systems

Marc Seidel, Martina Maggio, Frank Allgöwer

AI总结 本文提出了一种针对弱硬实时控制系统的设计框架,旨在解决控制器在偶尔错过截止期限时仍能保持系统稳定与性能的问题。该框架在控制器合成过程中显式整合弱硬约束,支持多种超期处理策略,并保证在弱硬条件下的系统稳定性与性能。实验在代表性的Furuta摆系统上验证了所提出方法的有效性,结果表明,考虑约束的控制器显著优于传统设计。

Comments accepted for publication at RTAS 2026

详情
Journal ref
in Proc. 32nd IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), 2026, pp. 41-54
英文摘要

Deadline misses are more common in real-world systems than one may expect. The weakly-hard task model has become a standard abstraction to describe and analyze how often these misses occur, and has been especially used in control applications. Most existing control approaches check whether a controller manages to stabilize the system it controls when its implementation occasionally misses deadlines. However, they usually do not incorporate deadline-overrun knowledge during the controller synthesis process. In this paper, we present a framework that explicitly integrates weakly-hard constraints into the control design. Our method supports various overrun handling strategies and guarantees stability and performance under weakly-hard constraints. We validate the synthesized controllers on a Furuta pendulum, a representative control benchmark. The results show that constraint-aware controllers significantly outperform traditional designs, demonstrating the benefits of proactive and informed synthesis for overrun-aware real-time control.

2603.02245 2026-05-14 eess.AS cs.LG cs.SD

LMU-Based Sequential Learning and Posterior Ensemble Fusion for Cross-Domain Infant Cry Classification

Niloofar Jazaeri, Hilmi R. Dajani, Marco Janeczek, Martin Bouchard

AI总结 本文研究了跨领域婴儿哭声分类问题,针对信号非平稳、标注有限及领域差异大的挑战,提出了一种融合MFCC、STFT和基频特征的紧凑声学框架,并采用增强的Legendre记忆单元(LMU)建模时序动态。通过引入校准的后验集成融合方法,有效提升了模型在不同数据集上的泛化能力,实验表明该方法在跨域评估中取得了更好的宏F1分数,并具备实时部署的可行性。

Comments 7 pages, to appear in Proc. Int. Conf. IEEE Engineering in Medicine and Biology Society (EMBC 2026), Toronto, Canada, July 26-30 2026

详情
英文摘要

Decoding infant cry causes remains challenging for healthcare monitoring due to short nonstationary signals, limited annotations, and strong domain shifts across infants and datasets. We propose a compact acoustic framework that fuses mel-frequency cepstral coefficients (MFCCs), short-time Fourier transform (STFT) features, and fundamental-frequency (F0) contours within a multi-branch convolutional neural network (CNN) encoder, and models temporal dynamics using an enhanced Legendre Memory Unit (LMU). Compared to LSTMs, the LMU backbone provides stable sequence modeling with substantially fewer recurrent parameters, supporting efficient deployment. To improve cross-dataset generalization, we introduce calibrated posterior ensemble fusion with entropy-gated weighting to preserve domain-specific expertise while mitigating dataset bias. Experiments on Baby2020 and Baby Crying demonstrate improved macro-F1 under cross-domain evaluation, along with leakage aware splits and real-time feasibility for on-device monitoring.

2602.16253 2026-05-14 eess.AS cs.SD

How Much Does Machine Identity Matter in Anomalous Sound Detection at Test Time?

Kevin Wilkinghoff, Keisuke Imoto, Zheng-Hua Tan

AI总结 本文研究了在测试阶段缺乏机器身份信息时,对异常声音检测(ASD)性能的影响。作者提出了一种修改后的评估方法,将多台机器的测试录音合并处理,不依赖机器身份进行推理,仅在事后评估中使用身份标签。实验表明,这种方法揭示了传统评估下隐藏的性能下降和方法鲁棒性差异,并发现这些下降与模型隐含的机器识别准确性密切相关。

详情
英文摘要

Anomalous sound detection (ASD) benchmarks typically assume that the identity of the monitored machine is known at test time and that recordings are evaluated in a machine-wise manner. However, in realistic monitoring scenarios with multiple known machines operating concurrently, test recordings may not be reliably attributable to a specific machine, and requiring machine identity imposes deployment constraints such as dedicated sensors per machine. To reveal performance degradations and method-specific differences in robustness that are hidden under standard machine-wise evaluation, we consider a minimal modification of the ASD evaluation protocol in which test recordings from multiple machines are merged and evaluated jointly without access to machine identity at inference time. Training data and evaluation metrics remain unchanged, and machine identity labels are used only for post hoc evaluation. Experiments with representative ASD methods show that relaxing this assumption reveals performance degradations and method-specific differences in robustness that are hidden under standard machine-wise evaluation, and that these degradations are strongly related to implicit machine identification accuracy.

2602.07029 2026-05-14 eess.IV cs.CV

Guidestar-Free Adaptive Optics with Asymmetric Apertures

Weiyun Jiang, Haiyun Guo, Christopher A. Metzler, Ashok Veeraraghavan

AI总结 本文提出了一种无需引导星或波前传感器的闭环自适应光学系统,能够实时校正光学像差。该方法基于非对称孔径和机器学习,结合波前感知、点扩散函数估计与光学校正,实现了高效、低计算量的波前校正。实验表明,该方法在复杂自然场景中表现优于现有无引导星波前调控技术,测量次数和计算量分别减少了十倍和千倍。

Comments Accepted to ACM Transactions on Graphics (TOG)

详情
英文摘要

This work introduces the first closed-loop adaptive optics (AO) system capable of optically correcting aberrations in real-time without a guidestar or a wavefront sensor. Nearly 40 years ago, Cederquist et al. demonstrated that asymmetric apertures enable phase retrieval (PR) algorithms to perform fully computational wavefront sensing, albeit at a high computational cost. More recently, Chimitt et al. extended this approach with machine learning and demonstrated real-time wavefront sensing using only a single (guidestar-based) point-spread-function (PSF) measurement. Inspired by these works, we introduce a guidestar-free AO framework built around asymmetric apertures and machine learning. Our approach combines three key elements: (1) an asymmetric aperture placed at the system's pupil plane that enables PR-based wavefront sensing, (2) a pair of machine learning algorithms that estimate the PSF from natural scene measurements and reconstruct phase aberrations, and (3) a spatial light modulator that performs optical correction. We experimentally validate this framework on dense natural scenes imaged through unknown obscurants. Our method outperforms state-of-the-art guidestar-free wavefront shaping methods, using an order of magnitude fewer measurements and three orders of magnitude less computation.

2602.01629 2026-05-14 cs.LG cs.RO cs.SY eess.SY

AdaptNC: Adaptive Nonconformity Scores for Conformal Prediction under Distribution Shift

Renukanandan Tumu, Aditya Singh, Rahul Mangharam

AI总结 本文研究了在分布偏移环境下如何提升共形预测(Conformal Prediction)的不确定性量化能力。传统共形预测依赖于数据交换性假设,但在实际机器人系统中这一假设常被违反,导致预测区域过于保守。为此,作者提出AdaptNC框架,同时在线调整非一致性得分函数参数和共形阈值,通过自适应加权和回放缓冲机制提升预测效率与稳定性。实验表明,AdaptNC在多个机器人基准任务中显著减少了预测区域体积,同时保持目标覆盖率。

详情
英文摘要

Rigorous uncertainty quantification is essential for the safe deployment of autonomous systems in unconstrained environments. Conformal Prediction (CP) provides a distribution-free framework for this task, yet its standard formulations rely on exchangeability assumptions that are violated by the distribution shifts inherent in real-world robotics. Existing online CP methods maintain target coverage by adaptively scaling the conformal threshold, but typically employ a static nonconformity score function. We show that this fixed geometry leads to highly conservative, volume-inefficient prediction regions when environments undergo structural shifts. To address this, we propose $\textbf{AdaptNC}$, a framework for the joint online adaptation of both the nonconformity score parameters and the conformal threshold. AdaptNC leverages an adaptive reweighting scheme to optimize score functions, and introduces a replay buffer mechanism to mitigate the coverage instability that occurs during score transitions. We evaluate AdaptNC on diverse robotic benchmarks involving multi-agent policy changes, environmental changes and sensor degradation. Our results demonstrate that AdaptNC significantly reduces prediction region volume compared to state-of-the-art threshold-only baselines while maintaining target coverage levels.

2512.20211 2026-05-14 cs.SD eess.AS eess.SP

Aliasing-Free Neural Audio Synthesis

Yicheng Gu, Junan Zhang, Chaoren Wang, Jerry Li, Zhizheng Wu, Lauri Juvela

AI总结 在神经音频合成中,现有模型在生成高质量音乐和人声演唱时常因非线性激活函数和上采样层引入严重的混叠伪影而表现不足。本文将可微分的抗混叠技术引入激活和上采样模块,提出Pupu-Vocoder和Pupu-Codec模型,有效提升了音频重建质量。实验表明,新模型在音乐、人声演唱和通用音频任务中优于现有系统,在语音任务上也保持了相近性能。

Comments Accepted by TASLP

详情
英文摘要

In neural audio synthesis, neural vocoders and codecs are models that reconstruct waveforms from acoustic and latent representations, which are essential to the resulting audio quality. While current models are capable of generating perceptually natural speech, they still struggle with high-fidelity music and singing voice synthesis, as severe aliasing artifacts are introduced by non-linear activation functions and upsampling layers in existing architectures. Although various anti-aliasing techniques have been proposed in digital signal processing, their integration into neural vocoders and codecs remains under-explored. This paper incorporates differentiable anti-aliasing techniques into the activation and upsampling modules to bridge this gap, and thus presents Pupu-Vocoder and Pupu-Codec. We build a test signal benchmark to evaluate the anti-aliased modules, and validate our proposed models on speech, singing voice, music, and audio. Experimental results show that Pupu-Vocoder and Pupu-Codec outperform existing systems on singing voice, music, and audio, while achieving comparable performance on speech. Demos, codes, and checkpoints are available at VocodexElysium.github.io/AliasingFreeNeuralAudioSynthesis/.