arXivDaily arXiv每日学术速递 周一至周五更新
重置
EESS电气与系统 114
2605.23682 2026-05-25 eess.SP

Tri-Domain Multiuser MIMO Precoding Optimization and Channel Estimation with Spatial-EM Reconfigurable Antenna

三域多用户MIMO预编码优化与空间-电磁可重构天线的信道估计

Yining Li, Ziwei Wan, Zhen Gao, Keke Ying, Lipeng Zhu, Rui Zhang

AI总结 本文提出了一种三域可重构多用户MIMO通信系统,整合了电磁可重构天线(EMRA)与空间可移动天线(SMA),称为空间-电磁可重构天线(SEMRA)。该系统提供了电磁、空间和数字域的自由度,用于联合信道重构,但同时也带来了信道估计(CE)和预编码优化的新挑战。文中提出了一种基于加权最小均方误差(WMMSE)的三域联合优化算法,以提升频谱效率(SE),并设计了一种低开销的移动辅助信道估计方案,通过协调天线重新定位生成更密集的虚拟阵列,实现更准确的角度离开估计和电磁域信道状态信息(eCSI)重构。仿真结果表明,所提的信道估计方案提升了eCSI估计精度,所提的SEMRA在相同试点开销下实现了更高的频谱效率。

详情
AI中文摘要

本文提出了一种三域可重构多用户多输入多输出(MIMO)通信系统,该系统将电磁(EM)可重构天线(EMRA)与空间可移动天线(SMA)相结合,称为空间-电磁可重构天线(SEMRA)。所提出的系统提供了电磁、空间和数字域自由度(DoFs)用于联合信道重构,但也引入了信道估计(CE)和预编码优化的新挑战。具体而言,对于多用户正交频分复用(OFDM)下行链路,预编码设计被表述为一个三域优化问题,涉及天线位置、电磁域辐射方向图权重和数字预编码器。我们首先开发了一种基于迫零(ZF)的基线算法来解耦空间重构的设计,然后提出了一种基于加权最小均方误差(WMMSE)的三域联合优化算法,以进一步提高频谱效率(SE)。此外,我们提出了一种低开销的移动辅助信道估计方案,其中跨导频时隙的协调天线重新定位合成更密集的虚拟阵列,从而在相同每用户导频开销下,相比EMRA基线实现更精确的离开角(AoD)估计和电磁域信道状态信息(eCSI)重构。所得的参数化表示使得无需额外导频即可在期望天线位置组装eCSI。仿真结果表明,所提出的CE方案提高了eCSI估计精度,并且在相同导频开销下,所提出的SEMRA比EMRA基线实现了更高的SE。

英文摘要

In this paper, we propose a tri-domain reconfigurable multiuser multiple-input multiple-output (MIMO) communication system that integrates the electromagnetic (EM) reconfigurable antenna (EMRA) with the spatially movable antenna (SMA), termed the spatial-EM reconfigurable antenna (SEMRA). The proposed system offers EM, spatial, and digital domain degrees of freedom (DoFs) for joint channel reconfiguration, yet introduces new challenges in channel estimation (CE) and precoding optimization. Specifically, for multiuser orthogonal frequency division multiplexing (OFDM) downlink, the precoding design is formulated as a tri-domain optimization problem over antenna positions, EM-domain radiation-pattern weights, and digital precoders. We first develop a zero-forcing (ZF)-based baseline algorithm to decouple the design of spatial reconfiguration, and then propose a weighted minimum mean square error (WMMSE)-based tri-domain joint optimization algorithm for further improving the spectral efficiency (SE). Furthermore, we propose a low-overhead movement-aided channel estimation scheme in which coordinated antenna repositioning across pilot slots synthesizes a denser virtual array, enabling more accurate angle-of-departure (AoD) estimation and EM-domain channel state information (eCSI) reconstruction under the same per-user pilot overhead as the EMRA baseline. The resulting parametric representation enables eCSI assembly at desired antenna positions without additional pilots. Simulation results show that the proposed CE scheme improves eCSI estimation accuracy and the proposed SEMRA achieves higher SE than the EMRA baseline under the same pilot overhead.

2605.23864 2026-05-25 math.OC cs.SY eess.SY

Harnessing Individual Motivation for Collective Efficiency: A Mechanism-Driven Distributed Optimization Method

利用个体动机实现集体效率:一种机制驱动的分布式优化方法

Dongwei Xie, Xuhao Wang, Yujie Tang, Jie Song

AI总结 在涉及多智能体协作决策的工业场景中,由于个体信息难以集中获取以及个体利益与全局性能之间的冲突,传统的集中式决策可能不可行。本文提出了一种机制驱动的分布式优化方法,通过设计激励机制引导各参与者在以自我利益驱动的前提下进行协作。该方法针对具有耦合目标函数和耦合约束的优化问题,设计了相应的分布式算法并提供了收敛性保证,同时引入两种激励机制以确保参与者的协作意愿,形成闭环反馈系统,实验结果验证了算法和机制的有效性。

详情
AI中文摘要

在涉及多智能体集体决策的工业场景中,由于对个体局部信息的访问受限,集中式决策可能不可行,而参与者自利与全局性能之间的冲突也可能阻碍协作式分布式决策。本文提出一种机制驱动的分布式决策方法,其中采用并设计激励措施,以激励参与者以分布式方式协作,即使每个参与者的决策主要受自利驱动。针对具有耦合目标函数和耦合约束的优化问题,我们设计了一种针对此类问题定制的分布式优化算法,并为其收敛性提供了保证。此外,我们设计了两种激励机制:影子定价机制和Vickrey-Clarke-Groves机制,并证明了在这些机制下参与者愿意参与分布式协作。该机制驱动分布式算法的执行,而分布式计算的最优结果指导机制中激励的确定,两者相互关联形成闭环。最后,数值实验说明了所提算法和机制的有效性。

英文摘要

In industrial scenarios involving multi-agent collective decision-making, centralized decision-making may not be admissible due to restrictive access to individual local information, while the conflicts between participants' self-interest and global performance may also impede collaborative distributed decision-making. This paper proposes a mechanism-driven distributed decision-making method, wherein incentives are employed and designed to motivate participants to collaborate in a distributed fashion even though each participant's decision is driven primarily by self-interest. Focusing on optimization problems with coupled objective functions and coupled constraints, we design a distributed optimization algorithm tailored for this class of problems and provide guarantees for its convergence. Furthermore, we design two incentive mechanisms, the shadow pricing mechanism and the Vickrey-Clarke-Groves mechanism, and demonstrate that participants are willing to engage in distributed collaboration under these mechanisms. The mechanism drives the execution of the distributed algorithm, and the optimal result of distributed computation guides the determination of incentives in the mechanism, both of which are interrelated to form a closed loop. Finally, numerical experiments illustrate the effectiveness of the proposed algorithm and mechanisms.

2605.23859 2026-05-25 eess.AS

Natural Yet Challenging to Detect: Robust In-the-Wild TTS through EMA and Dual-Scoring Prompt Selection -- Submission for WildSpoof 2026 TTS Track

自然但难以检测:通过EMA和双评分提示选择实现鲁棒的野外TTS——WildSpoof 2026 TTS赛道提交

Renhe Sun, Jiayi Zhou, Haolin He, Yueying Feng, Jian Liu

AI总结 本文介绍了针对WildSpoof 2026 TTS赛道的提交方案F5-TTS-DPS,旨在提升使用真实场景数据生成文本到语音的鲁棒性。该模型基于F5-TTS架构,引入指数移动平均(EMA)以稳定训练过程,并结合大型语言模型和大型音频语言模型进行双评分提示选择,有效提升合成语音的质量与自然度。实验结果表明,F5-TTS-DPS在多个评估指标上表现优异,尤其在欺骗检测中展现出最强的抗检测能力,验证了其在生成自然且难以被识别的语音方面的有效性。

详情
AI中文摘要

在这份技术报告中,我们描述了针对WildSpoof挑战TTS赛道(使用野外数据的文本到语音)的提交方案。我们提出了F5-TTS-DPS,一个基于F5-TTS架构的模型。我们的方法将指数移动平均(EMA)集成到监督微调中,以稳定训练并提高泛化能力。为了增强合成保真度,我们利用大语言模型(LLMs)和大音频语言模型(LALMs)进行双评分提示选择,过滤参考音频和文本提示以确保质量,同时解决噪声数据集中的对齐问题。实验评估表明,F5-TTS-DPS在开发集上取得了强劲性能,UTMOS为3.20,说话人相似度为0.51。更重要的是,我们的模型在所有提交中,针对三种先进SASV系统取得了最佳a-DCF分数,分别为0.1582、0.5233和0.2562,表明我们的合成语音最难检测,并展现出最高程度的自然度和真实性。结合具有竞争力的WER性能,这些结果验证了我们的方法在生成具有强欺骗能力的自然语音方面的有效性。

英文摘要

In this technical report, we describe our submission for the WildSpoof Challenge TTS Track: Text-to-Speech with In-the-Wild Data. We introduce F5-TTS-DPS, a model built upon the F5-TTS architecture. Our approach integrates Exponential Moving Average (EMA) into supervised fine-tuning to stabilize training and improve generalization. To enhance synthesis fidelity, we leverage large language models (LLMs) and large audio language models (LALMs) for dual-scoring prompt selection, filtering reference audio and text prompts to ensure quality while addressing alignment issues in noisy datasets. Experimental evaluation demonstrates that F5-TTS-DPS achieves strong performance with UTMOS of 3.20 and speaker similarity of 0.51 on the development set. More importantly, our model achieves the best a-DCF scores of 0.1582, 0.5233, and 0.2562 across three advanced SASV systems among all submissions, indicating our synthesized speech is the most difficult to detect and exhibits the highest degree of naturalness and authenticity. Combined with competitive WER performance, these results validate the effectiveness of our approach in generating natural-sounding speech with strong spoofing capabilities.

2605.23851 2026-05-25 eess.SP

A Manifold-Based Framework for Coupling-Aware Surrogate Optimization of Antenna Arrays Using Characteristic Modes

基于流形的耦合感知天线阵列替代优化框架:利用特征模

Leonardo Mörlein, Dirk Manteuffel

AI总结 本文提出了一种基于流形的耦合感知代理优化框架,用于天线阵列的设计,能够在保持计算效率的同时考虑互耦效应。该方法结合了通用的特征模态基、全局模态耦合模型以及逐元的广义散射矩阵(GSMs),并在具有物理意义的流形上对天线阵列的设计变量进行优化,特别是针对互易且无损耗的GSMs使用了单位对称矩阵流形。实验表明,该框架在多波束优化中能够有效满足旁瓣和交叉极化约束,并在单核CPU上实现秒级收敛,验证结果与预测趋势一致,展示了其在耦合感知阵列合成中的实用性和可扩展性。

详情
Comments
14 pages, 16 figures
AI中文摘要

提出了一种基于替代模型的天线阵列合成框架,该框架在保持优化计算效率的同时考虑了互耦效应。该方法结合了公共特征模基、全局模态耦合模型和单元广义散射矩阵(GSM)。阵列设计变量在具有物理意义的流形上制定和优化,特别是针对互易无耗单元GSM的酉对称矩阵流形。采用分阶段惩罚策略,在多波束优化过程中逐步施加旁瓣和交叉极化约束。该框架在一个8×8左旋圆极化贴片相控阵上进行了演示,该阵列在一个主平面内具有扫描特性。比较了不同的自由度分配策略,结果表明,约束非相同单元类可以满足严格的波束方向图要求,而等单元设计则无法满足。对于所演示的案例,优化在单个CPU内核上数秒内收敛,对实现阵列的全波验证证实了预测趋势,旁瓣电平(SLL)吻合良好,交叉极化比(XPR)具有有用的精度。结果表明,所提出的公式是实现耦合感知阵列合成与实现的实用且可扩展的途径。

英文摘要

A surrogate-based synthesis framework for antenna arrays is presented that incorporates mutual coupling while keeping optimization computationally efficient. The method combines a common characteristic-mode basis, a global modal coupling model, and element-wise generalized scattering matrices (GSMs). Array design variables are formulated and optimized on physically meaningful manifolds, in particular the manifold of unitary symmetric matrices for reciprocal and lossless element GSMs. A staged penalty strategy is used to progressively enforce sidelobe and cross-polarization constraints during multi-beam optimization. The framework is demonstrated for an 8x8 left-handed circularly polarized patch phased array with scan behavior in one principal plane. Different degree-of-freedom assignment strategies are compared, showing that constrained non-identical element classes can satisfy stringent pattern requirements where equal-element designs fail. For the demonstrated case, the optimization converges within seconds on a single CPU core, and full-wave verification of the realized arrays confirms the predicted trends, with good agreement for the SLL and useful accuracy for the XPR. The results indicate that the proposed formulation is a practical and scalable route for coupling-aware array synthesis and realization.

2605.23831 2026-05-25 eess.SP

Ray-Tracing vs. 3GPP TDL: Power Delay Profile Analysis in Outdoor-to-Indoor and Indoor Channels

射线追踪与3GPP TDL:室外到室内和室内信道的功率延迟分布分析

Julia Andrusenko, Chloe Makdad

AI总结 本文对比了基于确定性射线追踪模型和3GPP TR 38.901标准中Tapped Delay Line(TDL)信道模型在室外到室内及室内场景中的功率延迟分布(PDP)。研究发现,3GPP TDL模型虽然适用于大规模系统评估,但在捕捉具体场景的多径结构方面存在不足,如无法准确反映延迟能量和不规则脉冲等细节特征。因此,对于需要精确物理层设计的场景,确定性或混合方法更具优势。

详情
Comments
6 pages, 7 figures, 6 tables, submitted to MILCOM 2026 Track 1
AI中文摘要

第三代合作伙伴计划(3GPP)技术报告(TR)38.901信道模型(版本15-19)广泛用于密集城市室外到室内(O2I)和室内环境中的物理层设计和系统级评估。这些模型捕获了集合平均信道统计,但不考虑站点特定几何结构。在本文中,我们将确定性射线追踪模型(Remcom Wireless InSite软件)得出的功率延迟分布(PDP)与3GPP TR 38.901抽头延迟线(TDL)信道模型的PDP进行比较。该比较分析是在华盛顿特区建模的密集城市O2I场景和代表性单层室内布局下,在匹配链路距离和非视距(NLOS)条件下进行的。所有Wireless InSite PDP均经过功率归一化,以便比较相对多径延迟结构。我们评估了均方根(RMS)延迟扩展、平均超额延迟、有效最大延迟和Kullback-Leibler(KL)分布散度。结果表明,3GPP TDL模型通常表现出更长的延迟扩展,并且往往无法捕获确定性的站点特定特征,如晚到达能量和不规则尖峰。虽然TDL模型在某些情况下可以近似主要信道特征,但它们依赖于集合平均统计而非几何结构,限制了它们对精细多径结构的表示。我们得出结论,虽然3GPP TDL模型适用于大规模系统评估,但确定性或混合方法更适合于站点特定的物理层设计。

英文摘要

3rd Generation Partnership Project (3GPP) Technical Report (TR) 38.901 channel models (Releases 15-19) are widely used for physical-layer design and system-level evaluation in dense urban outdoor-to-indoor (O2I) and indoor environments. These models capture ensemble-averaged channel statistics but do not account for site-specific geometry. In this paper, we compare Power Delay Profiles (PDPs) derived from a deterministic ray-tracing model (Remcom Wireless InSite software) with those from the 3GPP TR 38.901 Tapped Delay Line (TDL) channel models. This comparative analysis is performed using a dense urban O2I scenario and a representative single-story indoor layout modeled in Washington, D.C., under matched link-distance and Non-Line-of-Sight (NLOS) conditions. All Wireless InSite PDPs are power-normalized to enable comparison of relative multipath delay structure. We evaluate root-mean-square (RMS) delay spread, mean excess delay, effective maximum delay, and Kullback-Leibler (KL) distribution divergence. Results indicate that 3GPP TDL models generally exhibit longer delay spreads and often fail to capture deterministic, site-specific features such as late-arriving energy and irregular spikes. While TDL models can approximate primary channel features in some cases, their reliance on ensemble-averaged statistics rather than geometry limits their representation of fine multipath structures. We conclude that while 3GPP TDL models are suitable for large-scale system evaluation, deterministic or hybrid approaches are more appropriate for site-specific physical-layer design.

2605.23813 2026-05-25 math.OC cs.SY eess.SY

Minimum Effort Control Using Variational Methods of Analytical Mechanics A New Approach For Optimal Control

使用分析力学变分法的最小努力控制:最优控制的新方法

Ossama Abdelkhalik, Aimar Negrete

AI总结 本文提出了一种基于分析力学变分方法的新最优控制方法,通过将控制执行器视为动态系统的一部分,直接在作用量泛函中引入控制能量项,从而避免了传统最优控制中使用协态变量的需要。该方法通过最小化包含控制能量的作用量来推导控制方程和运动方程,简化了问题复杂度,并为最优控制提供了一种全新的理论框架。文中通过案例研究验证了该方法的有效性。

详情
AI中文摘要

现代最优控制理论涉及使用动态协态变量将已知的动态系统运动方程附加到目标函数上,以约束最优控制解满足运动方程。协态变量的使用增加了变量数量,从而增加了问题的复杂性。另一方面,分析力学的变分方法通过最小化动态系统的作用量泛函来推导运动方程,将控制力视为系统的外部输入。本文提出了一种计算最优控制的新颠覆性方法。该方法采用分析力学的变分方法,除了推导运动方程外,还推导了控制方程。这是通过将控制执行器视为动态系统的一部分来实现的。除了动能和势能外,这种新方法中的作用量泛函还包括代表系统控制能量的额外能量项。提出了两种不同的方法来编写修改后的作用量泛函。所提出的方法显著偏离了现代最优控制理论,并在求解控制时消除了对协态变量的需求。本文通过一个案例研究来演示新方法。

英文摘要

Modern optimal control theory involves adjoining the already known equations of motion of a dynamic system to the objective function using dynamic costates; this is done in order to constrain the optimal control solutions to satisfy the equations of motion. The use of costates increases the number of variables and hence increases the complexity of the problem. On the other hand, variational methods of analytical mechanics finds the equations of motion by minimizing an action functional of the dynamic system, realizing control forces as external input to the system. In this paper a new disruptive approach for computing the optimal control is presented. This approach adopts the variational methods of analytical mechanics to derive equations for the control, in addition to the equations of motion. This is achieved by recognizing the control actuator as part of the dynamic system. In addition to the kinetic energy and potential energy, the action functional in this new approach includes additional energy terms that represent the control energy of the system. Two different methods are presented to write the modified action functional. The proposed approach is a significant departure from the modern optimal control theory, and it eliminates the need for costates when solving for the control. In this paper, a case study is presented to demonstrate the new approach.

2605.23811 2026-05-25 eess.SP

A Machine Learning Framework for Large-Scale Static Wireless Mesh Networks

大规模静态无线Mesh网络的机器学习框架

Julia Andrusenko

AI总结 本文提出了一种用于大规模静态无线网状网络的机器学习框架,旨在为岛状复杂环境中固定站点的155个商用现成无线电节点提供系统设计方法。研究通过结合射频传播建模与约束聚类优化,实现了节点的高效分簇与连接性优化,为复杂地理环境下的静态无线网络规划提供了可扩展的解决方案。

详情
Comments
5 pages, 3 figures, submitted to MILCOM 2026 Track 5
AI中文摘要

本文提出了一种针对大规模静态无线Mesh网络的系统设计方法,该网络由155个商用现成(COTS)无线电节点组成,部署在具有挑战性的岛屿环境中的固定基础设施站点。该架构包含大约十个15节点集群,每个集群设有指定的主网关和次网关节点以支持集群间通信。开发了一种结构化的多阶段规划方法来指导网络设计。利用Remcom的Wireless InSite射线追踪平台生成特定站点的射频(RF)路径损耗预测,考虑了地形、建筑物和茂密植被的影响。为了在物理层和操作约束下优化连接性,应用了谱嵌入结合平衡k-means聚类将节点划分为大小相近的集群。链路预算分析确定了在波形和硬件约束下最大可容忍路径损耗,定义了聚类框架中使用的连接阈值。本工作将确定性射频传播建模与约束聚类优化相结合,为在复杂地理环境中规划静态无线Mesh网络提供了一个可扩展的框架。节点移动性和更高层网络协议不在本研究范围内。

英文摘要

This paper presents a system design methodology for a large-scale static wireless mesh network for 155 commercial off-the-shelf (COTS) radio nodes at fixed infrastructure sites in a challenging island environment. The architecture consists of approximately ten 15-node clusters, each with designated primary and secondary gateway nodes to support inter-cluster communication. A structured, multi-stage planning methodology was developed to guide network design. Site-specific radio frequency (RF) path loss predictions were generated using Remcom's Wireless InSite ray-tracing platform, incorporating terrain, buildings, and dense foliage effects. To optimize connectivity under physical-layer and operational constraints, spectral embedding combined with balanced k-means clustering was applied to partition the nodes into clusters of comparable size. A link budget analysis determined the maximum tolerable path loss under waveform and hardware constraints, defining the connectivity threshold used in the clustering framework. This work integrates deterministic RF propagation modeling with constrained clustering optimization to provide a scalable framework for planning static wireless mesh networks in complex geographic environments. Node mobility and higher-layer networking protocols were outside the scope of this study.

2605.23804 2026-05-25 cs.HC eess.SP

Perceptually Lossless Tactile Texture Synthesis with Compact Spectral Envelope Models

基于紧凑频谱包络模型的感知无损触觉纹理合成

Jagan K. Balasubramanian, Yasemin Vardar

AI总结 该研究提出了一种高效且感知无损的触觉纹理合成方法,通过引入两个紧凑的频谱包络模型——谱beta和谱斜率,来捕捉指尖与表面摩擦信号的时频结构并保留感知相关的信息。实验表明,这些模型在感知评价中能够达到与高保真触觉信号相当的真实感,揭示了触觉纹理感知主要依赖于基本的时频模式,为触觉压缩、传输和合成提供了高效可扩展的框架。

详情
Comments
16 pages and 8 figures
AI中文摘要

现代视听媒体依赖紧凑表示实现高效存储和传输,而真实的数字触觉仍依赖于高分辨率触觉记录。现有的触觉信号表示方法限制了操作和新内容的生成。本文引入两种紧凑表示——频谱β和频谱斜率,它们捕获手指-表面摩擦信号的时间频谱结构,同时保留感知相关信息。频谱β使用双参数β分布对频谱偏度建模,而频谱斜率通过低通和高通阶数定义的非对称带通滤波器近似频谱。我们在一个摩擦调制显示器上使用五种虚拟纹理,对14名参与者进行了感知评估,并将这些表示与物理纹理和高保真再现信号进行比较。频谱β的感知相似度评分与原始高保真再现相当。回归分析进一步表明,匹配九个关键频带上的频谱能量是感知真实感的最强预测因子。这些发现共同表明,触觉纹理感知主要依赖于基本的时间频谱模式,对这些模式建模足以实现感知真实的渲染。这些结果为触觉压缩、通信和合成纹理生成建立了一个高效且可扩展的框架。

英文摘要

Modern audio-visual media rely on compact representations for efficient storage and transmission, whereas realistic digital touch still depends on high-resolution tactile recordings. Existing approaches for representing tactile signals constrain manipulation and limit the generation of new content. Here, we introduce two compact representations, spectral beta and spectral slope, that capture the temporal spectral structure of finger-surface friction signals while preserving perceptually relevant information. Spectral beta models spectral skewness using a two-parameter beta distribution, whereas spectral slope approximates the spectrum with an asymmetric bandpass filter defined by low- and high-pass orders. We evaluated these representations in a perceptual study with 14 participants using five virtual textures rendered on a friction-modulation display and compared them with physical textures and high-fidelity reproductions of recorded signals. Spectral beta achieved perceptual similarity ratings comparable to those of the original high-fidelity reproductions. Regression analysis further showed that matching spectral energy across nine critical frequency bands was the strongest predictor of perceived realism. Together, these findings suggest that tactile texture perception depends primarily on fundamental temporal spectral patterns and that modeling these patterns is sufficient for perceptually realistic rendering. These results establish an efficient and scalable framework for haptic compression, communication, and synthetic texture generation.

2605.23795 2026-05-25 eess.SP

A Measurement-Based Parameterization of Physics Reflection Models for Terahertz Communication

基于测量的太赫兹通信物理反射模型参数化

Taihao Zhang, Chenzhou Lin, Cunhua Pan, Hong Ren, Ruyi Liu, Yongchao He, Tian Qiu, Bingchang Hua, Jiangzhou Wang

AI总结 本文针对太赫兹通信中反射系数的精确建模问题,构建了一个300~400 GHz的信道测量平台,测量多种材料的反射特性,并提出了一种基于扩展参数化洛伦兹/德鲁德模型的单层干涉反射系数模型(SLI-EPLD)。该模型采用子带建模策略描述反射系数随频率的变化,并引入参数化映射方法保证模型参数稳定性,结合加权子带拟合趋势回归(WF-TREND)算法实现高精度参数拟合,实验验证表明该模型在多种材料上优于现有模型,为高太赫兹通信的信道建模提供了重要基础。

详情
AI中文摘要

反射系数的精确建模对于新兴太赫兹(THz)通信中可靠信道模型的开发至关重要。本研究搭建了300~400 GHz信道测量平台,用于测量多种材料的反射系数。基于测量数据分析,我们提出了单层干涉扩展参数化洛伦兹/德鲁德(SLI-EPLD)反射系数模型。该模型采用子带建模策略来表征反射系数随频率的变化,同时采用参数化映射方法确保模型参数的稳定性。此外,引入了加权子带拟合趋势回归(WF-TREND)算法以实现精确的子带参数拟合。验证结果表明,该模型在多种材料上的性能优于现有模型。本文建立的反射系数模型为300~400 GHz高频太赫兹通信的信道建模提供了关键基础。

英文摘要

The accurate modeling of reflection coefficients is pivotal for developing reliable channel models in emerging terahertz (THz) communications. This study establishes a 300$\sim$400 GHz channel measurement platform to measure the reflection coefficients of various materials. Based on the analysis of measured data, we propose the single-layer interference with an extended-parameterized Lorentz/Drude (SLI-EPLD) reflection coefficient model. In this model, a sub-band modeling strategy is adopted to characterize the variation of reflection coefficients with frequency, while a parameterized mapping approach is employed to ensure the stability of model parameters. Furthermore, the weighted sub-band fitting for trend regression (WF-TREND) algorithm is introduced to achieve precise sub-band parameter fitting. Validation results demonstrate superior performance to existing models across multiple materials. The reflection coefficient model established in this work serves as a critical foundation for channel modeling in 300$\sim$400 GHz for high-THz communication.

2605.23782 2026-05-25 cs.GT cs.SY eess.SY

Routing Equilibrium in Mixed-Autonomy Traffic Networks with Altruistic Autonomous Agents

混合自主交通网络中具有利他自主代理的路由均衡

Lihui Yi, Ermin Wei

AI总结 本文研究了混合自动驾驶交通网络中的路由均衡问题,其中人类驾驶者以自我利益为目标最小化自身出行时间,而自动驾驶车辆则具有利他主义特性,旨在最小化整体社会成本。通过将问题建模为变分不等式(VI),作者在无需凸性假设的情况下证明了均衡的存在性,并在特定成本函数下保证了均衡时路网流量和社会成本的唯一性。研究还分析了自动驾驶车辆对社会成本的影响条件,并通过数值实验展示了不同系统参数下社会成本随自动驾驶车辆比例变化的趋势。

详情
AI中文摘要

车辆自主性的最新进展引起了人们对理解自主车辆对交通系统影响的兴趣。在本文中,我们研究了混合自主环境中的交通分配问题,其中人类驾驶车辆和自主车辆共存。我们将交互建模为一个同时路由博弈,其中人类驾驶员是自私的,旨在最小化自己的行驶时间,而自主代理是利他的,旨在最小化总社会成本。标准的非原子拥塞博弈分析在凸成本函数下建立了该博弈均衡的存在性,但未涉及唯一性。在这项工作中,我们将均衡表述为变分不等式(VI),这使我们能够在没有凸性假设的情况下建立均衡存在性,并保证在特定成本函数类下均衡时聚合链路流和社会成本的唯一性。利用这一VI框架,我们提供了包含自主代理改善、恶化或不影响社会成本的充分条件。虽然恶化的可能性已在先前工作中确立,但我们的结果通过明确刻画每种结果发生的充分条件,补充了现有的最坏情况界,从而提供了对混合自主交通系统的更深入理解。此外,我们考虑了一个集中式场景,其中社会规划者优化自主代理的路由,并表明在假设凸成本的情况下,实现了与分散场景相同的均衡。最后,我们进行了数值实验,展示了在不同系统参数下社会成本如何随自主车辆数量变化。

英文摘要

Recent advancements in vehicle autonomy have drawn interest in understanding the impact of autonomous vehicles on traffic systems. In this paper, we study a traffic assignment problem in a mixed-autonomy setting where both human-driven and autonomous vehicles coexist. We model the interaction as a simultaneous routing game where human drivers are self-interested and aim to minimize their own travel times, while autonomous agents are altruistic and aim to minimize the total social cost. The standard nonatomic congestion game analysis establishes the existence of equilibrium to this game under convex cost functions, and does not have any implication of its uniqueness. In this work, we formulate the equilibrium as a variational inequality (VI), which enables us to establish the equilibrium existence without convexity assumption, and guarantees the uniqueness of the aggregated link flow and social cost at equilibrium under a specific class of cost functions. Leveraging this VI framework, we provide sufficient conditions under which including autonomous agents improves, deteriorates, or has no effect on social cost. While the possibility of deterioration has been established in prior work, our results complement existing worst-case bounds by explicitly characterizing sufficient conditions under which each outcome occurs, thereby providing a deeper understanding of mixed-autonomy traffic systems. Furthermore, we consider a centralized scenario where a social planner optimizes the routing of autonomous agents, and show that the same equilibrium is achieved as in the decentralized scenario when assuming convex costs.Finally, we conduct numerical experiments that illustrate how social cost changes with the amount of autonomous vehicles under different system parameters.

2605.23779 2026-05-25 eess.SP

SIM-Aided Near-Field Channel and Localization Estimation With Dimensionality Reduction: A Multiport Network Theory Approach

SIM辅助的近场信道与定位估计及降维:一种多端口网络理论方法

Andrea Abrardo, Bartoli Giulio

AI总结 本文针对6G大规模天线阵列带来的硬件复杂度和干扰问题,提出了一种基于多端口网络理论的SIM辅助近场信道与定位估计框架,通过波域降维有效降低系统复杂度。该方法利用堆叠智能超表面(SIM)进行模拟空间滤波,结合粗略位置信息进行子空间投影,实现了对近场信道的高效估计。研究分析了SIM近似误差对信道估计和定位性能的影响,验证了该架构在保持波前曲率信息的前提下,能够以远少于全数字系统的射频链数量实现相近的定位精度。

详情
Comments
6 pages, 3 figures
AI中文摘要

6G超大规模天线阵列的部署实现了辐射近场感知,但带来了硬件复杂度和干扰方面的重大挑战。堆叠智能超表面(SIM)通过实现波域降维来解决这些限制。本文基于多端口网络理论提出了一种严格的SIM辅助近场信道与定位估计框架,该框架提供了考虑互耦和非单向层间传播效应的电磁一致表征。采用间接估计方法,通过优化SIM执行模拟空间滤波,将接收信号投影到通过粗略先验位置信息识别的相关子空间上。在此现实设置下,我们分析表征了SIM近似误差对信道估计的影响,并量化了由此产生的定位性能影响。结果表明,所提出的架构保留了精确近场定位所需的必要波前曲率信息,实现了与全数字解决方案相当的性能,同时大幅减少了射频链的数量。

英文摘要

The deployment of Extremely Large-Scale Antenna Arrays for 6G enables radiative near-field sensing but poses significant challenges in terms of hardware complexity and interference. Stacked Intelligent Metasurfaces (SIMs) address these limitations by enabling wave-domain dimensionality reduction. This paper proposes a rigorous SIM-aided framework for near-field channel and localization estimation based on Multiport Network Theory, which provides an electromagnetically consistent characterization accounting for mutual coupling and non-unilateral inter-layer propagation effects. An indirect estimation approach is adopted, where the SIM is optimized to perform analog spatial filtering by projecting the received signal onto a relevant subspace identified through coarse prior location information. Within this realistic setting, we analytically characterize the impact of SIM approximation errors on channel estimation and quantify the resulting effects on localization performance. The results show that the proposed architecture preserves the essential wavefront curvature information required for accurate near-field localization, achieving performance comparable to fully digital solutions while drastically reducing the number of radio-frequency chains.

2605.23713 2026-05-25 eess.SP

Stacked Intelligent Metasurfaces (SIM) in the Nonlinear Regime: A Multiport Network Model Approach

非线性体制下的堆叠智能超表面:一种多端口网络模型方法

Andrea Abrardo, Alberto Toccafondi

AI总结 本文提出了一种适用于多层智能超表面(SIM)的物理一致的多端口网络模型,支持线性和显式非线性终端的建模。该模型在线性情况下提供了闭式输入-输出关系,在非线性情况下采用固定点前向计算,并基于伴随方法计算梯度以支持优化。研究显示,在28 GHz近场定位案例中,非线性终端能够提升传输函数匹配性能,显著降低平均定位误差,接近理想基准。

详情
Comments
6 pages, 2 figures
AI中文摘要

我们提出了一种物理一致的多端口框架,用于具有线性和显式非线性终端的堆叠智能超表面(SIM)。该模型在线性情况下提供闭式输入-输出关系,在非线性情况下提供定点前向评估,并在两种设置下提供基于伴随的梯度用于优化。在级隔离的SIM结构下,复杂度保持为$\mathcal{O}(QK^3)$。在28 GHz近场定位案例研究中,非线性终端改善了传递函数匹配并降低了平均定位误差,接近理想基准。

英文摘要

We present a physically consistent multiport framework for stacked intelligent metasurfaces (SIMs) with linear and explicit nonlinear terminations. The model provides closed-form input--output relations in the linear case and fixed-point forward evaluation in the nonlinear case, with adjoint-based gradients for optimization in both settings. Under stage-isolated SIM structure, complexity remains $\mathcal{O}(QK^3)$. In a 28 GHz near-field localization case study, nonlinear terminations improve transfer-function matching and reduce mean localization error, close to the ideal benchmark.

2605.23708 2026-05-25 cs.LG cs.SY eess.SY nlin.AO

Learning Dynamic Stability Landscapes in Synchronization Networks

学习同步网络中的动态稳定景观

Christian Nauck, Junyou Zhu, Michael Lindner, Frank Hellmann

AI总结 本文提出了一种新的上游任务——学习同步网络中的动态稳定性景观,以更深入地理解同步行为,并从中衍生出多种标量稳定性指标。研究首次引入了图到图像的预测范式,直接从图结构学习每个节点的图像状稳定性景观,并发布了两个包含10,000个图的基准数据集。通过结合图神经网络与卷积神经网络,模型能够端到端地学习稳定性景观,实现了良好的泛化能力,为超越传统标量稳定性指标提供了新方法。

详情
Comments
22 pages, 12 figures
AI中文摘要

同步的鲁棒性通常通过标量、节点级稳定性指数来表征,这些指数对拓扑的依赖性通过网络科学或图神经网络(GNN)进行研究。我们提出了一种新颖的上游任务——学习稳定景观,它提供了对同步行为的更深入洞察,并且可以从中推导出许多此类标量指数。关键的是,我们开创了一种图到图像的预测范式:直接从图拓扑学习作为节点级目标的图像状景观,这种表述在文献中我们尚未见到。为了支持这一任务,我们发布了两个数据集,每个数据集包含10,000个图,节点数分别为20和100,并带有节点级景观标签,基于一个概念性振荡器模型,捕捉电网同步行为。GNN编码拓扑,CNN解码器渲染每个节点的图像,以端到端方式学习,具有良好的分布内准确性,并能泛化到不同图大小和实际电网拓扑。这表明,稳定景观虽然超出了传统网络科学的能力范围,但可以从拓扑中学习,并为生物学、神经科学和电网中超越标量稳定性指数开辟了新途径。

英文摘要

The robustness of synchronization is typically characterized by scalar, per-node stability indices whose dependence on topology is studied via network science or graph neural networks (GNNs). We propose a novel upstream task, learning stability landscapes, which provide deeper insights into synchronization behavior and from which many such scalar indices can be derived. Crucially, we pioneer a graph-to-image prediction paradigm: learning image-like landscapes as per-node targets directly from graph topology, a formulation we are not aware of having been established elsewhere in the literature. To support this task, we release two datasets of 10,000 graphs each at 20 and 100 nodes with per-node landscape labels, based on a conceptual oscillator model, capturing power grid synchronization behavior. A GNN encodes topology and a CNN decoder renders per-node images, learned end-to-end with good in-distribution accuracy, generalizing across graph sizes and to realistic power grid topologies. This demonstrates that stability landscapes, while beyond the reach of conventional network science, are learnable from topology and open new avenues for moving beyond scalar stability indices in biology, neuroscience, and power grids.

2605.20418 2026-05-25 physics.soc-ph cs.SI cs.SY eess.SY math.DS math.PR

A Bounded-Confidence Model of Opinion Dynamics with Adaptive Interaction Probabilities

具有自适应交互概率的有界置信意见动力学模型

Leila Thompsky, Yuexuan Yolanda Wu, Mason A. Porter, Jiajie Luo

AI总结 本文研究了一个具有自适应交互概率的有界置信意见动态模型,旨在探讨个体在社交网络中如何通过交互调整其观点。该模型基于经典的Deffuant–Weisbuch模型,但引入了异质且自适应的边权重,以反映人们更倾向于与之前有过共识或积极互动的对象交流。研究证明了该模型的收敛性、边权重的长期动态行为以及“有效图”的特性,并通过数值模拟展示了自适应边权重对不同网络结构下意见演化过程的影响。

详情
Comments
22 pages, 10 figures
AI中文摘要

意见动力学模型旨在捕捉个体在相互交互时意见的变化。一个著名的意见动力学模型是Deffuant-Weisbuch(DW)模型,它是一种有界置信模型(BCM)。在DW模型中,智能体进行成对交互,并且当彼此意见足够接近时,他们会接受对方的意见。在本文中,我们通过在具有异质和自适应边权重的网络上研究DW模型来扩展它。这些边权重控制智能体之间的交互概率,从而编码了人们更倾向于与之前妥协或有过积极互动的个体交流的想法。我们证明了我们的自适应边加权DW模型的收敛性质、其边权重的长期动力学以及模型相关的“有效图”(一个时变子图,仅包含彼此意见可接受的智能体之间的边)的理论保证。我们通过在不同网络上的自适应边加权DW模型的数值模拟来支持我们的理论结果,并发现包含自适应边权重会在不同类型的网络上产生不同的定性动力学。特别是,对于较小的置信界限,我们观察到加入自适应边权重会减少密集网络的收敛时间,但增加稀疏网络的收敛时间。

英文摘要

Models of opinion dynamics aim to capture how individuals' opinions change when they interact with each other. One well-known model of opinion dynamics is the Deffuant--Weisbuch (DW) model, which is a type of bounded-confidence model (BCM). In the DW model, agents have pairwise interactions, and they are receptive to other agents' opinions when their opinions are sufficiently close to each other. In this paper, we extend the DW model by studying it on networks with heterogeneous and adaptive edge weights between pairs of agents. These edge weights govern the interaction probabilities between the agents and thereby encode the idea that people are more likely to communicate with individuals with whom they have previously compromised or had other positive interactions. We prove theoretical guarantees of our adaptive edge-weighted DW model's convergence properties, the long-time dynamics of its edge weights, and the model's associated ``effective graph", which is a time-dependent subgraph that includes edges only between agents that are receptive to each other's opinions. We support our theoretical results with numerical simulations of our adaptive edge-weighted DW model on a variety of networks and find that including adaptive edge weights yields different qualitative dynamics for different types of networks. In particular, for small confidence bounds, we observe that incorporating adaptive edge weights decreases the convergence time for dense networks but increases the convergence time for sparse networks.

2603.24489 2026-05-25 math.OC cs.SY eess.SY

Model Predictive Path Integral Control as Preconditioned Gradient Descent

模型预测路径积分控制作为预条件梯度下降

Mahyar Fazlyab, Sina Sharifi, Jiarui Wang

AI总结 本文研究了模型预测路径积分(MPPI)控制的收敛性质,并通过变分优化方法提供了直接的收敛性分析。通过将受约束的轨迹优化问题转化为决策分布上的KL散度正则化问题,作者推导出一个参数化采样族上的简化自由能目标函数,并分析了其梯度和Hessian形式。研究还表明,在固定协方差的高斯情况下,经典的MPPI更新可以精确地视为预条件梯度下降的一步更新,并给出了下降性和平稳性保证,以及影响MPPI收敛的关键超参数的数值实验结果。

详情
AI中文摘要

模型预测路径积分(MPPI)控制是一种广泛使用的基于采样的轨迹优化方法,但其收敛性质仅被部分理解。本文利用变分优化提供了直接的收敛性分析。通过将带约束的轨迹优化提升为决策分布上的Kullback-Leibler(KL)正则化问题,我们推导了定义在参数化采样族上的约化自由能目标。对于一般参数族,我们推导了该约化目标的梯度和Hessian表示,并分析了采样分布参数上的预条件梯度下降。在固定协方差高斯情况下,经典MPPI更新被精确恢复为单位步长预条件梯度更新。当约化目标的Hessian在预条件子诱导的度量下有界时,我们证明了基于精确期望的迭代的下降和驻点保证。对于高斯族,我们进一步表明预条件Hessian由Gibbs倾斜分布的协方差相对于采样分布的协方差控制,从而给出了精确单位步长MPPI下降的协方差相关充分条件。数值实验说明了理论及关键超参数的影响。

英文摘要

Model Predictive Path Integral (MPPI) control is a widely used sampling-based method for trajectory optimization, yet its convergence properties remain only partially understood. This paper provides a direct convergence analysis using variational optimization. By lifting constrained trajectory optimization to a Kullback-Leibler (KL) regularized problem over decision distributions, we derive a reduced free-energy objective defined over a parametric sampling family. For general parametric families, we derive gradient and Hessian representations of this reduced objective and analyze preconditioned gradient descent on the sampling-distribution parameters. In the fixed-covariance Gaussian case, the classical MPPI update is recovered exactly as a unit-step preconditioned gradient update. We prove descent and stationarity guarantees for the exact expectation-based iteration when the Hessian of the reduced objective is bounded in the metric induced by the preconditioner. For the Gaussian family, we further show that the preconditioned Hessian is governed by the covariance of the Gibbs-tilted distribution relative to the covariance of the sampling distribution, yielding a covariance-dependent sufficient condition for the descent of exact unit-step MPPI. Numerical experiments illustrate the theory and the effect of key hyperparameters.

2602.14671 2026-05-25 eess.AS

Data Augmentation for Pathological Speech Enhancement

病理语音增强的数据增强

Mingchi Hou, Enno Hermann, Ina Kodrasi

AI总结 本文研究了如何通过数据增强技术提升对帕金森病患者等病理语音的语音增强性能。作者系统评估了三种数据增强方法,发现噪声增强效果最显著,而生成式增强在数据量增加时可能适得其反。研究还表明,数据增强对预测型语音增强模型更为有效,但病理语音与正常语音之间仍存在性能差距,凸显了针对病理语音设计专用增强策略的必要性。

详情
Comments
Accepted at EUSIPCO 2026
AI中文摘要

由于非典型声学特征和数据可用性有限,最先进的语音增强(SE)模型在病理语音上的性能显著下降。本文系统地研究了数据增强(DA)策略,以改善受帕金森病影响的病理说话者的SE性能,评估了预测性和生成性SE模型。我们考察了三类DA,即变换性、生成性和噪声增强,并使用客观SE指标评估其影响。实验结果表明,噪声增强始终带来最大且最稳健的提升,变换性增强提供中等改进,而生成性增强收益有限,且随着合成数据量的增加可能损害性能。此外,我们表明DA的有效性取决于SE模型,DA对预测性SE模型更有利。虽然我们的结果表明DA改善了病理说话者的SE性能,但神经典型语音和病理语音之间的性能差距仍然存在,这凸显了未来需要针对病理语音的定向DA策略进行研究。

英文摘要

The performance of state-of-the-art speech enhancement (SE) models considerably degrades for pathological speech due to atypical acoustic characteristics and limited data availability. This paper systematically investigates data augmentation (DA) strategies to improve SE performance for pathological speakers affected by Parkinson`s disease, evaluating both predictive and generative SE models. We examine three DA categories, i.e., transformative, generative, and noise augmentation, assessing their impact with objective SE metrics. Experimental results show that noise augmentation consistently delivers the largest and most robust gains, transformative augmentations provide moderate improvements, while generative augmentation yields limited benefits and can harm performance as the amount of synthetic data increases. Furthermore, we show that the effectiveness of DA varies depending on the SE model, with DA being more beneficial for predictive SE models. While our results demonstrate that DA improves SE performance for pathological speakers, a performance gap between neurotypical and pathological speech persists, highlighting the need for future research on targeted DA strategies for pathological speech.

2511.21300 2026-05-25 eess.SY cs.SY

Data-Driven Reduction of Fault Location Errors in Onshore Wind Farm Collectors

陆上风电场集电线路故障定位误差的数据驱动缩减方法

A. J. Alves Junior, M. J. B. B. Davi, R. A. S. Fernandes, M. Oleskovicz, D. V. Coury

AI总结 本文针对陆上风电场集电系统中由于逆变器资源接入导致的故障定位误差问题,提出了一种基于门控残差网络的数据驱动修正方法。通过特征工程与超参数优化,构建了一个能够适应多种故障场景的改进预测模型,并在基于PSCAD的真实风电场仿真平台上进行了验证。实验结果表明,该方法相比现有方法将故障定位误差降低了76%,具有良好的鲁棒性、可扩展性和适应性。

详情
Journal ref
Sustainable Energy, Grids and Networks, Article 102309 (2026)
AI中文摘要

准确的故障定位对于风电场集电网络的操作可靠性和快速恢复至关重要。然而,逆变器接口资源的日益集成改变了故障期间的电流和电压行为,挑战了传统基于相量的诊断方法的有效性。在此背景下,本文介绍了一种先进的机器学习解决方案,通过引入由门控残差网络驱动的校正模型,专门设计用于最小化残余故障定位误差,从而增强确定性故障距离估计器。通过全面的特征工程和选择过程,开发并训练了一个改进的预测器,该预测器基于在PSCAD真实风电场模型中模拟的多样化故障场景,包括故障类型、电阻、位置、起始角和发电渗透率的变化。使用Optuna框架进行超参数优化,并统计验证了方法的鲁棒性。结果显示,与最先进的方法相比,故障定位误差总体降低了76%,精度显著提高。所提出的方法表现出强大的可扩展性和对拓扑及操作变化的适应性。该方法推进了数据驱动故障定位框架在现代电力系统中的部署。

英文摘要

Accurate fault location is essential for operational reliability and fast restoration in wind farm collector networks. However, the growing integration of inverter-based resources changes the current and voltage behavior during faults, challenging the effectiveness of traditional phasor-based diagnostic methods. In this context, the present paper introduces an advanced machine-learning solution that enhances a deterministic fault distance estimator by incorporating a correction model driven by a Gated Residual Network, specifically designed to minimize residual fault location errors. Through comprehensive feature engineering and selection processes, an improved predictor was developed and trained on a diverse set of fault scenarios simulated in a PSCAD-based real-world wind farm model, including variations in fault type, resistance, location, inception angle, and generation penetration. Hyperparameter optimization was performed using the Optuna framework, and the robustness of the method was statistically validated. Results show a significant improvement in accuracy, with a 76% overall decrease in fault location error compared to state-of-the-art approaches. The proposed method demonstrates strong scalability and adaptability to topological and operational changes. This approach advances the deployment of data-driven fault location frameworks for modern power systems.

2605.23671 2026-05-25 math.OC cs.SY eess.SY

A Non-Iterative Algorithm for Clearing Two-Layer Energy-Sharing Markets with Voltage Constraints

一种考虑电压约束的双层能源共享市场的非迭代清算算法

Tonghua Liu, Yifan Su, Zhaojian Wang, Feng Liu

AI总结 本文提出了一种用于具有电压约束的双层能源共享市场的非迭代清算算法。该算法通过推导下层市场的高效最佳响应函数,并利用定价耦合结构将均衡搜索简化为一维问题,从而将原复杂的双层MPEC问题转化为可计算的单层混合整数二阶锥规划问题。案例研究表明,该方法能够在保证节点电压在规定范围内的同时,快速获得高精度的市场清算方案。

详情
Comments
10 pages, 6 figures; submitted to IEEE Transactions on Smart Grid
AI中文摘要

实时分层能源共享市场有望协调大量产消者。然而,现有大多数清算方法依赖于线性化或直流潮流模型,并未明确处理无功功率或电压安全约束。考虑交流网络约束时,问题成为大规模双层均衡约束数学规划(MPEC),难以实时求解。本文开发了一种考虑电压约束的双层能源共享市场的非迭代清算算法。我们首先为每个下层能源共享市场推导出高效的最优响应函数,并通过利用定价耦合结构将均衡搜索降至一维。然后,我们将该函数嵌入上层网络约束问题,并将双层MPEC重构为单层混合整数二阶锥规划(MISOCP),该问题在计算上易于处理。在包含12,300个产消者的IEEE 123节点系统上的案例研究表明,所提方法将节点电压保持在规定限值内,并在0.829秒内提供最大误差低于0.01%的解。

英文摘要

Real-time hierarchical energy-sharing markets are promising to coordinate large numbers of prosumers. Still, most existing clearing methods rely on linearized or DC power-flow models and do not explicitly handle reactive power or voltage-security constraints. With AC network constraints, the problem becomes a large-scale bilevel Mathematical Program with Equilibrium Constraints (MPEC) that is difficult to solve in real time. This paper develops a non-iterative clearing algorithm for two-layer energy-sharing markets with voltage constraints. We first derive an efficient best-response function for each lower-layer energy-sharing market and reduce the equilibrium search to one dimension by exploiting the pricing-coupling structure. We then embed this function into the upper-layer network-constrained problem and reformulate the bilevel MPEC as a single-level mixed-integer second-order cone program (MISOCP), which is computationally tractable. Case studies on the IEEE 123-bus system with 12,300 prosumers show that the proposed method preserves nodal voltages within prescribed limits and delivers solutions with maximum errors below 0.01\% in 0.829 s.

2605.23661 2026-05-25 eess.SY cs.SY math.OC

Output Feedback MPC with Adaptive Tubes

自适应管道的输出反馈模型预测控制

Anchita Dey, Shubhendu Bhasin

AI总结 本文提出了一种带有自适应管的输出反馈模型预测控制(MPC)框架,用于处理具有参数和加性不确定性的线性时不变系统。该方法通过自适应观测器实时估计系统状态、模型参数和初始条件,并动态更新包含真实参数和初始状态的集合,从而参数化约束优化问题,实现约束收紧、终端项和管形结构的自适应调整。与传统鲁棒管基MPC方法不同,该方法无需在整个参数不确定集上使用统一的二次稳定线性反馈增益,随着不确定性信息的改善,管形结构随之演化,从而提升了控制性能。论文还证明了递归可行性和鲁棒指数稳定性,并给出了数值仿真示例。

详情
AI中文摘要

针对存在参数和加性不确定性的线性时不变系统,提出了一种具有自适应管道的输出反馈模型预测控制(MPC)框架。自适应观测器提供系统状态、模型参数和初始条件的点估计,同时联合更新包含真实参数和初始状态的相应集合。这些估计参数化约束最优控制问题,使得约束收紧、终端成分和管道几何形状能够随着估计的演变而更新。与标准的鲁棒基于管道的MPC公式相比,所提出的方法不需要在参数不确定性集上存在共同的二次稳定线性反馈增益。随着可用不确定性信息的改善,管道几何形状相应演变,形成随时间性能提高的自适应管道MPC框架。建立了递归可行性和鲁棒指数稳定性,并给出了数值示例。

英文摘要

An output feedback model predictive control (MPC) framework with adaptive tubes is proposed for linear time-invariant systems subject to parametric and additive uncertainties. An adaptive observer provides point estimates of the system state, model parameters, and initial condition, while jointly updating the corresponding sets containing the true parameters and initial state. These estimates parameterize the constrained optimal control problem, enabling constraint tightening, terminal ingredients, and tube geometry to be updated as the estimates evolve. In contrast to standard robust tube-based MPC formulations, the proposed approach does not require a common quadratically stabilizing linear feedback gain across the parametric uncertainty set. As the available uncertainty information improves, the tube geometry evolves accordingly, resulting in an adaptive tube MPC framework with improved performance over time. Recursive feasibility and robust exponential stability are established, and a numerical example is presented.

2605.23649 2026-05-25 eess.SP math.ST stat.TH

Diffusion Fluid Antenna Systems for Resilient ISAC

扩散流体天线系统用于弹性ISAC

Noor Waqar, Kai-Kit Wong, Chan-Byoung Chae, Ross Murch

AI总结 本文研究了面向鲁棒集成感知与通信(ISAC)的扩散流体天线系统(Diffusion FAS),旨在从物体侧视角提升系统在复杂电磁环境下的性能。通过引入可重构的空间自由度,该方法利用生成式人工智能框架,在稀疏观测条件下重构空间相关结构,实现对感知特征的动态调控。研究提出了两种新型ISAC模式:生成空间隐身模式可显著抑制目标的感知可见性,目标隔离模式则能有效抑制邻近物体的干扰,为提升ISAC系统的安全性和可靠性提供了新思路。

详情
AI中文摘要

大多数现有的集成感知与通信(ISAC)研究侧重于通过先进的波形设计和功率分配,使基站(BS)能够在共享资源上支持感知和通信。相比之下,对象侧视角仍未得到充分探索。例如,一个对象可能希望保持难以检测以保障安全,而另一个邻近对象可能产生主导反射,混淆BS并损害预期目标的感知可靠性。这些挑战激发了流体天线系统(FAS)范式,该范式引入了一种可重构的空间自由度(DoF),通过端口选择重塑感知特征,其能力超越了波形和功率控制单独所能提供的。在本文中,我们设计了扩散FAS,这是一种生成式人工智能(AI)驱动的框架,利用空间敏捷性在电磁衰落流形上引导ISAC性能。扩散FAS不是仅在功率域优化ISAC,而是将ISAC视为一个动态空间选择问题,其中选择天线状态(即端口)以塑造感知特征,同时保持通信目标。为了在稀疏测量下工作,我们采用条件去噪扩散概率模型(DDPM),从少量观测端口重建潜在的空间相关结构,从而有效探索可重构孔径。我们展示了两种FAS启用的ISAC模式:(1)生成式空间隐身,它识别局部深衰落以将目标的感知可见性抑制多达两个数量级,以及(2)目标隔离,它合成空间零点以抑制来自邻近物体的干扰。

英文摘要

Most existing integrated sensing and communication (ISAC) studies focus on enabling a base station (BS) to support sensing and communication over shared resources through advanced waveform design and power allocation. In contrast, the object-side perspective remains underexplored. For example, an object may wish to remain difficult to detect for security reasons, while another object in close proximity may generate dominant reflections that confuse the BS and impair sensing reliability for the intended target. These challenges motivate the fluid antenna system (FAS) paradigm which introduces a reconfigurable spatial degree of freedom (DoF) that can reshape sensing signatures via port selection, beyond what waveform and power control alone can provide. In this paper, we devise diffusion FAS, a generative artificial intelligence (AI)-driven framework that exploits spatial agility to steer ISAC performance over the electromagnetic fading manifold. Instead of optimizing ISAC solely in the power domain, diffusion FAS casts ISAC as a \emph{dynamic spatial selection} problem in which antenna states (i.e., ports) are chosen to shape sensing signatures while maintaining communication objectives. To work under sparse measurements, we employ a conditional denoising diffusion probabilistic model (DDPM) to reconstruct the latent spatial correlation structure from a small set of observed ports, enabling efficient exploration of the reconfigurable aperture. We demonstrate two FAS-enabled ISAC modes: (1) \emph{generative spatial stealth}, which identifies localized deep fades to suppress a target's sensing visibility by up to two orders of magnitude, and (2) \emph{target isolation}, which synthesizes spatial nulls that reject interference from adjacent objects.

2605.23642 2026-05-25 eess.SP

Fast Fluid Antenna Multiple Access

快速流体天线多址接入

Noor Waqar, Kai-Kit Wong, Chan-Byoung Chae, Ross Murch

AI总结 本文研究了快速流体天线多址接入(FAMA)通信中的干扰抑制问题,提出了一种无需预编码的新型接入方式,通过动态调整接收端天线位置以应对大规模接入场景下的严重干扰。为解决现有研究中对用户终端需全面感知所有天线端口的理想化假设,作者提出了一种基于copula的FAMA框架,通过学习天线端口间信道与干扰的联合依赖结构,仅利用少量观测端口信息即可准确重构未观测的信道和干扰信号。实验表明,当观测端口数量超过空间自由度时,信道、接收信号和干扰的重构归一化均方误差可降至 $10^{-4}$ 量级。

详情
AI中文摘要

快速流体天线多址接入(FAMA)是一种有前景的技术,通过在接收端逐符号重构天线位置,无需预编码或任何其他干扰缓解技术,即可克服大规模接入场景中的严重干扰。然而,该技术通常基于“理想辅助”前提进行研究:每个用户终端(UT)可以在每个符号实例中探测所有流体天线端口,并理想地知道所有端口接收信号的瞬时信号-干扰分离。这种假设不现实,因为它意味着不切实际的硬件和切换限制、导频开销,以及未知的确定信号-干扰分离的能力。本文重新审视快速FAMA通信问题,并提出一个关键问题:UT能否在仅观测一小部分端口的情况下,表现得如同拥有完整的每端口干扰知识?为此,我们提出了一种“copula辅助的FAMA”框架,该框架学习跨端口的复杂三元组$(r_k, h_k, I_k)$的联合依赖结构,其中$r_k$、$h_k$和$I_k$分别表示第$k$个端口的接收信号、信道系数和聚合干扰信号,并利用该学习模型推断未观测的信道和干扰。具体而言,我们设计了一个注意力-copula时间序列模型,该模型在随机部分观测掩码下进行训练,并在丰富和有限散射信道模型下进行评估。仿真结果表明,一旦观测端口数$M$超过空间自由度(DoF),$h$、$r$和$I$的重构归一化均方误差(NMSE)将降至$10^{-4}$量级。

英文摘要

Fast fluid antenna multiple access (FAMA) is an idea that promises to overcome severe interference in massive access scenarios by reconfiguring the antenna's position at the receiver side on a symbol-by-symbol basis, without the need of precoding nor any other interference mitigation techniques. However, this idea is commonly studied under a \emph{genie-aided} premise: each user terminal (UT) can probe \emph{all} fluid-antenna ports in every symbol instance and ideally knows the instantaneous signal-interference split for the received signals at all the ports. Such assumption is unrealistic since it implies impractical hardware and switching limits, pilot overhead, as well as an unknown ability to determine the signal-interference split. This paper revisits the fast FAMA communication problem and asks a key question: can a UT act \emph{as if} it had full per-port interference knowledge while observing only a small fraction of ports? To this end, we propose a \emph{copula-aided FAMA} framework that learns the joint dependence structure of the complex triplets $(r_k,h_k,I_k)$ across ports, where $r_k$, $h_k$ and $I_k$ denote, respectively, the received signal, the channel coefficient and the aggregate interference signal at the $k$-th port, and uses this learned model to infer unobserved channels and interference. Concretely, we devise an attention-copula time-series model that is trained under random partial-observation masks and evaluated under both rich and finite-scattering channel models. Simulation results indicate that the reconstruction normalized mean-square-error (NMSE) for $h$, $r$, and $I$ drops to the order of $10^{-4}$ once the number of observed ports, $M$, exceeds the spatial degrees of freedom (DoF).

2605.23636 2026-05-25 eess.SY cs.SY

RF Instrument Agent (RFIA): Empowering RF Instruments with Natural Language Understanding, Scheduling and Execution of Complex Tasks

RF仪器智能体 (RFIA): 赋予射频仪器自然语言理解、调度与执行复杂任务的能力

Chunhui Li, Wei Fan

AI总结 本文提出了一种名为RF Instrument Agent(RFIA)的自然语言代理框架,旨在提升射频仪器的自动化控制能力。RFIA采用意图-规划-执行解耦架构,利用大语言模型进行任务理解和高层规划,而具体的仪器操作则由确定性运行时处理,从而实现可靠的任务驱动控制。通过结构化的知识库和混合执行图,RFIA支持复杂射频测量任务的闭环执行,并在商用矢量网络分析仪上进行了硬件在环验证,展示了其在多种任务场景下的有效性与安全性。

详情
AI中文摘要

现代射频仪器,如矢量网络分析仪(VNA),已提供成熟的远程控制接口。然而,实际的射频测量工作流程仍依赖手动操作或定制脚本,耗时且需要专业知识。本文提出RF仪器智能体(RFIA),一种用于可靠任务驱动型射频仪器控制的自然语言智能体框架。RFIA采用解耦的意图-规划-执行架构,其中LLM仅用于任务理解和高层规划,而面向仪器的操作由确定性运行时处理。已验证的技能、工作流模板、射频分析工具、仪器特定规则和检索辅助的SCPI知识被组织在结构化知识库中,并采用混合执行图进行闭环测量任务。在商用VNA上实现了硬件在环原型,并使用包含配置、查询、采集、规则感知操作、射频数据分析及闭环测量的16任务基准进行评估。RFIA在预定义执行和安全策略下处理所有基准任务,包括一次预期的安全拒绝。使用230B规模的MiniMax-M2.7模型和较小的27B规模Qwen3.6-27B模型进行的硬件在环结果证实,该解耦架构支持跨不同LLM后端的可靠自然语言射频测量自动化。

英文摘要

Modern radio-frequency (RF) instruments, such as vector network analyzers (VNAs), already provide mature remote-control interfaces. However, practical RF measurement workflows still rely on manual operation or custom scripting, which is time-consuming and expertise-intensive. This paper presents RF Instrument Agent (RFIA), a natural-language agent framework for reliable task-driven RF instrument control. RFIA adopts a decoupled intent--planning--execution architecture, where the LLM is used only for task understanding and high-level planning, while instrument-facing operations are handled by a deterministic runtime. Verified skills, workflow templates, RF analysis tools, instrument-specific rules, and retrieval-assisted SCPI knowledge are organized in a structured knowledge base, and hybrid execution graphs are used for closed-loop measurement tasks. A hardware-in-the-loop prototype is implemented on a commercial VNA and evaluated using a 16-task benchmark covering configuration, query, acquisition, rule-aware operation, RF-data analysis, and closed-loop measurement. RFIA handles all benchmark tasks under predefined execution and safety policies, including one expected safety rejection. Hardware-in-the-loop results with both a 230B-scale MiniMax-M2.7 model and a smaller 27B-scale Qwen3.6-27B model confirm that the decoupled architecture supports reliable natural-language RF measurement automation across different LLM backends.

2605.23619 2026-05-25 eess.AS cs.SD

Frame-Aligned Fusion of Canary and WavLM for Non-Intrusive Intelligibility Prediction of Hearing-Aid-Processed Speech

Canary与WavLM的帧对齐融合用于助听器处理语音的非侵入式清晰度预测

Kazushi Nakazawa

AI总结 本文研究了在无参考条件下预测助听器处理语音可懂度的问题,提出了一种基于Canary和WavLM两个预训练语音编码器的框架对齐融合方法。通过比较多种融合策略,作者发现将WavLM经过可学习的步进卷积处理后,在较粗的Canary时间线上进行融合,能够有效提升预测性能,最终在Eval数据集上取得了较低的RMSE和较高的相关系数。实验分析表明,在池化前建立粗粒度的时序对应关系有助于模型更好地捕捉语音可懂度的关键特征。

详情
Comments
7 pages, 2 figures
AI中文摘要

非侵入式清晰度预测估计听力受损听众对助听器处理语音的理解程度,无需干净参考。我们在第三届清晰度预测挑战赛中研究此任务,使用两个冻结的语音编码器Canary和WavLM。核心问题不仅在于是否应结合互补的预训练表示,还在于它们的交互应发生在何处。我们在共享的左右保留双耳框架下比较了单骨干基线、统一分数平均、池后融合、交叉注意力、帧对齐融合和反向对齐。在比较的系统中,最佳模型使用可学习的步进卷积对WavLM进行时间准备,并在池化前在较粗的Canary时间线上将其与Canary融合,达到Eval RMSE 24.96±0.06和Eval Corr 0.796±0.001。严重性、增强系统、层窗口和时间偏移分析表明,池化前的粗局部时间对应是该任务的有用归纳偏置。

英文摘要

Non-intrusive intelligibility prediction estimates how well hearing-impaired listeners understand hearing-aid-processed speech without a clean reference. We study this task in the 3rd Clarity Prediction Challenge using two frozen speech encoders, Canary and WavLM. The central question is not only whether complementary pretrained representations should be combined, but where their interaction should occur. We compare single-backbone baselines, uniform score averaging, pool-late fusion, cross-attention, frame-aligned fusion, and reverse alignment under a shared left/right-preserving binaural framework. Among the compared systems, the best model temporally prepares WavLM with a learnable strided convolution and fuses it with Canary on the coarser Canary timeline before pooling, reaching Eval RMSE 24.96$\pm$0.06 and Eval Corr 0.796$\pm$0.001. Severity, enhancement-system, layer-window, and temporal-shift analyses indicate that coarse local temporal correspondence before pooling is a useful inductive bias for this task.

2605.23604 2026-05-25 eess.AS cs.SD

Word-Level Modeling with Alignment-Aware Acoustic Fusion for Text-Assisted Intelligibility Prediction in Listeners with Hearing Loss

基于对齐感知声学融合的词级建模用于听力损失患者文本辅助可懂度预测

Kazushi Nakazawa

AI总结 本文研究了如何利用文本辅助预测听力障碍者对语音的可懂度,提出了一种基于词级建模和对齐感知声学融合的方法。该方法结合冻结的Whisper编码器分析降质语音,通过条件解码器结合标准文本进行预测,并引入词对齐的局部声学分支与全局声学分支进行校准,提升了预测性能。实验表明,该方法在多项指标上优于基线模型,验证了细粒度预测与对齐融合的有效性。

详情
Comments
7 pages, 2 figures
AI中文摘要

我们针对CPC3中听力受损者的文本辅助语音可懂度预测问题。尽管目标是句子级百分比,但它由参考词识别结果决定。我们将预测建模为参考条件下的词级正确性建模:冻结的Whisper编码器分析退化语音,教师强制解码器以规范转录为条件,句子可懂度通过对有效参考词的预测正确概率取平均得到。为了补充转录条件解码器状态,我们添加了一个基于字符级交叉注意力对齐的词对齐局部声学分支,以及一个用于校准的语句级全局声学分支。在官方评估集上,解码器基线获得RMSE 24.92和相关系数0.795,而联合融合将错误词F1提升至0.778,MCC 0.626,相关系数0.806,RMSE 24.39。使用Whisper medium的类似趋势表明,增益来自预测粒度和对齐感知融合。

英文摘要

We address text-assisted speech intelligibility prediction for hearing-impaired listeners in CPC3. Although the target is a sentence-level percentage, it is determined by reference-word recognition outcomes. We formulate prediction as reference-conditioned word-level correctness modeling: a frozen Whisper encoder analyzes degraded speech, a teacher-forced decoder conditions on the canonical transcript, and sentence intelligibility is obtained by averaging predicted correctness probabilities over valid reference words. To complement transcript-conditioned decoder states, we add a word-aligned local acoustic branch based on character-level cross-attention alignment and an utterance-level global acoustic branch for calibration. On the official evaluation set, the decoder baseline obtains RMSE 24.92 and correlation 0.795, while joint fusion improves to incorrect-word F1 0.778, MCC 0.626, correlation 0.806, and RMSE 24.39. A similar trend with Whisper medium suggests that the gain comes from prediction granularity and alignment-aware fusion.

2605.23593 2026-05-25 eess.AS

A study on weakly-supervised training approaches for phoneme-level pronunciation scoring

音素级发音评分的弱监督训练方法研究

Jazmín Vidal, Luciana Ferrer

AI总结 本文研究了在没有音素级标注的情况下,如何通过高层次的发音标签来学习音素级别的发音评分。作者提出了一种弱监督训练方法,仅使用句子或词语级别的标签进行训练,并分析了这种方法是否能生成有效的音素级评分。此外,还提出了一种两阶段训练策略,通过少量精心选择的音素级标注数据对模型进行微调,实验表明该方法在性能上可与全音素级监督方法相比,且只需少量音素级标注数据。

详情
AI中文摘要

音素级计算机辅助发音训练系统通常依赖于音素级标注,这些标注成本高昂且稀缺。在这项工作中,我们研究了是否可以通过利用更高级别的发音标签,在没有音素级监督的情况下学习音素级发音错误信息。具体来说,我们研究了一种弱监督设置,其中模型仅使用话语级或单词级发音标签进行训练,并分析这种监督是否能诱导出有用的音素级分数预测。我们进一步考虑了一种两阶段训练场景,其中仅使用话语级标签训练的模型通过有限数量的精心挑选的音素级标注话语进行微调。我们发现,使用我们提出的架构和选择过程,两阶段过程能够获得与完全音素级监督相当的结果,仅需要一小部分音素级标签。

英文摘要

Phoneme-level computer-assisted pronunciation training systems typically rely on phoneme-level annotations, which are costly and scarce. In this work, we investigate whether phoneme-level mispronunciation information can be learned without phoneme-level supervision by exploiting higher-level pronunciation labels. Specifically, we study a weakly supervised setting in which models are trained using only utterance- or word-level pronunciation labels and analyze whether this supervision induces useful phoneme-level score predictions. We further consider a two-stage training scenario in which a model trained only with utterance-level labels is finetuned using a limited number of carefully-selected phoneme-level labeled utterances. We find that, using our proposed architecture and selection process, the two-stage process leads to comparable results to those obtained with full phoneme-level supervision, requiring only a small fraction of phoneme-level labels.

2605.23588 2026-05-25 eess.SP

Low-cost Parallel Transmission for Dense Indoor Data Collection with LoRaWAN: Time Synchronization and Resource Allocation

低成本并行传输用于密集室内数据收集的LoRaWAN:时间同步与资源分配

Junxiao Liu, Xinyu Fan, Luping Xiang, Kun Yang

AI总结 本文针对LoRaWAN在密集室内物联网场景下因默认随机接入机制导致的信道争用严重、数据包丢失率高和吞吐量低的问题,提出了一种低成本的时隙同步与资源分配方案。该方法通过引入一个专用的出带同步信道,结合时分多址(TDMA)机制,实现毫秒级的时间同步,无需网关调度或硬件改造,且不增加下行开销。实验表明,该方法在20节点的室内部署中将系统吞吐量提升了30%以上,数据包丢失率从25.8%降至5.02%,验证了其在高密度场景下的有效性与实用性。

详情
Journal ref
IEEE Internet of Things Journal
AI中文摘要

LoRaWAN凭借其强大的穿透能力和低功耗,成为大规模室内物联网数据回传的低成本解决方案。然而,其默认的纯ALOHA接入机制在密集并发传输下会导致严重的信道竞争、大量丢包和吞吐量下降。为此,我们提出一种轻量级带外(OOB)同步方案,将时分多址(TDMA)机制集成到商用LoRaWAN Class~A网络中。与需要网关调度、频繁下行信令或定制硬件的方法不同,我们的方法引入一个低成本节点,通过专用OOB同步信道提供毫秒级对齐。终端设备通过短暂调谐其现有LoRa收发器即可无缝接入该信道。因此,该方案在稳态报告阶段零下行开销,无需修改网关或终端设备的硬件,并完全向后兼容。该设计在配置的标称资源容量内实现了无冲突的调度信道接入,从而提高了吞吐量并减少了竞争。使用室内定位原型的实际实验表明,所提出的TDMA-LoRaWAN架构在20节点室内部署中,系统吞吐量提升超过30%,丢包率从25.8%降至5.02%。此外,大规模仿真验证了这些实验结果,支持更大网络规模下的可扩展性分析,并表明在密集网络环境下每成功数据包的能效有所提高。这些综合结果证明了所提方法在密集室内物联网数据收集中的有效性,并显示了其在高上行报告需求下的实际潜力。

英文摘要

LoRaWAN is a compelling low-cost solution for large-scale indoor Internet of Things (IoT) data backhaul, owing to its strong penetration capability and low power consumption. However, its default pure ALOHA access mechanism leads to severe channel contention, substantial packet loss, and reduced throughput under dense, concurrent transmissions. To overcome this, we propose a lightweight out-of-band (OOB) synchronization scheme that integrates a time division multiple access (TDMA) mechanism into commercial LoRaWAN Class~A networks. Unlike approaches requiring gateway scheduling, frequent downlink signaling, or custom hardware, our method introduces a single low-cost node providing millisecond-level alignment via a dedicated OOB synchronization channel. End devices seamlessly access this channel by briefly retuning their existing LoRa transceivers. Consequently, the scheme imposes zero downlink overhead during the steady-state reporting phase, requires no hardware modifications to gateways or end devices, and remains fully backward-compatible. This design enables collision-free scheduled channel access within the configured nominal resource capacity, thereby improving throughput and reducing contention. Real-world experiments using an indoor positioning prototype demonstrate that the proposed TDMA-LoRaWAN architecture improves system throughput by over 30\% and reduces the packet loss rate from 25.8\% to 5.02\% in a 20-node indoor deployment. Furthermore, large-scale simulations corroborate these empirical findings, support the scalability analysis under larger network sizes, and indicate improved energy efficiency per successful packet in dense network settings. These combined results demonstrate the effectiveness of the proposed approach for dense indoor IoT data collection and indicate its practical potential under high uplink reporting demands.

2605.23568 2026-05-25 cs.RO cs.SY eess.SY

TactileReflex: Noise-Statistics-Driven Vision-Tactile Reflex Control for Force-Sensitive Manipulation

TactileReflex:基于噪声统计的视觉-触觉反射控制用于力敏感操作

Ziyan Feng, Yulong Fu, Zheng Li, Yuxin He, Jieji Ren, Lujia Wang, Jinni Zhou, Yudong Zhong, Qiang Nie

AI总结 本文提出了一种基于噪声统计特性的视觉-触觉反射控制方法TactileReflex,用于实现对力敏感的精细操作任务,如液体填充的塑料杯的抓取与操作。该方法通过分析触觉传感器的内在噪声特性,直接推导出控制器的阈值,无需外部力标定或手动调参。实验表明,TactileReflex能够有效防止容器不可逆变形,并在动态倒水任务中表现出优异的稳定性与成功率,具有作为高层次操作系统安全层的潜力。

详情
Comments
8 pages, 4 figures, 6 tables
AI中文摘要

操作易变形的柔性容器(如装有液体的一次性塑料杯)需要在极窄的力裕度内实时调整抓取力:力不足会导致滑动,力过大则会使薄壁不可逆变形。现有方法难以完成此类力敏感操作任务。我们提出一种基于噪声统计的标定驱动反射控制范式,结合基于视觉的触觉感知:通过分析传感器的固有噪声特性(通过简短的静态保持-卸载协议),直接推导出所有控制器阈值,消除了外部力标定、试错手动调参或材料特定的物理模型。实现该范式,我们提出了TactileReflex,一个三通道闭环控制器,从双视觉触觉传感器中提取三个图像级代理:剪切强度($S_y$)、接触强度($F_n$)和压力中心($C$),并以约12Hz驱动优先反射通道,用于滑动抑制、重量自适应释放和力保护。每个通道通过噪声导出的阈值直接在其代理上闭环。消融实验表明,只有完整的三通道系统能够防止容器不可逆变形(5/5成功,而部分配置最多1/5成功)。在动态倾倒任务中,固定力基线因姿态漂移在所有10次尝试中均失败,而TactileReflex在两种水量下实现了9/10成功。作为一个自包含且可解释的控制器,TactileReflex可作为高层操作流水线(包括无触觉VR遥操作和视觉-语言-动作策略)的即插即用安全层。

英文摘要

Manipulating fragile deformable containers, such as disposable plastic cups filled with liquid, demands real-time grip-force adaptation within an extremely narrow force margin: insufficient force causes slip, while excessive force irreversibly deforms the thin wall. Existing approaches struggle to achieve such force-sensitive manipulation tasks. We propose a noise-statistics-based calibration-driven reflex control paradigm with vision-based tactile sensing: by analyzing the sensor's intrinsic noise characteristics (via a brief static-hold-and-unload protocol), we directly derive all controller thresholds, eliminating external force calibration, trial-and-error manual tuning, or material-specific physical models. Instantiating this paradigm, we present TactileReflex, a three-channel closed-loop controller that extracts three image-level proxies, shear intensity ($S_y$), contact intensity ($F_n$), and center of pressure ($C$), from dual visuo-tactile sensors and drives prioritized reflex channels at ~12 Hz for slip suppression, weight-adaptive release, and force protection. Each channel closes the loop directly on its proxy via noise-derived thresholds. Ablation demonstrates that only the full three-channel system is able to prevent irreversible container deformation (5/5 success vs. at most 1/5 for partial configurations). In a dynamic pouring task, fixed-effort baselines fail in all 10 attempts due to pose drift, while TactileReflex achieves 9/10 success across two water volumes. As a self-contained and interpretable controller, TactileReflex can serve as a plug-and-play safety layer beneath high-level manipulation pipelines, including haptic-free VR teleoperation and vision-language-action (VLA) policies.

2605.23561 2026-05-25 eess.SP

Reliable UAV Detection with ISAC

基于ISAC的可靠无人机检测

Stephan Saur, Mark Doll, Artjom Grudnitsky, Silvio Mandelli, Lucas Giroto, Marcus Henninger, Thorsten Wild

AI总结 本文研究了在5G-Advanced和未来6G网络中,利用集成感知与通信(ISAC)系统实现可靠无人机检测的问题。研究采用未修改的商用5G设备,基于单站OFDM雷达进行实验,并与基于链路预算和硬件缺陷的模型预测性能进行对比。实验结果表明,在复杂且存在强杂波的无线电环境中,仍能实现超过500米距离、亚米级精度的可靠无人机检测。

详情
AI中文摘要

无人机检测是5G-Advanced和未来6G网络中集成感知与通信系统的一个突出用例。本文展示了使用未修改的商业5G硬件进行单站正交频分复用雷达检测小型无人机的实验结果,并将其与基于链路预算和硬件损伤模型的预期性能进行比较。我们表明,在充满强杂波的高挑战性无线电环境中,仍能在超过500米的距离上实现亚米级精度的可靠检测。

英文摘要

Unmanned Aerial Vehicle (UAV) detection is one prominent use case of Integrated Sensing and Communication (ISAC) systems in 5G-Advanced and future 6G networks. In this paper, we present experimental results for the detection of a small UAV using unmodified commercial 5G hardware for mono-static Orthogonal Frequency-Division Multiplexing (OFDM) radar and compare them with the expected performance based on models for link budget and hardware impairments. We show that reliable detection with sub-meter accuracy is still possible in over 500 meters distance in a challenging radio environment rich of strong clutter.

2605.23537 2026-05-25 stat.ML eess.SP

Concomitant DAG Learning: On the Roles of Noise Adaptivity, Sparsity, and Non-negativity

伴随DAG学习:噪声自适应性、稀疏性和非负性的作用

Gonzalo Mateos, Samuel Rey, Hamed Ajorlou, Mariano Tepper

AI总结 该论文探讨了如何从观测数据中学习有向无环图(DAG),以揭示复杂系统中的因果关系。研究提出了一个连续的评分估计框架,通过邻接矩阵对DAG结构进行建模,克服了传统方法在可扩展性和可识别性上的挑战。文章还介绍了共轭DAG估计方法,能够同时推断稀疏的因果结构和外生噪声水平,提高在异方差性和分布偏移情况下的鲁棒性,为因果推断与大规模图学习的交叉研究提供了新方向。

详情
Comments
Submitted to the IEEE Signal Processing Magazine Special Issue: From Signals to Causes: Methodological Advances in Causal Inference. arXiv admin note: text overlap with arXiv:2310.02895
AI中文摘要

有向无环图(DAG)构成了一种核心建模工具,能够对复杂系统中的因果相互作用进行原则性推理。然而,由于一组变量背后的因果结构通常是未知的,并且干预可能不可行或实施起来有伦理挑战,因此需要解决从观测数据推断DAG的任务。然而,大多数经典的结构识别方法面临两个关键障碍:强制执行无环性的组合挑战(严重限制了可扩展性),以及由潜在混杂或异质噪声引起的可识别性挑战。本教程概述了最近信号处理和优化方面的进展,这些进展通过将DAG结构学习重新表述为关于邻接矩阵的连续、基于得分的估计问题来解决这些问题。我们首先对结构方程模型和因果图恢复的公式进行教学性介绍,然后对基于得分的方法进行历史综述,从早期的组合搜索方案和贪婪启发式方法到利用无环性平滑表征的现代连续框架。在此基础上,我们描述了伴随DAG估计方法,该方法联合推断稀疏因果结构和外生噪声水平,通过使估计器具有噪声自适应性,提高了在异方差性和分布偏移下的鲁棒性。总而言之,本教程向读者介绍了因果推断、高维统计和可扩展图学习交叉领域信号处理研究的挑战和机遇,同时概述了新兴方向,包括在线、非线性和神经因果发现。

英文摘要

Directed acyclic graphs (DAGs) constitute a central modeling tool to enable principled reasoning about cause-effect interactions in complex systems. However, since the causal structure underlying a group of variables is often unknown and interventions may be infeasible or ethically challenging to implement, there is a need to address the task of inferring DAGs from observational data. However, most classical structure identification approaches face two key obstacles: the combinatorial challenge of enforcing acyclicity, which severely limits scalability, and identifiability challenges arising from latent confounding or heterogeneous noise. This tutorial offers an overview of recent signal processing and optimization advances that address these issues by recasting DAG structure learning as a continuous, score-based estimation problem over adjacency matrices. We begin with a didactic introduction to structural equation models and the formulation of causal graph recovery, followed by a historical survey of score-based methods ranging from early combinatorial search schemes and greedy heuristics to modern continuous frameworks that leverage smooth characterizations of acyclicity. Building on this foundation, we describe concomitant DAG estimation methods that jointly infer sparse causal structure and exogenous noise levels, improving robustness under heteroscedasticity and distribution shifts by rendering the estimator noise adaptive. All in all, the tutorial introduces readers to challenges and opportunities for signal processing research at the crossroads of causal inference, high-dimensional statistics, and scalable graph learning, while outlining emerging directions including online, nonlinear, and neural causal discovery.

2605.23536 2026-05-25 eess.SP

Utilizing Missed Detections in Directional Sensitivity-Based DOA Estimation

利用方向灵敏度DOA估计中的漏检

Gustav Zetterqvist, Fredrik Gustafsson, Gustaf Hendeby

AI总结 本文提出了一种基于信号强度的方向到达(DOA)估计算法,专门用于方向性传感器,并明确考虑了漏检信息。与传统基于相位的DOA估计方法不同,该方法仅依赖接收到的信号强度测量值,从而使得漏检自然地成为感知和检测过程的一部分,并通过已知的检测阈值传递有价值的信息。通过将检测到的信号和漏检信息同时纳入似然函数,本文开发了一种概率估计方法,充分利用了测量和检测模型。实验结果表明,该方法在高漏检率等挑战性场景下显著提升了DOA估计的准确性,并在实际的蓝牙低功耗(BLE)信号实验中验证了其有效性。

详情
Comments
This work has been submitted to the IEEE for possible publication
AI中文摘要

本文针对方向传感器提出了一种基于信号强度的到达方向(DOA)估计方法,该方法明确考虑了漏检。在传统的基于相位的DOA估计框架中,低于检测阈值的预期发射源的负面信息超出了标准测量模型的范围。与基于相位的DOA估计方法不同,所提出的方法仅依赖于接收信号强度测量。因此,漏检自然地从感知和检测过程中产生,并通过已知的检测阈值传递有价值的信息。通过将检测到的信号和漏检纳入似然函数,我们开发了一种概率估计方法,充分利用了底层的测量和检测模型。仿真结果表明,与基线技术相比,所提出的方法显著提高了DOA估计精度,特别是在漏检率高的挑战性场景中。使用蓝牙低功耗(BLE)信号和定向天线的实际实验进一步验证了该方法的有效性,展示了显著的性能提升。这些发现强调了在传感器阵列处理中对漏检进行建模的价值,并为增强无线通信系统中的定位性能开辟了新途径。

英文摘要

This paper introduces a signal strength-based direction of arrival (DOA) estimation approach for directional sensors that explicitly accounts for missed detections. In traditional phase-based DOA estimation frameworks, negative information from expected emitters that fall below the detection threshold fall outside the scope of standard measurement models. Unlike phase-based DOA estimation methods, the proposed approach relies only on received signal strength measurements. As a result, missed detections arise naturally from the sensing and detection process and convey valuable information via the known detection thresholds. By incorporating both detected signals and missed detections into the likelihood function, we develop a probabilistic estimation method that fully leverages the underlying measurement and detection models. Simulation results show that the proposed method significantly improves DOA estimation accuracy compared to baseline techniques, particularly in challenging scenarios with high missed-detection rates. Real-world experiments using Bluetooth Low Energy (BLE) signals and directional antennas further validate the effectiveness of the approach, demonstrating substantial performance gains. These findings highlight the value of modeling missed detections in sensor array processing and open new avenues for enhancing localization performance in wireless communication systems.