arXivDaily arXiv每日学术速递 周一至周五更新
2606.20450 2026-06-19 eess.SP 新提交

Max-Min Rate Fairness Optimization for Multi-User Pinching-Antenna NOMA Systems

多用户捏合天线NOMA系统的最大最小速率公平性优化

Mahmoud AlaaEldin, Amy Inwood, Xidong Mu, Michail Matthaiou

AI总结 针对多波导捏合天线NOMA下行系统,提出两阶段优化框架,联合优化天线位置和预编码,以最大化最小用户速率,显著提升性能。

详情
AI中文摘要

捏合天线系统(PAS)通过沿米级波导重新定位介电辐射元件(称为捏合天线,PA)来克服信号阻塞,从而创建视距链路。由于每个波导由单个射频(RF)链驱动,非正交多址(NOMA)非常适合基于PAS的多用户通信。本文研究了一个多波导的PAS使能多用户下行NOMA系统,每个波导配备多个PA。联合优化PA位置和基站发射预编码,以最大化最小用户速率。由于PA间干扰引起的快速振荡相干和,所得问题高度非光滑且非凸。为应对这一挑战,我们提出了一种两阶段结构化优化框架。在第一阶段,使用内点算法进行粗略的PA位置和功率分配优化,同时忽略PA信道相位,从而得到接近真实最优的解。在第二阶段,考虑PA信道相位偏移,对PA位置和发射预编码进行微调。该阶段首先应用相位归零,即局部重新定位每个PA,使相应信道相位归零并促进建设性相干合并。然后使用交替过程,迭代执行前后向PA位置精炼和基于逐次凸近似的复发射预编码优化直至收敛,从而减少残余相位失配。仿真结果表明,所提框架显著优于启发式优化基准,且计算时间更短。结果还展示了相对于可比的多输入多输出下行NOMA系统的巨大增益,并揭示了PA数量、用户数量和发射功率对系统性能的影响。

英文摘要

Pinching-antenna systems (PASs) can overcome signal blockage by repositioning dielectric radiating elements, called pinching antennas (PAs), along meter-scale waveguides to create line-of-sight links. Since each waveguide is driven by a single radio-frequency (RF) chain, non-orthogonal multiple access (NOMA) is well suited for PAS-based multi-user communications. This paper studies a PAS-enabled multi-user downlink NOMA system with multiple waveguides, each equipped with multiple PAs. The PA positions and base-station transmit precoding are jointly optimized to maximize the minimum user rate. The resulting problem is highly non-smooth and non-convex because of the rapidly oscillating coherent sums caused by inter-PA interference. To tackle this challenge, we propose a two-stage structured optimization framework. In the first stage, coarse PA-position and power-allocation optimization is performed using an interior-point algorithm while neglecting the PA channel phases, which gives solutions near the true optima. In the second stage, PA positions and transmit precoding are fine-tuned while accounting for the PA channel phase shifts. This stage first applies phase zeroing, where each PA is locally repositioned to align the corresponding channel phase toward zero and promote constructive coherent combining. It then uses an alternating procedure that iteratively performs forward-backward PA position refinement and successive-convex-approximation-based complex transmit precoding optimization until convergence, thereby reducing residual phase mismatch. Simulation results show that the proposed framework significantly outperforms heuristic optimization benchmarks with much lower computational time. They also demonstrate large gains over a comparable multiple-input multiple-output downlink NOMA system and reveal the impact of the number of PAs, users, and transmit power on system performance.

2606.20222 2026-06-19 eess.SP 新提交

Reliable ORIS-assisted FSO Communications via HARQ

基于HARQ的可靠ORIS辅助自由空间光通信

Georgios D. Chondrogiannis, Athanasios P. Chrysologou, Vasilis K. Papanikolaou, Alexandros-Apostolos A. Boulogeorgos, Nestor D. Chatzidiamantis, Robert Schober

AI总结 研究结合光学可重构智能表面(ORIS)和混合自动重传请求(HARQ)的自由空间光通信链路,推导端到端信道统计模型,给出HARQ-CC的闭式中断概率和HARQ-IR的中断上界,分析分集阶数和延迟特性。

Comments 13 pages, 8 Figures, Journal

详情
AI中文摘要

本文研究了一种由光学可重构智能表面(ORIS)辅助并通过混合自动重传请求(HARQ)方案增强的自由空间光(FSO)链路。ORIS在障碍物周围创建虚拟视距路径,而HARQ通过重传和合并恢复受湍流、指向抖动和几何损耗损坏的帧。我们首先通过联合考虑大气湍流、ORIS引起的指向误差和几何衰减,推导了端到端发射器-ORIS-接收器(Tx-ORIS-Rx)反射信道的易处理统计模型。基于这些结果,我们获得了采用Chase合并的HARQ(HARQ-CC)的闭式中断概率(OP)表达式,以及采用增量冗余的HARQ(HARQ-IR)的解析中断上界,这些表达式对任意最大传输轮次有效。我们进一步进行了高信噪比(SNR)分析,该分析提供了中断行为的全面表征,并揭示了两种方案的分集阶数。此外,我们通过平均传输轮次和给定成功解码的条件平均轮次来表征截断HARQ过程的延迟行为。最后,数值和蒙特卡洛结果验证了所提出的分析,并表明HARQ显著提高了ORIS辅助FSO的可靠性,即使对于少量重传轮次,HARQ-IR也能实现比HARQ-CC更低的中断和延迟。

英文摘要

This paper studies a free-space optical (FSO) link assisted by an optical reconfigurable intelligent surface (ORIS) and enhanced by a hybrid automatic repeat request (HARQ) scheme. The ORIS creates a virtual line-of-sight path around obstacles, while HARQ recovers frames corrupted by turbulence, pointing jitter, and geometric loss through retransmission and combining. We first derive a tractable statistical model for the end-to-end transmitter-ORIS-receiver (Tx-ORIS-Rx) reflected channel by jointly accounting for atmospheric turbulence, ORIS-induced pointing errors, and geometric attenuation. Building on these results, we obtain closed-form outage probability (OP) expressions for HARQ with Chase combining (HARQ-CC) and analytical outage upper bounds for HARQ with incremental redundancy (HARQ-IR), valid for an arbitrary maximum number of transmission rounds. We further conduct a high signal-to-noise ratio (SNR) analysis that provides a thorough characterization of the outage behavior and reveals the diversity order of both schemes. In addition, we characterize the delay behavior of the truncated HARQ process through the mean number of transmission rounds and the conditional mean number of rounds given successful decoding. Finally, numerical and Monte Carlo results validate the proposed analysis and show that HARQ substantially improves ORIS-assisted FSO reliability, with HARQ-IR achieving lower outage and delay than HARQ-CC, even for a small number of retransmission rounds.

2606.20011 2026-06-19 eess.SP 新提交

Amplitude-Phase-Frequency Block Modulation for OFDM-ISAC with SI-Free PAPR Reduction and Pilotless Sensing

用于OFDM-ISAC的幅度-相位-频率块调制:无旁瓣信息PAPR降低和无导频感知

Bensheng Yang, Min Fan, Haitao Zhao, Haiming Wang

AI总结 提出一种幅度-相位-频率块调制方案,通过斯托克斯球映射和分组相位优化,在OFDM中实现无资源分割的通信与感知集成,同时降低PAPR并消除导频需求。

详情
AI中文摘要

基于正交频分复用(OFDM)的集成感知与通信系统需要一种统一波形,同时支持可靠数据传输、低峰均功率比(PAPR)和精确信道感知。现有方法在分离的时间或频率资源上复用通信与感知,或依赖专用导频进行信道估计,限制了系统灵活性并增加了开销。本文提出一种用于OFDM的幅度-相位-频率块调制(APFBM)方案,在不进行资源分割的情况下实现通信与感知的波形级集成。信息符号在斯托克斯球上表示,并通过明确规则映射到能量归一化的琼斯矢量,该规则为每个块建立确定性相位参考。这种映射暴露了信号结构中固有的共相自由度。在发射端,分组相位优化算法利用该结构自由度降低PAPR,无需旁瓣信息(SI)。在接收端,相同的确定性相位结构支持基于维特比的最大似然(ML)序列检测算法,该算法联合恢复优化相位并估计块状信道幅度和相位。无需专用感知导频,因为感知观测量直接从通信波形中提取。推导了闭式错误率和感知精度表达式。在软件无线电链路上的数值仿真和空中测量证实了有效的PAPR降低、精确的信道感知、可靠的相位恢复和稳定的信道状态信息重建。所提方案以适度降低频谱效率为代价,实现了统一波形设计,同时提供无SI的PAPR降低和无导频感知。

英文摘要

Orthogonal Frequency Division Multiplexing (OFDM)-based integrated sensing and communication systems demand a unified waveform that simultaneously supports reliable data transmission, low peak-to-average power ratio (PAPR), and accurate channel sensing. Existing approaches multiplex communication and sensing across separate time or frequency resources, or rely on dedicated pilots for channel estimation, limiting system flexibility and increasing overhead. This paper proposes an amplitude-phase-frequency block modulation (APFBM) scheme for OFDM that achieves waveform-level integration of communication and sensing without resource partitioning. Information symbols are represented on the Stokes sphere and mapped to energy-normalized Jones vectors through an unambiguous rule that establishes a deterministic phase reference per block. This mapping exposes a commonphase degree of freedom inherent in the signal structure. At the transmitter, a grouped phase optimization algorithm exploits this structural freedom to reduce the PAPR without side information (SI). At the receiver, the same deterministic phase structure enables a Viterbi-based maximum-likelihood (ML) sequence detection algorithm that jointly recovers the optimization phases and estimates the block-wise channel amplitude and phase. No dedicated sensing pilots are required, as the sensing observables are extracted directly from the communication waveform. Closed-form error-rate and sensing-accuracy expressions are derived. Numerical simulations and over-the-air measurements on a software-defined radio link confirm effective PAPR reduction, accurate channel sensing, reliable phase recovery, and stable channel state information reconstruction. The proposed scheme trades a moderate reduction in spectral efficiency for a unified waveform design that simultaneously delivers SI-free PAPR reduction and pilotless sensing.

2606.19953 2026-06-19 eess.SP 新提交

ConsisFormer: Compute-Efficient Transformer for Wireless Foundation Models Based on Channel Consistency

ConsisFormer: 基于信道一致性的无线基础模型高效计算Transformer

Yuwei Wang, Li Sun, Tingting Yang, Liwen Jing, Yuxuan Shi, Maged Elkashlan, Mérouane Debbah

AI总结 提出ConsisFormer,利用无线信道短时一致性,通过自适应令牌聚合和特征序列插值降低Transformer计算复杂度,在多种任务上减少83%以上计算量且性能损失极小。

详情
AI中文摘要

无线基础模型(WFM)最近成为AI原生6G网络的一种有前景的范式,能够实现适应各种通信和感知任务的通用信道表示。现有的WFM主要基于Transformer架构,该架构提供了优越的性能,但计算复杂度与输入序列长度的平方成正比,这对其在严格推理延迟约束下的部署构成了重大障碍。为了解决这个问题,本文提出ConsisFormer,一种基于无线信道短时一致性的高效计算Transformer设计,作为WFM的骨干网络。利用相邻时间或频率实例共享相似的散射体簇并因此表现出相似信道特性的观察,我们开发了自适应令牌聚合(ATA)模块,动态合并相邻信道状态信息(CSI)令牌,从而减少自注意力计算中涉及的令牌序列长度以降低计算成本。此外,我们提出了一种特征序列插值(FSI)方法,基于Transformer块输出的稀疏特征序列恢复完整的CSI表示,从而在保持性能不受影响的同时确保低复杂度。此外,我们提出了一种用于WFM的聚合自编码器(AAE)预训练范式,通过压缩和恢复从稀疏化CSI令牌中学习鲁棒的信道表示。仿真结果表明,所提出的设计将WFM的计算复杂度降低了83%以上,同时在包括信道预测、视距/非视距分类、波束预测和定位在内的各种任务上性能损失极小。

英文摘要

Wireless foundation models (WFMs) have recently emerged as a promising paradigm for AI-native 6G networks, enabling universal channel representations adaptable to diverse communication and sensing tasks. Existing WFMs are predominantly built upon the Transformer architecture, which delivers superior performance but incurs computational complexity proportional to the square of the input sequence length, posing a significant barrier to their deployment under stringent inference latency constraints. To address this issue, in this paper, we propose ConsisFormer, a compute-efficient Transformer design based on short-term consistency of wireless channels, as a WFM backbone. By utilizing the observation that adjacent time or frequency instances share similar clusters of scatterers and thus exhibit similar channel characteristics, we develop an adaptive token aggregation (ATA) module to dynamically merge neighboring channel state information (CSI) tokens, thereby reducing the length of the token sequence involved in self-attention calculations to lower the computational cost. Furthermore, we propose a feature sequence interpolation (FSI) method to recover the full CSI representation based on the sparse feature sequence outputted from the Transformer blocks, thus keeping the performance unaffected while ensuring low complexity. Moreover, we propose an aggregated auto-encoder (AAE) pre-training paradigm for WFMs, enabling robust channel representation learning from sparsified CSI tokens via compression and recovery. Simulation results show that the proposed design reduces the computational complexity of WFM by over $83\%$ with negligible performance loss on various tasks including channel prediction, LoS/NLOS classification, beam prediction, and localization.

2606.19724 2026-06-19 eess.SP 新提交

Cyclic-Prefix OFDM Probing for Spatial-ISI-Free Distributed Acoustic Sensing via Frequency-Domain Channel Reconstruction

基于频域信道重构的循环前缀OFDM探测实现无空间ISI分布式声学传感

Huan Huang, Zhiyang Xue, Ziang Chen, Zhongxing Tian, Dongdong Zou, Gangxiang Shen, Yi Cai

AI总结 提出使用循环前缀正交频分复用(CP-OFDM)波形作为传感探头,通过频域信道重构消除匹配滤波脉冲压缩中的空间符号间干扰(ISI),实现无空间ISI的分布式声学传感,并同时恢复通信数据,展示共享波形集成感知与通信(ISAC)。

Comments This manuscript has been submitted for possible publication

详情
AI中文摘要

基于匹配滤波的脉冲压缩分布式声学传感(DAS)存在非零压缩旁瓣,导致确定性距离单元间泄漏,即空间符号间干扰(ISI),并在重建的瑞利背向散射迹中产生虚假响应。我们提出一种用于$\phi$-OTDR的循环前缀正交频分复用(CP-OFDM)DAS系统,使用承载数据的CP-OFDM波形作为传感探头。该系统还恢复前向通信数据,初步展示了共享波形集成感知与通信(ISAC)。据我们所知,这是首次将分布式瑞利背向散射建模为有限记忆传感多径信道。基于该模型,我们证明,如果有用OFDM和CP长度覆盖传感多径记忆,则去除CP、单抽头频域均衡和逆离散傅里叶变换可重建每个距离单元系数,且无确定性波形引起的空间ISI,从而实现无空间ISI的相位解调。在模拟的5.2公里链路上,组内间隔5.31–5.83米的十个同时强、弱事件,所提接收机抑制了事件外泄漏,并将相位迹均方误差相比匹配滤波脉冲压缩提升高达29.55 dB。在5.2公里光纤链路的相干外差实验中,占用带宽111.984 MHz,在5 V和1 V驱动下,500 Hz PZT振动分别被盲定位在5.071公里和5.066公里处,其波形恢复的相关系数分别为0.990和0.962。同一承载数据探头还恢复了一幅图像,误码率为零,误差矢量幅度中位数为-23.14 dB。这些结果验证了CP-OFDM辅助的频域信道重构用于无空间ISI的DAS,并展示了其在共享波形光纤ISAC中的潜力。

英文摘要

Matched-filter-based pulse-compression distributed acoustic sensing (DAS) suffers from nonzero compression sidelobes that cause deterministic inter-range-bin leakage, i.e., spatial inter-symbol interference (ISI), and false responses in reconstructed Rayleigh-backscatter traces. We propose a cyclic-prefix orthogonal frequency-division multiplexing (CP-OFDM) DAS system for $ϕ$-OTDR, using a data-bearing CP-OFDM waveform as the sensing probe. It also recovers forward communication data, providing an initial demonstration of shared-waveform integrated sensing and communication (ISAC). To our knowledge, this is the first formulation of distributed Rayleigh backscattering as a finite-memory sensing multipath channel. Based on this formulation, we prove that, if the useful OFDM and CP lengths cover the sensing multipath memory, CP removal, one-tap frequency-domain equalization, and inverse discrete Fourier transform reconstruct each range-bin coefficient without deterministic waveform-induced spatial ISI, enabling spatial-ISI-free phase demodulation. For a simulated 5.2-km link with ten simultaneous strong and weak events spaced by 5.31--5.83 m within groups, the proposed receiver suppresses off-event leakage and improves phase-trace mean-square error by up to 29.55 dB over matched-filter pulse compression. In a heterodyne coherent experiment over a 5.2-km fiber link with 111.984-MHz occupied bandwidth, 500-Hz PZT vibrations are blindly localized at 5.071 and 5.066 km under 5- and 1-V drives, respectively, and their waveforms are recovered with correlation coefficients of 0.990 and 0.962. The same data-bearing probe also recovers an image with zero measured bit-error rate and a median error vector magnitude of -23.14 dB. These results validate CP-OFDM-aided frequency-domain channel reconstruction for spatial-ISI-free DAS and demonstrate its potential for shared-waveform optical-fiber ISAC.

2606.19720 2026-06-19 eess.SP 新提交

An Optimization Framework for Certain Separable Problems using Neural Networks

基于神经网络的特定可分离问题优化框架

Rohit Negi, Soummya Kar

AI总结 针对参数可分离的约束优化问题,提出离线学习与在线处理两阶段策略,利用ADMM和神经网络降低在线计算复杂度。

Comments 15 pages, 5 figures

详情
AI中文摘要

本文研究一类由实时应用驱动的参数约束优化问题。在参数可分离问题结构下,提出基于离线学习和在线处理的两阶段策略,以在资源受限设备上解决这些优化问题。具体地,利用可分离结构,开发了基于交替方向乘子法(ADMM)的迭代求解过程,该过程允许使用基于学习的函数表示(离线学习但在线可快速计算)来降低整体在线设备实现复杂度。通过精心设计ADMM过程,表明即使参数变化,参数优化问题的相应实例也可通过设备上的轻量级在线计算,借助神经网络协处理器求解。

英文摘要

This paper studies a class of parametric constrained optimization problems that are motivated by applications in real time applications. Under a parameter-separable problem structure that naturally arises in these applications, the paper proposes a two phase strategy, based on offline learning and online processing, to address these optimization problems on resource limited devices. Specifically, by exploiting the separable structure, an iterative Alternating Direction Method of Multipliers (ADMM) based solution procedure is developed that enables the use of certain learning based function representations (learned offline but readily computable online) to reduce the overall online on-device implementation complexity. By carefully crafting the ADMM procedure, it is shown that even as the parameters vary, the corresponding instances of the parametric optimization problem may be solved by lightweight online computations in the device with the assistance of a neural network co-processor.

2606.19666 2026-06-19 eess.SP 新提交

Degrees of Freedom and Beamforming for Large Intelligent Surfaces

大规模智能表面的自由度与波束赋形

Jiawang Li, Alireza Saberkari, Buon Kiong Lau, Mats Gustafsson

AI总结 通过互阴影面积闭式表达式估计大规模智能表面(LIS)的空间自由度(DoF),并验证其与数值奇异值谱的吻合;基于DoF分析设计采样方案和波束赋形,证明可形成约DoF数量的独立波束,超过此限会导致干扰增加;极化研究表明电场分量对DoF贡献不均,总场DoF为单极化分量的两倍。

详情
AI中文摘要

空间自由度(DoF)、采样和波束赋形是多用户大规模智能表面(LIS)的基础,其中电磁场必须在多个近场位置进行成形、分辨和聚焦。本文利用互阴影面积的闭式表达式,针对代表性LIS配置估计了DoF数量。通过数值奇异值谱验证了所得DoF预测,其谱膝点与理论估计紧密吻合。对于线源配置,通过将源或观测线划分为单位DoF区间,开发了一种解析采样方案,从而能够选择空间样本。使用最大比传输和迫零的波束赋形结果表明,可以形成大约DoF数量的独立波束。试图超过此限制会导致干扰增加和性能下降。对于基于表面的LIS配置,采样点则通过离散经验插值方法数值确定。相应的波束赋形结果进一步证实,目标区域可以支持大约与DoF分析预测数量相同的独立波束。最后,一项极化感知研究表明,电场分量对DoF的贡献不相等,且总场DoF是单极化分量DoF的两倍。

英文摘要

Spatial degrees of freedom (DoF), sampling, and beamforming are fundamental to multi-user large intelligent surfaces (LISs), where electromagnetic fields must be shaped, resolved, and focused at multiple near-field locations. This work estimates the number of DoF using closed-form expressions derived from the mutual shadow area for representative LIS configurations. The resulting DoF predictions are validated through numerical singular-value spectra, whose spectral knee points closely match the theoretical estimates. For line-source configurations, an analytic sampling scheme is developed by partitioning the source or observation line into unit-DoF intervals, enabling the selection of spatial samples. Beamforming results using maximum-ratio transmission and zero-forcing demonstrate that approximately the number of DoF independent beams can be formed. Attempting to exceed this limit results in increased interference and degraded performance. For surface-based LIS configurations, sampling points are instead determined numerically using the discrete empirical interpolation method. The corresponding beamforming results further confirm that the target region can support approximately as many independent beams as predicted by the DoF analysis. Finally, a polarization-aware study reveals that the electric-field components contribute unequally to the DoF and that the total-field DoF is twice that of a single polarization component.

2606.19536 2026-06-19 eess.SP 新提交

Multistatic J-Band Radar TX/RX Chipset in SiGe BiCMOS with Integrated x16 Frequency Multiplier Chain and High EIRP

采用SiGe BiCMOS工艺的集成x16倍频链和高EIRP的多基地J波段雷达收发芯片组

Stephan Hauptmeier, Kennet Braasch, Till Ziegler-Bellenberg, Diana P. Cortes N., Tobias T. Braun, Michael Höft, Nils Pohl

AI总结 本文设计并测量了一种多基地J波段雷达芯片组,包含集成x16倍频链的发射和接收MMIC,实现了高EIRP和远距离探测。

详情
AI中文摘要

本文介绍了一种多基地J波段雷达芯片组的设计与测量,该芯片组包括一个发射机和一个接收机MMIC,两者均集成了$\ imes$16倍频链,用于低频本振分配和可扩展雷达配置。多基地雷达架构可以同时维持高发射功率和高接收灵敏度,这一优势在本芯片组中得到了充分利用。为此,发射机MMIC上集成的四路功率合成放大器链提供了11.2 dBm的输出功率。在292 GHz下,使用准直PTFE透镜时测得的EIRP为41 dBm,无透镜时为8.8 dBm。尽管倍频因子较高,但片上谐波抑制优于24 dBc,而通过多个滤波器级实现了约50 dBc的辐射带内谐波抑制。接收机MMIC包含三级低噪声放大器,在292 GHz下整体转换增益为43.3 dB。集成的片上贴片天线便于系统集成,并可使用高方向性介质透镜,使该芯片组适用于长达150米的远距离雷达测量。MMIC采用130 nm SiGe BiCMOS工艺实现,其f_T和f_max分别为500 GHz和610 GHz。

英文摘要

This work presents the design and measurement of a multistatic J-band radar chipset comprising a transmitter and a receiver MMIC both featuring an integrated $times$16 frequency multiplier chain for low-frequency local-oscillator distribution and scalable radar configurations. Multistatic radar architectures can sustain high transmission power and high receiver sensitivity simultaneously an advantage that is fully leveraged in the present chipset. To this end a four-way power-combining amplifier chain integrated on the transmitter MMIC delivers an output power of 11.2 dBm. The resulting measured EIRP is 41 dBm at 292 GHz with a collimating PTFE lens and 8.8 dBm without a lens. Despite the high frequency-multiplication factor an on-chip harmonic rejection better than 24 dBc was measured while a radiated in-band harmonic rejection of approximately 50 dBc was achieved through multiple filter stages. The receiver MMIC incorporates a three-stage low-noise amplifier and exhibits an overall conversion gain of 43.3 dB at 292 GHz. Integrated on-chip patch antennas facilitate system integration and the use of highly directive dielectric lenses making the chipset suitable for long-range radar measurements which are demonstrated up to 150 m. The MMICs are realized in a 130 nm SiGe BiCMOS technology with an f_T and f_max of 500 GHz and 610 GHz respectively.

2606.20074 2026-06-19 eess.SP cs.AI cs.LG 新提交

Evaluation of EEG Foundation Models for Event-Based Burst-Suppression Detection in ICU

用于ICU中基于事件的爆发-抑制检测的EEG基础模型评估

Elisa Vasta, Thorir Mar Ingolfsson, Andrea Cossettini, Luca Benini, Tilman Beck, Emanuela Keller, Una Pale

AI总结 本研究首次评估EEG基础模型在ICU中无需患者校准的爆发检测性能,REVE-base模型在事件级F1分数上达到0.868,并将每分钟爆发错误率分别降低52.1%和36.2%。

Comments 4 pages, 1 figure. Code available upon publication

详情
AI中文摘要

爆发抑制(BS)是一种临床相关的脑电图(EEG)模式,用于监测危重患者的镇静深度和脑活动,特别是在重症监护病房(ICU)的诱导昏迷期间。自动爆发检测仍然具有挑战性,因为BS模式在不同患者之间差异很大,且标注数据集稀缺。最近,EEG基础模型(FMs)在多个下游EEG应用中显示出前景,但它们在BS检测中的实用性尚未被探索。我们提出了第一项研究,评估EEG FMs在减少导联的ICU EEG中无需患者校准的爆发检测性能。我们将REVE-base、LUNA-large和LuMamba-Tiny与自适应阈值基线以及任务特定的EEGNet基线进行比较。此外,我们补充了基于事件的爆发检测评估,以替代传统的EEG窗口分类。这有助于临床评估爆发事件是否被正确检测,减少预期标注变异性的影响。最佳模型REVE-base取得了最高的事件级F1分数($0.868 \pm 0.167$),并且与EEGNet和自适应阈值相比,分别将每分钟爆发错误减少了52.1%和36.2%,支持了FMs在ICU中可扩展的EEG监测。消融实验表明,与冻结骨干训练、两步微调和基于LoRA的适应相比,全微调是最有效的适应策略,对于LUNA-large,事件级F1分数比冻结骨干训练提高了最多$+0.102$。在减少标注数据集的情况下,预训练的REVE-base在25%的队列中比随机初始化高出$+0.723$事件级F1点,证明了在有限标注数据下适应爆发检测时预训练FM表示的优势。

英文摘要

Burst suppression (BS) is a clinically relevant electroencephalographic (EEG) pattern used to monitor sedation depth and brain activity in critically ill patients, particularly during induced coma in Intensive Care Units (ICUs). Automatic burst detection remains challenging because BS patterns vary substantially between patients and annotated datasets are scarce. Recently, EEG Foundation Models (FMs) have shown promise across several downstream EEG applications, but their usefulness for BS detection remains unexplored. We present the first study to evaluate EEG FMs for burst detection in reduced-montage ICU EEG without patient-specific calibration. We compare REVE-base, LUNA-large and LuMamba-Tiny with an adaptive thresholding baseline and a task-specific EEGNet baseline. Additionally, we complement conventional EEG window-based classification with event-based burst detection evaluation. This helps assessing clinically whether burst episodes are correctly detected, reducing the impact of expected annotation variability. The best model, REVE-base, achieved the highest event-based F1-score ($0.868 \pm 0.167$) and reduced burst-per-minute error by 52.1% and 36.2% compared to EEGNet and adaptive thresholding respectively, supporting FMs for scalable EEG monitoring in ICU. Ablation experiments showed that full fine-tuning was the most effective adaptation strategy with respect to frozen-backbone training, two-step fine-tuning, and LoRA-based adaptation, improving event-based F1-score over frozen-backbone training by up to $+0.102$ for LUNA-large. With reduced labeled datasets, pretrained REVE-base outperformed random initialization by $+0.723$ event-based F1 points at 25% of the cohort, demonstrating the benefit of pretraining FM representations when adapted to burst detection with limited labeled data.

2606.20413 2026-06-19 eess.SP cs.IT math.IT 新提交

Hybrid TRP-UE Sensing for Enhanced Target Localization

混合TRP-UE感知用于增强目标定位

Necati Kagan Erkek, Marco Di Renzo, Arman Shojaeifard, Yasser Mestrah, Remun Koirala, Mohammad Heggo, Kunjan Shah

AI总结 提出一种混合TRP-UE感知机制,利用UE辅助感知提升网络感知性能,在室内工厂等复杂传播环境下显著改善目标定位精度。

Comments 6 pages

详情
AI中文摘要

集成感知与通信(ISAC)指的是网络在提供通信服务的同时,能够以可扩展的方式感知环境的能力。ISAC的关键功能之一是对无源和移动感知目标的精确定位。本文介绍了一种新颖的混合TRP-UE感知机制,该机制提升了基于网络的感知性能。使用符合3GPP标准的ISAC信道模型提供了评估结果。结果表明,在室内工厂等具有挑战性的传播环境中,用UE辅助感知补充基于TRP的感知具有显著优势。

英文摘要

Integrated Sensing and Communication (ISAC) refers to the capability for the network to provide communications services whilst also being able to sense the environment in a scalable manner. One of the key functions of ISAC is the accurate localization of passive and mobile sensing targets. This paper introduces a novel hybrid TRP-UE sensing mechanism that improves network-based sensing performance. Evaluation results are provided using 3GPP-compliant ISAC channel models. The results demonstrate the significant benefit in complimenting TRP-based sensing with UE-assisted sensing in challenging propagation environments such as indoor factory.

2606.19715 2026-06-19 eess.SP cs.IT math.IT 新提交

Generalized Pinching-Antenna Systems: A Radio-Stripe-Based Realization

广义夹捏天线系统:基于无线电条带的实现

Yanqing Xu, Zhiguo Ding, Tsung-Hui Chang

AI总结 本文提出基于无线电条带(RS)的广义夹捏天线(RS-GPA)框架,通过主动天线处理单元实现位置灵活的无线接入,并开发稀疏激活与波束成形算法以降低总功耗。

Comments 13 pages, 7 figures

详情
AI中文摘要

本文研究无线电条带(RS)作为广义夹捏天线的实际实现,并提出基于RS的广义夹捏天线(RS-GPA)框架。与依赖导波到自由空间被动耦合的介质波导基被动夹捏天线不同,RS采用沿共享电缆部署的主动天线处理单元(APU)进行本地传输、接收和信号处理。这种类似电缆的主动架构提供了灵活的安装和广泛的频率适用性,同时允许选定的APU作为离散且可控的辐射或接收点,实现位置灵活的无线接入。基于所提出的RS-GPA框架,我们通过考虑距离相关的APU-用户信道建立了系统和信道模型。对于下行传输,我们提出了一个电路功率感知的稀疏APU激活和波束成形问题,并开发了一种重加权群稀疏波束成形算法。为了揭示激活原理,我们分析了单用户下行情况,并通过平衡发射功率节省和电路功率成本来刻画何时应激活额外的APU。受此启发,提出了一种几何引导的低复杂度多用户算法。对于上行传输,我们提出了一个联合APU激活和用户功率控制问题,并开发了一种几何引导的稀疏激活设计。数值结果表明,与基准方案相比,所提出的RS-GPA框架显著降低了总功耗,而几何引导算法在运行时间显著降低的情况下实现了与群稀疏设计几乎相同的功耗性能。

英文摘要

This paper investigates radio stripes (RSs) as a practical realization of generalized pinching antennas and proposes an RS-based generalized pinching-antenna (RS-GPA) framework. Unlike dielectric-waveguide-based passive pinching antennas that rely on passive coupling from a guided wave into free space, RSs employ active antenna processing units (APUs) deployed along a shared cable for local transmission, reception, and signal processing. This cable-like active architecture offers flexible installation and broad frequency applicability, while allowing selected APUs to act as discrete and controllable radiation or reception points for location-flexible wireless access. Based on the proposed RS-GPA framework, we establish the system and channel models by accounting for the distance-dependent APU-user channels. For downlink transmission, we formulate a circuit-power-aware sparse APU activation and beamforming problem and develop a reweighted group-sparse beamforming algorithm. To reveal the activation principle, we analyze the single-user downlink case and characterize when an additional APU should be activated by balancing transmit-power saving and circuit-power cost. Inspired by this insight, a geometry-guided low-complexity multiuser algorithm is proposed. For uplink transmission, we formulate a joint APU activation and user power control problem and develop a geometry-guided sparse activation design. Numerical results show that the proposed RS-GPA framework substantially reduces the total consumed power compared with benchmark schemes, while the geometry-guided algorithm achieves near-identical consumed-power performance to the group-sparse design with significantly lower runtime.

2606.18196 2026-06-19 eess.SP 新提交

Receiver-Aware Analysis and Verification of the Spectral Separation Coefficient Under Interference-Induced Degradation

接收机感知的干扰诱导退化下频谱分离系数的分析与验证

Lucas Heublein, Fabian Benschuh, Alexander Rügamer, Felix Ott

AI总结 本文通过引入接收机前端特性计算依赖接收机的频谱分离系数(SSC),并利用真实和仿真数据集实验验证了干扰影响计算的鲁棒性。

Comments 7 pages, 4 figures

详情
AI中文摘要

干扰对基于卫星的定位系统构成重大挑战,因此准确量化特定干扰类型对接收机性能以及由此产生的位置计算可靠性的影响至关重要。当前实践中,干扰影响通常使用与接收机无关的指标进行量化,而接收机特定的前端特性要么被理想化,要么仅被隐含考虑。在本文中,我们通过将接收机特定的前端特性明确纳入干扰影响的计算中,并通过实验验证所得的依赖接收机的分析,来解决这一局限性。因此,我们记录了一个包含210个不同干扰场景的真实世界开放场数据集,并针对特定接收机模块计算了依赖接收机的频谱分离系数(SSC)和干扰影响。此外,我们使用由射频星座模拟器(RFCS)生成的受控数据集验证了计算,该模拟器采用相同的接收机模块并回放类似的干扰类别。两种环境下获得的结果比较证明了干扰影响计算的鲁棒性。

英文摘要

Interference poses a significant challenge to satellite-based positioning systems, making it essential to accurately quantify the effects of specific interference types on receiver performance and the resulting reliability of position computation. In current practice, interference effects are often quantified using receiver-independent metrics, with receiver-specific front-end characteristics either idealized or only implicitly considered. In this paper, we address this limitation by explicitly incorporating receiver-specific front-end characteristics into the computation of interference effects and validating the resulting receiver-dependent analysis experimentally. Therefore, we record a real-world open-field dataset comprising 210 distinct interference scenarios and compute the receiver-dependent spectral separation coefficient (SSC) and interference impact for a specific receiver module. Furthermore, we verify the computation using a controlled dataset generated with a radio frequency constellation simulator (RFCS), employing the same receiver module and replaying similar interferences classes. The comparison of results obtained in both environments demonstrates the robustness of the interference impact computation.

2606.19797 2026-06-19 eess.AS cs.AI cs.SD eess.SP 交叉投稿

Improving End-to-End Speech Recognition for Dysarthric Speech through In-Domain Data Augmentation

通过域内数据增强改进构音障碍语音的端到端语音识别

Paban Sapkota, Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Shrikanth Narayanan

AI总结 针对构音障碍语音识别中数据稀缺和严重程度差异的问题,本文探索了四种数据增强方法(SRM、PM、FM、VTLP)对预训练Wav2Vec2模型进行微调,在不同严重程度上实现了显著的字错误率降低。

详情
AI中文摘要

构音障碍语音识别对于促进构音障碍患者之间的有效沟通至关重要。然而,由于严重程度不同和数据可用性有限,准确识别构音障碍语音面临重大挑战。在本文中,我们通过微调端到端预训练Wav2Vec2模型,探索了针对构音障碍自动语音识别(ASR)系统的数据增强技术,特别关注严重程度级别。为了解决数据稀缺以及微调预训练ASR系统用于构音障碍语音时需要大量数据的问题,我们研究了四种主要的数据增强方法:语速修改(SRM)、音高修改(PM)、共振峰修改(FM)和声道长度扰动(VTLP),这些方法针对构音障碍的不同方面进行了调整。本研究使用为每个严重程度类别单独微调的Wav2Vec2模型作为基线系统。此外,我们使用增强数据对ASR模型进行了特定严重程度的微调。结果表明,每种增强技术在不同严重程度级别上表现出不同的有效性模式。对于\textit{低}(9.02%)和\textit{中}(38.11%)严重程度,使用SRM($s$=0.8)获得了最佳WER;对于\textit{高}严重程度(55.15%),使用PM($\ au$=0.8)获得了最佳WER,分别相对改进了30.02%、16.64%和15.47%。这些结果证实了增强方法在提高构音障碍ASR性能方面的有效性。

英文摘要

Dysarthric speech recognition is crucial for facilitating effective communication among individuals with dysarthria. However, accurately recognizing dysarthric speech poses significant challenges due to varying severity levels and limited data availability. In this paper, we explore data augmentation techniques for dysarthric automatic speech recognition (ASR) systems by fine-tuning the End-to-End pre-trained Wav2Vec2 model, with a specific focus on severity levels. To address the challenges of data scarcity and the need for extensive data in fine-tuning pre-trained ASR systems for dysarthric speech, we investigate four prominent data augmentation methods: Speaking-Rate Modification (SRM), Pitch Modification (PM), Formant Modification (FM), and vocal tract Length Perturbation (VTLP), tailored to different aspects of dysarthria. The study uses individually fine-tuned Wav2Vec2 models for each severity class as baseline systems. Additionally, we conducted severity-specific fine-tuning of the ASR model using augmented data. Results demonstrate distinct efficacy patterns for each augmentation technique across severity levels. The best WERs were achieved with SRM ($s$=0.8) for \textit{low} (9.02\%) and \textit{medium} (38.11\%) severities, and with PM ($τ$=0.8) for \textit{high} severity (55.15\%), reflecting relative improvements of 30.02\%, 16.64\%, and 15.47\%, respectively. These results confirm the effectiveness of the augmentation methods in improving dysarthric ASR performance.

2606.19793 2026-06-19 eess.AS cs.AI cs.LG cs.SD eess.SP 交叉投稿

Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models

构音障碍语音识别的系统研究:频谱特征与声学模型

Paban Sapkota, Hemant Kumar Kathania, Mikko Kurimo, Sudarsana Reddy Kadiri, Shrikanth Narayanan

AI总结 本文系统研究不同频谱特征与声学模型的组合,通过引入音高特征和优化训练帧重叠数,在F-TDNN模型上实现孤立词和句子识别相对提升4.65%和4.63%。

详情
AI中文摘要

识别构音障碍语音的挑战主要源于发音精度受损导致的显著声学变异性。过去的研究表明,通过使用混合DNN/HMM序列区分性训练可以改善识别性能。本文对不同声学模型定制的各种声学特征组合进行了全面研究,为每种模型提供了合适的特征选择。音高特征的引入显著提高了识别性能,特别是对于涉及构音障碍语音的句子识别任务。通过对TORGO数据库的系统检查,我们证明了增强最先进的因子化时延神经网络(F-TDNN)模型识别构音障碍语音性能的潜力。使用F-TDNN模型实现的方法,与先前研究相比,在构音障碍语音的孤立词识别中获得了4.65%的相对改进,在句子识别中获得了4.63%的相对改进。这种改进有效补偿了语音变异性,这归因于我们精心选择了连续训练样本块之间的重叠帧数。

英文摘要

The challenge associated with recognizing dysarthric speech primarily arises from pronounced acoustic variability attributed to impaired articulatory precision. Past research has demonstrated improved recognition through the use of hybrid DNN/HMM sequence discriminative training. This paper presents a comprehensive investigation of various combinations of acoustic features tailored to different Acoustic Models, offering suitable feature selections for each. The incorporation of Pitch features notably improved recognition performance, especially for sentence recognition tasks involving dysarthric speech. Through a systematic examination of the TORGO database, we have demonstrated the potential to enhance the performance of the state-of-the-art Factorized Time Delay Neural Network (F-TDNN) model for recognizing dysarthric speech. Our methods, implemented with the F-TDNN model, resulted in a 4.65\% relative improvement in isolated word recognition and a 4.63\% relative improvement in sentence recognition for dysarthric speech, compared to previous research. This improvement effectively compensates for speech variability, attributable to our deliberate selection of the number of overlapping frames between consecutive training example chunks.

2606.19398 2026-06-19 cs.SD eess.AS eess.SP 交叉投稿

S-JEPA : Soft Clustering Anchors for Self-Supervised Speech Representation Learning

S-JEPA:用于自监督语音表示学习的软聚类锚点

Georgios Ioannides, Adrian Kieback, Judah Goldfeder, Linsey Pang, Aman Chadha, Aaron Elkins, Yann LeCun, Ravid Shwartz-Ziv

发表机构 * Carnegie Mellon University(卡内基梅隆大学) New York University(纽约大学) James Silberrad Brown Center for AI(詹姆斯·西尔伯拉德·布朗人工智能中心) Columbia University(哥伦比亚大学) Northeastern University(东北大学) Stanford University(斯坦福大学) Amazon GenAI(亚马逊生成式人工智能)

AI总结 提出S-JEPA,通过KL散度匹配高斯混合模型的软后验概率训练编码器-预测器对,无需离线重聚类或教师蒸馏,在SUPERB协议下以低于90M参数取得最低WER,并建立新的帕累托前沿。

详情
AI中文摘要

自监督语音编码器主要通过预测掩蔽位置处的离散硬聚类ID进行训练,这种方法会坍缩类别边界处的声学模糊性,并需要在迭代之间中断训练以对整个语料库进行重聚类。我们提出S-JEPA,一种JEPA风格的编码器-预测器对,通过KL散度训练以匹配掩蔽位置处高斯混合模型的软后验概率。训练作为连续优化轨迹分两个阶段进行:首先在MFCC特征上使用固定GMM,然后在编码器特征上使用在线GMM,输入层从无标签信号中自适应选择,从而消除了离线重聚类步骤以及手动选择聚类所在Transformer层的问题。在SUPERB协议下,S-JEPA在评估的低于90M参数的自监督方法中实现了最低的词错误率(WER),并在大约一半参数量的情况下在情感识别任务上与HuBERT-Base相当,无需离线重聚类或教师蒸馏即建立了新的帕累托前沿。对预测器在保留语音上的每帧熵的分析揭示了双峰分布,其中相当一部分帧的熵接近完美两聚类平局的熵,这直接经验性地证明了软目标目标保留了硬目标会坍缩的声学模糊性。代码可在以下网址获取:https://this https URL。

英文摘要

Self-supervised speech encoders are predominantly trained by predicting discrete hard cluster IDs at masked positions, a recipe that collapses acoustic ambiguity at category boundaries and requires interrupting training to re-cluster the entire corpus between iterations. We introduce S-JEPA, a JEPA-style encoder-predictor pair trained to match the soft posteriors of a Gaussian Mixture Model at masked positions via KL divergence. Training runs as one continuous optimization trajectory in two phases: a fixed GMM over MFCC features, then an online GMM over encoder features, with the input layer selected adaptively from a label-free signal, removing both the offline re-cluster step and the hand-tuned choice of which transformer layer to cluster on. Under the SUPERB protocol, S-JEPA achieves the lowest WER among evaluated SSL methods below 90M parameters and matches HuBERT-Base on emotion recognition at roughly half its parameter count, establishing a new Pareto frontier without offline re-clustering or teacher distillation. An analysis of the predictor's per-frame entropy on held-out speech reveals a bimodal distribution with a substantial minority of frames near the entropy of a perfect two-cluster tie, providing direct empirical evidence that the soft-target objective preserves the acoustic ambiguity that hard targets would collapse. Code is available at https://github.com/gioannides/s-jepa.

2606.19366 2026-06-19 cs.LG cs.AI eess.SP 交叉投稿

Information Lattice Learning as Probabilistic Graphical Model Structure Learning

信息格学习作为概率图模型结构学习

Haizi Yu, Lav R. Varshney

发表机构 * Kocree, Inc.(Kocree公司) AI Innovation Institute, Stony Brook University(石溪大学人工智能创新研究所)

AI总结 将信息格学习(ILL)解释为概率图模型结构学习,通过投影到分区格上学习可解释规则,并建立与最大熵和因子图的联系。

详情
AI中文摘要

信息格学习(ILL)通过将信号交替投影到编码抽象层次结构的分区格上,并将选定的规则提升回信号域,来学习信号的可解释规则。当信号是概率质量函数时,我们证明ILL学习的概率规则具有自然的概率图模型(PGM)解释,并详细发展了这一解释。ILL中的分区诱导出一个确定性的商变量,规则是该商变量的边际分布。因此,规则集是可解释抽象上的边际约束集合。一般提升是满足这些约束的所有联合分布的可行族,而特殊提升则选择最大无知重建,在ILL中通过L2均匀性原理实现,该原理与最大熵密切相关。在香农熵提升下,相同的约束产生一个对数线性因子图,其因子由学习的抽象索引。然而,信息格本身不是贝叶斯网络:其边编码抽象的细化与粗化,而非条件依赖。因此,ILL最好被视为商变量上可解释的基于约束的因子图的结构学习。这一观点阐明了ILL如何与图模型和最大熵模型相关,同时为推理、可识别性和混合符号-概率学习提出了新方向。

英文摘要

Information lattice learning (ILL) learns interpretable rules of a signal by alternately projecting the signal onto a partition lattice that encodes a hierarchy of abstractions and lifting selected rules back to the signal domain. When the signal is a probability mass function, we show the probabilistic rules learned by ILL admit a natural probabilistic graphical model (PGM) interpretation and develop this interpretation in detail. A partition in ILL induces a deterministic quotient variable, and a rule is the marginal law of that quotient variable. A rule set is therefore a collection of marginal constraints over interpretable abstractions. General lifting is the feasible family of all joint distributions satisfying those constraints, while special lifting chooses a maximum-ignorance reconstruction, implemented in ILL by an L2 uniformity principle closely related to maximum entropy. Under a Shannon-entropy lifting, the same constraints yield a log-linear factor graph whose factors are indexed by learned abstractions. The information lattice itself, however, is not a Bayesian network: its edges encode refinement and coarsening of abstractions, not conditional dependence. Thus ILL is best viewed as structure learning for interpretable constraint-based factor graphs over quotient variables. This view clarifies how ILL relates to graphical models and maximum entropy models, while suggesting new directions for inference, identifiability, and hybrid symbolic-probabilistic learning.

2606.20098 2026-06-19 cs.IT eess.SP math.IT 交叉投稿

Site-Specific MIMO Channel Generation via Diffusion and Flow Matching: Fidelity, Efficiency, and Downstream Utility

基于扩散和流匹配的特定场地MIMO信道生成:保真度、效率与下游效用

Sina Beyraghi, Masoud Sadeghian, Firdous Bin Ismail, Angel Lozano, Paul Almasan, Giovanni Geraci

AI总结 本文比较条件去噪扩散隐式模型(cDDIM)和条件流匹配模型(cFMM)生成特定场地MIMO信道数据,cFMM在保持质量的同时推理速度快一个数量级,合成数据能显著提升下游物理层任务性能。

详情
AI中文摘要

本文探索使用生成模型合成高质量的、特定场地的多输入多输出(MIMO)信道数据,以解决为AI原生无线网络获取真实数据所需的大量测量活动的高成本问题。比较了两种位置条件生成范式:条件去噪扩散隐式模型(cDDIM)和条件流匹配模型(cFMM)。这两种模型都根据用户坐标生成MIMO信道矩阵,以保持部署场地的空间结构。从三个维度评估这些方法:统计保真度(包括波束一致性和有效秩)、生成效率以及在下游任务中的效用,例如信道状态信息压缩和波束对齐。在多种传播场景(28 GHz和3.5 GHz,视距和非视距)下的结果表明,即使在训练数据稀缺的情况下,两种模型都能准确捕捉特定场地的特征。值得注意的是,cFMM实现了与cDDIM相当的质量,但推理时间大约少一个数量级。与仅使用稀缺数据或随机信道相比,用这些合成信道扩充稀缺的特定场地数据集在下游物理层任务中带来了显著的性能提升。

英文摘要

This paper explores the use of generative models to synthesize high-quality, site-specific multiple-input multiple-output (MIMO) channel data, addressing the high cost of the extensive measurement campaigns required to acquire real-world data for AI-native wireless networks. Two location-conditioned generative paradigms are compared: a conditional denoising diffusion implicit model (cDDIM), and a conditional flow matching model (cFMM). Both these models generate MIMO channel matrices conditioned on user coordinates, to preserve the spatial structure of the deployment site. The approaches are evaluated across three dimensions: statistical fidelity (including beam consistency and effective rank), generation efficiency, and utility in downstream tasks such as channel-state information compression and beam alignment. Results across diverse propagation scenarios (28 GHz and 3.5 GHz, both line-of-sight and non-line-of-sight) demonstrate that both models accurately capture site-specific characteristics, even when trained on scarce ground-truth data. Notably, cFMM achieves a quality comparable to cDDIM with roughly an order of magnitude less inference time. Augmenting scarce site-specific datasets with these synthetic channels yields hefty performance gains in downstream physical layer tasks compared to using scarce data alone or stochastic channels.

2606.16057 2026-06-19 cs.RO cs.SY eess.SP eess.SY 交叉投稿

A Smart-Scheduled Hybrid (SSH) EKF-FGO State Estimation

一种智能调度混合(SSH)EKF-FGO状态估计方法

Eric Levy, Soosan Beheshti

发表机构 * GitHub arXiv

AI总结 本文通过智能调度混合EKF-FGO框架,实验性地将优化调度作为独立设计变量,研究其在平衡估计精度与计算成本中的作用,并在平面SLAM仿真中验证了调度对预优化漂移、瞬态误差和运行时间的显著影响。

Comments This work has been accepted for presentation/publication at the 2026 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE). The final published version will appear in IEEE Xplore

详情
AI中文摘要

在机器人学和控制中,可靠的状态估计需要在估计精度和计算成本之间取得平衡。虽然基于滤波的方法(如扩展卡尔曼滤波器,EKF)提供高效的实时更新,而使用因子图的优化公式化方法改善全局一致性,但优化调度的作用通常被隐式处理,而非作为明确的设计变量进行研究。本文提出了一项实验研究,通过使用智能调度混合(SSH)EKF-FGO框架作为受控测试平台,明确隔离了优化调度。通过将基于EKF的状态传播与定期调用的批量优化相结合,并保持求解器结构和计算量固定,本文的主要贡献是实验性地将优化调度表征为一个独立的设计变量,它控制着中间估计精度与计算成本之间的权衡。在平面SLAM环境中的仿真结果表明,调度强烈影响预优化漂移、瞬态误差行为和运行时间。特别是,结果识别出一些操作区域,在这些区域中,全局优化的大部分好处可以以一小部分计算成本保留,从而突显了优化调度作为混合状态估计系统中一个未被充分探索但至关重要的考虑因素。

英文摘要

Reliable state estimation in robotics and control re quires balancing estimation accuracy against computational cost. While filtering-based methods such as the Extended Kalman Filter (EKF) provide efficient real-time updates, and optimisation based formulations using factor graphs improve global consistency, the role of optimisation scheduling is often treated implicitly rather than examined as an explicit design variable. This paper presents an experimental study that explicitly isolates optimisation scheduling using a Smart Scheduled Hybrid (SSH) EKF-FGO framework as a controlled testbed. By combining EKF-based state propagation with periodically invoked batch optimisation and holding solver structure and effort fixed, the main contribution of this work is the experimental characterisation of optimisation scheduling as an independent design variable governing the trade-off between intermediate estimation accuracy and computational cost. Simulation results in a planar SLAM environment show that scheduling strongly influences pre optimisation drift, transient error behaviour, and runtime. In particular, the results identify operating regimes in which most of the benefit of global optimisation can be retained at a fraction of the computational cost, highlighting optimisation scheduling as an under-explored yet critical consideration in hybrid state estimation systems.

2605.02989 2026-06-19 cs.IT eess.SP math.IT stat.ML 版本更新

Information Theory and Statistical Learning

信息论与统计学习

Abbas El Gamal

AI总结 本文是Cover & Thomas《信息论基础》第三版的章节预印本,系统介绍了散度度量在模型训练中的作用,涵盖线性回归、生成扩散模型等,并给出了扩散模型更系统的推导。

详情
AI中文摘要

本手稿包含即将出版的《Cover and Thomas信息论基础》第三版中一章的预印本,经Wiley许可发布。新版的目录EIT-3 ToC可在此https URL找到。反馈请联系abbas@ee. this http URL。学习与信息论在模型训练和基本性能极限的表征中均有交叉。本手稿对第一个交叉点进行了简洁易懂的处理,仅需高年级本科生或一年级研究生水平的信息论和统计学基础知识。章末习题使材料既适合课堂使用也适合自学。本章重点讨论散度度量在模型训练中的作用,示例涵盖从线性回归、逻辑回归到自回归模型、变分自编码器、扩散模型、生成对抗网络和基于分数的模型。介绍了证据下界(ELBO)、f-散度和Fisher散度。特别是,对生成扩散模型的处理提供了比文献中更系统、更明确的推导。

英文摘要

This manuscript contains preprint of a chapter under consideration for inclusion in the forthcoming third edition of {\em Cover and Thomas's Elements of Information Theory}, posted with permission from Wiley. The table of contents EIT-3 ToC of the new edition can be found at: https://docs.google.com/document/d/1L-m4oQEJw1PJhoxBeMwrrBD8S_HmvzMEkPbYvS24980/edit?usp=sharing . For feedback, please contact abbas@ee.stanford.edu Learning and information theory intersect in both model training and the characterization of fundamental performance limits. This manuscript provides a concise and accessible treatment of the first intersection, requiring only basic background in information theory and statistics at the senior undergraduate or first-year graduate level. End-of-chapter exercises make the material well suited for classroom use as well as self-study. The chapter focuses on the role of divergence measures in model training, with examples ranging from linear and logistic regression to autoregressive models, variational autoencoders, diffusion models, generative adversarial networks, and score-based models. It introduces the evidence lower bound (ELBO), f-divergences, and the Fisher divergence. In particular, the treatment of the generative diffusion model provides a more systematic and explicit derivation than is typical in the literature.

2507.15475 2026-06-19 eess.SP math.PR stat.AP 版本更新

On the Distribution of a Two-Dimensional Random Walk with Restricted Angles

二维受限角度随机游走的分布

Karl-Ludwig Besser

AI总结 研究受限角度二维随机游走的分布,推导两步联合与边缘分布,提供一般步数的数值解及大步数近似,明确支持集的精确描述。

Comments 14 pages, 14 figures

Journal ref IEEE Transactions on Signal Processing, vol. 74, pp. 2316-2330, 2026

详情
AI中文摘要

本文推导了二维(复数)随机游走的分布,其中每一步的角度被限制在圆的一个子集。这种设置出现在信号处理中的空中计算等领域。特别地,我们推导了两步的联合和边缘分布,给出了任意步数的数值解,并对大步数提供了近似解。此外,我们为任意步数提供了支持集的精确描述。本文的结果为未来涉及此类问题的研究提供了参考。

英文摘要

In this paper, we derive the distribution of a two-dimensional (complex) random walk in which the angle of each step is restricted to a subset of the circle. This setting appears in various domains, such as in over-the-air computation in signal processing. In particular, we derive the exact joint and marginal distributions for two steps, numerical solutions for a general number of steps, and approximations for a large number of steps. Furthermore, we provide an exact characterization of the support for an arbitrary number of steps. The results in this work provide a reference for future work involving such problems.

2601.12433 2026-06-19 eess.SP cs.LG 版本更新

Temporal Data and Short-Time Averages Improve Multiphase Mass Flow Metering

Amanda Nyholm, Yessica Arellano, Jinyu Liu, Damian Krakowiak, Pierluigi Salvo Rossi

发表机构 * Dept. Electronic Systems, Norwegian University of Science and Technology(电子系统系,挪威科学与技术大学) Dept. Gas Technology, SINTEF Energy Research(气体技术系,SINTEF能源研究) Dept. Research and Development, KROHNE Ltd.(研发部,KROHNE有限公司)

Comments 9 pages, 6 figures

Journal ref IEEE Sensors Journal, vol. 26, no. 11, pp. 17252-17261, 1 June 2026

详情
英文摘要

Reliable flow measurements are essential in many industries, but current instruments often fail to accurately estimate multiphase flows, which are frequently encountered in real-world operations. Combining machine learning (ML) algorithms with accurate single-phase flowmeters has therefore received extensive research attention in recent years. The Coriolis mass flowmeter is a widely used single-phase meter that provides direct mass flow measurements, which ML models can be trained to correct, thereby reducing measurement errors in multiphase conditions. This paper demonstrates that preserving temporal information significantly improves model performance in such scenarios. We compare a multilayer perceptron, a windowed multilayer perceptron, and a convolutional neural network (CNN) on three-phase air-water-oil flow data from 342 experiments. Whereas prior work typically compresses each experiment into a single averaged sample, we instead compute short-time averages from within each experiment and train models that preserve temporal information at several downsampling intervals. The CNN performed best at 0.25 Hz with approximately 95 % of relative errors below 13 %, a normalized root mean squared error of 0.03, and a mean absolute percentage error of approximately 4.3 %, clearly outperforming the best single-averaged model and demonstrating that short-time averaging within individual experiments is preferable. Results are consistent across multiple data splits and random seeds, demonstrating robustness.

2604.03725 2026-06-19 quant-ph cs.IT eess.SP math.IT 版本更新

Quantum Algebraic Diversity: Single-Copy Density Matrix Estimation via Group-Structured Measurements

量子代数多样性:通过群结构测量进行单副本密度矩阵估计

Mitchell A. Thornton

AI总结 将代数多样性框架扩展到量子测量,提出量子代数多样性定理,通过群结构POVM从单副本量子态估计密度矩阵,实现高保真度,并建立经典-量子对偶映射和最优性继承定理。

Comments v3: copy-reduction claim corrected; fidelities fixed; 1 figure removed

详情
AI中文摘要

我们将代数多样性(AD)框架从经典信号处理扩展到量子测量理论。量子代数多样性(QAD)定理表明,应用于量子态单副本的群结构正算子值测度(POVM)会产生一个满秩的群平均密度矩阵估计量,其特征基和特征值排序追踪真实密度矩阵的特征基和特征值排序,并偏向对称化态,类似于从单个观测中恢复协方差特征结构的经典情况。我们建立了一个经典-量子对偶映射,将经典协方差估计与量子态层析成像联系起来,以及一个最优性继承定理,表明经典群最优性通过Born映射在群平均族内转移到量子设置。SIC-POVM被识别为Heisenberg-Weyl群的AD,互无偏基被识别为Clifford群的AD,揭示了层次结构$\mathrm{HW}(d) \subseteq \mathcal{C}(d) \subseteq S_d$,这镜像了经典的$\mathbb{Z}_M \subseteq G_{\min} \subseteq S_M$。双对易子特征值定理给出了多项式时间自适应POVM选择。一个工作的量子比特示例展示了来自单个计算基测量的群平均估计量,在匹配的$\mathbb{Z}_2$群上平均后,达到保真度0.99,而标准单基层层析成像给出的秩1估计保真度为0.80。对于$d=2$到13的蒙特卡洛模拟证实,来自单个结果的保真度高于0.90,而标准保真度按$\sim 1/d$退化。增长比率反映了秩1标准估计量的崩溃,而不是每个参数的更少副本:有偏的单副本估计量减少了不同测量设置的数目,而不是每个参数的采样成本,并且真正的副本减少仅在精确对称下成立。

英文摘要

We extend the algebraic diversity (AD) framework from classical signal processing to quantum measurement theory. The Quantum Algebraic Diversity (QAD) Theorem establishes that a group-structured positive operator-valued measure (POVM) applied to a single copy of a quantum state produces a full-rank, group-averaged density matrix estimator whose eigenbasis and eigenvalue ordering track those of the true density matrix, with a bias toward the symmetrized state, analogous to the classical recovery of covariance eigenstructure from a single observation. We establish a Classical-Quantum Duality Map connecting classical covariance estimation to quantum state tomography, and an Optimality Inheritance Theorem showing that classical group optimality transfers to quantum settings via the Born map within the group-averaged family. SIC-POVMs are identified as AD with the Heisenberg-Weyl group and mutually unbiased bases as AD with the Clifford group, revealing the hierarchy $\mathrm{HW}(d) \subseteq \mathcal{C}(d) \subseteq S_d$ that mirrors the classical $\mathbb{Z}_M \subseteq G_{\min} \subseteq S_M$. The double-commutator eigenvalue theorem gives polynomial-time adaptive POVM selection. A worked qubit example shows the group-averaged estimator from a single computational-basis measurement, averaged over a matched $\mathbb{Z}_2$ group, reaching fidelity 0.99 where standard single-basis tomography gives a rank-1 estimate of fidelity 0.80. Monte Carlo simulations for $d = 2$ to $13$ confirm fidelity above 0.90 from a single outcome while standard fidelity degrades as $\sim 1/d$. The growing ratio reflects collapse of the rank-1 standard estimator, not fewer copies per parameter: the biased single-copy estimator reduces the number of distinct measurement settings, not the per-parameter sampling cost, and a genuine copy reduction holds only under exact symmetry.

2509.15069 2026-06-19 eess.SP cs.DS cs.NA math.NA 版本更新

Efficient Computation of Time-Index Powered Weighted Sums Using Cascaded Accumulators

使用级联累加器高效计算时间索引加权和

Deijany Rodriguez Linares, Oksana Moryakova, Håkan Johansson

AI总结 提出一种利用级联累加器高效计算时间索引加权和的方法,将乘法次数从K×N减少到K+1次常数乘法,无需存储数据块,适用于实时逐样本处理系统。

Comments This work has been submitted to the IEEE for possible publication

Journal ref IEEE Signal Processing Letters, vol. 33, pp. 893-897, Feb. 2026

详情
AI中文摘要

本文提出了一种新颖的方法,使用级联累加器高效计算形如$\sum_{n=0}^{N-1} n^{K} v[n]$的时间索引加权和。传统的直接计算需要$K{\times}N$次通用乘法,对于大的$N$变得不可行,而基于查找表或信号反转的替代策略需要存储整个数据块。通过利用累加器特性,所提方法消除了此类存储需求,并将乘法成本降低到仅$K{+}1$次常数乘法,实现了高效的实时实现。当需要在逐样本处理系统中高效计算此类和时,该方法特别有用。

英文摘要

This letter presents a novel approach for \mbox{efficiently} computing time-index powered weighted sums of the form $\sum_{n=0}^{N-1} n^{K} v[n]$ using cascaded accumulators. Traditional direct computation requires $K{\times}N$ general multiplications, which become prohibitive for large $N$, while alternative strategies based on lookup tables or signal reversal require storing entire data blocks. By exploiting accumulator properties, the proposed method eliminates the need for such storage and reduces the multiplicative cost to only $K{+}1$ constant multiplications, enabling efficient real-time implementation. The approach is particularly useful when such sums need to be efficiently computed in sample-by-sample processing systems.

2602.20953 2026-06-19 eess.SP 版本更新

Timing Recovery and Sequence Detection for Integrate-and-Fire Time Encoding Receivers

Neil Irwin Bernardo

Comments 6 pages, 3 figures, accepted in 2026 IEEE Wireless Communications and Networking Conference (WCNC 2026)

详情
英文摘要

Recent advances in neuromorphic signal processing have introduced time encoding machines as a promising alternative to conventional uniform sampling for low-power communication receivers. In this paradigm, analog signals are converted into event timings by an integrate-and-fire circuit, allowing information to be represented through spike times rather than amplitude samples. While event-driven sampling eliminates the need for a fixed-rate clock, receivers equipped with integrate-and-fire time encoding machines, called time encoding receivers, often assume perfect symbol synchronization, leaving the problem of symbol timing recovery unresolved. This paper presents a joint timing recovery and data detection framework for integrate-and-fire time encoding receivers. The log-likelihood function is derived to capture the dependence between firing times, symbol timing offset, and transmitted sequence, leading to a maximum likelihood formulation for joint timing estimation and sequence detection. A practical two-stage receiver is developed, consisting of a timing recovery algorithm followed by a zero-forcing detector. Simulation results demonstrate accurate symbol timing offset estimation and improved symbol error rate performance compared to existing time encoding receivers.

2601.00014 2026-06-19 eess.SP cs.AI cs.LG 版本更新

Modeling Day-Long ECG Signals to Predict Heart Failure Risk with Explainable AI

建模全天心电图信号以可解释人工智能预测心力衰竭风险

Eran Zvuloni, Ronit Almog, Michael Glikson, Shany Brimer Biton, Ilan Green, Izhar Laufer, Offer Amir, Joachim A. Behar

发表机构 * Leumit Health Services(Leumit健康服务)

AI总结 提出DeepHHF深度学习模型,利用24小时单导联心电图数据预测五年内心力衰竭风险,AUC达0.80,优于短时片段和临床评分,可解释性分析显示模型关注心律失常和心脏异常。

详情
AI中文摘要

心力衰竭(HF)影响11.8%的65岁及以上成年人,降低生活质量和寿命。预防HF可降低发病率和死亡率。我们假设将人工智能(AI)应用于24小时单导联心电图(ECG)数据可预测五年内HF风险。为此,使用了Technion-Leumit Holter ECG(TLHE)数据集,包括20年间收集的47,729名患者的69,663条记录。我们的深度学习模型DeepHHF在24小时ECG记录上训练,实现了0.80的受试者工作特征曲线下面积,优于使用30秒片段和临床评分的模型。DeepHHF识别的高风险个体住院或死亡事件概率翻倍。可解释性分析显示DeepHHF关注心律失常和心脏异常。本研究强调了深度学习建模24小时连续ECG数据的可行性,捕捉了对可靠风险预测至关重要的阵发性事件。应用于单导联Holter ECG的人工智能无创、廉价且广泛可及,使其成为HF风险预测的有前景工具。

英文摘要

Heart failure (HF) affects 11.8% of adults aged 65 and older, reducing quality of life and longevity. Preventing HF can reduce morbidity and mortality. We hypothesized that artificial intelligence (AI) applied to 24-hour single-lead electrocardiogram (ECG) data could predict the risk of HF within five years. To research this, the Technion-Leumit Holter ECG (TLHE) dataset, including 69,663 recordings from 47,729 patients, collected over 20 years was used. Our deep learning model, DeepHHF, trained on 24-hour ECG recordings, achieved an area under the receiver operating characteristic curve of 0.80 that outperformed a model using 30-second segments and a clinical score. High-risk individuals identified by DeepHHF had a two-fold chance of hospitalization or death incidents. Explainability analysis showed DeepHHF focused on arrhythmias and heart abnormalities. This study highlights the feasibility of deep learning to model 24-hour continuous ECG data, capturing paroxysmal events essential for reliable risk prediction. Artificial intelligence applied to single-lead Holter ECG is non-invasive, inexpensive, and widely accessible, making it a promising tool for HF risk prediction.

2512.17473 2026-06-19 eess.SP cs.LG math.OC stat.ML 版本更新

Alternating Direction Method of Multipliers for Nonlinear Matrix Decompositions

非线性矩阵分解的交替方向乘子法

Atharva Awari, Nicolas Gillis, Arnaud Vandaele

发表机构 * University of Mons(蒙斯大学)

AI总结 提出基于交替方向乘子法(ADMM)的算法求解非线性矩阵分解(NMD),支持多种非线性函数和损失函数,在真实数据集上验证了适用性和效率。

Comments 16 pages, 7 figures. v3: Revised version: added new experiments and comparisons. Code available from https://gitlab.com/Atharva05/admm-for-nmd

详情
AI中文摘要

我们提出了一种基于交替方向乘子法(ADMM)的算法,用于求解非线性矩阵分解(NMD)。给定输入矩阵 $X \in \mathbb{R}^{m \times n}$ 和分解秩 $r \ll \min(m, n)$,NMD 寻求矩阵 $W \in \mathbb{R}^{m \times r}$ 和 $H \in \mathbb{R}^{r \times n}$,使得 $X \approx f(WH)$,其中 $f$ 是逐元素非线性函数。我们在几个代表性非线性模型上评估了我们的方法:适用于非负稀疏数据近似的修正线性单元激活 $f(x) = \max(0, x)$,适用于概率电路表示的逐分量平方 $f(x) = x^2$,以及适用于推荐系统的 MinMax 变换 $f(x) = \min(b, \max(a, x))$。所提出的框架灵活支持多种损失函数,包括最小二乘、$\ell_1$ 范数和 Kullback-Leibler 散度,并且可以轻松扩展到其他非线性和度量。我们在真实世界数据集上展示了该方法的适用性、效率和适应性,突出了其在广泛应用中的潜力。

英文摘要

We present an algorithm based on the alternating direction method of multipliers (ADMM) for solving nonlinear matrix decompositions (NMD). Given an input matrix $X \in \mathbb{R}^{m \times n}$ and a factorization rank $r \ll \min(m, n)$, NMD seeks matrices $W \in \mathbb{R}^{m \times r}$ and $H \in \mathbb{R}^{r \times n}$ such that $X \approx f(WH)$, where $f$ is an element-wise nonlinear function. We evaluate our method on several representative nonlinear models: the rectified linear unit activation $f(x) = \max(0, x)$, suitable for nonnegative sparse data approximation, the component-wise square $f(x) = x^2$, applicable to probabilistic circuit representation, and the MinMax transform $f(x) = \min(b, \max(a, x))$, relevant for recommender systems. The proposed framework flexibly supports diverse loss functions, including least squares, $\ell_1$ norm, and the Kullback-Leibler divergence, and can be readily extended to other nonlinearities and metrics. We illustrate the applicability, efficiency, and adaptability of the approach on real-world datasets, highlighting its potential for a broad range of applications.

2510.00831 2026-06-19 cs.AI cs.LG eess.SP 版本更新

Controlled Comparison of Machine Learning Models for Fault Classification and Localization in Power System Protection

电力系统保护中故障分类与定位的机器学习模型受控比较

Julian Oelhaf, Georg Kordowich, Changhun Kim, Paula Andrea Pérez-Toro, Christian Bergler, Andreas Maier, Johann Jäger, Siming Bayer

发表机构 * Department of Electrical Engineering, Media and Computer Science, Ostbayerische Technische Hochschule Amberg-Weiden(奥贝格-魏登应用技术大学电气工程、媒体与计算机科学系)

AI总结 在统一电磁暂态数据集和10-50ms决策窗口下,对比机器学习模型在故障分类与定位中的性能,发现分类在10ms时F1>0.98,定位误差稳定在约10%线路长度。

Comments Accepted at IEEE PES Innovative Smart Grid Technologies Europe 2026 (ISGT Europe 2026). Pre-camera-ready author version; final proceedings version may differ

详情
AI中文摘要

现代电力系统因逆变器基和分布式能源的集成而日益复杂,挑战了传统保护方案的可靠性,并推动了机器学习在保护任务中的应用。然而,由于不同研究中的数据集、传感假设和决策时域各异,已发表的结果往往难以比较。本文在相同的传感、时序和验证条件下,基于公共电磁暂态数据集,使用10-50ms的决策窗口以反映保护相关时间尺度,对故障分类(FC)和故障定位(FL)的机器学习模型进行了受控比较。对于FC,性能最佳的非线性模型在10ms时F1分数已超过0.98,而低容量模型在较短时域下性能下降,但随窗口延长而改善,表明相关故障类型信息在最早暂态中已存在。对于FL,顶级模型在所有评估时域下达到约10%归一化线路长度的稳定定位误差,而较弱模型形成明显分离的第二性能层级。线路解析分析显示,定位精度随电网段变化,表明存在拓扑依赖的难度而非仅时间上下文不足。这些发现为比较两个信息需求根本不同的保护任务中的机器学习模型提供了受控参考。

英文摘要

The increasing complexity of modern power systems, driven by the integration of inverter-based and distributed energy resources, challenges the reliability of conventional protection schemes and motivates the use of machine learning for protection tasks. However, published results are often difficult to compare because datasets, sensing assumptions, and decision horizons vary across studies. This paper presents a controlled comparison of machine learning models for fault classification (FC) and fault localization (FL) under identical sensing, timing, and validation conditions on a common electromagnetic transient dataset, using decision windows of 10-50 ms to reflect protection-relevant time scales. For FC, the best-performing nonlinear models achieve F1 scores above 0.98 already at 10 ms, while lower-capacity models degrade at shorter horizons but improve with longer windows, indicating that relevant fault-type information is already present in the earliest transient. For FL, the top-performing models reach a stable localization error of about 10 % of normalized line length across all evaluated horizons, while weaker models form a clearly separated second performance tier. Line-resolved analysis shows that localization accuracy varies across grid segments, indicating topology-dependent difficulty rather than insufficient temporal context alone. These findings provide a controlled reference for comparing machine learning models across two protection tasks with fundamentally different information requirements.

2509.03488 2026-06-19 eess.SP 版本更新

Efficient DoA Estimation for Linear and Rectangular Arrays with Hybrid Architectures Using Compact DFT Codebooks

基于紧凑DFT码本的线性和矩形阵列混合架构高效DoA估计

Miguel Rivas-Costa, Carlos Mosquera

AI总结 针对混合架构中维度压缩导致空间协方差矩阵自由度不足的问题,提出利用DFT波束成形后的柯西型位移结构的广义最小二乘框架,实现线性阵列的协方差矩阵高效恢复,复杂度为O(N_RF^2 N_x),逼近CRB并优于现有方法。

详情
AI中文摘要

混合模拟数字(HAD)架构显著降低了硬件开销,但引入了严重的维度压缩,这剥夺了空间协方差矩阵(SCM)进行高分辨率波达方向(DoA)估计所需的自由度。离散傅里叶变换(DFT)模拟波束成形的无源巴特勒矩阵实现避免了有源移相器和放大器,进一步加剧了这一挑战。在本文中,我们提出了一个广义最小二乘(GLS)框架,该框架利用了DFT波束成形后出现的柯西型位移结构。通过利用这种结构,我们开发了一种高效的数值技术来恢复均匀线性阵列的SCM,复杂度为$\mathcal{O}(N_{\text{RF}}^2 N_x)$,其中$N_x$是天线数量,$N_{\text{RF}}$是射频链数量。仿真表明,我们的估计器逼近克拉美-罗界(CRB),同时优于最先进的方法。

英文摘要

Hybrid Analog and Digital (HAD) architectures significantly reduce hardware overhead but introduce severe dimensionality compression, which strips the Spatial Covariance Matrix (SCM) of the degrees of freedom required for high-resolution Direction-of-Arrival (DoA) estimation. This challenge is further compounded by passive Butler-matrix implementations of Discrete Fourier Transform (DFT) analog beamforming, which avoid active phase shifters and amplifiers. In this paper, we propose a Generalized Least Squares (GLS) framework that exploits the Cauchy-like displacement structure that arises after DFT beamforming. By leveraging this structure, we develop a highly efficient numerical technique to recover the SCM for uniform linear arrays with a complexity of $\mathcal{O}(N_{\text{RF}}^2 N_x)$, where $N_x$ is the number of antennas and $N_{\text{RF}}$ the number of RF-chains. Simulations demonstrate that our estimator approaches the Cramér-Rao Bound (CRB) while outperforming state-of-the-art methods.