arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 1755
2606.07347 2026-06-08 eess.SP cs.ET 新提交

CSI Phase Averaging for High-Sensitivity Wi-Fi Sensing in Low-Multipath Environments

低多径环境下的高灵敏度Wi-Fi感知的CSI相位平均

Toshinori Suzuki, Shin-ichiro Ogura, Yu Morishima, Hiroshi Matsuura

AI总结 提出一种基于模型驱动的低复杂度运动检测方法,利用CSI相位结构特性抑制相位偏移误差,并通过相位平均降低噪声,实验证明可在低多径户外环境中检测数米外的飞鸟。

详情
Comments
13 pages, 11 figures, 3 tables
AI中文摘要

本文提出一种基于模型驱动的低复杂度运动检测方法,用于户外Wi-Fi感知。该方法利用低多径传播环境下信道状态信息(CSI)相位分量的结构特性(通常被认为不利于Wi-Fi感知),以减轻源自无线设备的相位偏移误差。此外,相位平均提供了处理增益,降低了包括量化噪声和热噪声在内的随机噪声分量。描述了该方法的理论基础,并使用从商用IEEE 802.11ac设备获取的压缩波束成形帧进行了实验评估。实验主要关注户外果园环境中飞行的野生乌鸦。实验结果表明,即使鸟类在距离发射和接收天线之间的直接视距路径数米外飞行,该方法也能检测到它们。此外,结果表明当风速低于3 m/s时,植被运动引起的波动可忽略不计。所提出的方法预计不仅适用于果园监测,也适用于低多径环境下的其他户外Wi-Fi感知应用。

英文摘要

This paper presents a low-complexity motion detection method for outdoor Wi-Fi sensing based on a model-driven approach. The method exploits the structural characteristics of the phase components in channel state information (CSI) for low-multipath propagation environments, which are generally considered disadvantageous for Wi-Fi sensing, to mitigate the phase offset errors originating from wireless devices. In addition, phase averaging provides a processing gain that reduces the random noise components, including quantization and thermal noise. The theoretical basis of the method is described and its effectiveness is experimentally evaluated using Compressed Beamforming frames obtained from commercial IEEE 802.11ac devices. The experiments primarily focus wild crows flying in an outdoor orchard environment. The experimental results demonstrate that the method can detect birds even when they fly several meters away from the direct line-of-sight path between the transmitter and receiver antennas. Furthermore, the results indicated that fluctuations caused by vegetation movement were negligible when the wind speed was less than 3~m/s. The proposed approach is expected to be applicable not only to orchard monitoring but also to other outdoor Wi-Fi sensing applications in low-multipath environments.

2606.07328 2026-06-08 eess.SP 新提交

Implementation and Calibration of 3GPP-Compliant ISAC Channel Simulator

符合3GPP标准的ISAC信道模拟器的实现与校准

Chien-Han Wu, Ming-Chun Lee, Ta-Sung Lee

AI总结 本文实现了3GPP TR 38.901中指定的ISAC信道模型模拟器,并通过与3GPP公司参考结果对比进行校准分析,为模拟器的实现和校准提供了关键细节。

详情
Comments
6 pages, Codes and other source files are open on GitHub
AI中文摘要

集成感知与通信(ISAC)已成为6G系统的关键技术。为了支持ISAC系统的开发,用于性能评估的精确信道建模与仿真至关重要。最近,3GPP为此引入了标准化的ISAC信道模型及其相关的校准程序。然而,由于建模方法的复杂性以及3GPP报告中缺乏完全明确的实现细节,不同的实现可能导致不一致或不同步的仿真结果。为了解决这个问题,在本工作中,我们实现了TR 38.901中指定的3GPP ISAC信道模型模拟器,并进行了全面的校准分析。我们将仿真结果与3GPP中公司报告的参考结果进行比较,并讨论了几个关键的实现细节,以提供对模拟器实现和校准的见解。为了促进可重复性和进一步研究,所开发的模拟器以及相关数据集和校准结果已作为开源项目在GitHub上发布。

英文摘要

Integrated sensing and communication (ISAC) has emerged as a key technology for 6G systems. To support the development of ISAC systems, accurate channel modeling and simulation for performance evaluation is essential. Recently, 3GPP introduced a standardized ISAC channel model and its associated calibration procedure for this purpose. However, due to the complexity of the modeling methodology and the lack of fully explicit implementation details in the 3GPP reports, different implementations may lead to inconsistent or unsynchronized simulation results. To address this issue, in this work, we implement the 3GPP ISAC channel model simulator specified in TR 38.901 and conduct a comprehensive calibration analysis. We compare the simulation results with the reference results reported by companies in 3GPP and discuss several key implementation details to provide insights into the implementation and calibration of the simulator. To facilitate reproducibility and further research, the developed simulator, together with the relevant datasets and calibration results, has been released as an open-source project on GitHub.

2606.07284 2026-06-08 eess.SP 新提交

RSMA Enabled Hierarchical UAV Networks with Non Linear Energy Harvesting: Outage Probability Analysis and UAV Placement Optimization

具有非线性能量收集的RSMA赋能分层无人机网络:中断概率分析与无人机部署优化

Faicel Khennoufa, Khelil Abdellatif, Metin Ozturk, Halim Yanikomeroglu, Safwan Alfattani

AI总结 针对分层无人机网络中的能量受限和硬件损伤问题,提出结合非线性能量收集与速率分割多址接入的方案,推导中断概率表达式并优化无人机部署,显著提升可靠性。

详情
Comments
Accepted in IEEE Transactions on Vehicular Technology
AI中文摘要

无人机有望增强第六代蜂窝网络的连接性、扩展网络覆盖并支持高级通信服务,特别是在公共和民用应用中。尽管多无人机系统比单无人机部署具有更高的效率和成本效益,但其实现仍面临若干基本挑战,限制了其可靠性、可持续性和可扩展性。有限的机载能量限制了任务持续时间和通信连续性。因此,无线能量收集成为克服这一限制的有前景的解决方案。然而,地面能源存在路径损耗,使得从周围无人机收集能量更具可持续性。此外,在硬件损伤和不完美信道状态信息下,速率分割多址接入在分层无人机网络中尚未得到充分探索。本文提出一种具有非线性能量收集和RSMA的分层自组织无人机网络,以提高能量和成本效率,其中无人机从周围无人机收集能量。针对实际场景,我们在所提系统中考虑了HWI和ICSI的影响。据作者所知,本研究是文献中首次对此类场景进行探讨。推导了地面物联网设备、每个CMU以及所提系统总中断概率的表达式,基于Nakagami-$m$衰落信道,同时考虑了HWI、ICSI和非线性EH等实际约束。此外,还推导了高发射功率区域下的近似中断概率表达式。随后,我们制定了两个优化问题以提高可靠性和性能。结果表明,所提系统在中断概率方面优于所有基准方案。

英文摘要

Uncrewed aerial vehicles (UAVs) are expected to enhance connectivity, extend network coverage, and support advanced communication services in sixth-generation (6G) cellular networks, particularly in public and civil applications. Although multi-UAV systems offer greater efficiency and cost-effectiveness than single-UAV deployments, their implementation still faces several fundamental challenges that limit their reliability, sustainability, and scalability. The limited onboard energy restricts mission duration and communication continuity. Therefore, wireless energy harvesting (EH) emerges as a promising solution to overcome this limitation. However, terrestrial energy sources experience path loss, making EH from surrounding UAVs more sustainable. Moreover, rate-splitting multiple access (RSMA) remains insufficiently explored in hierarchical UAV networks under hardware impairments (HWI) and imperfect channel state information (ICSI). This paper proposes a hierarchical ad hoc UAV network with non-linear EH and RSMA to enhance both energy and cost efficiency, where UAVs harvest energy from surrounding UAVs. For a practical scenario, we consider the effect of HWI and ICSI in our proposed system. To the best of the authors knowledge, this study is the first to investigate such a scenario in the literature. The outage probability expressions for ground Internet of things (IoT) devices, each CMU, and the overall outage probability of the proposed system are derived over Nakagami-$m$ fading channels while considering practical constraints such as HWI, ICSI, and non-linear EH. Additionally, approximate outage probability expressions are derived for high transmit power regimes. Subsequently, we formulate two optimization problems to enhance reliability and performance. Our findings indicate that the proposed system outperforms all benchmarks in terms of outage probability.

2606.07264 2026-06-08 eess.AS 新提交

VISA: A Visual Information Strengthened Audio-Reasoning System for the Interspeech 2026 ARC Agent Track

VISA:面向Interspeech 2026 ARC智能体赛道的视觉信息增强音频推理系统

Wenming Tu, Jian Gao, Yanru Huo, Yixuan Wang, Jing Peng, Bohan Li, Ziyang Ma, Tao Liu, Shuai Fan, Kai Yu, Xie Chen, Zilong Zheng

AI总结 提出VISA系统,通过多模态特征提取、模型投票推理和细粒度类别感知路由,增强大音频语言模型的音频推理能力,在ARC智能体赛道取得66.23%评分和77.40%准确率。

详情
Comments
Submitted to INTERSPEECH 2026
AI中文摘要

音频推理需要对时变动态和声学混合信号进行多步骤、基于证据的推理,超越了传统感知任务如ASR或字幕生成。我们提出VISA,作为提交至Interspeech 2026音频推理挑战赛(智能体赛道)的系统,通过MMAR评分标准评估正确性和推理质量。在“LALM作为工具”范式下,VISA利用辅助多模态证据增强大音频语言模型,同时避免繁重的编排。该系统集成三个组件:多模态特征提取以获取互补的音频和声学-视觉线索,带一致性检查的模型投票推理以获得稳定预测,以及细粒度类别感知路由以解决分歧并选择符合评分标准的推理链。在官方智能体赛道排行榜上,VISA以66.23%的评分排名第二。它还达到了77.40%的准确率,是单模型和智能体赛道所有系统中最高的。

英文摘要

Audio reasoning requires multi-step, evidence-grounded inference over temporally dynamic and acoustically mixed signals, exceeding conventional perception tasks such as ASR or captioning. We present VISA, our submission to the Interspeech 2026 Audio Reasoning Challenge (Agent Track), evaluated via the MMAR Rubrics for correctness and reasoning quality. Under a "LALM as a Tool" paradigm, VISA strengthens large audio language models with auxiliary multi-modal evidence while avoiding heavy orchestration. The system integrates three components: multi-modal feature extraction for complementary audio and acoustic-visual clues, model-voting inference with consistency checking for stable predictions, and fine-grained category-aware routing to resolve disagreements and select rubric-aligned reasoning chains. On the official Agent Track leaderboard, VISA ranks 2nd overall with a 66.23% Rubrics score. It also achieves 77.40% Accuracy, the highest among all systems listed across both the Single Model and Agent tracks.

2606.07182 2026-06-08 eess.AS 新提交

Audio Imitator: Controlling Timbre and Tempo in Video2Audio Synthesis with Audio Reference

Audio Imitator: 通过音频参考控制视频到音频合成中的音色和节奏

Jiahui Zhao, Tianrui Wang, Chunyu Qiang, Cheng Gong, Xijuan Zeng, Feng Deng, Longbiao Wang

AI总结 提出AudioIM框架,通过双编码器分离建模音色和节奏,实现细粒度风格控制,在保持语义一致性的同时提升风格相似度。

详情
AI中文摘要

视频到音频生成在实现无声视频的语义一致性和时间对齐方面取得了显著进展。然而,音频包含丰富的风格属性,如音色和节奏,这些很难仅从视觉和文本输入中推断出来。虽然参考音频可以作为额外的条件,但它通常被视为整体信号,限制了细粒度的风格控制。我们提出AudioIM,一个属性感知框架,明确将音色和节奏建模为独立的控制因素,而不是依赖整体提示条件。双编码器提取互补的音色相关和节奏相关表示,并通过全局条件注入。基于掩码的训练策略使得在推理时能够进行有效的潜在提示条件。在VGGSound上的实验表明,在保持语义对齐和同步的同时,风格相似度得到了提升。音频样本可在以下网址获取:this https URL。

英文摘要

Video-to-audio generation has made significant progress in achieving semantic consistency and temporal alignment from silent videos. However, audio contains rich stylistic attributes such as timbre and tempo that are difficult to infer from visual and textual inputs alone. While reference audio can serve as additional conditioning, it is typically treated as a holistic signal, limiting fine-grained style control. We propose AudioIM, an attribute-aware framework that explicitly models timbre and tempo as separate control factors rather than relying on holistic prompt conditioning. Dual encoders extract complementary timbre-related and tempo-related representations, which are injected through global conditioning. A masking-based training strategy enables effective latent prompt conditioning at inference. Experiments on VGGSound show improved style similarity while preserving semantic alignment and synchronization. Audio samples are available at: https://anonymousdemo757.github.io/.

2606.07104 2026-06-08 eess.SP 新提交

Robust Secure Beamforming for Movable Antenna Enhanced Integrated Sensing and Communications

可移动天线增强集成感知与通信的鲁棒安全波束赋形

Yuan Chen, Ning Wei, Ahmad Bazzi, Xiangyu Dong, Ran Yang, You Li, Yue Xiu

AI总结 针对不完美窃听信道状态信息,提出联合优化发射波束赋形和天线位置的鲁棒波束赋形设计,以最大化雷达信干噪比并保证通信安全,采用基于块坐标下降的算法结合逐次凸近似和分数规划。

详情
AI中文摘要

在这封信中,我们研究了可移动天线增强的安全集成感知与通信系统中,在存在不完美窃听信道状态信息情况下的鲁棒波束赋形设计。为了提升雷达感知性能,我们通过联合优化发射波束赋形和天线位置,同时确保通信数据安全,提出了一个雷达信干噪比最大化问题。然而,由于天线位置到信道系数的非线性映射以及窃听者信道的不确定性,所得到的优化问题本质上是难以处理的。为了应对这些挑战,我们提出了一种基于块坐标下降的算法,结合了逐次凸近似和分数规划技术。仿真结果表明,我们提出的算法具有快速收敛性,并在保证通信安全的同时显著提升了雷达信干噪比。

英文摘要

In this letter, we investigate robust beamforming design for a movable antenna (MA)-enhanced secure integrated sensing and communications (ISAC) system with imperfect eaves?dropping channel state information (CSI). To improve radar sensing performance, we formulate a radar signal-to-interference?plus-noise ratio (SINR) maximization problem by jointly opti?mizing the transmit beamforming and antenna placement while ensuring communication data security. However, the resulting op?timization problem is inherently intractable due to the nonlinea mapping from antenna positions to channel coefficients, as well as the eavesdropper (Eve) channel uncertainty. To handle these challenges, we propose a block coordinate descent (BCD)-based algorithm incorporating successive convex approximation (SCA) and fractional programming (FP) techniques. Simulation results show that our proposed algorithm exhibits fast convergence and achieves a significant improvement in the radar SINR while guaranteeing communication security.

2606.07091 2026-06-08 eess.SP 新提交

Rate-Splitting--Inspired Uplink Near-Field ISAC

速率分裂启发的上行近场ISAC

Anup Mishra, Israel Leyva-Mayorga, Petar Popovski

AI总结 提出速率分裂(RS)启发的上行近场ISAC框架,通过分裂通信消息到感知操作,推导通信速率和感知速率的闭式表达式,表征可达速率区域,证明RS启发边界优于NOMA启发的时间共享区域。

详情
AI中文摘要

集成感知与通信(ISAC)使感知和通信(S&C)功能共享频谱、硬件和信号处理资源,但由此产生的功能间干扰带来了基本的接收机设计挑战,特别是在上行链路操作中。本文开发了一个速率分裂(RS)启发的上行近场ISAC框架。该框架通过将通信消息分裂到感知操作中,推广了非正交多址(NOMA)启发ISAC的感知中心(S-C)和通信中心(C-C)端点顺序。推导了通信速率(CR)和感知速率(SR)的闭式表达式,考虑了来自目标响应估计不确定性的残余感知干扰。在感知匹配照明下表征了可达CR-SR速率区域,其中所提出的单帧RS启发边界包含NOMA启发的时间共享区域。与经典高斯上行多址信道(其中RS恢复时间共享主导面)不同,上行ISAC中的分裂因子也重塑了感知阶段的干扰,使得RS启发边界匹配或严格扩大S&C折衷。高信噪比分析表明,对于非对齐的S&C信道,残余感知干扰改变速率偏移但不改变主导S&C斜率,而在完全对齐的情况下,它变得斜率受限。使用孔径感知的近场信道模型,推导了大阵列极限,表明随着阵列增长,可达速率保持有限。数值结果验证了分析,并展示了RS启发方案的优势、残余感知干扰的影响以及由物理一致近场建模引起的有限大阵列行为。

英文摘要

Integrated sensing and communication (ISAC) enables sensing and communication (S&C) functionalities to share spectrum, hardware, and signal-processing resources, but the resulting inter-functionality interference creates a fundamental receiver-design challenge, particularly in uplink operation. This paper develops a rate-splitting (RS)-inspired framework for uplink near-field ISAC. The framework generalizes the sensing-centric (S-C) and communication-centric (C-C) endpoint orders of non-orthogonal multiple access (NOMA)-inspired ISAC by splitting the communication message across the sensing operation. Closed-form expressions are derived for the communication-rate (CR) and sensing-rate (SR), accounting for residual sensing interference from target-response estimation uncertainty. The achievable CR-SR rate region is characterized under sensing-matched illumination, where the proposed single-frame RS-inspired boundary contains the NOMA-inspired time-sharing region. Unlike the classical Gaussian uplink multiple access channel, where RS recovers the time-sharing dominant face, the split factor in uplink ISAC also reshapes the sensing-stage interference, allowing the RS-inspired boundary to match or strictly enlarge the S&C tradeoff. High-SNR analysis shows that, for non-aligned S&C channels, residual sensing interference changes the rate offsets but not the leading S&C slopes, whereas in the fully-aligned case it becomes slope-limiting. Using an aperture-aware near-field channel model, large-array limits are derived, showing that achievable rates remain finite as the array grows. Numerical results validate the analysis and demonstrate the benefits of the RS-inspired scheme, the impact of residual sensing interference, and the bounded large-array behaviour induced by physically consistent near-field modelling.

2606.07050 2026-06-08 eess.SP 新提交

Optimized Sampling of Angle-Resolved Scatterometry Data Using End-to-End Compressed Learning Model for Nanograss Deficiency Detection

使用端到端压缩学习模型优化角度分辨散射测量数据采样用于纳米草缺陷检测

Mehdi Abdollahpour, Carsten Bockelmann, Armin Dekorsy

AI总结 提出端到端压缩学习框架,集成可学习纬度采样层与CNN,联合优化采样与分类,在减少90%采样点下保持94.2%的五级缺陷分类精度。

详情
Comments
Preprint. 13 pages, 11 figures
AI中文摘要

纳米表面的可靠检测对于确保纳米结构制造质量至关重要。角度分辨散射测量提供了一种非侵入式检测方法,可在线使用,但由于密集的角度采样,通常采集时间较长。本文针对数据采集挑战,提出了一种端到端压缩学习框架,用于使用ARS图像检测氧化锌纳米草中的5级空位缺陷。该框架将可学习的基于纬度的采样层与卷积神经网络集成,使得采样和分类可以在训练过程中联合优化。采样层利用ARS模式的物理结构,学习信息丰富的纬度区域,从而减少采样搜索空间并提高收敛性。评估结果表明,所提方法在不同噪声条件下实现了高且稳定的缺陷级分类性能。使用完整ARS图像,模型在五级缺陷分类中达到94.2%的准确率,在区分缺陷与非缺陷纳米表面时达到98.6%的准确率。所提采样模型在使用多达90%更少的角度采样点时,性能与全图像相当。即使采样点减少99.7%,分类准确率下降不到10个百分点。为了进一步改善有限数据下的训练,我们还研究了基于GAN的增强方法,并使用GAN生成的数据进行模型预训练。增强数据使得仅需少量微调轮次即可快速收敛。

英文摘要

Reliable inspection of nanosurfaces is essential to ensure the quality of nanostructure manufacturing. Angle-resolved scatterometry provides a non-invasive inspection method that can be used in-line but often suffers from long acquisition times due to dense angular sampling. This paper addresses the data acquisition challenge by proposing an end-to-end compressed learning framework for 5-level vacancy deficiency detection in zinc oxide nanograss using ARS images. The proposed framework integrates a learnable latitude-based sampling layer with a convolutional neural network, allowing sampling and classification to be jointly optimized during training. The sampling layer exploits the physical structure of ARS patterns and learns informative latitudinal regions, which reduces the sampling search space and improves convergence. Evaluation results show that the proposed approach achieves high and stable deficiency-level classification performance under different noise conditions. Using full ARS images, the model achieves 94.2% accuracy for five-level deficiency classification and 98.6% accuracy for separating deficient from non-deficient nanosurfaces. The proposed sampling model matches full-image performance while using up to 90% fewer angular sampling points. Even when sampling points are reduced by 99.7%, the classification accuracy decreases by less than 10 percentage points. To further improve training with limited data, we also studied a GAN-based augmentation approach and used GAN-generated data for model pretraining. Augmented data resulted in fast convergence within only a few fine-tuning epochs.

2606.07026 2026-06-08 eess.SP 新提交

A Novel Stripe-based RIS Optimization for UAV Communications and Sensing in Low-Altitude Wireless Networks

基于条带的可重构智能表面优化用于低空无线网络中的无人机通信与感知

Burak Ahmet Celebi, Sefa Kayraklik, Onur Salan, Ibrahim Hokelek, Ali Emre Pusane, Ali Gorcin

AI总结 提出一种低复杂度的条带式RIS相位优化框架,利用相邻元素的结构相位梯度减小搜索空间,在3D移动下增强通信可靠性并提供被动感知能力,仿真和实验验证了其高收敛速度和鲁棒性。

详情
Comments
13 Pages, 14 figures
AI中文摘要

低空无线网络(LAWN)设想了一种可重构的3D网络,能够支持关键任务的空中操作。本文提出了一种可重构智能表面(RIS)辅助的LAWN,以在变化的无线信道条件和信号阻塞下与无人机(UAV)建立可靠通信。提出了一种低复杂度的条带式RIS相移优化框架,以同时增强通信可靠性并为3D移动下的UAV跟踪提供被动感知能力。与高复杂度的优化方法不同,所提方法利用RIS相邻元素固有的结构相位梯度,显著减少了随UAV移动计算和更新RIS配置的搜索空间。分析和仿真结果表明,所提框架在收敛速度和计算效率上优于传统基准,即使在存在相位估计误差和低信噪比(SNR)的情况下,也能保持稳健的高SNR连接。此外,在室外校园环境中使用真实RIS原型进行了测量实验,以证明所提方法的实际可行性。

英文摘要

Low-altitude wireless networks (LAWN) envision a reconfigurable 3D network capable of supporting mission-critical aerial operations. This paper presents a reconfigurable intelligent surface (RIS)-assisted LAWN to establish a reliable communication with an unmanned aerial vehicle (UAV) across varying wireless channel conditions and signal blockages. A low complexity stripe-based RIS phase shift optimization framework is proposed to simultaneously enhance communication reliability and provide passive sensing capability for UAV tracking under 3D mobility. Unlike high-complexity optimization approaches, the proposed method leverages the inherent structural phase-gradient of the RIS adjacent elements to significantly reduce the search space for calculating and updating the RIS configuration as the UAV moves. The analysis and simulation results demonstrate that the proposed framework outperforms conventional benchmarks in convergence speed and computational efficiency, while maintaining robust, high signal-to-noise-ratio (SNR) connectivity even in the presence of phase estimation errors and low SNR regimes. In addition, the measurement experiments using a real RIS prototype in an outdoor campus environment are performed to demonstrate the practical viability of the proposed approach.

2606.06962 2026-06-08 eess.AS 新提交

FSC-Net: Integrating Fast Fourier Convolutions and Progressive Learning for Speech Bandwidth Extension

FSC-Net:融合快速傅里叶卷积与渐进学习的语音带宽扩展

Xinan Chen, Xiaobin Rong, Qinwen Hu, Kai Chen, Jing Lu

AI总结 提出FSC-Net,通过集成快速傅里叶卷积和频率渐进学习,高效建模跨频段谐波依赖,实现窄带到宽带语音的高保真重建,在VCTK 4kHz-48kHz任务上以1.54M参数取得领先的LSD和PESQ分数。

详情
Comments
5 pages, 2 figures
AI中文摘要

语音带宽扩展(BWE)旨在从窄带输入重建高保真宽带音频。尽管近期方法取得了显著进展,但它们通常难以重建真实的高频相位和谐波结构,导致感知伪影。本文提出FSC-Net(全频谱上下文网络),一种参数高效的架构,旨在显式建模跨频段谐波依赖。通过将快速傅里叶卷积(FFCs)集成到复频谱映射框架中,FSC-Net将其感受野扩展到整个频谱,有效捕获长程频率交互。为解决高频生成的不适定性,我们新颖的频率渐进学习课程引导网络从粗到细地重建频谱细节。在VCTK和未见过的EARS数据集上的实验结果表明,FSC-Net提供了持续强劲的重建质量和泛化能力,尤其在具有挑战性的VCTK 4 kHz至48 kHz任务中。与规模更大的基线相比,我们的模型在保持高度紧凑的参数规模(1.54 M)的同时,取得了领先的LSD和PESQ分数。

英文摘要

Speech bandwidth extension (BWE) aims to reconstruct high-fidelity wideband audio from narrowband inputs. While recent approaches have made significant progress, they often struggle to reconstruct realistic high-frequency phase and harmonic structures, leading to perceptual artifacts. In this paper, we propose FSC-Net (Full-Spectrum Context Network), a parameter-efficient architecture designed to explicitly model cross-band harmonic dependencies. By integrating Fast Fourier Convolutions (FFCs) into a complex spectral mapping framework, FSC-Net expands its receptive field to the entire spectrum, capturing long-range frequency interactions effectively. To address the ill-posed nature of high-frequency generation, our novel frequency-progressive learning curriculum guides the network to reconstruct spectral details from coarse to fine. Experimental results on the VCTK and unseen EARS datasets demonstrate that FSC-Net delivers consistently strong reconstruction quality and generalization, particularly in the challenging VCTK 4 kHz-to-48 kHz task. Compared to scaled-up baselines, our model attains leading LSD and PESQ scores while maintaining a highly compact parameter footprint (1.54 M).

2606.06954 2026-06-08 eess.SP 新提交

Learn to Access and Backhaul the Sky: Multi-Scale Radio Map Guided Multi-UAV Cooperation

学会接入和回传天空:多尺度无线电地图引导的多无人机协作

Yifeng Yuan, Shijian Gao

AI总结 针对无人机群在三维场景中因用户移动和建筑遮挡导致的端到端瓶颈问题,提出多尺度无线电地图引导(MRMG)框架,结合全局、局部和链路级地图信息,通过多智能体强化学习实现无人机移动、下一跳选择和功率控制的联合优化,显著提升网络吞吐量和边缘用户速率。

详情
Comments
6 pages, 4 figures
AI中文摘要

受新兴低空经济的驱动,无人机群提供了灵活的集成空地接入和回传。然而,由于用户移动和建筑遮挡在这些三维场景中的相互依赖动态性,提供无缝连接是困难的。这些因素在端到端路径中造成快速变化的瓶颈。此外,联合控制的多维性质限制了传统启发式方法的有效性。为了应对这些挑战,提出了一个多尺度无线电地图引导(MRMG)框架。MRMG框架通过整合三个不同层次的无线电信息来处理异构动态:全局地图提供区域覆盖洞察,局部地图捕获邻域尺度服务条件,链路级地图表征高分辨率信道特征。这种设计有效地解耦了宏观移动和微观链路自适应。为了实现长期性能提升,一个多智能体强化学习(MARL)控制器学习无人机移动、下一跳选择和发射功率控制的协作策略。仿真结果表明,MRMG框架不仅提高了网络吞吐量,还显著增强了小区边缘服务,几乎将第5百分位用户速率翻倍。

英文摘要

Driven by the emerging low-altitude economy, uncrewed aerial vehicle (UAV) swarms offer flexible integrated air-ground access and backhaul. However, providing seamless connectivity is difficult due to the interdependent dynamics of user mobility and building blockages in these 3D scenarios. These factors create rapidly shifting bottlenecks in end-to-end paths. Furthermore, the multi-dimensional nature of joint control limits the effectiveness of traditional heuristics. To address these challenges, a \textbf{\underline{M}}ulti-Scale \textbf{\underline{R}}adio \textbf{\underline{M}}ap-\textbf{\underline{G}}uided (MRMG) framework is proposed. The MRMG framework handles heterogeneous dynamics by integrating three distinct levels of radio information: global-level maps provide regional coverage insights, local-level maps capture neighborhood-scale service conditions, and link-level maps characterize high-resolution channel features. This design effectively decouples macro-movement from micro-link adaptation. To yield long-term performance improvements, A multi-agent reinforcement learning (MARL) controller learns cooperative policies for UAV movement, next-hop selection, and transmit-power control. Simulation results show that the MRMG framework not only improves network throughput but also significantly bolsters cell-edge service, nearly doubling the 5th-percentile user rate.

2606.06933 2026-06-08 eess.IV 新提交

A 3D Formulation of the Extended Phaseless Rytov Approximation

扩展无相位Rytov近似的三维公式

Wanqin Ma, Zan Li, Amartansh Dubey, Alikhan Umirbayev, Yijun Chen, Junhui Rao, Ross Murch

AI总结 提出扩展三维无相位Rytov近似(x3DPRA),将二维无相位RF成像方法扩展到三维,保持实现简单性,实现体积成像,并通过仿真验证其定位、形状重建和材料衰减估计性能。

详情
Comments
12 pages, 6 figures, In processing for IEEE Trans
AI中文摘要

扩展无相位Rytov近似(xPRA)是一种最近提出的无设备射频成像技术,仅使用无相位测量(如接收信号强度RSS)即可提供成像区域的高分辨率重建。由于其无相位公式,可以利用现有无线通信基础设施直接实现。它也优于著名的无设备无相位RF成像方法,如无线电断层成像(RTI)。xPRA(和RTI)中使用的线性无相位公式使得这些方法可能对下一代无线网络中的集成感知与通信(ISAC)系统有用,因为它们不需要宽带宽。然而,到目前为止,xPRA和RTI主要是在二维(2D)中提出的。本文介绍了xPRA的三维扩展,我们称之为扩展三维无相位Rytov近似(x3DPRA)。我们方法的新颖之处在于,它保留了RTI和xPRA的直接实现优势,同时实现了体积(3D)成像。仿真结果表明,x3DPRA提供了良好的位置和形状估计,并且还可以重建物体材料衰减。我们提出了三维公式,通过与二维模型比较进行验证,并报告了展示其性能的仿真结果。

英文摘要

The extended Phaseless Rytov Approximation (xPRA) is a recently proposed device-free RF imaging technique that provides high-resolution reconstructions of the imaging region using only phaseless measurements, such as received signal strength (RSS). Because of its phaseless formulation, it can be implemented straightforwardly using existing wireless commu?nication infrastructure. It also outperforms well-known device?free phaseless RF imaging methods such as Radio Tomographic Imaging (RTI). The linear phaseless formulation used in xPRA(and RTI) makes these methods potentially useful for integrated sensing and communication (ISAC) systems in next generation wireless networks since they do not require wide bandwidths. However, so far, both xPRA and RTI have primarily been formulated in two dimensions (2D). This paper introduces a 3D extension of xPRA, which we call the extended three-dimensional phaseless Rytov approximation (x3DPRA). The novelty of our approach is that it preserves the straightforward implementation advantages of RTI and xPRA while enabling volumetric (3D) imaging. Simulation results show that x3DPRA provides good estimates of location and shape and can also reconstruct object material attenuation. We present the 3D formulation, validate it with a 2D model comparison, and report simulation results demonstrating its performance.

2606.06846 2026-06-08 eess.SP 新提交

Variable-Length Finite-Rate CSI Feedback With Generative Priors

变长有限速率CSI反馈与生成先验

Yangxuan Cheng, Fanyang Meng, Jian Zou, Jiacheng Xie, Zhongqiang Zhang, Ye Wang, Yongsheng Liang

AI总结 提出CsiCoGen,一种基于生成扩散模型的变长CSI反馈结构,通过可迁移码本实现灵活序列长度和量化精度,无需联合训练,在COST2100上达到高码率下室内-31 dB、室外-20 dB NMSE。

详情
AI中文摘要

本文从结构角度研究了变长有限速率CSI反馈,并提出了CsiCoGen,一种新颖的生成式反馈结构,具有无需联合训练的可迁移码本机制。UE将$H_0$映射为有序的码本索引序列,而BS利用共享的去噪先验从接收到的任意部分反馈索引序列中递归恢复CSI。这通过码本大小实现了反馈序列长度和每步量化精度的灵活控制。CsiCoGen不需要联合训练特定任务的反馈编码器或码本与重构器,且相同的在线结构可以搭配不同的预训练去噪器。在本文中,我们使用生成扩散模型实例化解码器。在COST2100上的仿真结果表明,与代表性基线相比,CsiCoGen在速率-NMSE和速率-$\ ho$权衡上表现优异,在高码率下达到约-31 dB室内NMSE和-20 dB室外NMSE,同时展示了可扩展的解码复杂度和可调节的每步量化精度。

英文摘要

This letter studies variable-length finite-rate CSI feedback from a structural perspective and proposes CsiCoGen, a novel generative feedback structure with a transferable codebook mechanism without joint training. The UE maps $H_0$ into an ordered sequence of codebook indices, while the BS recursively recovers CSI from any received partial sequence of feedback indices using a shared denoising prior. This enables flexible control of feedback sequence length and per-step quantization precision through codebook size. CsiCoGen does not require jointly training a task-specific feedback encoder or codebook with the reconstructor, and the same online structure can be paired with different pretrained denoisers. In this work, we instantiate the decoder with a generative diffusion model. Simulation results on COST2100 show favorable rate-NMSE and rate-$ρ$ tradeoffs against representative baselines, with CsiCoGen reaching about -31 dB indoor NMSE and -20 dB outdoor NMSE in the high-rate regime while demonstrating scalable decoding complexity and adjustable per-step quantization precision.

2606.06792 2026-06-08 eess.SP 新提交

Copula Function Parameter Regions in Analyzing Wireless Communications Performances

无线通信性能分析中的Copula函数参数区域

Mona Mohsenzadeh, Saeid Pakravan, Ghosheh Abed Hodtani

AI总结 提出Copula依赖参数区域概念,通过两用户MAC信道中FGM Copula的示例,从通信和概率角度推导参数区域,表明实际需求可显著缩小经典可容许区间。

详情
AI中文摘要

Copula函数已广泛应用于无线通信分析中,用于建模依赖结构和评估系统性能。然而,现有研究通常用Copula依赖参数表达性能指标,而未明确表征其可容许区域。本文介绍了Copula依赖参数区域的概念,并研究了其在无线通信中的重要性。考虑一个由双变量Farlie--Gumbel--Morgenstern (FGM) Copula建模的相关瑞利衰落的两用户无线多址接入信道 (MAC),从中断概率和皮尔逊相关系数 (PCC) 约束出发,从通信理论和概率角度推导出显式参数区域。结果表明,实际通信和统计要求可以显著缩小经典的Copula可容许区间,使得一些理论上可容许的依赖结构变得不可行。数值示例说明了所提出的概念及其实际意义。

英文摘要

Copula functions have been widely employed in wireless communication analysis to model dependence structures and evaluate system performance. However, existing studies generally express performance metrics in terms of copula dependence parameters without explicitly characterizing their admissible regions. This letter introduces the concept of copula dependence parameter regions and investigates its significance in wireless communications. Considering a two-user wireless multiple access channel (MAC) with correlated Rayleigh fading modeled by the bivariate Farlie--Gumbel--Morgenstern (FGM) copula, explicit parameter regions are derived from communication-theoretic and probabilistic perspectives using outage probability and Pearson correlation coefficient (PCC) constraints. The results show that practical communication and statistical requirements can significantly shrink the classical copula admissible interval, rendering some theoretically admissible dependence structures infeasible. Numerical examples illustrate the proposed concept and its practical implications.

2606.06732 2026-06-08 eess.SP 新提交

Angular Sector-Based Sparse Array Design for Adaptive Beamforming Using Deep Learning

基于深度学习的角扇区稀疏阵列设计用于自适应波束成形

John Kobak, Ethan Atiyeh, Syed A Hamza

AI总结 提出一种基于深度学习的稀疏阵列设计框架,通过角扇区分类策略和CNN/ResNet50实现高精度阵列选择,SINR偏差低于1%。

详情
Comments
Presented at the IEEE Radar Conference 2026
AI中文摘要

高效的稀疏阵列可重构性对于动态射频环境中的认知感知至关重要,其中快速干扰变化需要适应性和稳定性。本文提出一个框架,用于设计在宽角扇区上优化的稀疏阵列,实现接近最优的波束成形,从而在干扰角度范围内最大化信号与干扰加噪声比(SINR)。计算候选配置的完整数据相关矩阵,并应用基于角扇区的类别缩减策略合并由相同配置主导的相邻扇区,得到56个代表性类别。通过受控的上采样和下采样生成四个数据集变体,包括高样本数和低样本数、平衡和不平衡数据集,以系统评估数据集大小和类别分布对神经网络性能的影响。使用这些数据集训练和评估轻量级卷积神经网络(CNN)和更深的ResNet 50架构。结果表明,分类准确率高,ResNet 50达到97.3%,而大多数类别的SINR偏差保持在1%以下,即使对于接近法线的挑战性干扰角度,偏差也低于5%。所提出的方法实现了鲁棒的稀疏阵列选择,保持了强大的SINR性能,减少了不必要的重配置,并为实时认知感知和自适应干扰缓解提供了有效框架。

英文摘要

Efficient sparse array reconfigurability is essential for cognitive sensing in dynamic radio frequency environments, where rapid interference variations require both adaptability and stability. This work presents a framework for designing sparse arrays optimized over broad angular sectors, enabling near-optimal beamforming that maximizes the signal-to-interference-plus-noise ratio (SINR) across a range of interferer angles. Full data correlation matrices are computed for candidate configurations, and an angular-sector-based class reduction strategy is applied to merge adjacent sectors dominated by the same configuration, resulting in 56 representative classes. Controlled up- and down-sampling produce four dataset variants involving, high and low sample count, balanced and unbalanced datasets, to systematically evaluate the effects of dataset size and class distribution on neural network performance. A lightweight convolutional neural network (CNN) and a deeper ResNet 50 architecture are trained and evaluated using these datasets. Results demonstrate high classification accuracy, with ResNet 50 achieving up to 97.3%, while SINR deviations remain below 1% for most classes and below 5% even for challenging interference angles near broadside. The proposed approach enables robust sparse array selection, maintains strong SINR performance, reduces unnecessary reconfigurations, and provides an effective framework for real-time cognitive sensing and adaptive interference mitigation.

2606.06723 2026-06-08 eess.SP 新提交

Deep Learning Based Sparse Array Design with Pre-Steering for Adaptive Beamforming

基于预导向的深度学习稀疏阵列设计用于自适应波束形成

Ian Straub, Syed A Hamza

AI总结 提出利用卷积神经网络学习稀疏阵列配置,通过预导向策略避免针对每个源角度重新训练,实现快速重配置并最大化信干噪比,在动态环境中达到90%以上测试精度。

详情
Comments
Accepted for presentation at the IEEE Radar Conference 2026
AI中文摘要

本文研究了使用卷积神经网络(CNN)学习稀疏阵列配置,在变化的源和干扰角度下实现接近最优的波束形成。与传统的或基于凸优化的算法不同,所提出的深度学习方法能够在高度动态的传播环境中快速重配置稀疏阵列。本文考虑单个期望源和单个干扰信号位于任意角度,分析了固定和变化期望源方向两种情况。为避免针对每个可能的源角度重新训练,引入了阵列预导向策略,即网络仅在侧射方向训练,而测试输入被预导向以对齐侧射方向。为考虑实际不完美性,研究了预导向误差的影响,并采用了鲁棒的误差增强训练。该方法在训练过程中系统地引入小的、结构化的预导向扰动,使网络即使在角度不确定下也能保持高分类精度并最大化信干噪比(SINR)。结果表明,所提出的方法在广泛的源和干扰角度范围内实现了超过90%的测试精度,突显了其在动态环境中实时、鲁棒稀疏阵列配置的潜力。

英文摘要

This paper investigates the use of convolutional neural networks (CNNs) for learning sparse array configurations that achieve near-optimal beamforming under varying source and interference angles. Unlike conventional or convex optimization based algorithms, the proposed deep learning approach enables rapid reconfiguration of sparse arrays in highly dynamic propagation environments. The paper considers a single desired source and a single interference signal at arbitrary angles, analyzing scenarios with both fixed and varying desired source directions. To avoid retraining for each possible source angle, an array pre-steering strategy is introduced, whereby the network is trained only at broadside, while test inputs are pre-steered to align with the broadside direction. To account for practical imperfections, the effect of pre-steering errors is examined, and a robust error-augmented training is adopted. The approach systematically incorporates small, structured pre-steering perturbations during training, enabling the network to maintain high classification accuracy and maximize the signal-to-interference-plus-noise ratio (SINR) even under angular uncertainty. The results demonstrate that the proposed method achieves over 90% test accuracy across wide ranges of source and interference angles, highlighting its potential for real-time, robust sparse array configuration in dynamic environments.

2606.06672 2026-06-08 eess.SP 新提交

Variational Bayes Estimation for Affine-Precoded Superimposed Pilots in Partially Connected Dual-Wideband Tera-Hertz MU-MIMO Systems

部分连接双宽带太赫兹MU-MIMO系统中仿射预编码叠加导频的变分贝叶斯估计

Abhisha Garg, Suraj Srivastava, Aditya K. Jagannatham

AI总结 针对部分连接双宽带太赫兹MU-MIMO系统,提出两种仿射预编码模型,利用叠加导频和变分贝叶斯推断实现联合信道估计与稀疏结构学习,并进行了性能权衡分析。

详情
AI中文摘要

本工作构思了两种基于仿射预编码的系统模型:联合信道估计的公共预编码(CP-JCE)和用于解耦信道估计的用户特定预编码(USPDCE)。考虑到受双宽带影响的部分连接架构,我们通过结合吸收、反射和自由空间损耗,严格建模了每个用户对应子阵列的太赫兹(THz)多输入多输出(MIMO)信道。接下来,为了解决传统基于导频的信道估计带来的显著带宽开销,我们采用了叠加导频。在此基础上,我们构建了一个结构化稀疏信道模型,并开发了一种变分贝叶斯推断算法,该算法通过超参数推断联合估计信道系数并学习底层稀疏结构,从而在严重的模型不确定性下实现鲁棒且高精度的叠加导频信道估计。最后,我们比较了两种系统的结果,并提供了它们之间的权衡分析。

英文摘要

This work conceives two affine precoding based system models, common precoding with joint channel estimation (CP-JCE) and user-specific precoding for decoupled channel estimation (USPDCE). Considering a dual-wideband effected partially connected architecture, we rigorously model the terahertz (THz) multiple input multiple output (MIMO) channel for each subarray corresponding to each user by incorporating the absorption, reflection, and freespace losses. Next, to address the significant bandwidth overhead associated with conventional pilot-based channel estimation, we employ superimposed pilots. Building on this, we formulate a structured sparse channel model and develop a variational Bayesian inference algorithm that jointly estimates the channel coefficients and learns the underlying sparsity structure through hyperparameter inference, thereby enabling robust and high-precision superimposed pilotbased channel estimation under severe model uncertainty. Lastly, we compare our results for both systems and provide a trade-off analysis between them.

2606.06640 2026-06-08 eess.SP 新提交

SEMIKHORN: Globally balanced affinities for mmWave Localization in MU mMIMO systems

SEMIKHORN:用于MU mMIMO系统中毫米波定位的全局平衡亲和度

Abhisha Garg, Raghav Shukla, Suraj Srivastava, Aditya K. Jagannatham

AI总结 提出SEMIKHORN框架,利用t-SNEkhorn的全局平衡相似性进行半监督信道图构建,通过融合分布式基站的局部不相似矩阵实现毫米波定位,在模拟环境中以少于15%的标记样本达到6.86%的平均定位误差。

详情
AI中文摘要

本工作提出了SEMIKHORN,一种用于毫米波定位的半监督信道图(CC)框架,它利用t-SNEkhorn——t分布随机邻域嵌入(t-SNE)的双随机变体,该变体利用熵最优传输来构建成对相似性。与标准t-SNE(对每个数据点独立归一化亲和度)不同,t-SNEkhorn生成全局平衡的相似性,确保一致的邻域表示。我们考虑配备多天线的分布式基站(BS)的无线网络,每个基站从信道状态信息(CSI)构建局部不相似矩阵。然后将这些局部不相似矩阵融合以获得单个全局不相似矩阵,通过流形学习处理,将用户嵌入到几何地图上。在模拟室外环境中评估性能,并采用贝叶斯优化对框架超参数进行优化,以最小化平均定位误差(MLE)。实验结果表明,所提出的框架在半径100m的圆形区域内实现了6.86%的MLE,所需标记CSI样本少于15%。

英文摘要

This work conceives SEMIKHORN, a semisupervised channel charting (CC) framework for mmWave localization, which leverages t-SNEkhorn, a doubly stochastic variant of t-distributed Stochastic Neighbor Embedding (t-SNE) that utilizes entropic optimal transport to construct pairwise similarities. Unlike standard t-SNE, which normalizes affinities independently for each data point, t-SNEkhorn generates globally balanced similarities ensuring consistent neighborhood representation. We consider wireless networks with distributed base stations (BSs) equipped with multiple antennas, where each BS constructs a local dissimilarity matrix from the channel state information (CSI). These local dissimilarity matrices are then fused to obtain a single global dissimilarity matrix, which is processed through manifold learning to embed users onto a geometric map. The performance is evaluated in a simulated outdoor environment, and Bayesian optimization is employed on the framework hyperparameters to minimize the mean localization error (MLE). Experimental results demonstrate that the proposed framework achieves an MLE of 6.86% in a circular vicinity of radius 100m, requiring less than 15% of labeled CSI samples.

2606.06512 2026-06-08 eess.IV 新提交

Dilated Symmetric Difference for Binary Image Comparison

二值图像比较的膨胀对称差

Sharon Urieli

AI总结 提出膨胀对称差算子,用于在残差对齐误差有界时有效检测二值图像差异。

详情
Comments
4 pages, 4 figures. Also archived at https://doi.org/10.5281/zenodo.20329139
AI中文摘要

两个二值图像的比较用数学形态学来表述。引入了一个新算子,即膨胀对称差。结果表明,只要残差对齐误差在指定范围内,膨胀对称差就能有效检测二值图像之间的差异。

英文摘要

The comparison of two binary images is formulated in terms of mathematical morphology. A new operator, the dilated symmetric difference, is introduced. It is shown that the dilated symmetric difference effectively detects differences between binary images, provided that the residual alignment error is within specified bounds.

2606.07466 2026-06-08 stat.ME 新提交

Covariance-Adaptive Residualization and Stagewise Calibration for Dependent Multiple Testing

协方差自适应残差化与分步校准用于相依多重检验

Prasenjit Ghosh, Arijit Chakrabarti

AI总结 针对任意协方差相依下的多元高斯均值同时假设检验问题,提出一种结合协方差自适应残差化与广义分步临界常数的分步校准程序,在降低计算复杂度的同时实现更优的信号恢复和错误控制。

详情
AI中文摘要

本文研究在任意协方差相依下多元高斯均值的同步假设检验。基于Cohen等人(2009)的最大残差向下(MRD)程序,我们探索了一种基于Gavrilov等人(2009)的广义分步临界常数的新校准策略。所得程序保留了MRD的协方差自适应残差化机制,同时将原始模型依赖的阈值设定替换为简单的分步校准规则。由于所提程序属于Ghosh和Chakrabarti(2026)研究的单调残差基分步程序类,其可容许性直接由其理论得出。我们还推导了MRD残差统计量的替代表示,将所有活动残差通过单个活动精度矩阵表达,大幅降低了计算复杂度。在广泛相依结构下的模拟研究表明,所提方法通常比几种广泛使用的边际检验程序获得更低的归一化误分类风险。在几种结构化相依模型下,该程序还表现出强大的信号恢复能力,实现了接近名义水平的错误发现率、极小的错误非发现率、接近1的功效以及接近预期真实信号数的平均拒绝数。这些发现提供了经验证据,表明协方差自适应残差化和分步校准在相依多重检验中可能以高度有利的方式相互作用。

英文摘要

In this paper, we study simultaneous hypothesis testing for multivariate Gaussian means under arbitrary covariance dependence. Building on the Maximum Residual Down (MRD) procedure of Cohen et al. (2009), we investigate a new calibration strategy based on the generalized step-down critical constants of Gavrilov et al. (2009). The resulting procedure retains the covariance-adaptive residualization mechanism of MRD while replacing the original model-dependent threshold specification with a simple stagewise calibration rule. Since the proposed procedure belongs to the class of monotone residual-based step-down procedures studied by Ghosh and Chakrabarti (2026), its admissibility follows directly from their theory. We also derive alternative representations of the MRD residual statistics that express all active residuals through a single active precision matrix, substantially reducing computational complexity. Simulation studies across a broad range of dependence structures show that the proposed methodology often achieves a lower normalized misclassification risk than several widely used marginal testing procedures. Under several structured dependence models, the procedure also exhibits strong signal-recovery behavior, attaining false discovery rates near the nominal level, extremely small false non-discovery rates, powers approaching one, and average numbers of rejections close to the expected number of true signals. These findings provide empirical evidence that covariance-adaptive residualization and stagewise calibration may interact in a highly favorable manner for dependent multiple testing.

2606.07447 2026-06-08 stat.ME math.ST stat.TH 新提交

Community Detection on a Randomly Growing Network

随机增长网络上的社区检测

Jianxiang Wang, Min Xu

AI总结 针对非随机块模型的马尔可夫随机网络,提出两阶段算法,先分类高度节点再扩展社区标签,理论证明无法一致恢复所有节点但可恢复中心子集。

详情
Comments
69 pages, 16 figures, 7 tables
AI中文摘要

我们在随机块模型框架之外研究马尔可夫随机网络上的社区检测。具体来说,我们考虑一个随机网络增长过程,该过程生成$K$个独立的优先连接树,并通过Erdős–Rényi边连接它们,使得每棵树代表一个社区,每个节点继承其所属树的标签。该模型能够产生许多在SBM下不太可能出现的真实网络特征,例如幂律度分布以及链和枢纽的存在。仅给定最终图,对增长过程一无所知,我们试图恢复节点的未观察到的社区成员身份。我们首先证明任何算法都无法一致地恢复所有节点的社区标签。然而,我们设计了算法,这些算法能够证明地恢复中心节点子集的社区标签,对于节点中心性的几种不同概念,例如到达时间或度数。我们的过程包括两个阶段,在第一阶段,我们对高度节点进行分类,然后在第二阶段,将社区分配扩展到剩余顶点。数值实验和合著网络上的真实数据应用证明了我们提出方法的有效性。

英文摘要

We study community detection on Markovian random networks outside of the Stochastic Block Model (SBM) framework. Specifically, we consider a random network growth process which generates $K$ separate preferential attachment trees and connects them with Erdős--Rényi edges, so that each tree represents a community and each node inherits the label of the tree to which it belongs. This model is able to produce many features of real world networks that are improbable under SBM, such as power law degree distribution and the existence of chains and hubs. Given only the final graph, without any knowledge of the growth process, we seek to recover the unobserved community membership of the nodes. We first prove that it is impossible for any algorithm to consistently recover the community label of all the nodes. However, we design algorithms which are provably able to recover the community labels of subsets of central nodes, for several different notions of node centrality such as arrival time or degree. Our procedure consists of two stages where, in the first stage, we classify high degree nodes and then, in the second stage, extend the community assignments to the remaining vertices. Numerical experiments and a real data application on a coauthorship network demonstrate the effectiveness of our proposed approach.

2606.07406 2026-06-08 stat.ME 新提交

Deriving the Variance-Minimizing Design for Standard Addition via c-Optimality

通过c-最优性推导标准加入法的方差最小化设计

Gerhard Gössler, Vera Hofer, Walter Goessler

AI总结 本文通过c-最优性理论,针对线性响应下测量误差非递减的情况,证明了标准加入法的最优设计为两点设计,并探讨了测量分配、浓度范围及加权回归的影响。

详情
AI中文摘要

关于标准加入法最优设计的知识似乎分散在文献中,并且至少部分仅存在于数学文献中,对于不熟悉设计最优性理论的读者来说不易快速获取。因此,本工作的想法是总结分析文献中已有的内容,并在需要时将最优性理论的相关结果应用于标准加入法的特殊情况。研究表明,对于非递减的测量误差(例如,随着分析物浓度增加,误差恒定或线性或二次增加),在线性响应的情况下,最优设计是两点设计,无论测量误差方差的具体行为如何。此外,证明测量的最优分配取决于具体设置,这意味着测量的最优分布可能显著偏离50:50的比例。还研究了范围(即最大添加浓度)如何影响结果。最后但同样重要的是,讨论了加权回归的应用问题,并表明,与使用两个以上加标浓度的设计相比,当使用两点设计时,无需加权即可实现最优结果。虽然重点在于浓度估计的精度,但也研究了偏差的影响。

英文摘要

Knowledge about optimal designs for standard addition seems to be scattered among literature and is also, at least partially, only available in mathematical literature that is not quickly accessible for readers not skilled in the field of design optimality theory. Therefore, the idea for this work was to summarize what is already available in analytical literature and to apply the respective results from optimality theory, where needed, to the special case of standard addition. It is shown, for measurement errors that are non-decreasing, e.g., are constant or increase linearly or quadratically with increasing analyte concentration, that the optimal design in the case of a linear response is a two-point design irrespective of the particular behavior of measurement error variance. In addition, it is demonstrated that the optimal allocation of measurements depends on the concrete setting, which means that the optimal distribution of measurements may deviate significantly from a 50:50 ratio. It is also investigated how the range, i.e., the largest added concentration influences the result. Last but not least, also the question of applying weighted regression is discussed and it is shown, that, in contrast to designs using more than two spiked concentrations, no weighting is necessary to achieve optimal results, when a two-point design is used. While the focus lies on the precision of the concentration estimate also the implications for the bias are investigated.

2606.07373 2026-06-08 stat.ME 新提交

Learning Collapsed Patterns in Compositional Data: A Bayesian Heterogeneous Relative-Shift Approach

成分数据中的塌陷模式学习:一种贝叶斯异质性相对位移方法

Maoran Xu, Guanyu Hu

AI总结 提出贝叶斯异质性相对位移回归模型,联合学习潜在聚类和简约效应结构,通过投影收缩先验和有限混合先验实现,并开发了嵌入确定性替代塌缩算子的混合MCMC算法。

详情
AI中文摘要

相对位移回归通过量化当质量从一个成分重新分配到另一个成分时响应如何变化,为建模成分协变量提供了一个原则性框架。然而,许多新兴的成分数据问题超出了这一经典设置,涉及高维预测变量和跨潜在子群体变化的回归效应。这种复杂性对现有方法构成了双重挑战:恢复潜在聚类结构,同时在每个聚类内实现降维。我们提出了一种贝叶斯异质性相对位移回归模型,该模型联合学习潜在聚类和简约效应结构。在方法论上,我们将基于投影的收缩先验(在可识别对比上诱导混合成分内的精确系数绑定)与有限混合先验(推断聚类数量)相结合。在计算上,我们开发了一种可扩展的混合MCMC算法,该算法在NUTS内嵌入了一个确定性替代塌缩算子。在理论上,我们建立了潜在划分和聚类特定效应结构的后验一致性。模拟证实了准确的恢复和强大的预测性能,对跨国宏观经济数据和空间转录组学的应用证明了该方法的可解释性和实用性。

英文摘要

Relative-shift regression provides a principled framework for modeling compositional covariates by quantifying how the response changes when mass is reallocated from one component to another. Yet many emerging compositional data problems extend beyond this classical setting, involving high-dimensional predictors and regression effects that vary across latent subpopulations. This complexity poses a dual challenge unmet by existing methods: recovering latent cluster structure while simultaneously achieving dimension reduction within each cluster. We propose a Bayesian heterogeneous relative-shift regression model that jointly learns latent clusters and parsimonious effect structures. Methodologically, we combine a projection-based shrinkage prior on identifiable contrasts, which induces exact coefficient ties within mixture components, with a mixture of finite mixtures prior that infers the number of clusters. Computationally, we develop a scalable hybrid MCMC algorithm that embeds a deterministic surrogate collapse operator within NUTS. Theoretically, we establish posterior consistency for both the latent partition and cluster-specific effect structures. Simulations confirm accurate recovery and strong predictive performance, and applications to cross-country macroeconomic data and spatial transcriptomics demonstrate the method's interpretability and practical utility.

2606.07364 2026-06-08 stat.AP 新提交

S2A3: Thompson Sampling and Stochastic Exposure Control for High-Stakes CATs

S2A3: 高风险CAT的汤普森采样与随机曝光控制

James Sharpnack, Alexander Tsigler, J. R. Lockwood, Steven Nydick, Alina A. von Davier

AI总结 提出S2A3框架,通过汤普森采样优化项目选择、软评分处理不确定性、随机曝光控制平衡效率与安全,在高风险自适应测试中实现快速项目校准并保持评分可靠性。

详情
AI中文摘要

高风险计算机自适应测试(CAT)需要持续供应已校准的项目,然而传统的项目试测过程缓慢、昂贵且操作风险高。我们引入了S2A3框架——软评分(S2)与自适应自适应管理(A3)——将项目校准和测试管理统一为单一的在线过程。汤普森采样通过从每个项目的后验分布中抽取临时参数,并选择最大化期望Fisher信息的项目来增强项目选择,自然地将不确定项目分配给信息量大的考生,同时保持测量精度。软评分整合了参数不确定性,使得未完全校准的项目对能力估计产生适当减弱的影响。Sympson-Hetter曝光控制的随机变体通过可调温度参数和项目特定权重,平衡测量效率与题库安全。我们在多邻国英语测试的“是/否词汇”和“语境词汇”任务上验证了S2A3,结果表明即使在冷启动项目占活跃题库很大比例的情况下,也能实现快速项目校准并保持评分可靠性。

英文摘要

High-stakes computerized adaptive tests (CATs) require a continuous supply of calibrated items, yet traditional item piloting is slow, expensive, and operationally hazardous. We introduce the S2A3 framework -- Soft Scoring (S2) and Adaptive Adaptive Administration (A3) -- which unifies item calibration and test administration into a single online process. Thompson sampling enhances item selection by drawing provisional parameters from each item's posterior distribution and selecting the item maximizing expected Fisher information, naturally routing uncertain items to informative test-takers while maintaining measurement precision. Soft scoring integrates over parameter uncertainty so that incompletely calibrated items exert appropriately attenuated influence on ability estimates. A stochastic variant of Sympson-Hetter exposure control balances measurement efficiency against bank security via a tunable temperature parameter and item-specific weights. We validate S2A3 on Yes/No Vocabulary and Vocabulary-in-Context tasks from the Duolingo English Test, demonstrating rapid item calibration and preserved scoring reliability even when cold-start items constitute a significant fraction of the active pool.

2606.07213 2026-06-08 stat.ME math.ST stat.ML stat.TH 新提交

Principal Component Analysis for Multivariate Extremes

多元极值的主成分分析

Dan Cooley, Anne Sabourin, Troy Wixson

AI总结 提出一种针对多元极值数据的降维方法,通过主成分分析保留极值相关信息,解决高维极值分析中的维度灾难问题。

详情
Comments
Chapter 11 in "Handbook of of Statistic of Extremes", edited by Miguel de Carvalho, Raphaël Huser, Philippe Naveau, and Brian Reich
AI中文摘要

本章探讨在保留与多元极值分析相关的关键信息的同时,降低数据维度的各种方法。

英文摘要

This chapter explores ways to reduce the dimensionality of the data while preserving key information relevant to the analysis of multivariate extreme values.

2606.07174 2026-06-08 stat.ME 新提交

One-step Outcome Imputation: An Alternative to Multiple Imputation

一步结果插补:多重插补的替代方案

Andreas Nordland, Klaus K. Holst, David Redek, Christian B. Pipper, Aske T. Iversen

AI总结 针对随机对照试验中的缺失结局,提出一种基于影响函数的一步估计方法,避免多重插补中Rubin规则的标准误估计失效问题,并简化计算。

详情
AI中文摘要

随机对照试验中的缺失结局通常通过多重插补(MI)处理。Rubin规则常用于估计标准误,但对于某些常用程序(如基于参考的插补)可能无法提供有效的标准误估计。我们提出一种一步替代方案,通过明确目标给定插补模型所隐含的处理效应,并利用其影响函数构建该处理效应的有效一步估计量。与Rubin规则不同,该方法产生渐近有效的推断。此外,所提方法规避了MI的随机成分和计算负担。我们通过一系列插补模型(包括基于参考的插补和依赖于并发事件的插补)的示例来说明该方法。

英文摘要

Missing outcomes in randomized controlled trials are often handled by multiple imputation (MI). Rubin's rules are routinely used to estimate standard errors but can fail to provide valid standard error estimates for some commonly used procedures, such as reference-based imputation. We propose a one-step alternative by explicitly targeting the treatment effect implied by a given imputation model and constructing an efficient one-step estimator for that treatment effect via its influence function. Unlike Rubin's rules, this approach yields asymptotically valid inference. Moreover, the proposed method circumvents the stochastic component and computational burden of MI. We illustrate the approach with examples spanning a range of imputation models, including reference-based imputation and intercurrent-event-dependent imputation.

2606.07169 2026-06-08 stat.ME math.ST stat.TH 新提交

When can a posterior predictive check identify the learning rate? Exact degeneracy in Gaussian models and implications for Generalised Bayesian Inerence

后验预测检查何时能识别学习率?高斯模型中的精确退化及其对广义贝叶斯推断的影响

Nam Anh Le

AI总结 本文通过精确有限样本分析,揭示了在高斯线性模型中,基于后验预测检查的学习率选择器存在退化现象,即p值不依赖于学习率或数据,导致选择器失效,并提出了数据无关的预筛选诊断方法。

详情
Comments
6 pages, 4 figures
AI中文摘要

广义贝叶斯推断通过学习率$\eta$对似然进行退火以缓解模型误设定,而$\eta$的选择至关重要。Zafar和Nicholls (2024) 提出通过后验预测检查(PPC)选择$\eta$:选择使对数似然PPC $p$值不被拒绝的最小$\eta$。本文给出了该选择器在高斯线性模型上的精确有限样本分析。在方差已知且使用平坦先验时,对于每个$\eta$,PPC $p$值等于$P(\chi^2_n > \mathrm{RSS}/\sigma_0^2)$,因此选择器对$\eta$不变;在方差误设定下,它是双侧非识别的。在方差未知且使用参考先验时,$p$值仅依赖于$(n,d,\eta)$,而不依赖于实际数据或数据生成过程。因此,选择器的输出在观察到任何数据之前就已固定,通常会坍缩到最小的网格值,这会导致过度退火并相对于留出选择扩大预测区间。该现象是高斯尺度-位置族和参考先验特有的枢轴性质;在信息先验下消失。这些结果界定了选择器的适用范围,识别了它无法识别学习率的典型类别,并激发了一种廉价、无数据的预筛选诊断方法。

英文摘要

Generalised Bayesian inference tempers the likelihood by a learning rate $η$ to mitigate model misspecification, and the choice of $η$ is consequential. Zafar and Nicholls (2024) proposed selecting $η$ by a posterior predictive check (PPC): one chooses the smallest $η$ at which a log-likelihood PPC $p$-value is not rejected. An exact, finite-sample analysis of this selector on the Gaussian linear model is given. With known variance and a flat prior, the PPC $p$-value equals $P(χ^2_n > \mathrm{RSS}/σ_0^2)$ for every $η$, so the selector is $η$-invariant; under variance misspecification it is two-sided non-identifying. With unknown variance and the reference prior, the $p$-value depends only on $(n,d,η)$ and not on the realised data or the data-generating process. Consequently the selector's output is fixed before any data are seen, typically collapsing to the smallest grid value, which over-tempers and inflates predictive intervals relative to held-out selection. The phenomenon is a pivotality property specific to the Gaussian scale--location family and the reference prior; it disappears under informative priors. These results delineate the selector's scope, identify a canonical class on which it cannot identify the learning rate, and motivate a cheap, data-free pre-screening diagnostic.

2606.07062 2026-06-08 stat.CO cs.MS 新提交

CATEKAPPA: An R Shiny Application for Design and Analysis of Consistency Tests Based on the Kappa Statistic for Categorical Responses

CATEKAPPA:基于Kappa统计量进行分类响应一致性检验设计与分析的R Shiny应用

Zheng Gai, Li Xincheng, Jiang Wangyingjie, Zhao Panwei

AI总结 针对分类数据一致性检验中样本量确定和Kappa系数计算两大难题,开发了集成样本量规划与一致性分析的R Shiny应用CATEKAPPA,支持Cohen's、Fleiss'和Light's Kappa,并提供自动解释。

详情
Comments
10 pages, 4 figures; This open-source R package CATEKAPPA is available on CRAN at https://CRAN.R-project.org/package=catekappa, source code repository is hosted at https://github.com/satellite837/catekappa. Manuscript planned for submission to Journal of Statistical Software (JSS). Supplementary R package source code uploaded as ancillary file
AI中文摘要

Kappa统计量是分类数据中衡量评估者间一致性的最广泛使用的指标。尽管其流行,应用研究人员常遇到两大障碍:(i) 确定达到给定功效下期望一致性水平所需的样本量,以及(ii) 计算合适的Kappa系数并进行正确解释。现有的R包如irr和kappaSize提供了这些功能,但需要编程技能且缺乏集成的用户友好界面。我们提出CATEKAPPA,一个R包,通过将样本量规划(通过kappaSize)和一致性分析(通过irr)结合到单个基于Shiny的Web应用中,弥合了这一差距。该包支持两位评估者的Cohen's kappa、三位或更多评估者的Fleiss' kappa以及Light's kappa,并使用Landis & Koch量表提供自动解释。用户可以启动交互式图形界面或使用命令行函数进行脚本编写。该包在CRAN上免费提供。

英文摘要

The kappa statistic is the most widely used measure of inter-rater agreement for categorical data. Despite its popularity, applied researchers often encounter two major hurdles: (i) determining the sample size required to achieve a desired level of agreement with given power, and (ii) computing appropriate kappa coefficients with proper interpretation. Existing R packages such as irr and kappaSize provide these functionalities but require programming skills and lack an integrated, user-friendly interface. We present CATEKAPPA, an R package that bridges this gap by combining sample size planning (via kappaSize) and agreement analysis (via irr) into a single Shiny-based web application. The package supports Cohen's kappa for two raters, Fleiss' kappa for three or more raters, and Light's kappa, and provides automatic interpretation using the Landis & Koch scale. Users can either launch an interactive graphical interface or use command-line functions for scripting. The package is freely available on CRAN.

2606.07052 2026-06-08 stat.ME 新提交

Influence of continuous predictor modelling methods on prediction stability in clinical prediction model development: an empirical comparison using real clinical data

连续预测因子建模方法对临床预测模型开发中预测稳定性的影响:基于真实临床数据的实证比较

Phichayut Phinyo, Pakpoom Wongyikul, Noraworn Jirattikanwong, Natthanaphop Isaradech, Wuttipat Kiratipaisarl, Suppachai Lawanaskol, Noppadon Seesuwan, Wachiranun Sirikul

AI总结 本研究利用真实临床数据比较六种连续变量建模方法(二分法、三分法、线性项、二次项、多变量分数多项式、极端梯度提升)对预测稳定性的影响,发现线性项在较小样本中更稳定,而复杂方法需要更大样本。

详情
Comments
30 pages
AI中文摘要

背景与目的:预测稳定性在可靠临床预测模型开发中日益受到重视,但连续预测因子建模选择的影响尚不明确。本研究探讨了连续预测因子建模方法对预测稳定性的影响。方法:我们使用包含19,418名急诊患者的真实临床数据集,创建了从437到8,739名患者的五种样本量场景。比较了六种方法:中位数二分法(DIC)、三分法(TER)、线性项(LIN)、二次项(QUA)、多变量分数多项式(MFP)和极端梯度提升(XGB)。使用基于bootstrap的框架评估预测稳定性。通过内部验证估计了经乐观校正的AUC和校准度。当至少90%的个体预测的平均绝对预测误差(MAPE)<=5%时,认为方法稳定。结果:稳定性随样本量增加而变化,且因方法而异。在n=437时,没有方法达到稳定性标准;LIN最稳定,其次是DIC。在n=874时,DIC和LIN实现了稳定预测且校准度相似,尽管DIC的AUC较低。在n=1,748时,QUA达到稳定,而MFP和XGB未达到。在n=3,496和n=8,739时,所有方法均达到稳定。LIN、QUA、MFP和XGB通常比DIC和TER具有更高的AUC,而XGB显示出最高的AUC但持续存在校准偏差。结论:连续预测因子建模方法似乎影响预测稳定性。LIN从基础样本量开始即实现稳定预测,而QUA、MFP和XGB需要更大样本。尽管XGB具有高区分度,但校准问题持续存在。这些发现表明,在较小数据集中,更简单的方法(尤其是LIN)可能提供更稳定的预测。

英文摘要

Background and objective: Prediction stability is increasingly recognised as important for reliable clinical prediction model development, but the effect of continuous predictor modelling choices is unclear. This study examined how approaches to modelling continuous predictors influence prediction stability. Methods: We used a real clinical dataset of 19,418 emergency department patients to create five sample size scenarios ranging from 437 to 8,739 patients. Six methods were compared: dichotomisation at the median (DIC), tertile categorisation (TER), linear terms (LIN), quadratic terms (QUA), multivariable fractional polynomials (MFP), and extreme gradient boosting (XGB). Prediction stability was evaluated using a bootstrap-based framework. Optimism-corrected AUC and calibration were estimated through internal validation. A method was considered stable when at least 90% of individual predictions had a mean absolute prediction error (MAPE) <=5%. Results: Stability increased with sample size and varied by method. At n = 437, no method met the stability criterion; LIN was the most stable, followed by DIC. At n = 874, DIC and LIN achieved stable predictions with similar calibration, although DIC had lower AUC. At n = 1,748, QUA achieved stability, whereas MFP and XGB did not. At n = 3,496 and n = 8,739, all methods achieved stability. LIN, QUA, MFP, and XGB generally had higher AUCs than DIC and TER, while XGB showed the highest AUC but persistent miscalibration. Conclusion: Continuous predictor modelling methods appeared to influence prediction stability. LIN achieved stable predictions from the base sample size onwards, whereas QUA, MFP, and XGB required larger samples. Although XGB showed high discrimination, calibration concerns persisted. These findings suggest that, in smaller datasets, simpler approaches, particularly LIN, may provide more stable predictions.

2606.07014 2026-06-08 stat.AP 新提交

Networked Spatial Effects in European Electricity Price Forecasting

欧洲电价预测中的网络空间效应

Sultan Mahmud Chomon, Florian Ziel

AI总结 针对欧洲竞价区高度互联的特点,提出网络时空模型(NSTM),利用度量图映射空间信息覆盖,在39个竞价区的高分辨率流式预测中,该模型优于传统孤立模型,揭示了网络结构在跨市场信息传播中的关键作用。

详情
AI中文摘要

由于欧洲竞价区通过物理输电线路高度互联,空间影响通过网络在相邻节点间传播。这反映在欧洲竞价区的日前电价中,因为拍卖算法也使用每个竞价区地理边界之外的信息。为了捕捉这种互联如何影响相邻竞价区的电价,我们使用度量图,通过定义良好的邻域度量来映射信息的空间覆盖。我们提出了网络时空模型(NSTM),它将不规则的空间节点映射到有序网络中,从而能够系统地纳入邻域信息。我们在覆盖大部分欧洲电力市场的39个竞价区中,以高分辨率流式预测设置实现了NSTM。该模型利用自回归、跨小时和季节效应,以及燃料和排放价格和基本面的日前预测,作为互联信息来预测每个竞价区的日前价格。本文呈现的一项全欧洲研究表明,NSTM始终优于传统的孤立纯局部模型。本文提供了一个框架,展示了网络结构在跨互联市场传播信息中的关键作用及其对日前电价预测的重大影响。

英文摘要

As European bidding zones are highly interconnected by physical transmission lines, spatial influences propagate across neighboring nodes through a network. It is reflected in the day-ahead electricity prices across European bidding zones, as the auction algorithm also uses information beyond each bidding zone's geographic boundary. To capture how this interconnection affects the electricity prices in neighboring bidding zones, we have used a metric graph to map the spatial coverage of information using a well-defined neighborhood measure. We propose the Networked Spatio-Temporal Model (NSTM), which maps irregular spatial nodes into an ordered network, enabling the systematic incorporation of neighborhood information. We implement the NSTM across 39 bidding zones covering the majority of European electricity markets in a high-resolution, streaming-forecasting setup. The model uses autoregressive, cross-hour, and seasonal effects, along with fuel and emission prices and day-ahead forecasts of fundamentals, as interconnected information to predict the day-ahead prices for each bidding zone. A Europe-wide study presented in this paper shows that the NSTM consistently outperforms traditional island-based pure local models. This paper provides a framework that demonstrates the critical role the networked structure plays in propagating information across interconnected markets and its vast implications for day-ahead electricity price forecasting.