arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.10879 2026-05-12 cs.IT cs.CR cs.NI eess.SP math.IT

Private Information Retrieval With Arbitrary Privacy Requirements for Graph-Based Storage

Mohamed Nomeir, Shreya Meel, Sennur Ulukus

AI总结本文重新定义了私有信息检索（PIR）问题中的隐私要求，以支持更灵活的隐私需求。研究聚焦于基于图结构的存储系统中的PIR问题，允许每个服务器对隐私消息集合有不同且任意的设定，而非要求所有消息对所有服务器都私有。针对路径图和环形图两种具体存储结构，作者分析了多种隐私设置，并特别关注基于服务器邻域范围的隐私集合，从而实现了从局部PIR到标准图复制PIR的平滑过渡，并推导了相关场景下的容量界限。

2605.10872 2026-05-12 cs.IT cs.CR cs.NI eess.SP math.IT

Local Private Information Retrieval: A New Privacy Perspective for Graph-Based Replicated Systems

Shreya Meel, Mohamed Nomeir, Sennur Ulukus

AI总结本文重新定义了多服务器图复制私有信息检索（PIR）系统中的隐私概念，提出了一种新的隐私保护模型——局部用户隐私，即用户仅需隐藏其请求的消息索引，当且仅当该服务器存储了对应消息。研究核心在于分析这种局部隐私下PIR的通信效率提升，并建立了相应的容量理论。研究发现，在由不同图组成的离散图联合中，局部PIR容量具有乘法性优势，且对连通图提出了下界分析，特别地，推导出了环图和奇数顶点路径图的精确局部PIR容量。

2605.10822 2026-05-12 cs.LG eess.SP

Benchmarking Sensor-Fault Robustness in Forecasting

Alexander Windmann, Philipp Wittenberg, Gianluca Manca, Marcel Dix, Jens U. Brandt, Oliver Niggemann

AI总结该论文提出了一种名为SensorFault-Bench的基准测试框架，用于评估预测模型在传感器故障情况下的鲁棒性。研究通过引入标准化的故障严重性模型和多个真实数据集，系统评估了不同预测架构和鲁棒性改进方法在多种故障场景下的表现，揭示了传统基于干净数据误差的模型排名可能与实际故障场景下的性能存在显著差异。该框架还提供了开源代码和数据接口，支持后续研究在统一协议下进行扩展和比较。

详情

英文摘要

Cyber-physical system (CPS) forecasting models depend on sensor streams with noisy, biased, missing, or temporally misaligned readings, yet standard forecasting evaluation often selects models by nominal error without showing whether they remain robust under such faults. We introduce SensorFault-Bench, a shared CPS-grounded sensor-fault stress-test protocol for evaluating forecasting architectures and robustness-improvement methods, and an operational taxonomy organizing the method comparison. Across four real-world datasets and eight scored scenarios governed by a standardized severity model, it reports worst-scenario degradation, clean mean squared error (MSE), and worst-scenario fault-time MSE, separating relative robustness from absolute error. A disjoint fault-transfer split lets explicit fault-training methods train on adjacent fault families while evaluation uses separate benchmark scenarios. Empirically, forecasting architectures favored by clean MSE can degrade sharply under faults, and clean-MSE rankings can disagree with worst-scenario fault-time error rankings. Chronos-2, the evaluated zero-shot foundation-model representative, matches or trails the last-value naive forecaster in clean MSE on the two single-target datasets and has the largest worst-scenario degradation on ETTh1 and Traffic, where all channels are forecast targets. For the evaluated robustness-improvement method set, paired deltas show selective degradation reductions: projected gradient descent adversarial training and randomized training lead where value faults dominate observed degradation, while fault augmentation leads where availability faults dominate. SensorFault-Bench provides open-source code, documented data access, and reproduction and extension guides, so new datasets, architectures, and robustness-improvement methods can be evaluated under the same CPS sensor-fault robustness protocol.

URL PDF HTML ☆

赞 0 踩 0

2605.10772 2026-05-12 cs.CV cs.AI eess.IV

Towards a Large Language-Vision Question Answering Model for MSTAR Automatic Target Recognition

David F. Ramirez, Tim L. Overman, Kristen Jaskie, Marv Kleine, Andreas Spanias

AI总结本文研究了将大语言-视觉模型（LLVM）应用于合成孔径雷达（SAR）图像的目标识别任务，特别是在军事车辆自动目标识别（ATR）中的应用。通过构建基于MSTAR公开数据集的训练与评估基准，并引入描述性文本和问答对，作者探索了LLVM在遥感图像描述和视觉问答（VQA）中的性能。实验表明，使用参数高效的微调方法，模型在识别细粒度目标特征方面达到了98%的准确率，为机器辅助的军事和情报遥感目标识别提供了新的技术路径。

Comments Accepted to SPIE Defense + Commercial Sensing, Automatic Target Recognition XXXV

详情

DOI: 10.1117/12.3053859
Journal ref: Proc. SPIE 13463, Automatic Target Recognition XXXV, 134630D (29 May 2025);

英文摘要

Large language-vision models (LLVM), such as OpenAI's ChatGPT and GPT-4, have gained prominence as powerful tools for analyzing text and imagery. The merging of these data domains represents a significant paradigm shift with far-reaching implications for automatic target recognition (ATR). Recent transformer-based LLVM research has shown substantial improvements for geospatial perception tasks. Our study examines the application of LLVM to remote sensing image captioning and visual question-answering (VQA), with a specific focus on synthetic aperture radar (SAR) imagery. We examine newly published LLVM methods, including CLIP and LLaVA neural network transformer architectures. We have developed a work-in-progress SAR training and evaluation benchmark derived from the MSTAR Public Dataset. This has been extended to include descriptive text captions and question-answer pairs for VQA tasks. This challenge dataset is designed to push the boundaries of an LLVM in identifying nuanced ATR details in SAR imagery. Utilizing parameter-efficient fine-tuning, we train an LLVM method to identify fine-grained target qualities at 98% accuracy. We detail our data setup and experiments, addressing potential pitfalls that could lead to misleading conclusions. Accurately identifying and differentiating military vehicle types in SAR data poses a critical challenge, especially under complex environmental conditions. Mastering this target recognition skill may require a human analyst months of training and years of practice. This research represents a unique effort to apply LLVM to SAR applications, advancing machine-assisted remote sensing ATR for military and intelligence contexts.

URL PDF HTML ☆

赞 0 踩 0

2605.10745 2026-05-12 eess.SP

How Time-Sensitive are IoBNT Networks? An Age of Information Perspective for In-Body Monitoring

Jorge Torres Gómez

AI总结本文从信息新鲜度（AoI）的角度，研究了体内纳米传感器网络（IoBNT）在疾病监测中的时间敏感性。通过构建包含心血管生理特性和纳米通信信道的马尔可夫模型，评估了网络在样本生成、传输和交付过程中的信息更新能力。研究发现，在合理假设下，监测设备可在数十秒内接收到新鲜信息，表明该网络适用于组织层面的疾病监测，如细菌感染，但需更高效的架构来监测更快速的细胞级过程。

详情

Journal ref: Habilitation Thesis in Electrical Engineering, TU Berlin, Germany, 2026

英文摘要

This thesis develops a theoretical framework to evaluate the monitoring capability of IoBNT networks. We consider a scenario in which nanosensors passively flow in the bloodstream and detect biomarkers associated with potential diseases, reporting their detections to external gateways on the skin that host a monitoring device. The nanosensors thus realize an artificial point-to-point communication channel between the disease region and the monitor: some packets reach the destination directly, while others are lost through vessel paths that bypass the gateway. We evaluate the network's monitoring capability over this artificial channel using the \ac{AoI} concept, which jointly integrates sample generation (at the disease region), carrying (nanosensor travel through vessels), and delivery (nanosensor-to-gateway) as random events. These are modeled through (i) a Markov model that follows cardiovascular physiology and (ii) channel models of reported nanocommunication technologies. We compute the Markov transition probabilities using a cardiovascular simulator built as a low-complexity electric circuit model of the human vessels. For the nanosensor-to-gateway link, we model two well-known schemes: ultrasonic and terahertz channels. Integrating these components within the \ac{AoI} framework, we report information freshness via the average \ac{PAoI} metric. Under realistic physiological and communication assumptions, fresh information appears on the monitor within tens of seconds. The network is therefore suitable for monitoring tissue-level processes such as bacterial infections, while more adequate architectures are needed to monitor cellular-scale processes, which occur on timescales below tens of seconds.

URL PDF HTML ☆

赞 0 踩 0

2605.10739 2026-05-12 eess.IV cs.AI cs.CV

Geospatial-Temporal Sensemaking of Remote Sensing Activity Detections with Multimodal Large Language Model

David F. Ramirez, Tim Overman, Kristen Jaskie, Andreas Spanias

AI总结本文提出了一种基于Sentinel-2卫星影像的多模态视觉问答数据集SMART-HC-VQA，用于分析人类活动的时空演变。该数据集通过将施工标注、类型标签、时间阶段标签等信息转化为自然语言问答对，构建了一个时序扩展的自动目标识别与视觉问答挑战任务。研究还引入了一种多图像大语言模型训练框架，能够处理多时相遥感影像并进行语义推理，为理解语言引导下的遥感活动提供了可复现的基础。

Comments Accepted to 2026 SPIE Defense + Security, Automatic Target Recognition XXXVI

2605.10738 2026-05-12 math.OC cs.MA cs.RO cs.SY eess.SY

Decentralized Contingency MPC based on Safe Sets for Nonlinear Multi-agent Collision Avoidance

Max Studt, Georg Schildbach

AI总结本文研究了在非线性多智能体系统中，如何在不共享轨迹信息的情况下实现去中心化的避障控制。提出了一种基于安全集的应急模型预测控制（MPC）框架，每个智能体仅依赖于自身状态进行局部优化，通过耦合主轨迹与应急保证机制，确保在滚动时域操作中具有可行的避障动作。该方法引入了一种新颖的几何安全集更新机制，保证了递归可行性与收敛性，并在多种密集和稀疏场景中验证了其有效性。

2605.10704 2026-05-12 eess.SP cs.RO

xApp Empowered Resource Management for Non-Terrestrial Users in 5G O-RAN Networks

Mohammed M. H. Qazzaz, Syed Ali Zaidi, Aubida A. Al-Hameed, Abdelaziz Salama, Des Mclernon

AI总结本文提出了一种基于深度强化学习的xApp，用于5G开放无线接入网（O-RAN）中非地面用户设备的资源管理，旨在优化无人机沿预设航线飞行时的切换决策。该方法采用结合迁移学习的双重深度Q网络（DDQN）进行预测性优化，提前预判网络状态，从而降低切换频率和断连概率。实验表明，该框架在保证连接可靠性的同时，显著减少了切换次数，验证了智能学习方法在下一代O-RAN架构中管理无人机移动性的有效性。

2605.10688 2026-05-12 cs.LG eess.SP

DANCE: Detect and Classify Events in EEG

Jarod Lévy, Hubert Banville, Jérémy Rapin, Jean-Remi King, Thomas Moreau, Stéphane d'Ascoli

AI总结本文提出了一种名为DANCE的深度学习方法，用于直接从原始未对齐的脑电（EEG）信号中检测和分类事件，解决了传统方法依赖已知事件起始点的局限性。该方法将神经解码任务建模为集合预测问题，实现了端到端的异步解码。实验表明，DANCE在多种认知、临床和脑机接口任务中均优于现有方法，并在癫痫监测任务中达到了新的性能水平。

Comments 29 pages

2605.10621 2026-05-12 cs.LG cs.SY eess.SY

Hierarchical End-to-End Taylor Bounds for Complete Neural Network Verification

Taha Entesari, Mahyar Fazlyab

AI总结该论文研究了神经网络的可达性分析问题，旨在计算或界定给定输入域下网络输出的可能范围，以验证学习驱动的物理系统的安全性与鲁棒性。现有方法多依赖于二阶信息的可追踪近似，而本文提出了一种新的验证框架HiTaB，通过利用Hessian矩阵及其Lipschitz常数，系统性地引入更高阶的平滑性信息，构建了统一的零阶、一阶和二阶界框架，并提出了高效的层间曲率传播算法来计算深层网络中Hessian Lipschitz常数的上界，从而获得更紧致和可靠的安全性证明。

2605.10571 2026-05-12 eess.IV cs.CV

Set-Based Groupwise Registration for Variable-Length, Variable-Contrast Cardiac MRI

Yi Zhang, Yidong Zhao, Tijmen Toxopeus, Maša Božić-Iven, Sebastian Weingärtner, Qian Tao

AI总结该研究针对可变长度、对比度不同的心脏MRI序列，提出了一种基于集合的群组配准方法\emph{\AnyTwoReg}，以解决传统深度学习方法在跨协议配准中的泛化性不足问题。该方法将MRI序列视为无序集合，解耦了网络设计与序列长度和输入顺序的依赖关系，并通过共享编码器和相关性引导的特征聚合构建了排列不变的参考基准，实现了从图像到形变场的排列等变映射。实验表明，该方法在未见过的定量MRI数据集上表现出良好的零样本泛化能力，并有效提升了后续定量映射的质量。

Comments MICCAI 2026. Submitted Version

详情

英文摘要

Quantitative cardiac magnetic resonance imaging (MRI) enables non-invasive myocardial tissue characterization but relies on robust motion correction within these variable-length, variable-contrast image sequences. Groupwise registration, which simultaneously aligns all images, has shown greater robustness than pairwise registration for motion correction. However, current deep-learning-based groupwise registration methods cannot generalize across MRI sequences: the architecture typically encodes input data as a fixed-length channel stack, which rigidly couples network design to protocol-specific sequence length, input ordering, and contrast dynamics. At inference time, any change in imaging protocols will render the network unusable. In this work, we introduce \emph{\AnyTwoReg}, a new set-based groupwise registration framework that takes a quantitative MRI sequence as an unordered set. This set formulation fundamentally decouples network design from sequence length and input ordering. By utilizing a shared encoder and correlation-guided feature aggregation, \emph{\AnyTwoReg} constructs a permutation-invariant canonical reference for registration, and learns a permutation-equivariant mapping from images to deformation fields. Additionally, we extract contrast-insensitive image features from an existing foundation model to handle extreme contrast variations. Trained exclusively on a single public $T_1$ mapping dataset (STONE, sequence length $L=11$), \AnyTwoReg generalizes to two unseen quantitative MRI datasets (MOLLI, ASL) with variable lengths ($L \in [11, 60]$) and different contrast dynamics. It achieves strong cross-protocol generalization in a zero-shot manner, and consistently improves downstream quantitative mapping quality. Notably, while designed for quantitative MRI sequences, our framework is directly applicable to Cine MRI sequences for inter-cardiac-phase registration.

URL PDF HTML ☆

赞 0 踩 0

2605.10565 2026-05-12 eess.SP

Exponential Noise Robustness of Type-Based Multiple Access for Over-the-Air Computation

Marc Martinez-Gost, Ana Pérez-Neira, Miguel Ángel Lagunas

AI总结本文研究了基于类型划分的多址接入（TBMA）在无先验分布知识的非参数估计环境下，用于空中介质计算（AirComp）时的鲁棒性。与传统依赖幅度调制且易受噪声影响的AirComp方法不同，TBMA通过结构化调制格式提升了性能，其信号叠加在接收端诱导出离散的晶格结构，利用最近晶格点投影可有效抑制噪声。该方法实现了均方误差（MSE）随信噪比指数级衰减，相较传统方法具有根本性的鲁棒性优势。

Comments Submitted to GLOBECOM 2026

2605.10558 2026-05-12 cs.MA cs.SY eess.SY

Effect of Graph Gluing on Consensus in Networked Multi-Agent Systems

Rohollah Moghadam, Santosh Kandel

AI总结本文研究了多智能体系统网络中图连接操作对一致性性能的影响。通过分析桥接连接和接口连接两种方式，探讨了子系统间通信链路的数量和结构如何影响组合图的Fiedler特征值，从而影响系统的一致性收敛速度。研究建立了互联策略、代数连通性与系统性能之间的明确关系，并通过仿真实验验证了理论分析的有效性。

2605.10520 2026-05-12 eess.SY cs.SY

Equivariant Observer Design on SL(3) for Image Intensity-Based Homography Estimation

Tarek Bouazza, Pieter van Goor, Robert Mahony, Tarek Hamel

AI总结本文研究了基于图像强度信息的单应性估计问题，提出了一种在特殊线性群 $\mathbf{SL}(3)$ 上设计的等变观测器，避免了传统方法对特征提取和匹配的依赖。该方法通过直接最小化图像像素强度定义的成本函数进行图像配准，分析了成本函数非退化条件，并引入二阶观测器以提升收敛性能。实验结果验证了所提方法在真实图像中的有效性。

Comments 16 pages, 4 figures, preprint submitted to Automatica

2605.10490 2026-05-12 eess.SY cs.SY

Glycemic Safety Tube: A Provably Safe Control Framework for Artificial Pancreas Systems under Parametric Uncertainty

Pukhrambam Akash Singh, Ratnangshu Das, Ahan Basu, Pushpak Jagtap

AI总结该研究提出了一种名为Glycemic Safety Tube Control（GSTC）的控制框架，用于在参数不确定性下实现人工胰腺系统的安全血糖调节。该方法无需依赖精确的患者特异性模型，通过设计保证血糖水平始终处于临床安全范围内，并在存在饮食扰动和估计误差的情况下确保输入约束满足。实验表明，GSTC在不同饮食模式和患者条件下均能保持安全性能，展现出良好的鲁棒性和计算效率，为新一代人工胰腺系统提供了一种安全、高效且无需依赖患者个体差异的控制方案。

2605.10489 2026-05-12 eess.SY cs.SY

Observing the state of networks with directed higher-order interactions

Roberto Rizzello, Davide Salzano, Stefano Boccaletti, Pietro De Lellis

AI总结本文研究了在存在有向高阶相互作用的情况下，如何重构非线性动态系统网络的状态。作者基于分析收敛性结果，提出了一种算法观测器设计方法，能够同时选择被测节点和观测器增益。通过大量数值实验验证了所设计观测器的性能与鲁棒性，并将其应用于群体智能体意见的完整重构。

2605.10433 2026-05-12 cs.IT eess.SP math.IT

Syndrome Adaptive Gain Control for Min-Sum Decoding of Quantum LDPC Codes

Hernan Cordova, Alexios Balatsoukas-Stimming, Yunus Can Gültekin, Gabriele Liga, Alex Alvarado

AI总结本文提出了一种适用于量子低密度奇偶校验（QLDPC）码的综合征自适应增益最小和（SAGMS）解码算法，旨在解决传统最小和（MS）解码中因消息幅值高估而导致的性能下降问题。该方法通过在解码过程中根据未满足稳态器的比例动态调整消息增益，无需针对具体编码或噪声水平进行优化。仿真结果表明，SAGMS在保持MS复杂度的同时，性能接近甚至优于离线优化的MS解码，并在某些情况下超越了信念传播（BP）解码。

Comments 6 pages, 3 figures

2605.10431 2026-05-12 eess.SY cs.SY math.OC

Hierarchical 2-degree-of-freedom control combining Youla-Kucera parameterization and model predictive control

Zhiheng Zhao, Hans Henrik Niemann, John Bagterp Jørgensen

AI总结本文提出了一种结合Youla-Kucera参数化与模型预测控制的分层二自由度控制结构。该方法通过系统的互质因子分解引入辅助前馈通道和控制器参数化通道，前者用于实现级联模型预测控制以优化系统性能，后者通过H2最优控制器设计实现无偏模型预测控制。该方法有效融合了参数化控制与预测控制的优势，提升了系统控制精度与鲁棒性。

Comments 7 pages, 4 figures, accepted for Europan Control Conference 2026 (ECC 2026)

2605.10411 2026-05-12 astro-ph.IM cs.SY eess.SY

High-speed single-photoelectron detection for Cherenkov astronomy

Luca Giangrande, Matthieu Heller, Teresa Montaruli

AI总结该研究提出了一种用于切连科夫天文观测的高速单光电子探测系统，解决了在低噪声、可扩展探测器中实现纳秒级时间分辨和单光电子分辨率的难题。研究设计了一种定制的六边形硅光电倍增管传感器与专用前端集成电路（ASIC）协同工作的系统，具备光学滤波、四像素分割和高精度读出功能，能够在低功耗下实现高时间分辨率和良好线性响应。该系统展示了对单光电子信号的清晰分辨能力，为大视场切连科夫望远镜的成像提供了高效、可扩展的解决方案。

2605.10398 2026-05-12 eess.AS

SF-Flow: Sound field magnitude estimation via flow matching guided by sparse measurements

Ege Erdem, Shoichi Koyama, Tomohiko Nakamura, Orchisama Das, Zoran Cvetković

AI总结本文提出了一种名为SF-Flow的新方法，用于从稀疏麦克风测量中重建三维声场的幅度。该方法基于流匹配（Flow Matching）生成模型，并结合一个具有排列不变性集编码器的3D U-Net网络，实现了对任意数量稀疏输入的稳定高效重建。实验表明，SF-Flow在1 kHz频率范围内具有较高的重建精度，训练速度远超传统自编码器，并且随着数据集规模的增大性能显著提升。

2605.10352 2026-05-12 eess.SP

Quantifying System Level KPI Deviations of Sionna RT: Material and Near-Field Error Analysis Using a 5G OAI Testbed

Faizan Rauf, Srijita Sanyal, Markus Heinrichs, Aydin Sezgin

AI总结本文研究了Sionna RT在5G系统级关键性能指标（KPI）上的偏差，通过OpenAirInterface（OAI）5G NR测试平台，将RT模拟信道与矢量网络分析仪（VNA）实测信道进行对比，揭示了天线近场过渡效应和材料属性不匹配是导致系统级KPI误差的主要因素，并提供了基于数字孪生的5G及未来网络规划的量化基准。

2605.10351 2026-05-12 cs.LG eess.SP

Foundations of Reliable Inference: Reliability-Efficiency Co-Design

Jiayi Huang

AI总结本研究探讨了如何在保证人工智能模型不确定性估计可信度的同时提高推理效率的问题。作者提出了一种统一的框架，从两个角度出发，旨在实现可靠性与计算效率的协同设计。该工作为构建高效且可信的AI推理系统提供了理论基础和方法支持。

Comments PhD Thesis

2605.10350 2026-05-12 eess.SP

Signal-Dependent Shot Noise Modeling of Rydberg Atomic Quantum Receivers: A Design Perspective

Qihao Peng, Qu Luo, Tierui Gong, Neng Ye, Jizhou Wu, Cunhua Pan, Maged Elkashlan, Pei Xiao, Chau Yuen, George K. Karagiannidis, Jiangzhou Wang

AI总结本文提出了一种面向通信的复数基带等效模型，用于超外差 Rydberg 原子量子接收机（RAQR），该模型准确描述了光检测引起的信号依赖性散粒噪声及其与光学工作点的耦合关系。通过构建直接非相干和平衡相干光检测下的复数基带表示，研究揭示了光学工作点对有效接收增益和等效噪声背景的联合影响，确立了由系统设计决定的增益-噪声权衡关系。进一步分析表明，忽略信号依赖性散粒噪声会导致工作点设计不准确，并推导了考虑该噪声的多输入多输出（MIMO）系统可实现的速率下界，验证了 RAQ-MIMO 在特定噪声条件下的性能优势。

2605.10340 2026-05-12 eess.IV cs.CE cs.ET

Learning to Focus Synthetic Aperture Radar On-line with State-Space Models

Sebastian Fieldhouse, Roberto Del Prete, Gabriele Daga, Nathaniel Rensly, Gabriele Meoni, Kea-Tiong Tang

AI总结本文提出了一种在线合成孔径雷达（SAR）处理器（OSP），通过将SAR成像视为数据流进行实时处理，解决了传统SAR聚焦方法延迟高、难以支持闭环认知系统的问题。OSP采用小型状态空间模型，并通过教师-学生蒸馏和多阶段损失进行训练，实现了高效低延迟的图像生成。实验表明，相比传统数字信号处理方法，OSP在单个CPU核心上处理速度提升了70倍，内存占用降低130倍，且保持了足够的成像质量以支持下游任务，如船舶检测和洪水映射。

2605.10337 2026-05-12 cs.AI eess.SP

CORTEG: Foundation Models Enable Cross-Modality Representation Transfer from Scalp to Intracranial Brain Recordings

Liuyin Yang, Qiang Sun, Bob Van Dyck, Eva Calvo Merino, Marc M. Van Hulle

AI总结该研究提出CORTEG框架，旨在将基于头皮EEG的预训练基础模型迁移至颅内ECoG信号，以提升脑机接口的解码性能。CORTEG结合了电极感知的空间适配器、双流分词器和留一被试法微调策略，实现了跨被试学习和快速个性化校准。实验表明，CORTEG在多个任务中达到或超越了专门方法的性能，尤其在数据量有限的情况下表现突出，为高效、可扩展的颅内脑机接口提供了新思路。

2605.10264 2026-05-12 cs.IT cs.SY eess.SY math.IT

Low-Cost GNSS Anti-Jamming Through 2-Bit Phase Shift Beamforming with Machine Learning

Burak Soner, Ekin Uzun, Can Aksoy

AI总结本文研究了一种低成本的GNSS抗干扰方法，通过使用仅具有2比特相位移的波束成形技术，将每个复数阵列权重限制在四个QPSK相位状态中。为了解决由此带来的波束图解空间受限问题，作者提出了一个离散优化框架，并引入机器学习方法以实现低延迟的高性能波束成形。实验表明，该方法在中等和强干扰环境下显著提升了GNSS接收机的信号质量，验证了2比特相位移波束成形在抗干扰方面的有效性。

Comments Accepted for presentation at RAST 2026. Author accepted version. Final version to appear in IEEE Xplore

2605.10213 2026-05-12 eess.SP

Unsupervised Online Channel Estimation for High-Mobility OFDM via Implicit Neural Representation

Bohao Shi, Tianfu Qi, Xiaonan Chen, Jun Wang

AI总结本文研究了高速移动场景下正交频分复用（OFDM）系统的无监督在线信道估计问题，针对多普勒效应引起的严重载波间干扰（ICI），提出了一种基于隐式神经表示（INR）的框架。该方法将时变频率选择性信道建模为连续的时频函数，通过正弦表示网络（SIREN）捕获细粒度信道变化，无需离线预训练或标注数据。实验表明，该方法在真实车联网（V2X）环境中实现了接近最优的链路可靠性，并在分布外鲁棒性方面优于监督学习基线，为物理层提供了一种适应性强、数据高效的解决方案。

2605.10203 2026-05-12 cs.SD eess.AS

Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration

Haowen Li, Tianxiang Li, Yi Yang, Boyu Cao, Qi Liu

AI总结该研究提出了一种名为Polyphonia的零样本音色迁移框架，旨在解决多声部音乐中对特定音轨进行音色编辑时背景伴奏易被破坏的问题。其核心方法是引入基于声学信息的注意力校准机制，通过概率声学先验建立粗略边界，从而在保持非目标音轨语义完整性的同时，更精确地定位并修改目标音轨。实验表明，该方法在目标音轨对齐度上比现有方法提升了15.5%，同时保持了较高的音乐保真度和非目标音轨的完整性。

Comments Accepted by ICML 2026

2605.10199 2026-05-12 cs.CL eess.AS

How Should LLMs Listen While Speaking? A Study of User-Stream Routing in Full-Duplex Spoken Dialogue

Hui Lu, Xueyuan Chen, Huimeng Wang, Shuhai Peng, Shiyin Kang, Xixin Wu, Zhiyong Wu

AI总结本文研究了在全双工语音对话中，大语言模型（LLM）如何在生成自身语音响应的同时持续监听用户输入的问题。作者提出用户流在LLM中的路由方式是影响系统性能的关键架构问题，并设计了两种路由策略进行对比：一种是直接将用户流注入模型输入，另一种是通过交叉注意力机制访问外部记忆。实验表明，直接注入方式在语义理解和问答任务中表现更优，但在用户打断等场景下容易导致上下文混乱；而交叉注意力路由虽然问答性能稍逊，但能更好地保持生成上下文的稳定性，更具鲁棒性。研究为全双工语音对话系统的设计提供了重要的指导。

2605.10192 2026-05-12 eess.SP

LO-Free Receiver: Next-Gen Low-Power Joint Communication and Sensing

Hasan Atalay Gunel, Mohaned Chraiti, Ali Gorcin

AI总结本文提出了一种无需本地振荡器（LO）的接收机架构，用于实现低功耗的联合通信与感知（JCAS）。该方法通过天线间的相对空间相位进行信息嵌入与恢复，利用天线域相关性构建与到达方向（DoA）相关的基带可观测量，从而实现通信与感知的自然解耦。研究构建了完整的流形域信号模型与接收机架构，并分析了其在相位噪声下的误码率与感知精度，适用于大规模物联网场景，尤其在毫米波及以上频段具有显著优势。

Comments The paper was accepted by VTC