arXivDaily arXiv每日学术速递 周一至周五更新
重置
EESS电气与系统 100
2605.27361 2026-05-27 cs.AI cs.SY eess.SY

Natural Language Query to Configuration for Retrieval Agents

面向检索代理的自然语言查询到配置

Melissa Z. Pan, Negar Arabzadeh, Mathew Jacob, Fiodar Kazhamiaka, Esha Choukse, Matei Zaharia

AI总结 提出BRANE方法,利用LLM将查询转换为工作负载特征,并训练轻量级预测器选择最优配置,在多个基准上实现成本-质量帕累托前沿的优化。

详情
AI中文摘要

现代检索代理暴露了许多配置选择——LLM、检索器、文档数量、跳数和合成策略——每个都影响答案质量和服务成本。目前,这些流水线通常针对每个工作负载手动调整一次,留下了大量每查询优化的空间。我们形式化了这个问题:给定一个自然语言查询以及一个准确性或预算目标,从预定义的流水线目录中选择在推理时最小化成本或最大化准确性的配置。我们提出了**BRANE**,它使用LLM将每个查询转换为工作负载特定的特征,然后训练一个轻量级的每配置预测器,估计流水线是否能正确回答查询。在推理时,**BRANE**选择最大化预测正确性(经成本惩罚)的配置,无需重新训练即可暴露可调的成本-质量权衡。在MuSiQue、BrowseComp-Plus和FinanceBench上,**BRANE**持续推动成本-质量帕累托前沿,以高达89%的成本降低匹配最佳固定配置的准确性,并优于LLM路由、基于规则和微调的Qwen3-4B基线。这些结果表明,对整个检索流水线进行每查询配置是静态工作负载级调优的实用替代方案。

英文摘要

Modern retrieval agents expose many configuration choices -- LLM, retriever, number of documents, number of hops, and synthesis strategy -- each shaping both answer quality and serving cost. Today, these pipelines are typically hand-tuned once per workload, leaving substantial per-query optimization untapped. We formulate the problem: given a natural-language query and either an accuracy or a budget target, select from a predefined pipeline catalog the configuration that minimizes cost or maximizes accuracy at inference time. We propose **BRANE**, which uses an LLM to convert each query into workload-specific characteristics, then trains a lightweight per-configuration predictor that estimates whether the pipeline will answer the query correctly. At inference time, **BRANE** selects the configuration that maximizes predicted correctness penalized by cost, exposing a tunable cost-quality tradeoff without retraining. Across MuSiQue, BrowseComp-Plus, and FinanceBench, **BRANE** consistently pushes the cost-quality Pareto frontier, matches the best fixed configuration's accuracy at up to 89% lower cost, and outperforms LLM-routing, rule-based, and fine-tuned Qwen3-4B baselines. These results show that per-query configuration of the full retrieval pipeline is a practical alternative to static workload-level tuning.

2605.27314 2026-05-27 cs.RO cs.SY eess.SY

Riding the Shifting Potential: When Reactive Control Suffices for Multi-Goal Behavior

驾驭变化势场:何时反应控制足以实现多目标行为

Vito Mengers, Oliver Brock

AI总结 本文提出通过零空间投影扩展图基世界模型中的交互结构,动态调整优先级以解决多目标冲突,在非凸障碍导航和非凸物体推拉任务中实现100%成功率,无需演示或重新训练。

详情
AI中文摘要

反应控制通常被认为不足以处理多目标任务,因为冲突目标会导致局部极小值。我们认为这一限制并非固有,而是源于无法反映目标当前交互方式的静态编码。我们利用图基世界模型中编码的交互结构,通过零空间投影对其进行扩展:冲突在产生处通过将低优先级梯度投影到高优先级梯度的零空间来解决,优先级根据当前状态连续确定。我们在两个目标冲突为核心问题的领域中进行了演示:非凸障碍导航(静态势场在此根本失败)和非凸物体推拉(我们的方法在一百个配置中达到100%成功率,而最速下降基线为0%,扩散策略约为55%,无需演示或重新训练)。相同的公式直接迁移到具有额外感知和运动学约束的真实机器人上,通过相同机制适应这些约束。

英文摘要

Reactive control is often considered insufficient for multi-objective tasks because conflicting objectives give rise to local minima. We argue this limitation is not inherent but arises from static encodings that fail to reflect how objectives currently interact. We exploit the interaction structure encoded in a graph-based world model by extending it with nullspace projections: conflicts are resolved where they arise by projecting lower-priority gradients into the nullspace of higher-priority ones, with priorities determined continuously from the current state. We demonstrate this in two domains where conflicts between objectives are central: navigation around non-convex obstacles, where static potential fields fundamentally fail, and planar pushing of non-convex objects, where our method achieves $100\%$ success across one-hundred configurations versus $0\%$ for the steepest-descent baseline and ${\sim}55\%$ for diffusion policy, without demonstrations or retraining. The same formulation transfers directly to a real robot with additional perceptual and kinematic constraints, accommodating them through the same mechanism.

2605.27299 2026-05-27 cs.CR cs.AI cs.HC cs.LG cs.SY eess.SY

Risk Averse Alert Prioritization for IDS Using Subnormal Gaussian Fuzzy Models

使用次正态高斯模糊模型的IDS风险规避警报优先级排序

Murat Moran

AI总结 提出基于次正态高斯模糊数的警报优先级排序框架,通过建模威胁严重性、检测置信度和组织风险态度三种不确定性,利用排序指数实现可调安全姿态,实验证明在检测器退化下比基线方法更鲁棒。

详情
AI中文摘要

现代入侵检测系统每天生成数千条警报,但由于误报或低影响事件过多,警报疲劳严重限制了安全运营的有效性。我们通过提出一个基于次正态高斯模糊数的原则性警报优先级排序框架来解决这个问题,该框架明确建模了三种不确定性来源:威胁严重性、检测置信度和组织风险态度。每个警报被表示为一个模糊数,其核心表示严重性,展度表示不确定性,高度反映检测可靠性。我们应用排序指数对警报进行优先级排序,允许组织通过风险态度参数调整安全姿态。在CIC-IDS2017和NSL-KDD上的实验验证表明,在检测器退化下,该方法比基线方法具有更强的鲁棒性(NDCGrel@100为0.9963对比0.8215),在中等置信度警报中具有明显区分度,在稳健检测器下与基线方法接近。该框架具有理论基础、计算效率高、提供可解释推理,并且在检测器系列和校准错误场景下保持鲁棒性。

英文摘要

Modern intrusion detection systems generate thousands of alerts daily, but alert fatigue severely limits security operations effectiveness due to too many false positives or low-impact events. We address this by proposing a principled framework for alert prioritization based on subnormal Gaussian fuzzy numbers, explicitly modeling three sources of uncertainty: threat severity, detection confidence, and organizational risk attitude. Each alert is represented as a fuzzy number with the core indicating severity, spread indicating uncertainty, and height reflecting detection reliability. We apply ranking indices to prioritize alerts, allowing organizations to tune security posture through a risk-attitude parameter. Experimental validation on CIC-IDS2017 and NSL-KDD demonstrates greater robustness than baselines under detector degradation (0.9963 vs 0.8215 NDCGrel@100), with distinct differentiation in mid-confidence alerts and near-parity with baselines under robust detectors. The framework is theoretically grounded, computationally efficient, provides interpretable reasoning, and remains robust across detector families and miscalibration scenarios.

2605.27205 2026-05-27 eess.IV cs.AI

TWIST: Closed-Loop token Synchronization for Application-Aware Wireless Digital Twins

TWIST:面向应用感知无线数字孪生的闭环令牌同步

Sige Liu, Kezhi Wang

AI总结 提出TWIST框架,通过闭环令牌同步和模式条件不等错误保护,在有限通信资源下实现应用感知的无线数字孪生状态同步,提升交通状态推断性能并降低同步成本。

详情
AI中文摘要

无线数字孪生需要在有限且时变的通信资源下,对随时间演变的物理场景及其数字副本进行重复同步。对于以感知为中心的数字孪生,像素域传输或均匀保护的比特流可能与孪生侧应用消耗的语义状态不匹配。本文提出TWIST,一种面向应用感知无线数字孪生的闭环令牌同步框架。TWIST将每个物理观测表示为一个令牌,并通过无线链路同步该状态,而非优化视觉重建。令牌位置按任务相关性分组,并通过低、中、高同步模式下的模式条件不等错误保护进行保护。在孪生侧,解码置信度将不可靠的硬令牌决策转换为擦除,在更新语义孪生状态之前由补全模型恢复。恢复后的状态支持交通状态推断,并生成紧凑的反馈统计信息,包括信道质量、接收器不确定性、语义漂移和应用优先级,用于后续模式自适应。在动态道路场景数字孪生场景上的实验表明,与固定模式和仅信道自适应策略相比,TWIST改善了交通状态推断和语义孪生状态同步,同时相对于始终高传输降低了平均同步成本。

英文摘要

Wireless digital twins require repeated synchronization between a time-evolving physical scene and its digital counterpart under limited and time-varying communication resources. For perception-centric twins, pixel-domain transmission or uniformly protected bitstreams can be mismatched to the semantic state consumed by twin-side applications. This paper proposes TWIST, a closed-loop token synchronization framework for application-aware wireless digital twins. TWIST represents each physical observation as a token and synchronizes this state over a wireless link, rather than optimizing visual reconstruction. Token positions are grouped by task relevance and protected through mode-conditioned unequal error protection under low-, medium-, and high-synchronization modes. At the twin side, decoding confidence converts unreliable hard token decisions into erasures, which are restored by a completion model before updating the semantic twin state. The recovered state supports traffic-state inference and generates compact feedback statistics, including channel quality, receiver uncertainty, semantic drift, and application priority, for subsequent mode adaptation. Experiments on a dynamic road-scene digital-twin scenario show that TWIST improves traffic-state inference and semantic twin-state synchronization compared with fixed-mode and channel-only adaptation strategies, while reducing the average synchronization cost relative to always-high transmission.

2605.27191 2026-05-27 quant-ph eess.SP math.OC

Statistical and Algorithmic Foundations of Probing Quantum Systems with Compressive Measurements: A Review

压缩测量探测量子系统的统计与算法基础:综述

Zhen Qin, Michael B. Wakin, Zhihui Zhu

AI总结 本文综述了结构化量子态层析成像的统一视角,涵盖紧凑态表示、测量设计和计算算法,并探讨了样本复杂度、测量效率和可扩展恢复的理论基础。

详情
AI中文摘要

量子态层析成像(QST)是量子信息科学中的一项基本任务,旨在从测量数据中重建未知量子态。然而,希尔伯特空间维度随系统大小呈指数增长,使得一般量子态的全层析成像在统计和计算上变得不可行。这一挑战推动了结构化量子态层析成像的广泛研究,其中先验结构(如低秩性、张量网络表示、浅量子电路和神经量子态)可以显著减少有效自由度,并实现可扩展恢复。在本综述中,我们通过三个密切相关的主题为结构化量子态的QST提供了统一视角:紧凑态表示、测量设计和计算算法。在回顾结构化量子态的常见模型后,我们调查了测量框架的几何保持性质方面的现有工作,范围从信息完备的POVM到随机测量,及其对样本复杂度的影响。在算法方面,我们回顾了从经验测量中重建结构化量子态的优化方法。通过将QST与压缩感知、矩阵感知和结构化逆问题的更广泛原理联系起来,本综述强调了样本复杂度、测量效率和可扩展恢复背后的共同理论基础。

英文摘要

Quantum state tomography (QST) is a fundamental task in quantum information science that aims to reconstruct unknown quantum states from measurement data. However, the exponential growth of Hilbert-space dimension with system size makes full tomography of general quantum states statistically and computationally prohibitive. This challenge has motivated extensive research on structured quantum state tomography, where prior structure, such as low-rankness, tensor-network representations, shallow quantum circuits, and neural quantum states, can substantially reduce the effective degrees of freedom and enable scalable recovery. In this review, we provide a unified perspective on QST for structured quantum states through three closely related themes: compact state representations, measurement design, and computational algorithms. After reviewing common models for structured quantum states, we survey existing work on geometric preservation properties of measurement frameworks, ranging from informationally complete POVMs to randomized measurements, and their implications for sample complexity. On the algorithmic side, we review optimization methods for reconstructing structured quantum states from empirical measurements. By connecting QST with broader principles from compressive sensing, matrix sensing, and structured inverse problems, this survey highlights common theoretical foundations underlying sample complexity, measurement efficiency, and scalable recovery.

2605.27189 2026-05-27 cs.CL cs.LG cs.SD eess.AS q-bio.NC

Beyond Binary: Speech Representations Across the Cognitive Score Hierarchy

超越二元:认知评分层级中的语音表征

Serli Kopar, Roshan Prakash Rane, Christian Mychajliw, Lydia Federmann, Gerhard Eschweiler, Daniela Berg, Sam Gijsen, Paula Andrea Perez-Toro, Kerstin Ritter

AI总结 本研究利用5,754份德语神经心理学评估录音,比较手工声学特征与自监督学习嵌入在轻度认知障碍认知评估层级(任务、领域、全局)中的表现,发现任务约束与评估层级之间的关联。

详情
AI中文摘要

本研究考察了轻度认知障碍中语音表征与认知评估层级结构之间的关系。利用5,754份德语神经心理学评估录音,我们在三个评分层级(任务、领域和全局)上评估了六项认知任务。我们比较了手工声学特征与自监督学习(SSL)嵌入。结果表明,尽管SSL表示在较低层级通常优于手工特征,但这种趋势在MCI分类中发生逆转。此外,任务特定约束影响性能:响应自由度较大的任务随着层级增加表现出性能稀释,表明“专家”表示,而高度结构化任务的性能向更高层级增加,表明“通才”表示。这些发现揭示了自动临床语音分析中任务约束与评估层级之间的联系。

英文摘要

This study examines the relationship between speech representations and the hierarchical structure of cognitive assessment in mild cognitive impairment. Utilizing 5,754 German neuropsychological assessment recordings, we evaluate six cognitive tasks across three score levels: task, domain, and global levels. We compare hand-crafted acoustic features with self-supervised learning (SSL) embeddings. Results show that although SSL representations generally outperform hand-crafted features at lower levels, this trend reverses for MCI classification. Furthermore, task-specific constraints influence performance: tasks with greater response freedom exhibit performance dilution as hierarchical levels increase, suggesting ``specialist'' representations, whereas the performance of highly structured tasks increases toward higher levels, suggesting ``generalist'' representations. These findings show links between task constraints and assessment hierarchy in automated clinical speech analysis.

2605.27143 2026-05-27 eess.SY cs.SY

Container Unloading via Reinforcement Learning: Picking Order, Deadlock Avoidance, and Proof-of-Concept Simulation

基于强化学习的集装箱卸载:拣选顺序、死锁避免及概念验证仿真

Jan Rüdiger, Max Schenke, Daniel Weber

AI总结 针对快递包裹行业集装箱卸载的自动化需求,提出一种基于掩码深度Q学习与特殊神经网络架构的策略,在仿真环境中实现平均60%的成功率,显著优于随机策略的20%。

详情
AI中文摘要

在快递、快运和包裹行业中,卸载集装箱是一项体力要求高且劳动密集型的工作。自动化这一过程是提高包裹处理系统效率的重要一步。本研究探讨了强化学习在集装箱卸载场景中学习物品选择策略的潜力。为此,创建了一个仿真环境,并实现了一种具有特殊设计的神经网络架构的掩码深度Q学习。结果表明,智能体能够以平均60%的成功率学习选择物品,这显著优于随机策略的20%随机概率。研究结果表明,强化学习可能是未来自动化物品卸载任务的一种有前景的方法。

英文摘要

Unloading containers in the courier, express and parcel industry is a physically demanding and labor-intensive work. Automatizing this process is an important step towards increasing the efficiency of parcel-handling systems. This work investigates the potential of reinforcement learning to learn a policy for item selection in container unloading scenarios. For that, a simulation environment is created and a masked deep Q-learning with a specially designed neural network architecture is implemented. The results indicate that the agent can learn to select items with an average success rate of 60 %, which is significantly better than a random policy at a random chance of 20 %. The findings suggest that RL could be a promising approach for automatizing item unloading tasks in the future.

2605.27139 2026-05-27 eess.IV cs.CV physics.ins-det

Unsupervised Deep Image Prior for Sparse-View and Limited-Angle Electron Tomography

无监督深度图像先验用于稀疏视角和有限角度电子断层扫描

Serge Brosset, Daniel del Pozo Bueno, Thomas David, Laure Guetaz, Philippe Ciuciu, Zineb Saghi

AI总结 提出无监督深度图像先验方法,在稀疏视角和有限角度条件下实现与监督方法相当的电子断层重建性能,并应用于实验数据验证其可靠性。

详情
Comments
22 pages, 12 figures
AI中文摘要

电子断层扫描(ET)在纳米材料的三维(3D)表征中发挥着重要作用。然而,在有限角度和稀疏视角条件下,传统算法会产生退化的重建结果,影响所得3D数据的质量和可解释性。本文提出深度图像先验(DIP),一种无监督的深度学习(DL)方法,用于高度退化的断层扫描采集,并通过模拟数据证明,即使在倾斜范围仅为60°、倾斜步长为10°的情况下,其性能也与需要训练数据集的监督方法相当。然后,我们将其应用于实验数据,并表明它在稀疏视角和有限角度条件下都能实现可靠的3D量化,突显了其在广泛材料和采集模式中的潜力。

英文摘要

Electron tomography (ET) plays an important role in the three-dimensional (3D) characterization of nanomaterials. However, under limited-angle and sparse-view conditions, conventional algorithms produce degraded reconstructions, which compromise the quality and interpretability of resulting 3D data. In this paper, we present deep image prior (DIP), an unsupervised deep learning (DL) approach, for highly degraded tomography acquisitions and demonstrate, using simulated data, that its performance is comparable to that of supervised approaches requiring training datasets, even for tilt ranges as limited as 60° and tilt increments of 10°. We then apply it to experimental data and show that it enables reliable 3D quantification under both sparse-view and limited-angle conditions, highlighting its potential for a wide range of materials and acquisition modalities.

2605.27094 2026-05-27 eess.SY cs.SY

En-route Charging Coordination for Electric Trucks

电动卡车途中充电协调

Joas Kahlert, Ruiting Wang, Jonas Mårtensson

AI总结 针对长途货运电动化中充电基础设施容量有限和拥堵问题,提出一种基于混合整数规划的协调充电调度框架,可降低充电、运营、电池退化和拥堵延迟成本,相比非协调调度节省高达36%的成本。

详情
AI中文摘要

长途货运的电气化带来了若干新挑战,例如途中充电基础设施容量有限和拥堵。为减少高峰时段的等待时间,本文提出了一种协调充电调度框架。该方法采用混合整数规划,优化充电、运营、电池退化和拥堵延迟相关的充电成本,并考虑一系列场景。结果表明,与非协调调度相比,协调调度可节省高达36%的成本,特别是通过减少电池退化和延迟成本。

英文摘要

The electrification of long-haul freight transport introduces several new challenges, such as the limited capacity and congestion at en-route charging infrastructure. To reduce waiting times during peak periods, this paper proposes a framework for coordinated charging scheduling. The approach employs a mixed-integer formulation to optimize charging-related costs across charging, operation, battery degradation, and congestion delay, considering a range of scenarios. The results demonstrate that coordinated scheduling yields substantial cost savings up to 36% compared to uncoordinated scheduling, particularly by reducing battery degradation and delay costs.

2605.27039 2026-05-27 eess.AS cs.SD

Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory

为什么它们记不住?揭示多轮声学记忆中的表征和检索瓶颈

Yang Xiao, Siyi Wang, Han Yin, Hong Jia, Vidhyasaharan Sethu, Eun-Jung Holden, Ting Dang

AI总结 本文通过引入EnvMem基准,发现大型音频语言模型在多轮交互中非语音信息记忆失败的主要原因是表征轨迹漂移,而非注意力分配不足。

详情
AI中文摘要

大型音频语言模型(LALMs)处理语音和环境声学线索,但在多轮交互中难以保留非语音信息。语义(语音)和声学(非语音)理解之间的性能差距仍未被充分理解,其表征和检索的底层机制尚不清楚。本文引入EnvMem,一个受控的多轮基准,用于研究这一差距并识别表征(即潜在嵌入)和检索层面(即注意力分配)失败的根源。我们进一步进行事后干预以探究表征结构和注意力动态。我们的结果揭示表征轨迹漂移是关键失败模式,同时表明注意力分配在解释观察到的退化中作用有限。总体而言,我们提供了一个系统框架,用于分析和改进长上下文LALMs中的非语言记忆,为未来鲁棒声学记忆建模的数据和训练设计提供启示。

英文摘要

Large audio language models (LALMs) process both speech and environmental acoustic cues, yet struggle to retain non-speech information across multi-turn interactions. The performance gap between semantic (speech) and acoustic (non-speech) understanding remains poorly understood, and the underlying mechanisms of representation and retrieval are still unclear. This work introduces EnvMem, a controlled multi-turn benchmark designed to study this gap and identify the root causes of failures at the representation (i.e., latent embeddings) and retrieval levels (i.e., attention allocation). We further conduct post-hoc interventions to probe representational structure and attention dynamics. Our results reveal representational trajectory drift as the key failure mode, while showing that attention allocation plays a limited role in explaining the observed degradation. Overall, we provide a systematic framework for analyzing and improving non-linguistic memory in long-context LALMs, shedding light on future data and training design for robust acoustic memory modeling.

2605.27021 2026-05-27 eess.SY cs.SY

In-Orbit Intelligence or Ground Offloading? Inference Freshness under Intermittent Satellite Connectivity

在轨智能还是地面卸载?间歇性卫星连接下的推理新鲜度

Ayse Nur Pehlivanoglu, Aimin Li, Elif Uysal

AI总结 针对低轨卫星间歇性连接下的推理新鲜度优化问题,提出一种混合策略,通过半马尔可夫决策过程建模并求解,在减少卫星计算和通信资源需求的同时降低平均推理年龄。

详情
AI中文摘要

本文研究在间歇性LEO连接下如何平衡星载和地面计算以优化推理新鲜度。随着连接性随时间变化,系统在星载计算、缓存语义传输、原始数据卸载和等待等动作之间切换。我们定义推理年龄(AoInf)作为性能指标,其中年龄仅在成功完成任务有效更新时重置。我们将长期平均AoInf最小化问题建模为有限状态平均成本半马尔可夫决策过程,其状态捕获地面AoInf、轨道接触阶段、缓存占用和缓存年龄。然后,我们将SMDP转化为等效的平均成本MDP,并通过归一化相对值迭代(RVI)计算解。数值结果表明,所得到的混合策略相对于仅星载和仅卸载基线降低了平均AoInf,同时相比前者需要更少的卫星计算资源,相比后者需要更少的通信资源。

英文摘要

This paper studies how to balance onboard and ground computation under intermittent LEO connectivity for optimized inference freshness. As connectivity varies in time, the system switches among the actions of onboard computation, cached semantic transmission, raw-data offloading, and waiting. We define Age of Inference (AoInf) as the performance metric, where the age resets only upon successful task-valid updates. We formulate long-run average AoInf minimization as a finite-state average-cost semi-Markov decision process whose state captures the ground AoInf, orbital contact phase, cache occupancy, and cache age. We then transform the SMDP into an equivalent average-cost MDP and compute the solution via normalized relative value iteration (RVI). Numerical results indicate that the resulting hybrid policy reduces average AoInf relative to onboard-only and offload-only baselines, while requiring less computational resources on the satellite than the former, and fewer communication resources than the latter.

2605.27017 2026-05-27 eess.SY cs.SY

Graph-Based Modeling, Control, and Optimization for Multi-Domain and Multi-Timescale Energy Systems

基于图的多领域多时间尺度能源系统建模、控制与优化

Joseph M. Pisani, Christopher T. Aksland, Philip M. Renkert, Joseph Broniszewski, Vismay Vyas, Andrew G. Alleyne, Donald J. Docimo, Justin P. Koeln, Neera Jain, Herschel C. Pangborn

AI总结 本文提出一种基于图的建模方法,用于多领域多时间尺度能源系统的建模、分析、控制、优化和设计,并通过案例和开源工具箱展示其应用。

详情
AI中文摘要

现代车辆和建筑基础设施中的能源系统由跨越多个物理领域(如电气、热力、机械)和时间尺度的高维动力学控制。本教程论文提出了一种基于图的建模方法,旨在促进这些系统的建模、分析、控制、估计、优化和设计。该方法经过十多年跨多个学术机构和公司的研究验证,将瞬态能量守恒与系统中能量存储和传输网络的显式数学表示相结合。在基于图模型的数学概述之后,介绍了近期文献中的多领域组件和系统模型示例,包括单相热系统、两相热系统和机电系统。随后综述了在分散和分层模型预测控制、设计优化以及控制协同设计方面的最新应用。最后,本文描述了一个用于促进基于图模型生成和分析的开源工具箱。

英文摘要

Modern energy systems in vehicles and built infrastructure are governed by high-dimensional dynamics spanning multiple physical domains (e.g., electrical, thermal, mechanical) and timescales. This tutorial paper presents a graph-based modeling approach created to facilitate the modeling, analysis, control, estimation, optimization, and design of these systems. Matured and validated through more than a decade of research spanning multiple academic institutions and companies, the graph-based approach combines transient energy conservation with an explicit mathematical representation of the network by which energy is stored and transferred within a system. Following a mathematical overview of graph-based models, examples of multi-domain component and system models from the recent literature are presented, including single-phase thermal systems, two-phase thermal systems, and electro-mechanical systems. This is followed by a survey of recent applications for decentralized and hierarchical model predictive control, design optimization, and control co-design. Lastly, the paper describes an open-source toolbox created to facilitate the generation and analysis of graph-based models.

2605.27005 2026-05-27 eess.SP

Over-the-Air Successive Interference Cancellation for Efficient 5G NR and Wi-Fi Spectrum Reuse

基于空中连续干扰消除的高效5G NR与Wi-Fi频谱复用

Mir Lodro, Francesco Raimondo, Geoffrey S. Hilton, Mark A. Beach, Andrew C. M. Austin

AI总结 提出一种在屏蔽箱环境中利用连续干扰消除(SIC)实现5G NR与Wi-Fi并发传输的空中(OTA)实验方法,通过USRP接收机捕获复合波形并进行样本域SIC,在18 dB衰减点实现11.88 dB消除深度和26.96 dB的5G信道抑制。

详情
Comments
5 pages, 7 figures
AI中文摘要

本文介绍了在屏蔽箱环境中,利用连续干扰消除(SIC)对并发5G新空口(5G NR)和Wi-Fi传输进行的空中(OTA)实验评估。使用USRP作为接收机,捕获包含两种空中接口信号的复合波形,并应用样本域SIC抑制主导的5G NR信号,从残余波形中恢复Wi-Fi信号。该框架报告了误差矢量幅度(EVM)、误码率(BER)、样本域消除深度和信道估计抑制,并在代表性的18 dB衰减点测量到11.88 dB消除深度和26.96 dB的5G信道抑制。所提出的方法为在受控OTA条件下评估跨技术共存和接收端干扰抑制提供了实用基础。

英文摘要

An over-the-air (OTA) experimental evaluation of concurrent 5G New Radio (5G NR) and Wi-Fi transmission using successive interference cancellation (SIC) in a shielded-box environment is presented. A USRP is used as the receiver, which captures the composite waveform containing both air-interface signals and applies sample-domain SIC to suppress the dominant 5G-NR signal and recover Wi-Fi signal from the residual waveform. The framework reports error vector magnitude (EVM), bit error rate (BER), sample-domain cancellation depth, and channel-estimate suppression, and, at the representative \(18\) dB attenuation point, measures \(11.88\) dB cancellation depth and \(26.96\) dB 5G channel suppression. The proposed methodology provides a practical basis for assessing cross-technology coexistence and receiver-side interference suppression under controlled OTA conditions.

2605.26995 2026-05-27 eess.SP

OTA Characterization of Dual-User IEEE 802.11be EHT-MU Under Transmit-Chain Imbalance

发射链路不平衡下双用户IEEE 802.11be EHT-MU的OTA特性

Mir Lodro, Francesco Raimondo, Geoffrey S. Hilton, Mark A. Beach, Andrew C. M. Austin

AI总结 通过OTA实验研究发射链路不平衡对双用户IEEE 802.11be EHT-MU传输的影响,发现主要失效模式为流局部而非包全局,且LDPC编码可提升敏感流的鲁棒性。

详情
Comments
6 pages, 9 figures
AI中文摘要

本文介绍了在发射链路不平衡条件下,对双用户IEEE 802.11be极高吞吐量多用户(EHT-MU)传输进行受控空中(OTA)特性分析。目标是确定施加到一个接入点发射链路的衰减是产生包全局退化,还是在接收机处理后主要表现为依赖于流的有效载荷退化。测量在屏蔽射频箱内使用两个NI USRP-2953R和NI USRP-2942R软件无线电进行,其中一个USRP生成双用户非OFDMA EHT-MU波形,另一个实现同步双支路分组恢复。对第二个AP发射链路(TX2)施加校准的衰减扫描,并使用误码率(BER)、EHT-Data误差矢量幅度(EVM)、控制字段成功概率、有效载荷成功概率和子载波级EVM分布评估性能。结果表明,在测试范围内,解码为用户1的流保持在BER基底,而解码为用户2的流表现出渐进式EVM退化,随后出现阈值状的BER和有效载荷成功崩溃。公共信令字段保持可恢复,表明观察到的占主导地位的失效模式是接收机输出处的流局部而非包全局。将用户2的二进制卷积编码(BCC)替换为低密度奇偶校验(LDPC)编码,使BER和有效载荷成功崩溃延迟约5 dB的TX2衰减,表明对于更敏感的流存在可测量的依赖于编码的鲁棒性裕度。

英文摘要

This paper presents a controlled over-the-air (OTA) characterization of dual-user IEEE 802.11be Extremely High Throughput Multi-User (EHT-MU) transmission under transmit-chain imbalance. The objective is to determine whether attenuation applied to one access-point transmit chain produces packet-global degradation or appears primarily as stream-dependent payload degradation after receiver processing. Measurements are performed in a shielded RF enclosure using two NI USRP-2953R and NI USRP-2942R software-defined radios, with one USRP generating a dual-user non-OFDMA EHT-MU waveform and the other implementing synchronized dual-branch packet recovery. A calibrated attenuation sweep is applied to the second AP transmit chain (TX2), and performance is evaluated using bit error rate (BER), EHT-Data error vector magnitude (EVM), control-field success probability, payload-success probability, and subcarrier-level EVM distributions. The results show that the stream decoded as User~1 remains at the BER floor over the tested range, while the stream decoded as User~2 exhibits progressive EVM degradation followed by threshold-like BER and payload-success collapse. Common signaling fields remain recoverable, indicating that the dominant observed failure mode is stream-local at the receiver output than the packet-global. Replacing User~2 binary convolutional coding (BCC) with low density parity check (LDPC) coding delays the BER and payload-success collapse by approximately \(5\)~dB of TX2 attenuation, demonstrating a measurable coding-dependent robustness margin for the more sensitive stream.

2605.26988 2026-05-27 physics.geo-ph eess.IV physics.app-ph

Unveiling magma plumbing systems for volcanic eruptions and crustal accretion via active-seismic matrix imaging

通过主动源地震矩阵成像揭示火山喷发和地壳增生的岩浆管道系统

Baptiste Hériard-Dubreuil, Milena Marjanović, Arnaud Burtin, Alexandre Aubry

AI总结 应用主动源地震矩阵成像技术,绘制了东太平洋海隆9°50'N的岩浆储层内部结构,发现锥形轴部储层和相互连接的富岩浆带,结合蛇绿岩证据解决了下地壳形成机制中岩浆通道与原位结晶的长期争议。

详情
Comments
34 pages, 9 figures
AI中文摘要

海底喷发占地球火山活动的80%以上,主要发生在洋中脊,那里的浅层岩浆系统可以通过高分辨率成像观测。然而,它们的偏远性常常导致未被探测。最近在东太平洋海隆9°50'N(最活跃的脊段之一)的地震研究成像了最浅岩浆透镜的详细结构,但没有数据约束模型解释岩浆如何积累、迁移或触发喷发。同样,洋壳的形成仍然知之甚少。虽然二维地震数据仅揭示少数垂直堆叠的瞬态岩浆透镜,但我们的研究应用矩阵成像(一种可控源地震学的新技术)来绘制轴部和离轴岩浆储层的内部结构。我们揭示了一个锥形轴部储层和整个地壳中相互连接的富岩浆带。结合蛇绿岩证据,这些发现表明岩浆通道主导了下地壳形成的前3公里,而原位结晶在最后1公里占主导,解决了一个长期存在的争论。

英文摘要

Submarine eruptions, accounting for over 80% of Earth's volcanic activity, primarily occur along mid-ocean ridges, where shallow magmatic systems are accessible to high-resolution imaging. Yet, their remoteness often leaves them undetected. Recent seismic studies at the East Pacific Rise (EPR) 9°50'N-one of the most dynamic ridge segments, imaged the detailed architecture of the shallowest magma lens, but no data-constrained model yet explains how magma accumulates, migrates, or triggers eruptions. Similarly, the formation of oceanic crust remains poorly understood. While 2-D seismic data reveal only a few vertically stacked, transient magma lenses, our study applies matrix imaging, a novel technique in controlled-source seismology, to map the inner structure of on- and off-axis magma reservoirs. We uncover a conical on-axis reservoir and interconnected magma-rich zones throughout the crust. Combined with ophiolite evidence, these findings reveal that magma channels dominate the first 3 km for lower crust formation, while in situ crystallization prevails in the final 1 km, resolving a long-standing debate.

2605.26970 2026-05-27 eess.SY cs.SY

Congestion Forecasting for Electric Vehicle Charging Scheduling with Fluid Queues

基于流体队列的电动汽车充电调度拥塞预测

Joas Kahlert, Ruiting Wang, Jonas Mårtensson

AI总结 提出一种基于流体的拥塞预测方法,用于电动汽车充电调度,在考虑不确定性到达和容量约束下预测充电站可用性,并减少等待时间高达14%。

详情
AI中文摘要

为了支持电动交通系统的普及,公共充电机会变得越来越重要。在这种动态环境中,路线规划和充电调度的一个核心挑战是在波动需求下预测充电站的可用性。在这项工作中,我们提出了一种基于流体的预测方法,该方法考虑了已知和未知电动汽车到达模式的不确定性,同时尊重充电站容量约束。我们进一步通过将拥塞预测方法应用于电动汽车调度问题来评估其性能。与依赖标准基线的调度框架相比,基于流体拥塞预测模型的充电调度将等待相关的停机时间减少了高达14%。最后,我们量化了增加对车辆到达的了解以及不同水平的充电站拥塞如何影响整体系统性能。

英文摘要

To support the adoption of electric transport systems, public charging opportunities are becoming increasingly important. In this dynamic environment, a central challenge for route planning and charging scheduling is forecasting charging-station availability under fluctuating demand. In this work, we propose a fluid-based forecasting method that accounts for uncertainty in both known and unforeseen electric vehicle arrival patterns while respecting station capacity constraints. We further evaluate the congestion forecasting method by applying it to an electric vehicle scheduling problem. Compared to scheduling frameworks that rely on standard baselines, charging schedules based on the fluid congestion forecasting model reduce waiting-related downtime by up to 14%. Finally, we quantify how increased knowledge of vehicle arrivals and different levels of station congestion affect overall system performance.

2605.26950 2026-05-27 eess.SP

Half-Quadratic Criterion based Adaptive Graph Signal Processing Algorithm

基于半二次准则的自适应图信号处理算法

Chong Zhang, Haiquan Zhao, Chengjin Li

AI总结 针对图信号处理中非高斯噪声干扰及参数手动调节复杂的问题,提出基于强凸半二次准则的GSP HQC算法,提高了收敛速度和自适应估计精度。

详情
AI中文摘要

近年来,自适应图信号处理算法的进展为处理定义在图结构上的信号提供了有效解决方案。作为信息论中的经典策略,广义最大相关熵准则(GMCC)对非高斯噪声表现出良好的抵抗性。当非高斯噪声干扰图信号时,基于GMCC的图信号处理算法(GSP GMCC)展现出更好的性能。然而,GSP GMCC算法本身有三个需要手动调节的参数,手动调参过程复杂且繁琐。同时,GMCC函数本身的非凹非凸性限制了其收敛速度和自适应估计精度。为解决上述问题,本文基于强凸函数半二次准则(HQC),提出了GSP HQC算法。本文对GSP HQC算法进行了性能分析。仿真实验表明,GSP HQC算法在保持与现有算法相当的计算复杂度的同时,在收敛速度和自适应估计精度方面实现了优越性能。

英文摘要

In recent years, progress in adaptive graph signal processing algorithms has provided effective solutions for processing signals defined on graph structures. As a classical strategy in information theory, the Generalized Maximum Correntropy Criterion (GMCC) exhibits good resistance to non-Gaussian noises. When non-Gaussian noise interferes with the graph signal, the graph signal processing algorithm based on GMCC (GSP GMCC) algorithm shows better performance. However, the GSP GMCC algorithm itself has three parameters that need to be manually tuned, and the process of manually tuning the parameters is complex and tedious. Meanwhile, the non-concave and non-convex nature of the GMCC function itself limits its own convergence rate and adaptive estimation accuracy. To solve the above problems, based on the strongly convex function half-quadratic criterion (HQC), the GSP HQC algorithm is proposed in this paper. The performance analysis of the GSP HQC algorithm is implemented in this paper. Simulation experiments demonstrate that the GSP HQC algorithm achieves superior performance in terms of convergence rate and adaptive estimation accuracy while maintaining computational complexity comparable to existing algorithms

2605.26943 2026-05-27 eess.SP

On the LEO Satellite Constellation Design for North Atlantic Coverage

面向北大西洋覆盖的LEO卫星星座设计

Alejandro Ramírez-Arroyo, Miguel Villanueva-Fernández, Preben Mogensen

AI总结 针对北大西洋高纬度区域,研究Walker Delta星座的倾角、最小仰角、高度和卫星覆盖范围对可见概率、重访时间、路径损耗和覆盖连续性的影响,发现最小仰角和倾角是关键参数。

详情
Comments
13 pages, 12 figures
AI中文摘要

低地球轨道(LEO)卫星星座因其低延迟和高容量通信能力,正成为非地面网络的关键组成部分。然而,与更高轨道卫星相比,这些轨道上的卫星具有覆盖足迹小和轨道速度高的特点。这导致星座不断变化且动态,需要智能设计轨道参数以确保连续覆盖。现有的星座部署通常针对低中纬度地区或全极地覆盖进行优化,而北大西洋等高纬度区域场景尚未得到充分探索。本文深入探讨了在LEO部署卫星以实现北大西洋覆盖的关键特征。因此,我们研究了星座倾角、最小仰角、高度和卫星覆盖范围如何共同影响可见概率、重访时间、路径损耗和覆盖连续性。结果表明,最小仰角是一个关键设计参数:在1000公里高度、64颗卫星的Walker Delta星座中,对于低于20°的仰角,可以在北纬55°以上提供连续覆盖,而对于更大的仰角,覆盖概率急剧下降。同样,要实现中等规模星座对北大西洋的稳健覆盖,倾角需高于约70°。因此,这些结果为如何设计卫星星座以高效部署并专注于北大西洋覆盖(针对海事、航空和北极连接场景)提供了实用指南。

英文摘要

Low Earth Orbit (LEO) satellite constellations are emerging as a key component of non-terrestrial networks due to their low-latency and high-capacity communication capabilities. However, satellites in these orbits are characterized by a small coverage footprint and high orbital velocity compared to those in higher orbits. This results in constantly changing and dynamic constellations that require smart design of orbital parameters to ensure continuous coverage. Existing constellation deployments are typically optimized either for low- and mid-latitude regions or for full polar coverage, leaving high-latitude regional scenarios such as the North Atlantic insufficiently explored. This work provides insights into the key characteristics associated with the deployment of satellites in LEO for North Atlantic coverage. Therefore, we investigate how constellation inclination, minimum elevation angle, altitude, and satellite footprint jointly affect visibility probability, revisit time, path loss, and coverage continuity. Results show that the minimum elevation angle is a critical design parameter since a Walker Delta constellation with 64 satellites at 1000 km altitude can provide continuous coverage above 55°N for elevations below 20°, whereas coverage probability degrades drastically for larger elevation angles. Similarly, inclinations above approximately 70° are required to achieve robust North Atlantic coverage with medium-size constellations. Thus, these results provide practical guidelines on how a satellite constellation should be designed to achieve an efficient deployment with a focus on coverage over the North Atlantic, targeting maritime, aviation, and Arctic connectivity scenarios.

2605.26928 2026-05-27 eess.SP

NF-TrackLLM: Joint Prediction of UAV Trajectory and Near-Field Beam for LAE XL-MIMO Systems

NF-TrackLLM:用于LAE XL-MIMO系统的无人机轨迹与近场波束联合预测

Qianfan Lu, Mengyuan Li, Jiachen Tian, Yu Han, Xiao Li, Shi Jin

AI总结 提出NF-TrackLLM框架,通过多模态语义感知和GPT-2时空推理,联合预测近场XL-MIMO系统中无人机轨迹和波束,实现高精度跟踪与波束预测。

详情
AI中文摘要

在超大规模多输入多输出(XL-MIMO)系统中,用户定位和波束管理紧密耦合,特别是在密集低空经济(LAE)场景中。然而,XL-MIMO中的近场传播引入了强烈的距离敏感性和复杂的空间耦合,使得联合轨迹和波束预测具有挑战性。同时,大语言模型(LLMs)在物理层传输中因建模长程依赖性而受到关注。本文提出NF-TrackLLM,一种用于XL-MIMO系统中近场无人机(UAV)定位和波束预测的多模态语义感知框架。通过将视觉和LiDAR感知集成到基于Sionna的信道生成流水线中,利用环境语义和GPS指导轨迹和波束预测。基于对齐的多模态表示,采用基于GPT-2的时空推理骨干和级联预测策略,首先推断未来轨迹,然后将其作为几何先验指导波束预测。仿真结果表明,NF-TrackLLM在密集城市低空场景中实现了准确的波束预测和可靠的无人机轨迹跟踪。

英文摘要

User localization and beam management are tightly linked in extremely large-scale multiple-input multiple-output (XL-MIMO) systems, especially in dense low-altitude economy (LAE) scenarios. However, the near-field propagation in XL-MIMO introduces strong distance sensitivity and complex spatial coupling, which makes joint trajectory and beam prediction challenging. Meanwhile, large language models (LLMs) have attracted attention in physical-layer transmission for modeling long-range dependencies. In this paper, we propose NF-TrackLLM, a multi-modal semantic-aware framework for near-field unmanned aerial vehicles (UAVs) positioning and beam prediction in XL-MIMO systems. By incorporating visual and LiDAR sensing into a Sionna-based channel generation pipeline, environmental semantics and GPS are utilized to guide trajectory and beam prediction. Built upon the aligned multi-modal representation, a GPT-2-based spatiotemporal reasoning backbone, and a cascaded prediction strategy are employed, where future trajectories are first inferred and then used to guide beam prediction as geometric priors. Simulation results demonstrate that NF-TrackLLM achieves accurate beam prediction and reliable UAV trajectory tracking in dense urban low-altitude scenarios.

2605.26915 2026-05-27 eess.SP

Gaussian Process-Based Extended Object Estimation for 6G ISAC at Millimeter-Wave Frequencies

基于高斯过程的扩展目标估计用于毫米波频段6G ISAC

M. Ertug Pihtili, Ossi Kaltiokallio, Julia Equi, Jukka Talvitie, Elena Simona Lohan, Ertugrul Basar, Mikko Valkama

AI总结 针对6G ISAC场景,提出基于高斯过程的扩展目标估计方法,通过双基地感知和5G NR实测验证,在毫米波映射和SLAM中提升环境感知能力。

详情
AI中文摘要

本文介绍了一种基于高斯过程(GP)的扩展目标估计(EOE)方法,用于集成感知与通信(ISAC)场景,代表了一种超越传统点散射体假设来增强环境感知的有前景方法。通过符合第五代(5G)新无线电(NR)标准并采用双基地感知的实际测量设置,研究了所提出的基于GP的EOE方法的适用性,并在毫米波(mmWave)频率下对映射和同时定位与映射(SLAM)两种情况进行了评估。结果表明,通信网络的增强能力与双基地感知及基于GP的EOE相结合,能够在未来无线系统中实现更好的环境感知。重要的是,结果证明,在实际条件下,GP在毫米波映射和SLAM场景中均能有效执行EOE。

英文摘要

This paper introduces a Gaussian process (GP)-based method for extended object estimation (EOE) in integrated sensing and communication (ISAC) scenarios, representing a promising approach to enhance environmental awareness beyond the conventional point-scatterer assumption. The suitability of the proposed GP-based method for EOE is investigated through a practical measurement setup compliant with the fifth-generation (5G) New Radio (NR) standard and employing bistatic sensing, with results evaluated for both mapping and simultaneous localization and mapping (SLAM ) cases at millimeter-wave (mmWave) frequencies. The findings reveal that the enhanced capabilities of communication networks, when combined with bistatic sensing and GP-based EOE, enable improved environmental awareness in future wireless systems. Importantly, the results demonstrate that, under practical conditions, GP effectively performs EOE in both mmWave mapping and SLAM scenarios.

2605.26901 2026-05-27 eess.SY cs.SY

Load Management of Distribution Systems via Online Dynamic Pricing

配电网负荷管理:基于在线动态定价

Jiarui Yu, Zhiyu He, Wenbin Wang, Colin N. Jones, Florian Dörfler, Hanmin Cai

AI总结 提出一种仅依赖聚合负荷观测的在线反馈优化(OFO)算法,用于日前电价设计,在保护隐私的同时有效降低峰值负荷。

详情
Comments
18 pages
AI中文摘要

电动汽车的日益普及正在增加配电网的峰值需求,这可能威胁电网稳定性并降低运营效率。动态电价是一种通过转移灵活需求来缓解这些峰值的有效手段。然而,大多数现有方法依赖于详细的用户级消费数据和行为模型,这些在实际中往往难以获取,并可能引发隐私问题。本文提出了一种在线反馈优化(OFO)算法,用于在数据有限的情况下进行日前电价设计,其中仅观测聚合负荷。OFO利用聚合负荷测量值迭代更新电价,从而在无需访问个体用户数据的情况下实现有效的峰值削减。该公式还包括一个惩罚项,用于约束相对于参考电价的总电费偏差。尽管仅依赖聚合负荷测量,OFO的电价更新能够高效收敛到最优电价。在有限时域仿真中,OFO实现的峰值削减接近具有完整模型信息的Stackelberg基准。同时,其计算量显著降低。在多种初始条件和延迟充电窗口不匹配下的额外测试进一步证实了所提方法的鲁棒性。总体而言,这些结果表明OFO是一种可扩展且计算高效的配电网峰值需求管理方法,适用于可观测性有限的情况。

英文摘要

The growing adoption of electric vehicles (EVs) is increasing peak demand in distribution systems, which can threaten grid stability and reduce operational efficiency. Dynamic electricity pricing is a promising means of mitigating these peaks by shifting flexible demand. However, most existing approaches rely on detailed user-level consumption data and behavioral models, which are often difficult to obtain in practice and may raise privacy concerns. This paper proposes an Online Feedback Optimization (OFO) algorithm for day-ahead price design with limited data, where only aggregate loads are observed. OFO updates prices iteratively using aggregate load measurements, enabling effective peak reduction without access to individual user data. The formulation also includes a term that penalizes deviations in total electricity cost relative to a reference tariff. Although relying only on aggregate load measurements, the OFO price updates efficiently converge to the optimal price. In finite-horizon simulations, OFO achieves peak reduction close to that of the Stackelberg benchmark with full model information. Meanwhile, its computational effort is substantially lower. Additional tests under multiple initial conditions and delayed charging-window mismatch further confirm the robustness of the proposed method. Overall, these results show that OFO is a scalable and computationally efficient approach for peak-demand management in distribution systems with limited observability.

2605.26885 2026-05-27 math.OC cs.SY eess.SY

A Fixed-Time Sliding-Mode Framework for Constraint Optimization

面向约束优化的固定时间滑模框架

Baby Diana, Priyanka Singh, Shyam Kamal, Sandip Ghosh, Bijnan Bandyopadhyay

AI总结 提出一种固定时间滑模框架,通过将拉格朗日乘子视为控制输入并嵌入等式约束作为滑模流形,实现约束满足和KKT点的固定时间收敛,同时保证对匹配扰动的鲁棒性。

详情
Comments
6 Pages, 4 Figures, Accepted in VSS 2026 18th International Workshop on Variable Structure Systems
AI中文摘要

本文针对约束问题开发了一个鲁棒的固定时间优化框架,该框架保证在固定时间内精确满足约束并收敛到KKT点,且与初始条件无关。该方法将拉格朗日乘子视为控制输入,由等效控制和切换控制组成,系统状态代表决策变量。等效控制将梯度流渐近地引导至非凸目标的局部KKT点,并在凸目标下于固定时间内引导至唯一全局最优解。通过将等式约束直接嵌入为滑模流形来实现约束强制执行,并采用固定时间切换控制确保快速可靠的可行性。该框架进一步考虑了匹配扰动,提供了鲁棒性保证,并通过球面约束进行了理论表征和说明。在3节点交流最优潮流问题和基于分布式共识的参数估计问题上的数值研究证明了所提方法的有效性、可扩展性和鲁棒性。

英文摘要

This paper develops a robust fixed time optimization framework for constrained problems that guarantees exact constraint satisfaction and convergence to KKT points within fixed time , independent of initial conditions. The approach treats the Lagrange multipliers as control inputs, composed of an equivalent control and a switching control, with the system states representing the decision variables. An equivalent control steers the gradient flow to a local KKT point asymptotically for nonconvex objectives and to unique global optimum in fixed time for convex objectives. Constraint enforcement is achieved by embedding the equality constraints directly as a sliding manifold, with a fixed time switching control ensuring rapid and reliable feasibility. The framework further accounts for the matched disturbances, providing robustness guarantees that are theoretically characterized and illustrated using spherical constraints. Numerical studies on a 3-bus AC optimal power flow problem and distributed consensus=based parameter estimation problem demonstrate the effectiveness, scalability and robustness of proposed approach.

2605.26880 2026-05-27 eess.IV cs.MM

GScomp-QA: A Subjective Dataset for Quality Assessment of Compressed Gaussian Splatting

GScomp-QA:压缩高斯泼溅质量评估的主观数据集

Pedro Martin, António Rodrigues, João Ascenso, Maria Paula Queluz

AI总结 针对高斯泼溅压缩缺乏评估数据集的问题,构建了包含331个视频刺激的主观质量评估数据集GScomp-QA,并通过率失真分析和18种客观指标评估现有压缩方案。

详情
AI中文摘要

高斯泼溅(GS)已成为高质量3D重建和新视角合成的高效表示方法。然而,其大模型尺寸给存储和传输带来挑战。尽管已有多种GS压缩方案被提出,但由于缺乏专门的评估数据集,其感知影响仍不明确。为填补这一空白,本文引入了GScomp-QA,一个用于评估压缩GS模型合成质量的主观质量评估数据集。该数据集包含来自13个真实场景的331个视频刺激,覆盖9种最先进的GS压缩方案。通过使用未压缩模型合成的视频作为参考,GScomp-QA将压缩引起的失真与合成伪影分离开来。进行了20名参与者的主观研究,提供了可靠的感知评分。基于这些数据,通过感知率失真分析评估了GS压缩方案。此外,评估了18种客观质量指标,表明它们未能完全捕捉GS特有的失真。GScomp-QA将公开提供,作为评估GS压缩方案和支持开发针对GS压缩的质量指标的基准。

英文摘要

Gaussian Splatting (GS) has emerged as an efficient representation for high-quality 3D reconstruction and novel view synthesis. However, its large model size poses challenges for storage and transmission. While several GS compression solutions have been proposed, their perceptual impact remains poorly understood due to the lack of dedicated evaluation datasets. To address this gap, this paper introduces GScomp-QA, a subjective quality assessment dataset for evaluating synthesis quality from compressed GS models. The dataset comprises 331 video stimuli from 13 real-world scenes, covering 9 state-of-the-art GS compression solutions. By using videos synthesized from uncompressed models as reference, GScomp-QA isolates compression-induced distortions from synthesis artifacts. A subjective study with 20 participants was conducted, providing reliable perceptual scores. Based on these data, GS compression solutions are evaluated through perceptual rate-distortion analysis. In addition, 18 objective quality metrics are evaluated, showing that they do not fully capture GS-specific distortions. GScomp-QA will be publicly available and provide a benchmark for evaluating GS compression solutions and supporting the development of quality metrics tailored to GS compression.

2605.26838 2026-05-27 eess.SY cs.SY

Critical Infrastructure Defense Against Aerial Swarms Under Sensing Uncertainty: Online Allocation With Finite-Time Guarantees

感知不确定性下针对空中蜂群的关键基础设施防御:具有有限时间保证的在线分配

Shriya Pandey, Devaprakash Muniraj

AI总结 提出一种闭环、感知不确定性感知的框架,通过概率图建模、风险感知关键性模型和鲁棒在线分配机制,防御小型无人机蜂群对保护区的协同入侵,并证明有限时间捕获触发和预期中和时间的混合线性-几何上界。

详情
AI中文摘要

本文提出了一种闭环、不确定性感知的框架,用于防御受保护区域免受小型无人飞机系统(UAS)蜂群的协同入侵。攻击者的交互结构被建模为时变的,而防御者在感知不完善的情况下运行。所提出的关键性驱动的防御者-攻击者分配策略整合了三个组成部分:基于从不确定性观测推断出的攻击蜂群的概率图表示;结合时间-突破紧迫性和不确定性的风险感知攻击者关键性模型;在线防御者分配机制,该机制分配并选择性重新分配防御者,同时通过鲁棒执行约束限制切换引起的不稳定性。在基于过滤的首次命中时间框架内建立了分析保证。特别地,证明了检测后首次捕获事件的有限时间触发,并推导了预期中和时间的显式混合线性-几何上界。蒙特卡洛模拟证明了所提出框架的有效性,在概率感知下实现了85.6%的中和效率,在确定性感知下实现了99.9%。系统的消融和敏感性研究进一步量化了检测阈值和协调参数如何影响可靠性和首次捕获时间。

英文摘要

This article presents a closed-loop, uncertainty-aware framework for defending a protected zone against coordinated incursions by swarms of small uncrewed aircraft systems (UAS). The interaction structure of the attackers is modeled as time-varying, while defenders operate under imperfect sensing. The proposed criticality-driven defender-to-attacker assignment strategy integrates three components: a probabilistic graph-based representation of the attacking swarm inferred from uncertain observations; a risk-aware attacker criticality model combining time-to-breach urgency with uncertainty; an online defender allocation mechanism that assigns and selectively reassigns defenders while limiting switching-induced instability through robust execution constraints. Analytical guarantees are established within a filtration-based first-hitting-time framework. In particular, finite-time triggering of the first capture event following detection is proven, and explicit mixed linear-geometric upper bounds are derived for the expected neutralization time. Monte Carlo simulations demonstrate the effectiveness of the proposed framework, achieving 85.6% neutralization efficiency under probabilistic sensing and 99.9% under deterministic sensing. Systematic ablation and sensitivity studies further quantify how detection thresholds and coordination parameters influence reliability and time-to-first-capture.

2605.26836 2026-05-27 eess.SP

Same Signal, Different Story: Demystifying Receiver Effects in Wi-Fi Channel State Information

相同信号,不同故事:揭秘Wi-Fi信道状态信息中的接收器效应

Fabian Portner, Francesco Gringoli, Matthias Hollick, Arash Asadi

AI总结 本文通过统一实验设置,首次系统比较了不同商用Wi-Fi接收器的信道状态信息,发现自动增益控制和子载波非线性是导致跨设备差异的主要原因,并提出增益对齐预处理方法,可恢复高达75%的跨设备人体活动识别准确率。

详情
Journal ref
in IEEE Internet of Things Journal, vol. 13, no. 10, pp. 20963-20977, 15 May, 2026
Comments
15 pages, 22 figures, IEEE Internet of Things Journal
AI中文摘要

Wi-Fi感知已成为一种多功能工具,用于定位、手势识别和生命体征监测等任务,支持从智能环境到个性化医疗的应用。然而,当预训练模型部署到不同的商用接收器时,感知精度通常会显著下降。我们首次对跨多种商用Wi-Fi感知平台的信道状态信息进行了系统比较。通过统一的实验设置,同时向多个接收器发送精确预编码的信号,我们隔离了接收器特定的变异性。我们发现,主要的跨设备差异源于自动增益控制和一致性的子载波非线性。我们提出了一种简单的增益对齐预处理步骤,在跨设备人体活动识别模型部署中恢复了大部分丢失的精度(高达75%)。没有预处理,模型精度急剧下降,实际上破坏了实际部署。进一步的分析揭示了接收器保真度、灵敏度和噪声方面的可测量固有差异。虽然这些接收器引起的差异不会显著影响稳健的感知任务(如人体活动识别),但在需要高精度的场景(如单次飞行时间测量)中变得相关。我们的发现表明,CSI中的跨设备变异性是真实但可管理的,我们提供了用于稳健、硬件无关的Wi-Fi感知的工具和指南。

英文摘要

Wi-Fi sensing has emerged as a versatile tool for tasks such as localization, gesture recognition, and vital-sign monitoring, enabling applications from smart environments to personalized healthcare. However, sensing accuracy often significantly degrades when pretrained models are deployed across different commodity receivers. We present the first systematic comparison of Channel State Information (CSI) across diverse Commercial Off-The-Shelf Wi-Fi sensing platforms. Using a unified experimental setup delivering precisely precoded signals simultaneously to multiple receivers, we isolate receiver-specific variability. We find that dominant cross-device differences arise from Automatic Gain Control and consistent subcarrier nonlinearities. We propose a simple gain-alignment preprocessing step, recovering most of the lost accuracy (up to 75%) in cross-device Human Activity Recognition model deployments. Without preprocessing, model accuracy sharply drops-effectively breaking practical deployments. Additional analyses reveal measurable inherent differences in receiver faithfulness, sensitivity and noise. While these receiver-induced differences do not significantly affect robust sensing tasks such as Human Activity Recognition, they become relevant in scenarios demanding high precision (e.g., single-shot time of flight). Our findings demonstrate that cross-device variability in CSI is real but manageable, and we provide tools and guidelines for robust, hardware-agnostic Wi-Fi sensing.

2605.26812 2026-05-27 eess.AS

CFMDCTCodec: A Low-Bitrate Neural Speech Codec with Noise-Prior-aware Conditional Flow Matching for MDCT-Spectral Enhancement

CFMDCTCodec: 一种基于噪声先验感知的条件流匹配MDCT谱增强的低比特率神经语音编解码器

Xiao-Hang Jiang, Yang Ai, Hui-Peng Du, Zhen-Hua Ling, Ji Wu

AI总结 提出CFMDCTCodec,一种在MDCT域内结合轻量级编解码器和条件流匹配增强器的低比特率神经语音编解码器,通过非对抗训练在0.65 kbps下实现高质量语音编码。

详情
Comments
Accepted by IEEE Transactions on Audio, Speech and Language Processing
AI中文摘要

低比特率下的高质量语音编码对于带宽受限的应用至关重要,但由于高度压缩表示中质量关键信息的严重丢失,仍然具有挑战性。为了克服这一挑战,我们提出了CFMDCTCodec,一种完全在改进的离散余弦变换(MDCT)域中运行的低比特率神经语音编解码器。CFMDCTCodec集成了一个轻量级的编码器-量化器-解码器风格的MDCT谱编解码器和一个基于噪声先验感知的条件流匹配(CFM)的MDCT谱增强器。在该框架中,编解码器作为基础模块,将语音中提取的MDCT谱紧凑地离散化并产生初始的粗略重建,而增强器进一步恢复细粒度的谱细节。增强器通过将条件MDCT速度场滤波器与常微分方程(ODE)求解器集成,在MDCT导出的幅度自适应噪声先验的指导下,改善解码后的MDCT谱,旨在强调感知上显著的高能量区域,同时稳定低能量和静音区域。最后,增强后的MDCT谱通过逆MDCT重建为解码语音。在优化CFMDCTCodec时,我们采用统一的非对抗训练策略,联合结合了重建、量化和CFM目标。客观和主观评估表明,CFMDCTCodec在低比特率区域(例如0.65 kbps)优于竞争基线,同时在参数和计算量显著减少的情况下接近大规模编解码器的感知质量。

英文摘要

High-quality speech coding at low bitrates is crucial for bandwidth-constrained applications, yet remains challenging due to the severe loss of quality-critical information in highly compressed representations. To overcome this challenge, we propose CFMDCTCodec, a low-bitrate neural speech codec that operates entirely in the modified discrete cosine transform (MDCT) domain. CFMDCTCodec integrates a lightweight encoder-quantizer-decoder-style MDCT-spectral codec with a noise-prior-aware, conditional-flow-matching (CFM)-based MDCT-spectral enhancer. Within this framework, the codec serves as a base module that compactly discretizes the MDCT spectrum extracted from speech and produces an initial coarse reconstruction, while the enhancer further restores fine-grained spectral details. The enhancer improves the decoded MDCT spectrum by integrating a conditional MDCT velocity-field filter with an ordinary differential equation (ODE) solver, under the guidance of an MDCT-derived magnitude-adaptive noise prior, aiming to emphasize perceptually significant high-energy regions while stabilizing low-energy and silent regions. Finally, the enhanced MDCT spectrum is reconstructed into the decoded speech using the inverse MDCT. When optimizing CFMDCTCodec, we adopt a unified non-adversarial training strategy that jointly combines reconstruction, quantization and CFM objectives. Both objective and subjective evaluations show that CFMDCTCodec outperforms competitive baselines in low-bitrate regimes, e.g., 0.65 kbps, while approaching the perceptual quality of large-scale codecs with significantly fewer parameters and computations.

2605.26796 2026-05-27 eess.SY cs.SY

Incentive-Based Load Curtailment with Limited Information: A Bilevel Zeroth-Order Learning Approach

基于有限信息的激励型负荷削减:一种双层零阶学习方法

Zhisen Jiang, Florian Dörfler, Saverio Bolognani

AI总结 针对用户参数未知和响应非光滑问题,提出双层零阶学习算法Bi-ZOL,利用结构分解平滑响应并降低超梯度估计误差,实现近似最优性能。

详情
Comments
9 pages, 5 figures, submitted to PowerUP conference
AI中文摘要

激励型负荷削减释放了关键的需求侧灵活性,但由于用户私有参数知识有限以及物理设备约束导致的响应固有非光滑性而受到阻碍。我们通过一个受约束的双层优化框架来解决这个问题,并提出了Bi-ZOL(双层零阶学习)算法。与传统的黑箱方法不同,Bi-ZOL利用双层结构分解超梯度,将系统运营商目标的精确解析信息与未知响应灵敏度的零阶估计相结合。这种基于结构分解的学习方法在数学上平滑了非光滑的响应景观,并减少了超梯度估计误差。我们提供了收敛到近似平稳点的理论保证,并通过仿真表明Bi-ZOL实现了接近最优的性能。

英文摘要

Incentive-based load curtailment unlocks critical demand-side flexibility but is hindered by the limited knowledge of private user parameters and the inherent nonsmoothness of responses due to physical device constraints. We address this via a constrained bilevel optimization framework and propose the Bi-ZOL (Bilevel Zeroth-Order Learning) algorithm. Unlike conventional black-box methods, Bi-ZOL exploits the bilevel structure to decompose the hypergradient, integrating the exact analytical information of the SO's objective with a zeroth-order estimate of the unknown response sensitivity. This structural decomposition-based learning method mathematically smoothes the nonsmooth response landscape and reduces hypergradient estimation error. We provide theoretical convergence guarantees to an approximate stationary point and demonstrate through simulations that Bi-ZOL achieves near-optimal performance.

2605.26787 2026-05-27 eess.SY cs.SY

Enforcing Soft Monotonicity Constraints for Recursive Gaussian Process Regression in Real Time

递归高斯过程回归中软单调性约束的实时强制执行

Ricus Husmann, Sven Weishaupt, Harald Aschemann

AI总结 提出一种实时算法,通过扩展卡尔曼滤波和伪测量结合ReLU伪测量函数,高效计算递归高斯过程回归梯度并强制执行软单调性约束,显著提升数值鲁棒性。

详情
Comments
This paper was accepted to the Springer "Lecture Notes in Electrical Engineering" as a post-publication of DOI: 10.5220/0013783100003982
AI中文摘要

在这项工作中,我们引入了一种实时算法,用于考虑递归高斯过程回归(RGP)的单调性假设。因此,我们展示了如何在线高效计算RGP梯度。然后,我们利用扩展卡尔曼滤波和伪测量,结合ReLU伪测量函数来强制执行软不等式约束。这项工作基于之前发表的目标相同且基本方法类似的会议论文。然而,与之前的工作相反,我们现在对RGP梯度使用精确的协方差计算。此外,我们还提出了该算法的实时优化版本,与之前发布的版本相比简化更少。这些以及其他几项算法创新使得算法的数值鲁棒性大大提高。该算法通过一个二维数值示例进行验证,并与之前发布的版本进行比较。论文最后通过成功实验验证了所开发算法在单调性保持学习气动阀特性以控制气动系统中的应用,利用了部分输入-输出线性化。

英文摘要

In this work, we introduce a real-time capable algorithm for considering monotonicity assumptions for recursive Gaussian Process regression (RGP). Therefore, we present how to efficiently calculate the RGP gradients online. Then, we utilize an extended Kalman filter and pseudo-measurements in combination with a ReLU pseudo-measurement function to enforce soft inequality constraints. This work builds upon a previously published conference paper with the same goal and a similar fundamental approach. Opposite to our previous work, however, we now use an exact covariance calculation for the RGP gradients. Furthermore, we also present a real-time optimized version of this algorithm with less simplifications compared to the previously published version. These and several other algorithmic innovations lead to an algorithm with greatly improved numerical robustness. The algorithm is validated and compared to its previously published version for a 2D numerical example. The paper is concluded with a successful experimental validation of the developed algorithm for the monotonicity-preserving learning of pneumatic valve characteristics for the control of a pneumatic system, leveraging a partial input - output linearization.

2605.26760 2026-05-27 eess.SP

Multimodal Signal Restoration with Signed Twofold Graph Learning

带符号双图学习的多模态信号恢复

Haruki Yokota, Hiroshi Higashi, Yuichi Tanaka

AI总结 提出一种联合信号恢复与双图学习的方法,通过矩阵正态先验和交替最小化求解非凸目标,并利用可展开网络实现参数学习,在合成与真实数据上优于现有方法。

详情
Comments
34 pages, 10 figures, 2 tables. Submitted to APSIPA Transactions on Signal and Information Processing
AI中文摘要

传感器网络上的多模态信号通常基于双图假设(TGA)建模,该假设将空间结构和模态间关系表示为两个独立的图。然而,现有的基于TGA的信号恢复方法要么假设图已知,要么限制边权重为非负,从而无法捕捉负的模态间相关性。我们通过将联合信号恢复和双图学习公式化为矩阵正态先验下的最大后验估计来解决这两个限制,其中空间图和模态图的拉普拉斯矩阵直接作为精度矩阵。得到的非凸目标通过交替最小化求解:信号通过共轭梯度法应用于产生的Sylvester型线性系统进行更新;图通过原始-对偶混合梯度(PDHG)进行更新。我们进一步提出一种方法,从互补核矩阵的主特征空间中估计模态图的带符号结构,然后将其用于PDHG中更新边幅度。这些迭代求解器随后被展开成一个前馈网络,其中正则化权重和步长作为逐层可训练参数。在合成多模态图信号和真实日本气象数据集上的实验证实,所提出的方法在各种噪声水平和缺失数据模式下均优于现有基线方法。

英文摘要

Multimodal signals on sensor networks are commonly modeled under the twofold graph assumption (TGA), which represents spatial structure and inter-modality relations as two separate graphs. Existing TGA-based signal restoration methods, however, either assume the graphs are known or restrict edge weights to be non-negative, preventing them from capturing negative inter-modal correlations. We address both limitations by formulating joint signal restoration and twofold graph learning as MAP estimation under a matrix normal prior, where the spatial and modality graph Laplacians appear directly as precision matrices. The resulting non-convex objective is solved by alternating minimization: The signal is updated via conjugate gradient applied to the arising Sylvester-type linear system; the graphs are updated via primal-dual hybrid gradient (PDHG). We further propose a method to estimate the signed structure of the modality graph from the dominant eigenspace of a complementary kernel matrix, which is then used in PDHG to update edge magnitudes. These iterative solvers are then unrolled into a feedforward network, with regularization weights and step sizes as layer-wise trainable parameters. Experiments on synthetic multimodal graph signals and a real Japan meteorological dataset confirm that the proposed method outperforms existing baselines across a range of noise levels and missing-data patterns.

2605.26752 2026-05-27 eess.IV

Reconstructing 3D Neural Hemodynamics using Sparse Ultrasound Localization Microscopy Data

利用稀疏超声定位显微镜数据重建三维神经血流动力学

Jipeng Yan, Oscar Bates, Jingwen Zhu, Qingyuan Tan, Biao Huang, John Goodwin, Andriy S. Kozlov, Chris Dunsby, Meng-Xing Tang

AI总结 针对超声定位显微镜数据稀疏导致血流动力学重建受限的问题,提出基于随机变分推断求解层流模型的方法,生成速度、压力梯度和不确定性图,并通过仿真和三维大鼠脑成像验证其有效性。

详情
AI中文摘要

超声定位显微镜(ULM)因其能够重建深层微血管结构而在功能成像中展现出巨大潜力。然而,由于有限数量的微泡轨迹无法采样单个血管内的完整速度分布,ULM数据的稀疏性影响了血流动力学重建。本文提出通过随机变分推断求解层流模型,利用稀疏ULM速度图重建血流动力学。除了血管几何结构和血流速度图,该方法还生成两种新的ULM图——压力梯度图和描述估计不确定性的图。通过研究ULM图中稀疏性对血流动力学量化和可视化的影响,我们通过仿真和三维大鼠脑成像证明了该方法在处理稀疏ULM图方面的有效性。利用稀疏ULM数据准确重建广泛的血流动力学参数及其相关不确定性,可能有助于检测微妙且动态的脑活动。

英文摘要

Ultrasound Localization Microscopy (ULM) has presented great potential in functional imaging, benefiting from its ability to reconstruct deep microvasculature. However, the hemodynamic reconstruction is compromised by sparsity in the ULM data, as a limited number of MB tracks cannot sample the complete speed profile in one vessel. Here, we propose to reconstruct hemodynamics using sparse ULM velocity maps by solving a laminar flow model through stochastic variational inference. In addition to vascular geometry and flow velocity maps, the proposed method generates two new ULM maps - a pressure gradient map and a map describing uncertainty of the estimation. By investigating the effect of sparsity in ULM maps on the quantification and visualization of hemodynamics, we demonstrate the effectiveness of the proposed method in dealing with sparse ULM maps via simulations and 3D rat brain imaging. Accurately reconstructing a broad range of hemodynamic parameters and associate uncertanties using sparse ULM data may help detect subtle and dynamic brain activity.