arXivDaily arXiv每日学术速递 周一至周五更新
2606.20401 2026-06-19 eess.SY cs.SY 交叉投稿

PowerAgentBench-Dyn: A Benchmark for Agentic AI in Power System Dynamic Studies

PowerAgentBench-Dyn:电力系统动态研究中智能体AI的基准测试

Qian Zhang, Andrea Pomarico, Costas Mylonas, Magda Foti, Alberto Berizzi, Le Xie

AI总结 提出PowerAgentBench-Dyn基准,用于评估基于LLM的智能体在电力系统动态分析任务中的能力,涵盖模型质量审查和安全风险筛选两个任务。

详情
AI中文摘要

基于大型语言模型(LLM)的智能体越来越多地被用于通过与软件工具交互、解释中间结果以及自主规划后续行动来自动化多步骤工程工作流。电力系统动态研究是这些智能体一个特别有前景但尚未充分探索的应用领域。与静态计算任务不同,动态研究通常需要更多时间进行模型参数校准、工程判断以及在受限动作空间下的决策。本文介绍了PowerAgentBench-Dyn,一个旨在评估智能体AI系统在电力系统动态分析任务上的基准测试。该基准针对那些不能简化为单一优化或编码任务的问题,而是需要经验丰富的电力系统工程师日常执行的那种推理、工具使用和迭代实验。所提出的框架包括两个初始基准任务。第一个是动态模型质量审查基准,评估智能体根据系统运营商指定的模型质量合规标准验证和诊断动态模型的能力。第二个是动态安全风险筛选基准,评估智能体利用语义记忆和有限的仿真预算从未见故障数据集中识别、排序和分析最关键短路事故,并提出和评估可能的缓解措施的能力。对于每个任务,我们定义了仿真环境、观测和动作空间以及评估指标。该基准在基于度量的意义上是可复现的:发布案例和仿真器设置定义了确定性评估器,而随机智能体行为通过重复运行使用成功率和其他指标进行评估。该基准支持未来用于电力系统运行和规划的智能体AI的开发。

英文摘要

Large Language Model (LLM)-based agents are increasingly being used to automate multi-step engineering work flows by interacting with software tools, interpreting intermediate results, and autonomously planning subsequent actions. Power system dynamic studies represent a particularly promising yet largely unexplored application domain for these agents. Unlike static computational tasks, dynamic studies often require more time on model parameter calibration, engineering judgment, and decision making under constrained action spaces. This paper introduces PowerAgentBench-Dyn, a benchmark designed to evaluate Agentic AI systems on power system dynamic-analysis tasks. The benchmark targets problems that cannot be reduced to a single optimization or coding task, but instead require a type of reasoning, tool usage, and iterative experimentation routinely performed by experienced power system engineers. The proposed framework includes two initial benchmark tasks. The first, the Dynamic Model Quality Review Benchmark, evaluates agents' ability to validate and diagnose dynamic models based on model-quality compliance criteria specified by system operators. The second, the Dynamic Security Risk Screening Benchmark, assesses agents' capability to leverage semantic memory and a limited simulation budget to identify, rank, and analyze the most critical short-circuit contingencies from an unseen fault dataset, as well as propose and evaluate possible mitigation measures. For each task, we define the simulation environment, observation and action spaces, and evaluation metrics. The benchmark is reproducible in a metric-based sense: released cases and simulator settings define a deterministic evaluator, while stochastic agent behavior is assessed over repeated runs using success rates and other metrics. The benchmark supports the development of future Agentic AI for power system operation and planning.

2606.20361 2026-06-19 eess.SY cs.SY 交叉投稿

Sparse add-on controller design: A Youla approach to system-level performance

稀疏附加控制器设计:一种面向系统级性能的Youla方法

M. van der Hulst, N. Dirkx, R. A. González, K. Tiels, J. van de Wijdeven, T. oomen

AI总结 提出一种基于Youla参数化的稀疏附加控制器设计框架,通过凸优化求解稀疏H2综合问题,实现系统级性能与互联复杂度的最优权衡。

详情
AI中文摘要

高科技系统的性能通常由机器中运行的多个闭环控制子系统共享的少数性能目标决定,例如同步、协调和对齐,这需要明确处理这些目标以实现最优性能的控制方法。本文旨在引入一种框架,通过设计系统级控制器作为现有子系统控制结构的附加组件来提高系统性能。所开发的方法使用Youla框架参数化所有稳定的系统级附加控制器,从而能够凸形式化稀疏$\mathcal{H}_2$综合问题。结果是一个稀疏附加控制器,实现了组合性能与互联复杂度之间的最优权衡,如数值模拟所示。

英文摘要

The performance of high-tech systems is often dictated by a few performance objectives shared among the many closed-loop controlled subsystems operating in the machine, such as synchronization, coordination, and alignment, which necessitates control methods that explicitly address them to achieve optimal performance. The aim of this paper is to introduce a framework that improves system performance through system-level controllers designed to be implemented as add-ons to the existing subsystem control structure. The developed method parametrizes all stabilizing system-level add-on controllers using the Youla framework, enabling a convex formulation of the sparse $\mathcal{H}_2$ synthesis problem. The result is a sparse add-on controller that achieves the optimal trade-off between combined performance and interconnection complexity, as demonstrated through numerical simulations.

2606.20301 2026-06-19 eess.SY cs.SY 交叉投稿

Data-Driven Control from Poisoned Data: Fundamental Limitations and Secure DeePC

来自中毒数据的数据驱动控制:基本局限性与安全DeePC

Takumi Shinohara, Henrik Sandberg, Karl Henrik Johansson

AI总结 针对任意数据中毒攻击,提出安全DeePC算法,通过截断输出和在线重建实现有限时间内的MPC等效性能。

详情
AI中文摘要

我们研究了存在任意数据中毒攻击时的数据驱动控制问题。假设一部分离线输出数据存储在未受保护的位置,可能被对手篡改。我们首先建立了由这种中毒数据引起的数据驱动控制的基本局限性:仅从数据集无法检测/识别中毒攻击;未受保护的数据对于具有最坏情况保证的控制器设计是非信息性的;未受保护输出的硬约束是不可认证的。受这些局限性和数据使能预测控制(DeePC)技术的启发,我们提出了安全DeePC,一种能够抵御中毒攻击的数据驱动控制算法。它首先仅使用受保护数据集运行输出截断的DeePC,直到在线输入变得持续激励。然后利用在线测量重建部分离线数据集,最后返回到全输出DeePC。安全DeePC在特定条件下几乎必然在有限时间内实现MPC等效性能。仿真结果证明了所提框架对抗中毒攻击的有效性。

英文摘要

We study a data-driven control problem in the presence of arbitrary data poisoning attacks. We assume that a subset of offline output data is stored in unprotected locations and may be poisoned by an adversary. We first establish fundamental limitations for data-driven control arising from such poisoned data: poisoning attacks are not detected/identified from the dataset alone; unprotected data are non-informative for controller design with worst-case guarantees; and hard constraints on unprotected outputs are not certifiable. Motivated by these limitations and the data-enabled predictive control (DeePC) technique, we propose Secure DeePC, a data-driven control algorithm that is resilient against poisoning attacks. It first runs output-truncated DeePC using only the protected dataset until the online input becomes persistently exciting. It then uses online measurements to reconstruct the partial offline dataset, and finally returns to full-output DeePC. Secure DeePC achieves MPC-equivalent performance in finite time almost surely under certain conditions. Simulation results illustrate the efficacy of the proposed framework against poisoning attacks.

2606.20163 2026-06-19 eess.SY cs.SY 交叉投稿

Techno-Economic Analysis of Shared Mobile Storage for Demand Charge Reduction

用于需求费用削减的共享移动储能技术经济分析

B Hari Kiran Reddy, Ge Chen, Junjie Qin

AI总结 本文提出一个高保真车队管理框架,通过混合整数线性规划模型和启发式算法,评估共享电动汽车在考虑实际物流和运营约束下削减需求费用的技术经济可行性。

Comments 22 pages, 26 figures, journal

详情
AI中文摘要

本文研究了在实际物流和运营约束下,共享电动汽车车队用于削减需求费用的技术经济可行性。与忽略运输开销的理想化模型不同,我们提出了一个高保真车队管理框架,明确考虑了能源消耗的时空耦合、电动汽车驾驶员的人工成本和电池退化。我们将调度问题表述为混合整数线性规划,共同最小化需求费用和总拥有成本。为了解决路径依赖约束带来的计算复杂性,我们开发了一种基于边际价值的启发式算法,该算法以高计算效率实现了接近最优的性能。使用旧金山的真实数据,我们的分析表明,适度数量的电动汽车可以实现显著的需求费用节省,足以收回拥有和运营成本。我们的结果还显示了电价结构、车队规模和成本组成部分如何影响整体盈利能力。

英文摘要

This paper investigates the techno-economic viability of shared electric vehicle (EV) fleets for demand charge reduction under practical logistical and operational constraints. Unlike idealized models that overlook transit overheads, we propose a high-fidelity fleet management framework that explicitly accounts for the spatio-temporal coupling of energy consumption, labor costs for EV drivers, and battery degradation. We formulate the dispatch problem as a mixed-integer linear program (MILP) that jointly minimizes demand charges and total cost of ownership. To address the computational complexity arising from path-dependent constraints, we develop a marginal-value-based heuristic algorithm that achieves near-optimal performance with high computational efficiency. Using real-world data from San Francisco, our analysis reveals that a modest number of EVs can achieve significant demand charge savings, sufficient to recover the ownership and operational expenses. Our results also show how tariff structures, fleet size, and cost components influence overall profitability.

2606.20127 2026-06-19 eess.SY cs.SY 交叉投稿

Contraction-based Neural Control for Cooperative Aerial Payload Transportation with Variable-length Cables

基于收缩的神经控制用于可变长度缆绳的协同空中载荷运输

Yi Lok Lo, Longhao Qian, Hugh H. T. Liu

AI总结 提出一种多无人机吊挂载荷系统的神经非线性控制框架,通过解耦动力学结构,联合训练神经收缩度量控制器和反馈控制器实现载荷轨迹跟踪,并利用可变长度缆绳进行避障。

Comments Submitted for publication in AIAA Scitech 2027

详情
AI中文摘要

本文提出了一种新颖的神经非线性控制框架,用于具有可变长度缆绳和刚体载荷的多无人机吊挂载荷系统。运动方程被表述为解耦结构,其中载荷和缆绳长度动力学由独立控制通道控制,便于在降阶子系统上进行模块化控制器设计。联合训练神经控制收缩度量(CCM)控制器和神经反馈控制器,以强制执行载荷子系统的收缩条件。另外,推导了一种缆绳长度控制律,利用可变长度自由度进行避障。数值模拟展示了在提出的控制框架下,刚体载荷的轨迹跟踪和整个系统的门穿越能力。

英文摘要

This paper presents a novel neural nonlinear control framework for a multi-drone slung payload system with variable-length cables and a rigid-body payload. The equations of motion are formulated into a decoupled structure, where the payload and cable length dynamics are governed by independent control channels, facilitating modularized controller design on reduced-order subsystems. A neural control contraction metric (CCM) controller and a neural feedback controller are jointly trained to enforce contraction conditions for the payload subsystem. Separately, a cable length control law is derived that exploits the variable-length degree of freedom for obstacle avoidance. Numerical simulations demonstrate trajectory tracking of a rigid-body payload and gate traversal capabilities of the overall system under the proposed control framework.

2606.19711 2026-06-19 cs.RO cs.LG cs.SY eess.SY 交叉投稿

A Differentiable Composite Approximation Framework for Autonomous Underwater Vehicle Maneuvering Modeling from Sea-Trial Data

一种可微复合近似框架:基于海试数据的自主水下航行器机动建模

Aobo Wang, Aifei Xia, Zihao Wang, Lizhu Hao

发表机构 * College of Shipbuilding Engineering, Harbin Engineering University(哈尔滨工程大学船舶工程学院) China Academy of Aerospace Aerodynamics(中国航天空气动力技术研究院) Institute of Artificial Intelligence, Shanghai University(上海大学人工智能研究院) China Ship Scientific Research Center(中国船舶科学研究中心)

AI总结 提出可微复合近似框架,结合多项式基与数据自适应基联合校准,并引入转向运动电流估计补偿,提升AUV机动预测精度。

详情
AI中文摘要

基于机载测量的场建模可以生成反映真实运行特性的自主水下航行器(AUV)机动模型。从近似角度看,传统机动模型使用预定义的约束多项式基,而数据驱动模型使用数据自适应基。受此基函数视角启发,本文提出一种可微复合近似公式,其中多项式基分量和数据自适应基分量被视为单个预测器的可微部分并联合校准。开发了一种基于梯度的协同校准方法用于全尺寸AUV机动预测,其中灵敏度感知机制调节有界多项式更新,而神经残差在共享预测目标下捕获剩余非线性差异。为了考虑现场数据中的海流效应,引入了一种基于转向运动的电流估计和补偿程序,以构建电流补偿的学习目标用于训练和滚动预测。该框架使用从7米长AUV在多种机动条件下收集的海试数据进行评估。结果表明,与纯多项式、纯神经网络和冻结先验混合基线相比,所提方法改进了递归轨迹和速度预测,证明了其在基于现场数据的AUV机动建模中的适用性。

英文摘要

Field-based modeling from onboard measurements can produce autonomous underwater vehicle (AUV) maneuvering models that reflect real operating characteristics. From an approximation perspective, conventional maneuvering models use predefined constraint polynomial bases, whereas data-driven models use data-adaptive bases. Motivated by this basis-function view, this paper presents a differentiable composite-approximation formulation, in which the polynomial-basis component and the data-adaptive basis component are treated as differentiable parts of a single predictor and calibrated jointly. A gradient-based co-calibration method is developed for full-scale AUV maneuvering prediction, where a sensitivity-aware mechanism regulates bounded polynomial updates while the neural residual captures remaining nonlinear discrepancies under a shared prediction objective. To account for ocean-current effects in field data, a turning-motion-based current estimation and compensation procedure is incorporated to construct current-compensated learning targets for training and rollout. The framework is evaluated using sea-trial data collected from a 7-meter AUV under multiple maneuvering conditions. Results show that the proposed method improves recursive trajectory and velocity prediction compared with polynomial-only, neural-only, and frozen-prior hybrid baselines, demonstrating its applicability to field-data-based AUV maneuvering modeling.

2606.19699 2026-06-19 cs.RO cs.LG cs.SY eess.SY 交叉投稿

Comparative Study on Agility, Efficiency, and Impact Absorption of Bipedal Robots with Active Toes

具有主动脚趾的双足机器人敏捷性、效率和冲击吸收的比较研究

Joong-Gil Kim, Wontae Ye, Geunwoo Cho, Seong-Ho Yun, Se-Hyoung Cho, Yong-Jae Kim

发表机构 * School of Electrical, Electronics and Communication Engineering, Korea University of Technology and Education(韩国技术教育大学电气、电子与通信工程学院) Artificial Intelligence and Robotics Institute, Korea Institute of Science and Technology(韩国科学技术研究院人工智能与机器人研究所) Robot Innovation Hub, WIRobotics Inc.(WIRobotics公司机器人创新中心)

AI总结 提出一种14自由度双足机器人,模拟人类脚趾的轻量、高扭矩、坚固特性,通过高保真仿真训练环境,对比有无主动脚趾的配置,发现脚趾机器人以1.33米/秒行走时,CoT降低17.5%,脚跟冲击力降低5.0%,路径偏差平均和最大分别降低25.0%和34.0%。

Comments 6 pages, 7 figures

详情
AI中文摘要

人类腿部表现出高效率、敏捷性和冲击吸收能力,其中脚趾在这些能力中起着关键作用。尽管已经有许多尝试在机器人中实现类似人类的脚趾,但它们尚未完全复制人类特征,也没有严格验证其益处。我们提出了一种14自由度的双足机器人,模拟人类脚趾的轻量、高扭矩、坚固特性。为了定量分析主动脚趾在敏捷性、效率和冲击吸收方面的有效性,我们开发了一个高保真仿真训练环境,该环境反映了具有耦合传动和精确功耗的实际执行器。为了确保有和没有主动脚趾的配置之间的公平比较,我们设计了一个最小化强化学习奖励函数,并对两者应用了相同的训练程序。仿真结果表明,在1.33米/秒行走时,与无脚趾配置相比,配备脚趾的机器人将CoT降低了17.5%,脚跟冲击力降低了5.0%。在敏捷性测试中,平均和最大路径偏差分别降低了25.0%和34.0%。

英文摘要

Human legs exhibit high efficiency, agility, and impact absorption, with toes playing a crucial role in these capabilities. While many attempts have been made to implement human-like toes in robots, they have not fully replicated human characteristics nor rigorously validated their benefits. We propose a 14-DOF biped robot emulating human toes' lightweight, high-torque, robust nature. To quantitatively analyze the effectiveness of the active toes in terms of agility, efficiency, and impact absorption, we developed a high-fidelity simulation training environment that reflects actual actuators with coupled transmissions and accurate power consumption. To ensure a fair comparison between configurations with and without active toes, we designed a minimal RL reward function and applied an identical training procedure to both. The simulation results indicate that, at 1.33 m/s walking, the toe-equipped robot reduced CoT by 17.5% and heel-strike GRF by 5.0% compared with the toe-ablation configuration. On the agility test, average and maximum path deviation decreased by 25.0% and 34.0%, respectively.

2606.19683 2026-06-19 cs.AI cs.MA cs.SY eess.SY 交叉投稿

Exit-and-Join Dynamics for Decentralized Coalition Formation

去中心化联盟形成的退出与加入动力学

Quanyan Zhu

发表机构 * New York University Tandon School of Engineering(纽约大学坦登工程学院) Department of Electrical and Computer Engineering(电气与计算机工程系)

AI总结 研究基于单边退出与加入决策的去中心化联盟形成动力学,利用Aumann-Dreze值计算个体收益,建立合作支付分配与非合作最优反应的关联,并分析均衡特征及成本对局部稳定性的影响。

详情
AI中文摘要

本文研究联盟形成作为一种由单边退出与加入决策驱动的去中心化动力学过程。智能体使用Aumann-Dreze值评估局部移动,因此收益在智能体当前联盟内计算,而非通过全局协商的联盟结构。由此产生的模型将合作支付分配与非合作最优反应行为联系起来:一个终端划分恰好是一个没有可接受的、个体有利可图的退出与加入偏离的联盟结构。我们建立了均衡特征,确定了动力学允许标量Lyapunov或精确势函数表示的条件,并分析了切换和接受成本如何塑造局部稳定性。数值实验测试了有限时间稳定、成本敏感性以及一个特殊的凸博弈基准。

英文摘要

This paper studies coalition formation as a decentralized dynamical process driven by unilateral exit-and-join decisions. Agents evaluate local moves using the Aumann-Dreze value, so payoffs are computed within the agent's current coalition rather than through a globally negotiated coalition structure. The resulting model links cooperative payoff allocation with noncooperative best-response behavior: a terminal partition is precisely a coalition structure with no admissible, individually profitable exit-and-join deviation. We establish equilibrium characterizations, identify conditions under which the dynamics admit scalar Lyapunov or exact-potential representations, and analyze how switching and acceptance costs shape local stability. Numerical experiments test finite-time stabilization, cost sensitivity, and a special convex-game benchmark.

2606.19630 2026-06-19 cs.AI cs.DL cs.SY eess.SY 交叉投稿

AI4SE and SE4AI Exploration: A Decade Looking Back and Forward

AI4SE 与 SE4AI 探索:回顾与展望的十年

H. Sinan Bank, Daniel R. Herber, Thomas Bradley

发表机构 * Colorado State University(科罗拉多州立大学)

AI总结 本文回顾了人工智能与系统工程在三个阶段的进展,通过人机一致性文献综述识别出五个关键研究空白,并提供了AI采纳、保障和劳动力转型的指导。

Comments 10 pages, 5 figure

详情
AI中文摘要

2020年3月INCOSE INSIGHT关于人工智能与系统工程的特刊成为该刊历史上下载量最高的一期,并催生了一个研究社区,其年度研讨会现吸引超过250名注册者。在本文中,我们基于作者对该领域核心论文的解读,追溯了人工智能与系统工程在三个阶段(标记为基础、应用和LLM转折点)的进展,并描述了我们对社区已达成共识以及仍存在关键空白的看法。此外,我们进行了一项人机一致性文献综述,利用人类专家和六个人工智能模型评估了1,712篇INCOSE INSIGHT文章和889篇SERC出版物的相关性。结果识别出五个关键研究空白,并为从业者在系统工程中应对AI采纳、保障和劳动力转型提供了指导。我们共享一致性数据以及AI4SE/SE4AI Explorer网络应用程序,以便读者将自己的相关性判断与人类和AI评分者进行比较。

英文摘要

The March 2020 INCOSE INSIGHT special issue on AI and Systems Engineering (SE) became the most downloaded issue in the publication's history and launched a research community that now draws over 250 registrants to its annual workshop. In this article, we trace the progress in AI and SE across three phases (labeled here foundational, applied, and LLM inflection) based on the authors' reading of the field's core papers, and describe our opinions of where the community has converged and where critical gaps remain. Separately, a human-AI agreement literature review leveraging both human expertise and six AI models was performed to assess the relevance of 1,712 INCOSE INSIGHT articles and 889 SERC publications. The results identify five critical research gaps and offer guidance for practitioners navigating AI adoption, assurance, and workforce transformation in SE. We share the agreement data and the AI4SE/SE4AI Explorer web application so readers can compare their own relevance judgments with the human and AI raters.

2606.19599 2026-06-19 eess.SY cs.SY econ.EM 交叉投稿

Ramping Procurement and Bid-Cost Recovery in Real-Time Market

实时市场中的爬坡采购与投标成本回收

Cong Chen, Valentina Norambuena, Lang Tong

AI总结 研究净需求不确定下与经济调度协同优化的爬坡采购,分析单间隔与多间隔协同优化设计,提出评估发电机利润、消费者支付、投标成本回收和运营效率的分析框架,并比较三种定价机制。

Comments 4 figures

详情
AI中文摘要

我们研究了净需求不确定下与经济调度协同优化的爬坡采购。我们考察了电网运营商实施的两种灵活爬坡产品设计:单间隔和多间隔协同优化。两者都依赖于滚动窗口随机优化,包含绑定和咨询间隔决策。我们开发了分析框架来评估发电机利润、消费者支付、投标成本回收(BCR)和运营效率。特别是,净需求不确定性可能导致发电机补偿不足,需要歧视性BCR。虽然运营效率对能量和爬坡价格不变,但生产者利润和消费者支付关键取决于定价。我们研究了节点边际定价(LMP)和两种统一定价:最大调度成本定价(MDCP)和最大时间节点边际定价(MTLMP)。在市场外BCR下,LMP产生歧视性能量价格,而MDCP消除BCR,MTLMP在大多数情况下也是如此。这一性质使我们能够在MDCP下为价格接受型发电机建立真实投标激励。我们的分析突出了单间隔和多间隔协同优化与定价设计之间的权衡:在高预测不确定性和中等爬坡需求下,单间隔能量-爬坡协同优化具有优势,而当净需求预测相对准确且爬坡需求具有挑战性时,多间隔协同优化更优。基于CAISO和ERCOT数据的实证结果表明,与LMP相比,MDCP和MTLMP增加了生产者利润且BCR可忽略,但以消费者支付增加为代价。

英文摘要

We study ramping procurement co-optimized with economic dispatch under net-demand uncertainty. We examine two flexible ramp product designs implemented by grid operators: single-interval and multi-interval co-optimization. Both rely on rolling-window stochastic optimization with binding and advisory interval decisions. We develop analytical frameworks to evaluate generator profits, consumer payments, bid cost recovery (BCR), and operational efficiency. In particular, net-demand uncertainty may lead to generator under-compensation, requiring discriminatory BCR. While operational efficiency is invariant to energy and ramp prices, producer profits and consumer payments depend critically on pricing. We examine locational marginal pricing (LMP) and two uniform pricing: maximum dispatch cost pricing (MDCP) and maximum temporal locational marginal pricing (MTLMP). With out-of-market BCR, LMP yields discriminatory energy prices, whereas MDCP eliminates BCR and MTLMP does so in most cases. This property enables us to establish truthful bidding incentives for price-taking generators under MDCP. Our analysis highlights trade-offs between single- and multi-interval co-optimization and pricing designs: single-interval energy-ramp co-optimization is advantageous under high forecast uncertainty and moderate ramping requirements, whereas multi-interval co-optimization is superior when net-demand forecasts are relatively accurate and ramp needs are challenging. Empirical results on CAISO and ERCOT data show that MDCP and MTLMP increase producer profits with negligible BCR, albeit at the expense of higher consumer payments relative to LMP.

2606.19590 2026-06-19 cs.RO cs.SY eess.SY 交叉投稿

Safe, Real-Time Active Model Discrimination and Fault Diagnosis for Nonlinear Systems via Differentiable Reachability

通过可微可达性实现非线性系统的安全、实时主动模型辨识与故障诊断

Xinpei Ni, Melkior Ornik, Glen Chou, Samuel Coogan

发表机构 * Institute of Robotics and Intelligent Machines (IRIM), Georgia Institute of Technology(佐治亚理工学院机器人与智能机器研究所) Department of Aerospace Engineering, University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校航空航天工程系)

AI总结 针对不确定非线性系统,提出一种基于可微可达性近似的实时主动故障诊断算法,通过优化控制输入使输出集分离,在保证安全的同时实现快速模型辨识。

详情
AI中文摘要

我们提出了一种安全、实时的算法,用于对具有过程和测量扰动的连续时间不确定非线性系统进行主动故障诊断和模型辨识。给定一组表示正常和故障模式(包括执行器和传感器故障)的候选模型,我们制定了一个输出反馈、时变策略优化问题,该问题(i)在有限时域内鲁棒地强制执行状态输入安全约束,并且(ii)驱动系统产生与至多一个模型一致的采样测量,从而实现确定性诊断。为了实时解决这个问题,我们使用可达状态和输出集的区间过近似开发了一个可处理的近似,并通过一个可微目标函数对诊断能力进行编码,该函数惩罚可能模型的可达输出集之间的重叠。由此产生的优化使用基于梯度的JAX和可微可达性原语在线高效求解。我们在几个高维非线性机器人系统(包括模拟四旋翼和战斗机模型、硬件差速驱动机器人和四足导航)上评估了我们的方法,用于传感器和执行器故障诊断(最多11种故障模式)。在这些案例研究中,我们的方法在50毫秒内实现了可靠的模型辨识,在辨识成功率和速度上优于基线方法,同时提供了形式化的安全保证。

英文摘要

We present a safe, real-time algorithm for active fault diagnosis and model discrimination for uncertain continuous-time nonlinear systems with process and measurement disturbances. Given a finite set of candidate models representing nominal and faulty modes, including actuator and sensor faults, we formulate an output-feedback, time-varying policy optimization problem that (i) robustly enforces state-input safety constraints over a finite horizon and (ii) drives the system to produce sampled measurements consistent with at most one model, enabling deterministic diagnosis. To solve this problem in real time, we develop a tractable approximation using interval over-approximations of reachable state and output sets, and encode diagnosability via a differentiable objective that penalizes overlap between the reachable output sets of possible models. The resulting optimization is solved efficiently online with gradient-based methods using JAX and differentiable reachability primitives. We evaluate our method on sensor and actuator fault diagnosis (up to 11 fault modes) in several high-dimensional nonlinear robotic systems, including a simulated quadrotor and fighter-jet model, a hardware differential-drive robot, and quadrupedal navigation. Across these case studies, our approach achieves reliable model discrimination in under 50 ms, outperforming baselines in discrimination success rate and speed while providing formal safety guarantees.

2606.19566 2026-06-19 eess.SY cs.AI cs.SY 交叉投稿

GDGU: A Gradient Difference-based Graph Unlearning Method for Cyberattack Localization in Electric Vehicle Charging Networks

GDGU:基于梯度差异的图遗忘方法用于电动汽车充电网络中的网络攻击定位

Nanhong Liu, Mucun Sun, Jie Zhang

AI总结 针对电动汽车充电站数据删除需求,提出基于梯度差异的图遗忘方法(GDGU),通过一阶参数校正实现高效遗忘,在保持定位性能的同时显著降低计算开销。

详情
AI中文摘要

电动汽车充电站(EVCS)可能使配电馈线暴露于网络攻击。尽管包括图神经网络在内的机器学习方法可以定位哪个母线被攻破,但在数据共享和模型训练方面仍存在重大挑战。例如,隐私法规允许EVCS所有者从已部署的模型中删除其训练数据,但每次请求都从头重新训练在计算上不可行。为了解决这个问题,我们研究了用于EVCS网络攻击定位的图遗忘(GU),将其形式化为图级多标签分类任务上的特征级遗忘问题。具体来说,我们提出了基于梯度差异的图遗忘(GDGU),通过一阶参数校正消除请求删除数据的影响。该校正基于原始训练数据与修改后数据集之间的梯度差异计算,其中仅遗忘请求的EVCS母线的充电功率特征。然后,应用批归一化重新校准和简短的恢复微调步骤以恢复定位效用。我们在IEEE 34母线、123母线和8500节点配电网络上,使用三种图神经网络骨干网络和累积遗忘场景,将GDGU与两种二阶GU基线进行比较。GDGU在定位效用上与最强基线相当,遗忘保真度接近完全重新训练,同时遗忘速度比从头重新训练快10到12倍,且内存使用远少于二阶GU基线。

英文摘要

Electric vehicle charging stations (EVCSs) can expose distribution feeders to cyberattacks. While machine learning methods, including graph neural networks, can localize which bus is compromised, significant challenges remain in data sharing and model training. For example, privacy regulations grant EVCS owners the right to delete their training data from a deployed model, yet retraining from scratch on every request is computationally prohibitive. To address this, we study graph unlearning (GU) for EVCS cyberattack localization, formulated as a feature-level unlearning problem on a graph-level multi-label classification task. Specifically, we propose gradient difference-based graph unlearning (GDGU), which removes the influence of the requested deletion data through a first-order parameter correction. The correction is computed from the gradient difference between the original training data and a modified dataset in which only the charging power features at the requested EVCS buses are unlearned. Then, a batch-normalization recalibration and a brief recovery fine-tuning step are applied to restore localization utility. We benchmark GDGU against two second-order GU baselines on the IEEE 34-bus, 123-bus, and 8500-node distribution networks across three graph neural network backbones and cumulative unlearning scenarios. GDGU matches the strongest baseline on localization utility and reaches forgetting fidelity close to full-retraining, while unlearning 10 to 12 times faster than retraining from scratch and using far less memory than the second-order GU baselines.

2606.19561 2026-06-19 cs.RO cs.SY eess.SY 交叉投稿

pdSTL: Probabilistic Differentiable Signal Temporal Logic for Stochastic Systems

pdSTL: 面向随机系统的概率可微信号时序逻辑

Bennett Dogbey, Hemanth Manjunatha

发表机构 * Oklahoma State University(俄克拉荷马州立大学)

AI总结 提出pdSTL框架,将概率语义与可微鲁棒性结合,通过区间值概率语义和LSTM式展开实现线性时间可微监控,在障碍物规避、换道和真实四旋翼飞行实验中优于确定性可微STL。

详情
AI中文摘要

在不确定环境中运行的自主机器人必须满足复杂的时序和安全规范,尽管存在随机动力学和感知噪声。虽然信号时序逻辑(STL)为基于梯度的优化提供了鲁棒性度量,但现有的扩展要么缺乏可微性,要么忽略了信念空间的不确定性。我们引入了pdSTL(概率可微信号时序逻辑),这是一个将概率语义与信念轨迹上的可微鲁棒性统一起来的框架。pdSTL采用区间值概率语义来计算保守的满足界限,并通过STL语法树组合传播。我们将时序鲁棒性评估制定为STL算子的循环、LSTM式展开,从而实现适用于端到端轨迹优化的线性时间、可微监控。我们在模拟障碍物规避、换道操作以及真实世界的Crazyflie四旋翼飞行实验中验证了pdSTL,这些实验在气动干扰下进行。结果表明,pdSTL在保持形式化概率保证的同时实现了高效优化,在现实世界的不确定性下,在维持安全裕度方面显著优于确定性可微STL。

英文摘要

Autonomous robots operating in uncertain environments must satisfy complex temporal and safety specifications despite stochastic dynamics and sensing noise. While Signal Temporal Logic (STL) offers robustness measures for gradient-based optimization, existing extensions either lack differentiability or ignore belief-space uncertainty. We introduce pdSTL (probabilistic differentiable Signal Temporal Logic), a framework that unifies probabilistic semantics with differentiable robustness over belief trajectories. pdSTL employs interval-valued probabilistic semantics to compute conservative satisfaction bounds, propagated compositionally through the STL syntax tree. We formulate the temporal robustness evaluation as a recurrent, LSTM-style unfolding of STL operators, enabling linear-time, differentiable monitoring suitable for end-to-end trajectory optimization. We validate pdSTL on simulated obstacle avoidance, lane-change maneuvers, and real-world Crazyflie quadcopter flight experiments under aerodynamic disturbances. Results demonstrate that pdSTL achieves efficient optimization with formal probabilistic guarantees, significantly outperforming deterministic differentiable STL in maintaining safety margins under real-world uncertainty.

2606.19520 2026-06-19 eess.SY cs.SY 交叉投稿

ev-flow: A Reproducible, NHTS-Grounded Generator of Synthetic Plug-in Electric Vehicle Charging Behavior for Eight U.S. Regions

ev-flow: 一个可复现的、基于NHTS的合成插电式电动汽车充电行为生成器,适用于美国八个地区

Bertrand Travacca

AI总结 提出ev-flow开源Python包,基于2017年全国家庭旅行调查数据,通过九阶段流水线生成美国八个地区的合成插电式电动汽车充电行为,填补了美国本土化、NHTS驱动的充电行为生成工具空白。

Comments 20 pages

详情
AI中文摘要

电动汽车并网研究需要大量具有行为真实性的个体充电档案,但实际充电遥测数据稀缺且受隐私限制,现有的开源生成器要么基于非美国出行调查校准,要么忽略了驱动总需求的区域、季节和设备异质性。我们提出\texttt{ev-flow}(导入名\texttt{pev\_synth}),一个MIT许可的开源Python包,基于2017年全国家庭旅行调查(NHTS)微观数据和区域销售组合模型,为美国八个区域生成合成插电式电动汽车充电行为。一个确定性的九阶段流水线(M1-M9)将每辆车从调查记录转换为带时间戳的充电档案:它将调查的人日拼接成捐赠者匹配的365天出行日历,并带有温度依赖的冬季能量提升;从已发表的SPEECh K=16高斯混合参数化中采样行为插电开始时间;评估三层伯努利插电模型;传播连续时间荷电状态账本,并带有明确的PHEV汽油续航扩展项;将插电状态栅格化为15分钟和小时网格。该包生成住宅和工作场所档案类型,并附有描述性EVSE品牌和连接器丰富信息;每个输出均以UTC存储、时区感知,并可从单个主种子实现比特可复现。验证运行器将生成的分布与已发表的边界进行比较,并根据文献出处对每个偏差进行分类:参考的\texttt{bay\_area}住宅档案在21项适用检查中汇总为11项通过、0项未解释失败、6项已解释失败和4项已解释跳过。\texttt{ev-flow}填补了美国本土、基于NHTS的空白,与欧洲生成器(如emobpy和VencoPy)以及充电模拟器(如datafev和ACN-Sim)互补。

英文摘要

Electric-vehicle grid-integration studies need large, behaviorally realistic populations of individual charging profiles, but real charging telemetry is scarce and privacy-restricted, and the existing open generators are calibrated to non-U.S. mobility surveys or flatten the regional, seasonal, and equipment heterogeneity that drives aggregate demand. We present \texttt{ev-flow} (import name \texttt{pev\_synth}), an open-source, MIT-licensed Python package that generates synthetic plug-in electric vehicle charging behavior for eight U.S. regions, grounded in 2017 National Household Travel Survey (NHTS) microdata and regional sales-mix models. A deterministic nine-stage pipeline (M1--M9) carries each vehicle from survey records to a time-stamped charging profile: it stitches survey person-days into donor-matched 365-day travel calendars with a temperature-dependent winter energy uplift, samples behavioral plug-in start times from the published SPEECh K=16 Gaussian-mixture parameterization, evaluates a three-layer Bernoulli plug-in model, propagates a continuous-time state-of-charge ledger with an explicit PHEV gasoline range-extension term, and rasterizes plug status to 15-minute and hourly grids. The package generates residential and workplace profile types with descriptive EVSE brand and connector enrichment; every output is UTC-stored, timezone-aware, and bit-reproducible from a single master seed. A validation runner compares the generated distributions against published bounds and classifies every divergence with literature provenance: the reference \texttt{bay\_area} residential profile rolls up to 11 PASS, 0 unexplained FAIL, 6 explained failures, and 4 explained skips across 21 applicable checks. \texttt{ev-flow} fills a U.S.-focused, NHTS-grounded niche complementary to European generators such as emobpy and VencoPy and to charging simulators such as datafev and ACN-Sim.

2606.19512 2026-06-19 cs.RO cs.SY eess.SY 交叉投稿

Proprioceptive Invariant State Estimation for Humanoid Robots on Non-Inertial Ground

非惯性地面上仿人机器人的本体感觉不变状态估计

Falak Mandali, Zijian He, Yan Gu

发表机构 * Purdue University(普渡大学)

AI总结 提出一种仅使用本体感觉的InEKF方法,利用足部IMU和运动学约束,实现非惯性地面上仿人机器人的实时状态估计,收敛速度提升96%,位置误差降低80%。

详情
AI中文摘要

本文提出了一种不变扩展卡尔曼滤波(InEKF)方法,用于在非惯性地面上运行的仿人机器人仅使用机载本体感觉进行实时状态估计。所提出的方法估计机器人相对于移动地面框架的基座位置和速度,无需直接测量地面运动或外部安装的传感器。通过足部安装的IMU利用支撑脚的运动学约束,该滤波器在保持完全本体感觉的同时,考虑了过程模型和测量模型中的地面引起的非线性。估计器被设计为具有右不变测量模型,从而在较大的初始不确定性下实现有利的误差动态。可观测性分析建立了机器人相对于非惯性地面框架的相对基座位置和速度可观测的条件。在摇摆和俯仰地面上站立和蹲下的Digit仿人机器人实验表明,与现有的InEKF相比,收敛速度提高了96%,位置估计误差减少了80%。在单轴旋转地面上的行走实验实现了平均估计误差小于9厘米,初始误差高达1米。

英文摘要

This paper presents an invariant extended Kalman filtering (InEKF) approach for real-time state estimation of humanoid robots operating on non-inertial ground using only onboard proprioceptive sensing. The proposed approach estimates the robot's base position and velocity relative to the moving ground frame without requiring direct measurements of ground motion or externally mounted sensors. By exploiting kinematic constraints at the stance foot through foot-mounted IMUs, the filter accounts for ground-induced nonlinearities in the process and measurement models while remaining fully proprioceptive. The estimator is formulated to admit a right-invariant measurement model, enabling favorable error dynamics under large initial uncertainties. Observability analysis establishes conditions under which the robot's relative base position and velocity are observable with respect to the non-inertial ground frame. Experiments with the Digit humanoid robot standing and squatting atop a swaying and pitching ground showcase a 96% speedup in convergence rate and an 80% reduction in position estimate errors over existing InEKFs. Walking experiments on a uni-axially rotating ground achieve an average estimation error of less than 9 cm for an initial error of up to 1 m.

2606.19504 2026-06-19 cs.RO cs.SY eess.SY 交叉投稿

Simulating Robotic Locomotion in Sand: Resistive Force Theory in an Open-Source Physics Engine

模拟沙地中的机器人运动:开源物理引擎中的阻力理论

Ryan Walker Brown, Laura K. Treers, Kathryn A. Daltorio

发表机构 * Case Western Reserve University(凯斯西储大学) University of Vermont(佛蒙特大学)

AI总结 将三维颗粒阻力理论(3D RFT)集成到MuJoCo物理引擎中,实现沙地行走模拟,验证了足端形状、速度和负载对运动的影响,并在六足机器人实验中预测行走距离和沉陷误差在20%以内。

Comments 12 pages, 7 figures

详情
AI中文摘要

阻力理论(RFT)的最新进展使得无需模拟单个颗粒相互作用即可近似沙地运动中的地面反作用力,从而降低了计算成本。然而,这些工具在常用于机器人仿真的3D物理引擎中尚不可用。我们探讨了将阻力近似与标准动力学计算相结合,是否能为自由行走的机器人提供稳定的支撑。为此,我们在物理仿真引擎MuJoCo中实现了三维颗粒阻力理论(3D RFT)。我们在多个场景中验证了仿真,证明了由于末端执行器形状、速度和负载引起的关键趋势得以保留。我们的实现预测了12自由度六足机器人在沙地中的行走距离和足部下沉,误差在实验值的20%以内。尽管RFT存在固有近似,但本文描述的开源工具有望帮助开发新的和改进的机器人设计,以穿越颗粒介质基底。

英文摘要

Recent advancements in Resistive Force Theory (RFT) enable approximation of ground reaction forces for locomotion in sand without the computational expense of modeling interactions with individual grains. However, these tools have been absent in 3D physics engines commonly used for robot simulation. We explore if resistive force approximations are sufficient, when integrated with standard dynamics calculations, to provide a stable substrate for a freely walking robot. To determine this, we implement 3D Granular Resistive Force Theory (3D RFT) in a physics simulation engine, MuJoCo. We verify simulations in multiple scenarios to demonstrate that key trends due to end effector shape, speed, and loading are preserved. Our implementation predicts walking distance and foot sinkage of a 12-Degree of Freedom hexapod robot within 20\% of experiments in sand. While RFT has inherent approximations, the open source tool described here has potential to help develop new and improved robot designs to traverse granular media substrates.

2606.20443 2026-06-19 eess.SY cs.LG cs.SY math.AT 交叉投稿

Topological Data Analysis for High-Dimensional Dynamic Process Monitoring

高维动态过程监测的拓扑数据分析

Angan Mukherjee, Tyler A. Soderstrom, Michael J. Kurtz, Victor M. Zavala

AI总结 提出结合拓扑数据分析和机器学习的方法,将多变量时间序列表示为流形,用拓扑描述符总结结构,并用神经常微分方程学习拓扑结构动态演化,实现高效事件检测。

详情
AI中文摘要

实时过程监测需要从高维时间序列数据中提取可操作信息的方法。在这项工作中,我们提出了一种新的过程监测方法,结合了拓扑数据分析(TDA)和机器学习工具。在所提出的方法中,我们将多变量时间序列数据表示为流形,并使用拓扑描述符来总结此类数据的结构;然后,我们使用神经常微分方程来学习系统拓扑结构的动态演化。使用来自工业过程的真实数据,我们表明这种基于轨迹的事件检测方法能有效检测多种类型的事件。我们将该方法与基于重构的方法(如主成分分析和自编码器)以及使用Koopman自编码器的基于轨迹的方法进行了对比。

英文摘要

Real-time process monitoring requires methods that extract actionable information from high-dimensional time-series data. In this work, we present a new approach for process monitoring that combines tools of topological data analysis (TDA) and machine learning. In the proposed approach, we represent multivariate time-series data as manifolds and use topological descriptors to summarize the structure of such data; we then use a neural ordinary differential equation to learn the dynamic evolution of the topological structure of the system. Using real data from an industrial process, we show that this trajectory-based event detection approach is effective at detecting diverse types of events. We contrast this approach against reconstruction-based approaches such as principal component analysis and autoencoders and against a trajectory-based approach that uses Koopman autoencoders.

2606.19695 2026-06-19 eess.SY cs.GT cs.SY math.OC 交叉投稿

A Unified Framework for Joint Sensor Placement and Scheduling for Intrusion Detection

入侵检测中联合传感器放置与调度的统一框架

Jayanth Bhargav, Mahsa Ghasemi, Shreyas Sundaram

AI总结 提出一个统一框架,将传感器放置与方向调度联合优化,通过博弈论设计效用函数并利用弱子模性实现近最优检测性能。

Comments 27 pages, 4 figures

详情
AI中文摘要

我们考虑一个入侵检测任务,其中防御者必须联合优化传感器放置位置和方向,以最小化入侵者穿越受保护环境时被漏检的概率。我们将此问题分解为一个元问题(称为SensorPlacement)和一个嵌入的子问题(称为OrientationScheduling)。对于固定的传感器放置,OrientationScheduling子问题被建模为防御者和入侵者之间的两人零和博弈,其中防御者寻求已部署传感器的方向策略以最小化漏检概率,而入侵者则寻求路径选择策略以最大化该概率。由于防御者的策略空间随传感器数量和方向组合增长,通过标准线性规划求解博弈变得不可行。为此,我们开发了一种迭代且高效的均衡求解算法,该算法利用博弈收益函数的结构,并建立了收敛到博弈纳什均衡(NE)的理论保证。该NE值随后被用作SensorPlacement元问题中的效用度量。我们证明了这个基于博弈值的效用函数在传感器放置集合上是弱子模的,并提出了一个具有近最优性保证的贪婪放置算法。据我们所知,这是第一个将博弈论效用设计与(弱)子模优化相结合的统一框架,实现了传感器放置和方向调度的原则性联合优化。通过大量仿真,我们证明所提出的方法实现了近最优的检测性能,同时与基线相比显著减少了计算时间。

英文摘要

We consider an intrusion detection task in which a defender must jointly optimize sensor placement locations and orientations to minimize the probability of missed detection of an intruder traversing a protected environment. We decompose this problem into a meta problem, termed SensorPlacement, and an embedded subproblem, termed OrientationScheduling. The OrientationScheduling subproblem, for a fixed sensor placement, is modeled as a 2-player zero-sum game between the defender and the intruder, where the defender seeks an orientation strategy for the deployed sensors to minimize the probability of missed detection, while the intruder seeks a path selection strategy to maximize it. Since the defender's strategy space grows combinatorially with the number of sensors and orientations, solving the game via standard linear programming becomes prohibitive. To this end, we develop an iterative and efficient equilibrium-seeking algorithm that exploits the structure of the game's payoff function and establishes theoretical guarantees for convergence to the Nash equilibrium (NE) of the game. This NE value is then used as a utility measure in the SensorPlacement meta problem. We show that this game-value-based utility function is weakly submodular over the set of sensor placements and propose a greedy placement algorithm with near-optimality guarantees. To our knowledge, this is the first unified framework to integrate game-theoretic utility design with (weak) submodular optimization, enabling principled joint optimization of sensor placement and orientation scheduling. Through extensive simulations, we demonstrate that the proposed approach achieves near-optimal detection performance while significantly reducing computation time compared to baselines.

2606.19871 2026-06-19 math.OC cs.MA cs.SY eess.SY 交叉投稿

Semiglobal Input-Delay Tolerance Algorithm for Distributed Nonconvex Optimization of Networked Nonlinear Systems

网络化非线性系统分布式非凸优化的半全局输入延迟容忍算法

Jing-Zhe Xu, Zhi-Wei Liu, Ming-Feng Ge, Yan-Wu Wang, Dinxin He

AI总结 针对存在输入延迟和一致性约束的网络化非线性系统,提出一种半全局输入延迟容忍算法,通过分层设计和输入-状态稳定性分析,在Polyak-Łojasiewicz条件下实现非凸优化的分布式求解。

Comments 36 pages, 5 figures

详情
AI中文摘要

本文研究了一类受输入延迟和一致性约束的网络化非线性系统中的分布式优化问题。引入了输入延迟容忍半全局收敛(IDTSC),即对于任意给定的紧致初始集,存在一个可容许的延迟界,在该界下,最优解在一致性约束内被计算,并且所有节点状态收敛到该解。基于分层设计和输入-状态稳定性分析,开发了一种新的半全局输入延迟容忍(SIDT)算法,该算法在实际中实现了输入延迟与非线性动力学耦合下的分布式优化IDTSC。此外,通过Polyak-Łojasiewicz条件放宽严格凸性要求,SIDT算法将其适用性扩展到非凸优化。最后,数值实验验证了该理论在具有输入延迟的网络化非线性系统上的有效性。

英文摘要

This paper studies a class of distributed optimization problems in networked nonlinear systems (NNSs) subject to input delays and consensus constraints. It introduces input-delay tolerant semiglobal convergence (IDTSC), meaning that for any prescribed compact initial set there exists an admissible delay bound under which the optimal solution is computed within consensus constraints and all node states converge to the solution. Building on a hierarchical design and input-to-state stability analysis, a new semiglobal input-delay tolerant (SIDT) algorithm is developed that practically achieves IDTSC for distributed optimization under the coupling between input delays and nonlinear dynamics. Further, by relaxing strict convexity requirements through the Polyak-Łojasiewicz condition, the SIDT algorithm broadens its applicability to nonconvex optimization. Finally, numerical experiments corroborate the theory on NNSs with input delays.

2606.19669 2026-06-19 math.OC cs.SY eess.SY 交叉投稿

Learning Neural Maximal Lyapunov Functions on $\mathsf{SO}(n)$

在 $\mathsf{SO}(n)$ 上学习神经最大李雅普诺夫函数

Adeel Akhtar, Matthieu Barreau

AI总结 提出基于对数映射的神经李雅普诺夫架构,通过Zubov型表征学习最大吸引域,并推导对数映射导数的显式公式,实现两阶段训练算法。

Comments Accepted to IEEE Control Systems Letters (L-CSS), 6 pages, 2 figures,

详情
AI中文摘要

为李群上的动力系统建立稳定性保证是一个基本挑战,因为为欧几里得空间开发的经典李雅普诺夫方法不能直接转移到弯曲几何上。在本文中,我们提出了一个框架,用于学习在特殊正交群 $\mathsf{SO}(n)$ 上演化的系统的最大李雅普诺夫函数。理论上,我们引入了一种基于对数映射的神经李雅普诺夫架构,具有可证明的逼近能力,并通过最大吸引域的Zubov型表征来形式化学习问题。一个关键的技术贡献是推导了对数映射导数的显式、数值可处理的公式,使得通过一个平衡计算效率和精度的两阶段算法进行训练成为可能。实证上,我们在一个低维非线性系统上验证了该方法。

英文摘要

Establishing stability guarantees for dynamical systems on Lie groups is a fundamental challenge, as classical Lyapunov methods developed for Euclidean spaces do not directly transfer to curved geometries. In this paper, we propose a framework for learning maximal Lyapunov functions for systems evolving on the special orthogonal group $\mathsf{SO}(n)$. Theoretically, we introduce a neural Lyapunov architecture based on the logarithmic map with proven approximation capabilities, and we formulate the learning problem via a Zubov-type characterization of the maximal region of attraction. A key technical contribution is the derivation of explicit, numerically tractable formulas for the derivative of the logarithmic map, enabling training through a two-phase algorithm that balances computational efficiency and accuracy. Empirically, we validate the approach on a low-dimensional nonlinear system.

2606.20060 2026-06-19 nlin.AO cs.SY eess.SY 交叉投稿

Nodal Braess's Paradox and Inertia Destabilization with Dynamic Node and Line Failures in Power Grids

电网中动态节点与线路故障的节点Braess悖论与惯性失稳

Nubius Brandner, Frank Hellmann, Hans Würfel, Jürgen Kurths, Anton Plietzsch, Anna Büttner

AI总结 提出集成节点/线路故障与同步振荡器动力学的新模型,发现高惯性和节点鲁棒性增强可能反常地扩大级联规模,揭示新型Braess悖论。

详情
AI中文摘要

大规模停电通常由级联故障引起。这些故障通过网络动力学与单个组件故障之间的复杂相互作用动态展开。相比之下,物理学中对级联故障的研究集中在准静态状态下分析线路过载。我们引入了一个新模型,将节点和线路故障的动力学与电网同步的典型振荡器模型相结合。这使我们能够首次研究耦合故障的集体级联行为。我们研究了节点鲁棒性(节点承受瞬态扰动的能力)和惯性(节点抵抗频率偏差的能力)对级联规模的影响。我们发现了驱动系统脆弱性的两种新机制:i) 虽然低惯性被广泛认为是电网的主要挑战,但我们发现高惯性会放大级联规模,除非伴随其他动力学特性的适当调整。ii) 此外,我们发现单个节点鲁棒性的增强可能反常地导致更大的级联。后一种现象构成了一种新型的Braess悖论。理解这种反直觉的集体效应对于实现有弹性的未来电网可能至关重要。

英文摘要

Large-scale power outages are typically caused by cascading failures. These unfold dynamically through complex interactions between network dynamics and individual component failures. In contrast, the study of cascading failures in physics has focused on analyzing line overloads in the quasi-static regime. We introduce a new model that integrates the dynamics of node and line failures with a paradigmatic oscillator model for power grid synchronization. This enables us to investigate the collective cascading behavior of coupled failures for the first time. We study the impact of nodal robustness, the ability of nodes to tolerate transient disturbances, and inertia, the ability of nodes to resist frequency deviations, on cascade sizes. We discover two novel mechanisms driving system fragility: i) While low inertia is widely considered a major challenge for power grids, we find that high inertia can amplify cascade sizes unless accompanied by appropriate adjustments of other dynamical properties. ii) Further, we find that an increase in the robustness of individual nodes can paradoxically lead to larger cascades. This latter phenomenon constitutes a novel type of Braess's paradox. Understanding such counterintuitive collective effects may become central for achieving resilient future power grids.

2606.18272 2026-06-19 cs.NI cs.AI cs.SY eess.SY 交叉投稿

Mitigating Anchoring Bias in LLM-Based Agents for Energy-Efficient 6G Autonomous Networks

缓解基于LLM的智能体在节能6G自主网络中的锚定偏差

Hatim Chergui, Claudia Carballo González, Farhad Rezazadeh, Merouane Debbah

发表机构 * i2CAT Foundation(i2CAT基金会) Universitat Politècnica de Catalunya(政治技术大学) Research Institute for Digital Future(数字未来研究院)

AI总结 提出一种基于截断三参数威布尔分布的随机锚定策略,缓解LLM智能体在6G网络切片中的锚定偏差,结合CVaR数字孪生保障SLA尾延迟,实现高达25%的节能。

Comments 7 pages, 4 figures

详情
AI中文摘要

本文提出了一种自主智能体资源协商框架,旨在使用大语言模型(LLM)智能体实现6G架构中的零接触网络切片。虽然LLM提供了强大的推理能力,但我们证明此类智能体固有地遭受锚定偏差,僵化地坚持初始启发式提议,导致严重的网络过度配置。为系统性地缓解这种认知偏差,我们提出了一种新颖的随机锚定策略,通过截断三参数威布尔分布建模。这种数学上有界的方法与采用条件风险价值(CVaR)的突发感知数字孪生(DT)无缝集成,以严格保证严格的服务水平协议(SLA)尾延迟。为验证我们的方法,我们引入并证明了双峰约束避免效用定理,表明虽然可行的协商遵循经典凸界,但高度约束的场景会发生由逆有理衰减包络控制的相变。使用本地托管的1B参数模型(\ exttt{otel-llm-1b-it})生成的实证结果证实了这些双区域界。我们的认知去偏成功瓦解了僵化的协商模式,迫使智能体主动探索以安全地利用SLA边界,并将系统节能提升高达25%。关键的是,轻量级1B LLM实现了亚秒级推理延迟(平均0.95秒),确保我们的多智能体框架与O-RAN非实时RAN智能控制器(non-RT RIC)的操作时间尺度兼容。

英文摘要

This paper presents an autonomous agentic resource negotiation framework designed to enable zero-touch network slicing in 6G architectures using Large Language Model (LLM) agents. While LLMs offer powerful reasoning capabilities, we demonstrate that such agents inherently suffer from anchoring bias, rigidly adhering to initial heuristic proposals and causing severe network over-provisioning. To systematically mitigate this cognitive bias, we propose a novel randomized anchoring strategy modeled via a Truncated 3-Parameter Weibull distribution. This mathematically bounded approach seamlessly integrates with burst-aware Digital Twins (DTs) employing Conditional Value at Risk (CVaR) to rigorously guarantee strict Service Level Agreement (SLA) tail-latencies. To validate our methodology, we introduce and prove the \emph{Bimodal Constraint-Avoidance Utility Theorem}, demonstrating that while feasible negotiations follow classical convex bounds, highly constrained scenarios undergo a phase transition governed by an inverse rational decay envelope. Empirical results generated using a locally hosted 1B-parameter model otel-llm-1b-it confirm these dual-regime bounds. Our cognitive de-biasing successfully dismantles rigid negotiation patterns, forcing agents into active exploration to safely ride SLA boundaries and boost system energy savings up to 25\%. Crucially, the lightweight 1B LLM achieves sub-second inference latencies (0.95s mean), ensuring our multi-agent framework is compatible with the operational timescales of the O-RAN non-Real-Time RAN Intelligent Controller (non-RT RIC)\footnote{Our source code is available for non-commercial use at https://github.com/HatimChergui.

2606.16057 2026-06-19 cs.RO cs.SY eess.SP eess.SY 交叉投稿

A Smart-Scheduled Hybrid (SSH) EKF-FGO State Estimation

一种智能调度混合(SSH)EKF-FGO状态估计方法

Eric Levy, Soosan Beheshti

发表机构 * GitHub arXiv

AI总结 本文通过智能调度混合EKF-FGO框架,实验性地将优化调度作为独立设计变量,研究其在平衡估计精度与计算成本中的作用,并在平面SLAM仿真中验证了调度对预优化漂移、瞬态误差和运行时间的显著影响。

Comments This work has been accepted for presentation/publication at the 2026 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE). The final published version will appear in IEEE Xplore

详情
AI中文摘要

在机器人学和控制中,可靠的状态估计需要在估计精度和计算成本之间取得平衡。虽然基于滤波的方法(如扩展卡尔曼滤波器,EKF)提供高效的实时更新,而使用因子图的优化公式化方法改善全局一致性,但优化调度的作用通常被隐式处理,而非作为明确的设计变量进行研究。本文提出了一项实验研究,通过使用智能调度混合(SSH)EKF-FGO框架作为受控测试平台,明确隔离了优化调度。通过将基于EKF的状态传播与定期调用的批量优化相结合,并保持求解器结构和计算量固定,本文的主要贡献是实验性地将优化调度表征为一个独立的设计变量,它控制着中间估计精度与计算成本之间的权衡。在平面SLAM环境中的仿真结果表明,调度强烈影响预优化漂移、瞬态误差行为和运行时间。特别是,结果识别出一些操作区域,在这些区域中,全局优化的大部分好处可以以一小部分计算成本保留,从而突显了优化调度作为混合状态估计系统中一个未被充分探索但至关重要的考虑因素。

英文摘要

Reliable state estimation in robotics and control re quires balancing estimation accuracy against computational cost. While filtering-based methods such as the Extended Kalman Filter (EKF) provide efficient real-time updates, and optimisation based formulations using factor graphs improve global consistency, the role of optimisation scheduling is often treated implicitly rather than examined as an explicit design variable. This paper presents an experimental study that explicitly isolates optimisation scheduling using a Smart Scheduled Hybrid (SSH) EKF-FGO framework as a controlled testbed. By combining EKF-based state propagation with periodically invoked batch optimisation and holding solver structure and effort fixed, the main contribution of this work is the experimental characterisation of optimisation scheduling as an independent design variable governing the trade-off between intermediate estimation accuracy and computational cost. Simulation results in a planar SLAM environment show that scheduling strongly influences pre optimisation drift, transient error behaviour, and runtime. In particular, the results identify operating regimes in which most of the benefit of global optimisation can be retained at a fraction of the computational cost, highlighting optimisation scheduling as an under-explored yet critical consideration in hybrid state estimation systems.

2606.13794 2026-06-19 eess.SY cs.AI cs.RO cs.SY 交叉投稿

An integrated interpretable control effectiveness learning and nonlinear control allocation methodology for overactuated aircrafts

过驱动飞行器的可解释控制效能学习与非线性控制分配集成方法

Umut Demir, Aamir Ahmad, Walter Fichter

发表机构 * University of Stuttgart, Faculty of Aerospace Engineering and Geodesy, Institute of Flight Mechanics and Control (iFR)(斯图加特大学航空航天工程与大地测量学院飞行力学与控制研究所)

AI总结 提出一种基于稀疏非线性动力学辨识的学习控制效能映射方法,结合在线自适应机制,实现过驱动飞行器的高效非线性控制分配,兼具可解释性和低计算成本。

详情
AI中文摘要

非线性动力学以及多个执行器之间产生的强耦合削弱了传统线性控制分配技术背后的假设。当飞行进入非线性效应主导的模态时,线性分配器因模型失配增加而精度下降,进而降低飞行控制系统的性能和鲁棒性。高保真机载模型和黑箱数据驱动方法可以在整个飞行包线内恢复精度,但分别带来实时分配难以承受的计算负担,并牺牲了验证和故障诊断所需的可解释性。本文通过使用稀疏非线性动力学辨识从代表性飞行数据中学习显式的、受物理约束的控制效能映射解析模型,解决了这些限制。所得映射紧凑、可解释,并允许解析导数,从而能够在非线性求解器中高效计算,同时额外包含执行器动力学,无需机载模型。在线自适应机制监控预测残差,并在检测到显著对象变化时刷新模型,从而在执行器故障和变化工况下提供平滑重构。该方法在一款高保真非线性基准飞行器上经过一系列激进机动评估,达到了与完整非线性机载模型相当的精度,同时相对于现有基线显著降低了计算成本。

英文摘要

Nonlinear dynamics and the strong couplings that arise between multiple effectors undermine the assumptions behind conventional, linear control allocation techniques. When flight enters regimes where nonlinear effects dominate, linear allocators exhibit reduced accuracy due to increased model mismatch, which subsequently degrades performance and robustness of the flight control system. High fidelity onboard models and black box data driven approaches can recover accuracy across the flight envelope, but respectively impose computational burdens prohibitive for real time allocation and sacrifice the interpretability required for verification and fault diagnosis. This paper addresses these limitations by learning an explicit, physics constrained analytical model of the control effectiveness mapping from representative flight data using Sparse Identification of Nonlinear Dynamics. The resulting mapping is compact, interpretable, and admits analytical derivatives, enabling efficient computation within nonlinear solvers that additionally incorporate actuator dynamics, without requiring an onboard model. An online adaptation mechanism monitors prediction residuals and refreshes the model when significant plant changes are detected, providing graceful reconfiguration under actuator failures and varying operating conditions. The methodology is evaluated on a high fidelity nonlinear benchmark aircraft across a range of aggressive maneuvers, achieving accuracy comparable to a full nonlinear onboard model while substantially reducing computational cost relative to established baselines.

2605.28654 2026-06-19 cs.RO cs.SY eess.SY math.OC 版本更新

Integrated Exploration-Aware UAV Route Optimization and Path Planning

集成探索感知的无人机路径优化与轨迹规划

Jimin Choi, Grant Stagg, Cameron K. Peterson, Max Z. Li

发表机构 * Department of Aerospace Engineering, University of Michigan(密歇根大学航空航天工程系) Department of Electrical Engineering, Brigham Young University(BYU 电子工程系) Department of Aerospace Engineering, Department of Civil and Environmental Engineering, and Department of Industrial and Operations Engineering, University of Michigan(密歇根大学航空航天工程系、土木与环境工程系和工业与运营管理工程系)

AI总结 提出一种集成探索感知的无人机路径优化与轨迹规划框架,通过风险地图、不确定兴趣区域建模、B样条轨迹优化和在线重规划,在灾害监测中平衡报告点访问与新信息探索,实现平均KL散度降低15.9%。

详情
AI中文摘要

无人机越来越多地用于危险环境(如灾区、污染场地、野火区域和受损基础设施)中的探索驱动监测,此时有限的飞行续航必须在访问报告位置和收集新信息之间分配。在这些场景中,关于危险的先验信息通常不完整、空间不精确,并且在执行过程中可能发生变化。例如,初始报告可能识别出危险可能存在的区域,但实际危险可能被移动、部分观察到或完全未被报告。我们提出了一种集成的探索感知无人机路径优化与轨迹规划框架,用于在不确定和演变的先验信息下进行危险监测。环境被表示为空间风险地图,每个位置都有相关的危险状况信念。报告的危险被建模为不确定的兴趣区域(ROI),而不是确认的目标位置,要求无人机在检查报告区域的同时,利用有限的飞行续航探索信息丰富的区域。所提出的方法解决了报告ROI上的车辆路径问题,通过辅助伪节点增强路径以改善空间覆盖,将剩余飞行距离预算分配到路径段,并优化局部探索的动态可行B样条轨迹。在执行过程中,无人机测量更新基于网格的信念地图,当新信息和剩余预算证明调整合理时,对剩余轨迹进行重规划。在48种场景配置中,在线重规划相比离线优化规划器平均KL散度降低15.9%,相比直线遍历降低48.6%。

英文摘要

Uncrewed aerial vehicles (UAVs) are increasingly used for exploration-driven monitoring in hazardous environments such as disaster zones, contaminated sites, wildfire areas, and damaged infrastructure, where limited flight endurance must be allocated between visiting reported locations and gathering new information. In these settings, prior information regarding hazards is often incomplete, spatially imprecise, and subject to change during execution. For example, initial reports may identify a region where a hazard is likely to exist, but the actual hazard may be displaced, partially observed, or entirely unreported. We present an integrated exploration-aware UAV route optimization and path planning framework for hazard monitoring under uncertain and evolving prior information. The environment is represented as a spatial risk map, where each location has an associated belief of hazardous conditions. Reported hazards are modeled as uncertain regions of interest (ROIs) rather than confirmed target locations, requiring the UAV to inspect reported areas while also using its limited flight endurance to explore informative regions. The proposed method solves a vehicle routing problem over reported ROIs, augments the route with auxiliary pseudo-nodes to improve spatial coverage, allocates the remaining flight distance budget across route segments, and optimizes dynamically feasible B-spline trajectories for local exploration. During execution, UAV measurements update a grid-based belief map, and the remaining trajectory is replanned when new information and the remaining budget justify adaptation. Across 48 scenario configurations, online replanning improves average KL reduction by 15.9% over the offline optimized planner and 48.6% over straight-line traversal.

2603.19895 2026-06-19 eess.SY cs.SY math.CV math.DG math.DS 版本更新

Complex Frequency as Generalized Eigenvalue

复频率作为广义特征值

Nikolas Sofos, Federico Milano

AI总结 本文研究了复频率在描述线性时不变系统状态时作为特征值的广义形式,通过几何频率的定义和分解,展示了复频率在二维欧几里得平面中的应用,并证明了线性系统中复频率与特征值的等价性,同时指出非线性系统不具有这一等价性。

详情
AI中文摘要

本文证明了复频率的概念,最初用于描述复值信号的动力学,当应用于线性时不变(LTI)系统的状态时,构成了特征值的广义形式。从几何频率的定义出发,该定义为电路中的频率提供了几何解释,并自然分解为对称和反称成分,分别对应于幅度变化和旋转运动。我们展示复频率作为其在二维欧几里得平面上的限制。对于LTI系统,证明了通过非等距变换计算的系统状态的复频率与原系统的特征值一致。该等价性在任何阶数的可对角化系统中均成立。本文提供了一个统一的几何解释,将经典线性系统理论与曲线微分几何联系起来。同时指出,这种等价性一般不适用于非线性系统。另一方面,系统的几何频率总能被定义,从而为系统流提供几何解释。基于线性和非线性电路的多种示例展示了所提出的框架。

英文摘要

This paper shows that the concept of complex frequency, originally introduced to characterize the dynamics of signals with complex values, constitutes a generalization of eigenvalues when applied to the states of linear time-invariant (LTI) systems. Starting from the definition of geometric frequency, which provides a geometrical interpretation of frequency in electric circuits that admits a natural decomposition into symmetric and antisymmetric components associated with amplitude variation and rotational motion, respectively, we show that complex frequency arises as its restriction to the two-dimensional Euclidean plane. For LTI systems, it is shown that the complex frequencies computed from the system's states subject to a non-isometric transformation, coincide with the original system's eigenvalues. This equivalence is demonstrated for diagonalizable systems of any order. The paper provides a unified geometric interpretation of eigenvalues, bridging classical linear system theory with differential geometry of curves. The paper also highlights that this equivalence does not generally hold for nonlinear systems. On the other hand, the geometric frequency of the system can always be defined, providing a geometrical interpretation of the system flow. A variety of examples based on linear and nonlinear circuits illustrate the proposed framework.

2605.10078 2026-06-19 eess.SY cs.SY 版本更新

Scalable Design of Attack-Resilient Controllers for Positive Systems

可扩展的抗攻击控制器设计方法用于正系统

Alba Gurpegui, Sribalaji C. Anand, André M. H. Teixeira

AI总结 本文提出了一种针对正系统在面对网络攻击时的安全和鲁棒控制器设计框架,通过最小化最大损失分析攻击影响,并展示线性策略可优化攻击策略。

Comments 3 figures, submitted to L-CSS and CDC 2026

详情
AI中文摘要

本文提出了一种用于正系统在面对网络攻击时的 secure 和 resilient 控制器设计框架。我们考虑了一个网络控制系统,其中攻击者通过注入虚假数据到执行器通道来增加控制成本(性能度量),同时惩罚攻击努力并受状态依赖约束。使用最小最大公式,我们分析了此类攻击导致的最坏情况性能损失,其由差分方程和当时间范围无限时的代数方程给出。我们证明,在可能的非线性策略中,最优攻击策略是线性的。尽管没有显式的隐蔽约束,我们还展示当测量输出具有一个不稳定的零点,但该零点不是性能度量的不稳定零点时,攻击可以导致性能退化无界。所提出的框架还扩展到具有模型不确定性的系统。数值示例展示了结果,并展示了如何利用正系统和线性调节器理论的工具来以低计算成本缓解网络攻击。

英文摘要

This paper proposes a framework for secure and resilient controller design for positive systems against cyber-attacks. In particular, we consider a network-controlled system where an adversary injects false data into the actuator channels to increase the control cost (performance measure) while penalizing the attack effort and subject to state-dependent constraints. Using a minimax formulation, we analyze the worst-case performance loss caused by such adversaries, which is given by the solution of a difference equation, and an algebraic equation when the time horizon is infinite. We show that the optimal attack policy, among possible nonlinear policies, is linear. Despite the lack of explicit stealthiness constraints, we also show that when the measured output has an unstable zero which is not an unstable zero of the performance measure, the attacks can induce unbounded performance degradation. The proposed framework is also extended to systems with model uncertainty. Numerical examples illustrate the results and demonstrate how tools from positive systems and linear regulator theory can be used to mitigate cyber-attacks with low computational effort.

2605.08525 2026-06-19 cs.RO cs.SY eess.SY 版本更新

Model-Reference Adaptive Flight Control of a 95-mg Insect-Scale Flapping-Wing Aerial Robot

95毫克昆虫尺度扑翼飞行机器人的模型参考自适应飞行控制

Francisco M. F. R. Gonçalves, Conor K. Trygstad, Néstor O. Pérez-Arancibia

发表机构 * Washington State University(华盛顿州立大学)

AI总结 针对昆虫尺度扑翼飞行机器人参数不确定性和扰动问题,提出模型参考自适应控制(MRAC)架构,结合混合乘性扩展卡尔曼滤波,实现高精度位置控制,并通过95毫克机器人实验验证了悬停和轨迹跟踪性能。

Comments Under review, 8 pages, 7 figures

详情
AI中文摘要

由于系统尺度和复杂制造,描述扑翼昆虫尺度飞行机器人动力学的模型存在参数不确定性,例如惯性矩阵和飞行器的执行器映射。此外,由于其低惯性,这种机器人在飞行中受到随机和系统性扰动的严重影响,包括电源线张力、阵风和机翼不对中产生的非期望气动力。因此,在亚分克尺度上执行复杂机动的高性能要求机器人调整其行为以抵消扰动和模型不确定性。为此,我们引入了一种模型参考自适应控制(MRAC)架构,用于可实现为三维空间中刚体的扑翼机器昆虫的高性能位置控制。此外,我们展示了在飞行中实现混合乘性扩展卡尔曼滤波以估计当前和期望角速度,如何显著抑制姿态振动,特别是沿滚转和俯仰自由度,并提高飞行性能。为了展示所提方法的适用性、功能性和高性能,我们使用一个95毫克的昆虫尺度飞行机器人进行了实时悬停和轨迹跟踪六自由度飞行控制实验。

英文摘要

Due to the system's scale and complex fabrication, the model describing the dynamics of a flapping-wing insect-scale aerial robot is subject to parameter uncertainty; for example, in the inertia matrix and the actuator mapping of the flier. Furthermore, due to its low inertia, this type of robot is greatly affected by stochastic and systematic disturbances during flight, including power-wire tension, gusts, and undesired aerodynamic forces produced by wing misalignment. Therefore, the high-performance execution of complex maneuvers at the subdecigram scale requires the robot to adapt its behavior to counteract disturbances and model uncertainty. Toward this objective, we introduce a model-reference adaptive control (MRAC) architecture for high-performance position control of flapping-wing robotic insects that can be modeled as rigid bodies in the three-dimensional (3D) space. In addition, we demonstrate how the implementation of a hybrid multiplicative extended Kálmán filter for estimating current and desired angular velocities during flight significantly dampens attitude vibrations, especially along the roll and pitch degrees of freedom (DOFs), and also improves flight performance. To show the suitability, functionality, and high performance of the proposed approach, we conducted real-time hovering and trajectory-tracking 6-DOF flight control experiments with a 95-mg insect-scale aerial robot.

2605.00457 2026-06-19 cs.NI cs.LG cs.SY eess.SY 版本更新

Utility-Aware DRL-Based TXOP Adaptation for NR-U and Wi-Fi Coexistence Networks

基于策略驱动的DRL的NR-U与Wi-Fi共存中的TXOP自适应

Po-Heng Chou, Yi-Fang Yu, Shou-Yu Chen, Chiapin Wang

发表机构 * Research Center for Information Technology Innovation (CITI), Academia Sinica (AS)(资讯科技创新研究所以(CITI),中华学术界(AS)) Department of Electrical Engineering, National Taiwan Normal University (NTNU)(国立台湾师范大学电子工程系(NTNU))

AI总结 针对NR-U与Wi-Fi在非授权频谱共存中的频谱利用不平衡问题,提出一种基于策略驱动的深度强化学习框架,通过奖励设计实现公平性、吞吐量和效用的灵活权衡控制。

Comments 15 pages, 13 figures, 2 tables, submitted to IEEE Open Journal of the Communications Society

详情
AI中文摘要

NR-U与Wi-Fi在非授权频谱中的共存引入了一个具有挑战性的共存管理问题,其中异构信道接入机制导致频谱利用的显著不平衡和Wi-Fi性能下降。为了解决这一挑战,我们提出了一种基于策略驱动的深度强化学习(DRL)框架,用于自适应传输机会(TXOP)控制,其中共存过程被建模为马尔可夫决策过程(MDP),深度Q网络(DQN)通过在线交互学习控制策略。一个关键贡献是通过奖励设计引入策略层,从而实现对公平性、吞吐量和效用之间共存权衡的显式控制。开发了三种策略,即绝对公平、适度公平和基于效用的公平,以实现不同的工作点。仿真结果表明,所提出的框架在严格公平控制下实现了高于0.9的Jain公平指数。与绝对公平相比,适度公平将总吞吐量提高了68.22%,而基于效用的策略进一步将效用提高了177.6%。这些结果表明,策略驱动控制为管理异构共存网络中的权衡提供了一种灵活有效的解决方案。

英文摘要

The coexistence of NR-U and Wi-Fi in the unlicensed spectrum introduces a challenging resource management problem, where heterogeneous channel access mechanisms can lead to unbalanced spectrum utilization and severe Wi-Fi performance degradation. To address this issue, this paper proposes a utility-aware deep reinforcement learning (DRL) framework for adaptive transmission opportunity (TXOP) control in NR-U/Wi-Fi coexistence networks. The coexistence process is formulated as a Markov decision process (MDP), in which the NR-U TXOP duration is treated as a controllable variable for regulating post-access channel occupancy. A deep Q-network (DQN) is then employed to learn adaptive TXOP control policies through online interaction with the coexistence environment. A key feature of the proposed framework is the integration of a configurable reward and criterion design, which enables explicit control of the fairness-efficiency-utility tradeoff. Three operating policies are developed, namely absolute fairness, moderate fairness, and utility-oriented moderate fairness, to characterize different coexistence operating points. Simulation results show that the proposed framework achieves a Jain fairness index above 0.9 under strict fairness control. Compared with the absolute fairness policy, the moderate fairness policy improves aggregate throughput by 68.22%, while the utility-oriented policy achieves a 177.6% improvement under the adopted utility evaluation metric. These results demonstrate that the proposed utility-aware DRL framework provides an effective and flexible solution for adaptive TXOP control and tradeoff management in heterogeneous unlicensed coexistence networks.

2604.09795 2026-06-19 eess.SY cs.RO cs.SY 版本更新

On Feedback Speed Control for a Planar Tracking

平面跟踪中的反馈速度控制

Xincheng Li, Tengyue Liu, Udit Halder

发表机构 * Department of Mechanical and Aerospace Engineering, University of South Florida(南佛罗里达大学机械与航空航天工程系)

AI总结 针对领航-跟随平面跟踪问题,提出一种反馈速度控制律与恒定方位角转向策略,实现并排编队并证明渐近稳定性,扩展至N-agent链网络。

详情
AI中文摘要

本文研究了领航者和跟随者之间的平面跟踪问题。我们提出了一种新颖的反馈速度控制律,结合恒定方位角转向策略,以保持两个智能体之间的并排编队。我们证明了当领航者的转向已知时,所提出的控制使闭环系统渐近稳定。对于跟随者无法获取领航者转向的情况,我们表明系统相对于被视为输入的领航者转向仍然是输入-状态稳定的。此外,我们证明如果领航者的转向是周期性的,跟随者将渐近收敛到具有相同周期的周期轨道。我们通过数值模拟和移动机器人实验验证了这些结果。最后,我们通过将两智能体控制律扩展到N智能体链网络,展示了所提出方法的可扩展性,并说明了其在生物和工程群体中方向信息传播的意义。

英文摘要

This paper investigates a planar tracking problem between a leader and follower agent. We propose a novel feedback speed control law, paired with a constant bearing steering strategy, to maintain an abreast formation between the two agents. We prove that the proposed control yields asymptotic stability of the closed-loop system when the steering of the leader is known. For the case when the leader's steering is unavailable to the follower, we show that the system is still input-to-state stable with respect to the leader's steering viewed as an input. Furthermore, we demonstrate that if the leader's steering is periodic, the follower will asymptotically converge to a periodic orbit with the same period. We validate these results through numerical simulations and experimental implementations on mobile robots. Finally, we demonstrate the scalability of the proposed approach by extending the two-agent control law to an N-agent chain network, illustrating its implications for directional information propagation in biological and engineered flocks.