arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1903.05817 2026-06-04 eess.SY cs.LG cs.SY

A New Approach for Distributed Hypothesis Testing with Extensions to Byzantine-Resilience

分布式假设检验的一种新方法及其对拜占庭容错的扩展

Aritra Mitra, John A. Richards, Shreyas Sundaram

发表机构 * School of Electrical and Computer Engineering at Purdue University(普渡大学电气与计算机工程学院) Sandia National Laboratories(桑迪亚国家实验室)

AI总结 本文提出了一种新的分布式学习规则,用于在时间序列中联合观察资料下学习真实的状态,该方法不采用信念平均,且能扩展到处理网络中某些代理的恶意行为。

Comments To appear in the Proceedings of the American Control Conference, 2019

详情
AI中文摘要

我们研究了一个场景,其中一组代理各自接收部分信息的私人观察,试图协作学习能够解释他们随时间变化的联合观察资料的真实状态(在一组假设中)。为了解决这个问题,我们提出了一种分布式学习规则,与现有方法不同,它不采用任何形式的“信念平均”。具体来说,每个代理维护一个本地信念(对每个假设),该信念以贝叶斯方式更新,不受网络影响,同时维护一个实际信念,该信念在更新(除归一化外)时是其自身本地信念和邻居实际信念的最小值。在对代理信号结构和底层通信图的最小要求下,我们建立了所提出信念更新规则的一致性,即我们证明了代理的实际信念几乎必然渐近地集中在真实状态上。作为我们方法的一个关键好处,我们展示了我们的学习规则可以扩展到捕捉网络中某些代理的恶意行为,通过拜占庭对手模型。特别是,我们证明在适当的观察模型和网络拓扑条件下,每个非恶意代理几乎必然渐近地学习世界的真实状态。

英文摘要

We study a setting where a group of agents, each receiving partially informative private observations, seek to collaboratively learn the true state (among a set of hypotheses) that explains their joint observation profiles over time. To solve this problem, we propose a distributed learning rule that differs fundamentally from existing approaches, in the sense, that it does not employ any form of "belief-averaging". Specifically, every agent maintains a local belief (on each hypothesis) that is updated in a Bayesian manner without any network influence, and an actual belief that is updated (up to normalization) as the minimum of its own local belief and the actual beliefs of its neighbors. Under minimal requirements on the signal structures of the agents and the underlying communication graph, we establish consistency of the proposed belief update rule, i.e., we show that the actual beliefs of the agents asymptotically concentrate on the true state almost surely. As one of the key benefits of our approach, we show that our learning rule can be extended to scenarios that capture misbehavior on the part of certain agents in the network, modeled via the Byzantine adversary model. In particular, we prove that each non-adversarial agent can asymptotically learn the true state of the world almost surely, under appropriate conditions on the observation model and the network topology.

1903.04706 2026-06-04 eess.SY cs.RO cs.SY

Control Barrier Functions for Systems with High Relative Degree

用于高相对次数系统的控制屏障函数

Wei Xiao, Calin Belta

发表机构 * Boston University(波士顿大学)

AI总结 本文扩展了控制屏障函数(CBFs)到高阶控制屏障函数(HOCBFs),用于处理高相对次数约束。提出HOCBFs比最近提出的(指数)HOCBFs更通用。我们介绍了高阶屏障函数(HOBF),并展示了其满足Lyapunov-like条件会使得一系列集合的交集向前不变。然后引入HOCBF,并展示任何满足HOCBF约束的控制输入会使一系列集合的交集向前不变。我们提出了具有HOCBF和控制Lyapunov函数(CLF)约束的优化控制问题,并分析了在定义HOCBF时所用的$\mathcal{K}$类函数的选择对可行控制区域大小的影响。我们还提供了一种有前途的方法来解决HOCBF约束与控制限制之间的冲突,通过惩罚$\mathcal{K}$类函数。我们通过自适应巡航控制问题展示了所提出的方法。

Comments 9 pages, 7 figures, submitted to CDC19

详情
AI中文摘要

本文扩展了控制屏障函数(CBFs)到高阶控制屏障函数(HOCBFs),用于处理高相对次数约束。所提出的HOCBFs比最近提出的(指数)HOCBFs更通用。我们介绍了高阶屏障函数(HOBF),并展示了其满足Lyapunov-like条件会使得一系列集合的交集向前不变。然后引入HOCBF,并展示任何满足HOCBF约束的控制输入会使一系列集合的交集向前不变。我们提出了具有HOCBF和控制Lyapunov函数(CLF)约束的优化控制问题,并分析了在定义HOCBF时所用的$\mathcal{K}$类函数的选择对可行控制区域大小的影响。我们还提供了一种有前途的方法来解决HOCBF约束与控制限制之间的冲突,通过惩罚$\mathcal{K}$类函数。我们通过自适应巡航控制问题展示了所提出的方法。

英文摘要

This paper extends control barrier functions (CBFs) to high order control barrier functions (HOCBFs) that can be used for high relative degree constraints. The proposed HOCBFs are more general than recently proposed (exponential) HOCBFs. We introduce high order barrier functions (HOBF), and show that their satisfaction of Lyapunov-like conditions implies the forward invariance of the intersection of a series of sets. We then introduce HOCBF, and show that any control input that satisfies the HOCBF constraints renders the intersection of a series of sets forward invariant. We formulate optimal control problems with constraints given by HOCBF and control Lyapunov functions (CLF) and analyze the influence of the choice of the class $\mathcal{K}$ functions used in the definition of the HOCBF on the size of the feasible control region. We also provide a promising method to address the conflict between HOCBF constraints and control limitations by penalizing the class $\mathcal{K}$ functions. We illustrate the proposed method on an adaptive cruise control problem.

1903.05079 2026-06-04 math.NA cs.CV cs.NA

A total variation based regularizer promoting piecewise-Lipschitz reconstructions

一种基于总变分的正则化器,促进分段Lipschitz重建

Martin Burger, Yury Korolev, Carola-Bibiane Schönlieb, Christiane Stollenwerk

发表机构 * Department of Applied Mathematics and Theoretical Physics, University of Cambridge(剑桥大学应用数学与理论物理系)

AI总结 本文提出了一种新的总变分家族正则化器,促进具有给定Lipschitz常数(可空间变化)的重建。通过证明该功能的正则化性质,并研究其与总变分和infimal convolution类型正则化器TVLp的联系,特别是建立了拓扑等价性。数值实验表明,所提出的正则化器在性能上与总广义变分相似,但具有非常直观的自由参数解释,即只是梯度范数的局部估计。它还提供了一种自然的空间自适应正则化方法。

Comments 12 pages, 4 figures, accepted for publication in SSVM conference proceedings 2019

详情
AI中文摘要

我们介绍了一种新的总变分家族正则化器,该正则化器促进具有给定Lipschitz常数(也可以空间变化)的重建。我们证明了该功能的正则化性质,并研究了其与总变分和infimal convolution类型正则化器TVLp的联系,特别是建立了拓扑等价性。我们的数值实验表明,所提出的正则化器可以达到与总广义变分相似的性能,同时具有非常直观的自由参数解释,其自由参数仅仅是梯度范数的局部估计。它还提供了一种自然的空间自适应正则化方法。

英文摘要

We introduce a new regularizer in the total variation family that promotes reconstructions with a given Lipschitz constant (which can also vary spatially). We prove regularizing properties of this functional and investigate its connections to total variation and infimal convolution type regularizers TVLp and, in particular, establish topological equivalence. Our numerical experiments show that the proposed regularizer can achieve similar performance as total generalized variation while having the advantage of a very intuitive interpretation of its free parameter, which is just a local estimate of the norm of the gradient. It also provides a natural approach to spatially adaptive regularization.

1903.04958 2026-06-04 eess.SY cs.LG cs.SY

Real-Time Boiler Control Optimization with Machine Learning

燃煤电厂实时锅炉控制优化与机器学习

Yukun Ding, Yiyu Shi

发表机构 * University of Notre Dame(诺特达姆大学)

AI总结 本文提出利用机器学习优化燃煤电厂锅炉实时控制,通过优化不同区域的温度分布和炉膛氧含量,提高锅炉稳定性与能源效率。

Comments To appear in TC-CPS Newsletter

详情
AI中文摘要

在燃煤电厂中,提高锅炉运行效率对于可持续发展至关重要。本文将实时锅炉控制建模为一个优化问题,寻找不同区域的最佳温度分布和炉膛氧含量,以提高锅炉的稳定性和能源效率。我们采用一种高效的算法,结合适当的机器学习和优化技术。我们从行业合作伙伴处获得了一个超过两个月的实时锅炉数据集,并进行了广泛的实验,以证明所提算法的有效性和效率。

英文摘要

In coal-fired power plants, it is critical to improve the operational efficiency of boilers for sustainability. In this work, we formulate real-time boiler control as an optimization problem that looks for the best distribution of temperature in different zones and oxygen content from the flue to improve the boiler's stability and energy efficiency. We employ an efficient algorithm by integrating appropriate machine learning and optimization techniques. We obtain a large dataset collected from a real boiler for more than two months from our industry partner, and conduct extensive experiments to demonstrate the effectiveness and efficiency of the proposed algorithm.

1903.04681 2026-06-04 eess.SY cs.LG cs.SY

Estimating multi-class dynamic origin-destination demand through a forward-backward algorithm on computational graphs

通过计算图上的前向-后向算法估计多类动态出行生成需求

Wei Ma, Xidong Pi, Sean Qian

发表机构 * Department of Civil and Environmental Engineering(土木与环境工程系) Carnegie Mellon University(卡内基梅隆大学)

AI总结 本文提出了一种基于计算图的多类动态出行生成需求估计框架(MCDODE),通过前向-后向算法和树基累积曲线估计OD需求梯度,以解决大规模交通网络中多类时空车流估计的挑战。

Comments 31 pages, 21 figures, submitted to Transportation Research Part C: Emerging Technologies

详情
AI中文摘要

交通网络的复杂性前所未有,具有异质性车流。传统上,车辆类别通过车辆分类(如标准乘用车和卡车)来考虑。然而,车辆流的异质性源于许多其他方面,例如网约车与个人车辆、人工驾驶车辆与联网和自动驾驶车辆。在大型交通网络中,为每个类别提供一些车辆流观测,如何估计多类时空车辆流,即时间变化的起源-目的地(OD)需求和路径/链流,仍是一个重大挑战。本文提出了一种多类动态OD需求估计(MCDODE)的解决方案框架,该框架基于具有张量表示的时空流和MCDODE公式中所有中间特征的计算图。提出了一种前向-后向算法,以在计算图上高效求解MCDODE公式。此外,我们提出了一种新的树基累积曲线概念来估计OD需求的梯度。开发了Growing Tree算法来构建树基累积曲线。所提出的框架在小型网络以及实际的大规模网络上进行了检验。实验结果表明,所提出的框架具有竞争力、令人满意且计算上可行。

英文摘要

Transportation networks are unprecedentedly complex with heterogeneous vehicular flow. Conventionally, vehicle classes are considered by vehicle classifications (such as standard passenger cars and trucks). However, vehicle flow heterogeneity stems from many other aspects in general, e.g., ride-sourcing vehicles versus personal vehicles, human driven vehicles versus connected and automated vehicles. Provided with some observations of vehicular flow for each class in a large-scale transportation network, how to estimate the multi-class spatio-temporal vehicular flow, in terms of time-varying Origin-Destination (OD) demand and path/link flow, remains a big challenge. This paper presents a solution framework for multi-class dynamic OD demand estimation (MCDODE) in large-scale networks. The proposed framework is built on a computational graph with tensor representations of spatio-temporal flow and all intermediate features involved in the MCDODE formulation. A forward-backward algorithm is proposed to efficiently solve the MCDODE formulation on computational graphs. In addition, we propose a novel concept of tree-based cumulative curves to estimate the gradient of OD demand. A Growing Tree algorithm is developed to construct tree-based cumulative curves. The proposed framework is examined on a small network as well as a real-world large-scale network. The experiment results indicate that the proposed framework is compelling, satisfactory and computationally plausible.

1903.03763 2026-06-04 eess.SY cs.LG cs.SY math.OC stat.ML

A tractable ellipsoidal approximation for voltage regulation problems

电压调节问题中的可处理椭球近似

Pan Li, Baihong Jin, Ruoxuan Xiong, Dai Wang, Alberto Sangiovanni-Vincentelli, Baosen Zhang

发表机构 * Facebook Inc.(Facebook公司) University of Washington(华盛顿大学) Stanford University(斯坦福大学) Tesla Inc.(特斯拉公司)

AI总结 本文提出了一种基于机器学习的方法来解决电力系统运行中电压调节问题中的机会约束优化问题,通过用椭球近似不确定性可行区域,提出了类似支持向量机的学习模型和高效的采样算法。

Comments accepted by ACC2019 http://acc2019.a2c2.org/

详情
AI中文摘要

我们提出了一种机器学习方法来解决电压调节问题中的机会约束优化问题。我们的方法新颖之处在于用椭球近似不确定性可行区域。我们使用类似于支持向量机(SVM)的学习模型来提出这个问题,并提出了一种高效的采样算法来训练模型。我们使用标准的IEEE配电测试馈线在电压调节问题上展示了我们的方法。

英文摘要

We present a machine learning approach to the solution of chance constrained optimizations in the context of voltage regulation problems in power system operation. The novelty of our approach resides in approximating the feasible region of uncertainty with an ellipsoid. We formulate this problem using a learning model similar to Support Vector Machines (SVM) and propose a sampling algorithm that efficiently trains the model. We demonstrate our approach on a voltage regulation problem using standard IEEE distribution test feeders.

1903.03318 2026-06-04 eess.SY cs.RO cs.SY

Development of an Autonomous Sanding Robot with Structured-Light Technology

基于结构光技术的自主打磨机器人开发

Yingxin Huo, Diancheng Chen, Xiang Li, Peng Li, Yun-Hui Liu

发表机构 * CUHK T Stone Robotics Institute(CUHK T Stone机器人研究所) Innovation and Technology Commission of Hong Kong(香港创新及科技委员会) Harbin Institute of Technology(哈尔滨工业大学)

AI总结 本文提出了一种能够自主完成未知物体打磨工作的机器人,通过结构光相机扫描建模、优化运动规划和阻抗模型控制,实现了无需人工干预的自主打磨。

Comments 7 pages, 11 figures, IEEE/RSJ International Conference on Intelligent Robots and Systems 2019

详情
AI中文摘要

机器人和自动化的大需求在打磨工作中得到了体现,因为当前的手动操作是劳动密集型的,缺乏一致的质量,且存在安全和健康问题。虽然已经开发出几种自动化打磨工作的机器,但现有解决方案的自主能力相对较低,仍需要大量的人工协助或监督来校准目标物体或规划机器人运动和任务。本文提出了一种自主打磨机器人,能够自动对未知物体进行打磨,无需任何先前的校准或人工干预。该机器人工作流程如下:首先,使用结构光相机扫描并建模目标物体。其次,规划机器人运动以覆盖物体的所有表面,采用优化的过渡序列。第三,控制机器人在期望的阻抗模型下对物体进行打磨。制造了一个打磨机器人的原型,并在打磨一批木箱的任务中验证了其性能。凭借足够的自由度(DOFs)和末端执行器的模块化设计,该机器人能够为许多其他不同物体的自主打磨提供通用解决方案。

英文摘要

Large demand for robotics and automation has been reflected in the sanding works, as current manual operations are labor-intensive, without consistent quality, and also subject to safety and health issues. While several machines have been developed to automate one or two steps in the sanding works, the autonomous capability of existing solutions is relatively low, and the human assistance or supervision is still heavily required in the calibration of target objects or the planning of robot motion and tasks. This paper presents the development of an autonomous sanding robot, which is able to perform the sanding works on an unknown object automatically, without any prior calibration or human intervention. The developed robot works as follows. First, the target object is scanned then modeled with the structured-light camera. Second, the robot motion is planned to cover all the surfaces of the object with an optimized transition sequence. Third, the robot is controlled to perform the sanding on the object under the desired impedance model. A prototype of the sanding robot is fabricated and its performance is validated in the task of sanding a batch of wooden boxes. With sufficient degrees of freedom (DOFs) and the module design for the end effector, the developed robot is able to provide a general solution to the autonomous sanding on many other different objects.

1807.09519 2026-06-04 math.NA cs.LG cs.NA

A machine learning framework for data driven acceleration of computations of differential equations

一种用于微分方程计算的数据驱动加速的机器学习框架

Siddhartha Mishra

发表机构 * Seminar for Applied Mathematics (SAM), D-Math ETH Zürich(应用数学研讨会(SAM),ETH Zurich 数学系)

AI总结 本文提出了一种机器学习框架,用于加速时间依赖的常微分方程和偏微分方程的数值计算,通过将现有数值方法转化为人工神经网络,并通过离线训练过程最小化损失函数来确定可训练参数,从而提高计算效率。

详情
AI中文摘要

我们提出了一种机器学习框架,用于加速时间依赖的常微分方程和偏微分方程的数值计算。我们的方法是将(现有数值方法的)泛化形式作为人工神经网络,具有一个可训练的参数集。这些参数通过离线训练过程通过(随机)梯度下降方法(近似)最小化合适的(可能非凸)损失函数来确定。所提出的算法始终与底层微分方程保持一致。涉及线性和非线性ODE和PDE模型问题的数值实验显示,与标准数值方法相比,计算效率有显著提升。

英文摘要

We propose a machine learning framework to accelerate numerical computations of time-dependent ODEs and PDEs. Our method is based on recasting (generalizations of) existing numerical methods as artificial neural networks, with a set of trainable parameters. These parameters are determined in an offline training process by (approximately) minimizing suitable (possibly non-convex) loss functions by (stochastic) gradient descent methods. The proposed algorithm is designed to be always consistent with the underlying differential equation. Numerical experiments involving both linear and non-linear ODE and PDE model problems demonstrate a significant gain in computational efficiency over standard numerical methods.

1812.05591 2026-06-04 eess.SY cs.AI cs.MA cs.SY

TuSeRACT: Turn-Sample-Based Real-Time Traffic Signal Control

TuSeRACT:基于转向的实时交通信号控制

Srishti Dhamija, Pradeep Varakantham

发表机构 * School of Information Systems, Singapore Management University(新加坡管理大学信息学院)

AI总结 本文提出TuSeRACT,一种基于转向的实时交通信号控制方法,通过采样转向流量来优化交通信号调度,从而降低车辆等待时间,相比SURTRAC有更优的性能。

详情
AI中文摘要

实时交通信号控制是一个具有挑战性的问题,由于不断变化的交通需求模式、有限的规划时间和各种不确定性来源(例如转向运动、车辆检测)在现实世界中。SURTRAC(可扩展的Urban交通控制)是一种最近开发的交通信号控制方法,它在实时计算中计算减少延误和协调(跨邻近交通灯)的即将到来车辆集群的调度。为了确保在转向引起的不确定性存在下实时响应性,SURTRAC计算调度以最小化预期转向运动的延误,而不是在转向引起的不确定性下最小化预期延误。这种近似确保了实时可处理性,但在存在转向引起的不确定性时会降低解决方案质量。为了解决这一限制,我们引入了TuSeRACT(基于转向的实时交通信号控制),一种分布式基于采样的调度方法用于交通信号控制。与SURTRAC不同,TuSeRACT计算调度以最小化采样转向运动的观察交通下的预期延误,并与邻近交叉口通信流量样本。我们将这种基于采样的调度问题公式化为一个约束程序,并在合成交通网络上经验性地评估了我们的方法。我们的方法在车辆等待时间方面相对于SURTRAC提供了显著更低的平均值。

英文摘要

Real-time traffic signal control is a challenging problem owing to constantly changing traffic demand patterns, limited planning time and various sources of uncertainty (e.g., turn movements, vehicle detection) in the real world. SURTRAC (Scalable URban TRAffic Control) is a recently developed traffic signal control approach which computes delay-minimizing and coordinated (across neighbouring traffic lights) schedules of oncoming vehicle clusters in real time. To ensure real-time responsiveness in the presence of turn-induced uncertainty, SURTRAC computes schedules which minimize the delay for the expected turn movements as opposed to minimizing the expected delay under turn-induced uncertainty. This approximation ensures real-time tractability, but degrades solution quality in the presence of turn-induced uncertainty. To address this limitation, we introduce TuSeRACT (Turn Sample based Real-time trAffic signal ConTrol), a distributed sample-based scheduling approach to traffic signal control. Unlike SURTRAC, TuSeRACT computes schedules that minimize expected delay over sampled turn movements of observed traffic, and communicates samples of traffic outflows to neighbouring intersections. We formulate this sample-based scheduling problem as a constraint program and empirically evaluate our approach on synthetic traffic networks. Our approach provides substantially lower mean vehicular waiting times relative to SURTRAC.

1903.00220 2026-06-04 eess.SY cs.RO cs.SY

Approximate Robust Control of Uncertain Dynamical Systems

不确定动力系统近似鲁棒控制

Edouard Leurent, Yann Blanco, Denis Efimov, Odalric-Ambrym Maillard

发表机构 * INRIA Lille(INRIA里尔) Renault(雷诺) Non-A team, INRIA Lille SequeL team, INRIA Lille(非A团队,INRIA里尔SequeL团队,INRIA里尔)

AI总结 本文研究了在不确定环境中大型非线性系统安全控制策略的设计,提出两种可处理的鲁棒控制方法,应用于自动驾驶问题。

详情
Journal ref
32nd Conference on Neural Information Processing Systems (NeurIPS 2018) Workshop, Dec 2018, Montr{é}al, Canada
AI中文摘要

本文研究了在不确定环境中大型非线性系统安全控制策略的设计。在这样的情况下,鲁棒控制框架是一种安全性的原理性方法,旨在最大化系统的最坏情况性能。然而,由此产生的优化问题通常对于具有连续状态的非线性系统来说是不可行的。为了解决这个问题,我们引入了两种可处理的方法,这些方法基于采样或对鲁棒目标的保守近似。所提出的方法应用于自动驾驶问题。

英文摘要

This work studies the design of safe control policies for large-scale non-linear systems operating in uncertain environments. In such a case, the robust control framework is a principled approach to safety that aims to maximize the worst-case performance of a system. However, the resulting optimization problem is generally intractable for non-linear systems with continuous states. To overcome this issue, we introduce two tractable methods that are based either on sampling or on a conservative approximation of the robust objective. The proposed approaches are applied to the problem of autonomous driving.

1903.00182 2026-06-04 eess.SY cs.DC cs.LG cs.SY

Distributed Variational Bayesian Algorithms for Extended Object Tracking

分布式变分贝叶斯算法用于扩展目标跟踪

Junhao Hua, Chunguang Li

发表机构 * College of Information Science and Electronic Engineering, Zhejiang University(浙江大学信息科学与电子工程学院)

AI总结 本文研究了分布式扩展目标跟踪问题,提出了一种基于变分贝叶斯方法的集中算法,并扩展到分布式场景,通过交替方向乘子法技术,同时估计扩展目标状态和测量噪声协方差。

Comments 14 pages, 9 figures

详情
AI中文摘要

本文关注分布式扩展目标跟踪问题,旨在通过节点网络协同估计目标的状态和扩展。在传统跟踪应用中,大多数方法将目标视为测量点源,由于传感器分辨率有限。最近,一些研究考虑了扩展对象,即空间结构化的对象,即多个分辨率单元被目标占据。在这种设置中,每个时间步长生成多个测量值。本文提出了一种用于传感器网络中扩展目标跟踪问题的贝叶斯模型。在该模型中,目标扩展由对称正定随机矩阵表示,并假设存在但未知的测量噪声。基于此贝叶斯模型,我们首先提出了一种基于变分贝叶斯方法的新型集中算法用于扩展目标跟踪。然后,我们基于交替方向乘子法(ADMM)技术将其扩展到分布式场景。所提出的算法可以同时估计扩展目标状态(运动状态和扩展)和测量噪声协方差。给出了扩展目标跟踪和群体目标跟踪的仿真以验证所提模型和算法的有效性。

英文摘要

This paper is concerned with the problem of distributed extended object tracking, which aims to collaboratively estimate the state and extension of an object by a network of nodes. In traditional tracking applications, most approaches consider an object as a point source of measurements due to limited sensor resolution capabilities. Recently, some studies consider the extended objects, which are spatially structured, i.e., multiple resolution cells are occupied by an object. In this setting, multiple measurements are generated by each object per time step. In this paper, we present a Bayesian model for extended object tracking problem in a sensor network. In this model, the object extension is represented by a symmetric positive definite random matrix, and we assume that the measurement noise exists but is unknown. Using this Bayesian model, we first propose a novel centralized algorithm for extended object tracking based on variational Bayesian methods. Then, we extend it to the distributed scenario based on the alternating direction method of multipliers (ADMM) technique. The proposed algorithms can simultaneously estimate the extended object state (the kinematic state and extension) and the measurement noise covariance. Simulations on both extended object tracking and group target tracking are given to verify the effectiveness of the proposed model and algorithms.

1902.11136 2026-06-04 eess.SY cs.LG cs.SY math.DS physics.ao-ph

Learning Dynamical Systems from Partial Observations

从部分观测中学习动力系统

Ibrahim Ayed, Emmanuel de Bézenac, Arthur Pajot, Julien Brajard, Patrick Gallinari

发表机构 * Theresis lab, Thales, Thales Research \& Technology Route D\'epartementale, 91120 Palaiseau Sorbonne Universit\'e, CNRS-IRD-MNHN, LOCEAN, Paris, France Remote Sensing Center, Bergen, Norway Criteo AI Lab, Paris, France

AI总结 本文提出了一种数据驱动的框架,用于从部分观测中预测复杂非线性时空过程,通过神经网络估计时间变化的微分方程来建模系统动力学,并在浅水和欧拉模拟中验证了该方法在长期预测和学习隐藏状态方面的有效性。

详情
AI中文摘要

我们考虑在观测仅提供系统状态部分信息的情况下,预测复杂非线性时空过程的问题。我们提出了一种自然的数据驱动框架,其中系统的动力学由一个未知的时间变化微分方程建模,通过神经网络从数据中估计演化项。任何未来的状态都可以通过将关联的微分方程输入ODE求解器来计算。我们首先在浅水和欧拉模拟上评估了我们的方法,发现该方法不仅能够产生高质量的长期预测,还能学习产生接近系统真实状态的隐藏状态,而无需对后者进行直接监督。在具有挑战性的最新海洋模拟中进行的额外实验进一步验证了我们的发现,同时在经典基线方法上展示了显著的改进。

英文摘要

We consider the problem of forecasting complex, nonlinear space-time processes when observations provide only partial information of on the system's state. We propose a natural data-driven framework, where the system's dynamics are modelled by an unknown time-varying differential equation, and the evolution term is estimated from the data, using a neural network. Any future state can then be computed by placing the associated differential equation in an ODE solver. We first evaluate our approach on shallow water and Euler simulations. We find that our method not only demonstrates high quality long-term forecasts, but also learns to produce hidden states closely resembling the true states of the system, without direct supervision on the latter. Additional experiments conducted on challenging, state of the art ocean simulations further validate our findings, while exhibiting notable improvements over classical baselines.

1902.11015 2026-06-04 eess.SY cs.MA cs.RO cs.SY

Mobile Formation Coordination and Tracking Control for Multiple Non-holonomic Vehicles

多非holonomic车辆的移动编队协调与跟踪控制

Xiuhui Peng, Zhiyong Sun, Kexin Guo, Zhiyong Geng

发表机构 * Department of Automatic Control, Lund University(自动化系,吕勒奥大学)

AI总结 本文针对非holonomic车辆在SE(2)上的轨迹跟踪和移动编队协调问题,提出了一种基于中间姿态变量的分阶段控制方法,并证明了严格刚体运动的编队条件,同时通过仿真和实验验证了所提控制器的性能。

详情
AI中文摘要

本文针对在SE(2)上进行轨迹跟踪和移动编队协调的非holonomic车辆的前进运动控制问题。首先,通过构建一个包含车辆位置信息和期望姿态的中间姿态变量,设计了分阶段的平移和旋转控制输入以解决轨迹跟踪问题。其次,深入探讨了非holonomic车辆之间的相对位置和航向的协调关系,以维持具有刚体运动约束的移动编队。证明了除了平行编队和纯 translation 直线编队的情况外,只要每个车辆的线速度与角速度的比值为常数,就可以实现严格刚体运动的编队。还展示了具有弱刚体运动的移动编队的运动特性。之后,基于所提的轨迹跟踪方法,在有向树图上设计了分布式移动编队控制律。所提控制器的性能通过数值仿真和实验进行了验证。

英文摘要

This paper addresses forward motion control for trajectory tracking and mobile formation coordination for a group of non-holonomic vehicles on SE(2). Firstly, by constructing an intermediate attitude variable which involves vehicles' position information and desired attitude, the translational and rotational control inputs are designed in two stages to solve the trajectory tracking problem. Secondly, the coordination relationships of relative positions and headings are explored thoroughly for a group of non-holonomic vehicles to maintain a mobile formation with rigid body motion constraints. We prove that, except for the cases of parallel formation and translational straight line formation, a mobile formation with strict rigid-body motion can be achieved if and only if the ratios of linear speed to angular speed for each individual vehicle are constants. Motion properties for mobile formation with weak rigid-body motion are also demonstrated. Thereafter, based on the proposed trajectory tracking approach, a distributed mobile formation control law is designed under a directed tree graph. The performance of the proposed controllers is validated by both numerical simulations and experiments.

1902.10590 2026-06-04 cs.SE cs.AI cs.LG cs.SY eess.SY

Architecting Dependable Learning-enabled Autonomous Systems: A Survey

构建可靠的学习自主系统:一项综述

Chih-Hong Cheng, Dhiraj Gulati, Rongjie Yan

发表机构 * fortiss - Research Institute of the Free State of Bavaria, Germany(巴伐利亚自由州研究 institute) State Key Laboratory of Computer Science, China(中国计算机科学国家重点实验室)

AI总结 本文综述了构建可靠学习自主系统的方法,重点在于自动驾驶,讨论了多样冗余、信息融合和运行时监控等技术支柱,并总结了提升深度学习组件可靠性的最新方法,最后提出了研究方向。

详情
AI中文摘要

我们提供了一项关于构建可靠学习自主系统架构方法的综述,重点在于自动驾驶。我们考虑了构建可靠自主性的三个技术支柱,即多样化冗余、信息融合和运行时监控。对于学习组件,我们还总结了近年来提高深度学习组件可靠性的最新架构方法。最后,我们以现有方法面临的挑战为导向,提出了一 series of promising research directions.

英文摘要

We provide a summary over architectural approaches that can be used to construct dependable learning-enabled autonomous systems, with a focus on automated driving. We consider three technology pillars for architecting dependable autonomy, namely diverse redundancy, information fusion, and runtime monitoring. For learning-enabled components, we additionally summarize recent architectural approaches to increase the dependability beyond standard convolutional neural networks. We conclude the study with a list of promising research directions addressing the challenges of existing approaches.

1703.00734 2026-06-04 stat.ML cs.DC cs.LG cs.NA math.NA stat.ME

Distributed Bayesian Matrix Factorization with Limited Communication

分布式贝叶斯矩阵分解与有限通信

Xiangju Qin, Paul Blomstedt, Eemeli Leppäaho, Pekka Parviainen, Samuel Kaski

发表机构 * Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University(赫尔辛基信息科技研究院 HIIT,计算机科学系,阿莱大学) Department of Informatics, University of Bergen(信息学院,卑尔根大学)

AI总结 本文提出了一种分布式贝叶斯矩阵分解方法,通过分层分解联合后验分布,结合并行计算和高效近似实现,提高了大规模数据处理效率,同时保持预测准确性。

Comments 28 pages, 8 figures. The paper is published in Machine Learning journal. An implementation of the method is is available in SMURFF software on github (bmfpp branch): https://github.com/ExaScience/smurff

详情
Journal ref
Machine Learning, 2019
AI中文摘要

贝叶斯矩阵分解(BMF)是一种强大的工具,用于生成低秩矩阵表示,并预测缺失值和提供置信区间。对大规模矩阵的后验推断进行扩展具有挑战性,需要将数据和计算分布到多个工人上,使通信成为主要的计算瓶颈。 embarrassingly parallel 推断可以通过在不同数据子集上使用完全独立的计算来消除通信需求,但会受到BMF解的固有不可识别性的影响。我们引入了联合后验分布的分层分解,将子推断耦合起来,允许在最多三个阶段中进行 embarrassingly parallel 计算。使用高效的近似实现,我们在真实和模拟数据上经验性地展示了改进。我们的分布式方法能够实现比完整后验快几乎一个数量级的速度提升,对预测准确性影响微小。我们的方法在准确性上优于最先进的 embarrassingly parallel MCMC 方法,并在结果上与其它可用的分布式和并行BMF实现具有竞争力。

英文摘要

Bayesian matrix factorization (BMF) is a powerful tool for producing low-rank representations of matrices and for predicting missing values and providing confidence intervals. Scaling up the posterior inference for massive-scale matrices is challenging and requires distributing both data and computation over many workers, making communication the main computational bottleneck. Embarrassingly parallel inference would remove the communication needed, by using completely independent computations on different data subsets, but it suffers from the inherent unidentifiability of BMF solutions. We introduce a hierarchical decomposition of the joint posterior distribution, which couples the subset inferences, allowing for embarrassingly parallel computations in a sequence of at most three stages. Using an efficient approximate implementation, we show improvements empirically on both real and simulated data. Our distributed approach is able to achieve a speed-up of almost an order of magnitude over the full posterior, with a negligible effect on predictive accuracy. Our method outperforms state-of-the-art embarrassingly parallel MCMC methods in accuracy, and achieves results competitive to other available distributed and parallel implementations of BMF.

1902.09427 2026-06-04 eess.SP cs.LG cs.SY eess.SY stat.ML

Fault Diagnosis Method Based on Scaling Law for On-line Refrigerant Leak Detection

基于缩放定律的故障诊断方法用于在线制冷剂泄漏检测

Shun Takeuchi, Takahiro Saito

发表机构 * Machine Discovery Technology Project Artificial Intelligence Laboratory Fujitsu Laboratories Ltd., Kanagawa, Japan Machine Learning Technology Project Artificial Intelligence Laboratory Fujitsu Laboratories Ltd., Kanagawa, Japan

AI总结 本文提出了一种基于物理建模和空调系统控制机制的制冷剂泄漏故障诊断方法,通过推导与制冷剂泄漏相关的缩放定律,使模型能够适用于不同配置的空调系统,利用实验室的小规模离线故障测试数据估计缩放指数,并通过真实数据验证,证明了该方法在早期泄漏检测中的有效性。

Comments 8 pages, 6 figures

详情
Journal ref
2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)
AI中文摘要

利用仪器化传感器数据进行早期故障检测是机器学习在工业设施中的一个有前景的应用领域。然而,由于目标诊断系统中复杂的系统配置和不足的故障数据,训练出的故障检测模型的泛化性能难以提高。将训练好的模型应用于其他系统并不容易。本文提出了一种考虑空调系统物理建模和控制机制的制冷剂泄漏故障诊断方法。我们推导出与制冷剂泄漏相关的有用缩放定律。如果控制机制相同,模型可以应用于其他空调系统,而不论系统配置如何。在实验室中获得的小规模离线故障测试数据用于估计缩放指数。我们通过真实数据评估所提出的缩放定律。基于两组之间相互作用的统计假设检验,我们证明了不同空调系统的缩放指数是等价的。此外,我们基于缩放定律对实际过程数据的泄漏程度时间序列进行了估计,并通过与专家评估的比较,证明了该方法在早期泄漏检测中的有效性。

英文摘要

Early fault detection using instrumented sensor data is one of the promising application areas of machine learning in industrial facilities. However, it is difficult to improve the generalization performance of the trained fault-detection model because of the complex system configuration in the target diagnostic system and insufficient fault data. It is not trivial to apply the trained model to other systems. Here we propose a fault diagnosis method for refrigerant leak detection considering the physical modeling and control mechanism of an air-conditioning system. We derive a useful scaling law related to refrigerant leak. If the control mechanism is the same, the model can be applied to other air-conditioning systems irrespective of the system configuration. Small-scale off-line fault test data obtained in a laboratory are applied to estimate the scaling exponent. We evaluate the proposed scaling law by using real-world data. Based on a statistical hypothesis test of the interaction between two groups, we show that the scaling exponents of different air-conditioning systems are equivalent. In addition, we estimated the time series of the degree of leakage of real process data based on the scaling law and confirmed that the proposed method is promising for early leak detection through comparison with assessment by experts.

1902.09426 2026-06-04 eess.SP cs.LG cs.SY eess.SY stat.ML

Semi-supervised Approach to Soft Sensor Modeling for Fault Detection in Industrial Systems with Multiple Operation Modes

基于半监督方法的软传感器建模用于具有多种操作模式的工业系统故障检测

Shun Takeuchi, Takuya Nishino, Takahiro Saito, Isamu Watanabe

发表机构 * Artificial Intelligence Research Center(人工智能研究中心) Knowledge Information Processing Laboratory(知识信息处理实验室) Fujitsu Laboratories Ltd.(Fujitsu实验室有限公司) Japan(日本)

AI总结 本文提出了一种半监督方法用于软传感器建模,以解决在多操作模式系统中因目标变量数据不足而无法有效训练的问题,通过利用操作模式转换点的特性来改进模型预测能力。

Comments 7 pages, 1 figure

详情
Journal ref
International Conference on Advanced Intelligent Systems and Informatics 2017
AI中文摘要

在工业系统中,某些需要监控以检测故障的过程变量往往难以或无法测量。软传感器技术广泛用于从易于测量的变量估计这些难以测量的过程变量。软传感器建模需要包含各种状态信息的训练数据集,但目标变量的故障数据集不足,无法作为训练数据集。本文描述了一种半监督方法用于软传感器建模,以将缺少目标变量的不完整数据集纳入训练数据集。为了整合不完整数据集,我们考虑系统中操作模式转换点的特性。在约束条件下,通过从模式转换信息中获得的约束条件估计操作模式的回归系数。在案例研究中,这种受约束的软传感器建模被用于预测具有加热和制冷操作模式的空调系统中的制冷剂泄漏。结果表明,这种建模方法对于具有多种操作模式的系统中的软传感器具有前景。

英文摘要

In industrial systems, certain process variables that need to be monitored for detecting faults are often difficult or impossible to measure. Soft sensor techniques are widely used to estimate such difficult-to-measure process variables from easy-to-measure ones. Soft sensor modeling requires training datasets including the information of various states such as operation modes, but the fault dataset with the target variable is insufficient as the training dataset. This paper describes a semi-supervised approach to soft sensor modeling to incorporate an incomplete dataset without the target variable in the training dataset. To incorporate the incomplete dataset, we consider the properties of processes at transition points between operation modes in the system. The regression coefficients of the operation modes are estimated under constraint conditions obtained from the information on the mode transitions. In a case study, this constrained soft sensor modeling was used to predict refrigerant leaks in air-conditioning systems with heating and cooling operation modes. The results show that this modeling method is promising for soft sensors in a system with multiple operation modes.

1812.06120 2026-06-04 eess.SY cs.AI cs.RO cs.SY

Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles

模拟到缩放城市:通过自动驾驶车辆实现交通控制的零样本策略迁移

Kathy Jang, Eugene Vinitsky, Behdad Chalaki, Ben Remer, Logan Beaver, Andreas Malikopoulos, Alexandre Bayen

发表机构 * University of California, Berkeley(加州大学伯克利分校) University of Delaware(德克萨斯大学)

AI总结 本文通过深度强化学习训练自动驾驶车辆在环形交叉口的控制策略,并将训练好的策略迁移至缩放智能城市进行测试,发现注入噪声的策略在迁移后表现更佳,实现了交通流的优化。

Comments To be published at the International Conference on Cyber Physical Systems (ICCPS) 2019. 10 pages, 9 figures

详情
AI中文摘要

使用深度强化学习,我们训练了自动驾驶车辆在车队中通过环形交叉口的控制策略。使用Flow库,我们在微仿真器中训练了两种策略:一种在状态和动作空间中注入噪声,另一种则没有。在模拟中,自动驾驶车辆为两种策略都学习出一种涌现的引导行为,即减速以实现更流畅的合并。随后,我们将该策略直接迁移至德雷克塞尔大学缩放智能城市(UDSSC)测试平台,该平台是连接和自动化车辆的1:25比例测试场。我们对两种策略在缩放城市中的性能进行了表征。结果显示,无噪声策略经常导致碰撞,仅偶尔实现引导;而注入噪声的策略则始终表现出引导行为且无碰撞,表明噪声有助于零样本策略迁移。此外,迁移后的噪声注入策略在UDSSC中使平均行程时间减少了5%,最大行程时间减少了22%。控制器的视频可在https://sites.google.com/view/iccps-policy-transfer查看。

英文摘要

Using deep reinforcement learning, we train control policies for autonomous vehicles leading a platoon of vehicles onto a roundabout. Using Flow, a library for deep reinforcement learning in micro-simulators, we train two policies, one policy with noise injected into the state and action space and one without any injected noise. In simulation, the autonomous vehicle learns an emergent metering behavior for both policies in which it slows to allow for smoother merging. We then directly transfer this policy without any tuning to the University of Delaware Scaled Smart City (UDSSC), a 1:25 scale testbed for connected and automated vehicles. We characterize the performance of both policies on the scaled city. We show that the noise-free policy winds up crashing and only occasionally metering. However, the noise-injected policy consistently performs the metering behavior and remains collision-free, suggesting that the noise helps with the zero-shot policy transfer. Additionally, the transferred, noise-injected policy leads to a 5% reduction of average travel time and a reduction of 22% in maximum travel time in the UDSSC. Videos of the controllers can be found at https://sites.google.com/view/iccps-policy-transfer.

1902.08594 2026-06-04 eess.SY cs.LG cs.MA cs.SY stat.ML

Regression-based Inverter Control for Decentralized Optimal Power Flow and Voltage Regulation

基于回归的逆变器控制用于分布式最优功率流和电压调节

Oscar Sondermeijer, Roel Dobbe, Daniel Arnold, Claire Tomlin, Tamás Keviczky

发表机构 * 2 Department of Electrical Engineering \& Computer Sciences, UC Berkeley, Berkeley, USA 3 Department of Mechanical Engineering, UC Berkeley, Berkeley, USA 4 Delft Center for Systems Control, Delft University of Technology, Delft, The Netherlands

AI总结 本文提出了一种系统化的数据驱动方法,通过本地测量确定逆变器输出无功功率,以实现接近最优的结果,该方法通过网络模型和历史负荷和发电数据进行最优功率流计算,然后利用回归找到每个逆变器的函数,将本地历史数据映射到其最优无功功率注入的近似值,从而实现分布式控制,以在电压和容量约束下最小化损耗并实现电压平坦化,同时允许高效的电压-无功优化(VVO)方案,使传统控制设备与现有逆变器协同工作,以安全运行高分布式发电水平的配电网。

Comments Cite as: Oscar Sondermeijer, Roel Dobbe, Daniel Arnold, Claire Tomlin and Tamás Keviczky, "Regression-based Inverter Control for Decentralized Optimal Power Flow and Voltage Regulation", IEEE Power & Energy Society General Meeting, Boston, July 2016

详情
AI中文摘要

电子功率逆变器能够快速提供无功功率以维持客户电压在运行容差范围内,并减少配电网中的系统损耗。本文提出了一种系统化且数据驱动的方法,以确定无功功率逆变器输出作为本地测量函数的方式,以获得接近最优的结果。首先,我们使用网络模型和历史负荷和发电数据,并进行最优功率流计算,以计算网络中所有可控逆变器的全局最优无功功率注入。随后,我们使用回归找到每个逆变器的函数,将本地历史数据映射到其最优无功功率注入的近似值。所得函数随后作为参与逆变器的分布式控制器,根据新的本地测量预测最优注入。该方法在执行电压和容量约束下的损耗最小化和电压平坦化时能够实现接近最优的结果,并允许高效的电压-无功优化(VVO)方案,其中传统控制设备与现有逆变器协同工作,以安全运行具有更高分布式发电水平的配电网。

英文摘要

Electronic power inverters are capable of quickly delivering reactive power to maintain customer voltages within operating tolerances and to reduce system losses in distribution grids. This paper proposes a systematic and data-driven approach to determine reactive power inverter output as a function of local measurements in a manner that obtains near optimal results. First, we use a network model and historic load and generation data and do optimal power flow to compute globally optimal reactive power injections for all controllable inverters in the network. Subsequently, we use regression to find a function for each inverter that maps its local historical data to an approximation of its optimal reactive power injection. The resulting functions then serve as decentralized controllers in the participating inverters to predict the optimal injection based on a new local measurements. The method achieves near-optimal results when performing voltage- and capacity-constrained loss minimization and voltage flattening, and allows for an efficient volt-VAR optimization (VVO) scheme in which legacy control equipment collaborates with existing inverters to facilitate safe operation of distribution networks with higher levels of distributed generation.

1807.04020 2026-06-04 math.NA cs.LG cs.NA stat.ML

Improved SVD-based Initialization for Nonnegative Matrix Factorization using Low-Rank Correction

改进的基于SVD的非负矩阵分解初始化方法:利用低秩修正

Atif Muhammad Syed, Sameer Qazi, Nicolas Gillis

发表机构 * Graduate School of Science and Engineering(研究生院) PAF-Karachi Institute of Economics and Technology(卡拉奇经济和技术学院) Department of Mathematics and Operational Research(数学与运筹学系)

AI总结 本文提出了一种改进的基于SVD的非负矩阵分解初始化方法,通过考虑被丢弃的SVD因子来降低初始误差,同时生成稀疏初始因子并提高计算效率。

Comments 12 pages, 1 figure, 5 tables, submitted to pattern recognition letters

详情
Journal ref
Pattern Recognition Letters 122, pp. 53-59, 2019
AI中文摘要

由于大多数非负矩阵分解(NMF)算法的迭代性质,初始化是一个关键因素,因为它显著影响收敛性和最终得到的解。许多初始化方案已被提出,其中最受欢迎的一类方法基于奇异值分解(SVD)。然而,这些基于SVD的初始化方法并不满足一个自然条件,即误差应随着因子分解的秩增加而减少。在本文中,我们提出了一种新的基于SVD的NMF初始化方法,专门针对这一不足,通过考虑用于获得非负初始化而被丢弃的SVD因子。这种方法称为非负SVD与低秩修正(NNSVD-LRC),通过利用被丢弃的SVD因子的低秩结构,在可忽略的额外计算成本下显著降低初始误差。与以往基于SVD的初始化方法相比,NNSVD-LRC还有两个其他优势:(1)它能够证明生成稀疏的初始因子;(2)它更快,因为它只需要计算秩为⌈r/2 + 1⌉的截断SVD,其中r是所求NMF分解的因子秩(与其他方法不同,其他方法需要计算秩为r的截断SVD)。我们在多个标准密集和稀疏数据集上展示了我们的新方法在NMF中与最先进的基于SVD的初始化方法竞争性。

英文摘要

Due to the iterative nature of most nonnegative matrix factorization (\textsc{NMF}) algorithms, initialization is a key aspect as it significantly influences both the convergence and the final solution obtained. Many initialization schemes have been proposed for NMF, among which one of the most popular class of methods are based on the singular value decomposition (SVD). However, these SVD-based initializations do not satisfy a rather natural condition, namely that the error should decrease as the rank of factorization increases. In this paper, we propose a novel SVD-based \textsc{NMF} initialization to specifically address this shortcoming by taking into account the SVD factors that were discarded to obtain a nonnegative initialization. This method, referred to as nonnegative SVD with low-rank correction (NNSVD-LRC), allows us to significantly reduce the initial error at a negligible additional computational cost using the low-rank structure of the discarded SVD factors. NNSVD-LRC has two other advantages compared to previous SVD-based initializations: (1) it provably generates sparse initial factors, and (2) it is faster as it only requires to compute a truncated SVD of rank $\lceil r/2 + 1 \rceil$ where $r$ is the factorization rank of the sought NMF decomposition (as opposed to a rank-$r$ truncated SVD for other methods). We show on several standard dense and sparse data sets that our new method competes favorably with state-of-the-art SVD-based initializations for NMF.

1902.07708 2026-06-04 eess.SY cs.RO cs.SY

A Stability Analysis for the Acceleration-based Robust Position Control of Robot Manipulators via Disturbance Observer

基于扰动观测器的机器人操作臂加速度鲁棒位置控制的稳定性分析

Emre Sariyildiz, Hiromu Sekiguchi, Takahiro Nozaki, Barkan Ugurlu, Kouhei Ohnishi

发表机构 * University of Wollongong(沃林戈大学) Keio University(庆应大学)

AI总结 本文提出了一种新的非线性稳定性分析方法,用于通过扰动观测器(DOb)实现机器人操作臂的加速度鲁棒位置控制,证明了在适当调节名义惯性矩阵时,位置误差在调节控制中渐近趋于零,在轨迹跟踪控制中是均匀最终有界的。随着DOb的带宽和名义惯性矩阵的增加,误差的界减小,即位置控制系统的鲁棒稳定性和性能得到改善。然而,由于实际设计限制,DOb的带宽和名义惯性矩阵不能随意增加,例如,当它们增加时,鲁棒位置控制器会变得更加噪声敏感。所提出的稳定性分析为DOb基于鲁棒运动控制系统动态行为提供了见解。理论和实验证明了名义惯性矩阵的非对角元素对稳定性和调整鲁棒性与噪声敏感性之间的权衡是有帮助的。该提议的有效性通过仿真和实验结果得到验证。

Comments 9 pages, 9 figures, Journal

详情
Journal ref
IEEE/ASME Transactions On Mechatronics, Vol. 23, No. 5, October 2018
AI中文摘要

本文提出了一种新的非线性稳定性分析方法,用于通过扰动观测器(DOb)实现机器人操作臂的加速度鲁棒位置控制。证明了如果在DOb的设计中适当调节名义惯性矩阵,则在调节控制中位置误差渐近趋于零,在轨迹跟踪控制中是均匀最终有界的。随着DOb的带宽和名义惯性矩阵的增加,误差的界减小,即位置控制系统的鲁棒稳定性和性能得到改善。然而,由于实际设计限制,DOb的带宽和名义惯性矩阵不能随意增加,例如,当它们增加时,鲁棒位置控制器会变得更加噪声敏感。所提出的稳定性分析为DOb基于鲁棒运动控制系统动态行为提供了见解。理论和实验证明了名义惯性矩阵的非对角元素对稳定性和调整鲁棒性与噪声敏感性之间的权衡是有帮助的。该提议的有效性通过仿真和实验结果得到验证。

英文摘要

This paper proposes a new nonlinear stability analysis for the acceleration-based robust position control of robot manipulators by using Disturbance Observer (DOb). It is shown that if the nominal inertia matrix is properly tuned in the design of DOb, then the position error asymptotically goes to zero in regulation control and is uniformly ultimately bounded in trajectory tracking control. As the bandwidth of DOb and the nominal inertia matrix are increased, the bound of error shrinks, i.e., the robust stability and performance of the position control system are improved. However, neither the bandwidth of DOb nor the nominal inertia matrix can be freely increased due to practical design constraints, e.g., the robust position controller becomes more noise sensitive when they are increased. The proposed stability analysis provides insights regarding the dynamic behavior of DOb-based robust motion control systems. It is theoretically and experimentally proved that non-diagonal elements of the nominal inertia matrix are useful to improve the stability and adjust the trade-off between the robustness and noise sensitivity. The validity of the proposal is verified by simulation and experimental results.

1712.10158 2026-06-04 q-bio.NC cs.LG cs.NE cs.SY eess.SY stat.ML

Non-linear motor control by local learning in spiking neural networks

通过局部学习在脉冲神经网络中实现非线性运动控制

Aditya Gilra, Wulfram Gerstner

发表机构 * School of Computer and Communication Sciences(计算机与通信科学学院) Brain-Mind Institute, School of Life Sciences(脑科学与生命科学研究所)

AI总结 本文提出了一种基于反馈的在线局部学习权重(FOLLOW)方法,用于训练异构脉冲神经网络,以控制双臂并重现期望的状态轨迹,核心贡献是通过局部可塑性规则学习逆模型以实现非线性动力学控制。

详情
Journal ref
Proceedings of the 35th International Conference on Machine Learning, PMLR 80:1773-1782, 2018
AI中文摘要

在具有隐藏神经元的脉冲神经网络中,使用局部、稳定且在线的规则学习权重,以控制非线性身体动力学是一个开放性问题。本文采用监督方案,反馈基于在线局部学习权重(FOLLOW),训练具有隐藏层的异质脉冲神经元网络,以控制双臂以重现期望状态轨迹。网络首先学习非线性动力学的逆模型,即从状态轨迹作为输入,学习推断产生轨迹的连续时间命令。连接权重通过涉及前突触放电和后突触误差反馈的局部可塑性规则进行调整。我们选择了一种称为微分前馈的网络架构,该架构在不同前馈和递归架构中提供了最低的测试误差。学习的逆模型随后用于生成连续时间运动命令以控制手臂,给定期望轨迹。

英文摘要

Learning weights in a spiking neural network with hidden neurons, using local, stable and online rules, to control non-linear body dynamics is an open problem. Here, we employ a supervised scheme, Feedback-based Online Local Learning Of Weights (FOLLOW), to train a network of heterogeneous spiking neurons with hidden layers, to control a two-link arm so as to reproduce a desired state trajectory. The network first learns an inverse model of the non-linear dynamics, i.e. from state trajectory as input to the network, it learns to infer the continuous-time command that produced the trajectory. Connection weights are adjusted via a local plasticity rule that involves pre-synaptic firing and post-synaptic feedback of the error in the inferred command. We choose a network architecture, termed differential feedforward, that gives the lowest test error from different feedforward and recurrent architectures. The learned inverse model is then used to generate a continuous-time motor command to control the arm, given a desired trajectory.

1712.06281 2026-06-04 math.OC cs.LG cs.SY eess.SY physics.chem-ph

A New Data-Driven Sparse-Learning Approach to Study Chemical Reaction Networks

一种新的数据驱动稀疏学习方法用于研究化学反应网络

Farshad Harirchi, Doohyun Kim, Omar A. Khalil, Sijia Liu, Paolo Elvati, Angela Violi, Alfred O. Hero

发表机构 * Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109--2125, USA(电气工程与计算机科学系,密歇根大学,安娜堡,MI 48109--2125,美国) Department of Mechanical Engineering, University of Michigan, Ann Arbor, MI 48109--2125, USA(机械工程系,密歇根大学,安娜堡,MI 48109--2125,美国) Departments of Chemical Engineering, Biomedical Engineering, Macromolecular Science and Engineering, Biophysics Program, University of Michigan, Ann Arbor, MI 48109--2125, USA(化学工程、生物医学工程、大分子科学与工程、生物物理项目系,密歇根大学,安娜堡,MI 48109--2125,美国)

AI总结 本文提出了一种数据驱动的稀疏学习方法,用于识别化学反应网络中关键反应,该方法通过物种浓度和反应速率来确定影响反应,具有低计算成本,无需额外数据或模拟,应用于氢气和丙烷的燃烧化学分析,并展示了简化机制在点火延迟上的良好性能。

详情
AI中文摘要

化学动力学机制可以通过一组基本反应表示,这些反应可以利用物理化学关系轻松转换为数学表达式。反应的示意图表示捕捉了反应物和产物之间的相互作用。确定系统动态行为下的最小化学相互作用是一个主要任务。在本文中,我们介绍了一种新的方法,利用数据驱动的稀疏学习技术来识别化学反应网络中在燃烧应用中的关键反应。所提出的方法利用物种浓度和反应速率来确定一组关键反应,且具有最小的计算成本,无需额外的数据或模拟。新的方法应用于分析恒容均质反应器中氢气和丙烷的燃烧化学。稀疏学习方法识别出的关键反应与当前化学机制的速率理论知识一致。此外,我们还表明,可以将不同时间和条件下识别出的关键反应组合起来,生成一个简化版本的原始机制,并且对于氢气和丙烷,这种简化机制在广泛的条件下表现出与原始机制相近的点火延迟性能。我们的结果展示了稀疏学习方法作为有效且高效的机制分析和机制简化工具的潜力。

英文摘要

Chemical kinetic mechanisms can be represented by sets of elementary reactions that are easily translated into mathematical terms using physicochemical relationships. The schematic representation of reactions captures the interactions between reacting species and products. Determining the minimal chemical interactions underlying the dynamic behavior of systems is a major task. In this paper, we introduce a novel approach for the identification of the influential reactions in chemical reaction networks for combustion applications, using a data-driven sparse-learning technique. The proposed approach identifies a set of influential reactions using species concentrations and reaction rates, with minimal computational cost without requiring additional data or simulations. The new approach is applied to analyze the combustion chemistry of H2 and C3H8 in a constant-volume homogeneous reactor. The influential reactions identified by the sparse-learning method are consistent with the current kinetics knowledge of chemical mechanisms. Additionally, we show that a reduced version of the parent mechanism can be generated as a combination of the influential reactions identified at different times and conditions and that for both H2 and C3H8 this reduced mechanism performs closely to the parent mechanism as a function of ignition delay over a wide range of conditions. Our results demonstrate the potential of the sparse-learning approach as an effective and efficient tool for mechanism analysis and mechanism reduction.

1902.02542 2026-06-04 math.OC cs.LG cs.SY eess.SY

Predict Globally, Correct Locally: Parallel-in-Time Optimal Control of Neural Networks

全局预测,局部校正:神经网络优化的并行时间最优控制

Panos Parpas, Corey Muir

发表机构 * Department of Computing, Imperial College London, London, United Kingdom(帝国理工学院计算系,伦敦,英国)

AI总结 本文提出了一种新的分布式优化算法,通过将神经网络的层视为动力系统离散动力学,利用最优控制的共态(adjoints)与反向传播的关系,实现参数更新无需等待前向或反向传播完成,从而提高效率。

详情
AI中文摘要

动态系统最优控制与神经网络之间的联系在理论和实践中都具有价值。几位研究者利用这些联系来研究不同神经网络架构的稳定性,并开发了内存高效的训练算法。我们同样采用动态系统的观点来看待神经网络,但我们的目标与早期工作不同。我们利用动态系统、最优控制和神经网络之间的联系,开发了一种新的分布式优化算法。所提出的算法解决了分布式神经网络优化算法最显著的障碍:网络权重不能在数据前向传播完成之前更新,且反向传播梯度完成之后才能更新。利用动态系统的观点,我们将(残差)神经网络的层解释为动态系统的离散动力学,并利用最优控制问题的共态(adjoints)与反向传播之间的关系。然后我们开发了一种并行时间方法,该方法在前向或反向传播算法完全完成之前即可更新网络参数。我们建立了所提算法的收敛性。初步的数值结果表明,该算法在竞争性和效率方面优于最先进的方法。

英文摘要

The links between optimal control of dynamical systems and neural networks have proved beneficial both from a theoretical and from a practical point of view. Several researchers have exploited these links to investigate the stability of different neural network architectures and develop memory efficient training algorithms. We also adopt the dynamical systems view of neural networks, but our aim is different from earlier works. We exploit the links between dynamical systems, optimal control, and neural networks to develop a novel distributed optimization algorithm. The proposed algorithm addresses the most significant obstacle for distributed algorithms for neural network optimization: the network weights cannot be updated until the forward propagation of the data, and backward propagation of the gradients are complete. Using the dynamical systems point of view, we interpret the layers of a (residual) neural network as the discretized dynamics of a dynamical system and exploit the relationship between the co-states (adjoints) of the optimal control problem and backpropagation. We then develop a parallel-in-time method that updates the parameters of the network without waiting for the forward or back propagation algorithms to complete in full. We establish the convergence of the proposed algorithm. Preliminary numerical results suggest that the algorithm is competitive and more efficient than the state-of-the-art.

1902.01064 2026-06-04 cs.DC cs.LG cs.SY eess.SY

Hop: Heterogeneity-Aware Decentralized Training

Hop:异质性感知的去中心化训练

Qinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian

发表机构 * University of Southern California(南加州大学) Tsinghua University(清华大学)

AI总结 本文提出Hop,首个考虑异质性的去中心化训练协议,通过引入迭代间隙这一独特特性,提出基于队列的同步机制以实现备份工作者和有限滞后,同时通过跳过迭代来缓解确定性延迟,实验表明在异质环境中相比标准去中心化训练有显著加速。

详情
AI中文摘要

近期研究表明,在机器学习领域,去中心化算法在异质环境中相较于集中化算法能提供更优的性能。这两种方法的主要区别在于其不同的通信模式,两者在异质环境中都可能受到性能下降的影响。尽管已有大量努力支持集中化算法对抗异质性,但针对去中心化算法的相关研究却十分有限。本文提出Hop,首个异质性感知的去中心化训练协议。基于我们识别出的去中心化训练的一个独特特性,即迭代间隙,我们提出一种基于队列的同步机制,能够高效实现备份工作者和有限滞后。为了应对确定性延迟,我们提出跳过迭代,以进一步减轻较慢工作者的影响。我们基于TensorFlow构建了Hop的原型实现。在CNN和SVM上的实验结果表明,在异质环境中相比标准去中心化训练有显著的加速效果。

英文摘要

Recent work has shown that decentralized algorithms can deliver superior performance over centralized ones in the context of machine learning. The two approaches, with the main difference residing in their distinct communication patterns, are both susceptible to performance degradation in heterogeneous environments. Although vigorous efforts have been devoted to supporting centralized algorithms against heterogeneity, little has been explored in decentralized algorithms regarding this problem. This paper proposes Hop, the first heterogeneity-aware decentralized training protocol. Based on a unique characteristic of decentralized training that we have identified, the iteration gap, we propose a queue-based synchronization mechanism that can efficiently implement backup workers and bounded staleness in the decentralized setting. To cope with deterministic slowdown, we propose skipping iterations so that the effect of slower workers is further mitigated. We build a prototype implementation of Hop on TensorFlow. The experiment results on CNN and SVM show significant speedup over standard decentralized training in heterogeneous settings.

1809.06277 2026-06-04 math.OC cs.LG cs.SY eess.SY

Optimal Matrix Momentum Stochastic Approximation and Applications to Q-learning

最优矩阵动量随机逼近及其在Q学习中的应用

Adithya M. Devraj, Ana Bušić, Sean Meyn

发表机构 * Department of Electrical and Computer Engineering, University of Florida(佛罗里达大学电气与计算机工程系)

AI总结 本文提出两种新的根寻找算法,PolSA和NeSA,用于解决优化问题,并探讨了这些算法在强化学习中的应用,特别是在Q学习中通过随机逼近实现最优渐近协方差。

详情
AI中文摘要

加速是随机优化文献中越来越常见的主题。最常见的例子是Nesterov的方法和Polyak的动量技术。在本文中,针对根寻找问题引入了两种新的算法:1)PolSA是一种具有特别设计的矩阵动量的根寻找算法,2)NeSA可以被视为Nesterov算法的一种变种,或PolSA的简化版本。PolSA算法在优化领域(当作为根寻找问题处理时)是新的。本文研究的调研受到强化学习应用的启发。众所周知,大多数TD-和Q学习的变种可以作为SA(随机逼近)算法来处理,且一般SA理论的工具可用于研究收敛性和收敛速率的界限。特别是,渐近方差是SA算法性能的常见度量标准,也是评估随机优化算法性能的多种度量之一。有两种广为人知的SA技术已知具有最优渐近方差:Ruppert-Polyak平均技术和随机牛顿-拉夫逊(SNR)。前者算法可能具有极差的瞬时性能,而后者计算成本较高。本文证明了新提出的PolSA算法的参数估计与理想(但更复杂)SNR算法的估计耦合。因此,新算法成为获得最优渐近协方差的第三种方法。这些强结果需要对模型的假设。考虑了线性化模型,并假设噪声是一个鞅差序列。在非线性设置中获得了数值结果,这是本文工作的动机:在PolSA实现的Q学习中,在这种非理想设置下观察到与SNR的耦合。

英文摘要

Acceleration is an increasingly common theme in the stochastic optimization literature. The two most common examples are Nesterov's method, and Polyak's momentum technique. In this paper two new algorithms are introduced for root finding problems: 1) PolSA is a root finding algorithm with specially designed matrix momentum, and 2) NeSA can be regarded as a variant of Nesterov's algorithm, or a simplification of PolSA. The PolSA algorithm is new even in the context of optimization (when cast as a root finding problem). The research surveyed in this paper is motivated by applications to reinforcement learning. It is well known that most variants of TD- and Q-learning may be cast as SA (stochastic approximation) algorithms, and the tools from general SA theory can be used to investigate convergence and bounds on convergence rate. In particular, the asymptotic variance is a common metric of performance for SA algorithms, and is also one among many metrics used in assessing the performance of stochastic optimization algorithms. There are two well known SA techniques that are known to have optimal asymptotic variance: the Ruppert-Polyak averaging technique, and stochastic Newton-Raphson (SNR). The former algorithm can have extremely bad transient performance, and the latter can be computationally expensive. It is demonstrated here that parameter estimates from the new PolSA algorithm couple with those of the ideal (but more complex) SNR algorithm. The new algorithm is thus a third approach to obtain optimal asymptotic covariance. These strong results require assumptions on the model. A linearized model is considered, and the noise is assumed to be a martingale difference sequence. Numerical results are obtained in a non-linear setting that is the motivation for this work: In PolSA implementations of Q-learning it is observed that coupling occurs with SNR in this non-ideal setting.

1807.04638 2026-06-04 math.NA cs.CV cs.NA

PDE-constrained LDDMM via geodesic shooting and inexact Gauss-Newton-Krylov optimization using the incremental adjoint Jacobi equations

基于偏微分方程约束的LDDMM通过测地线射击和近似高斯-牛顿-克罗内克优化使用增量伴随雅可比方程

Monica Hernandez

发表机构 * Computer Sciences Department(计算机科学系) Aragon Institute on Engineering Research(阿拉贡工程研究院) University of Zaragoza(萨拉戈塔大学)

AI总结 本文提出了一种基于偏微分方程约束的LDDMM方法,利用测地线射击和近似高斯-牛顿-克罗内克优化,通过增量伴随雅可比方程在初始速度场空间中进行参数化,从而避免了对初始速度场的复杂依赖,提供了高效的测地线路径。

详情
AI中文摘要

在偏微分方程约束的大变形 diffeomorphic度量映射框架下提出的一类非刚性注册方法是一个特别有趣且具有物理意义的 diffeomorphic 注册方法集合。在该框架中,不精确的牛顿-克罗内克优化已显示出卓越的数值精度和极快的收敛速度。然而,非稳态速度场的伽辽金表示法并未提供适当的测地线路径。在本文中,我们提出了一种在初始速度场空间中参数化的偏微分方程约束LDDMM方法。梯度和Hessian-向量积的推导是在最终速度场上进行,并通过伴随和增量伴随雅可比方程反向传输。这样,我们避免了在推导和计算伴随方程及其增量版本时对初始速度场的复杂依赖。所提出的方法在偏微分方程约束LDDMM框架内提供了测地线,并展示了与基准PDE约束LDDMM和EPDiff-LDDMM方法相媲美的性能。

英文摘要

The class of non-rigid registration methods proposed in the framework of PDE-constrained Large Deformation Diffeomorphic Metric Mapping is a particularly interesting family of physically meaningful diffeomorphic registration methods. Inexact Newton-Krylov optimization has shown an excellent numerical accuracy and an extraordinarily fast convergence rate in this framework. However, the Galerkin representation of the non-stationary velocity fields does not provide proper geodesic paths. In this work, we propose a method for PDE-constrained LDDMM parameterized in the space of initial velocity fields under the EPDiff equation. The derivation of the gradient and the Hessian-vector products are performed on the final velocity field and transported backward using the adjoint and the incremental adjoint Jacobi equations. This way, we avoid the complex dependence on the initial velocity field in the derivations and the computation of the adjoint equation and its incremental counterpart. The proposed method provides geodesics in the framework of PDE-constrained LDDMM, and it shows performance competitive to benchmark PDE-constrained LDDMM and EPDiff-LDDMM methods.

1606.02421 2026-06-04 stat.ML cs.AI cs.DC cs.LG cs.SY eess.SY

Gossip Dual Averaging for Decentralized Optimization of Pairwise Functions

基于 gossip 的双重平均法用于分布式优化配对函数

Igor Colin, Aurélien Bellet, Joseph Salmon, Stéphan Clémençon

发表机构 * Magnet Team, INRIA Lille – Nord Europe(磁力团队、法国国家信息与自动化技术研究所里尔-北欧洲分部)

AI总结 本文提出了一种基于 gossip 的双重平均算法,用于在分布式网络中优化配对函数,适用于排名、距离度量学习和图推断等应用,通过同步和异步设置解决优化问题,并展示了其在AUC最大化和度量学习中的实际应用。

详情
AI中文摘要

在分布式网络(如传感器、连接设备等)中,存在对高效算法优化全局成本函数的重要需求,例如从每个计算单元收集的本地数据中学习全局模型。本文针对分布式最小化数据点配对函数的问题,这些点分布在定义网络通信拓扑的图的节点上。该问题在排名、距离度量学习和图推断等领域有广泛应用。我们提出了一种基于双重平均的新型 gossip 算法,旨在在同步和异步设置中解决此类问题。所提出的框架足够灵活,能够处理约束和正则化优化问题的变体。我们的理论分析表明,所提出的算法在保持集中式双重平均收敛速度的同时,仅引入一个加性偏差项。我们还通过在AUC最大化和度量学习问题上的数值模拟,展示了我们方法的实际价值。

英文摘要

In decentralized networks (of sensors, connected objects, etc.), there is an important need for efficient algorithms to optimize a global cost function, for instance to learn a global model from the local data collected by each computing unit. In this paper, we address the problem of decentralized minimization of pairwise functions of the data points, where these points are distributed over the nodes of a graph defining the communication topology of the network. This general problem finds applications in ranking, distance metric learning and graph inference, among others. We propose new gossip algorithms based on dual averaging which aims at solving such problems both in synchronous and asynchronous settings. The proposed framework is flexible enough to deal with constrained and regularized variants of the optimization problem. Our theoretical analysis reveals that the proposed algorithms preserve the convergence rate of centralized dual averaging up to an additive bias term. We present numerical simulations on Area Under the ROC Curve (AUC) maximization and metric learning problems which illustrate the practical interest of our approach.

1511.05464 2026-06-04 stat.ML cs.DC cs.LG cs.SY eess.SY stat.CO

Extending Gossip Algorithms to Distributed Estimation of U-Statistics

将 gossip 算法扩展到分布式 U-统计量估计

Igor Colin, Aurélien Bellet, Joseph Salmon, Stéphan Clémençon

发表机构 * INRIA Lille - Nord Europe(INRIA里尔-北欧洲)

AI总结 本文提出新的同步和异步随机 gossip 算法,用于在分布式网络中同时传播数据并维护局部的 U-统计量估计,证明了同步和异步情况下的收敛率分别为 O(1/t) 和 O(log t / t),并通过数值实验验证了算法的优越性。

Comments to be presented at NIPS 2015

详情
AI中文摘要

高效且稳健的去中心化网络估计算法对于许多分布式系统至关重要。尽管样本均值统计的分布式估计已受到广泛关注,但依赖于对观测对的更昂贵平均的 U-统计量计算却是一个研究较少的领域。然而,这些数据函数对于描述统计总体的全局特性至关重要,重要例子包括曲线下面积、经验方差、基尼均差和簇内点散度。本文提出新的同步和异步随机 gossip 算法,同时在网络中传播数据并维护感兴趣的 U-统计量的局部估计。我们建立了同步和异步情况下的收敛率界分别为 O(1/t) 和 O(log t / t),其中 t 是迭代次数,且具有明确的数据和网络依赖项。除了在速率分析方面的优越比较外,数值实验还提供了实证证据,证明所提出的算法优于之前引入的方法。

英文摘要

Efficient and robust algorithms for decentralized estimation in networks are essential to many distributed systems. Whereas distributed estimation of sample mean statistics has been the subject of a good deal of attention, computation of $U$-statistics, relying on more expensive averaging over pairs of observations, is a less investigated area. Yet, such data functionals are essential to describe global properties of a statistical population, with important examples including Area Under the Curve, empirical variance, Gini mean difference and within-cluster point scatter. This paper proposes new synchronous and asynchronous randomized gossip algorithms which simultaneously propagate data across the network and maintain local estimates of the $U$-statistic of interest. We establish convergence rate bounds of $O(1/t)$ and $O(\log t / t)$ for the synchronous and asynchronous cases respectively, where $t$ is the number of iterations, with explicit data and network dependent terms. Beyond favorable comparisons in terms of rate analysis, numerical experiments provide empirical evidence the proposed algorithms surpasses the previously introduced approach.

1812.06325 2026-06-04 eess.SY cs.LG cs.RO cs.SY

Data-efficient Auto-tuning with Bayesian Optimization: An Industrial Control Study

数据高效自动调优与贝叶斯优化:一项工业控制研究

Matthias Neumann-Brosig, Alonso Marco, Dieter Schwarzmann, Sebastian Trimpe

发表机构 * IAV GmbH(IAV集团) Max Planck Society(马克斯·普朗克学会) Cyber Valley initiative(Cyber Valley倡议) Max Planck Institute for Intelligent Systems(智能系统研究所)

AI总结 本文提出利用贝叶斯优化自动学习最优控制器参数,通过概率模型(高斯过程)建模控制器参数到用户定义成本的未知函数,并通过实验数据迭代优化,以高效找到全局最优参数,实验表明其在 throttle valve 控制中优于手动校准。

Comments 11 pages, 7 figures and 4 tables. To appear in IEEE Transactions on Control Systems Technology

详情
AI中文摘要

贝叶斯优化被提出用于从实验数据自动学习最优控制器参数。通过概率描述(高斯过程)建模控制器参数到用户定义成本的未知函数。概率模型通过在物理系统上测试一组参数并评估成本来更新。为加快学习速度,贝叶斯优化算法系统地选择下一步评估的参数,例如通过最大化关于最优解的信息增益。因此,该算法通过少量实验迭代找到全局最优参数。以节流阀控制为例,所提出的自动调优方法在低实验次数下 consistently 实现更好的性能,优于手动校准。所提出的自动调优框架具有灵活性,可处理不同的控制结构和目标。

英文摘要

Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a user-defined cost. The probabilistic model is updated with data, which is obtained by testing a set of parameters on the physical system and evaluating the cost. In order to learn fast, the Bayesian optimization algorithm selects the next parameters to evaluate in a systematic way, for example, by maximizing information gain about the optimum. The algorithm thus iteratively finds the globally optimal parameters with only few experiments. Taking throttle valve control as a representative industrial control example, the proposed auto-tuning method is shown to outperform manual calibration: it consistently achieves better performance with a low number of experiments. The proposed auto-tuning framework is flexible and can handle different control structures and objectives.