arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1605.09232 2026-06-04 math.NA cs.LG cs.NA cs.NE math.OC stat.ML

Tradeoffs between Convergence Speed and Reconstruction Accuracy in Inverse Problems

反问题中收敛速度与重建精度之间的权衡

Raja Giryes, Yonina C. Eldar, Alex M. Bronstein, Guillermo Sapiro

发表机构 * School of Electrical Engineering, Tel Aviv University(特拉维夫大学电子工程学院) Electrical Engineering Department, Technion - IIT(技术学院-理工学院电子工程系) Computer Science Department, Technion - IIT(技术学院-理工学院计算机科学系) Electrical and Computer Engineering Department, Duke University(杜克大学电气与计算机工程系)

AI总结 研究探讨了在逆问题中,通过调整迭代算法以加快收敛速度同时保持重建精度的可行性,结合低维集的恢复技术,分析了粗略估计对收敛速度的影响。

Comments To appear in IEEE Transactions on Signal Processing

详情
AI中文摘要

使用迭代算法求解逆问题在大数据中很流行。由于时间限制,迭代次数通常有限,可能影响可实现的精度。给定可接受的误差范围,一个重要问题是是否可以通过修改原始迭代方法,获得更快收敛到达到允许误差的极小值点,而不显著增加每次迭代的计算成本。基于最近为某些低维集信号恢复开发的恢复技术,我们表明使用该集的粗略估计可能以额外的重建误差为代价加快收敛。我们的理论与稀疏恢复、压缩感知和深度学习的最新进展相关。特别是,它可能为神经网络通过层表示迭代来近似l1最小化解的成功提供了可能的解释,如在学习迭代收缩阈值算法(LISTA)中实践的那样。

英文摘要

Solving inverse problems with iterative algorithms is popular, especially for large data. Due to time constraints, the number of possible iterations is usually limited, potentially affecting the achievable accuracy. Given an error one is willing to tolerate, an important question is whether it is possible to modify the original iterations to obtain faster convergence to a minimizer achieving the allowed error without increasing the computational cost of each iteration considerably. Relying on recent recovery techniques developed for settings in which the desired signal belongs to some low-dimensional set, we show that using a coarse estimate of this set may lead to faster convergence at the cost of an additional reconstruction error related to the accuracy of the set approximation. Our theory ties to recent advances in sparse recovery, compressed sensing, and deep learning. Particularly, it may provide a possible explanation to the successful approximation of the l1-minimization solution by neural networks with layers representing iterations, as practiced in the learned iterative shrinkage-thresholding algorithm (LISTA).

1709.07089 2026-06-04 eess.SY cs.LG cs.SY stat.ML

On the Design of LQR Kernels for Efficient Controller Learning

关于为高效控制器学习设计LQR核

Alonso Marco, Philipp Hennig, Stefan Schaal, Sebastian Trimpe

发表机构 * Max Planck Institute for Intelligent Systems(马克斯·普朗克智能系统研究所) Computational Learning and Motor Control Lab(计算学习与运动控制实验室)

AI总结 本文提出两种基于LQR结构的核,用于改进基于贝叶斯优化的控制器学习,通过模拟线性和非线性系统证明其优于传统GP方法。

Comments 8 pages, 5 figures, to appear in 56th IEEE Conference on Decision and Control (CDC 2017)

详情
AI中文摘要

从数据中为非线性动态系统寻找最优反馈控制器是困难的。最近,贝叶斯优化(BO)被提出作为直接从实验试次中调整控制器的强大框架。为了选择下一个查询点并找到全局最优解,BO依赖于潜在目标函数的概率描述,通常为高斯过程(GP)。本文显示,使用常见核的GP在标准二次控制问题上可能导致学习效果差。对于一阶系统,我们构建了两种核,专门利用广为人知的线性二次调节器(LQR)的结构,同时保留贝叶斯非参数学习的灵活性。对不确定线性和非线性系统的模拟显示,LQR核在学习性能上优于传统GP方法。

英文摘要

Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.

1801.09238 2026-06-04 eess.SY cs.CV cs.SY stat.AP

Performance Analysis of Robust Stable PID Controllers Using Dominant Pole Placement for SOPTD Process Models

基于主导极点放置的鲁棒稳定PID控制器性能分析用于SOPTD过程模型

Saptarshi Das, Kaushik Halder, Amitava Gupta

发表机构 * Department of Mathematics, College of Engineering, Mathematics and Physical Sciences, University of Exeter(数学系,工程、数学与物理科学学院,埃克塞特大学) Department of Power Engineering, Jadavpur University(动力工程系,贾瓦德普大学)

AI总结 本文提出新的主导极点放置PID控制器设计方法,用于处理具有时间延迟的二阶过程。通过三阶Pade近似约束闭环主导和非主导极点位置,分析不同非主导极点类型对稳定性区域的影响。

Comments 50 pages, 42 figures, Knowledge-Based Systems, 2018

详情
AI中文摘要

本文推导了新的基于主导极点放置的PID控制器设计公式,用于处理具有时间延迟的二阶过程(SOPTD)。之前已尝试在无延迟系统中进行极点放置。时间延迟项在Pade近似中表现为具有可变数量交错极点和零点的高阶系统,这使得精确极点放置控制变得困难。本文报告了使用三阶Pade近似来约束闭环主导和非主导极点在复数s平面上的解析表达式。然而,通过增加Pade阶数验证了不同时间延迟近似对闭环性能的不变性,代表了更接近现实的高阶延迟动态。非主导极点的性质(如全部为复数、实数或组合)会影响特征方程并影响可实现的稳定性区域。不同类型的非主导极点及其对应的稳定性区域对九个测试台过程的影响被获得,这些过程表现出不同的开环阻尼比和滞后到延迟比。接下来,通过蒙特卡洛模拟研究不同表达式在设计参数空间中产生更宽稳定性区域的效果。随后,通过成千上万的蒙特卡洛模拟研究了各种时域和频域控制性能参数及其在不确定过程参数下的偏差,围绕每个测试台过程的鲁棒稳定解。

英文摘要

This paper derives new formulations for designing dominant pole placement based proportional-integral-derivative (PID) controllers to handle second order processes with time delays (SOPTD). Previously, similar attempts have been made for pole placement in delay-free systems. The presence of the time delay term manifests itself as a higher order system with variable number of interlaced poles and zeros upon Pade approximation, which makes it difficult to achieve precise pole placement control. We here report the analytical expressions to constrain the closed loop dominant and non-dominant poles at the desired locations in the complex s-plane, using a third order Pade approximation for the delay term. However, invariance of the closed loop performance with different time delay approximation has also been verified using increasing order of Pade, representing a closed to reality higher order delay dynamics. The choice of the nature of non-dominant poles e.g. all being complex, real or a combination of them modifies the characteristic equation and influences the achievable stability regions. The effect of different types of non-dominant poles and the corresponding stability regions are obtained for nine test-bench processes indicating different levels of open-loop damping and lag to delay ratio. Next, we investigate which expression yields a wider stability region in the design parameter space by using Monte Carlo simulations while uniformly sampling a chosen design parameter space. Various time and frequency domain control performance parameters are investigated next, as well as their deviations with uncertain process parameters, using thousands of Monte Carlo simulations, around the robust stable solution for each of the nine test-bench processes.

1801.09657 2026-06-04 math.NA cs.LG cs.NA stat.ME

Matrix Completion for Structured Observations

结构观测的矩阵补全

Denali Molitor, Deanna Needell

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校)

AI总结 本文提出改进的核范数最小化方法,以考虑观测与未观测条目间的结构性差异,提升矩阵补全效果。

详情
AI中文摘要

预测或填补缺失数据(即矩阵补全)是当今数据驱动世界中的常见挑战。以往策略通常假设观测与缺失条目之间无结构性差异。不幸的是,这一假设在许多应用中显得不现实。例如,在经典的Netflix挑战中,预测用户对未观看电影的评分时,观众未观看某部电影可能表明对该电影缺乏兴趣,从而建议评分低于预期。本文提出调整标准核范数最小化策略,通过正则化未观测条目的值来考虑此类结构性差异。我们证明在某些情况下,所提方法优于核范数最小化。

英文摘要

The need to predict or fill-in missing data, often referred to as matrix completion, is a common challenge in today's data-driven world. Previous strategies typically assume that no structural difference between observed and missing entries exists. Unfortunately, this assumption is woefully unrealistic in many applications. For example, in the classic Netflix challenge, in which one hopes to predict user-movie ratings for unseen films, the fact that the viewer has not watched a given movie may indicate a lack of interest in that movie, thus suggesting a lower rating than otherwise expected. We propose adjusting the standard nuclear norm minimization strategy for matrix completion to account for such structural differences between observed and unobserved entries by regularizing the values of the unobserved entries. We show that the proposed method outperforms nuclear norm minimization in certain settings.

1801.09361 2026-06-04 eess.SY cs.RO cs.SY

Safe and Efficient Intersection Control of Connected and Autonomous Intersection Traffic

连接与自动驾驶交叉口交通的安全高效交叉控制

Qiang Lu

发表机构 * Daniel Felix Ritchie School of Engineering and Computer Science(丹尼尔·费利克斯·里奇工程与计算机科学学院)

AI总结 本文提出DICA算法,用于协调自动驾驶车辆在交叉口的安全高效通行,同时通过Reactive DICA算法优化紧急车辆通行优先级,确保应急车辆快速通过而最小化其他车辆的行程影响。

Comments 104 pages, 23 figures, PhD comprehensive thesis

详情
AI中文摘要

本论文针对自动驾驶和连接地面交通在交叉口的安全高效通行问题,提出了一种名为离散时间占用轨迹基于交叉口交通协调算法(DICA)的算法。所有系统中的车辆均为连接和自动驾驶车辆(CAVs),并具备无线车与交叉口通信能力。在所提出的框架中,交叉口基于车辆提出的DTOTs协调其运动,以高效通过交叉口并避免碰撞。在车辆DTOTs发生碰撞时,交叉口修改冲突的DTOTs以避免碰撞,并请求CAVs根据修改后的DTOTs接近和通过交叉口。我们证明了基本DICA是无死锁和无饥饿的,并且在计算复杂度上具有保守性,通过几种计算方法进行了改进。接着,本文还解决了通过自动驾驶和连接交叉口交通快速疏散紧急车辆的问题。所提出的交叉口控制算法Reactive DICA旨在确定一个高效的车辆通行序列,使紧急车辆能够尽快通过交叉口,同时最小化其他车辆的行程时间。当没有紧急车辆在交叉口区域内时,车辆由DICA控制。当有紧急车辆进入通信范围时,我们通过优化车辆的排序优先处理紧急车辆。提出了一种遗传算法来解决优化问题,找到最优的车辆序列,使紧急车辆获得最高优先级。

英文摘要

In this dissertation, we address a problem of safe and efficient intersection crossing traffic management of autonomous and connected ground traffic. Toward this objective, an algorithm that is called the Discrete-time occupancies trajectory based Intersection traffic Coordination Algorithm (DICA) is proposed. All vehicles in the system are Connected and Autonomous Vehicles (CAVs) and capable of wireless Vehicle-to-Intersection communication. In the proposed framework, an intersection coordinates the motions of CAVs based on their proposed DTOTs to let them cross the intersection efficiently while avoiding collisions. In case when there is a collision between vehicles' DTOTs, the intersection modifies conflicting DTOTs to avoid the collision and requests CAVs to approach and cross the intersection according to the modified DTOTs. We then prove that the basic DICA is deadlock free and also starvation free. We also show that the basic DICA is conservative in computational complexity and improve it by several computational approaches. Next, we addressed the problem of evacuating emergency vehicles as quickly as possible through autonomous and connected intersection traffic in this dissertation. The proposed intersection control algorithm Reactive DICA aims to determine an efficient vehicle-passing sequence which allows the emergency vehicle to cross an intersection as soon as possible while the travel times of other vehicles are minimally affected. When there are no emergency vehicles within the intersection area, the vehicles are controlled by DICA. When there are emergency vehicles entering communication range, we prioritize emergency vehicles through optimal ordering of vehicles. A genetic algorithm is proposed to solve the optimization problem which finds the optimal vehicle sequence that gives the emergency vehicles the highest priority.

1801.07132 2026-06-04 cs.CR cs.RO cs.SY eess.SY

SecSens: Secure State Estimation with Application to Localization and Time Synchronization

SecSens: 带有定位和时间同步应用的安全状态估计

Amr Alanwar, Bernhard Etzlinger, Henrique Ferraz, Joao Hespanha, Mani Srivastava

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校) University of California, Santa Barbara(加州大学圣芭芭拉分校) Johannes Kepler University(约翰·凯撒大学)

AI总结 本文提出SecSens,一种用于对抗模型和测量噪声的新型安全非线性状态估计方法,通过SecEKF和SecOPT算法实现安全状态估计,无需专用硬件或加密技术,在强攻击下性能优于现有方案。

详情
AI中文摘要

针对网络物理系统中信息技术和物理传感耦合带来的安全漏洞,本文提出SecSens,一种基于扩展卡尔曼滤波和最大似然估计的新型安全非线性状态估计方法。SecSens包含SecEKF和SecOPT两个独立算法,通过整体方法引入安全意识,无需专用硬件或加密技术。本文将SecSens应用于网络化移动设备的安全定位和时间同步,实验表明其在强攻击下性能优于现有方案。

英文摘要

Research evidence in Cyber-Physical Systems (CPS) shows that the introduced tight coupling of information technology with physical sensing and actuation leads to more vulnerability and security weaknesses. But, the traditional security protection mechanisms of CPS focus on data encryption while neglecting the sensors which are vulnerable to attacks in the physical domain. Accordingly, researchers attach utmost importance to the problem of state estimation in the presence of sensor attacks. In this work, we present SecSens, a novel approach for secure nonlinear state estimation in the presence of modeling and measurement noise. SecSens consists of two independent algorithms, namely, SecEKF and SecOPT, which are based on Extended Kalman Filter and Maximum Likelihood Estimation, respectively. We adopt a holistic approach to introduce security awareness among state estimation algorithms without requiring specialized hardware, or cryptographic techniques. We apply SecSens to securely localize and time synchronize networked mobile devices. SecSens provides good performance at run-time several order of magnitude faster than the state of art solutions under the presence of powerful attacks. Our algorithms are evaluated on a testbed with static nodes and a mobile quadrotor all equipped with commercial ultra-wide band wireless devices.

1801.06637 2026-06-04 stat.ML cs.LG cs.NA math.AP math.NA

Deep Hidden Physics Models: Deep Learning of Nonlinear Partial Differential Equations

深度隐物理模型:深度学习非线性偏微分方程

Maziar Raissi

发表机构 * Division of Applied Mathematics, Brown University(布朗大学应用数学系)

AI总结 本文提出深度学习方法,从散乱且可能含噪声的观测数据中发现非线性偏微分方程,通过两个深度神经网络近似未知解及非线性动力学,验证了该方法在多个科学领域基准问题中的有效性。

详情
AI中文摘要

在人工智能与应用数学的交汇处,长期存在的问题是设计出能够将观测数据转化为物理世界预测数学模型的算法。在数据丰富和机器学习能力先进的时代,自然的问题是:如何自动从高维实验数据中揭示物理定律?本文提出了一种深度学习方法,用于从空间和时间上散乱且可能含噪声的观测中发现非线性偏微分方程。具体而言,我们通过两个深度神经网络近似未知解及非线性动力学。第一个网络作为未知解的先验,本质上使我们能够避免本质上病态且不稳定的数值微分。第二个网络代表非线性动力学,帮助我们提炼支配给定时空数据集演化的机制。我们测试了该方法在多个科学领域基准问题中的有效性,并展示了所提框架如何帮助我们准确学习底层动力学并预测系统未来状态。特别是,我们研究了Burgers'、Korteweg-de Vries(KdV)、Kuramoto-Sivashinsky、非线性Schrödinger和Navier-Stokes方程。

英文摘要

A long-standing problem at the interface of artificial intelligence and applied mathematics is to devise an algorithm capable of achieving human level or even superhuman proficiency in transforming observed data into predictive mathematical models of the physical world. In the current era of abundance of data and advanced machine learning capabilities, the natural question arises: How can we automatically uncover the underlying laws of physics from high-dimensional data generated from experiments? In this work, we put forth a deep learning approach for discovering nonlinear partial differential equations from scattered and potentially noisy observations in space and time. Specifically, we approximate the unknown solution as well as the nonlinear dynamics by two deep neural networks. The first network acts as a prior on the unknown solution and essentially enables us to avoid numerical differentiations which are inherently ill-conditioned and unstable. The second network represents the nonlinear dynamics and helps us distill the mechanisms that govern the evolution of a given spatiotemporal data-set. We test the effectiveness of our approach for several benchmark problems spanning a number of scientific domains and demonstrate how the proposed framework can help us accurately learn the underlying dynamics and forecast future states of the system. In particular, we study the Burgers', Korteweg-de Vries (KdV), Kuramoto-Sivashinsky, nonlinear Schrödinger, and Navier-Stokes equations.

1801.05894 2026-06-04 math.HO cs.LG cs.NA math.NA stat.ML

Deep Learning: An Introduction for Applied Mathematicians

深度学习:应用数学家的入门指南

Catherine F. Higham, Desmond J. Higham

发表机构 * School of Computing Science, University of Glasgow, UK(格拉斯哥大学计算机科学学院,英国) Department of Mathematics and Statistics, University of Strathclyde, UK(斯特拉斯克莱德大学数学与统计学系,英国)

AI总结 本文从应用数学角度介绍深度学习基本概念,面向数学专业研究生及本科生,通过MATLAB代码和图像分类实例展示神经网络原理与训练方法。

详情
AI中文摘要

多层人工神经网络正成为众多应用领域中的普遍工具。深度学习革命的核心概念来自应用数学和计算数学,包括微积分、逼近论、优化和线性代数。本文为应用数学家提供深度学习基础介绍。目标读者为数学专业研究生及大四本科生,也适用于希望在课堂中引入深度学习应用的数学教师。文章聚焦三个核心问题:什么是深度神经网络?如何训练网络?什么是随机梯度方法?通过MATLAB代码展示网络构建与训练,并展示在大规模图像分类问题中使用最新软件的应用。最后提供当前文献的参考文献。

英文摘要

Multilayered artificial neural networks are becoming a pervasive tool in a host of application fields. At the heart of this deep learning revolution are familiar concepts from applied and computational mathematics; notably, in calculus, approximation theory, optimization and linear algebra. This article provides a very brief introduction to the basic ideas that underlie deep learning from an applied mathematics perspective. Our target audience includes postgraduate and final year undergraduate students in mathematics who are keen to learn about the area. The article may also be useful for instructors in mathematics who wish to enliven their classes with references to the application of deep learning techniques. We focus on three fundamental questions: what is a deep neural network? how is a network trained? what is the stochastic gradient method? We illustrate the ideas with a short MATLAB code that sets up and trains a network. We also show the use of state-of-the art software on a large scale image classification problem. We finish with references to the current literature.

1801.04816 2026-06-04 eess.SY cs.RO cs.SY

Localizability-Constrained Deployment of Mobile Robotic Networks with Noisy Range Measurements

具有噪声测距的移动机器人网络的局部化约束部署

Jerome Le Ny, Simon Chauvière

发表机构 * Polytechnique Montreal(蒙特利尔理工学院) GERAD

AI总结 本文研究了在噪声测距下如何部署移动机器人网络以实现高精度定位,通过局部化函数优化网络几何结构,结合梯度下降法进行分布式控制。

Comments 7 pages, 3 figures

详情
AI中文摘要

当移动网络中的节点使用相对于邻居的噪声测距来估计位置时,整体连通性和测量网络的几何结构对可实现的定位精度有关键影响。本文考虑了部署一个仅基于测距的协作定位移动机器人网络的问题,旨在维持有利于高精度估计机器人位置的网络几何结构。网络几何的质量通过一个称为“局部化性”的函数来衡量,该函数作为机器人运动规划的潜在场。该函数基于Cramér-Rao界,为给定的几何结构提供任何无偏位置估计器的协方差矩阵的下界。我们描述了基于梯度下降的机器人运动规划器,试图优化或约束网络局部化函数的不同变体,并讨论了以分布式方式实现这些控制器的方法。最后,本文还建立了统计观点与维护捕捉相对测距的图的加权刚性形式之间的正式联系。

英文摘要

When nodes in a mobile network use relative noisy measurements with respect to their neighbors to estimate their positions, the overall connectivity and geometry of the measurement network has a critical influence on the achievable localization accuracy. This paper considers the problem of deploying a mobile robotic network implementing a cooperative localization scheme based on range measurements only, while attempting to maintain a network geometry that is favorable to estimating the robots' positions with high accuracy. The quality of the network geometry is measured by a "localizability" function serving as potential field for robot motion planning. This function is built from the Cramér-Rao bound, which provides for a given geometry a lower bound on the covariance matrix achievable by any unbiased position estimator that the robots might implement using their relative measurements. We describe gradient descent-based motion planners for the robots that attempt to optimize or constrain different variations of the network's localizability function, and discuss ways of implementing these controllers in a distributed manner. Finally, the paper also establishes formal connections between our statistical point of view and maintaining a form of weighted rigidity for the graph capturing the relative range measurements.

1706.09993 2026-06-04 math.NA cs.IT cs.LG cs.NA math.IT math.PR math.ST stat.TH

Phase Retrieval via Randomized Kaczmarz: Theoretical Guarantees

通过随机Kaczmarz方法进行相位恢复:理论保证

Yan Shuo Tan, Roman Vershynin

发表机构 * Department of Mathematics, University of Michigan(密歇根大学数学系) Department of Mathematics, University of California, Irvine(加州大学尔湾分校数学系)

AI总结 本文提出随机Kaczmarz方法在相位恢复中的理论保障,证明仅需与维度成正比的高斯测量即可保证收敛,引入了测量集的充分条件,并利用VC维和度量熵证明高斯采样向量满足该条件。

Comments Revised after comments from referees

详情
AI中文摘要

我们考虑相位恢复问题,即求解二次方程组的问题。最近提出了一种随机Kaczmarz方法的简单变种,并在数值上显示出比最先进的Wirtinger流方法更高效。在本文中,我们为相位恢复中的随机Kaczmarz方法提供了首次理论保证。我们证明仅需与维度成正比的高斯测量即可保证收敛。在此过程中,我们引入了一个关于测量集的充分条件,以保证随机Kaczmarz方法能够正常工作。我们证明高斯采样向量以高概率满足该性质;这通过链式论证结合VC维和度量熵的界来证明。

英文摘要

We consider the problem of phase retrieval, i.e. that of solving systems of quadratic equations. A simple variant of the randomized Kaczmarz method was recently proposed for phase retrieval, and it was shown numerically to have a computational edge over state-of-the-art Wirtinger flow methods. In this paper, we provide the first theoretical guarantee for the convergence of the randomized Kaczmarz method for phase retrieval. We show that it is sufficient to have as many Gaussian measurements as the dimension, up to a constant factor. Along the way, we introduce a sufficient condition on measurement sets for which the randomized Kaczmarz method is guaranteed to work. We show that Gaussian sampling vectors satisfy this property with high probability; this is proved using a chaining argument coupled with bounds on VC dimension and metric entropy.

1705.03342 2026-06-04 math.NA cs.NA cs.SD

On the eigenmodes of periodic orbits for multiple scattering problems in 2D

关于二维多重散射问题中周期轨道的本征模

Daan Huybrechs, Peter Opsomer

发表机构 * Department of Computer Science(计算机科学系) KU Leuven(库勒韦恩大学)

AI总结 本文研究了二维多重散射问题中周期轨道的本征模,提出了一种基于边界积分方程的渐近方法,通过泰勒展开近似相位,以加速射线追踪方案。

Comments 24 pages, 9 figures and the implementation is available on https://github.com/popsomer/asyBEM/releases

详情
AI中文摘要

波传播和声学散射问题需要大量计算资源来在高频下准确求解。渐近方法可以通过显式提取解的振荡特性使成本可能与频率无关。然而,在存在多个散射障碍物时,高频波模式变得非常复杂。我们考虑了涉及多个障碍物的二维亥姆霍兹方程的边界积分方程形式,其中已提出射线追踪方案。现有的射线追踪方案分析集中在周期轨道之间的一组障碍物之间。观察到每个障碍物上的密度在几次迭代后趋于平衡。在本文中,我们以泰勒级数形式给出了这些密度相位的渐近近似。密度代表了周期轨道中的完整反射周期。我们最初利用对称性处理两个圆形散射体的情况,但还为任意数量的一般二维障碍物提供了显式算法。系数以及计算它们的时间与波数和入射波无关。这些结果可用于在少量初始迭代后加速射线追踪方案。

英文摘要

Wave propagation and acoustic scattering problems require vast computational resources to be solved accurately at high frequencies. Asymptotic methods can make this cost potentially frequency independent by explicitly extracting the oscillatory properties of the solution. However, the high-frequency wave pattern becomes very complicated in the presence of multiple scattering obstacles. We consider a boundary integral equation formulation of the Helmholtz equation in two dimensions involving several obstacles, for which ray tracing schemes have been previously proposed. The existing analysis of ray tracing schemes focuses on periodic orbits between a subset of the obstacles. One observes that the densities on each of the obstacles converge to an equilibrium after a few iterations. In this paper we present an asymptotic approximation of the phases of those densities in equilibrium, in the form of a Taylor series. The densities represent a full cycle of reflections in a periodic orbit. We initially exploit symmetry in the case of two circular scatterers, but also provide an explicit algorithm for an arbitrary number of general 2D obstacles. The coefficients, as well as the time to compute them, are independent of the wavenumber and of the incident wave. The results may be used to accelerate ray tracing schemes after a small number of initial iterations.

1606.09178 2026-06-04 math.NA cs.NA cs.SD

High-frequency asymptotic compression of dense BEM matrices for general geometries without ray tracing

高频率渐近压缩密集BEM矩阵以适应一般几何形状而无需射线追踪

Daan Huybrechs, Peter Opsomer

发表机构 * Department of Computer Science(计算机科学系) KU Leuven(根特大学)

AI总结 本文提出了一种基于渐近压缩的BEM矩阵方法,通过显式局部化格林函数来减少计算规模,提高矩阵-向量乘积速度和条件数,适用于复杂几何形状的高频声学问题。

Comments 24 pages, 13 figures

详情
AI中文摘要

声学中的波传播和散射问题通常通过边界元方法求解。它们导致一个通常密集且大的离散化矩阵:其大小和条件数随着频率的增加而增长。然而,高频散射问题本质上是局部的,这很好地由高度局部化的射线反弹表示。渐进方法可以用来减少线性系统的规模,甚至使其频率无关,通过显式提取解的振荡特性来实现,使用射线追踪或类似技术。然而,在存在(多个)散射障碍物的复杂几何形状中,射线追踪变得昂贵或难以处理。在本文中,我们从构造完全解析的大而密集矩阵的相同离散化开始,通过显式局部化格林函数来实现渐进压缩。这导致一个大但稀疏的矩阵,具有更快的矩阵-向量乘积速度,并且如数值实验所示,条件数显著提高。尽管适当的格林函数局部化也取决于一般几何形状中不可用的渐进信息,我们可以在频率扫描从小到大频率的过程中自适应地构造它,这种方式会自动考虑一般入射波。我们证明了该方法对非凸、多散射和甚至近陷阱域具有鲁棒性,尽管在后者情况下压缩率明显较低。此外,尽管其渐进性质,该方法对低阶离散化如分段常数、线性或立方体,通常在应用中使用,具有鲁棒性。另一方面,我们没有减少与传统经典离散化相比的总自由度数量。该方法的组合...

英文摘要

Wave propagation and scattering problems in acoustics are often solved with boundary element methods. They lead to a discretization matrix that is typically dense and large: its size and condition number grow with increasing frequency. Yet, high frequency scattering problems are intrinsically local in nature, which is well represented by highly localized rays bouncing around. Asymptotic methods can be used to reduce the size of the linear system, even making it frequency independent, by explicitly extracting the oscillatory properties from the solution using ray tracing or analogous techniques. However, ray tracing becomes expensive or even intractable in the presence of (multiple) scattering obstacles with complicated geometries. In this paper, we start from the same discretization that constructs the fully resolved large and dense matrix, and achieve asymptotic compression by explicitly localizing the Green's function instead. This results in a large but sparse matrix, with a faster associated matrix-vector product and, as numerical experiments indicate, a much improved condition number. Though an appropriate localisation of the Green's function also depends on asymptotic information unavailable for general geometries, we can construct it adaptively in a frequency sweep from small to large frequencies in a way which automatically takes into account a general incident wave. We show that the approach is robust with respect to non-convex, multiple and even near-trapping domains, though the compression rate is clearly lower in the latter case. Furthermore, in spite of its asymptotic nature, the method is robust with respect to low-order discretizations such as piecewise constants, linears or cubics, commonly used in applications. On the other hand, we do not decrease the total number of degrees of freedom compared to a conventional classical discretization. The combination of the ...

1801.00048 2026-06-04 eess.SY cs.AI cs.SY q-bio.NC

Characterizing optimal hierarchical policy inference on graphs via non-equilibrium thermodynamics

通过非平衡热力学刻画图上最优分层策略推断

Daniel McNamee

发表机构 * Computational and Biological Learning Lab, University of Cambridge(计算与生物学习实验室,剑桥大学)

AI总结 本文提出一种基于图的非平衡热力学方法,用于构建和推断最优分层策略,解决状态空间在不同空间分辨率下的层次结构构建问题。

Comments NIPS 2017 Workshop on Hierarchical Reinforcement Learning. 8 pages, 1 figure

详情
AI中文摘要

层次结构在随机最优控制和生物控制中具有根本性意义,因其能促进控制算法中的多种有利计算特性,并可能成为传感器运动和认知控制系统的核心原理。然而,理论上尚未明确构建所有空间分辨率下的状态空间层次结构及其通过策略推断过程演变的方法。本文在图的背景下引入了一种形式化方法,用于推导离散马尔可夫决策过程的规范表示。所得到的层次结构对应于一种分层策略推断算法,该算法近似了由先验和最优策略生成的状态空间轨迹密度之间的离散梯度流。

英文摘要

Hierarchies are of fundamental interest in both stochastic optimal control and biological control due to their facilitation of a range of desirable computational traits in a control algorithm and the possibility that they may form a core principle of sensorimotor and cognitive control systems. However, a theoretically justified construction of state-space hierarchies over all spatial resolutions and their evolution through a policy inference process remains elusive. Here, a formalism for deriving such normative representations of discrete Markov decision processes is introduced in the context of graphs. The resulting hierarchies correspond to a hierarchical policy inference algorithm approximating a discrete gradient flow between state-space trajectory densities generated by the prior and optimal policies.

1710.09668 2026-06-04 math.NA cs.LG cs.NA cs.NE stat.ML

PDE-Net: Learning PDEs from Data

PDE-Net:从数据中学习偏微分方程

Zichao Long, Yiping Lu, Xianzhong Ma, Bin Dong

发表机构 * School of Mathematical Sciences(数学科学学院) School of Mathematical Sciences, Peking University(北京大学数学科学学院) Peking University, Beijing, China(北京大学北京中国) Beijing Computational Science Research Center(北京计算科学研究中心) Beijing International Center for Mathematical Research, Peking University(北京大学北京国际数学研究中心) Center for Data Science, Peking University(北京大学数据科学中心) Beijing Institute of Big Data Research(北京大数据研究院)

AI总结 本文提出PDE-Net,通过学习卷积核来获取微分算子,同时近似未知非线性响应,灵活地揭示复杂系统的动力学和隐藏的PDE模型。

Comments 15 pages, 13 figures

详情
AI中文摘要

本文提出了一种新的前馈深度网络PDE-Net,旨在同时准确预测复杂系统的动力学并揭示隐藏的PDE模型。PDE-Net通过学习卷积核来获取微分算子,并利用神经网络或其他机器学习方法近似未知的非线性响应。与现有方法相比,我们的方法通过学习微分算子和非线性响应具有最大的灵活性。PDE-Net的特殊之处在于所有滤波器都受到适当约束,这使我们能够轻松识别 governing PDE 模型,同时保持网络的表达力和预测能力。这些约束通过充分利用微分算子的阶数与滤波器的阶数总和规则(源自小波理论的重要概念)精心设计。我们还讨论了PDE-Net与计算机视觉中的一些现有网络如Network-In-Network (NIN) 和 Residual Neural Network (ResNet) 的关系。数值实验表明,PDE-Net有潜力揭示观测动态的隐藏PDE,并在噪声环境中预测相对较长的时间内的动态行为。

英文摘要

In this paper, we present an initial attempt to learn evolution PDEs from data. Inspired by the latest development of neural network designs in deep learning, we propose a new feed-forward deep network, called PDE-Net, to fulfill two objectives at the same time: to accurately predict dynamics of complex systems and to uncover the underlying hidden PDE models. The basic idea of the proposed PDE-Net is to learn differential operators by learning convolution kernels (filters), and apply neural networks or other machine learning methods to approximate the unknown nonlinear responses. Comparing with existing approaches, which either assume the form of the nonlinear response is known or fix certain finite difference approximations of differential operators, our approach has the most flexibility by learning both differential operators and the nonlinear responses. A special feature of the proposed PDE-Net is that all filters are properly constrained, which enables us to easily identify the governing PDE models while still maintaining the expressive and predictive power of the network. These constrains are carefully designed by fully exploiting the relation between the orders of differential operators and the orders of sum rules of filters (an important concept originated from wavelet theory). We also discuss relations of the PDE-Net with some existing networks in computer vision such as Network-In-Network (NIN) and Residual Neural Network (ResNet). Numerical experiments show that the PDE-Net has the potential to uncover the hidden PDE of the observed dynamics, and predict the dynamical behavior for a relatively long time, even in a noisy environment.

1501.05151 2026-06-04 eess.SY cs.RO cs.SY

Recursive Bayesian Filtering in Circular State Spaces

循环状态空间中的递归贝叶斯滤波

Gerhard Kurz, Igor Gilitschenski, Uwe D. Hanebeck

发表机构 * Intelligent Sensor-Actuator-Systems Laboratory (ISAS)(智能传感器-执行器系统实验室) Institute for Anthropomatics and Robotics(人机学与机器人研究所) Karlsruhe Institute of Technology (KIT), Germany(卡尔斯鲁厄理工学院(KIT),德国)

AI总结 本文提出了一种基于循环统计的递归滤波框架,利用循环分布(如环绕正态分布和von Mises分布)估计循环状态,并通过高效确定性采样技术处理非线性系统和测量函数,引入了分布无关预测算法和改进的环绕正态密度乘法公式。

详情
AI中文摘要

针对基于循环统计的递归循环滤波,我们介绍了一个通用框架,用于基于不同循环分布(特别是环绕正态分布和von Mises分布)估计循环状态。我们提出了一种用于具有非线性系统和测量函数的循环系统的估计方法,通过依赖高效的确定性采样技术实现。此外,我们展示了在多种重要特殊情况下计算如何简化,例如具有加性噪声的系统以及身份系统或测量函数。我们引入了几个新的关键组件,特别是分布无关的预测算法、新的和更优的环绕正态密度乘法公式,以及处理非加性系统噪声的能力。所有提出的方法都经过彻底评估,并与几种最先进的解决方案进行了比较。

英文摘要

For recursive circular filtering based on circular statistics, we introduce a general framework for estimation of a circular state based on different circular distributions, specifically the wrapped normal distribution and the von Mises distribution. We propose an estimation method for circular systems with nonlinear system and measurement functions. This is achieved by relying on efficient deterministic sampling techniques. Furthermore, we show how the calculations can be simplified in a variety of important special cases, such as systems with additive noise as well as identity system or measurement functions. We introduce several novel key components, particularly a distribution-free prediction algorithm, a new and superior formula for the multiplication of wrapped normal densities, and the ability to deal with non-additive system noise. All proposed methods are thoroughly evaluated and compared to several state-of-the-art solutions.

1712.09999 2026-06-04 math.NA cs.LG cs.NA

Parallel Active Subspace Decomposition for Scalable and Efficient Tensor Robust Principal Component Analysis

并行主动子空间分解用于可扩展和高效的张量鲁棒主成分分析

Jonathan Q. Jiang, Michael K. Ng

发表机构 * Department of Mathematics, Hong Kong Baptist University(香港 Baptist 大学数学系)

AI总结 本文提出并行主动子空间分解方法,通过将张量展开的每个模式分解为正交矩阵和小矩阵,降低核范数最小化问题的规模,提升张量鲁棒主成分分析的效率和精度。

Comments 19 pages, 2 figures, 2 tables

详情
AI中文摘要

张量鲁棒主成分分析(TRPCA)在多个领域受到广泛关注。现有方法通常依赖张量核范数最小化,但每次迭代都需要多个奇异值分解(SVD),导致计算成本高昂。为克服这一缺点,我们提出了一种可扩展且高效的方法,称为并行主动子空间分解(PASD),该方法将张量展开的每个模式分解为列正交矩阵(主动子空间)和另一个小矩阵。这种变换导致了一个非凸优化问题,其中核范数最小化的规模通常比原始问题小得多。此外,我们引入交替方向乘子法(ADMM)来解决改写的问题,并提供其收敛性和次优性的严格分析。在合成和真实数据上的实验结果表明,我们的算法比最先进的方法更准确,并且快了多个数量级。

英文摘要

Tensor robust principal component analysis (TRPCA) has received a substantial amount of attention in various fields. Most existing methods, normally relying on tensor nuclear norm minimization, need to pay an expensive computational cost due to multiple singular value decompositions (SVDs) at each iteration. To overcome the drawback, we propose a scalable and efficient method, named Parallel Active Subspace Decomposition (PASD), which divides the unfolding along each mode of the tensor into a columnwise orthonormal matrix (active subspace) and another small-size matrix in parallel. Such a transformation leads to a nonconvex optimization problem in which the scale of nulcear norm minimization is generally much smaller than that in the original problem. Furthermore, we introduce an alternating direction method of multipliers (ADMM) method to solve the reformulated problem and provide rigorous analyses for its convergence and suboptimality. Experimental results on synthetic and real-world data show that our algorithm is more accurate than the state-of-the-art approaches, and is orders of magnitude faster.

1707.07342 2026-06-04 eess.SY cs.LG cs.SY

An Online Learning Approach to Buying and Selling Demand Response

面向购买和销售需求响应的在线学习方法

Kia Khezeli, Eilyan Bitar

发表机构 * School of Electrical and Computer Engineering, Cornell University, Ithaca, NY 14853, USA.(电气与计算机工程系,康奈尔大学,纽约州伊萨卡市,14853,美国)

AI总结 本文提出一种在线学习方法,用于协调聚合商在固定居民客户群中购买需求削减,并在双结算批发市场中销售总体需求削减。研究通过动态定价和合同策略,在未知需求模型下最大化预期利润。

详情
AI中文摘要

我们采用聚合商的视角,旨在协调其从固定居民电力客户群中购买需求削减与在双结算批发市场中销售总体需求削减。聚合商通过向客户提供统一价格来获取需求削减,该价格相对于其预定基准。在实现总体需求削减之前,聚合商还必须确定向双结算能源市场出售多少能源。在日间市场中,聚合商承诺一份远期合同,要求在实时市场交付能源。基础需求曲线,将总体需求削减与聚合商提供的价格相关联,被假设为线性且受不可观测的随机冲击影响。假设聚合商最初不知道需求曲线的参数和随机冲击的分布,我们研究聚合商在T天时间窗口内动态调整报价和远期合同以最大化预期利润的程度。具体而言,我们设计了一种动态定价和合同提供策略,解决聚合商学习未知需求模型与最大化时间累积预期利润之间的需求。特别地,所提出的定价策略被证明在T天内产生的遗憾不超过O(log(T)√T)。

英文摘要

We adopt the perspective of an aggregator, which seeks to coordinate its purchase of demand reductions from a fixed group of residential electricity customers, with its sale of the aggregate demand reduction in a two-settlement wholesale energy market. The aggregator procures reductions in demand by offering its customers a uniform price for reductions in consumption relative to their predetermined baselines. Prior to its realization of the aggregate demand reduction, the aggregator must also determine how much energy to sell into the two-settlement energy market. In the day-ahead market, the aggregator commits to a forward contract, which calls for the delivery of energy in the real-time market. The underlying aggregate demand curve, which relates the aggregate demand reduction to the aggregator's offered price, is assumed to be affine and subject to unobservable, random shocks. Assuming that both the parameters of the demand curve and the distribution of the random shocks are initially unknown to the aggregator, we investigate the extent to which the aggregator might dynamically adapt its offered prices and forward contracts to maximize its expected profit over a time window of $T$ days. Specifically, we design a dynamic pricing and contract offering policy that resolves the aggregator's need to learn the unknown demand model with its desire to maximize its cumulative expected profit over time. In particular, the proposed pricing policy is proven to incur a regret over $T$ days that is no greater than $O(\log(T)\sqrt{T})$.

1710.10737 2026-06-04 math.OC cs.LG cs.NA math.NA stat.ML

Linearly convergent stochastic heavy ball method for minimizing generalization error

用于最小化泛化误差的线性收敛随机重力球方法

Nicolas Loizou, Peter Richtárik

发表机构 * University of Edinburgh, United Kingdom(爱丁堡大学,英国) KAUST, Kingdom of Saudi Arabia(王国沙特阿拉伯的KAUST)

AI总结 本文首次证明了随机重力球方法的线性收敛性,通过固定步长的SGD步骤结合重力球动量项,专注于最小化期望损失而非有限和最小化。

Comments NIPS 2017, Workshop on Optimization for Machine Learning (camera ready version)

详情
AI中文摘要

本文首次建立了随机重力球方法的线性收敛性结果。该方法通过固定步长的SGD步骤结合重力球动量项进行优化。在分析中,我们专注于最小化期望损失,而非通常更困难的有限和最小化问题。尽管分析中我们限制在二次损失下,但总体目标不一定是强凸的。

英文摘要

In this work we establish the first linear convergence result for the stochastic heavy ball method. The method performs SGD steps with a fixed stepsize, amended by a heavy ball momentum term. In the analysis, we focus on minimizing the expected loss and not on finite-sum minimization, which is typically a much harder problem. While in the analysis we constrain ourselves to quadratic loss, the overall objective is not necessarily strongly convex.

1705.08927 2026-06-04 quant-ph cs.AI cs.ET cs.SY eess.SY

Compiling quantum circuits to realistic hardware architectures using temporal planners

利用时间规划器将量子电路编译到现实硬件架构

Davide Venturelli, Minh Do, Eleanor Rieffel, Jeremy Frank

发表机构 * NASA Ames Research Center, Quantum Artificial Intelligence Laboratory(美国国家航空航天局阿姆斯研究中心,量子人工智能实验室) USRA Research Institute for Advanced Computer Science (RIACS)(美国宇航局高级计算机科学研究所(RIACS)) Stinger Ghaffarian Technologies (SGT Inc.)(Stinger Ghaffarian技术(SGT公司)) NASA Ames Research Center, Planning and Scheduling Group(美国国家航空航天局阿姆斯研究中心,计划与调度组)

AI总结 本文研究了将量子电路编译到新兴量子硬件的时空规划方法,重点探讨了超导架构的最近邻约束,并通过QAOA电路的实验验证了时间规划在编译优化中的可行性。

Comments updated manuscript, more planners and results

详情
Journal ref
2017 Quantum Sci. Technol. - also related to proceedings of IJCAI 2017, and ICAPS SPARK Workshop 2017
AI中文摘要

为了在新兴门模型量子硬件上运行量子算法,量子电路必须被编译以考虑硬件的限制。对于近期硬件,由于只能有限地缓解退相干,最小化电路持续时间至关重要。我们研究了将时间规划器应用于量子电路编译到新兴量子硬件的问题。虽然我们的方法是通用的,但我们专注于编译到具有最近邻约束的超导硬件架构。我们的初步实验集中在编译具有高数量交换门的量子交替算子范式(QAOA)电路,这些交换门允许在应用门的顺序上具有极大的灵活性。这种自由度使找到最优编译更具挑战性,但也意味着更优化的编译可能带来更大的收益。我们将这个量子电路编译问题映射到时间规划问题,并为不同大小的QAOA电路生成了一个测试集,以现实硬件架构为目标。我们报告了几个最先进的时间规划器在该测试集上的编译结果。这项早期的实证评估表明,时间规划是量子电路编译的一种可行方法。

英文摘要

To run quantum algorithms on emerging gate-model quantum hardware, quantum circuits must be compiled to take into account constraints on the hardware. For near-term hardware, with only limited means to mitigate decoherence, it is critical to minimize the duration of the circuit. We investigate the application of temporal planners to the problem of compiling quantum circuits to newly emerging quantum hardware. While our approach is general, we focus on compiling to superconducting hardware architectures with nearest neighbor constraints. Our initial experiments focus on compiling Quantum Alternating Operator Ansatz (QAOA) circuits whose high number of commuting gates allow great flexibility in the order in which the gates can be applied. That freedom makes it more challenging to find optimal compilations but also means there is a greater potential win from more optimized compilation than for less flexible circuits. We map this quantum circuit compilation problem to a temporal planning problem, and generated a test suite of compilation problems for QAOA circuits of various sizes to a realistic hardware architecture. We report compilation results from several state-of-the-art temporal planners on this test set. This early empirical evaluation demonstrates that temporal planning is a viable approach to quantum circuit compilation.

1605.06645 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS

Full-Pose Tracking Control for Aerial Robotic Systems with Laterally-Bounded Input Force

具有横向受限输入力的空中机器人系统的全姿态跟踪控制

Antonio Franchi, Ruggero Carli, Davide Bicego, Markus Ryll

发表机构 * Department of Information Engineering, University of Padova(意大利帕多瓦大学信息工程系)

AI总结 本文提出了一种新的几何控制策略,用于实现具有横向受限输入力的空中机器人系统在SE(3)中的全姿态轨迹跟踪,通过Lyapunov方法证明了可行全姿态参考轨迹的指数跟踪,并展示了该方法在欠驱动和全驱动平台中的应用效果。

详情
Journal ref
IEEE Trascation on Robotics, 2018
AI中文摘要

本文定义了一类通用的抽象空中机器人系统,称为横向受限力(LBF)车辆,其中大部分控制权沿主推力方向表达,而在横向方向上可能利用较小甚至为零的力来实现全姿态跟踪。此类系统能够很好地近似具有非共面/非共线旋翼的平台,这些平台可以使用倾斜螺旋桨略微改变总推力相对于机体框架的方向。对于此类广泛系统,我们引入了一种新的几何控制策略,以在力约束允许的情况下实现位置加姿态轨迹的独立跟踪。使用SE(3)中的Lyapunov技术证明了可行的全姿态参考轨迹的指数跟踪。该方法可以无缝处理欠驱动和全驱动的LBF平台。控制器在提供不可行的全姿态参考轨迹时,至少保证位置部分的跟踪。本文提供了几个实验测试,清晰展示了该方法的实用性以及与现有方法相比的显著改进。

英文摘要

In this paper, we define a general class of abstract aerial robotic systems named Laterally Bounded Force (LBF) vehicles, in which most of the control authority is expressed along a principal thrust direction, while in the lateral directions a (smaller and possibly null) force may be exploited to achieve full-pose tracking. This class approximates well platforms endowed with non-coplanar/non-collinear rotors that can use the tilted propellers to slightly change the orientation of the total thrust w.r.t. the body frame. For this broad class of systems, we introduce a new geometric control strategy in SE(3) to achieve, whenever made possible by the force constraints, the independent tracking of position-plus-orientation trajectories. The exponential tracking of a feasible full-pose reference trajectory is proven using a Lyapunov technique in SE(3). The method can deal seamlessly with both under- and fully-actuated LBF platforms. The controller guarantees the tracking of at least the positional part in the case that an unfeasible full-pose reference trajectory is provided. The paper provides several experimental tests clearly showing the practicability of the approach and the sharp improvement with respect to state of-the-art approaches.

1712.06021 2026-06-04 math.OC cs.RO cs.SY eess.SY

Bendable Cuboid Robot Path Planning with Collision Avoidance using Generalized $L_p$ Norms

可弯曲立方体机器人路径规划与碰撞避免使用广义L_p范数

Nak-seung P. Hyun, Patricio A. Vela, Erik I. Verriest

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出基于广义L_p范数的立方体机器人路径规划方法,通过隐式表面近似和双重优化问题解决碰撞避免问题,验证了方法的有效性。

Comments 12 pages, 6 figures

详情
AI中文摘要

本文考虑了刚性及可变形(可弯曲)立方体机器人的最优路径规划问题,通过提供基于广义L_p范数的解析安全约束来实现。对于常规立方体机器人,加权L_p范数的等高线生成其表面的隐式近似。对于可弯曲立方体机器人,极坐标中的加权L_p范数通过指定等高线隐式近似表面边界。环境中的障碍物体积被假设为加权L_p范数的子等高线近似。利用这些近似表面模型,最优安全路径规划问题被重新表述为一个两阶段优化问题,其中安全约束依赖于机器人上最接近障碍物的点。通过推导一组等式和不等式约束来替代最近点问题,从而为原始路径规划问题定义了额外的解析约束。将所有解析约束通过逻辑与操作结合,得到一个通用的最优安全路径规划问题。数值求解该问题涉及将其转换为非线性规划问题。对刚性和可弯曲立方体机器人进行了仿真验证。

英文摘要

Optimal path planning problems for rigid and deformable (bendable) cuboid robots are considered by providing an analytic safety constraint using generalized $L_p$ norms. For regular cuboid robots, level sets of weighted $L_p$ norms generate implicit approximations of their surfaces. For bendable cuboid robots a weighted $L_p$ norm in polar coordinates implicitly approximates the surface boundary through a specified level set. Obstacle volumes, in the environment to navigate within, are presumed to be approximately described as sub-level sets of weighted $L_p$ norms. Using these approximate surface models, the optimal safe path planning problem is reformulated as a two stage optimization problem, where the safety constraint depends on a point on the robot which is closest to the obstacle in the obstacle's distance metric. A set of equality and inequality constraints are derived to replace the closest point problem, which is then defines additional analytic constraints on the original path planning problem. Combining all the analytic constraints with logical AND operations leads to a general optimal safe path planning problem. Numerically solving the problem involve conversion to a nonlinear programing problem. Simulations for rigid and bendable cuboid robot verify the proposed method.

1712.04612 2026-06-04 q-fin.CP cs.AI cs.CE cs.LG cs.SY eess.SY

Inverse Reinforcement Learning for Marketing

营销中的逆强化学习

Igor Halperin

发表机构 * NYU Tandon School of Engineering(纽约大学坦顿工程学院)

AI总结 本文提出利用逆强化学习研究动态消费者需求,通过最大熵方法构建可 tractable 模型,展示观测噪声可能被误认为消费者异质性。

Comments 18 pages, 5 figures

详情
AI中文摘要

从观察行为中学习顾客偏好是营销文献中的重要课题。结构模型通常将前瞻性顾客或企业建模为效用最大化代理,其效用通过随机最优控制方法估计。本文提出基于逆强化学习(IRL)的替代方法研究动态消费者需求。我们开发了一种最大熵IRL的变种,导致高度可 tractable 的模型公式,最终转化为低维凸优化以寻找最优模型参数。通过消费者需求的模拟,我们显示相同顾客的观测噪声可以轻易被误认为显而易见的消费者异质性。

英文摘要

Learning customer preferences from an observed behaviour is an important topic in the marketing literature. Structural models typically model forward-looking customers or firms as utility-maximizing agents whose utility is estimated using methods of Stochastic Optimal Control. We suggest an alternative approach to study dynamic consumer demand, based on Inverse Reinforcement Learning (IRL). We develop a version of the Maximum Entropy IRL that leads to a highly tractable model formulation that amounts to low-dimensional convex optimization in the search for optimal model parameters. Using simulations of consumer demand, we show that observational noise for identical customers can be easily confused with an apparent consumer heterogeneity.

1701.08735 2026-06-04 eess.SY cs.RO cs.SY math.OC

Real-Time Control for Autonomous Racing Based on Viability Theory

基于可行性理论的实时自主赛车控制

Alexander Liniger, John Lygeros

发表机构 * Automatic Control Laboratory, ETH Zurich(自动控制实验室,苏黎世联邦理工学院)

AI总结 本文基于可行性理论生成有限步前瞻轨迹,结合低层模型预测控制器实现实时自主赛车,通过游戏理论方法改进可行性核计算,提升安全性并减少 lap 时间。

Comments 26 pages, 11 figures

详情
AI中文摘要

本文研究了微型赛车的自主驾驶问题,利用可行性核生成递归可行的有限步前瞻轨迹,以最大化进度并保持在静态障碍物内。该方法结合低层模型预测控制器实现实时自主赛车。可行性核计算基于空间离散化,提出基于游戏理论的新型数值方案,特别是辨别核,以提高计算鲁棒性。结果显示,使用辨别核的保守近似方法在安全性上有所提升,但略微增加 lap 时间。

英文摘要

In this paper we consider autonomous driving of miniature race cars. The viability kernel is used to efficiently generate finite look-ahead trajectories that maximize progress while remaining recursively feasible with respect to static obstacles (e.g., stay inside the track). Together with a low-level model predictive controller, this method makes real-time autonomous racing possible. The viability kernel computation is based on space discretization. To make the calculation robust against discretization errors, we propose a novel numerical scheme based on game theoretical methods, in particular the discriminating kernel. We show that the resulting algorithm provides an inner approximation of the viability kernel and guarantees that, for all states in the cell surrounding a viable grid point, there exists a control that keeps the system within the kernel. The performance of the proposed control method is studied in simulation where we determine the effects of various design choices and parameters and in experiments on an autonomous racing set-up maintained at the Automatic Control Laboratory of ETH Zurich. Both simulation and experimental results suggest that the more conservative approximation using the discriminating kernel results in safer driving style at the cost of a small increase in lap time.

1710.01719 2026-06-04 eess.SY cs.LG cs.SY math.DS math.OC

Decomposition of Nonlinear Dynamical Systems Using Koopman Gramians

利用Koopman格拉姆矩阵分解非线性动力系统

Zhiyuan Liu, Soumya Kundu, Lijun Chen, Enoch Yeung

发表机构 * Pacific Northwest National Laboratory(太平洋西北国家实验室)

AI总结 本文提出了一种新的Koopman算子方法,用于利用Koopman格拉姆矩阵分解非线性动力系统,介绍了输入-Koopman算子,并展示了如何将其用于将非线性系统转换为经典状态空间形式,以及输入和状态可观测函数分离的条件。

Comments 8 pages, submitted to IEEE 2018 ACC

详情
AI中文摘要

在本文中,我们提出了一种新的Koopman算子方法,用于利用Koopman格拉姆矩阵分解非线性动力系统。我们引入了输入-Koopman算子的概念,并展示了如何利用输入-Koopman算子将非线性系统转换为经典状态空间形式,并确定输入和状态可观测函数分离的条件。然后,我们扩展了现有的动态模式分解方法,用于从数据中学习Koopman算子,称为深度动态模式分解,以适用于具有控制或扰动的系统。我们通过学习一个输入-状态分离的Koopman算子来演示该方法的准确性,即使底层系统表现出混合的状态-输入项。我们接下来介绍了一种基于Koopman格拉姆矩阵的非线性分解算法,该算法最大化内部子系统的可观测性,并从其他子系统的噪声中减少扰动。我们推导了基于Koopman格拉姆矩阵和多维分区的放松方法,用于解决由此产生的NP难分解问题。最后,我们用IEEE 39节点系统的摆动动力学来演示所提出的算法。

英文摘要

In this paper we propose a new Koopman operator approach to the decomposition of nonlinear dynamical systems using Koopman Gramians. We introduce the notion of an input-Koopman operator, and show how input-Koopman operators can be used to cast a nonlinear system into the classical state-space form, and identify conditions under which input and state observable functions are well separated. We then extend an existing method of dynamic mode decomposition for learning Koopman operators from data known as deep dynamic mode decomposition to systems with controls or disturbances. We illustrate the accuracy of the method in learning an input-state separable Koopman operator for an example system, even when the underlying system exhibits mixed state-input terms. We next introduce a nonlinear decomposition algorithm, based on Koopman Gramians, that maximizes internal subsystem observability and disturbance rejection from unwanted noise from other subsystems. We derive a relaxation based on Koopman Gramians and multi-way partitioning for the resulting NP-hard decomposition problem. We lastly illustrate the proposed algorithm with the swing dynamics for an IEEE 39-bus system.

1712.00390 2026-06-04 eess.SY cs.RO cs.SY

Gain Scheduling LPV Control Scheme for the Autonomous Guidance Problem using a Dynamic Modelling Approach

基于动态建模方法的自主引导问题的增益调度LPV控制方案

Eugenio Alcalá, Vicenç Puig, Joseba Quevedo, Teresa Escobet

发表机构 * 1 Advanced Control Systems Group, Automatic Control Department, Universitat Polit\` e cnica de Catalunya (UPC), Rambla Sant Nebridi, 10, 08222, Terrassa, Spain 11pt 13.2pt This paper is a preprint of a paper submitted to IET Control Theory \& Applications. If accepted, copy of record will be available at the IET Digital Library.

AI总结 本文提出了一种用于城市自动驾驶车辆纵向和横向控制的增益调度LPV控制方法,通过动力学和运动学模型构建LPV表示,并采用级联控制方法同时控制车辆行为,通过解决两个LPV LMI-LQR问题实现性能优化。

详情
AI中文摘要

本文提出了一种用于城市自动驾驶车辆纵向和横向控制的增益调度LPV控制方法。利用运动学和动力学车辆模型,采用线性参数变化(LPV)表示,并提出了一种用于控制两种车辆行为的级联控制方法。特别是,在控制设计中,分别使用两种模型解决两个LPV LMI-LQR问题。此外,为了实现所需的性能水平,提出了一种基于运动学和动力学控制器级联设计的方法。该级联控制方案基于动态闭环行为比运动学闭环行为更快的设计理念。所获得的增益调度LPV控制方法,与轨迹生成模块结合,在模拟城市驾驶场景中展示了良好的效果。

英文摘要

This work proposes a solution for the longitudinal and lateral control problem of urban autonomous vehicles using a gain scheduling LPV control approach. Using the kinematic and dynamic vehicle models, a linear parameter varying (LPV) representation is adopted and a cascade control methodology is proposed for controlling both vehicle behaviours. In particular, for the control design, the use of both models separately lead to solve two LPV LMI-LQR problems. Furthermore, to achieve the desired levels of performance, an approach based on cascade design of the the kinematic and dynamic controllers has been proposed. This cascade control scheme is based on the idea that the dynamic closed loop behaviour is designed to be faster than the kinematic closed loop one. The obtained gain scheduling LPV control approach, jointly with a trajectory generation module, has presented suitable results in a simulated city driving scenario.

1711.11075 2026-06-04 math.NA cs.CV cs.NA

A fast nonconvex Compressed Sensing algorithm for highly low-sampled MR images reconstruction

一种快速非凸压缩感知算法用于高采样率MRI图像重建

Damiana Lazzaro, Elena Loli Piccolomini, Fabiana Zama

发表机构 * Department of Mathematics, University of Bologna(博洛尼亚大学数学系)

AI总结 本文提出一种快速高效的MRI图像重建算法,通过非凸正则化目标函数和最小二乘数据拟合约束,解决严重欠采样数据的重建问题,证明了算法的收敛性。

详情
AI中文摘要

本文提出了一种快速且高效的MRI图像重建方法,将压缩感知理论建模为具有非凸正则化目标函数的约束最小化问题。我们提出了一种名为快速非凸重加权(FNCR)的算法,基于迭代方案,通过凸线性化近似非凸问题,并自动更新惩罚参数。凸问题通过前向-后向过程求解,其中后向步骤通过分裂Bregman策略实现。此外,我们提出了一种新的高效迭代求解器用于出现的线性系统。我们证明了所提出的FNCR方法的收敛性。在合成假人和真实图像上的结果表明,该算法表现优异且计算高效,即使与文献中表现最佳的方法相比也是如此。

英文摘要

In this paper we present a fast and efficient method for the reconstruction of Magnetic Resonance Images (MRI) from severely under-sampled data. From the Compressed Sensing theory we have mathematically modeled the problem as a constrained minimization problem with a family of non-convex regularizing objective functions depending on a parameter and a least squares data fit constraint. We propose a fast and efficient algorithm, named Fast NonConvex Reweighting (FNCR) algorithm, based on an iterative scheme where the non-convex problem is approximated by its convex linearization and the penalization parameter is automatically updated. The convex problem is solved by a Forward-Backward procedure, where the Backward step is performed by a Split Bregman strategy. Moreover, we propose a new efficient iterative solver for the arising linear systems. We prove the convergence of the proposed FNCR method. The results on synthetic phantoms and real images show that the algorithm is very well performing and computationally efficient, even when compared to the best performing methods proposed in the literature.

1711.11018 2026-06-04 eess.SY cs.RO cs.SY math.OC

PDE-Based Optimization for Stochastic Mapping and Coverage Strategies using Robotic Ensembles

基于偏微分方程的机器人群体映射与覆盖策略优化

Karthik Elamvazhuthi, Hendrik Kuiper, Spring Berman

发表机构 * School for Engineering of Matter, Transport and Energy, Arizona State University(物质、传输与能量工程学院,亚利桑那州立大学) School of Mathematical and Statistical Sciences, Arizona State University(数学与统计科学学院,亚利桑那州立大学)

AI总结 本文提出基于偏微分方程的机器人群体控制框架,用于解决具有随机行为的机器人在有限感知与驱动能力下的映射与覆盖任务,通过凸优化和最优控制方法实现区域空间分布的重建与活动率调控。

详情
AI中文摘要

本文提出了一种基于偏微分方程(PDE)的新型框架,用于控制具有有限感知和驱动能力且表现出随机行为的机器人群体,以执行映射和覆盖任务。我们将机器人群体的动态建模为一个输运-扩散-反应PDE模型,并将映射和覆盖任务建模为该模型的识别与控制问题。在映射任务中,机器人被部署在封闭域内以收集无定位信息的数据,用于重建目标区域的未知空间分布。我们将该任务建模为一个凸优化问题,其解表示为PDE模型中的空间依赖系数。然后考虑覆盖问题,其中机器人必须以可编程的概率率执行预期活动以实现目标区域的活动分布。我们将该任务建模为一个最优控制问题,其中PDE模型被表示为双线性控制系统,机器人覆盖活动率和速度场定义为控制输入。我们通过在两个环境中进行联合映射与覆盖场景的模拟来验证我们的方法。

英文摘要

This paper presents a novel partial differential equation (PDE)-based framework for controlling an ensemble of robots, which have limited sensing and actuation capabilities and exhibit stochastic behaviors, to perform mapping and coverage tasks. We model the ensemble population dynamics as an advection-diffusion-reaction PDE model and formulate the mapping and coverage tasks as identification and control problems for this model. In the mapping task, robots are deployed over a closed domain to gather data, which is unlocalized and independent of robot identities, for reconstructing the unknown spatial distribution of a region of interest. We frame this task as a convex optimization problem whose solution represents the region as a spatially-dependent coefficient in the PDE model. We then consider a coverage problem in which the robots must perform a desired activity at a programmable probability rate to achieve a target spatial distribution of activity over the reconstructed region of interest. We formulate this task as an optimal control problem in which the PDE model is expressed as a bilinear control system, with the robots' coverage activity rate and velocity field defined as the control inputs. We validate our approach with simulations of a combined mapping and coverage scenario in two environments with three target coverage distributions.

1711.09867 2026-06-04 math.NA cs.CV cs.NA

Accelerated Optimization in the PDE Framework: Formulations for the Active Contour Case

在PDE框架中实现加速优化:活动轮廓情况的公式化

Anthony Yezzi, Ganesh Sundaramoorthi

发表机构 * School of Electrical and Computer Engineering, Georgia Institute of Technology(电子工程学院,佐治亚理工学院) Electrical Engineering, King Abdullah University of Science and Technology(电气工程,国王阿卜杜勒-阿齐兹大学)

AI总结 本文探讨了在PDE框架中利用加速优化方法提升参数估计性能,通过变分框架和Bregman散度推导连续极限ODE,并扩展至无限维流形,引入共进化质量模型连接最优质量传输的流体力学公式化。

详情
AI中文摘要

在Nesterov开创性工作的基础上,加速优化方法已被用于显著提升一阶梯度基参数估计在第二阶优化策略不可用或不实际的场景中的性能。不仅加速梯度下降比传统梯度下降收敛更快,而且通过初始超调和随后振荡回荡,更稳健地搜索参数空间,从而选择仅局部最小值,其吸引基足够大以包含初始超调。这种行为使加速和随机梯度搜索方法在机器学习社区中特别受欢迎。在最近的PNAS 2016论文中,Wibisono、Wilson和Jordan展示了如何将广泛的一类加速方案用变分框架形式化,围绕Bregman散度,从而得到连续极限ODE。我们展示了其公式如何进一步扩展到无限维流形(从几何空间曲线和曲面开始)通过将Bregman散度替换为切空间上的内积,并显式引入一个与目标对象同时演变的分布式质量模型。这种共进化质量模型,仅为使优化具备有益的动力学而引入,也将由此得到的一类基于PDE的加速优化方案与最优质量传输的流体力学公式化联系起来。

英文摘要

Following the seminal work of Nesterov, accelerated optimization methods have been used to powerfully boost the performance of first-order, gradient-based parameter estimation in scenarios where second-order optimization strategies are either inapplicable or impractical. Not only does accelerated gradient descent converge considerably faster than traditional gradient descent, but it also performs a more robust local search of the parameter space by initially overshooting and then oscillating back as it settles into a final configuration, thereby selecting only local minimizers with a basis of attraction large enough to contain the initial overshoot. This behavior has made accelerated and stochastic gradient search methods particularly popular within the machine learning community. In their recent PNAS 2016 paper, Wibisono, Wilson, and Jordan demonstrate how a broad class of accelerated schemes can be cast in a variational framework formulated around the Bregman divergence, leading to continuum limit ODE's. We show how their formulation may be further extended to infinite dimension manifolds (starting here with the geometric space of curves and surfaces) by substituting the Bregman divergence with inner products on the tangent space and explicitly introducing a distributed mass model which evolves in conjunction with the object of interest during the optimization process. The co-evolving mass model, which is introduced purely for the sake of endowing the optimization with helpful dynamics, also links the resulting class of accelerated PDE based optimization schemes to fluid dynamical formulations of optimal mass transport.

1711.08512 2026-06-04 eess.SY cs.AI cs.SY

A Study on Modeling of Inputting Electrical Power of Ultra High Power Electric Furnace by using Fuzzy Rule and Regression Model

基于模糊规则和回归模型的超高压电炉输入电力建模研究

Choe Un-Chol, Yun Kum-Il, Kwak Son-Il

发表机构 * Faculty of Electronics & Automation, Kim Il Sung University(电子自动化学院,金日成大学)

AI总结 本文提出利用模糊规则和回归模型建立影响高超功率电炉熔炼过程的电力输入模型,并通过仿真实验验证其有效性。

Comments 8 pages, 3 figures, 1 table

详情
AI中文摘要

本文提出了一种方法,该方法通过模糊规则和回归模型来建立影响高超功率(UHP)电炉熔炼过程的电力输入模型,并通过仿真实验验证了该方法的有效性。

英文摘要

: In this paper a method to make inputting electrical model upon factors that affect melting process of high ultra power(UHP) electric furnace by using fuzzy rule and regression model is suggested and its effectiveness is verified with simulation experiment.

1709.02126 2026-06-04 eess.SY cs.AI cs.SY

Proceedings First Workshop on Formal Verification of Autonomous Vehicles

第一届自动驾驶车辆形式验证研讨会论文集

Lukas Bulwahn, Maryam Kamali, Sven Linker

发表机构 * International Conference on integrated Formal Methods(国际形式化方法会议) EPTCS(电子程序技术报告)

AI总结 本文集聚焦自动驾驶车辆的形式验证,汇集了形式验证领域研究人员及控制理论、机器人学等领域的专家,探讨验证技术在自动驾驶开发中的应用。

详情
Journal ref
EPTCS 257, 2017
AI中文摘要

这些是2017年9月19日在意大利都灵举行的自动驾驶车辆形式验证研讨会的论文集,作为国际集成形式方法会议(iFM 2017)的附属研讨会。研讨会旨在汇集形式验证社区中开发用于自动驾驶车辆的形式方法的研究人员,以及在控制理论或机器人学等领域工作的研究人员,探讨验证技术在自动驾驶车辆设计与开发中的应用。

英文摘要

These are the proceedings of the workshop on Formal Verification of Autonomous Vehicles, held on September 19th, 2017 in Turin, Italy, as an affiliated workshop of the International Conference on integrated Formal Methods (iFM 2017). The workshop aim is to bring together researchers from the formal verification community that are developing formal methods for autonomous vehicles as well as researchers working, e.g., in the area of control theory or robotics, interested in applying verification techniques for designing and developing of autonomous vehicles.