arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

1905.11011 2026-06-04 math.OC cs.AI cs.LG cs.SY eess.SY

Robustness of accelerated first-order algorithms for strongly convex optimization problems

强凸优化问题中加速一阶算法的鲁棒性

Hesameddin Mohammadi, Meisam Razaviyayn, Mihailo R. Jovanović

发表机构 * Ming Hsieh Department of Electrical and Computer Engineering（明希德电气与计算机工程系）； Daniel J. Epstein Department of Industrial and Systems Engineering（丹尼尔·J·埃普斯坦工业与系统工程系）

AI总结本文研究了在梯度评估中存在随机不确定性的加速一阶算法的鲁棒性，分析了噪声对优化变量均方误差的影响，并探讨了噪声放大与收敛速率之间的根本权衡。

Comments 45 pages, 6 figures

详情

AI中文摘要

我们研究了在梯度评估中存在随机不确定性的加速一阶算法的鲁棒性。具体而言，针对无约束、光滑、强凸优化问题，我们考察了在迭代项受到加性白噪声扰动时优化变量的均方误差。这种不确定性可能出现在通过真实系统的测量来近似梯度或在分布式网络计算中。尽管此类问题的一阶算法的动力学是非线性的，我们建立了均方偏离最优解的上界，这些上界在常数因子范围内是紧致的。我们的分析量化了通过任何类似于Nesterov或重力球方法的加速方案所获得的噪声放大与收敛速率之间的根本权衡。为了获得额外的分析洞察，对于强凸二次问题，我们明确地将优化变量的稳态方差表示为目标函数Hessian矩阵特征值的函数。我们证明了Hessian的整个谱，而不仅仅是极值特征值，影响噪声算法的鲁棒性。我们将这一结果专门应用于无向网络上的分布式平均问题，并考察了网络大小和拓扑结构对噪声加速算法鲁棒性的影响。

英文摘要

We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradient is sought through measurements of a real system or in a distributed computation over a network. Even though the underlying dynamics of first-order algorithms for this class of problems are nonlinear, we establish upper bounds on the mean-squared deviation from the optimal solution that are tight up to constant factors. Our analysis quantifies fundamental trade-offs between noise amplification and convergence rates obtained via any acceleration scheme similar to Nesterov's or heavy-ball methods. To gain additional analytical insight, for strongly convex quadratic problems, we explicitly evaluate the steady-state variance of the optimization variable in terms of the eigenvalues of the Hessian of the objective function. We demonstrate that the entire spectrum of the Hessian, rather than just the extreme eigenvalues, influence robustness of noisy algorithms. We specialize this result to the problem of distributed averaging over undirected networks and examine the role of network size and topology on the robustness of noisy accelerated algorithms.

URL PDF HTML ☆

赞 0 踩 0

1711.00493 2026-06-04 eess.SY cs.RO cs.SY eess.SP

Event-Triggered Diffusion Kalman Filters

事件触发扩散卡尔曼滤波器

Amr Alanwar, Hazem Said, Ankur Mehta, Matthias Althoff

发表机构 * Technical University of Munich（慕尼黑技术大学）； Ain Shams University（艾因夏姆斯大学）； University of California, Los Angeles（加州大学洛杉矶分校）

AI总结本文提出了一种事件触发的扩散卡尔曼滤波器，通过本地信号指示估计误差来收集测量并交换信息，从而减少资源消耗并提高分布式状态估计的有效性。

详情

AI中文摘要

分布式状态估计强烈依赖于协作信号处理，这通常需要在资源受限的传感器节点上执行过多的通信和计算。为了解决这个问题，我们提出了一种事件触发的扩散卡尔曼滤波器，该滤波器根据本地信号指示的估计误差收集测量并交换信息。在此基础上，我们开发了一种节能的状态估计算法，该算法调节无线网络中的资源消耗，并确保每个消耗的资源的有效性。所提出的算法不需要节点共享其本地协方差矩阵，从而允许显著减少传输信息的数量。为了验证其效率，我们将所提出算法应用于分布式的同时定位和时间同步问题，并在移动四旋翼节点和静止的定制超宽带无线设备的物理测试台上进行评估。获得的实验结果表明，所提出的算法在通信开销方面节省了与原始扩散卡尔曼滤波器相关的86%，而仅导致性能下降16%。我们在线提供了Matlab代码和实际测试数据。

英文摘要

Distributed state estimation strongly depends on collaborative signal processing, which often requires excessive communication and computation to be executed on resource-constrained sensor nodes. To address this problem, we propose an event-triggered diffusion Kalman filter, which collects measurements and exchanges messages between nodes based on a local signal indicating the estimation error. On this basis, we develop an energy-aware state estimation algorithm that regulates the resource consumption in wireless networks and ensures the effectiveness of every consumed resource. The proposed algorithm does not require the nodes to share its local covariance matrices, and thereby allows considerably reducing the number of transmission messages. To confirm its efficiency, we apply the proposed algorithm to the distributed simultaneous localization and time synchronization problem and evaluate it on a physical testbed of a mobile quadrotor node and stationary custom ultra-wideband wireless devices. The obtained experimental results indicate that the proposed algorithm allows saving 86% of the communication overhead associated with the original diffusion Kalman filter while causing deterioration of performance by 16% only. We make the Matlab code and the real testing data available online.

URL PDF HTML ☆

赞 0 踩 0

1602.02726 2026-06-04 math.OC cs.LG cs.NA math.NA

Local and Global Convergence of a General Inertial Proximal Splitting Scheme

局部和全局收敛性的一般惯性近端分裂方案

Patrick R. Johnstone, Pierre Moulin

发表机构 * Coordinated Science Laboratory, University of Illinois, Urbana, IL 61801, USA（协调科学实验室，伊利诺伊大学，厄巴纳，伊利诺伊州，61801，美国）

AI总结本文研究了希尔伯特空间中的凸复合最优化问题，提出了一种通用的惯性近端分裂算法（GIPSA），并证明了其迭代序列的平方增量和累积点的最优性，以及在最小值存在时的弱收敛性。进一步分析了ℓ1正则化优化问题，展示了GIPSA在特定参数选择下的局部收敛性和局部线性收敛性，以及其在FISTA变体中的应用。

Comments 33 pages 1 figure

详情

DOI: 10.1007/s10589-017-9896-7
Journal ref: Comput Optim Appl 67, 259-292 (2017)

AI中文摘要

本文关注希尔伯特空间中的凸复合最优化问题。在这些问题中，目标函数是两个闭合、proper且凸函数的和，其中一个是光滑的，另一个具有计算成本较低的近端算子。我们分析了一种通用的惯性近端分裂算法（GIPSA）以解决此类问题。我们建立了迭代序列平方增量之和的有限性和累积点的最优性。如果最小值被达到，则整个序列的弱收敛性随之成立。我们的分析统一并扩展了之前的一些结果。然后我们专注于ℓ1正则化优化，这是最常见的特殊情况，其中非光滑项是ℓ1范数。对于某些参数选择，GIPSA适用于此问题的局部分析。对于这些选择，我们证明GIPSA在有限次迭代内收敛到最优支持和符号，之后GIPSA减少到最小化局部光滑函数。在某些条件下，局部线性收敛性成立。我们以惯性、步长和局部曲率来确定收敛率。我们的局部分析适用于某些最近的快速迭代收缩阈值算法（FISTA）变体，我们在此类FISTA变体中建立了主动流形识别和局部线性收敛性。我们的分析促使在这些FISTA变体中使用动量重启方案以获得最优的局部线性收敛率。

英文摘要

This paper is concerned with convex composite minimization problems in a Hilbert space. In these problems, the objective is the sum of two closed, proper, and convex functions where one is smooth and the other admits a computationally inexpensive proximal operator. We analyze a general family of inertial proximal splitting algorithms (GIPSA) for solving such problems. We establish finiteness of the sum of squared increments of the iterates and optimality of the accumulation points. Weak convergence of the entire sequence then follows if the minimum is attained. Our analysis unifies and extends several previous results. We then focus on $\ell_1$-regularized optimization, which is the ubiquitous special case where the nonsmooth term is the $\ell_1$-norm. For certain parameter choices, GIPSA is amenable to a local analysis for this problem. For these choices we show that GIPSA achieves finite "active manifold identification", i.e. convergence in a finite number of iterations to the optimal support and sign, after which GIPSA reduces to minimizing a local smooth function. Local linear convergence then holds under certain conditions. We determine the rate in terms of the inertia, stepsize, and local curvature. Our local analysis is applicable to certain recent variants of the Fast Iterative Shrinkage-Thresholding Algorithm (FISTA), for which we establish active manifold identification and local linear convergence. Our analysis motivates the use of a momentum restart scheme in these FISTA variants to obtain the optimal local linear convergence rate.

URL PDF HTML ☆

赞 0 踩 0

1811.06838 2026-06-04 stat.ML cs.LG cs.NA math.NA

The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description

核带宽选择的迹准则用于支持向量数据描述

Arin Chaudhuri, Carol Sadek, Deovrat Kakde, Wenhao Hu, Hansi Jiang, Seunghyun Kong, Yuewei Liao, Sergiy Peredriy, Haoyu Wang

发表机构 * Internet of Things, SAS Institute Inc., Cary, NC, 27513（物联网，SAS公司，北卡罗来纳州卡里，27513）

AI总结本文提出了一种新的无监督方法，用于选择支持向量数据描述（SVDD）中高斯核的带宽，通过利用核矩阵的低秩表示来建议带宽值，该方法在低维数据中与当前最佳方法竞争，并在许多高维数据类别中表现极佳。

Comments note: some text overlap with arXiv:1708.05106 because common background material is covered in both papers

详情

AI中文摘要

支持向量数据描述（SVDD）是一种流行的异常检测技术。SVDD分类器将整个数据空间划分为内群区域和外群区域。计算SVDD分类器需要一个核函数，高斯核是一个常见选择。高斯核有一个带宽参数，正确设置该参数对获得良好结果至关重要。小带宽会导致过拟合，使得SVDD分类器高估异常数量，而大带宽会导致欠拟合，无法检测许多异常。本文提出了一种新的无监督方法，用于选择高斯核的带宽。我们的方法利用核矩阵的低秩表示来建议带宽值。我们的新方法在低维数据中与当前最佳方法竞争，并在许多高维数据类别中表现极佳。由于当使用高斯核时，SVDD的数学公式与单类支持向量机（OCSVM）的数学公式相同，因此我们的方法同样适用于OCSVM的高斯核带宽调整。

英文摘要

Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choice. The Gaussian kernel has a bandwidth parameter, and it is important to set the value of this parameter correctly for good results. A small bandwidth leads to overfitting such that the resulting SVDD classifier overestimates the number of anomalies, whereas a large bandwidth leads to underfitting and an inability to detect many anomalies. In this paper, we present a new unsupervised method for selecting the Gaussian kernel bandwidth. Our method exploits a low-rank representation of the kernel matrix to suggest a kernel bandwidth value. Our new technique is competitive with the current state of the art for low-dimensional data and performs extremely well for many classes of high-dimensional data. Because the mathematical formulation of SVDD is identical with the mathematical formulation of one-class support vector machines (OCSVM) when the Gaussian kernel is used, our method is equally applicable to Gaussian kernel bandwidth tuning for OCSVM.

URL PDF HTML ☆

赞 0 踩 0

1905.03416 2026-06-04 eess.SY cs.RO cs.SY math.OC

Prioritized Inverse Kinematics: Nonsmoothness, Trajectory Existence, Task Convergence, Stability

优先级逆运动学：非光滑性、轨迹存在性、任务收敛性、稳定性

Sang-ik An, Dongheui Lee

发表机构 * German Excellence Initiative（德国卓越计划）； Institute of Robotics and Mechatronics, German Aerospace Center（机器人与机电研究所，德国航空航天中心）

AI总结本文研究了一类优先级逆运动学（PIK）解的理论性质，探讨了其作为动态系统优先级多输出的输出调节或跟踪控制律的特性。首先，开发了研究PIK解非光滑性的工具，并发现非光滑性的充分条件。这表明经典定理无法保证满足PIK解的关节轨迹存在性和唯一性。因此，构建了一个利用PIK解结构信息的替代存在性和唯一性定理。接着，将PIK解的类别缩小到所有任务都遵循某些期望任务轨迹的情况，并发现与任务收敛性相关的性质。进一步分析了当所有任务都达到某些期望任务位置时，由PIK解作为右侧的微分方程的平衡点的稳定性。最后，通过一个双臂机械臂的例子，展示了如何利用这些发现来分析由PIK解生成的关节轨迹行为。

Comments 18 pages

详情

AI中文摘要

在本文中，我们研究了一类优先级逆运动学（PIK）解的各种理论性质，这些解可以被视为动态系统优先级多输出的输出调节或跟踪控制律。我们首先开发了研究PIK解非光滑性的工具，并发现非光滑性的充分条件。这表明经典定理无法保证满足PIK解的关节轨迹存在性和唯一性。因此，我们构建了一个利用PIK解结构信息的替代存在性和唯一性定理。接着，我们将PIK解的类别缩小到所有任务都遵循某些期望任务轨迹的情况，并发现与任务收敛性相关的性质。研究进一步分析了当所有任务都达到某些期望任务位置时，由PIK解作为右侧的微分方程的平衡点的稳定性。最后，我们通过一个双臂机械臂的例子，展示了如何利用这些发现来分析由PIK解生成的关节轨迹行为。

英文摘要

In this paper, we study various theoretical properties of a class of prioritized inverse kinematics (PIK) solutions that can be considered as a class of (output regulation or tracking) control laws of a dynamical system with prioritized multiple outputs. We first develop tools to investigate nonsmoothness of PIK solutions and find a sufficient condition for nonsmoothness. It implies that existence and uniqueness of a joint trajectory satisfying a PIK solution cannot be guaranteed by the classical theorems. So, we construct an alternative existence and uniqueness theorem that uses structural information of PIK solutions. Then, we narrow the class of PIK solutions down to the case that all tasks are designed to follow some desired task trajectories and discover a few properties related to task convergence. The study goes further to analyze stability of equilibrium points of the differential equation whose right hand side is a PIK solution when all tasks are designed to reach some desired task positions. Finally, we furnish an example with a two-link manipulator that shows how our findings can be used to analyze the behavior of a joint trajectory generated from a PIK solution.

URL PDF HTML ☆

赞 0 踩 0

1712.05870 2026-06-04 math.NA cs.CV cs.NA

Multi-dimensional imaging data recovery via minimizing the partial sum of tubal nuclear norm

通过最小化管核范数的偏和进行多维成像数据恢复

Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng

发表机构 * School of Mathematical Sciences/Research Center for Image and Vision Computing（数学科学学院/图像与视觉计算研究中心）； University of Electronic Science and Technology of China（电子科技大学）； FinTech Innovation Center（金融科技创新中心）； Financial Intelligence and Financial Engineering Research Key Laboratory of Sichuan province（四川省金融 intelligence 和金融工程研究重点实验室）； School of Economic Information Engineering（经济信息工程学院）； Southwestern University of Finance and Economics（西南财经大学）

AI总结本文在张量奇异值分解（t-SVD）框架下研究张量恢复问题，提出张量管核秩的替代物偏和管核范数（PSTNN），并构建两个基于PSTNN的最小化模型用于张量补全和主成分分析，通过交替方向乘子法（ADMM）算法解决，并在合成数据和实际数据上验证了PSTNN的优越性。

1905.11266 2026-06-04 math.OC cs.LG cs.NA math.NA

One Method to Rule Them All: Variance Reduction for Data, Parameters and Many New Methods

一法统御诸法：数据、参数及许多新方法的方差缩减

Filip Hanzely, Peter Richtárik

发表机构 * King Abdullah University of Science and Technology（国王阿卜杜勒·阿齐兹科技大学）

AI总结本文提出了一种通用的方差缩减方法，适用于解决具有大量训练样例或大模型维度的正则化经验风险最小化问题。该方法在特殊情况下可以退化为多种已知且以前被认为无关的方法，如SAGA、LSVRG、JacSketch、SEGA和ISEGA及其任意采样和近端泛化。同时，本文还提出了许多具有有趣性质的新具体算法，并提供了一个单一定理，证明在光滑性和拟强凸性假设下方法的线性收敛性。

Comments 61 pages, 6 figures, 3 tables

1809.06646 2026-06-04 eess.SY cs.AI cs.SY

Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning

基于强化学习的无模型自适应最优控制用于周期固定时间制造过程

Johannes Dornheim, Norbert Link, Peter Gumbsch

发表机构 * Institute Intelligent Systems Research Group, Karlsruhe University of Applied Sciences（智能系统研究组，卡尔斯鲁厄应用科学大学）； Institute for Applied Materials (IAM-CMS), Karlsruhe Institute of Technology（应用材料研究所（IAM-CMS），卡尔斯鲁厄理工学院）

AI总结本文提出了一种用于周期固定时间制造过程的自学习最优控制算法，通过强化学习在连续过程中构建控制模型，并利用测量的产品质量作为奖励，从而避免了传统模型预测控制和近似动态规划算法所需的先验模型公式，解决了非线性动态和随机影响带来的系统辨识、精确建模和运行复杂度问题。

Comments Journal preprint version

详情

DOI: 10.1007/s12555-019-0120-7
Journal ref: International Journal of Control, Automation and Systems (2019)

AI中文摘要

本文提出了一种用于周期固定时间制造过程的自学习最优控制算法，通过强化学习在连续过程中构建控制模型，并利用测量的产品质量作为奖励，从而避免了传统模型预测控制和近似动态规划算法所需的先验模型公式，解决了非线性动态和随机影响带来的系统辨识、精确建模和运行复杂度问题。该算法通过与过程交互在线学习期望函数，以推导最优的过程控制决策。所提出的算法考虑了过程条件的随机变化，并能够应对部分可观测性。开发并研究了一种基于Q学习的方法，用于部分可观测的周期固定时间制造过程的自适应最优控制。通过将其应用于模拟的随机最优控制问题，即金属板深拉伸过程，对所得到的算法进行了实例化和评估。

英文摘要

A self-learning optimal control algorithm for episodic fixed-horizon manufacturing processes with time-discrete control actions is proposed and evaluated on a simulated deep drawing process. The control model is built during consecutive process executions under optimal control via reinforcement learning, using the measured product quality as reward after each process execution. Prior model formulation, which is required by state-of-the-art algorithms from model predictive control and approximate dynamic programming, is therefore obsolete. This avoids several difficulties namely in system identification, accurate modelling, and runtime complexity, that arise when dealing with processes subject to nonlinear dynamics and stochastic influences. Instead of using pre-created process and observation models, value function-based reinforcement learning algorithms build functions of expected future reward, which are used to derive optimal process control decisions. The expectation functions are learned online, by interacting with the process. The proposed algorithm takes stochastic variations of the process conditions into account and is able to cope with partial observability. A Q-learning-based method for adaptive optimal control of partially observable episodic fixed-horizon manufacturing processes is developed and studied. The resulting algorithm is instantiated and evaluated by applying it to a simulated stochastic optimal control problem in metal sheet deep drawing.

URL PDF HTML ☆

赞 0 踩 0

1807.01739 2026-06-04 math.OC cs.AI cs.LG cs.SY eess.SY

Proximal algorithms for large-scale statistical modeling and sensor/actuator selection

大规模统计建模和传感器/执行器选择的近端算法

Armin Zare, Hesameddin Mohammadi, Neil K. Dhingra, Tryphon T. Georgiou, Mihailo R. Jovanović

发表机构 * Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California（南加州大学明希赫电子与计算机工程系）； Numerica Corporation（Numerica公司）

AI总结本文提出了一种统一的近端算法框架，用于解决大规模系统建模与控制中的正则化半定规划问题，通过近端方法实现了对统计建模和传感器/执行器选择的高效处理，展示了算法的线性收敛性和有效性。

Comments To appear in IEEE Trans. Automat. Control

详情

DOI: 10.1109/TAC.2019.2948268

AI中文摘要

若干在随机驱动动态系统建模与控制中的问题可以被表述为正则化半定规划。我们考察了两个具有代表性的此类问题，并展示了它们可以以类似的方式进行表述。第一个问题在统计建模中寻求通过适当且最小的扰动来协调观测统计数据。第二个问题则旨在为控制目的最优选择可用的传感器和执行器子集。为了应对大规模系统的建模与控制，我们开发了一种统一的算法框架，利用近端方法。我们的定制算法利用问题结构，使得能够处理统计建模以及传感器和执行器选择，比当前通用求解器可以处理的规模大得多。我们建立了近端梯度算法的线性收敛性，对比了所提出的近端算法与交替方向乘子法，并提供了示例以说明我们框架的优势和有效性。

英文摘要

Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally select a subset of available sensors and actuators for control purposes. To address modeling and control of large-scale systems we develop a unified algorithmic framework using proximal methods. Our customized algorithms exploit problem structure and allow handling statistical modeling, as well as sensor and actuator selection, for substantially larger scales than what is amenable to current general-purpose solvers. We establish linear convergence of the proximal gradient algorithm, draw contrast between the proposed proximal algorithms and alternating direction method of multipliers, and provide examples that illustrate the merits and effectiveness of our framework.

URL PDF HTML ☆

赞 0 踩 0

1904.10379 2026-06-04 cs.GR cs.CV cs.NA math.NA

Multi-modal 3D Shape Reconstruction Under Calibration Uncertainty using Parametric Level Set Methods

在校准不确定性下利用参数水平集方法进行多模态3D形状重建

Moshe Eliasof, Andrei Sharf, Eran Treister

发表机构 * Computer Science Department, Ben-Gurion University of the Negev（本古里安大学计算机科学系）

AI总结本文提出了一种参数化水平集方法，用于在存在校准不确定性时从多模态数据中重建3D形状，该方法能够有效处理不同数据模态，如稀疏点集、体素切片、2D照片等，并在复杂对象的紧凑表示和校准噪声鲁棒性方面取得了显著贡献。

详情

AI中文摘要

我们考虑了在存在不确定校准参数的情况下，从多模态数据中重建3D形状的问题。通常，3D数据模态可以是多种多样的形式，例如稀疏点集、体素切片、2D照片等。为了联合处理这些数据模态，我们利用了一种参数化水平集方法，该方法使用椭球径向基函数。这种方法不仅允许我们以解析且紧凑的方式表示物体，还赋予我们克服源自不准确获取参数的校准相关噪声的能力。这种本质上隐式的正则化导致了高度鲁棒且可扩展的重建，超越了其他传统方法。在我们的结果中，我们首先展示了该方法对复杂物体进行紧凑表示的能力。然后我们展示了我们的重建方法在少量测量和获取参数中的噪声方面都具有鲁棒性。最后，我们展示了从不同模态，如通过液体位移获得的体素切片（类似于CT扫描和X射线）以及从形状轮廓获得的视觉测量中，我们的重建能力。

英文摘要

We consider the problem of 3D shape reconstruction from multi-modal data, given uncertain calibration parameters. Typically, 3D data modalities can be in diverse forms such as sparse point sets, volumetric slices, 2D photos and so on. To jointly process these data modalities, we exploit a parametric level set method that utilizes ellipsoidal radial basis functions. This method not only allows us to analytically and compactly represent the object, it also confers on us the ability to overcome calibration related noise that originates from inaccurate acquisition parameters. This essentially implicit regularization leads to a highly robust and scalable reconstruction, surpassing other traditional methods. In our results we first demonstrate the ability of the method to compactly represent complex objects. We then show that our reconstruction method is robust both to a small number of measurements and to noise in the acquisition parameters. Finally, we demonstrate our reconstruction abilities from diverse modalities such as volume slices obtained from liquid displacement (similar to CTscans and XRays), and visual measurements obtained from shape silhouettes.

URL PDF HTML ☆

赞 0 踩 0

1605.06311 2026-06-04 stat.CO cs.CV cs.SY eess.SY

Poisson multi-Bernoulli conjugate prior for multiple extended object filtering

泊松多伯努利共轭先验用于多扩展目标滤波

Karl Granstrom, Maryam Fatemi, Lennart Svensson

发表机构 * Department of Signals and Systems, Chalmers University of Technology（信號與系統系，查爾姆斯理工大学）； Zenuity

AI总结本文提出了一种用于多扩展目标滤波的泊松多伯努利混合（PMBM）共轭先验。通过泊松点过程描述尚未检测到的目标存在，而多伯努利混合描述已检测到的目标分布。预测和更新方程针对标准转移密度和测量似然性进行推导。预测和更新均保持密度的PMBM形式，因此PMBM密度是一种共轭先验。然而，未知的数据关联导致PMBM密度中出现难以处理的大量项，因此需要近似方法。本文给出了伽马高斯逆 Wishart 实现，并提供了处理数据关联问题的方法。模拟研究显示，扩展目标PMBM滤波器在与扩展目标d-GLMB和LMB滤波器的比较中表现良好。使用激光雷达数据的实验展示了同时跟踪已检测和未检测目标的优势。

详情

DOI: 10.1109/TAES.2019.2920220

AI中文摘要

本文提出了一种用于多扩展目标滤波的泊松多伯努利混合（PMBM）共轭先验。通过泊松点过程描述尚未检测到的目标存在，而多伯努利混合描述已检测到的目标分布。预测和更新方程针对标准转移密度和测量似然性进行推导。预测和更新均保持密度的PMBM形式，因此PMBM密度是一种共轭先验。然而，未知的数据关联导致PMBM密度中出现难以处理的大量项，因此需要近似方法。本文给出了伽马高斯逆 Wishart 实现，并提供了处理数据关联问题的方法。模拟研究显示，扩展目标PMBM滤波器在与扩展目标d-GLMB和LMB滤波器的比较中表现良好。使用激光雷达数据的实验展示了同时跟踪已检测和未检测目标的优势。

英文摘要

This paper presents a Poisson multi-Bernoulli mixture (PMBM) conjugate prior for multiple extended object filtering. A Poisson point process is used to describe the existence of yet undetected targets, while a multi-Bernoulli mixture describes the distribution of the targets that have been detected. The prediction and update equations are presented for the standard transition density and measurement likelihood. Both the prediction and the update preserve the PMBM form of the density, and in this sense the PMBM density is a conjugate prior. However, the unknown data associations lead to an intractably large number of terms in the PMBM density, and approximations are necessary for tractability. A gamma Gaussian inverse Wishart implementation is presented, along with methods to handle the data association problem. A simulation study shows that the extended target PMBM filter performs well in comparison to the extended target d-GLMB and LMB filters. An experiment with Lidar data illustrates the benefit of tracking both detected and undetected targets.

URL PDF HTML ☆

赞 0 踩 0

1905.06518 2026-06-04 eess.SY cs.LG cs.SY

Efficient hinging hyperplanes neural network and its application in nonlinear system identification

高效铰接超平面神经网络及其在非线性系统辨识中的应用

Jun Xu, Qinghua Tao, Zhen Li, Xiangming Xi, Johan A. K. Suykens, Shuning Wang

发表机构 * School of Mechanical Engineering（机械工程学院）； Automation, Harbin Institute of Technology, Shenzhen, 518055, China（自动化学院，哈尔滨工业大学深圳校区，中国，518055）； BNRist, Department of Automation, Tsinghua University, Beijing, 100084, China（BNRist，自动化系，清华大学，北京，中国，100084）

AI总结本文提出了一种高效的铰接超平面（EHH）神经网络，该网络通过求解多个凸优化问题进行训练，具有快速的训练速度。研究证明，每个EHH神经网络等价于一个自适应铰接超平面（AHH）树，并在系统辨识中表现出良好的应用效果。EHH神经网络具有可解释性，可通过ANOVA分解或交互矩阵获得，可作为输入变量选择的建议。

Comments submitted to Automatica

详情

AI中文摘要

本文提出了一种基于铰接超平面（HH）模型的高效铰接超平面（EHH）神经网络。EHH神经网络是一种分布式表示，其训练涉及求解多个凸优化问题，并且训练速度快。证明了对于每一个EHH神经网络，都存在一个等价的自适应铰接超平面（AHH）树，该AHH树也是基于HH模型提出的，并在系统辨识中找到了良好的应用。EHH神经网络的构建包括两个阶段。首先，EHH神经网络的初始结构是随机确定的，使用Lasso回归选择合适的网络。为了减轻随机性的影响，第二阶段采用堆叠策略来形成更一般的网络结构。与其他神经网络不同，EHH神经网络具有可解释性，可以通过其ANOVA分解（或交互矩阵）轻松获得。这种可解释性可以用于输入变量选择的建议。EHH神经网络应用于非线性系统辨识，仿真结果表明所选回归向量合理，识别速度较快，同时仿真精度也令人满意。

英文摘要

In this paper, the efficient hinging hyperplanes (EHH) neural network is proposed based on the model of hinging hyperplanes (HH). The EHH neural network is a distributed representation, the training of which involves solving several convex optimization problems and is fast. It is proved that for every EHH neural network, there is an equivalent adaptive hinging hyperplanes (AHH) tree, which was also proposed based on the model of HH and find good applications in system identification. The construction of the EHH neural network includes 2 stages. First the initial structure of the EHH neural network is randomly determined and the Lasso regression is used to choose the appropriate network. To alleviate the impact of randomness, secondly, the stacking strategy is employed to formulate a more general network structure. Different from other neural networks, the EHH neural network has interpretability ability, which can be easily obtained through its ANOVA decomposition (or interaction matrix). The interpretability can then be used as a suggestion for input variable selection. The EHH neural network is applied in nonlinear system identification, the simulation results show that the regression vector selected is reasonable and the identification speed is fast, while at the same time, the simulation accuracy is satisfactory.

URL PDF HTML ☆

赞 0 踩 0

1804.10273 2026-06-04 math.OC cs.LG cs.NA math.FA math.NA

A telescoping Bregmanian proximal gradient method without the global Lipschitz continuity assumption

一种无需全局Lipschitz连续性假设的 telescoping Bregmanian 近端梯度方法

Daniel Reem, Simeon Reich, Alvaro De Pierro

发表机构 * The Technion - Israel Institute of Technology（技术Ion-以色列理工学院）

AI总结本文提出了一种无需全局Lipschitz连续性假设的近端梯度方法变体，通过在约束集上进行 telescoping 分解，并利用Bregman散度来改进收敛性分析。

Comments Journal of Optimization Theory and Applications (JOTA): accepted for publication; very minor modifications; this version contains full proofs and alphabetically ordered list of references (in contrast with the journal version)

详情

DOI: 10.1007/s10957-019-01509-8
Journal ref: J. Optim. Theory. Appl. 182 (2019), 851--884

AI中文摘要

最小化两个凸函数之和的问题在理论和实际应用中都有广泛的应用。解决此类问题的一种流行方法是近端梯度方法（近端前向-后向算法）。在使用该方法时，一个常见的假设是光滑项的梯度具有全局Lipschitz连续性。然而，这种假设在实践中并不总是成立，从而限制了该方法的使用。在本文中，我们讨论了一种新的近端梯度方法变体，在广泛的有限和无限维空间中，该方法不假定上述全局Lipschitz连续性假设。该方法的关键贡献是迭代步长依赖于约束集的某种telescoping分解。此外，我们使用Bregman散度在近端前向-后向操作中。在某些实际条件下，建立了非渐近收敛率（即函数值的收敛率），以及整个序列弱收敛到极小值点。我们还获得了一些具有独立兴趣的辅助结果。

英文摘要

The problem of minimization of the sum of two convex functions has various theoretical and real-world applications. One of the popular methods for solving this problem is the proximal gradient method (proximal forward-backward algorithm). A very common assumption in the use of this method is that the gradient of the smooth term is globally Lipschitz continuous. However, this assumption is not always satisfied in practice, thus casting a limitation on the method. In this paper, we discuss, in a wide class of finite and infinite-dimensional spaces, a new variant of the proximal gradient method which does not impose the above-mentioned global Lipschitz continuity assumption. A key contribution of the method is the dependence of the iterative steps on a certain telescopic decomposition of the constraint set into subsets. Moreover, we use a Bregman divergence in the proximal forward-backward operation. Under certain practical conditions, a non-asymptotic rate of convergence (that is, in the function values) is established, as well as the weak convergence of the whole sequence to a minimizer. We also obtain a few auxiliary results of independent interest.

URL PDF HTML ☆

赞 0 踩 0

1904.03537 2026-06-04 math.OC cs.CV cs.LG cs.NA math.NA

Convex-Concave Backtracking for Inertial Bregman Proximal Gradient Algorithms in Non-Convex Optimization

凸凹回溯法用于非凸优化中的惯性Bregman近似梯度算法

Mahesh Chandra Mukkamala, Peter Ochs, Thomas Pock, Shoham Sabach

发表机构 * Faculty of Mathematics and Computer Science, Saarland University（萨尔兰大学数学与计算机科学学院）； Institute of Computer Graphics and Vision, Graz University of Technology（格拉茨技术大学计算机图形与视觉研究所）； Faculty of Industrial Engineering, The Technion（技术学院工业工程学院）

AI总结本文提出了一种凸凹回溯方法，用于非凸优化中的惯性Bregman近似梯度算法，通过寻找目标函数的凸上界和凹下界，实现步长和外推参数的自适应选择，并证明算法全局收敛到临界点。

Comments 29 pages

详情

AI中文摘要

回溯线搜索是一种古老而强大的策略，用于在近似梯度算法中寻找更好的步长。其主要原理是局部寻找目标函数的简单凸上界，从而控制使用的步长。在惯性近似梯度算法中，情况变得更加复杂，通常导致对外推参数的非常严格的限制。在本文中，我们展示通过局部寻找目标函数的简单凹下界，可以控制外推参数。这导致了一种双凸凹回溯过程，允许自适应地选择步长和外推参数。我们将此过程应用于惯性Bregman近似梯度方法的类别，并证明由这些算法生成的任何序列都全局收敛到函数的临界点。在图像处理和机器学习中的多个具有挑战性的非凸问题上的数值实验显示，结合惯性步和双回溯策略能够实现性能的提升。

英文摘要

Backtracking line-search is an old yet powerful strategy for finding a better step sizes to be used in proximal gradient algorithms. The main principle is to locally find a simple convex upper bound of the objective function, which in turn controls the step size that is used. In case of inertial proximal gradient algorithms, the situation becomes much more difficult and usually leads to very restrictive rules on the extrapolation parameter. In this paper, we show that the extrapolation parameter can be controlled by locally finding also a simple concave lower bound of the objective function. This gives rise to a double convex-concave backtracking procedure which allows for an adaptive choice of both the step size and extrapolation parameters. We apply this procedure to the class of inertial Bregman proximal gradient methods, and prove that any sequence generated by these algorithms converges globally to a critical point of the function at hand. Numerical experiments on a number of challenging non-convex problems in image processing and machine learning were conducted and show the power of combining inertial step and double backtracking strategy in achieving improved performances.

URL PDF HTML ☆

赞 0 踩 0

1812.03446 2026-06-04 math.NA cs.CV cs.NA

A New Variational Model for Joint Image Reconstruction and Motion Estimation in Spatiotemporal Imaging

一种新的变分模型用于时空成像中的联合图像重建和运动估计

Chong Chen, Barbara Gris, Ozan Öktem

发表机构 * LSEC, ICMSEC, Academy of Mathematics and Systems Science, Chinese Academy of Sciences（LSEC，ICMSEC，数学系统科学研究所，中国科学院）； Department of Mathematics, KTH–Royal Institute of Technology（数学系，皇家理工学院）

AI总结本文提出了一种新的变分模型，用于时空成像中的联合图像重建和运动估计，该模型基于形状理论框架，结合了改进的静态图像重建和顺序间接图像配准，通过理论分析和数值实验展示了其在稀疏和高噪声数据下的有效性。

Comments 35 pages, 5 figures, 3 tables, revised

详情

DOI: 10.1137/18M1234047
Journal ref: SIAM Journal on Imaging Sciences 2019

AI中文摘要

我们提出了一种新的变分模型，用于时空成像中的联合图像重建和运动估计，该模型基于我们提出的一般框架，结合了形状理论。该模型由两个组成部分组成，一个用于执行改进的静态图像重建，另一个依次执行间接图像配准。对于后者，我们将大变形各向同性度量映射框架推广到顺序间接配准设置中。所提出的模型在理论上与替代方法（基于光学流的模型和各向同性运动模型）进行了比较，证明了所提模型在最优解方面具有良好的性质。此外，还给出了所提模型时间离散化场景的理论推导和高效算法，表明时间离散化版本的最优解与时间连续版本一致，并且大部分计算组件都是易于实现的线性化变形。还分析了该算法的复杂度。本文最后通过2D空间+时间断层成像中非常稀疏和/或高噪声数据的数值示例进行了总结。

英文摘要

We propose a new variational model for joint image reconstruction and motion estimation in spatiotemporal imaging, which is investigated along a general framework that we present with shape theory. This model consists of two components, one for conducting modified static image reconstruction, and the other performs sequentially indirect image registration. For the latter, we generalize the large deformation diffeomorphic metric mapping framework into the sequentially indirect registration setting. The proposed model is compared theoretically against alternative approaches (optical flow based model and diffeomorphic motion models), and we demonstrate that the proposed model has desirable properties in terms of the optimal solution. The theoretical derivations and efficient algorithms are also presented for a time-discretized scenario of the proposed model, which show that the optimal solution of the time-discretized version is consistent with that of the time-continuous one, and most of the computational components is the easy-implemented linearized deformation. The complexity of the algorithm is analyzed as well. This work is concluded by some numerical examples in 2D space + time tomography with very sparse and/or highly noisy data.

URL PDF HTML ☆

赞 0 踩 0

1706.04048 2026-06-04 math.NA cs.CV cs.NA math.DS math.FA math.OC

Indirect Image Registration with Large Diffeomorphic Deformations

具有大变形的间接图像配准

Chong Chen, Ozan Öktem

发表机构 * Department of Mathematics, KTH–Royal Institute of Technology（数学系，皇家理工学院）； LSEC, ICMSEC, Academy of Mathematics and Systems Science, Chinese Academy of Sciences（数学和系统科学研究所，中国科学院）

AI总结本文提出了一种基于大变形可微映射框架的间接图像配准方法，解决了在间接噪声观测下模板与目标的配准问题，并证明了该方法在数据误差趋近于零时的稳定性与收敛性。

Comments 43 pages, 4 figures, 1 table; revised

1810.00697 2026-06-04 eess.SY cs.AI cs.LG cs.SY

Data-driven Discovery of Cyber-Physical Systems

基于数据的物理系统发现

Ye Yuan, Xiuchuan Tang, Wei Pan, Xiuting Li, Wei Zhou, Hai-Tao Zhang, Han Ding, Jorge Goncalves

发表机构 * School of Automation, Huazhong University of Science and Technology（华中科技大学自动化学院）； State Key Lab of Digital Manufacturing Equipment and Technology（数字制造装备与技术国家重点实验室）； School of Mechanical Science and Engineering, Huazhong University of Science and Technology（华中科技大学机械科学与工程学院）； Department of Cognitive Robotics, Delft University of Technology（代尔夫特理工大学认知机器人系）； Department of Engineering, University of Cambridge（剑桥大学工程系）； Luxembourg Centre for Systems Biomedicine, University of Luxembourg（卢森堡系统生物医学中心，卢森堡大学）

AI总结本文提出了一种从数据直接反向工程物理系统的通用框架，通过识别物理系统和推断转移逻辑，成功应用于机械、电气系统和医疗应用，为预测CPS轨迹、评估性能、设计容错系统和制定新系统设计指南提供了新方法。

详情

DOI: 10.1038/s41467-019-12490-1

AI中文摘要

物理系统（CPSs）将软件嵌入物理世界，广泛应用于智能电网、机器人、智能制造和医疗监测等领域。由于其固有的复杂性，来自物理组件和网络组件的组合以及它们之间的相互作用，CPSs在建模方面表现出抗性。本文提出了一种从数据直接反向工程CPSs的通用框架。该方法涉及识别物理系统以及推断转移逻辑。它已成功应用于从机械和电气系统到医疗应用的多个现实世界示例。该新颖的框架旨在使研究人员能够基于发现的模型预测CPS的轨迹。此类信息已被证明对于评估CPS性能、设计容错CPS以及为新CPS制定设计指南至关重要。

英文摘要

Cyber-physical systems (CPSs) embed software into the physical world. They appear in a wide range of applications such as smart grids, robotics, intelligent manufacture and medical monitoring. CPSs have proved resistant to modeling due to their intrinsic complexity arising from the combination of physical components and cyber components and the interaction between them. This study proposes a general framework for reverse engineering CPSs directly from data. The method involves the identification of physical systems as well as the inference of transition logic. It has been applied successfully to a number of real-world examples ranging from mechanical and electrical systems to medical applications. The novel framework seeks to enable researchers to make predictions concerning the trajectory of CPSs based on the discovered model. Such information has been proven essential for the assessment of the performance of CPS, the design of failure-proof CPS and the creation of design guidelines for new CPSs.

URL PDF HTML ☆

赞 0 踩 0

1811.07624 2026-06-04 math.NA cs.DS cs.LG cs.NA stat.ML

Approximate Eigenvalue Decompositions of Linear Transformations with a Few Householder Reflectors

利用少量Householder反射子进行线性变换的近似本征值分解

Cristian Rusu

发表机构 * Istituto Italiano di Tecnologia（意大利技术研究院）

AI总结本文提出了一种利用少量Householder反射子构造高效或thonormal矩阵的方法，用于近似或thonormal或对称变换，并应用于快速Mahalanobis距离度量变换的学习。

详情

AI中文摘要

将信号分解为正交基（一组正交分量，每个分量归一化为单位长度）的能力，是许多信号处理方法和应用的核心。经典例子是傅里叶变换和小波变换，它们具有数值高效的实现（FFT和FWT）。不幸的是，正交变换通常结构不规则，因此通常不具有低计算复杂度的性质。在本文中，基于Householder反射子，我们引入了一类正交矩阵，这些矩阵在数值上易于操作：我们通过一个给定参数控制这些矩阵与向量的乘法复杂度。我们提供了数值算法，用于近似任何正交或对称变换，通过给定数量Householder反射子的乘积构造新的正交或对称结构。我们展示了分析和数值证据，以突出所提近似的准确性，并提供了一个应用于快速Mahalanobis距离度量变换学习的应用。

英文摘要

The ability to decompose a signal in an orthonormal basis (a set of orthogonal components, each normalized to have unit length) using a fast numerical procedure rests at the heart of many signal processing methods and applications. The classic examples are the Fourier and wavelet transforms that enjoy numerically efficient implementations (FFT and FWT, respectively). Unfortunately, orthonormal transformations are in general unstructured, and therefore they do not enjoy low computational complexity properties. In this paper, based on Householder reflectors, we introduce a class of orthonormal matrices that are numerically efficient to manipulate: we control the complexity of matrix-vector multiplications with these matrices using a given parameter. We provide numerical algorithms that approximate any orthonormal or symmetric transform with a new orthonormal or symmetric structure made up of products of a given number of Householder reflectors. We show analyses and numerical evidence to highlight the accuracy of the proposed approximations and provide an application to the case of learning fast Mahanalobis distance metric transformations.

URL PDF HTML ☆

赞 0 踩 0

1905.07875 2026-06-04 eess.SY cs.LG cs.NA cs.SY math.NA

Investigating Flight Envelope Variation Predictability of Impaired Aircraft using Least-Squares Regression Analysis

利用最小二乘回归分析研究受损飞机飞行包线变化的可预测性

Ramin Norouzi, Amirreza Kosari, Mohammad Hossein Sabour

发表机构 * University of Tehran（塔里哈大学）

AI总结本文通过线性和非线性最小二乘估计方法，研究了受损飞机飞行包线内Trim点数量及其质心的可预测性，并开发并比较了多种多项式模型和基于双曲正切函数的非线性模型，以预测不同故障程度下的飞行包线变化。

Comments Accepted version, Journal of Aerospace Information Systems

详情

DOI: 10.2514/1.I010760

AI中文摘要

飞机故障会改变飞机的动态特性并导致飞行包线发生变化。此类包线变化是非线性的，通常无法被飞行员预测，因为它们受飞机复杂动态的支配。因此，为了防止飞行中失去控制，必须能够实际预测任何事先未知故障程度下受损飞机的飞行包线变化。本文通过线性和非线性最小二乘估计方法，研究了飞行包线内Trim点数量及其质心的可预测性。为此，开发并比较了多种多项式模型和基于双曲正切函数的非线性模型，这些模型将影响包线变化的因素作为输入，并在任何预期故障程度下估计飞行包线的Trim点数量和质心。结果表明，多项式和基于双曲正切函数的模型都能以高精度预测受损飞行包线的变化。此外，还证明了最佳多项式拟合的回归方程能够直接评估受损飞机的飞行包线收缩和位移对特定飞机故障和飞行条件参数的敏感性。

英文摘要

Aircraft failures alter the aircraft dynamics and cause maneuvering flight envelope to change. Such envelope variations are nonlinear and generally unpredictable by the pilot as they are governed by the aircraft's complex dynamics. Hence, in order to prevent in-flight Loss of Control it is crucial to practically predict the impaired aircraft's flight envelope variation due to any a-priori unknown failure degree. This paper investigates the predictability of the number of trim points within the maneuvering flight envelope and its centroid using both linear and nonlinear least-squares estimation methods. To do so, various polynomial models and nonlinear models based on hyperbolic tangent function are developed and compared which incorporate the influencing factors on the envelope variations as the inputs and estimate the centroid and the number of trim points of the maneuvering flight envelope at any intended failure degree. Results indicate that both the polynomial and hyperbolic tangent function-based models are capable of predicting the impaired fight envelope variation with good precision. Furthermore, it is shown that the regression equation of the best polynomial fit enables direct assessment of the impaired aircraft's flight envelope contraction and displacement sensitivity to the specific parameters characterizing aircraft failure and flight condition.

URL PDF HTML ☆

赞 0 踩 0

1809.03674 2026-06-04 eess.SY cs.RO cs.SY

Planar Cooperative Extremum Seeking with Guaranteed Convergence Using A Three-Robot Formation

平面协作极值搜索与保证收敛的三机器人编队

Anna Skobeleva, Baris Fidan, V. Ugrinovskii, Ian R. Petersen

发表机构 * Mechanical and Mechatronics Engineering Department, University of Waterloo（滑铁卢大学机械与机电工程系）

AI总结本文提出了一种结合编队获取和协作极值搜索控制的方案，用于在平面上移动的三机器人团队。极值搜索任务是找到一个未知二维函数在平面上的最大值点。该函数表示由于位于最大值点的源产生的信号强度场，并假设该函数在最大值点附近局部凹凸且随距离源点的增加而单调递减。通过在特定领头机器人位置和最大值点处对场函数进行泰勒展开，并结合基于机器人信号强度测量的梯度估计器，设计并分析了所提出的控制方案。所提出的方案被证明可以指数收敛并同时（i）获取指定的几何编队，（ii）将领头机器人驱动到指定的最大值点附近的盘内，该盘的半径取决于指定的期望编队大小以及场函数的海森矩阵范数上限。通过一组仿真实验评估了所提出控制方案的性能。

Comments Presented at the 2018 IEEE Conference on Decision and Control (CDC), Miami Beach, FL, USA

详情

DOI: 10.1109/CDC.2018.8619784

AI中文摘要

本文提出了一种结合编队获取和协作极值搜索控制的方案，用于在平面上移动的三机器人团队。极值搜索任务是找到一个未知二维函数在平面上的最大值点。该函数表示由于位于最大值点的源产生的信号强度场，并假设该函数在最大值点附近局部凹凸且随距离源点的增加而单调递减。通过在特定领头机器人位置和最大值点处对场函数进行泰勒展开，并结合基于机器人信号强度测量的梯度估计器，设计并分析了所提出的控制方案。所提出的方案被证明可以指数收敛并同时（i）获取指定的几何编队，（ii）将领头机器人驱动到指定的最大值点附近的盘内，该盘的半径取决于指定的期望编队大小以及场函数的海森矩阵范数上限。通过一组仿真实验评估了所提出控制方案的性能。

英文摘要

In this paper, a combined formation acquisition and cooperative extremum seeking control scheme is proposed for a team of three robots moving on a plane. The extremum seeking task is to find the maximizer of an unknown two-dimensional function on the plane. The function represents the signal strength field due to a source located at maximizer, and is assumed to be locally concave around maximizer and monotonically decreasing in distance to the source location. Taylor expansions of the field function at the location of a particular lead robot and the maximizer are used together with a gradient estimator based on signal strength measurements of the robots to design and analyze the proposed control scheme. The proposed scheme is proven to exponentially and simultaneously (i) acquire the specified geometric formation and (ii) drive the lead robot to a specified neighborhood disk around maximizer, whose radius depends on the specified desired formation size as well as the norm bounds of the Hessian of the field function. The performance of the proposed control scheme is evaluated using a set of simulation experiments.

URL PDF HTML ☆

赞 0 踩 0

1402.3735 2026-06-04 cs.MA cs.RO cs.SY eess.SY

Decentralized Goal Assignment and Safe Trajectory Generation in Multi-Robot Networks via Multiple Lyapunov Functions

多机器人网络中基于多重李雅普诺夫函数的去中心化目标分配与安全轨迹生成

Dimitra Panagou, Matthew Turpin, Vijay Kumar

发表机构 * GRASP Lab, School of Engineering and Applied Science, University of Pennsylvania（宾夕法尼亚大学工程与应用科学学院GRASP实验室）

AI总结本文研究了在仅本地通信可用的情况下多机器人网络的去中心化目标分配与轨迹生成问题，提出了一种基于切换系统和集合不变性方法的解决方案。通过使用一组李雅普诺夫-like函数编码候选目标分配的局部决策，使连接的代理选择最短总距离到目标的分配。当最优分配可能导致碰撞轨迹时，使用另一组李雅普诺夫-like屏障函数来维持系统安全并保持收敛保证。所提出的切换策略产生计算高效且可扩展的反馈控制策略，适用于在有限信息共享下机器人网络的快速响应部署。

详情

AI中文摘要

本文考虑了在仅本地通信可用的情况下多机器人网络的去中心化目标分配与轨迹生成问题，并提出了一种基于与切换系统和集合不变性相关的方法的解决方案。采用一组李雅普诺夫-like函数来编码候选目标分配之间的（局部）决策，使得一组连接的代理选择导致到目标的最短总距离的分配。在最优分配可能导致碰撞轨迹的情况下，激活另一组李雅普诺夫-like屏障函数，从而在保持系统安全的同时保留收敛保证。所提出的切换策略产生了计算高效且可扩展的反馈控制策略，因此适用于在有限信息共享下机器人网络的快速响应部署。通过模拟结果和六台地面机器人的实验验证了所提出方法的有效性。

英文摘要

This paper considers the problem of decentralized goal assignment and trajectory generation for multi-robot networks when only local communication is available, and proposes an approach based on methods related to switched systems and set invariance. A family of Lyapunov-like functions is employed to encode the (local) decision making among candidate goal assignments, under which a group of connected agents chooses the assignment that results in the shortest total distance to the goals. An additional family of Lyapunov-like barrier functions is activated in the case when the optimal assignment may lead to colliding trajectories, maintaining thus system safety while preserving the convergence guarantees. The proposed switching strategies give rise to feedback control policies that are computationally efficient and scalable with the number of agents, and therefore suitable for applications including first-response deployment of robotic networks under limited information sharing. The efficacy of the proposed method is demonstrated via simulation results and experiments with six ground robots.

URL PDF HTML ☆

赞 0 踩 0

1811.12819 2026-06-04 eess.SY cs.RO cs.SY

Structure-Preserving Constrained Optimal Trajectory Planning of a Wheeled Inverted Pendulum

保持结构的约束最优轨迹规划 of 一个轮式倒置摆

Klaus Albert, Karmvir Singh Phogat, Felix Anhalt, Ravi N Banavar, Debasish Chatterjee, Boris Lohmann

发表机构 * Systems and Control Engineering（系统与控制工程）

AI总结本文研究了轮式倒置摆的约束最优轨迹规划问题，通过离散力学推导了离散时间模型，并利用离散时间约束最优控制问题生成最优轨迹，同时设计了非线性连续时间模型和闭环LQ控制器，通过实验验证了非线性模型和控制方案的有效性。

Comments 12 pages, 8 figures, 1 table. arXiv admin note: text overlap with arXiv:1710.10932

详情

AI中文摘要

轮式倒置摆（WIP）是一个欠驱动、非完整机电系统，已被商业化为Segway。设计一个控制律进行运动规划，同时考虑状态和控制约束，并尊重配置流形，是一个具有挑战性的问题。本文通过离散力学推导了WIP系统的离散时间模型，并通过求解离散时间约束最优控制问题生成WIP系统的最优轨迹。进一步，我们描述了一个带有参数的非线性连续时间模型，用于设计闭环LQ控制器。所设计的最优轨迹作为参考提供给机器人，同时最优控制轨迹作为前馈控制作用，反馈模式下的LQ控制器用于抑制噪声和干扰，以确保WIP系统的稳定运动。在进行涉及剧烈操作和较为急转弯的WIP系统实验时，我们发现设计的最优轨迹与机器人跟踪这些轨迹时所走的路径高度一致。这证实了非线性模型和控制方案的有效性。最后，这些实验展示了WIP系统的高度非线性特性和控制方案的鲁棒性。

英文摘要

The Wheeled Inverted Pendulum (WIP) is an underactuated, nonholonomic mechatronic system, and has been popularized commercially as the Segway. Designing a control law for motion planning, that incorporates the state and control constraints, while respecting the configuration manifold, is a challenging problem. In this article we derive a discrete-time model of the WIP system using discrete mechanics and generate optimal trajectories for the WIP system by solving a discrete-time constrained optimal control problem. Further, we describe a nonlinear continuous-time model with parameters for designing a closed loop LQ-controller. A dual control architecture is implemented in which the designed optimal trajectory is then provided as a reference to the robot with the optimal control trajectory as a feedforward control action, and an LQ-controller in the feedback mode is employed to mitigate noise and disturbances for ensuing stable motion of the WIP system. While performing experiments on the WIP system involving aggressive maneuvers with fairly sharp turns, we found a high degree of congruence in the designed optimal trajectories and the path traced by the robot while tracking these trajectories. This corroborates the validity of the nonlinear model and the control scheme. Finally, these experiments demonstrate the highly nonlinear nature of the WIP system and robustness of the control scheme.

URL PDF HTML ☆

赞 0 踩 0

1904.11538 2026-06-04 eess.SY cs.LG cs.SY

Zap Q-Learning for Optimal Stopping Time Problems

Shuhang Chen, Adithya M. Devraj, Ana Bušić, Sean P. Meyn

发表机构 * Department of ECE at the University of Florida（佛罗里达大学电子与计算机工程系）； Inria International Chair, Paris（巴黎Inria国际席位）

AI总结本文研究了在不可约、均匀递归的马尔可夫链上，通过快速收敛的强化学习算法近似求解折扣成本最优停止问题，提出了一种名为Zap-Q-learning的算法，证明其在线性函数近似设置下的收敛性。

详情

AI中文摘要

本文的目标是获得快速收敛的强化学习算法，以近似求解在不可约、均匀递归的马尔可夫链上，其状态空间为$\mathbb{R}^n$的紧子集中的折扣成本最优停止问题的解。我们基于Tsitsikilis和Van Roy所采用的动态规划方法，其中他们提出了一种Q-learning算法来估计最优状态-动作价值函数，从而定义最优停止规则。我们探讨了该算法收敛速度慢的原因，并提出了一种快速收敛的替代算法，即“Zap-Q-learning”算法，旨在实现最优的收敛速度。首次在假设线性函数近似设置下证明了Zap-Q-learning算法的收敛性。我们通过ODE分析进行证明，并通过金融示例中的最优渐近方差性质反映该算法的快速收敛性。

英文摘要

The objective in this paper is to obtain fast converging reinforcement learning algorithms to approximate solutions to the problem of discounted cost optimal stopping in an irreducible, uniformly ergodic Markov chain, evolving on a compact subset of $\mathbb{R}^n$. We build on the dynamic programming approach taken by Tsitsikilis and Van Roy, wherein they propose a Q-learning algorithm to estimate the optimal state-action value function, which then defines an optimal stopping rule. We provide insights as to why the convergence rate of this algorithm can be slow, and propose a fast-converging alternative, the "Zap-Q-learning" algorithm, designed to achieve optimal rate of convergence. For the first time, we prove the convergence of the Zap-Q-learning algorithm under the assumption of linear function approximation setting. We use ODE analysis for the proof, and the optimal asymptotic variance property of the algorithm is reflected via fast convergence in a finance example.

URL PDF HTML ☆

赞 0 踩 0

1904.04211 2026-06-04 astro-ph.SR cs.AI cs.NA math.NA

Desaturating EUV observations of solar flaring storms

淡化日冕层太阳耀斑风暴的观测

Sabrina Guastavino, Michele Piana, Anna Maria Massone, Richard Schwartz, Federico Benvenuto

发表机构 * CNR - SPIN（意大利国家研究委员会-SPIN）； NASA Goddard Space Flight Center（美国国家航空航天局戈达德空间飞行中心）

AI总结本文提出了一种新颖的去饱和方法，能够通过利用图像本身的信息恢复AIA图像中饱和区域的信号，为构建AIA数据重建流程提供了可靠工具。

详情

DOI: 10.3847/1538-4357/ab35d8

AI中文摘要

图像饱和一直是太阳天文观测中多个仪器面临的问题，特别是在EUV波长范围内。然而，随着太阳动态观测站（SDO）任务载荷中大气成像装配（AIA）的发射，图像饱和已成为大数据问题，涉及自2010年2月以来每年提供的 impressive 数据集中的约10^$帧。本文介绍了一种新颖的去饱和方法，该方法能够通过利用图像本身包含的信息来恢复任何AIA图像中饱和区域的信号。这种独特的方法学特性，加上去饱和图像前所未有的统计可靠性，可能使该算法成为实现AIA数据重建流程的完美工具，即使在长时间、高能耀斑事件的情况下也能正常工作。

英文摘要

Image saturation has been an issue for several instruments in solar astronomy, mainly at EUV wavelengths. However, with the launch of the Atmospheric Imaging Assembly (AIA) as part of the payload of the Solar Dynamic Observatory (SDO) image saturation has become a big data issue, involving around 10^$ frames of the impressive dataset this beautiful telescope has been providing every year since February 2010. This paper introduces a novel desaturation method, which is able to recover the signal in the saturated region of any AIA image by exploiting no other information but the one contained in the image itself. This peculiar methodological property, jointly with the unprecedented statistical reliability of the desaturated images, could make this algorithm the perfect tool for the realization of a reconstruction pipeline for AIA data, able to work properly even in the case of long-lasting, very energetic flaring events.

URL PDF HTML ☆

赞 0 踩 0

1905.05992 2026-06-04 eess.SY cs.LG cs.SY math.OC

Deep reinforcement learning for scheduling in large-scale networked control systems

在大规模网络化控制系统中使用深度强化学习进行调度

Adrian Redder, Arunselvan Ramaswamy, Daniel E. Quevedo

发表机构 * Faculty of Computer Science, Electrical Engineering and Mathematics（计算机科学、电气工程与数学系）； Paderborn University（帕德博恩大学）

AI总结本文提出了一种基于深度强化学习的迭代资源分配算法DIRA，用于解决网络化系统中的控制与资源调度问题，通过联合优化控制与调度以提高性能。

1603.07421 2026-06-04 eess.SY cs.LG cs.SY math.OC

On the Powerball Method for Optimization

关于优化的Powerball方法

Ye Yuan, Mu Li, Jun Liu, Claire J. Tomlin

发表机构 * School of Automation, Huazhong University of Science and Technology（华中科技大学自动化学院）； Department of Computer Science, Carnegie Mellon University（卡内基梅隆大学计算机科学系）； Department of Applied Mathematics, University of Waterloo（滑铁卢大学应用数学系）； Department of Electrical Engineering and Computer Sciences, University of California, Berkeley（加州大学伯克利分校电气工程与计算机科学系）

AI总结本文提出了一种新的方法来加速优化算法的收敛，通过在优化过程中添加一个功率系数γ∈[0,1)，称为Powerball方法，并分析了该方法在强凸函数中的收敛率。尽管理论上Powerball方法与梯度方法有相同的线性收敛率，但实验证明其在初始迭代中显著优于梯度下降和牛顿方法，尤其在多个真实数据集上提供了10倍的收敛加速。

1610.04391 2026-06-04 eess.SY cs.RO cs.SY math.OC

A guiding vector field algorithm for path following control of nonholonomic mobile robots

一种基于引导矢量场的路径跟随控制非holonomic移动机器人的算法

Yuri A. Kapitanyuk, Anton V. Proskurnikov, Ming Cao

发表机构 * Delft Center for Systems and Control at Delft University of Technology（代尔夫特理工大学系统与控制中心）

AI总结本文提出了一种基于引导矢量场（GVF）思想的非holonomic移动机器人路径跟随控制算法，该算法能够将任意光滑曲线作为期望路径，通过设计引导矢量场使其积分曲线收敛到轨迹，并通过非线性运动控制器使机器人沿该积分曲线运动，最终到达期望路径，同时通过实验验证了算法的全局收敛性和有效性。

Comments under review in IEEE Transactions on Control Systems Technology

1712.09379 2026-06-04 math.OC cs.DS cs.LG cs.NA math.NA stat.ML

IHT dies hard: Provable accelerated Iterative Hard Thresholding

IHT死守：可证明的加速迭代硬阈值法

Rajiv Khanna, Anastasios Kyrillidis

发表机构 * University of Texas at Austin（德克萨斯大学奥斯汀分校）； IBM T.J. Watson Research Center（IBM 沃森研究中心）

AI总结本文研究了在理论和实践中经典迭代硬阈值（IHT）方法中动量运动的使用，通过简单修改普通IHT，探讨了其在具有非凸约束的凸优化标准下的收敛行为，并观察到IHT的加速在投影梯度下降和Frank-Wolfe变体中带来了显著改进。

Comments accepted to AISTATS 2018

1905.05574 2026-06-04 cs.IT cs.DC cs.PF cs.RO cs.SY eess.SY math.IT

Coded Distributed Tracking

编码分布式追踪

Albin Severinson, Eirik Rosnes, Alexandre Graell i Amat

发表机构 * Department of Electrical Engineering, Chalmers University of Technology（电气工程系，查尔姆斯理工大学）

AI总结本文提出了一种云辅助的分布式追踪方案，利用编码分布式计算方法提高追踪的及时性和准确性，并通过MDS码进一步提升大规模更新间隔下的估计精度，同时揭示了信息年龄与估计精度之间的权衡。

Comments Accepted for publication at IEEE GLOBECOM 2019

详情

AI中文摘要

我们考虑在分布式环境中跟踪随时间演化的过程状态的问题，其中多个观察者各自观测状态的部分。我们提出了一种云辅助方案，其中追踪在云上进行。为了提供及时且准确的更新，并缓解云计算中的straggler问题，我们提出了一种编码分布式计算方法，其中编码的观测被分布到多个工作者上。所提出的方案基于一种编码版本的卡尔曼滤波器，该滤波器在使用擦除纠正码编码的数据上运行，使得状态可以从部分由子集工作者计算的更新中进行估计。我们将所提出的方法应用于跟踪多个车辆的问题。我们显示，复制实现了比相应未编码方案显著更高的精度。使用最大距离可分离（MDS）码进一步在较大的更新间隔下提高精度。在两种情况下，所提出的方案在更新间隔足够大时接近理想集中式方案的精度。最后，我们观察到MDS码在信息年龄和估计精度之间存在权衡。

英文摘要

We consider the problem of tracking the state of a process that evolves over time in a distributed setting, with multiple observers each observing parts of the state, which is a fundamental information processing problem with a wide range of applications. We propose a cloud-assisted scheme where the tracking is performed over the cloud. In particular, to provide timely and accurate updates, and alleviate the straggler problem of cloud computing, we propose a coded distributed computing approach where coded observations are distributed over multiple workers. The proposed scheme is based on a coded version of the Kalman filter that operates on data encoded with an erasure correcting code, such that the state can be estimated from partial updates computed by a subset of the workers. We apply the proposed scheme to the problem of tracking multiple vehicles. We show that replication achieves significantly higher accuracy than the corresponding uncoded scheme. The use of maximum distance separable (MDS) codes further improves accuracy for larger update intervals. In both cases, the proposed scheme approaches the accuracy of an ideal centralized scheme when the update interval is large enough. Finally, we observe a trade-off between age-of-information and estimation accuracy for MDS codes.

URL PDF HTML ☆

赞 0 踩 0

1903.12605 2026-06-04 eess.SY cs.RO cs.SY

Stable, Concurrent Controller Composition for Multi-Objective Robotic Tasks

多目标机器人任务的稳定并发控制器组合

Anqi Li, Ching-An Cheng, Byron Boots, Magnus Egerstedt

发表机构 * Institute for Robotics and Intelligent Machines, Georgia Institute of Technology（机器人与智能机器研究所，佐治亚理工学院）

AI总结本文提出了一种稳定且并发的控制器组合方法，用于多目标机器人任务，通过分解任务为子任务并独立设计子任务控制器，再利用RMPflow框架结合生成整体控制策略，同时通过CLF分析确保系统稳定性。

Comments The 58th IEEE Conference on Decision and Control (CDC), 2019

详情

AI中文摘要

机器人系统往往需要同时考虑多个任务。这一挑战要求控制器合成算法能够在满足多个控制规范的同时保持系统稳定性。本文将多目标任务分解为子任务，其中每个子任务控制器独立设计，然后结合生成整体控制策略。特别地，我们采用最近提出的机器人控制器结构Riemannian Motion Policies（RMPs），以及其相关的计算框架RMPflow，用于结合RMP控制器。我们通过严格的控制Lyapunov函数（CLF）处理重新建立并扩展了RMPflow的稳定性结果。然后我们证明RMPflow能够稳定地结合满足一定CLF约束的独立设计的子任务控制器。这一新见解导致了一种高效的基于CLF的计算框架，用于生成同时考虑所有子任务的稳定控制器。与原始使用RMPflow相比，我们的框架为用户提供了一种通过名义控制器将设计启发式融入子任务的灵活性。我们通过数值模拟和机器人实现验证了所提出的计算框架。

英文摘要

Robotic systems often need to consider multiple tasks concurrently. This challenge calls for controller synthesis algorithms that fulfill multiple control specifications while maintaining the stability of the overall system. In this paper, we decompose multi-objective tasks into subtasks, where individual subtask controllers are designed independently and then combined to generate the overall control policy. In particular, we adopt Riemannian Motion Policies (RMPs), a recently proposed controller structure in robotics, and, RMPflow, its associated computational framework for combining RMP controllers. We re-establish and extend the stability results of RMPflow through a rigorous Control Lyapunov Function (CLF) treatment. We then show that RMPflow can stably combine individually designed subtask controllers that satisfy certain CLF constraints. This new insight leads to an efficient CLF-based computational framework to generate stable controllers that consider all the subtasks simultaneously. Compared with the original usage of RMPflow, our framework provides users the flexibility to incorporate design heuristics through nominal controllers for the subtasks. We validate the proposed computational framework through numerical simulation and robotic implementation.

URL PDF HTML ☆

赞 0 踩 0