arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 8085
1212.4507 2026-06-03 stat.ML cs.LG cs.NA math.NA

Variational Optimization

变分优化

Joe Staines, David Barber

AI总结 本文提出一种通用技术,通过构造可微边界来优化不可微或离散目标函数,并应用于稀疏学习与支持向量分类。

详情
AI中文摘要

我们讨论了一种通用技术,可用于形成不可微或离散目标函数最优值的可微边界。我们形成了这些方法的统一描述,并考虑了在何种情况下该边界是凹的。特别地,我们考虑了该方法的两个具体应用,即稀疏学习和支持向量分类。

英文摘要

We discuss a general technique that can be used to form a differentiable bound on the optima of non-differentiable or discrete objective functions. We form a unified description of these methods and consider under which circumstances the bound is concave. In particular we consider two concrete applications of the method, namely sparse learning and support vector classification.

1212.1735 2026-06-03 math.OC cs.AI cs.NI cs.SY eess.SY

Towards Design of System Hierarchy (research survey)

系统层次结构设计(研究综述)

Mark Sh. Levin

AI总结 本文综述了树状和层次系统结构的设计/构建框架,包括基于专家的方法、层次聚类、生成树问题、组织最优层次设计、多层k连通网络设计以及层次/网络的修改,并考虑组合优化问题。

Comments 36 pages, 41 figures, 9 tables

详情
AI中文摘要

本文讨论了某些树状和层次系统结构的设计/构建框架。考察了以下方法:(1)基于专家的程序;(2)层次聚类;(3)生成树问题(例如,最小生成树、最小斯坦纳树、最大叶子生成树问题);(4)组织“最优”层次设计;(5)多层(例如,三层)k连通网络设计;(6)层次或网络的修改:(i)通过合并相邻节点修改树,(ii)热链接分配,(iii)将树转换为斯坦纳树,(iv)重构作为将初始结构解修改为最接近目标解且考虑修改成本的解。组合优化问题被视为基本问题(例如,分类、背包问题、多选问题、分配问题)。一些数值示例说明了所提出的问题和求解框架。

英文摘要

The paper addresses design/building frameworks for some kinds of tree-like and hierarchical structures of systems. The following approaches are examined: (1) expert-based procedures, (2) hierarchical clustering; (3) spanning problems (e.g., minimum spanning tree, minimum Steiner tree, maximum leaf spanning tree problem; (4) design of organizational 'optimal' hierarchies; (5) design of multi-layer (e.g., three-layer) k-connected network; (6) modification of hierarchies or networks: (i) modification of tree via condensing of neighbor nodes, (ii) hotlink assignment, (iii) transformation of tree into Steiner tree, (iv) restructuring as modification of an initial structural solution into a solution that is the most close to a goal solution while taking into account a cost of the modification. Combinatorial optimization problems are considered as basic ones (e.g., classification, knapsack problem, multiple choice problem, assignment problem). Some numerical examples illustrate the suggested problems and solving frameworks.

1211.7369 2026-06-03 stat.ML cs.LG cs.NA math.NA

Approximate Rank-Detecting Factorization of Low-Rank Tensors

低秩张量的近似秩检测分解

Franz J. Király, Andreas Ziehe

AI总结 提出AROFAC2算法,通过检测三阶张量的CP秩并分解为秩一分量,具有内在检测真实秩、避免虚假分量、对异常值和非高斯噪声鲁棒的优势。

详情
AI中文摘要

我们提出了一种算法AROFAC2,该算法能够检测三阶张量的(CP-)秩并将其分解为秩一分量。我们给出了算法有效的生成条件,并在合成数据和真实世界数据上证明,AROFAC2是黄金标准PARAFAC的潜在更优替代方案,其优势在于能够内在检测真实秩、避免虚假分量,并且对异常值和非高斯噪声具有稳定性。

英文摘要

We present an algorithm, AROFAC2, which detects the (CP-)rank of a degree 3 tensor and calculates its factorization into rank-one components. We provide generative conditions for the algorithm to work and demonstrate on both synthetic and real world data that AROFAC2 is a potentially outperforming alternative to the gold standard PARAFAC over which it has the advantages that it can intrinsically detect the true rank, avoids spurious components, and is stable with respect to outliers and non-Gaussian noise.

1211.5414 2026-06-03 cs.DS cs.LG cs.NA math.NA stat.ML

Analysis of a randomized approximation scheme for matrix multiplication

矩阵乘法的随机近似方案分析

Daniel Hsu, Sham M. Kakade, Tong Zhang

AI总结 本文分析了Sarlos (2006)提出的基于随机旋转和均匀列采样的矩阵乘法随机近似方案,利用矩阵Bernstein不等式和次高斯随机向量二次型的尾部不等式给出简单分析。

详情
AI中文摘要

本文对Sarlos (2006)提出的基于随机旋转后均匀列采样的矩阵乘法随机近似方案进行了简单分析。结果来自于矩阵版本的Bernstein不等式以及次高斯随机向量中二次型的尾部不等式。

英文摘要

This note gives a simple analysis of a randomized approximation scheme for matrix multiplication proposed by Sarlos (2006) based on a random rotation followed by uniform column sampling. The result follows from a matrix version of Bernstein's inequality and a tail inequality for quadratic forms in subgaussian random vectors.

1103.1417 2026-06-03 math.ST cs.LG cs.SY eess.SY math.OC math.PR stat.TH

Localization from Incomplete Noisy Distance Measurements

基于不完整含噪距离测量的定位

Adel Javanmard, Andrea Montanari

AI总结 针对含噪部分距离测量下的欧氏空间点云定位问题,提出基于半定规划的算法,并刻画其在随机几何图模型下的性能边界。

Comments 46 pages, 8 figures, numerical experiments added. Journal version (v1,v2: Conference versions, ISIT 2011); Journal of Foundations of Computational Mathematics, 2012

详情
AI中文摘要

我们考虑在欧氏空间 $\mathbb{R}^d$ 中利用部分成对距离的含噪测量来定位点云的问题。该任务在传感器网络定位和从NMR测量重建蛋白质构象等领域有应用。此外,它与降维问题和流形学习密切相关,后者的目标是通过局部(或部分)度量信息学习数据集的潜在全局几何结构。本文提出一种基于半定规划的重建算法。对于随机几何图模型和一致有界噪声,我们精确刻画了算法的性能:在无噪声情况下,我们找到一个半径 $r_0$,超过该半径算法能重建精确位置(直至刚性变换)。在存在噪声的情况下,我们得到的重建误差上下界仅相差一个依赖于维度 $d$ 和图中节点平均度的因子。

英文摘要

We consider the problem of positioning a cloud of points in the Euclidean space $\mathbb{R}^d$, using noisy measurements of a subset of pairwise distances. This task has applications in various areas, such as sensor network localization and reconstruction of protein conformations from NMR measurements. Also, it is closely related to dimensionality reduction problems and manifold learning, where the goal is to learn the underlying global geometry of a data set using local (or partial) metric information. Here we propose a reconstruction algorithm based on semidefinite programming. For a random geometric graph model and uniformly bounded noise, we provide a precise characterization of the algorithm's performance: In the noiseless case, we find a radius $r_0$ beyond which the algorithm reconstructs the exact positions (up to rigid transformations). In the presence of noise, we obtain upper and lower bounds on the reconstruction error that match up to a factor that depends only on the dimension $d$, and the average degree of the nodes in the graph.

1211.4038 2026-06-03 eess.SY cs.RO cs.SY math.OC

Stochastic receding horizon control of nonlinear stochastic systems with probabilistic state constraints

具有概率状态约束的非线性随机系统的随机滚动时域控制

Shridhar K. Shah, Herbert G. Tanner, Chetan D. Pahlajani

AI总结 针对受概率状态约束的连续时间随机非线性系统,提出一种将滚动时域参考路径设计与随机最优控制器相结合的实时可实施控制框架,并证明无控制输入约束下的闭环收敛性。

Comments Draft of submission to IEEE Transactions of Automatic Control

详情
AI中文摘要

本文描述了一种针对受概率状态约束的连续时间随机非线性系统的滚动时域控制设计框架。其目标是推导出可在当前移动处理器上实时实现的解决方案。该方法将问题分解为基于系统动力学漂移分量设计滚动时域参考路径,然后实施随机最优控制器以使系统保持接近并跟随参考路径。在某些情况下,随机最优控制器可以闭式获得;在更一般的情况下,预计算的数值解可以实时实现,无需在线计算。假设控制输入无约束,建立了闭环系统的收敛性,并提供了仿真结果以验证理论预测。

英文摘要

The paper describes a receding horizon control design framework for continuous-time stochastic nonlinear systems subject to probabilistic state constraints. The intention is to derive solutions that are implementable in real-time on currently available mobile processors. The approach consists of decomposing the problem into designing receding horizon reference paths based on the drift component of the system dynamics, and then implementing a stochastic optimal controller to allow the system to stay close and follow the reference path. In some cases, the stochastic optimal controller can be obtained in closed form; in more general cases, pre-computed numerical solutions can be implemented in real-time without the need for on-line computation. The convergence of the closed loop system is established assuming no constraints on control inputs, and simulation results are provided to corroborate the theoretical predictions.

1209.5805 2026-06-03 eess.SY cs.RO cs.SY math.OC

Memoryless Control Design for Persistent Surveillance under Safety Constraints

安全约束下持久监视的无记忆控制设计

Eduardo Arvelo, Eric Kim, Nuno C. Martins

AI总结 针对有限二维网格中移动机器人的持久监视问题,提出一种基于熵最大化原理的有限参数凸规划方法,设计时间不变无记忆控制策略,在避免进入禁止区域的同时最大化被持久监视的状态数。

详情
AI中文摘要

本文研究在存在禁止区域的有限二维网格中移动的机器人的时间不变无记忆控制策略设计,这些机器人被任务为持久监视该区域。我们将每个机器人建模为一个受控马尔可夫链,其状态包括在网格中的位置和运动方向。目标是找到最少数量的机器人和相关的时间不变无记忆控制策略,保证在不访问禁止状态的情况下持久监视最大数量的状态。我们提出了一种基于熵最大化原理的有限参数凸规划设计方法。提供了数值示例。

英文摘要

This paper deals with the design of time-invariant memoryless control policies for robots that move in a finite two- dimensional lattice and are tasked with persistent surveillance of an area in which there are forbidden regions. We model each robot as a controlled Markov chain whose state comprises its position in the lattice and the direction of motion. The goal is to find the minimum number of robots and an associated time-invariant memoryless control policy that guarantees that the largest number of states are persistently surveilled without ever visiting a forbidden state. We propose a design method that relies on a finitely parametrized convex program inspired by entropy maximization principles. Numerical examples are provided.

1211.0056 2026-06-03 math.OC cs.LG cs.NA math.NA stat.CO stat.ML

Iterative Hard Thresholding Methods for $l_0$ Regularized Convex Cone Programming

$l_0$ 正则化凸锥规划问题的迭代硬阈值方法

Zhaosong Lu

AI总结 提出迭代硬阈值方法及其变体求解 $l_0$ 正则化凸锥规划,证明收敛到局部极小点并建立迭代复杂度。

Comments 25 pages

详情
AI中文摘要

本文考虑 $l_0$ 正则化凸锥规划问题。具体地,我们首先提出一种迭代硬阈值(IHT)方法及其变体,用于求解 $l_0$ 正则化箱约束凸规划。我们证明这些方法生成的序列收敛到局部极小点。同时,我们建立了 IHT 方法寻找 $\epsilon$-局部最优解的迭代复杂度。然后,我们通过将 IHT 方法应用于二次罚松弛,提出一种求解 $l_0$ 正则化凸锥规划的方法,并建立其寻找 $\epsilon$-近似局部极小解的迭代复杂度。最后,我们提出该方法的变体,其中相关的罚参数动态更新,并证明每个聚点是问题的局部极小点。

英文摘要

In this paper we consider $l_0$ regularized convex cone programming problems. In particular, we first propose an iterative hard thresholding (IHT) method and its variant for solving $l_0$ regularized box constrained convex programming. We show that the sequence generated by these methods converges to a local minimizer. Also, we establish the iteration complexity of the IHT method for finding an $ε$-local-optimal solution. We then propose a method for solving $l_0$ regularized convex cone programming by applying the IHT method to its quadratic penalty relaxation and establish its iteration complexity for finding an $ε$-approximate local minimizer. Finally, we propose a variant of this method in which the associated penalty parameter is dynamically updated, and show that every accumulation point is a local minimizer of the problem.

1202.5298 2026-06-03 eess.SY cs.LG cs.SY

Min Max Generalization for Two-stage Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

两阶段确定性批量模式强化学习的最小最大泛化:松弛方案

Raphael Fonteneau, Damien Ernst, Bernard Boigelot, Quentin Louveaux

AI总结 针对确定性批量模式强化学习中的最小最大优化问题,提出两种松弛方案(约束丢弃和拉格朗日对偶化)以降低计算复杂度,并证明其优于现有方法。

详情
AI中文摘要

我们研究了[22]中提出的用于计算确定性环境下批量模式强化学习策略的最小最大优化问题。首先,我们证明该问题是NP难的。在两阶段情况下,我们提供了两种松弛方案。第一种松弛方案通过丢弃一些约束来获得一个可在多项式时间内求解的问题。第二种松弛方案基于拉格朗日松弛,将所有约束对偶化,得到一个圆锥二次规划问题。我们还从理论上证明并实验说明,两种松弛方案均能提供比[22]中更好的结果。

英文摘要

We study the minmax optimization problem introduced in [22] for computing policies for batch mode reinforcement learning in a deterministic setting. First, we show that this problem is NP-hard. In the two-stage case, we provide two relaxation schemes. The first relaxation scheme works by dropping some constraints in order to obtain a problem that is solvable in polynomial time. The second relaxation scheme, based on a Lagrangian relaxation where all constraints are dualized, leads to a conic quadratic programming problem. We also theoretically prove and empirically illustrate that both relaxation schemes provide better results than those given in [22].

1210.4231 2026-06-03 eess.SY cs.AI cs.SY

An example illustrating the imprecision of the efficient approach for diagnosis of Petri nets via integer linear programming

一个说明通过整数线性规划诊断Petri网的高效方法不精确性的例子

Alban Grastien

AI总结 本文通过反例证明,即使系统是可诊断的,基于整数线性规划的Petri网高效诊断方法也可能无法检测到故障。

Comments 3 pages

详情
AI中文摘要

本文证明,即使系统是可诊断的,通过整数线性规划诊断Petri网的高效方法也可能无法检测到故障。

英文摘要

This document demonstrates that the efficient approach for diagnosis of Petri nets via integer linear programming may be unable to detect a fault even if the system is diagnosable.

1210.4081 2026-06-03 math.NA cs.CV cs.DS cs.LG cs.NA math.OC

Getting Feasible Variable Estimates From Infeasible Ones: MRF Local Polytope Study

从不可行变量估计获得可行变量估计:MRF局部多面体研究

Bogdan Savchynskyy, Stefan Schmidt

AI总结 针对具有可分离性的大规模优化问题,提出一种从对偶解构造近似可行原始解的方法,并应用于马尔可夫随机场推理问题的局部多面体松弛,证明其优于现有方法。

Comments 20 page, 4 figures

详情
AI中文摘要

本文提出了一种方法,用于从对偶解构造具有特定可分离性的大规模优化问题的近似可行原始解。虽然通常可以从对偶函数的(次)梯度产生不可行的原始估计,但将其投影到原始可行集往往并不容易,因为投影本身的复杂度与初始问题的复杂度相当。我们提出了一种替代的有效方法来获得可行性,并证明了其影响收敛到最优性的性质与欧几里得投影的性质相似。我们将我们的方法应用于马尔可夫随机场推理问题的局部多面体松弛,并证明了其优于现有方法。

英文摘要

This paper proposes a method for construction of approximate feasible primal solutions from dual ones for large-scale optimization problems possessing certain separability properties. Whereas infeasible primal estimates can typically be produced from (sub-)gradients of the dual function, it is often not easy to project them to the primal feasible set, since the projection itself has a complexity comparable to the complexity of the initial problem. We propose an alternative efficient method to obtain feasibility and show that its properties influencing the convergence to the optimum are similar to the properties of the Euclidean projection. We apply our method to the local polytope relaxation of inference problems for Markov Random Fields and demonstrate its superiority over existing methods.

1107.3090 2026-06-03 cs.CC cs.LG cs.SY eess.SY math.OC

On the Computational Complexity of Stochastic Controller Optimization in POMDPs

关于POMDP中随机控制器优化的计算复杂度

Nikos Vlassis, Michael L. Littman, David Barber

AI总结 本文证明在马尔可夫决策过程中寻找最优随机“盲”控制器是NP难问题,相应的决策问题属于PSPACE且是SQRT-SUM难的,并指出POMDP中更一般的随机控制器优化问题也是NP难的,但存在一个凸的特殊情况可高效求解。

Comments Corrected error in the proof of Theorem 2, and revised Section 5

详情
AI中文摘要

我们证明了在马尔可夫决策过程中寻找最优随机“盲”控制器是一个NP难问题。相应的决策问题是NP难的、属于PSPACE且是SQRT-SUM难的,因此将其置于NP中将意味着计算机科学中长期未解难题的突破。我们的结果确立了POMDP中更一般的随机控制器优化问题也是NP难的。尽管如此,我们概述了一个凸的特殊情况,该情况允许高效的全局解。

英文摘要

We show that the problem of finding an optimal stochastic 'blind' controller in a Markov decision process is an NP-hard problem. The corresponding decision problem is NP-hard, in PSPACE, and SQRT-SUM-hard, hence placing it in NP would imply breakthroughs in long-standing open problems in computer science. Our result establishes that the more general problem of stochastic controller optimization in POMDPs is also NP-hard. Nonetheless, we outline a special case that is convex and admits efficient global solutions.

1210.0822 2026-06-03 math.NA cs.CV cs.NA

Discrete geodesic calculus in the space of viscous fluidic objects

粘性流体对象空间中的离散测地线计算

Martin Rumpf, Benedikt Wirth

AI总结 基于黎曼距离的局部近似,提出了一种时间离散的测地线计算方法,并应用于形状空间中的变形、外推和特征传递。

详情
AI中文摘要

基于流形上黎曼距离的局部近似(通过计算成本低的相异性度量),发展了一种时间离散的测地线计算,并探索了在形状空间中的应用。该相异性度量源自变形能量,其Hessian矩阵再现了底层的黎曼度量,并用于定义形状空间中离散路径的长度和能量。离散测地线定义为能量最小化路径,由此引出了离散对数映射、离散指数映射的变分定义以及时间离散的平行传输。这一新概念应用于形状空间,其中形状被视为由粘性材料构成的物理对象的边界轮廓。通过保持拓扑的形状变形、将局部形状变化作为路径生成器来表示形状空间中的路径、通过离散测地线流进行形状外推以及几何特征的传递,展示了该方法的灵活性和计算效率。

英文摘要

Based on a local approximation of the Riemannian distance on a manifold by a computationally cheap dissimilarity measure, a time discrete geodesic calculus is developed, and applications to shape space are explored. The dissimilarity measure is derived from a deformation energy whose Hessian reproduces the underlying Riemannian metric, and it is used to define length and energy of discrete paths in shape space. The notion of discrete geodesics defined as energy minimizing paths gives rise to a discrete logarithmic map, a variational definition of a discrete exponential map, and a time discrete parallel transport. This new concept is applied to a shape space in which shapes are considered as boundary contours of physical objects consisting of viscous material. The flexibility and computational efficiency of the approach is demonstrated for topology preserving shape morphing, the representation of paths in shape space via local shape variations as path generators, shape extrapolation via discrete geodesic flow, and the transfer of geometric features.

1209.5826 2026-06-03 math.NA cs.CV cs.NA

Refinability of splines from lattice Voronoi cells

来自格点Voronoi细胞的样条的可细化性

Jorg Peters

AI总结 本文提出简单准则,证明只有少数样条族(如箱样条和张量积样条)是可细化的,而六边形样条等不可细化样条在格点细化时近似误差可能增大。

详情
AI中文摘要

样条可以通过对格点的Voronoi细胞的指示函数进行卷积来构造。本文提出了简单的准则,表明只有少数这样的样条族是可细化的:本质上就是众所周知的箱样条和张量积样条。许多不可细化的构造包括六边形样条及其在非笛卡尔格点上的推广。一个例子展示了不可细化样条在格点细化时如何表现出增加的近似误差。

英文摘要

Splines can be constructed by convolving the indicator function of the Voronoi cell of a lattice. This paper presents simple criteria that imply that only a small subset of such spline families can be refined: essentially the well-known box splines and tensor-product splines. Among the many non-refinable constructions are hex-splines and their generalization to non-Cartesian lattices. An example shows how non-refinable splines can exhibit increased approximation error upon refinement of the lattice.

1206.4481 2026-06-03 math.NA cs.LG cs.NA

Parsimonious Mahalanobis Kernel for the Classification of High Dimensional Data

用于高维数据分类的简约马氏核

M. Fauvel, A. Villa, J. Chanussot, J. A. Benediktsson

AI总结 利用高维空间的空性,基于马氏距离提出一种简约核,通过高维判别分析模型估计信号和噪声子空间,实现稳定逆协方差矩阵,并在SVM框架下优化半径-间隔界,实验表明该核优于高斯核。

详情
AI中文摘要

本文考虑使用核方法对高维数据进行分类。利用高维空间的空性,提出了一种基于马氏距离的核。计算马氏距离需要协方差矩阵的逆。在高维空间中,估计的协方差矩阵是病态的,其逆不稳定或不可能。使用简约统计模型,即高维判别分析模型,为每个考虑的类别估计特定的信号和噪声子空间,使得类别特定协方差矩阵的逆显式且稳定,从而定义了简约马氏核。采用基于SVM的框架,通过优化所谓的半径-间隔界来选择简约马氏核的超参数。在三个高维数据集上的实验结果表明,所提出的核适用于高维数据分类,比传统高斯核提供更好的分类精度。

英文摘要

The classification of high dimensional data with kernel methods is considered in this article. Exploit- ing the emptiness property of high dimensional spaces, a kernel based on the Mahalanobis distance is proposed. The computation of the Mahalanobis distance requires the inversion of a covariance matrix. In high dimensional spaces, the estimated covariance matrix is ill-conditioned and its inversion is unstable or impossible. Using a parsimonious statistical model, namely the High Dimensional Discriminant Analysis model, the specific signal and noise subspaces are estimated for each considered class making the inverse of the class specific covariance matrix explicit and stable, leading to the definition of a parsimonious Mahalanobis kernel. A SVM based framework is used for selecting the hyperparameters of the parsimonious Mahalanobis kernel by optimizing the so-called radius-margin bound. Experimental results on three high dimensional data sets show that the proposed kernel is suitable for classifying high dimensional data, providing better classification accuracies than the conventional Gaussian kernel.

1203.4345 2026-06-03 eess.SY cs.AI cs.RO cs.SY stat.ML

Robust Filtering and Smoothing with Gaussian Processes

基于高斯过程的鲁棒滤波与平滑

Marc Peter Deisenroth, Ryan Turner, Marco F. Huber, Uwe D. Hanebeck, Carl Edward Rasmussen

AI总结 提出一种基于非参数高斯过程模型的非线性随机动态系统鲁棒贝叶斯滤波与平滑算法,通过解析平滑实现鲁棒性,数值实验表明在其它先进方法失效时仍保持稳健。

Comments 7 pages, 1 figure, draft version of paper accepted at IEEE Transactions on Automatic Control

详情
AI中文摘要

我们提出了一种原则性算法,用于在非线性随机动态系统中进行鲁棒贝叶斯滤波和平滑,其中转移函数和测量函数均由非参数高斯过程(GP)模型描述。在信号处理、机器学习、机器人和控制领域,GP通过后验概率分布表示未知系统函数,其重要性日益增加。这种现代的“系统辨识”方式比寻找参数函数表示的点估计更为鲁棒。在本文中,我们提出了一种原则性算法,用于在GP动态系统中进行鲁棒解析平滑,该系统在机器人和控制领域应用日益广泛。我们的数值评估表明,在其它最先进的高斯滤波器和平滑器可能失败的情况下,所提方法具有鲁棒性。

英文摘要

We propose a principled algorithm for robust Bayesian filtering and smoothing in nonlinear stochastic dynamic systems when both the transition function and the measurement function are described by non-parametric Gaussian process (GP) models. GPs are gaining increasing importance in signal processing, machine learning, robotics, and control for representing unknown system functions by posterior probability distributions. This modern way of "system identification" is more robust than finding point estimates of a parametric function representation. In this article, we present a principled algorithm for robust analytic smoothing in GP dynamic systems, which are increasingly used in robotics and control. Our numerical evaluations demonstrate the robustness of the proposed approach in situations where other state-of-the-art Gaussian filters and smoothers can fail.

1208.0864 2026-06-03 math.OC cs.LG cs.SY eess.SY

Statistical Results on Filtering and Epi-convergence for Learning-Based Model Predictive Control

基于学习的模型预测控制的滤波与上收敛统计结果

Anil Aswani, Humberto Gonzalez, S. Shankar Sastry, Claire Tomlin

AI总结 本文证明了基于学习的模型预测控制中测量模型选择的合理性,并给出了随机收敛性证明,同时证明了用于LBMPC的非参数估计器的统计性质。

详情
AI中文摘要

基于学习的模型预测控制(LBMPC)是一种提供鲁棒性确定性保证的技术,同时使用统计识别工具来识别更丰富的系统模型以提高性能。本技术说明提供了证明,阐明我们选择测量模型的原因,并给出了关于LBMPC随机收敛性的证明。第一部分讨论了可用常微分方程(ODE)描述的动力系统的同时状态估计和未建模动力学的统计识别(或学习)。第二部分提供了关于可与基于学习的模型预测控制(LBMPC)技术一起使用的不同统计估计器的上收敛的证明。特别地,我们证明了一种非参数估计器的统计性质,该估计器设计用于与LBMPC结合使用时具有正确的确定性和随机性数值实现性质。

英文摘要

Learning-based model predictive control (LBMPC) is a technique that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance. This technical note provides proofs that elucidate the reasons for our choice of measurement model, as well as giving proofs concerning the stochastic convergence of LBMPC. The first part of this note discusses simultaneous state estimation and statistical identification (or learning) of unmodeled dynamics, for dynamical systems that can be described by ordinary differential equations (ODE's). The second part provides proofs concerning the epi-convergence of different statistical estimators that can be used with the learning-based model predictive control (LBMPC) technique. In particular, we prove results on the statistical properties of a nonparametric estimator that we have designed to have the correct deterministic and stochastic properties for numerical implementation when used in conjunction with LBMPC.

1107.2487 2026-06-03 math.OC cs.LG cs.SY eess.SY math.ST stat.TH

Provably Safe and Robust Learning-Based Model Predictive Control

可证明安全且鲁棒的基于学习的模型预测控制

Anil Aswani, Humberto Gonzalez, S. Shankar Sastry, Claire Tomlin

AI总结 提出一种基于学习的模型预测控制(LBMPC)方案,通过解耦安全与性能,利用统计学习改进性能并保证鲁棒性。

详情
AI中文摘要

控制器设计面临鲁棒性与性能之间的权衡,线性控制器的可靠性使得许多从业者关注前者。然而,为了应对日益增长的能源约束,提高系统性能重新引起兴趣。本文描述了一种基于学习的模型预测控制(LBMPC)方案,该方案提供鲁棒性的确定性保证,同时使用统计识别工具来识别更丰富的系统模型以提高性能;该框架的优点在于它处理状态和输入约束,根据成本函数优化系统性能,并且可以设计使用各种参数或非参数统计工具。LBMPC的主要见解是,在优化框架中,通过维护两个系统模型,可以在合理条件下解耦安全性和性能。第一个是具有不确定性界限的近似模型,第二个模型通过统计方法更新。LBMPC通过选择最小化成本的输入(受学习动力学约束)来提高性能,并通过检查这些相同的输入是否在不确定性下保持近似模型稳定来确保安全性和鲁棒性。此外,我们证明如果系统充分激励,则LBMPC控制动作概率收敛到使用真实动力学计算的MPC的控制动作。

英文摘要

Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a learning-based model predictive control (LBMPC) scheme that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance; the benefits of this framework are that it handles state and input constraints, optimizes system performance with respect to a cost function, and can be designed to use a wide variety of parametric or nonparametric statistical tools. The main insight of LBMPC is that safety and performance can be decoupled under reasonable conditions in an optimization framework by maintaining two models of the system. The first is an approximate model with bounds on its uncertainty, and the second model is updated by statistical methods. LBMPC improves performance by choosing inputs that minimize a cost subject to the learned dynamics, and it ensures safety and robustness by checking whether these same inputs keep the approximate model stable when it is subject to uncertainty. Furthermore, we show that if the system is sufficiently excited, then the LBMPC control action probabilistically converges to that of an MPC computed using the true dynamics.

1207.6051 2026-06-03 eess.SY cs.AI cs.SY math.OC

Composition of Modular Telemetry System with Interval Multiset Estimates

基于区间多集估计的模块化遥测系统组合

Mark Sh. Levin

AI总结 本文提出一种基于区间多集估计的组合综合方法,用于模块化遥测系统的建模、分析、设计和改进,通过分层形态多准则设计(HMMD)实现系统组件的多准则选择与合成。

Comments 9 pages, 9 figures, 6 tables

详情
AI中文摘要

本文描述了一种组合综合方法,该方法利用系统元素的区间多集估计来对模块化遥测系统进行建模、分析、设计和改进。形态(模块化)系统设计与改进被视为遥测系统元素(组件)配置的组合。求解过程基于分层形态多准则设计(HMMD):(i) 系统组件备选方案的多准则选择,(ii) 将所选备选方案合成为结果组合(同时考虑上述备选方案的质量及其兼容性)。使用区间多集估计来评估遥测系统元素的设计备选方案。还研究了两个附加系统问题:(a) 改进所获得的解,(b) 将所获得的解聚合成一个结果系统配置。改进和聚合过程基于具有区间多集估计的多重选择问题。通过一个机载遥测子系统的数值示例说明了设计和改进过程。

英文摘要

The paper describes combinatorial synthesis approach with interval multset estimates of system elements for modeling, analysis, design, and improvement of a modular telemetry system. Morphological (modular) system design and improvement are considered as composition of the telemetry system elements (components) configuration. The solving process is based on Hierarchical Morphological Multicriteria Design (HMMD): (i) multicriteria selection of alternatives for system components, (ii) synthesis of the selected alternatives into a resultant combination (while taking into account quality of the alternatives above and their compatibility). Interval multiset estimates are used for assessment of design alternatives for telemetry system elements. Two additional systems problems are examined: (a) improvement of the obtained solutions, (b) aggregation of the obtained solutions into a resultant system configuration. The improvement and aggregation processes are based on multiple choice problem with interval multiset estimates. Numerical examples for an on-board telemetry subsystem illustrate the design and improvement processes.

1207.3438 2026-06-03 stat.ML cs.LG cs.NA math.NA

MahNMF: Manhattan Non-negative Matrix Factorization

MahNMF: 曼哈顿非负矩阵分解

Naiyang Guan, Dacheng Tao, Zhigang Luo, John Shawe-Taylor

AI总结 针对重尾噪声和异常值问题,提出基于曼哈顿距离的MahNMF模型,并开发了秩一残差迭代和Nesterov平滑两种快速优化算法。

Comments 43 pages, 20 figures, 2 tables, submission to Journal of Machine Learning Research

详情
AI中文摘要

非负矩阵分解(NMF)通过两个非负低秩因子矩阵 $W$ 和 $H$ 的乘积来逼近非负矩阵 $X$。NMF 及其扩展通过最小化 $X$ 与 $W^T H$ 之间的 Kullback-Leibler 散度或欧氏距离来建模泊松噪声或高斯噪声。然而,当噪声分布具有重尾特性时,这些方法表现不佳。本文提出曼哈顿 NMF(MahNMF),通过最小化 $X$ 与 $W^T H$ 之间的曼哈顿距离来建模重尾拉普拉斯噪声。与稀疏和低秩矩阵分解类似,MahNMF 能够鲁棒地估计非负矩阵的低秩部分和稀疏部分,从而在数据受到异常值污染时有效工作。我们通过开发带盒约束的 MahNMF、流形正则化 MahNMF、组稀疏 MahNMF、弹性网诱导 MahNMF 和对称 MahNMF,将 MahNMF 扩展到各种实际应用。本文的主要贡献在于为 MahNMF 及其扩展提出了两种快速优化算法:秩一残差迭代(RRI)方法和 Nesterov 平滑方法。具体地,通过将 MahNMF 中的残差矩阵近似为 $W$ 的一行和 $H$ 的一行的外积,我们开发了 RRI 方法,以闭式解迭代更新 $W$ 和 $H$ 的每个变量。尽管 RRI 对于小规模 MahNMF 及其某些扩展是高效的,但它既不能扩展到大规模矩阵,也不够灵活以优化所有 MahNMF 扩展。由于 MahNMF 及其扩展的目标函数既非凸也不光滑,我们应用 Nesterov 平滑方法,在固定一个因子矩阵的情况下递归优化另一个因子矩阵。通过将平滑参数设置为与迭代次数成反比,我们逐步提高了 MahNMF 及其扩展的逼近精度。

英文摘要

Non-negative matrix factorization (NMF) approximates a non-negative matrix $X$ by a product of two non-negative low-rank factor matrices $W$ and $H$. NMF and its extensions minimize either the Kullback-Leibler divergence or the Euclidean distance between $X$ and $W^T H$ to model the Poisson noise or the Gaussian noise. In practice, when the noise distribution is heavy tailed, they cannot perform well. This paper presents Manhattan NMF (MahNMF) which minimizes the Manhattan distance between $X$ and $W^T H$ for modeling the heavy tailed Laplacian noise. Similar to sparse and low-rank matrix decompositions, MahNMF robustly estimates the low-rank part and the sparse part of a non-negative matrix and thus performs effectively when data are contaminated by outliers. We extend MahNMF for various practical applications by developing box-constrained MahNMF, manifold regularized MahNMF, group sparse MahNMF, elastic net inducing MahNMF, and symmetric MahNMF. The major contribution of this paper lies in two fast optimization algorithms for MahNMF and its extensions: the rank-one residual iteration (RRI) method and Nesterov's smoothing method. In particular, by approximating the residual matrix by the outer product of one row of W and one row of $H$ in MahNMF, we develop an RRI method to iteratively update each variable of $W$ and $H$ in a closed form solution. Although RRI is efficient for small scale MahNMF and some of its extensions, it is neither scalable to large scale matrices nor flexible enough to optimize all MahNMF extensions. Since the objective functions of MahNMF and its extensions are neither convex nor smooth, we apply Nesterov's smoothing method to recursively optimize one factor matrix with another matrix fixed. By setting the smoothing parameter inversely proportional to the iteration number, we improve the approximation accuracy iteratively for both MahNMF and its extensions.

1206.6474 2026-06-03 cs.DS cs.LG cs.NA math.NA stat.ML

Estimation of Simultaneously Sparse and Low Rank Matrices

同时稀疏和低秩矩阵的估计

Emile Richard, Pierre-Andre Savalle, Nicolas Vayatis

AI总结 本文提出一种凸混合惩罚方法,同时使用ℓ1范数和迹范数,以估计同时稀疏和低秩的矩阵,并推导了预言不等式和链接预测的泛化误差界,通过近端下降算法高效求解。

Comments Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

详情
AI中文摘要

本文介绍了一种惩罚矩阵估计过程,旨在同时实现稀疏和低秩的解。这种结构出现在社交网络或蛋白质相互作用的背景下,其中底层图的邻接矩阵在适当基下是块对角化的。我们引入了一种凸混合惩罚,同时涉及ℓ1范数和迹范数。我们得到了一个预言不等式,指示了两种效应如何根据目标矩阵的性质相互作用。我们界定了链接预测问题中的泛化误差。我们还开发了近端下降策略来高效求解优化问题,并在合成和真实数据集上评估了性能。

英文摘要

The paper introduces a penalized matrix estimation procedure aiming at solutions which are sparse and low-rank at the same time. Such structures arise in the context of social networks or protein interactions where underlying graphs have adjacency matrices which are block-diagonal in the appropriate basis. We introduce a convex mixed penalty which involves $\ell_1$-norm and trace norm simultaneously. We obtain an oracle inequality which indicates how the two effects interact according to the nature of the target matrix. We bound generalization error in the link prediction problem. We also develop proximal descent strategies to solve the optimization problem efficiently and evaluate performance on synthetic and real data sets.

1206.4640 2026-06-03 math.NA cs.LG cs.NA stat.ML

Stability of matrix factorization for collaborative filtering

协同过滤中矩阵分解的稳定性

Yu-Xiang Wang, Huan Xu

AI总结 研究矩阵分解算法在矩阵补全中对抗性噪声的稳定性,通过误差界、子空间分析和个体预测误差分析,为协同过滤系统设计提供指导。

Comments ICML2012

详情
AI中文摘要

我们研究了矩阵分解算法在矩阵补全中对抗性噪声的稳定性。具体地,我们的结果包括:(I)我们以均方根误差为度量,给出了分解方法解矩阵与真实值之间的差距的界;(II)我们将矩阵分解视为子空间拟合问题,并分析了求解子空间与真实子空间之间的差异;(III)我们基于子空间稳定性分析了单个用户的预测误差。我们将这些结果应用于操纵者攻击下的协同过滤问题,从而为协同过滤系统设计提供了有用的见解和指导。

英文摘要

We study the stability vis a vis adversarial noise of matrix factorization algorithm for matrix completion. In particular, our results include: (I) we bound the gap between the solution matrix of the factorization method and the ground truth in terms of root mean square error; (II) we treat the matrix factorization as a subspace fitting problem and analyze the difference between the solution subspace and the ground truth; (III) we analyze the prediction error of individual users based on the subspace stability. We apply these results to the problem of collaborative filtering under manipulator attack, which leads to useful insights and guidelines for collaborative filtering system design.

1206.4602 2026-06-03 math.NA cs.LG cs.NA stat.ML

Quasi-Newton Methods: A New Direction

拟牛顿方法:一个新方向

Philipp Hennig, Martin Kiefel

AI总结 本文通过将拟牛顿方法解释为贝叶斯线性回归的近似,揭示了经典算法的缺陷,并提出了一种新的非参数拟牛顿方法,在相似计算成本下更高效地利用信息。

Comments ICML2012

详情
AI中文摘要

在拟牛顿方法发明四十年后,它们仍然是无约束数值优化中的最先进技术。虽然通常不被这样解释,但这些是拟合目标函数的局部二次逼近的学习算法。我们表明,许多(包括最流行的)拟牛顿方法可以解释为在不同先验假设下贝叶斯线性回归的近似。这一新概念阐明了经典算法的一些缺陷,并为一种新颖的非参数拟牛顿方法指明了道路,该方法能够在与之前方法相似的计算成本下更有效地利用可用信息。

英文摘要

Four decades after their invention, quasi-Newton methods are still state of the art in unconstrained numerical optimization. Although not usually interpreted thus, these are learning algorithms that fit a local quadratic approximation to the objective function. We show that many, including the most popular, quasi-Newton methods can be interpreted as approximations of Bayesian linear regression under varying prior assumptions. This new notion elucidates some shortcomings of classical algorithms, and lights the way to a novel nonparametric quasi-Newton method, which is able to make more efficient use of available information at computational cost similar to its predecessors.

1206.2061 2026-06-03 math.NA cs.CV cs.NA

Comments on "On Approximating Euclidean Metrics by Weighted t-Cost Distances in Arbitrary Dimension"

关于“任意维度中通过加权t-代价距离逼近欧几里得度量”的评论

M. Emre Celebi, Hassan A. Kingravi, Fatih Celiker

AI总结 本文评论了Mukherjee提出的加权t-代价距离逼近欧几里得范数的方法,指出其在ℝⁿ中的平均误差过于乐观,并提出了改进精度的归一化方案。

Comments 7 pages, 1 figure, 3 tables. arXiv admin note: substantial text overlap with arXiv:1008.4870

详情
Journal ref
Pattern Recognition Letters 33 (2012) 1422--1425
AI中文摘要

Mukherjee(Pattern Recognition Letters, vol. 32, pp. 824-831, 2011)最近引入了一类称为加权t-代价距离的距离函数,它推广了m-邻域、八边形和t-代价距离。他证明了加权t-代价距离构成一个度量族,并推导了在$\mathbb{Z}^n$中欧几里得范数的近似。在本注释中,我们将此近似与先前提出的两种欧几里得范数近似进行比较,并证明Mukherjee给出的经验平均误差在$\mathbb{R}^n$中显著乐观。我们还提出了一种简单的归一化方案,该方案在平均相对误差和最大相对误差方面都显著提高了其近似的精度。

英文摘要

Mukherjee (Pattern Recognition Letters, vol. 32, pp. 824-831, 2011) recently introduced a class of distance functions called weighted t-cost distances that generalize m-neighbor, octagonal, and t-cost distances. He proved that weighted t-cost distances form a family of metrics and derived an approximation for the Euclidean norm in $\mathbb{Z}^n$. In this note we compare this approximation to two previously proposed Euclidean norm approximations and demonstrate that the empirical average errors given by Mukherjee are significantly optimistic in $\mathbb{R}^n$. We also propose a simple normalization scheme that improves the accuracy of his approximation substantially with respect to both average and maximum relative errors.

1109.2363 2026-06-03 stat.AP cs.RO cs.SY eess.SY math.OC

Sensor Management: Past, Present, and Future

传感器管理:过去、现在与未来

Alfred O. Hero, Douglas Cochran

AI总结 本文综述了传感器管理的理论、算法和应用,涵盖其发展历程和当前现状,并展望未来方向。

Comments 15 pages, 112 references

详情
Journal ref
IEEE Sensors Journal, vol. 11, issue 12, pp. 3064-3075, December 2011
AI中文摘要

传感器系统通常在资源约束下运行,这些约束阻止了所有资源同时使用。当传感系统具有主动管理这些资源的能力时,即能够在部署期间根据先前的测量改变其运行配置,传感器管理就变得相关。当前或近期可能使用传感器管理的系统示例包括自主机器人、监视和侦察网络以及波形捷变雷达。本文概述了传感器管理的理论、算法和应用,如其过去几十年发展至今的状况。

英文摘要

Sensor systems typically operate under resource constraints that prevent the simultaneous use of all resources all of the time. Sensor management becomes relevant when the sensing system has the capability of actively managing these resources; i.e., changing its operating configuration during deployment in reaction to previous measurements. Examples of systems in which sensor management is currently used or is likely to be used in the near future include autonomous robots, surveillance and reconnaissance networks, and waveform-agile radars. This paper provides an overview of the theory, algorithms, and applications of sensor management as it has developed over the past decades and as it stands today.

1008.5373 2026-06-03 math.OC cs.LG cs.NA cs.SY eess.SY math.NA q-fin.CP q-fin.ST

Penalty Decomposition Methods for Rank Minimization

秩最小化的罚分解方法

Zhaosong Lu, Yong Zhang

AI总结 本文提出罚分解方法求解目标或约束中含秩的秩最小化问题,通过块坐标下降法求解子问题,并证明序列聚点满足一阶最优性条件,在矩阵补全和最近低秩相关矩阵问题上表现优于或持平现有方法。

Comments This paper has been withdrawn by the author

详情
AI中文摘要

本文考虑一般的秩最小化问题,其中秩出现在目标函数或约束中。我们首先建立一类特殊的秩最小化问题具有闭式解。利用这一结果,我们随后提出针对一般秩最小化问题的罚分解方法,其中每个子问题通过块坐标下降法求解。在适当假设下,我们证明由罚分解方法生成的序列的任何聚点都满足问题的非线性重构的一阶最优性条件。最后,我们将方法应用于矩阵补全和最近低秩相关矩阵问题以测试性能。计算结果表明,我们的方法在解的质量方面通常与现有方法相当或更优。

英文摘要

In this paper we consider general rank minimization problems with rank appearing in either objective function or constraint. We first establish that a class of special rank minimization problems has closed-form solutions. Using this result, we then propose penalty decomposition methods for general rank minimization problems in which each subproblem is solved by a block coordinate descend method. Under some suitable assumptions, we show that any accumulation point of the sequence generated by the penalty decomposition methods satisfies the first-order optimality conditions of a nonlinear reformulation of the problems. Finally, we test the performance of our methods by applying them to the matrix completion and nearest low-rank correlation matrix problems. The computational results demonstrate that our methods are generally comparable or superior to the existing methods in terms of solution quality.

1205.3997 2026-06-03 stat.ML cs.AI cs.GT cs.SY eess.SY

Free Energy and the Generalized Optimality Equations for Sequential Decision Making

自由能与序列决策的广义最优性方程

Pedro A. Ortega, Daniel A. Braun

AI总结 本文应用自由能原理到包含对抗和随机环境的通用决策树,推导出广义序列最优性方程,该方程包含Bellman最优性方程作为极限情况,并导出Expectimax、Minimax和Expectiminimax等决策规则,为每个节点分配资源参数以表达计算成本。

Comments 10 pages, 2 figures

详情
Journal ref
European Workshop on Reinforcement Learning 2012
AI中文摘要

自由能泛函最近被提出作为有界理性决策的变分原理,因为它实例化了效用增益与信息处理成本之间的自然权衡,并且可以从公理推导出来。这里我们将自由能原理应用于包含对抗和随机环境的通用决策树。我们推导出广义序列最优性方程,该方程不仅包含Bellman最优性方程作为极限情况,而且导出了众所周知的决策规则,如Expectimax、Minimax和Expectiminimax。我们展示了如何从单一的自由能原理推导出这些决策规则,该原理为决策树中的每个节点分配一个资源参数。这些资源参数表达了一个具体的计算成本,可以测量为从属于每个节点的分布所需的样本数量。因此,自由能原理为考虑对抗和随机环境的广义最优性方程提供了规范基础。

英文摘要

The free energy functional has recently been proposed as a variational principle for bounded rational decision-making, since it instantiates a natural trade-off between utility gains and information processing costs that can be axiomatically derived. Here we apply the free energy principle to general decision trees that include both adversarial and stochastic environments. We derive generalized sequential optimality equations that not only include the Bellman optimality equations as a limit case, but also lead to well-known decision-rules such as Expectimax, Minimax and Expectiminimax. We show how these decision-rules can be derived from a single free energy principle that assigns a resource parameter to each node in the decision tree. These resource parameters express a concrete computational cost that can be measured as the amount of samples that are needed from the distribution that belongs to each node. The free energy principle therefore provides the normative basis for generalized optimality equations that account for both adversarial and stochastic environments.

1008.5372 2026-06-03 math.OC cs.CV cs.IT cs.LG cs.NA math.IT math.NA stat.ME

Penalty Decomposition Methods for $L0$-Norm Minimization

L0-范数最小化的罚分解方法

Zhaosong Lu, Yong Zhang

AI总结 提出罚分解方法求解含L0-范数的优化问题,通过转化为秩最小化问题并利用向量化操作,在压缩感知等应用中优于现有方法。

Comments This paper has been withdrawn by the author because an updated version has been resubmitted

详情
AI中文摘要

本文考虑一般的l0-范数最小化问题,即目标函数或约束中出现l0-范数的问题。特别地,我们首先将l0-范数约束问题重新表述为等价的秩最小化问题,然后应用[33]中提出的罚分解(PD)方法求解后者。通过利用特殊结构,我们将该方法的所有矩阵运算转化为向量运算,得到仅涉及向量运算的PD方法。在适当的假设下,我们证明PD方法生成的序列的任何聚点满足一阶最优性条件,该条件通常比一个自然最优性条件更强。我们进一步扩展PD方法以求解目标函数中出现l0-范数的问题。最后,通过将PD方法应用于压缩感知、稀疏逻辑回归和稀疏逆协方差选择来测试其性能。计算结果表明,我们的方法在解质量和/或速度方面通常优于现有方法。

英文摘要

In this paper we consider general l0-norm minimization problems, that is, the problems with l0-norm appearing in either objective function or constraint. In particular, we first reformulate the l0-norm constrained problem as an equivalent rank minimization problem and then apply the penalty decomposition (PD) method proposed in [33] to solve the latter problem. By utilizing the special structures, we then transform all matrix operations of this method to vector operations and obtain a PD method that only involves vector operations. Under some suitable assumptions, we establish that any accumulation point of the sequence generated by the PD method satisfies a first-order optimality condition that is generally stronger than one natural optimality condition. We further extend the PD method to solve the problem with the l0-norm appearing in objective function. Finally, we test the performance of our PD methods by applying them to compressed sensing, sparse logistic regression and sparse inverse covariance selection. The computational results demonstrate that our methods generally outperform the existing methods in terms of solution quality and/or speed.

1205.2046 2026-06-03 eess.SY cs.AI cs.SY math.OC

Multiset Estimates and Combinatorial Synthesis

多重集估计与组合综合

Mark Sh. Levin

AI总结 本文提出基于多重集估计的序数评估方法,研究其运算(集成、邻近性、比较、聚合、对齐)及在组合综合(形态学方法、背包问题)中的应用。

Comments 30 pages, 24 figures, 10 tables

详情
AI中文摘要

本文探讨了一种基于将元素分配到序数量表上的备选方案序数评估方法。在考虑基本序数量表[1,2,...,l]的层级数和分配元素个数(例如1,2,3)的情况下,提出了评估问题的基本版本。得到的估计是多重集(或袋)(多重集的基数等于常数)。给出了所研究评估问题的尺度偏序集。提出了“区间多重集估计”。进一步,研究了多重集估计上的运算:(a) 多重集估计的集成,(b) 多重集估计的邻近性,(c) 多重集估计的比较,(d) 多重集估计的聚合,以及(e) 多重集估计的对齐。研究了基于形态学方法的组合综合,包括带有设计备选方案多重集估计的改进版本。还简要描述了带有多重集估计的背包类问题。通过数值例子说明了评估方法、多重集估计以及相应的组合问题。

英文摘要

The paper addresses an approach to ordinal assessment of alternatives based on assignment of elements into an ordinal scale. Basic versions of the assessment problems are formulated while taking into account the number of levels at a basic ordinal scale [1,2,...,l] and the number of assigned elements (e.g., 1,2,3). The obtained estimates are multisets (or bags) (cardinality of the multiset equals a constant). Scale-posets for the examined assessment problems are presented. 'Interval multiset estimates' are suggested. Further, operations over multiset estimates are examined: (a) integration of multiset estimates, (b) proximity for multiset estimates, (c) comparison of multiset estimates, (d) aggregation of multiset estimates, and (e) alignment of multiset estimates. Combinatorial synthesis based on morphological approach is examined including the modified version of the approach with multiset estimates of design alternatives. Knapsack-like problems with multiset estimates are briefly described as well. The assessment approach, multiset-estimates, and corresponding combinatorial problems are illustrated by numerical examples.

1108.5359 2026-06-03 math.NA cs.CV cs.NA

Solving Principal Component Pursuit in Linear Time via $l_1$ Filtering

通过 $l_1$ 滤波在线性时间内求解主成分追踪

Risheng Liu, Zhouchen Lin, Siming Wei, Zhixun Su

AI总结 提出一种名为 $l_1$ 滤波的算法,以 $O(r^2(m+n))$ 复杂度精确求解主成分追踪问题,实现线性时间内的核范数最小化,并具有高度可并行性。

详情
AI中文摘要

在过去的几十年中,从被破坏的观测数据中精确恢复内在数据结构(即鲁棒主成分分析,RPCA)引起了极大的兴趣,并在计算机视觉中找到了许多应用。最近,该问题被表述为从观测数据矩阵中恢复低秩分量和稀疏分量。已证明,在适当的条件下,该问题可以通过主成分追踪(PCP)精确求解,即最小化核范数和 $l_1$ 范数的组合。现有的求解 PCP 的方法大多需要对数据矩阵进行奇异值分解(SVD),导致计算复杂度高,从而阻碍了 RPCA 在超大规模计算机视觉问题中的应用。在本文中,我们提出了一种新颖的算法,称为 $l_1$ 滤波,用于以 $O(r^2(m+n))$ 的复杂度精确求解 PCP,其中 $m\times n$ 是数据矩阵的大小,$r$ 是要恢复矩阵的秩,假设远小于 $m$ 和 $n$。此外,$l_1$ 滤波是高度可并行的。它是第一个能够以线性时间(相对于数据大小)精确求解核范数最小化问题的算法。在合成数据和实际应用上的实验证明了 $l_1$ 滤波在速度上相对于最先进算法的巨大优势。

英文摘要

In the past decades, exactly recovering the intrinsic data structure from corrupted observations, which is known as robust principal component analysis (RPCA), has attracted tremendous interests and found many applications in computer vision. Recently, this problem has been formulated as recovering a low-rank component and a sparse component from the observed data matrix. It is proved that under some suitable conditions, this problem can be exactly solved by principal component pursuit (PCP), i.e., minimizing a combination of nuclear norm and $l_1$ norm. Most of the existing methods for solving PCP require singular value decompositions (SVD) of the data matrix, resulting in a high computational complexity, hence preventing the applications of RPCA to very large scale computer vision problems. In this paper, we propose a novel algorithm, called $l_1$ filtering, for \emph{exactly} solving PCP with an $O(r^2(m+n))$ complexity, where $m\times n$ is the size of data matrix and $r$ is the rank of the matrix to recover, which is supposed to be much smaller than $m$ and $n$. Moreover, $l_1$ filtering is \emph{highly parallelizable}. It is the first algorithm that can \emph{exactly} solve a nuclear norm minimization problem in \emph{linear time} (with respect to the data size). Experiments on both synthetic data and real applications testify to the great advantage of $l_1$ filtering in speed over state-of-the-art algorithms.