arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

1212.4507 2026-06-03 stat.ML cs.LG cs.NA math.NA

Variational Optimization

变分优化

Joe Staines, David Barber

AI总结本文提出一种通用技术，通过构造可微边界来优化不可微或离散目标函数，并应用于稀疏学习与支持向量分类。

1212.1735 2026-06-03 math.OC cs.AI cs.NI cs.SY eess.SY

Towards Design of System Hierarchy (research survey)

系统层次结构设计（研究综述）

Mark Sh. Levin

AI总结本文综述了树状和层次系统结构的设计/构建框架，包括基于专家的方法、层次聚类、生成树问题、组织最优层次设计、多层k连通网络设计以及层次/网络的修改，并考虑组合优化问题。

Comments 36 pages, 41 figures, 9 tables

详情

AI中文摘要

本文讨论了某些树状和层次系统结构的设计/构建框架。考察了以下方法：（1）基于专家的程序；（2）层次聚类；（3）生成树问题（例如，最小生成树、最小斯坦纳树、最大叶子生成树问题）；（4）组织“最优”层次设计；（5）多层（例如，三层）k连通网络设计；（6）层次或网络的修改：（i）通过合并相邻节点修改树，（ii）热链接分配，（iii）将树转换为斯坦纳树，（iv）重构作为将初始结构解修改为最接近目标解且考虑修改成本的解。组合优化问题被视为基本问题（例如，分类、背包问题、多选问题、分配问题）。一些数值示例说明了所提出的问题和求解框架。

英文摘要

The paper addresses design/building frameworks for some kinds of tree-like and hierarchical structures of systems. The following approaches are examined: (1) expert-based procedures, (2) hierarchical clustering; (3) spanning problems (e.g., minimum spanning tree, minimum Steiner tree, maximum leaf spanning tree problem; (4) design of organizational 'optimal' hierarchies; (5) design of multi-layer (e.g., three-layer) k-connected network; (6) modification of hierarchies or networks: (i) modification of tree via condensing of neighbor nodes, (ii) hotlink assignment, (iii) transformation of tree into Steiner tree, (iv) restructuring as modification of an initial structural solution into a solution that is the most close to a goal solution while taking into account a cost of the modification. Combinatorial optimization problems are considered as basic ones (e.g., classification, knapsack problem, multiple choice problem, assignment problem). Some numerical examples illustrate the suggested problems and solving frameworks.

URL PDF HTML ☆

赞 0 踩 0

1211.7369 2026-06-03 stat.ML cs.LG cs.NA math.NA

Approximate Rank-Detecting Factorization of Low-Rank Tensors

低秩张量的近似秩检测分解

Franz J. Király, Andreas Ziehe

AI总结提出AROFAC2算法，通过检测三阶张量的CP秩并分解为秩一分量，具有内在检测真实秩、避免虚假分量、对异常值和非高斯噪声鲁棒的优势。

1211.5414 2026-06-03 cs.DS cs.LG cs.NA math.NA stat.ML

Analysis of a randomized approximation scheme for matrix multiplication

矩阵乘法的随机近似方案分析

Daniel Hsu, Sham M. Kakade, Tong Zhang

AI总结本文分析了Sarlos (2006)提出的基于随机旋转和均匀列采样的矩阵乘法随机近似方案，利用矩阵Bernstein不等式和次高斯随机向量二次型的尾部不等式给出简单分析。

1103.1417 2026-06-03 math.ST cs.LG cs.SY eess.SY math.OC math.PR stat.TH

Localization from Incomplete Noisy Distance Measurements

基于不完整含噪距离测量的定位

Adel Javanmard, Andrea Montanari

AI总结针对含噪部分距离测量下的欧氏空间点云定位问题，提出基于半定规划的算法，并刻画其在随机几何图模型下的性能边界。

Comments 46 pages, 8 figures, numerical experiments added. Journal version (v1,v2: Conference versions, ISIT 2011); Journal of Foundations of Computational Mathematics, 2012

详情

DOI: 10.1007/s10208-012-9129-5

AI中文摘要

我们考虑在欧氏空间 $\mathbb{R}^d$ 中利用部分成对距离的含噪测量来定位点云的问题。该任务在传感器网络定位和从NMR测量重建蛋白质构象等领域有应用。此外，它与降维问题和流形学习密切相关，后者的目标是通过局部（或部分）度量信息学习数据集的潜在全局几何结构。本文提出一种基于半定规划的重建算法。对于随机几何图模型和一致有界噪声，我们精确刻画了算法的性能：在无噪声情况下，我们找到一个半径 $r_0$，超过该半径算法能重建精确位置（直至刚性变换）。在存在噪声的情况下，我们得到的重建误差上下界仅相差一个依赖于维度 $d$ 和图中节点平均度的因子。

英文摘要

We consider the problem of positioning a cloud of points in the Euclidean space $\mathbb{R}^d$, using noisy measurements of a subset of pairwise distances. This task has applications in various areas, such as sensor network localization and reconstruction of protein conformations from NMR measurements. Also, it is closely related to dimensionality reduction problems and manifold learning, where the goal is to learn the underlying global geometry of a data set using local (or partial) metric information. Here we propose a reconstruction algorithm based on semidefinite programming. For a random geometric graph model and uniformly bounded noise, we provide a precise characterization of the algorithm's performance: In the noiseless case, we find a radius $r_0$ beyond which the algorithm reconstructs the exact positions (up to rigid transformations). In the presence of noise, we obtain upper and lower bounds on the reconstruction error that match up to a factor that depends only on the dimension $d$, and the average degree of the nodes in the graph.

URL PDF HTML ☆

赞 0 踩 0

1211.4038 2026-06-03 eess.SY cs.RO cs.SY math.OC

Stochastic receding horizon control of nonlinear stochastic systems with probabilistic state constraints

具有概率状态约束的非线性随机系统的随机滚动时域控制

Shridhar K. Shah, Herbert G. Tanner, Chetan D. Pahlajani

AI总结针对受概率状态约束的连续时间随机非线性系统，提出一种将滚动时域参考路径设计与随机最优控制器相结合的实时可实施控制框架，并证明无控制输入约束下的闭环收敛性。

Comments Draft of submission to IEEE Transactions of Automatic Control

1209.5805 2026-06-03 eess.SY cs.RO cs.SY math.OC

Memoryless Control Design for Persistent Surveillance under Safety Constraints

安全约束下持久监视的无记忆控制设计

Eduardo Arvelo, Eric Kim, Nuno C. Martins

AI总结针对有限二维网格中移动机器人的持久监视问题，提出一种基于熵最大化原理的有限参数凸规划方法，设计时间不变无记忆控制策略，在避免进入禁止区域的同时最大化被持久监视的状态数。

1211.0056 2026-06-03 math.OC cs.LG cs.NA math.NA stat.CO stat.ML

Iterative Hard Thresholding Methods for $l_0$ Regularized Convex Cone Programming

$l_0$ 正则化凸锥规划问题的迭代硬阈值方法

Zhaosong Lu

AI总结提出迭代硬阈值方法及其变体求解 $l_0$ 正则化凸锥规划，证明收敛到局部极小点并建立迭代复杂度。

Comments 25 pages

1202.5298 2026-06-03 eess.SY cs.LG cs.SY

Min Max Generalization for Two-stage Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

两阶段确定性批量模式强化学习的最小最大泛化：松弛方案

Raphael Fonteneau, Damien Ernst, Bernard Boigelot, Quentin Louveaux

AI总结针对确定性批量模式强化学习中的最小最大优化问题，提出两种松弛方案（约束丢弃和拉格朗日对偶化）以降低计算复杂度，并证明其优于现有方法。

1210.4231 2026-06-03 eess.SY cs.AI cs.SY

An example illustrating the imprecision of the efficient approach for diagnosis of Petri nets via integer linear programming

一个说明通过整数线性规划诊断Petri网的高效方法不精确性的例子

Alban Grastien

AI总结本文通过反例证明，即使系统是可诊断的，基于整数线性规划的Petri网高效诊断方法也可能无法检测到故障。

Comments 3 pages

1210.4081 2026-06-03 math.NA cs.CV cs.DS cs.LG cs.NA math.OC

Getting Feasible Variable Estimates From Infeasible Ones: MRF Local Polytope Study

从不可行变量估计获得可行变量估计：MRF局部多面体研究

Bogdan Savchynskyy, Stefan Schmidt

AI总结针对具有可分离性的大规模优化问题，提出一种从对偶解构造近似可行原始解的方法，并应用于马尔可夫随机场推理问题的局部多面体松弛，证明其优于现有方法。

Comments 20 page, 4 figures

1107.3090 2026-06-03 cs.CC cs.LG cs.SY eess.SY math.OC

On the Computational Complexity of Stochastic Controller Optimization in POMDPs

关于POMDP中随机控制器优化的计算复杂度

Nikos Vlassis, Michael L. Littman, David Barber

AI总结本文证明在马尔可夫决策过程中寻找最优随机“盲”控制器是NP难问题，相应的决策问题属于PSPACE且是SQRT-SUM难的，并指出POMDP中更一般的随机控制器优化问题也是NP难的，但存在一个凸的特殊情况可高效求解。

Comments Corrected error in the proof of Theorem 2, and revised Section 5

1210.0822 2026-06-03 math.NA cs.CV cs.NA

Discrete geodesic calculus in the space of viscous fluidic objects

粘性流体对象空间中的离散测地线计算

Martin Rumpf, Benedikt Wirth

AI总结基于黎曼距离的局部近似，提出了一种时间离散的测地线计算方法，并应用于形状空间中的变形、外推和特征传递。

详情

AI中文摘要

基于流形上黎曼距离的局部近似（通过计算成本低的相异性度量），发展了一种时间离散的测地线计算，并探索了在形状空间中的应用。该相异性度量源自变形能量，其Hessian矩阵再现了底层的黎曼度量，并用于定义形状空间中离散路径的长度和能量。离散测地线定义为能量最小化路径，由此引出了离散对数映射、离散指数映射的变分定义以及时间离散的平行传输。这一新概念应用于形状空间，其中形状被视为由粘性材料构成的物理对象的边界轮廓。通过保持拓扑的形状变形、将局部形状变化作为路径生成器来表示形状空间中的路径、通过离散测地线流进行形状外推以及几何特征的传递，展示了该方法的灵活性和计算效率。

英文摘要

Based on a local approximation of the Riemannian distance on a manifold by a computationally cheap dissimilarity measure, a time discrete geodesic calculus is developed, and applications to shape space are explored. The dissimilarity measure is derived from a deformation energy whose Hessian reproduces the underlying Riemannian metric, and it is used to define length and energy of discrete paths in shape space. The notion of discrete geodesics defined as energy minimizing paths gives rise to a discrete logarithmic map, a variational definition of a discrete exponential map, and a time discrete parallel transport. This new concept is applied to a shape space in which shapes are considered as boundary contours of physical objects consisting of viscous material. The flexibility and computational efficiency of the approach is demonstrated for topology preserving shape morphing, the representation of paths in shape space via local shape variations as path generators, shape extrapolation via discrete geodesic flow, and the transfer of geometric features.

URL PDF HTML ☆

赞 0 踩 0

1209.5826 2026-06-03 math.NA cs.CV cs.NA

Refinability of splines from lattice Voronoi cells

来自格点Voronoi细胞的样条的可细化性

Jorg Peters

AI总结本文提出简单准则，证明只有少数样条族（如箱样条和张量积样条）是可细化的，而六边形样条等不可细化样条在格点细化时近似误差可能增大。

1206.4481 2026-06-03 math.NA cs.LG cs.NA

Parsimonious Mahalanobis Kernel for the Classification of High Dimensional Data

用于高维数据分类的简约马氏核

M. Fauvel, A. Villa, J. Chanussot, J. A. Benediktsson

AI总结利用高维空间的空性，基于马氏距离提出一种简约核，通过高维判别分析模型估计信号和噪声子空间，实现稳定逆协方差矩阵，并在SVM框架下优化半径-间隔界，实验表明该核优于高斯核。

详情

AI中文摘要

本文考虑使用核方法对高维数据进行分类。利用高维空间的空性，提出了一种基于马氏距离的核。计算马氏距离需要协方差矩阵的逆。在高维空间中，估计的协方差矩阵是病态的，其逆不稳定或不可能。使用简约统计模型，即高维判别分析模型，为每个考虑的类别估计特定的信号和噪声子空间，使得类别特定协方差矩阵的逆显式且稳定，从而定义了简约马氏核。采用基于SVM的框架，通过优化所谓的半径-间隔界来选择简约马氏核的超参数。在三个高维数据集上的实验结果表明，所提出的核适用于高维数据分类，比传统高斯核提供更好的分类精度。

英文摘要

The classification of high dimensional data with kernel methods is considered in this article. Exploit- ing the emptiness property of high dimensional spaces, a kernel based on the Mahalanobis distance is proposed. The computation of the Mahalanobis distance requires the inversion of a covariance matrix. In high dimensional spaces, the estimated covariance matrix is ill-conditioned and its inversion is unstable or impossible. Using a parsimonious statistical model, namely the High Dimensional Discriminant Analysis model, the specific signal and noise subspaces are estimated for each considered class making the inverse of the class specific covariance matrix explicit and stable, leading to the definition of a parsimonious Mahalanobis kernel. A SVM based framework is used for selecting the hyperparameters of the parsimonious Mahalanobis kernel by optimizing the so-called radius-margin bound. Experimental results on three high dimensional data sets show that the proposed kernel is suitable for classifying high dimensional data, providing better classification accuracies than the conventional Gaussian kernel.

URL PDF HTML ☆

赞 0 踩 0

1203.4345 2026-06-03 eess.SY cs.AI cs.RO cs.SY stat.ML

Robust Filtering and Smoothing with Gaussian Processes

基于高斯过程的鲁棒滤波与平滑

Marc Peter Deisenroth, Ryan Turner, Marco F. Huber, Uwe D. Hanebeck, Carl Edward Rasmussen

AI总结提出一种基于非参数高斯过程模型的非线性随机动态系统鲁棒贝叶斯滤波与平滑算法，通过解析平滑实现鲁棒性，数值实验表明在其它先进方法失效时仍保持稳健。

Comments 7 pages, 1 figure, draft version of paper accepted at IEEE Transactions on Automatic Control

详情

DOI: 10.1109/TAC.2011.2179426

AI中文摘要

我们提出了一种原则性算法，用于在非线性随机动态系统中进行鲁棒贝叶斯滤波和平滑，其中转移函数和测量函数均由非参数高斯过程（GP）模型描述。在信号处理、机器学习、机器人和控制领域，GP通过后验概率分布表示未知系统函数，其重要性日益增加。这种现代的“系统辨识”方式比寻找参数函数表示的点估计更为鲁棒。在本文中，我们提出了一种原则性算法，用于在GP动态系统中进行鲁棒解析平滑，该系统在机器人和控制领域应用日益广泛。我们的数值评估表明，在其它最先进的高斯滤波器和平滑器可能失败的情况下，所提方法具有鲁棒性。

英文摘要

We propose a principled algorithm for robust Bayesian filtering and smoothing in nonlinear stochastic dynamic systems when both the transition function and the measurement function are described by non-parametric Gaussian process (GP) models. GPs are gaining increasing importance in signal processing, machine learning, robotics, and control for representing unknown system functions by posterior probability distributions. This modern way of "system identification" is more robust than finding point estimates of a parametric function representation. In this article, we present a principled algorithm for robust analytic smoothing in GP dynamic systems, which are increasingly used in robotics and control. Our numerical evaluations demonstrate the robustness of the proposed approach in situations where other state-of-the-art Gaussian filters and smoothers can fail.

URL PDF HTML ☆

赞 0 踩 0

1208.0864 2026-06-03 math.OC cs.LG cs.SY eess.SY

Statistical Results on Filtering and Epi-convergence for Learning-Based Model Predictive Control

基于学习的模型预测控制的滤波与上收敛统计结果

Anil Aswani, Humberto Gonzalez, S. Shankar Sastry, Claire Tomlin

AI总结本文证明了基于学习的模型预测控制中测量模型选择的合理性，并给出了随机收敛性证明，同时证明了用于LBMPC的非参数估计器的统计性质。

详情

AI中文摘要

基于学习的模型预测控制（LBMPC）是一种提供鲁棒性确定性保证的技术，同时使用统计识别工具来识别更丰富的系统模型以提高性能。本技术说明提供了证明，阐明我们选择测量模型的原因，并给出了关于LBMPC随机收敛性的证明。第一部分讨论了可用常微分方程（ODE）描述的动力系统的同时状态估计和未建模动力学的统计识别（或学习）。第二部分提供了关于可与基于学习的模型预测控制（LBMPC）技术一起使用的不同统计估计器的上收敛的证明。特别地，我们证明了一种非参数估计器的统计性质，该估计器设计用于与LBMPC结合使用时具有正确的确定性和随机性数值实现性质。

英文摘要

Learning-based model predictive control (LBMPC) is a technique that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance. This technical note provides proofs that elucidate the reasons for our choice of measurement model, as well as giving proofs concerning the stochastic convergence of LBMPC. The first part of this note discusses simultaneous state estimation and statistical identification (or learning) of unmodeled dynamics, for dynamical systems that can be described by ordinary differential equations (ODE's). The second part provides proofs concerning the epi-convergence of different statistical estimators that can be used with the learning-based model predictive control (LBMPC) technique. In particular, we prove results on the statistical properties of a nonparametric estimator that we have designed to have the correct deterministic and stochastic properties for numerical implementation when used in conjunction with LBMPC.

URL PDF HTML ☆

赞 0 踩 0

1107.2487 2026-06-03 math.OC cs.LG cs.SY eess.SY math.ST stat.TH

Provably Safe and Robust Learning-Based Model Predictive Control

可证明安全且鲁棒的基于学习的模型预测控制

Anil Aswani, Humberto Gonzalez, S. Shankar Sastry, Claire Tomlin

AI总结提出一种基于学习的模型预测控制（LBMPC）方案，通过解耦安全与性能，利用统计学习改进性能并保证鲁棒性。

详情

AI中文摘要

控制器设计面临鲁棒性与性能之间的权衡，线性控制器的可靠性使得许多从业者关注前者。然而，为了应对日益增长的能源约束，提高系统性能重新引起兴趣。本文描述了一种基于学习的模型预测控制（LBMPC）方案，该方案提供鲁棒性的确定性保证，同时使用统计识别工具来识别更丰富的系统模型以提高性能；该框架的优点在于它处理状态和输入约束，根据成本函数优化系统性能，并且可以设计使用各种参数或非参数统计工具。LBMPC的主要见解是，在优化框架中，通过维护两个系统模型，可以在合理条件下解耦安全性和性能。第一个是具有不确定性界限的近似模型，第二个模型通过统计方法更新。LBMPC通过选择最小化成本的输入（受学习动力学约束）来提高性能，并通过检查这些相同的输入是否在不确定性下保持近似模型稳定来确保安全性和鲁棒性。此外，我们证明如果系统充分激励，则LBMPC控制动作概率收敛到使用真实动力学计算的MPC的控制动作。

英文摘要

Controller design faces a trade-off between robustness and performance, and the reliability of linear controllers has caused many practitioners to focus on the former. However, there is renewed interest in improving system performance to deal with growing energy constraints. This paper describes a learning-based model predictive control (LBMPC) scheme that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance; the benefits of this framework are that it handles state and input constraints, optimizes system performance with respect to a cost function, and can be designed to use a wide variety of parametric or nonparametric statistical tools. The main insight of LBMPC is that safety and performance can be decoupled under reasonable conditions in an optimization framework by maintaining two models of the system. The first is an approximate model with bounds on its uncertainty, and the second model is updated by statistical methods. LBMPC improves performance by choosing inputs that minimize a cost subject to the learned dynamics, and it ensures safety and robustness by checking whether these same inputs keep the approximate model stable when it is subject to uncertainty. Furthermore, we show that if the system is sufficiently excited, then the LBMPC control action probabilistically converges to that of an MPC computed using the true dynamics.

URL PDF HTML ☆

赞 0 踩 0

1207.6051 2026-06-03 eess.SY cs.AI cs.SY math.OC

Composition of Modular Telemetry System with Interval Multiset Estimates

基于区间多集估计的模块化遥测系统组合

Mark Sh. Levin

AI总结本文提出一种基于区间多集估计的组合综合方法，用于模块化遥测系统的建模、分析、设计和改进，通过分层形态多准则设计（HMMD）实现系统组件的多准则选择与合成。

Comments 9 pages, 9 figures, 6 tables

详情

AI中文摘要

本文描述了一种组合综合方法，该方法利用系统元素的区间多集估计来对模块化遥测系统进行建模、分析、设计和改进。形态（模块化）系统设计与改进被视为遥测系统元素（组件）配置的组合。求解过程基于分层形态多准则设计（HMMD）：(i) 系统组件备选方案的多准则选择，(ii) 将所选备选方案合成为结果组合（同时考虑上述备选方案的质量及其兼容性）。使用区间多集估计来评估遥测系统元素的设计备选方案。还研究了两个附加系统问题：(a) 改进所获得的解，(b) 将所获得的解聚合成一个结果系统配置。改进和聚合过程基于具有区间多集估计的多重选择问题。通过一个机载遥测子系统的数值示例说明了设计和改进过程。

英文摘要

The paper describes combinatorial synthesis approach with interval multset estimates of system elements for modeling, analysis, design, and improvement of a modular telemetry system. Morphological (modular) system design and improvement are considered as composition of the telemetry system elements (components) configuration. The solving process is based on Hierarchical Morphological Multicriteria Design (HMMD): (i) multicriteria selection of alternatives for system components, (ii) synthesis of the selected alternatives into a resultant combination (while taking into account quality of the alternatives above and their compatibility). Interval multiset estimates are used for assessment of design alternatives for telemetry system elements. Two additional systems problems are examined: (a) improvement of the obtained solutions, (b) aggregation of the obtained solutions into a resultant system configuration. The improvement and aggregation processes are based on multiple choice problem with interval multiset estimates. Numerical examples for an on-board telemetry subsystem illustrate the design and improvement processes.

URL PDF HTML ☆

赞 0 踩 0

1207.3438 2026-06-03 stat.ML cs.LG cs.NA math.NA

MahNMF: Manhattan Non-negative Matrix Factorization

MahNMF: 曼哈顿非负矩阵分解

Naiyang Guan, Dacheng Tao, Zhigang Luo, John Shawe-Taylor

AI总结针对重尾噪声和异常值问题，提出基于曼哈顿距离的MahNMF模型，并开发了秩一残差迭代和Nesterov平滑两种快速优化算法。

Comments 43 pages, 20 figures, 2 tables, submission to Journal of Machine Learning Research

详情

AI中文摘要

非负矩阵分解（NMF）通过两个非负低秩因子矩阵 $W$ 和 $H$ 的乘积来逼近非负矩阵 $X$。NMF 及其扩展通过最小化 $X$ 与 $W^T H$ 之间的 Kullback-Leibler 散度或欧氏距离来建模泊松噪声或高斯噪声。然而，当噪声分布具有重尾特性时，这些方法表现不佳。本文提出曼哈顿 NMF（MahNMF），通过最小化 $X$ 与 $W^T H$ 之间的曼哈顿距离来建模重尾拉普拉斯噪声。与稀疏和低秩矩阵分解类似，MahNMF 能够鲁棒地估计非负矩阵的低秩部分和稀疏部分，从而在数据受到异常值污染时有效工作。我们通过开发带盒约束的 MahNMF、流形正则化 MahNMF、组稀疏 MahNMF、弹性网诱导 MahNMF 和对称 MahNMF，将 MahNMF 扩展到各种实际应用。本文的主要贡献在于为 MahNMF 及其扩展提出了两种快速优化算法：秩一残差迭代（RRI）方法和 Nesterov 平滑方法。具体地，通过将 MahNMF 中的残差矩阵近似为 $W$ 的一行和 $H$ 的一行的外积，我们开发了 RRI 方法，以闭式解迭代更新 $W$ 和 $H$ 的每个变量。尽管 RRI 对于小规模 MahNMF 及其某些扩展是高效的，但它既不能扩展到大规模矩阵，也不够灵活以优化所有 MahNMF 扩展。由于 MahNMF 及其扩展的目标函数既非凸也不光滑，我们应用 Nesterov 平滑方法，在固定一个因子矩阵的情况下递归优化另一个因子矩阵。通过将平滑参数设置为与迭代次数成反比，我们逐步提高了 MahNMF 及其扩展的逼近精度。

英文摘要

Non-negative matrix factorization (NMF) approximates a non-negative matrix $X$ by a product of two non-negative low-rank factor matrices $W$ and $H$. NMF and its extensions minimize either the Kullback-Leibler divergence or the Euclidean distance between $X$ and $W^T H$ to model the Poisson noise or the Gaussian noise. In practice, when the noise distribution is heavy tailed, they cannot perform well. This paper presents Manhattan NMF (MahNMF) which minimizes the Manhattan distance between $X$ and $W^T H$ for modeling the heavy tailed Laplacian noise. Similar to sparse and low-rank matrix decompositions, MahNMF robustly estimates the low-rank part and the sparse part of a non-negative matrix and thus performs effectively when data are contaminated by outliers. We extend MahNMF for various practical applications by developing box-constrained MahNMF, manifold regularized MahNMF, group sparse MahNMF, elastic net inducing MahNMF, and symmetric MahNMF. The major contribution of this paper lies in two fast optimization algorithms for MahNMF and its extensions: the rank-one residual iteration (RRI) method and Nesterov's smoothing method. In particular, by approximating the residual matrix by the outer product of one row of W and one row of $H$ in MahNMF, we develop an RRI method to iteratively update each variable of $W$ and $H$ in a closed form solution. Although RRI is efficient for small scale MahNMF and some of its extensions, it is neither scalable to large scale matrices nor flexible enough to optimize all MahNMF extensions. Since the objective functions of MahNMF and its extensions are neither convex nor smooth, we apply Nesterov's smoothing method to recursively optimize one factor matrix with another matrix fixed. By setting the smoothing parameter inversely proportional to the iteration number, we improve the approximation accuracy iteratively for both MahNMF and its extensions.

URL PDF HTML ☆

赞 0 踩 0

1206.6474 2026-06-03 cs.DS cs.LG cs.NA math.NA stat.ML

Estimation of Simultaneously Sparse and Low Rank Matrices

同时稀疏和低秩矩阵的估计

Emile Richard, Pierre-Andre Savalle, Nicolas Vayatis

AI总结本文提出一种凸混合惩罚方法，同时使用ℓ1范数和迹范数，以估计同时稀疏和低秩的矩阵，并推导了预言不等式和链接预测的泛化误差界，通过近端下降算法高效求解。

Comments Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

1206.4640 2026-06-03 math.NA cs.LG cs.NA stat.ML

Stability of matrix factorization for collaborative filtering

协同过滤中矩阵分解的稳定性

Yu-Xiang Wang, Huan Xu

AI总结研究矩阵分解算法在矩阵补全中对抗性噪声的稳定性，通过误差界、子空间分析和个体预测误差分析，为协同过滤系统设计提供指导。

Comments ICML2012

1206.4602 2026-06-03 math.NA cs.LG cs.NA stat.ML

Quasi-Newton Methods: A New Direction

拟牛顿方法：一个新方向

Philipp Hennig, Martin Kiefel

AI总结本文通过将拟牛顿方法解释为贝叶斯线性回归的近似，揭示了经典算法的缺陷，并提出了一种新的非参数拟牛顿方法，在相似计算成本下更高效地利用信息。

Comments ICML2012

1206.2061 2026-06-03 math.NA cs.CV cs.NA

Comments on "On Approximating Euclidean Metrics by Weighted t-Cost Distances in Arbitrary Dimension"

关于“任意维度中通过加权t-代价距离逼近欧几里得度量”的评论

M. Emre Celebi, Hassan A. Kingravi, Fatih Celiker

AI总结本文评论了Mukherjee提出的加权t-代价距离逼近欧几里得范数的方法，指出其在ℝⁿ中的平均误差过于乐观，并提出了改进精度的归一化方案。

Comments 7 pages, 1 figure, 3 tables. arXiv admin note: substantial text overlap with arXiv:1008.4870

详情

DOI: 10.1016/j.patrec.2012.03.002
Journal ref: Pattern Recognition Letters 33 (2012) 1422--1425

AI中文摘要

Mukherjee（Pattern Recognition Letters, vol. 32, pp. 824-831, 2011）最近引入了一类称为加权t-代价距离的距离函数，它推广了m-邻域、八边形和t-代价距离。他证明了加权t-代价距离构成一个度量族，并推导了在$\mathbb{Z}^n$中欧几里得范数的近似。在本注释中，我们将此近似与先前提出的两种欧几里得范数近似进行比较，并证明Mukherjee给出的经验平均误差在$\mathbb{R}^n$中显著乐观。我们还提出了一种简单的归一化方案，该方案在平均相对误差和最大相对误差方面都显著提高了其近似的精度。

英文摘要

Mukherjee (Pattern Recognition Letters, vol. 32, pp. 824-831, 2011) recently introduced a class of distance functions called weighted t-cost distances that generalize m-neighbor, octagonal, and t-cost distances. He proved that weighted t-cost distances form a family of metrics and derived an approximation for the Euclidean norm in $\mathbb{Z}^n$. In this note we compare this approximation to two previously proposed Euclidean norm approximations and demonstrate that the empirical average errors given by Mukherjee are significantly optimistic in $\mathbb{R}^n$. We also propose a simple normalization scheme that improves the accuracy of his approximation substantially with respect to both average and maximum relative errors.

URL PDF HTML ☆

赞 0 踩 0

1109.2363 2026-06-03 stat.AP cs.RO cs.SY eess.SY math.OC

Sensor Management: Past, Present, and Future

传感器管理：过去、现在与未来

Alfred O. Hero, Douglas Cochran

AI总结本文综述了传感器管理的理论、算法和应用，涵盖其发展历程和当前现状，并展望未来方向。

Comments 15 pages, 112 references

1008.5373 2026-06-03 math.OC cs.LG cs.NA cs.SY eess.SY math.NA q-fin.CP q-fin.ST

Penalty Decomposition Methods for Rank Minimization

秩最小化的罚分解方法

Zhaosong Lu, Yong Zhang

AI总结本文提出罚分解方法求解目标或约束中含秩的秩最小化问题，通过块坐标下降法求解子问题，并证明序列聚点满足一阶最优性条件，在矩阵补全和最近低秩相关矩阵问题上表现优于或持平现有方法。

Comments This paper has been withdrawn by the author

1205.3997 2026-06-03 stat.ML cs.AI cs.GT cs.SY eess.SY

Free Energy and the Generalized Optimality Equations for Sequential Decision Making

自由能与序列决策的广义最优性方程

Pedro A. Ortega, Daniel A. Braun

AI总结本文应用自由能原理到包含对抗和随机环境的通用决策树，推导出广义序列最优性方程，该方程包含Bellman最优性方程作为极限情况，并导出Expectimax、Minimax和Expectiminimax等决策规则，为每个节点分配资源参数以表达计算成本。

Comments 10 pages, 2 figures

详情

Journal ref: European Workshop on Reinforcement Learning 2012

AI中文摘要

自由能泛函最近被提出作为有界理性决策的变分原理，因为它实例化了效用增益与信息处理成本之间的自然权衡，并且可以从公理推导出来。这里我们将自由能原理应用于包含对抗和随机环境的通用决策树。我们推导出广义序列最优性方程，该方程不仅包含Bellman最优性方程作为极限情况，而且导出了众所周知的决策规则，如Expectimax、Minimax和Expectiminimax。我们展示了如何从单一的自由能原理推导出这些决策规则，该原理为决策树中的每个节点分配一个资源参数。这些资源参数表达了一个具体的计算成本，可以测量为从属于每个节点的分布所需的样本数量。因此，自由能原理为考虑对抗和随机环境的广义最优性方程提供了规范基础。

英文摘要

The free energy functional has recently been proposed as a variational principle for bounded rational decision-making, since it instantiates a natural trade-off between utility gains and information processing costs that can be axiomatically derived. Here we apply the free energy principle to general decision trees that include both adversarial and stochastic environments. We derive generalized sequential optimality equations that not only include the Bellman optimality equations as a limit case, but also lead to well-known decision-rules such as Expectimax, Minimax and Expectiminimax. We show how these decision-rules can be derived from a single free energy principle that assigns a resource parameter to each node in the decision tree. These resource parameters express a concrete computational cost that can be measured as the amount of samples that are needed from the distribution that belongs to each node. The free energy principle therefore provides the normative basis for generalized optimality equations that account for both adversarial and stochastic environments.

URL PDF HTML ☆

赞 0 踩 0

1008.5372 2026-06-03 math.OC cs.CV cs.IT cs.LG cs.NA math.IT math.NA stat.ME

Penalty Decomposition Methods for $L0$-Norm Minimization

L0-范数最小化的罚分解方法

Zhaosong Lu, Yong Zhang

AI总结提出罚分解方法求解含L0-范数的优化问题，通过转化为秩最小化问题并利用向量化操作，在压缩感知等应用中优于现有方法。

Comments This paper has been withdrawn by the author because an updated version has been resubmitted

详情

AI中文摘要

本文考虑一般的l0-范数最小化问题，即目标函数或约束中出现l0-范数的问题。特别地，我们首先将l0-范数约束问题重新表述为等价的秩最小化问题，然后应用[33]中提出的罚分解（PD）方法求解后者。通过利用特殊结构，我们将该方法的所有矩阵运算转化为向量运算，得到仅涉及向量运算的PD方法。在适当的假设下，我们证明PD方法生成的序列的任何聚点满足一阶最优性条件，该条件通常比一个自然最优性条件更强。我们进一步扩展PD方法以求解目标函数中出现l0-范数的问题。最后，通过将PD方法应用于压缩感知、稀疏逻辑回归和稀疏逆协方差选择来测试其性能。计算结果表明，我们的方法在解质量和/或速度方面通常优于现有方法。

英文摘要

In this paper we consider general l0-norm minimization problems, that is, the problems with l0-norm appearing in either objective function or constraint. In particular, we first reformulate the l0-norm constrained problem as an equivalent rank minimization problem and then apply the penalty decomposition (PD) method proposed in [33] to solve the latter problem. By utilizing the special structures, we then transform all matrix operations of this method to vector operations and obtain a PD method that only involves vector operations. Under some suitable assumptions, we establish that any accumulation point of the sequence generated by the PD method satisfies a first-order optimality condition that is generally stronger than one natural optimality condition. We further extend the PD method to solve the problem with the l0-norm appearing in objective function. Finally, we test the performance of our PD methods by applying them to compressed sensing, sparse logistic regression and sparse inverse covariance selection. The computational results demonstrate that our methods generally outperform the existing methods in terms of solution quality and/or speed.

URL PDF HTML ☆

赞 0 踩 0

1205.2046 2026-06-03 eess.SY cs.AI cs.SY math.OC

Multiset Estimates and Combinatorial Synthesis

多重集估计与组合综合

Mark Sh. Levin

AI总结本文提出基于多重集估计的序数评估方法，研究其运算（集成、邻近性、比较、聚合、对齐）及在组合综合（形态学方法、背包问题）中的应用。

Comments 30 pages, 24 figures, 10 tables

详情

AI中文摘要

本文探讨了一种基于将元素分配到序数量表上的备选方案序数评估方法。在考虑基本序数量表[1,2,...,l]的层级数和分配元素个数（例如1,2,3）的情况下，提出了评估问题的基本版本。得到的估计是多重集（或袋）（多重集的基数等于常数）。给出了所研究评估问题的尺度偏序集。提出了“区间多重集估计”。进一步，研究了多重集估计上的运算：(a) 多重集估计的集成，(b) 多重集估计的邻近性，(c) 多重集估计的比较，(d) 多重集估计的聚合，以及(e) 多重集估计的对齐。研究了基于形态学方法的组合综合，包括带有设计备选方案多重集估计的改进版本。还简要描述了带有多重集估计的背包类问题。通过数值例子说明了评估方法、多重集估计以及相应的组合问题。

英文摘要

The paper addresses an approach to ordinal assessment of alternatives based on assignment of elements into an ordinal scale. Basic versions of the assessment problems are formulated while taking into account the number of levels at a basic ordinal scale [1,2,...,l] and the number of assigned elements (e.g., 1,2,3). The obtained estimates are multisets (or bags) (cardinality of the multiset equals a constant). Scale-posets for the examined assessment problems are presented. 'Interval multiset estimates' are suggested. Further, operations over multiset estimates are examined: (a) integration of multiset estimates, (b) proximity for multiset estimates, (c) comparison of multiset estimates, (d) aggregation of multiset estimates, and (e) alignment of multiset estimates. Combinatorial synthesis based on morphological approach is examined including the modified version of the approach with multiset estimates of design alternatives. Knapsack-like problems with multiset estimates are briefly described as well. The assessment approach, multiset-estimates, and corresponding combinatorial problems are illustrated by numerical examples.

URL PDF HTML ☆

赞 0 踩 0

1108.5359 2026-06-03 math.NA cs.CV cs.NA

Solving Principal Component Pursuit in Linear Time via $l_1$ Filtering

通过 $l_1$ 滤波在线性时间内求解主成分追踪

Risheng Liu, Zhouchen Lin, Siming Wei, Zhixun Su

AI总结提出一种名为 $l_1$ 滤波的算法，以 $O(r^2(m+n))$ 复杂度精确求解主成分追踪问题，实现线性时间内的核范数最小化，并具有高度可并行性。

详情

AI中文摘要

在过去的几十年中，从被破坏的观测数据中精确恢复内在数据结构（即鲁棒主成分分析，RPCA）引起了极大的兴趣，并在计算机视觉中找到了许多应用。最近，该问题被表述为从观测数据矩阵中恢复低秩分量和稀疏分量。已证明，在适当的条件下，该问题可以通过主成分追踪（PCP）精确求解，即最小化核范数和 $l_1$ 范数的组合。现有的求解 PCP 的方法大多需要对数据矩阵进行奇异值分解（SVD），导致计算复杂度高，从而阻碍了 RPCA 在超大规模计算机视觉问题中的应用。在本文中，我们提出了一种新颖的算法，称为 $l_1$ 滤波，用于以 $O(r^2(m+n))$ 的复杂度精确求解 PCP，其中 $m\times n$ 是数据矩阵的大小，$r$ 是要恢复矩阵的秩，假设远小于 $m$ 和 $n$。此外，$l_1$ 滤波是高度可并行的。它是第一个能够以线性时间（相对于数据大小）精确求解核范数最小化问题的算法。在合成数据和实际应用上的实验证明了 $l_1$ 滤波在速度上相对于最先进算法的巨大优势。

英文摘要

In the past decades, exactly recovering the intrinsic data structure from corrupted observations, which is known as robust principal component analysis (RPCA), has attracted tremendous interests and found many applications in computer vision. Recently, this problem has been formulated as recovering a low-rank component and a sparse component from the observed data matrix. It is proved that under some suitable conditions, this problem can be exactly solved by principal component pursuit (PCP), i.e., minimizing a combination of nuclear norm and $l_1$ norm. Most of the existing methods for solving PCP require singular value decompositions (SVD) of the data matrix, resulting in a high computational complexity, hence preventing the applications of RPCA to very large scale computer vision problems. In this paper, we propose a novel algorithm, called $l_1$ filtering, for \emph{exactly} solving PCP with an $O(r^2(m+n))$ complexity, where $m\times n$ is the size of data matrix and $r$ is the rank of the matrix to recover, which is supposed to be much smaller than $m$ and $n$. Moreover, $l_1$ filtering is \emph{highly parallelizable}. It is the first algorithm that can \emph{exactly} solve a nuclear norm minimization problem in \emph{linear time} (with respect to the data size). Experiments on both synthetic data and real applications testify to the great advantage of $l_1$ filtering in speed over state-of-the-art algorithms.

URL PDF HTML ☆

赞 0 踩 0