arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

1204.4202 2026-06-03 cs.AI cs.LG cs.NE cs.SY eess.SY

Fuzzy Dynamical Genetic Programming in XCSF

XCSF中的模糊动态遗传编程

Richard J. Preen, Larry Bull

AI总结研究在XCSF学习分类器系统中使用模糊动态遗传编程表示，通过异步模糊逻辑网络实现自适应性开放演化，解决连续值测试问题。

Comments 2 page GECCO 2011 poster paper

1211.7045 2026-06-03 cs.LG cs.NA math.NA math.OC q-bio.BM

Orientation Determination from Cryo-EM images Using Least Unsquared Deviation

使用最小未平方偏差从冷冻电镜图像确定方向

Lanhui Wang, Amit Singer, Zaiwen Wen

AI总结针对冷冻电镜单颗粒重构中方向未知的二维投影图像，提出基于最小未平方偏差的鲁棒全局自洽误差模型，通过半定松弛和谱范数约束/正则化求解，显著降低低共线检测率下的方向估计误差。

详情

AI中文摘要

冷冻电镜单颗粒重构的一个主要挑战是利用未知方向的二维投影图像建立可靠的三维初始模型。基于共线的方法无需额外几何信息即可估计方向。然而，当图像噪声水平过高导致共线检测率过低时，此类方法会失效。通过半定规划的凸松弛，得到了最小二乘全局自洽误差的近似。本文引入一种更鲁棒的全局自洽误差，并证明相应的优化问题可通过半定松弛求解。为了防止估计视角的人为聚类，我们进一步引入一个谱范数项，作为约束或正则化项添加到松弛的最小化问题中。所得问题通过交替方向乘子法或迭代重加权最小二乘过程求解。模拟和真实图像的数值实验表明，当共线检测率较低时，所提方法显著降低了方向估计误差。

英文摘要

A major challenge in single particle reconstruction from cryo-electron microscopy is to establish a reliable ab-initio three-dimensional model using two-dimensional projection images with unknown orientations. Common-lines based methods estimate the orientations without additional geometric information. However, such methods fail when the detection rate of common-lines is too low due to the high level of noise in the images. An approximation to the least squares global self consistency error was obtained using convex relaxation by semidefinite programming. In this paper we introduce a more robust global self consistency error and show that the corresponding optimization problem can be solved via semidefinite relaxation. In order to prevent artificial clustering of the estimated viewing directions, we further introduce a spectral norm term that is added as a constraint or as a regularization term to the relaxed minimization problem. The resulted problems are solved by using either the alternating direction method of multipliers or an iteratively reweighted least squares procedure. Numerical experiments with both simulated and real images demonstrate that the proposed methods significantly reduce the orientation estimation error when the detection rate of common-lines is low.

URL PDF HTML ☆

赞 0 踩 0

1304.2367 2026-06-03 cs.CV cs.AI cs.SY eess.SY

Utility-Based Control for Computer Vision

基于效用的计算机视觉控制

Tod S. Levitt, Thomas O. Binford, Gil J. Ettinger, Patrice Gelband

AI总结针对贝叶斯网络实现计算机视觉中的计算效率问题，提出通过最大化效用而非概率来控制视觉任务，以优化传感器信息收集和数据分析。

Comments Appears in Proceedings of the Fourth Conference on Uncertainty in Artificial Intelligence (UAI1988)

详情

AI中文摘要

在利用贝叶斯网络实现计算机视觉识别世界对象时，出现了几个关键问题。计算效率是驱动力。感知网络非常深，通常有十五层结构。图像很宽，例如，在512×512像素或更大的图像中，未指定数量的边缘可能出现在任何位置。为了提高效率，我们动态实例化观察到的对象的假设。网络不是固定的，而是在运行时逐步创建。世界对象假设的生成和识别模型的索引很重要，但本文不讨论[4,11]。这项工作旨在近期通过并行计算在雷达监视系统ADRIES[5,15]和工业零件识别系统SUCCESSOR[2]中实现。对于许多应用，视觉必须更快才能实用，因此有效控制机器视觉过程至关重要。感知操作可能扫描百万像素，并可能需要数分钟的计算时间。必须避免不必要的传感器动作和计算。并行计算在多个处理器能力级别上可用。用于高层视觉的并行分布式计算的潜力意味着分配非均匀计算。本文解决了基于贝叶斯概率模型的机器视觉系统中的任务控制问题。我们将控制与推理分离，以扩展先前的工作[3]，最大化效用而非概率。最大化效用允许采用感知策略，以有效收集传感器信息并分析传感器数据。本文展示了通过效用控制机器视觉以识别军事场景的结果。未来工作将将其扩展到SUCCESSOR的工业零件识别。

英文摘要

Several key issues arise in implementing computer vision recognition of world objects in terms of Bayesian networks. Computational efficiency is a driving force. Perceptual networks are very deep, typically fifteen levels of structure. Images are wide, e.g., an unspecified-number of edges may appear anywhere in an image 512 x 512 pixels or larger. For efficiency, we dynamically instantiate hypotheses of observed objects. The network is not fixed, but is created incrementally at runtime. Generation of hypotheses of world objects and indexing of models for recognition are important, but they are not considered here [4,11]. This work is aimed at near-term implementation with parallel computation in a radar surveillance system, ADRIES [5, 15], and a system for industrial part recognition, SUCCESSOR [2]. For many applications, vision must be faster to be practical and so efficiently controlling the machine vision process is critical. Perceptual operators may scan megapixels and may require minutes of computation time. It is necessary to avoid unnecessary sensor actions and computation. Parallel computation is available at several levels of processor capability. The potential for parallel, distributed computation for high-level vision means distributing non-homogeneous computations. This paper addresses the problem of task control in machine vision systems based on Bayesian probability models. We separate control and inference to extend the previous work [3] to maximize utility instead of probability. Maximizing utility allows adopting perceptual strategies for efficient information gathering with sensors and analysis of sensor data. Results of controlling machine vision via utility to recognize military situations are presented in this paper. Future work extends this to industrial part recognition for SUCCESSOR.

URL PDF HTML ☆

赞 0 踩 0

1204.1259 2026-06-03 cs.LG cs.IR cs.NA math.NA

Fast ALS-based tensor factorization for context-aware recommendation from implicit feedback

基于快速ALS的张量分解用于隐式反馈的上下文感知推荐

Balázs Hidasi, Domonkos Tikk

AI总结提出iTALS算法，利用基于ALS的张量分解方法线性扩展至非零元素，整合上下文信息（如季节性和序列模式），在隐式反馈数据集上显著提升推荐质量。

Comments Accepted for ECML/PKDD 2012, presented on 25th September 2012, Bristol, UK

详情

DOI: 10.1007/978-3-642-33486-3_5
Journal ref: Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II

AI中文摘要

尽管基于隐式反馈的推荐问题（仅用户历史可用而无评分）是实际应用中最典型的场景，但其研究远少于显式反馈情况。在显式情况下高效的最先进算法，若要保持可扩展性，无法直接转化为隐式情况。隐式反馈基准数据集很少，因此新想法通常基于显式基准进行实验。本文提出了一种通用的上下文感知隐式反馈推荐算法，称为iTALS。iTALS应用了一种快速的、基于ALS的张量分解学习方法，其规模与张量中非零元素数量呈线性关系。该方法还允许我们在保持计算效率的同时，将多样化的上下文信息融入模型。特别地，我们提出了iTALS的两种上下文感知实现变体。第一种融入季节性，能够区分不同时间间隔的用户行为。另一种将用户历史视为序列信息，能够识别特定物品组的典型使用模式，例如自动区分通常重复购买（收藏品、杂货）或一次性购买（家用电器）的产品类型或类别。在三个隐式数据集（两个专有数据集和Netflix数据集的隐式变体）上进行的实验表明，通过将上下文感知信息与我们的分解框架集成到最先进的隐式推荐算法中，推荐质量显著提高。

英文摘要

Albeit, the implicit feedback based recommendation problem - when only the user history is available but there are no ratings - is the most typical setting in real-world applications, it is much less researched than the explicit feedback case. State-of-the-art algorithms that are efficient on the explicit case cannot be straightforwardly transformed to the implicit case if scalability should be maintained. There are few if any implicit feedback benchmark datasets, therefore new ideas are usually experimented on explicit benchmarks. In this paper, we propose a generic context-aware implicit feedback recommender algorithm, coined iTALS. iTALS apply a fast, ALS-based tensor factorization learning method that scales linearly with the number of non-zero elements in the tensor. The method also allows us to incorporate diverse context information into the model while maintaining its computational efficiency. In particular, we present two such context-aware implementation variants of iTALS. The first incorporates seasonality and enables to distinguish user behavior in different time intervals. The other views the user history as sequential information and has the ability to recognize usage pattern typical to certain group of items, e.g. to automatically tell apart product types or categories that are typically purchased repetitively (collectibles, grocery goods) or once (household appliances). Experiments performed on three implicit datasets (two proprietary ones and an implicit variant of the Netflix dataset) show that by integrating context-aware information with our factorization framework into the state-of-the-art implicit recommender algorithm the recommendation quality improves significantly.

URL PDF HTML ☆

赞 0 踩 0

1303.4434 2026-06-03 cs.LG cs.NA math.NA stat.CO stat.ML

A General Iterative Shrinkage and Thresholding Algorithm for Non-convex Regularized Optimization Problems

非凸正则化优化问题的一般迭代收缩与阈值算法

Pinghua Gong, Changshui Zhang, Zhaosong Lu, Jianhua Huang, Jieping Ye

AI总结针对非凸稀疏诱导惩罚的优化问题，提出一种通用迭代收缩与阈值算法（GIST），通过近端算子闭式解和BB规则线搜索实现高效求解，并给出收敛性分析。

详情

AI中文摘要

非凸稀疏诱导惩罚近年来在稀疏学习中受到广泛关注。最近的理论研究表明，在若干稀疏学习场景中，非凸惩罚优于其凸对应物。然而，与非凸惩罚相关的非凸优化问题的求解仍然是一个重大挑战。一种常用方法是多阶段（MS）凸松弛（或DC规划），它将原始非凸问题松弛为一系列凸问题。这种方法通常不适用于大规模问题，因为其计算成本是求解单个凸问题的倍数。在本文中，我们提出了一种通用迭代收缩与阈值（GIST）算法，用于求解一大类非凸惩罚的非凸优化问题。GIST算法迭代求解一个近端算子问题，而该问题对于许多常用惩罚具有闭式解。在算法的每次外迭代中，我们使用由Barzilai-Borwein（BB）规则初始化的线搜索，以快速找到合适的步长。本文还给出了GIST算法的详细收敛性分析。通过在大规模数据集上的大量实验，证明了所提算法的效率。

英文摘要

Non-convex sparsity-inducing penalties have recently received considerable attentions in sparse learning. Recent theoretical investigations have demonstrated their superiority over the convex counterparts in several sparse learning settings. However, solving the non-convex optimization problems associated with non-convex penalties remains a big challenge. A commonly used approach is the Multi-Stage (MS) convex relaxation (or DC programming), which relaxes the original non-convex problem to a sequence of convex problems. This approach is usually not very practical for large-scale problems because its computational cost is a multiple of solving a single convex problem. In this paper, we propose a General Iterative Shrinkage and Thresholding (GIST) algorithm to solve the nonconvex optimization problem for a large class of non-convex penalties. The GIST algorithm iteratively solves a proximal operator problem, which in turn has a closed-form solution for many commonly used penalties. At each outer iteration of the algorithm, we use a line search initialized by the Barzilai-Borwein (BB) rule that allows finding an appropriate step size quickly. The paper also presents a detailed convergence analysis of the GIST algorithm. The efficiency of the proposed algorithm is demonstrated by extensive experiments on large-scale data sets.

URL PDF HTML ☆

赞 0 踩 0

1301.3527 2026-06-03 cs.LG cs.NA math.NA

Block Coordinate Descent for Sparse NMF

块坐标下降法用于稀疏非负矩阵分解

Vamsi K. Potluru, Sergey M. Plis, Jonathan Le Roux, Barak A. Pearlmutter, Vince D. Calhoun, Thomas P. Hayes

AI总结针对稀疏NMF问题，提出基于L1/L2混合范数的块坐标下降算法，在保持稀疏性的同时显著提升计算速度，适用于大规模数据集。

详情

AI中文摘要

非负矩阵分解（NMF）已成为数据分析中无处不在的工具。一个重要变体是稀疏NMF问题，当我们明确要求学习到的特征稀疏时就会出现。稀疏性的自然度量是L$_0$范数，但其优化是NP难的。混合范数（如L$_1$/L$_2$度量）已被证明能够基于这些度量需要满足的直观属性来稳健地建模稀疏性。这与计算上更便宜的替代方案（如普通L$_1$范数）形成对比。然而，当前为优化混合范数L$_1$/L$_2$而设计的算法速度较慢，并且已经提出了其他稀疏NMF的公式，例如基于L$_1$和L$_0$范数的公式。我们提出的算法允许我们在不牺牲计算时间的情况下解决混合范数稀疏约束。我们在真实世界数据集上的实验证据表明，与当前最先进的优化混合范数的求解器相比，我们的新算法速度快一个数量级，并且适用于大规模数据集。

英文摘要

Nonnegative matrix factorization (NMF) has become a ubiquitous tool for data analysis. An important variant is the sparse NMF problem which arises when we explicitly require the learnt features to be sparse. A natural measure of sparsity is the L$_0$ norm, however its optimization is NP-hard. Mixed norms, such as L$_1$/L$_2$ measure, have been shown to model sparsity robustly, based on intuitive attributes that such measures need to satisfy. This is in contrast to computationally cheaper alternatives such as the plain L$_1$ norm. However, present algorithms designed for optimizing the mixed norm L$_1$/L$_2$ are slow and other formulations for sparse NMF have been proposed such as those based on L$_1$ and L$_0$ norms. Our proposed algorithm allows us to solve the mixed norm sparsity constraints while not sacrificing computation time. We present experimental evidence on real-world datasets that shows our new algorithm performs an order of magnitude faster compared to the current state-of-the-art solvers optimizing the mixed norm and is suitable for large-scale datasets.

URL PDF HTML ☆

赞 0 踩 0

1303.1264 2026-06-03 cs.LG cs.NA math.NA

Discovery of factors in matrices with grades

带等级矩阵中的因子发现

Radim Belohlavek, Vilem Vychodil

AI总结提出一种针对有序数据矩阵的分解与因子分析方法，基于完全剩余格结构，利用几何洞察识别矩形子矩阵作为最优因子，并设计贪心近似算法实现少量因子的分解。

详情

AI中文摘要

我们提出了一种处理有序数据矩阵的分解与因子分析的方法。矩阵中的条目是对象（由行表示）满足属性（由列表示）的等级，例如图像红色的程度、产品具有给定特征的程度或一个人在测试中表现良好的程度。我们假设这些等级构成一个有界尺度，配备特定的聚合算子，并符合完全剩余格的结构。我们提出了一种贪心近似算法，用于在因子数量较小的限制下，将此类矩阵分解为两个带等级矩阵的乘积。我们的算法基于一个定理提供的几何洞察，该定理将特定的矩形子矩阵识别为分解的最优因子。这些因子对应于输入数据的形式概念，并允许对分解进行简单解释。我们提供了说明性示例和实验评估。

英文摘要

We present an approach to decomposition and factor analysis of matrices with ordinal data. The matrix entries are grades to which objects represented by rows satisfy attributes represented by columns, e.g. grades to which an image is red, a product has a given feature, or a person performs well in a test. We assume that the grades form a bounded scale equipped with certain aggregation operators and conforms to the structure of a complete residuated lattice. We present a greedy approximation algorithm for the problem of decomposition of such matrix in a product of two matrices with grades under the restriction that the number of factors be small. Our algorithm is based on a geometric insight provided by a theorem identifying particular rectangular-shaped submatrices as optimal factors for the decompositions. These factors correspond to formal concepts of the input data and allow an easy interpretation of the decomposition. We present illustrative examples and experimental evaluation.

URL PDF HTML ☆

赞 0 踩 0

1302.7283 2026-06-03 cs.LG cs.NA math.NA

Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

基于GMM先验下MMSE估计的正则化NMF源分离及其不确定性在线学习

Emad M. Grais, Hakan Erdogan

AI总结提出一种在非负矩阵分解中引入高斯混合模型先验的最小均方误差估计正则化方法，用于单通道源分离，通过在线学习不确定性提升分离性能。

详情

AI中文摘要

我们提出了一种在非负矩阵分解（NMF）解上施加先验的新方法。所提出的算法可用于去噪或单通道源分离（SCSS）应用。NMF解被引导遵循源信号的高斯混合先验模型（GMM）下的最小均方误差（MMSE）估计。在SCSS应用中，观测到的混合信号的频谱被分解为每个源使用NMF训练的基向量的加权线性组合。在这项工作中，NMF分解权重矩阵被视为被失真算子扭曲的图像，该失真算子直接从观测信号中学习。然后，在GMM先验和对数正态分布失真下找到权重矩阵的MMSE估计，以改进NMF分解结果。MMSE估计被嵌入优化目标中，形成一个新的正则化NMF代价函数。本文推导了新目标的相应更新规则。实验结果表明，与不使用先验或使用其他先验模型的NMF相比，所提出的正则化NMF算法提高了源分离性能。

英文摘要

We propose a new method to enforce priors on the solution of the nonnegative matrix factorization (NMF). The proposed algorithm can be used for denoising or single-channel source separation (SCSS) applications. The NMF solution is guided to follow the Minimum Mean Square Error (MMSE) estimates under Gaussian mixture prior models (GMM) for the source signal. In SCSS applications, the spectra of the observed mixed signal are decomposed as a weighted linear combination of trained basis vectors for each source using NMF. In this work, the NMF decomposition weight matrices are treated as a distorted image by a distortion operator, which is learned directly from the observed signals. The MMSE estimate of the weights matrix under GMM prior and log-normal distribution for the distortion is then found to improve the NMF decomposition results. The MMSE estimate is embedded within the optimization objective to form a novel regularized NMF cost function. The corresponding update rules for the new objectives are derived in this paper. Experimental results show that, the proposed regularized NMF algorithm improves the source separation performance compared with using NMF without prior or with other prior models.

URL PDF HTML ☆

赞 0 踩 0

1301.7389 2026-06-03 cs.AI cs.SY eess.SY

Dealing with Uncertainty on the Initial State of a Petri Net

处理Petri网初始状态的不确定性

Iman Jarkass, Michele Rombaut

AI总结提出一种基于Dempster-Shafer理论的方法，利用传感器信息和Petri网模型，在初始状态未知的情况下确定动态系统的实际状态。

Comments Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

1301.6747 2026-06-03 cs.AI cs.SY eess.SY

Bayesian Control for Concentrating Mixed Nuclear Waste

混合核废料浓缩的贝叶斯控制

Robert L. Welch, Clayton Smith

AI总结提出一种基于条件高斯贝叶斯网络的批处理混合废料控制算法，网络在批处理阶段编译以实现对传感器输入的实时响应。

Comments Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

1301.6721 2026-06-03 cs.AI cs.SY eess.SY

Learning Finite-State Controllers for Partially Observable Environments

学习部分可观测环境的有限状态控制器

Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim, Leslie Pack Kaelbling

AI总结针对部分可观测马尔可夫决策过程，提出一种基于随机梯度下降的VAPS扩展算法，学习局部最优的有限状态自动机控制器，并通过实验验证其利用历史观测信息补偿不可观测性的能力。

Comments Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

详情

AI中文摘要

在完全可观测的马尔可夫决策过程（MDP）中，反应式（无记忆）策略是足够的，但对于部分可观测MDP的最优控制，通常需要某种形式的记忆。具有有限记忆的策略可以表示为有限状态自动机。在本文中，我们将Baird和Moore的VAPS算法扩展到学习一般有限状态自动机的问题。由于该算法执行随机梯度下降，可以证明它收敛到局部最优的有限状态控制器。我们提供了算法的细节，然后考虑在什么条件下随机梯度下降将优于精确梯度下降的问题。最后，我们通过实证结果比较了随机和精确梯度下降的性能，并展示了我们的算法从过去观测序列中提取有用信息以补偿每个时间步不可观测性的能力。

英文摘要

Reactive (memoryless) policies are sufficient in completely observable Markov decision processes (MDPs), but some kind of memory is usually necessary for optimal control of a partially observable MDP. Policies with finite memory can be represented as finite-state automata. In this paper, we extend Baird and Moore's VAPS algorithm to the problem of learning general finite-state automata. Because it performs stochastic gradient descent, this algorithm can be shown to converge to a locally optimal finite-state controller. We provide the details of the algorithm and then consider the question of under what conditions stochastic gradient descent will outperform exact gradient descent. We conclude with empirical results comparing the performance of stochastic and exact gradient descent, and showing the ability of our algorithm to extract the useful information contained in the sequence of past observations to compensate for the lack of observability at each time-step.

URL PDF HTML ☆

赞 0 踩 0

1301.3537 2026-06-03 cs.AI cs.NA math.NA

Learning Stable Group Invariant Representations with Convolutional Networks

使用卷积网络学习稳定的群不变表示

Joan Bruna, Arthur Szlam, Yann LeCun

AI总结本文通过卷积网络将不变性构建为稳定的群不变性，网络架构决定不变群，可训练滤波器系数刻画群作用，并探索深层卷积层通过群分解实现更抽象的不变表示。

Comments 4 pages

详情

AI中文摘要

变换群，如平移或旋转，有效地表达了在许多识别问题中观察到的部分变异性。群结构使得构建具有吸引人数学性质的不变信号表示成为可能，其中卷积与池化算子为输入的加性和几何扰动带来了稳定性。尽管物理变换群在图像和音频应用中无处不在，但它们并不能解释复杂信号类别的所有变异性。我们表明，深度卷积网络构建的不变性属性可以视为一种稳定的群不变性。网络布线架构决定了不变群，而可训练的滤波器系数刻画了群作用。我们给出了解释性示例，说明网络架构如何控制最终的不变群。我们还探讨了额外的卷积层通过群分解诱导更抽象、更强大的不变表示的原理。

英文摘要

Transformation groups, such as translations or rotations, effectively express part of the variability observed in many recognition problems. The group structure enables the construction of invariant signal representations with appealing mathematical properties, where convolutions, together with pooling operators, bring stability to additive and geometric perturbations of the input. Whereas physical transformation groups are ubiquitous in image and audio applications, they do not account for all the variability of complex signal classes. We show that the invariance properties built by deep convolutional networks can be cast as a form of stable group invariance. The network wiring architecture determines the invariance group, while the trainable filter coefficients characterize the group action. We give explanatory examples which illustrate how the network architecture controls the resulting invariance group. We also explore the principle by which additional convolutional layers induce a group factorization enabling more abstract, powerful invariant representations.

URL PDF HTML ☆

赞 0 踩 0

1301.2273 2026-06-03 cs.AI cs.SY eess.SY

Robust Combination of Local Controllers

局部控制器的鲁棒组合

Carlos E. Guestrin, Dirk Ormoneit

AI总结针对高维连续MDP规划问题，提出非参数化组合局部控制器的方法，并分别应用于随机最短路径和折扣MDP，前者保证高概率到达目标，后者通过鲁棒线性规划处理模型不确定性。

Comments Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

详情

AI中文摘要

规划问题是困难的，例如运动规划是PSPACE-hard的。在存在不确定性的情况下，这些问题更加困难。尽管马尔可夫决策过程（MDP）为此类问题提供了形式化框架，但求解高维连续MDP通常很困难，尤其是当动作和时间测量是连续的时候。幸运的是，问题特定知识使我们能够设计出局部表现良好的控制器，尽管没有全局保证。我们提出了一种非参数化组合局部控制器的方法，以获得全局良好的解。我们将此公式应用于两类问题：运动规划（随机最短路径）和折扣MDP。对于运动规划，我们认为通常的MDP最优性准则（期望成本）可能在实际中不相关。我们提出了一种替代方案：在机器人必须以高概率到达目标的约束下，寻找最小成本路径。对于这个问题，我们证明了多项式数量的样本足以获得高概率路径。对于折扣MDP，我们提出了一种明确处理模型不确定性的公式，即转移概率不完全已知时引入的问题。我们将该问题表述为一个鲁棒线性规划，直接纳入这种不确定性。

英文摘要

Planning problems are hard, motion planning, for example, isPSPACE-hard. Such problems are even more difficult in the presence of uncertainty. Although, Markov Decision Processes (MDPs) provide a formal framework for such problems, finding solutions to high dimensional continuous MDPs is usually difficult, especially when the actions and time measurements are continuous. Fortunately, problem-specific knowledge allows us to design controllers that are good locally, though having no global guarantees. We propose a method of nonparametrically combining local controllers to obtain globally good solutions. We apply this formulation to two types of problems : motion planning (stochastic shortest path) and discounted MDPs. For motion planning, we argue that usual MDP optimality criterion (expected cost) may not be practically relevant. Wepropose an alternative: finding the minimum cost path,subject to the constraint that the robot must reach the goal withhigh probability. For this problem, we prove that a polynomial number of samples is sufficient to obtain a high probability path. For discounted MDPs, we propose a formulation that explicitly deals with model uncertainty, i.e., the problem introduced when transition probabilities are not known exactly. We formulate the problem as a robust linear program which directly incorporates this type of uncertainty.

URL PDF HTML ☆

赞 0 踩 0

1301.0584 2026-06-03 cs.AI cs.LG cs.SY eess.SY

Decayed MCMC Filtering

衰减MCMC滤波

Bhaskara Marthi, Hanna Pasula, Stuart Russell, Yuval Peres

AI总结提出一种基于衰减MCMC的随机近似滤波算法，通过偏向翻转近期状态变量的提议分布对状态轨迹进行采样，并证明在观测序列增长时收敛时间有界，实验表明与粒子滤波等算法性能相当。

Comments Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

详情

AI中文摘要

滤波——从观测序列中估计部分可观测马尔可夫过程的状态——是控制理论、人工智能和计算统计学中研究最广泛的问题之一。对于大型离散系统和非线性连续系统，后验分布的精确计算通常是难以处理的，因此大量工作致力于开发鲁棒的近似算法。本文描述了一种简单的随机近似滤波算法，称为衰减MCMC。该算法对状态轨迹空间应用马尔可夫链蒙特卡罗采样，使用偏向翻转较新状态变量的提议分布。该算法的形式化分析涉及MCMC收敛的标准耦合论证的推广。我们证明，对于任何遍历的底层马尔可夫过程，随着观测序列长度的增长，具有逆多项式衰减的衰减MCMC的收敛时间保持有界。实验表明，衰减MCMC至少与粒子滤波等其他近似算法具有竞争力。

英文摘要

Filtering---estimating the state of a partially observable Markov process from a sequence of observations---is one of the most widely studied problems in control theory, AI, and computational statistics. Exact computation of the posterior distribution is generally intractable for large discrete systems and for nonlinear continuous systems, so a good deal of effort has gone into developing robust approximation algorithms. This paper describes a simple stochastic approximation algorithm for filtering called {em decayed MCMC}. The algorithm applies Markov chain Monte Carlo sampling to the space of state trajectories using a proposal distribution that favours flips of more recent state variables. The formal analysis of the algorithm involves a generalization of standard coupling arguments for MCMC convergence. We prove that for any ergodic underlying Markov process, the convergence time of decayed MCMC with inverse-polynomial decay remains bounded as the length of the observation sequence grows. We show experimentally that decayed MCMC is at least competitive with other approximation algorithms such as particle filtering.

URL PDF HTML ☆

赞 0 踩 0

1212.3998 2026-06-03 cs.AI cs.SY eess.SY

Online Learning for Ground Trajectory Prediction

地面轨迹预测的在线学习

Areski Hadjaz, Gaétan Marceau, Pierre Savéant, Marc Schoenauer

AI总结提出基于混合系统的模型用于数值模拟飞机爬升阶段，结合CMA-ES优化算法调整参数以提高轨迹预测精度，并通过在线更新预测实现更准确的结果。

Comments SESAR 2nd Innovation Days (2012)

详情

AI中文摘要

本文提出一个基于混合系统的模型，用于数值模拟飞机的爬升阶段。该模型随后被用于轨迹预测工具中。最后，采用协方差矩阵自适应进化策略（CMA-ES）优化算法来调整五个选定参数，从而提高模型的精度。集成在轨迹预测工具中，该模型可用于推导预测误差随时间变化的量级，从而确定轨迹预测的有效域。所提模型的第一个验证实验基于一次起飞时轨迹预测随时间变化的误差，与理论BADA模型的默认值进行比较。该实验假设完全信息，也显示了模型的局限性。第二个实验部分介绍了在线轨迹预测，其中预测基于当前飞机位置持续更新。这种方法引发了几个问题，针对这些问题提出了基本模型的改进，由此得到的轨迹预测工具在统计上显著优于默认模型的结果。

英文摘要

This paper presents a model based on an hybrid system to numerically simulate the climbing phase of an aircraft. This model is then used within a trajectory prediction tool. Finally, the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) optimization algorithm is used to tune five selected parameters, and thus improve the accuracy of the model. Incorporated within a trajectory prediction tool, this model can be used to derive the order of magnitude of the prediction error over time, and thus the domain of validity of the trajectory prediction. A first validation experiment of the proposed model is based on the errors along time for a one-time trajectory prediction at the take off of the flight with respect to the default values of the theoretical BADA model. This experiment, assuming complete information, also shows the limit of the model. A second experiment part presents an on-line trajectory prediction, in which the prediction is continuously updated based on the current aircraft position. This approach raises several issues, for which improvements of the basic model are proposed, and the resulting trajectory prediction tool shows statistically significantly more accurate results than those of the default model.

URL PDF HTML ☆

赞 0 踩 0

1212.3996 2026-06-03 cs.AI cs.SY eess.SY

Increasing Air Traffic: What is the Problem?

日益增长的空中交通：问题是什么？

Areski Hadjaz, Gaétan Marceau, Pierre Savéant, Marc Schoenauer

AI总结本文提出一个框架，通过贝叶斯网络建模轨迹不确定性，并利用优化和监控过程最小化扇区拥堵和延误概率，以桥接空中交通管理与控制以及不同空域部门之间的差距。

Comments SESAR 2nd Innovation Days (2012)

详情

AI中文摘要

如今，为了应对不确定性、复杂性和次优性，人们正在大力推动空中交通管理系统的现代化。一个答案是加强利益相关者之间的信息共享。本文介绍了一个框架，一方面弥合了空中交通管理与空中交通控制之间的差距，另一方面弥合了地面、进近和航路中心之间的差距。提出了一个原始系统，该系统包含三个基本组成部分：轨迹模型、优化过程和监控过程。轨迹的不确定性通过贝叶斯网络建模，其中节点与两类随机变量相关联：空域计量点的飞越时间以及连接这些点的航线的行驶时间。由此产生的贝叶斯网络覆盖整个空域，并通过蒙特卡洛模拟来估计扇区拥堵和延误的概率。在此轨迹模型之上，优化过程通过调整与计量点飞越时间相关的贝叶斯轨迹模型参数来最小化这些概率。最后一个组成部分是监控过程，它持续更新空域状态，根据飞机的实际位置修改轨迹的不确定性。每次更新后，计算新的最优飞越时间集，并可以作为指令传达给空中交通管制员，再传递给飞行员。本文给出了这一全局优化问题的形式化规范，其基本逻辑是在泰雷兹空中系统公司的空中交通管制员的帮助下得出的。

英文摘要

Nowadays, huge efforts are made to modernize the air traffic management systems to cope with uncertainty, complexity and sub-optimality. An answer is to enhance the information sharing between the stakeholders. This paper introduces a framework that bridges the gap between air traffic management and air traffic control on the one hand, and bridges the gap between the ground, the approach and the en-route centers on the other hand. An original system is presented, that has three essential components: the trajectory models, the optimization process, and the monitoring process. The uncertainty of the trajectory is modeled with a Bayesian Network, where the nodes are associated to two types of random variables: the time of overflight on metering points of the airspace, and the traveling time of the routes linking these points. The resulting Bayesian Network covers the complete airspace, and Monte- Carlo simulations are done to estimate the probabilities of sector congestion and delays. On top of this trajectory model, an optimization process minimizes these probabilities by tuning the parameters of the Bayesian trajectory model related to overflight times on metering points. The last component is the monitoring process, that continuously updates the situation of the airspace, modifying the trajectories uncertainties according to actual positions of aircraft. After each update, a new optimal set of overflight times is computed, and can be communicated to the controllers as clearances for the aircraft pilots. The paper presents a formal specification of this global optimization problem, whose underlying rationale was derived with the help of air traffic controllers at Thales Air Systems.

URL PDF HTML ☆

赞 0 踩 0

1212.2499 2026-06-03 cs.AI cs.SY eess.SY

Marginalizing Out Future Passengers in Group Elevator Control

在群控电梯调度中边缘化未来乘客

Daniel N. Nikovski, Matthew Brand

AI总结针对群控电梯调度中未来乘客对等待时间的影响，提出一种概率模型并集成到现有方法中，显著降低平均等待时间。

Comments Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

1212.2495 2026-06-03 cs.RO cs.AI cs.SY eess.SY

Policy-contingent abstraction for robust robot control

基于策略抽象的鲁棒机器人控制

Joelle Pineau, Geoffrey Gordon, Sebastian Thrun

AI总结提出一种可扩展的控制算法，使移动机器人系统在充分考虑概率信念的情况下做出高层决策，并成功部署于护理机构。

Comments Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

1212.2475 2026-06-03 cs.LG cs.SY eess.SY

Efficient Gradient Estimation for Motor Control Learning

运动控制学习的高效梯度估计

Gregory Lawrence, Noah Cowan, Stuart Russell

AI总结针对存在输入噪声的梯度估计问题，提出两种降低估计误差的方法：基于局部线性模型的强化基线法和方差折扣法，并应用于模拟三连杆机械臂的投镖任务，显著改善了奖励函数梯度估计和学习曲线。

Comments Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

详情

AI中文摘要

在噪声存在的情况下估计函数梯度的任务是几种强化学习形式的核心，包括策略搜索方法。我们提出了两种技术，用于在可观测输入噪声应用于控制信号时减少梯度估计误差。第一种方法通过拟合一个局部线性模型到被估计梯度的函数，扩展了强化基线的思想；我们展示了如何找到最小化梯度估计方差的线性模型，以及如何从数据中估计该模型。第二种方法通过折扣具有高方差的梯度向量分量进一步改进了这一点。这些方法被应用于运动控制学习问题，其中执行器噪声对行为有显著影响。特别地，我们将这些技术应用于使用模拟三连杆机械臂的投镖任务中学习局部最优控制器；我们证明了所提出的方法显著改善了奖励函数梯度估计，并因此改善了学习曲线，优于现有方法。

英文摘要

The task of estimating the gradient of a function in the presence of noise is central to several forms of reinforcement learning, including policy search methods. We present two techniques for reducing gradient estimation errors in the presence of observable input noise applied to the control signal. The first method extends the idea of a reinforcement baseline by fitting a local linear model to the function whose gradient is being estimated; we show how to find the linear model that minimizes the variance of the gradient estimate, and how to estimate the model from data. The second method improves this further by discounting components of the gradient vector that have high variance. These methods are applied to the problem of motor control learning, where actuator noise has a significant influence on behavior. In particular, we apply the techniques to learn locally optimal controllers for a dart-throwing task using a simulated three-link arm; we demonstrate that proposed methods significantly improve the reward function gradient estimate and, consequently, the learning curve, over existing methods.

URL PDF HTML ☆

赞 0 踩 0

1212.2471 2026-06-03 cs.LG cs.AI cs.NA math.NA

Monte Carlo Matrix Inversion Policy Evaluation

蒙特卡洛矩阵求逆策略评估

Fletcher Lu, Dale Schuurmans

AI总结提出使用蒙特卡洛矩阵求逆（MCMI）进行强化学习策略评估，通过重要性采样降低方差，并在运行时间和准确性上优于最大似然模型和时序差分方法。

Comments Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

详情

AI中文摘要

1950年，Forsythe和Leibler（1950）引入了一种统计技术，通过将矩阵逆的元素表征为一系列随机游走的期望值来求矩阵的逆。Barto和Duff（1994）随后展示了该技术与标准动态规划和时序差分方法之间的关系。蒙特卡洛矩阵求逆（MCMI）方法的优势在于，它相对于其他技术，在状态空间大小方面具有更好的可扩展性。在本文中，我们介绍了一种使用MCMI进行强化学习策略评估的算法。我们证明，MCMI在运行时间上优于基于最大似然模型的策略评估方法，并且在运行时间和准确性上都优于时序差分（TD）策略评估方法。我们进一步通过向算法添加重要性采样技术来降低估计器的方差，从而改进了MCMI策略评估。最后，我们展示了将MCMI扩展到大规模状态空间以进行策略改进的技术。

英文摘要

In 1950, Forsythe and Leibler (1950) introduced a statistical technique for finding the inverse of a matrix by characterizing the elements of the matrix inverse as expected values of a sequence of random walks. Barto and Duff (1994) subsequently showed relations between this technique and standard dynamic programming and temporal differencing methods. The advantage of the Monte Carlo matrix inversion (MCMI) approach is that it scales better with respect to state-space size than alternative techniques. In this paper, we introduce an algorithm for performing reinforcement learning policy evaluation using MCMI. We demonstrate that MCMI improves on runtime over a maximum likelihood model-based policy evaluation approach and on both runtime and accuracy over the temporal differencing (TD) policy evaluation approach. We further improve on MCMI policy evaluation by adding an importance sampling technique to our algorithm to reduce the variance of our estimator. Lastly, we illustrate techniques for scaling up MCMI to large state spaces in order to perform policy improvement.

URL PDF HTML ☆

赞 0 踩 0

1212.2005 2026-06-03 cs.AI cs.SY eess.SY

The Dynamic Controllability of Conditional STNs with Uncertainty

含不确定性的条件STN的动态可控性

Luke Hunsberger, Roberto Posenato, Carlo Combi

AI总结本文定义了一种结合时间约束、条件节点和不确定持续时间的条件简单时间网络（CSTNU），并提出了其动态可控性的概念及约束传播规则。

详情

Journal ref: PlanEX Workshop, ICAPS-2012, pages 21-29, 2012

AI中文摘要

最近自动化业务流程和医疗流程的尝试揭示了对一个正式框架的需求，该框架不仅能容纳时间约束，还能容纳具有不可控持续时间的观测和动作。为满足这一需求，本文定义了一种含不确定性的条件简单时间网络（CSTNU），它结合了简单时间网络（STN）的简单时间约束、条件简单时间问题（CSTP）的条件节点以及含不确定性的简单时间网络（STNU）的应急链接。定义了CSTNU的动态可控性概念，该概念推广了CTP的动态一致性和STNU的动态可控性。本文还提出了一些用于动态可控性的可靠约束传播规则，这些规则有望构成CSTNU动态可控性检查算法的基础。

英文摘要

Recent attempts to automate business processes and medical-treatment processes have uncovered the need for a formal framework that can accommodate not only temporal constraints, but also observations and actions with uncontrollable durations. To meet this need, this paper defines a Conditional Simple Temporal Network with Uncertainty (CSTNU) that combines the simple temporal constraints from a Simple Temporal Network (STN) with the conditional nodes from a Conditional Simple Temporal Problem (CSTP) and the contingent links from a Simple Temporal Network with Uncertainty (STNU). A notion of dynamic controllability for a CSTNU is defined that generalizes the dynamic consistency of a CTP and the dynamic controllability of an STNU. The paper also presents some sound constraint-propagation rules for dynamic controllability that are expected to form the backbone of a dynamic-controllability-checking algorithm for CSTNUs.

URL PDF HTML ☆

赞 0 踩 0

1212.1143 2026-06-03 cs.AI cs.SY eess.SY math.OC stat.ML

Multiscale Markov Decision Problems: Compression, Solution, and Transfer Learning

多尺度马尔可夫决策问题：压缩、求解与迁移学习

Jake Bouvrie, Mauro Maggioni

AI总结提出一种多尺度压缩马尔可夫决策过程的快速算法，自动构建层次结构，解耦子任务并加速收敛，同时实现跨问题的策略迁移。

Comments 86 pages, 15 figures

详情

AI中文摘要

序列决策和随机控制中的许多问题通常具有自然的多尺度结构：子任务被组合在一起以完成复杂目标。系统性地推断和利用层次结构，尤其是超越单一抽象层次，一直是一个长期挑战。我们描述了一种快速的多尺度过程，用于重复压缩或均质化马尔可夫决策过程（MDP），其中自动确定不同尺度上的子问题层次结构。粗化后的MDP本身是独立的确定性MDP，可以使用现有算法求解。该过程提供的多尺度表示将子任务相互解耦，可以在子问题内部局部和跨子问题全局上显著提高收敛速度，从而节省大量计算。这项工作的第二个基本方面是，这些多尺度分解为不同问题之间提供了新的迁移机会，其中层次结构中不同级别的子任务的解可能适用于迁移到新问题。强调了在任意尺度上策略和势算子的局部迁移。最后，我们在一个说明性领域集合中展示了压缩和迁移，包括涉及离散和连续状态空间的示例。

英文摘要

Many problems in sequential decision making and stochastic control often have natural multiscale structure: sub-tasks are assembled together to accomplish complex goals. Systematically inferring and leveraging hierarchical structure, particularly beyond a single level of abstraction, has remained a longstanding challenge. We describe a fast multiscale procedure for repeatedly compressing, or homogenizing, Markov decision processes (MDPs), wherein a hierarchy of sub-problems at different scales is automatically determined. Coarsened MDPs are themselves independent, deterministic MDPs, and may be solved using existing algorithms. The multiscale representation delivered by this procedure decouples sub-tasks from each other and can lead to substantial improvements in convergence rates both locally within sub-problems and globally across sub-problems, yielding significant computational savings. A second fundamental aspect of this work is that these multiscale decompositions yield new transfer opportunities across different problems, where solutions of sub-tasks at different levels of the hierarchy may be amenable to transfer to new problems. Localized transfer of policies and potential operators at arbitrary scales is emphasized. Finally, we demonstrate compression and transfer in a collection of illustrative domains, including examples involving discrete and continuous statespaces.

URL PDF HTML ☆

赞 0 踩 0

1112.3166 2026-06-03 cs.CV cs.NA math.NA

Higher-Order Momentum Distributions and Locally Affine LDDMM Registration

高阶动量分布与局部仿射LDDMM配准

Stefan Sommer, Mads Nielsen, Sune Darkner, Xavier Pennec

AI总结本文在LDDMM框架中引入高阶动量分布，通过一阶动量实现局部仿射变换的紧凑表示，从而以极少数参数完成非刚性配准，并直接提供可解释的数学和建模信息。

详情

AI中文摘要

为了实现允许直观分析的稀疏参数化，我们旨在用包含可解释元素的基来表示变形，并希望使用具有描述能力的元素来紧凑地表示变形。为此，本文在LDDMM配准框架中引入了高阶动量分布。先前在LDDMM中使用的零阶动量仅描述局部位移，而本文提出的一阶动量表示一个基，允许局部描述仿射变换，进而紧凑地描述全局非刚性变形中的非平移运动。所得表示从数学和建模角度都包含直接可解释的信息。我们开发了具有高阶动量的配准框架的数学构造，展示了其对稀疏图像配准和变形描述的意义，并提供了参数化如何以极少数参数实现配准的示例。使用高阶动量的参数化的能力和可解释性导致了关节运动的自然建模，该方法有望用于量化阿尔茨海默病期间的心室扩张和进行性萎缩。

英文摘要

To achieve sparse parametrizations that allows intuitive analysis, we aim to represent deformation with a basis containing interpretable elements, and we wish to use elements that have the description capacity to represent the deformation compactly. To accomplish this, we introduce in this paper higher-order momentum distributions in the LDDMM registration framework. While the zeroth order moments previously used in LDDMM only describe local displacement, the first-order momenta that are proposed here represent a basis that allows local description of affine transformations and subsequent compact description of non-translational movement in a globally non-rigid deformation. The resulting representation contains directly interpretable information from both mathematical and modeling perspectives. We develop the mathematical construction of the registration framework with higher-order momenta, we show the implications for sparse image registration and deformation description, and we provide examples of how the parametrization enables registration with a very low number of parameters. The capacity and interpretability of the parametrization using higher-order momenta lead to natural modeling of articulated movement, and the method promises to be useful for quantifying ventricle expansion and progressing atrophy during Alzheimer's disease.

URL PDF HTML ☆

赞 0 踩 0

1011.4104 2026-06-03 cs.LG cs.NA math.NA math.SP

Clustering and Latent Semantic Indexing Aspects of the Singular Value Decomposition

奇异值分解的聚类和潜在语义索引方面

Andri Mirzal

AI总结本文解释了奇异值分解（SVD）如何用于聚类，并指出其聚类与潜在语义索引（LSI）源于同一原理，进而设计了一种模拟SVD聚类能力的LSI算法，无需指定分解秩，性能与SVD相当。

Comments 38 pages, submitted to Pattern Recognition

详情

AI中文摘要

本文讨论了奇异值分解（SVD）的聚类和潜在语义索引（LSI）方面。本文的目的有两个。第一是解释奇异向量如何以及为何可用于聚类。第二是表明这两个看似无关的SVD方面实际上源于同一来源：在低秩近似矩阵的图表示中，相关顶点比在原始语义图中更倾向于聚集在一起。因此，SVD可以提高信息检索系统的检索性能，因为对近似矩阵的查询比原始矩阵的相同查询能检索到更多相关文档并过滤掉更多不相关文档。利用这一事实，我们将设计一种LSI算法，模拟SVD在聚类相关顶点方面的能力。收敛性分析表明该算法收敛，并对每个输入产生唯一解。使用LSI研究中一些标准数据集的实验结果表明，该算法的检索性能与SVD相当。此外，该算法更实用且更易使用，因为无需确定分解秩，而分解秩对驱动SVD的检索性能至关重要。

英文摘要

This paper discusses clustering and latent semantic indexing (LSI) aspects of the singular value decomposition (SVD). The purpose of this paper is twofold. The first is to give an explanation on how and why the singular vectors can be used in clustering. And the second is to show that the two seemingly unrelated SVD aspects actually originate from the same source: related vertices tend to be more clustered in the graph representation of lower rank approximate matrix using the SVD than in the original semantic graph. Accordingly, the SVD can improve retrieval performance of an information retrieval system since queries made to the approximate matrix can retrieve more relevant documents and filter out more irrelevant documents than the same queries made to the original matrix. By utilizing this fact, we will devise an LSI algorithm that mimicks SVD capability in clustering related vertices. Convergence analysis shows that the algorithm is convergent and produces a unique solution for each input. Experimental results using some standard datasets in LSI research show that retrieval performances of the algorithm are comparable to the SVD's. In addition, the algorithm is more practical and easier to use because there is no need to determine decomposition rank which is crucial in driving retrieval performance of the SVD.

URL PDF HTML ☆

赞 0 踩 0

1211.3444 2026-06-03 cs.LG cs.NA math.NA stat.ML

Spectral Clustering: An empirical study of Approximation Algorithms and its Application to the Attrition Problem

谱聚类：近似算法的实证研究及其在员工流失问题中的应用

B. Cung, T. Jin, J. Ramirez, A. Thompson, C. Boutsidis, D. Needell

AI总结本文通过实验评估多种谱聚类近似方法，并应用于员工流失预测问题，展示了近似谱聚类在保持分类准确性的同时降低计算成本的有效性。

详情

AI中文摘要

聚类是将一组对象分成组（称为簇）的问题，使得同一簇内的对象比不同簇中的对象更相似。谱聚类是一种众所周知的聚类方法，它利用数据相似性矩阵的谱来进行这种分离。由于该方法依赖于求解特征向量问题，对于大型数据集计算成本很高。为了克服这一限制，人们开发了近似方法，旨在减少运行时间同时保持准确的分类。在本文中，我们总结并实验评估了几种谱聚类的近似方法。从应用的角度，我们使用谱聚类来解决所谓的员工流失问题，其目标是从一组员工中识别出那些可能自愿离开公司的人。我们的研究揭示了现有近似谱聚类方法的实证性能，并展示了这些方法在一个重要的业务优化相关问题中的适用性。

英文摘要

Clustering is the problem of separating a set of objects into groups (called clusters) so that objects within the same cluster are more similar to each other than to those in different clusters. Spectral clustering is a now well-known method for clustering which utilizes the spectrum of the data similarity matrix to perform this separation. Since the method relies on solving an eigenvector problem, it is computationally expensive for large datasets. To overcome this constraint, approximation methods have been developed which aim to reduce running time while maintaining accurate classification. In this article, we summarize and experimentally evaluate several approximation methods for spectral clustering. From an applications standpoint, we employ spectral clustering to solve the so-called attrition problem, where one aims to identify from a set of employees those who are likely to voluntarily leave the company from those who are not. Our study sheds light on the empirical performance of existing approximate spectral clustering methods and shows the applicability of these methods in an important business optimization related problem.

URL PDF HTML ☆

赞 0 踩 0

1211.1550 2026-06-03 cs.LG cs.NA math.NA math.OC

A Riemannian geometry for low-rank matrix completion

低秩矩阵补全的黎曼几何

B. Mishra, K. Adithya Apuroop, R. Sepulchre

AI总结针对低秩矩阵补全问题，提出一种新的固定秩矩阵的黎曼几何，通过调节商空间的度量来适配最小二乘代价函数，并开发了梯度下降和信赖域算法，实现了与最先进算法LMaFit竞争的性能。

Comments Title modified, Typos removed. arXiv admin note: text overlap with arXiv:1209.0430

详情

AI中文摘要

我们提出了一种新的固定秩矩阵的黎曼几何，专门针对低秩矩阵补全问题。利用商空间的自由度，我们将搜索空间上的度量调整为特定的最小二乘代价函数。一方面，它以一种新颖的方式展示了如何利用商流形优化的灵活框架。另一方面，我们的算法可以被视为LMaFit（最先进的高斯-赛德尔算法）的改进版本。我们开发了执行一阶和二阶优化所需的必要工具。特别地，我们提出了梯度下降方案（最速下降和共轭梯度）以及信赖域算法。我们还表明，由于代价函数的简单性，在给定搜索方向时进行精确线搜索在数值上是廉价的，这使得我们的算法在标准低秩矩阵补全实例上与最先进算法具有竞争力。

英文摘要

We propose a new Riemannian geometry for fixed-rank matrices that is specifically tailored to the low-rank matrix completion problem. Exploiting the degree of freedom of a quotient space, we tune the metric on our search space to the particular least square cost function. At one level, it illustrates in a novel way how to exploit the versatile framework of optimization on quotient manifold. At another level, our algorithm can be considered as an improved version of LMaFit, the state-of-the-art Gauss-Seidel algorithm. We develop necessary tools needed to perform both first-order and second-order optimization. In particular, we propose gradient descent schemes (steepest descent and conjugate gradient) and trust-region algorithms. We also show that, thanks to the simplicity of the cost function, it is numerically cheap to perform an exact linesearch given a search direction, which makes our algorithms competitive with the state-of-the-art on standard low-rank matrix completion instances.

URL PDF HTML ☆

赞 0 踩 0

1211.1690 2026-06-03 cs.RO cs.CV cs.LG cs.SY eess.SY

Learning Monocular Reactive UAV Control in Cluttered Natural Environments

学习在杂乱自然环境中进行单目反应式无人机控制

Stephane Ross, Narek Melik-Barkhudarov, Kumar Shaurya Shankar, Andreas Wendel, Debadeepta Dey, J. Andrew Bagnell, Martial Hebert

AI总结本文使用单目相机和模仿学习训练控制器，使小型四旋翼飞行器能在自然森林环境中以1.5m/s速度自主避障导航。

Comments 8 pages, 10 figures

详情

AI中文摘要

大型无人机的自主导航相对简单，因为可以使用昂贵的传感器和监控设备。相比之下，在杂乱环境中低空飞行的微型飞行器（MAV）的避障仍然是一项具有挑战性的任务。与大型飞行器不同，MAV只能携带非常轻的传感器，如摄像头，这使得通过障碍物的自主导航更具挑战性。本文描述了一个系统，该系统能够使小型四旋翼直升机在自然森林环境中低空自主导航。仅使用单个廉价摄像头感知环境，我们能够保持高达1.5m/s的恒定速度。通过少量人类飞行员演示，我们使用最新的模仿学习技术训练了一个控制器，该控制器通过调整MAV的航向来避免树木。我们在室内更受控的环境和室外真实自然森林环境中展示了系统的性能。

英文摘要

Autonomous navigation for large Unmanned Aerial Vehicles (UAVs) is fairly straight-forward, as expensive sensors and monitoring devices can be employed. In contrast, obstacle avoidance remains a challenging task for Micro Aerial Vehicles (MAVs) which operate at low altitude in cluttered environments. Unlike large vehicles, MAVs can only carry very light sensors, such as cameras, making autonomous navigation through obstacles much more challenging. In this paper, we describe a system that navigates a small quadrotor helicopter autonomously at low altitude through natural forest environments. Using only a single cheap camera to perceive the environment, we are able to maintain a constant velocity of up to 1.5m/s. Given a small set of human pilot demonstrations, we use recent state-of-the-art imitation learning techniques to train a controller that can avoid trees by adapting the MAVs heading. We demonstrate the performance of our system in a more controlled environment indoors, and in real natural forest environments outdoors.

URL PDF HTML ☆

赞 0 踩 0

1210.5034 2026-06-03 cs.LG cs.CV cs.NA math.NA

Optimal Computational Trade-Off of Inexact Proximal Methods

非精确近端方法的最优计算权衡

Pierre Machart, Sandrine Anthoine, Luca Baldassarre

AI总结本文研究近端梯度方法在计算代价与收敛速度之间的权衡，提出了一种计算高效且易于实现的快速非精确近端梯度算法（SIP）。

详情

AI中文摘要

在本文中，我们研究了在使用近端梯度方法（机器学习中流行的优化工具）最小化复合泛函时，收敛速度与计算代价之间的权衡。我们考虑近端算子通过迭代过程计算的情况，该过程提供了精确近端算子的近似。在这种情况下，我们得到具有两个嵌套循环的算法。我们表明，在有限时间内达到所需精度的解时，最小化计算代价的策略是将内迭代次数设置为常数，这与收敛速度分析所指示的策略不同。在此过程中，我们还提出了一种称为SIP（快速非精确近端梯度算法）的新程序，该程序既计算高效又易于实现。我们的数值实验证实了理论发现，并表明SIP可以成为标准程序的非常有竞争力的替代方案。

英文摘要

In this paper, we investigate the trade-off between convergence rate and computational cost when minimizing a composite functional with proximal-gradient methods, which are popular optimisation tools in machine learning. We consider the case when the proximity operator is computed via an iterative procedure, which provides an approximation of the exact proximity operator. In that case, we obtain algorithms with two nested loops. We show that the strategy that minimizes the computational cost to reach a solution with a desired accuracy in finite time is to set the number of inner iterations to a constant, which differs from the strategy indicated by a convergence rate analysis. In the process, we also present a new procedure called SIP (that is Speedy Inexact Proximal-gradient algorithm) that is both computationally efficient and easy to implement. Our numerical experiments confirm the theoretical findings and suggest that SIP can be a very competitive alternative to the standard procedure.

URL PDF HTML ☆

赞 0 踩 0

1210.4883 2026-06-03 cs.LG cs.NA math.NA stat.ML

A Model-Based Approach to Rounding in Spectral Clustering

基于模型的谱聚类舍入方法

Leonard K. M. Poon, April H. Liu, Tengfei Liu, Nevin Lianwen Zhang

AI总结提出一种基于潜树模型的谱聚类舍入方法，同时解决特征向量选择、聚类数确定和数据划分三个子问题。

Comments Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

详情

AI中文摘要

在谱聚类中，首先为数据点集合定义相似矩阵，然后变换矩阵得到拉普拉斯矩阵，接着计算拉普拉斯矩阵的特征向量，最后利用前导特征向量获得数据的划分。最后一步有时称为舍入，需要决定使用多少个前导特征向量、确定聚类数以及划分数据点。本文提出了一种新的舍入方法。该方法在三个方面与以往方法不同。首先，我们放宽了聚类数等于所用特征向量数的假设。其次，在决定使用多少个前导特征向量时，我们不仅依赖前导特征向量本身包含的信息，还使用后续特征向量。第三，我们的方法是基于模型的，并使用一类称为潜树模型的图模型来解决舍入的三个子问题。我们在合成数据和真实数据上评估了该方法。结果表明，在理想情况下（即类间相似度为0），我们的方法能够正确工作，并且随着偏离理想情况，性能会优雅地下降。

英文摘要

In spectral clustering, one defines a similarity matrix for a collection of data points, transforms the matrix to get the Laplacian matrix, finds the eigenvectors of the Laplacian matrix, and obtains a partition of the data using the leading eigenvectors. The last step is sometimes referred to as rounding, where one needs to decide how many leading eigenvectors to use, to determine the number of clusters, and to partition the data points. In this paper, we propose a novel method for rounding. The method differs from previous methods in three ways. First, we relax the assumption that the number of clusters equals the number of eigenvectors used. Second, when deciding the number of leading eigenvectors to use, we not only rely on information contained in the leading eigenvectors themselves, but also use subsequent eigenvectors. Third, our method is model-based and solves all the three subproblems of rounding using a class of graphical models called latent tree models. We evaluate our method on both synthetic and real-world data. The results show that our method works correctly in the ideal case where between-clusters similarity is 0, and degrades gracefully as one moves away from the ideal case.

URL PDF HTML ☆

赞 0 踩 0

1202.3772 2026-06-03 cs.LG cs.NA math.NA stat.ML

Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering

具有闭式解的秩/范数正则化：应用于子空间聚类

Yao-Liang Yu, Dale Schuurmans

AI总结本文通过推广Eckart-Young-Mirsky定理到所有酉不变范数，得到秩/范数正则化问题的闭式解，并应用于子空间聚类，获得新理论见解和实验效果。

Comments 11 pages, 1 figure, appeared in UAI 2011. One footnote corrected and appendix added