arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 8098
2506.02018 2026-06-03 cs.CL

Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data

增强释义类型生成:基于人工排序数据的DPO和RLHF评估影响

Christopher Lee Lübbers

AI总结 本研究利用人工排序的释义类型数据集,结合直接偏好优化(DPO)使模型输出与人类判断对齐,将释义类型生成准确率提升3个百分点,人类偏好评分提升7个百分点,并创建了新的标注数据集以支持更严格的评估。

Comments 21 pages, 11 figures. Master's thesis, University of Goettingen, December 2024. Code: https://github.com/cluebbers/dpo-rlhf-paraphrase-types. Models: https://huggingface.co/collections/cluebbers/enhancing-paraphrase-type-generation-673ca8d75dfe2ce962a48ac0

详情
AI中文摘要

释义通过重新表达含义来增强文本简化、机器翻译和问答等应用。特定的释义类型有助于精确的语义分析和鲁棒的语言模型。然而,现有的释义类型生成方法由于依赖自动评估指标和有限的人工标注训练数据,常常与人类偏好不一致,掩盖了语义保真度和语言转换的关键方面。本研究通过利用人工排序的释义类型数据集,并整合直接偏好优化(DPO)使模型输出直接与人类判断对齐,填补了这一空白。基于DPO的训练将释义类型生成准确率比监督基线提高了3个百分点,并将人类偏好评分提高了7个百分点。新创建的人工标注数据集支持更严格的未来评估。此外,一个释义类型检测模型在增删、同极性替换和标点变化上的F1分数分别达到0.91、0.78和0.70。这些发现表明,偏好数据和DPO训练能产生更可靠、语义更准确的释义,从而改进摘要生成和更鲁棒的问答等下游应用。PTD模型超越了自动评估指标,为评估释义质量提供了更可靠的框架,推动释义类型研究向更丰富、与用户对齐的语言生成发展,并为基于人类中心标准的未来评估奠定了更坚实的基础。

英文摘要

Paraphrasing re-expresses meaning to enhance applications like text simplification, machine translation, and question-answering. Specific paraphrase types facilitate accurate semantic analysis and robust language models. However, existing paraphrase-type generation methods often misalign with human preferences due to reliance on automated metrics and limited human-annotated training data, obscuring crucial aspects of semantic fidelity and linguistic transformations. This study addresses this gap by leveraging a human-ranked paraphrase-type dataset and integrating Direct Preference Optimization (DPO) to align model outputs directly with human judgments. DPO-based training increases paraphrase-type generation accuracy by 3 percentage points over a supervised baseline and raises human preference ratings by 7 percentage points. A newly created human-annotated dataset supports more rigorous future evaluations. Additionally, a paraphrase-type detection model achieves F1 scores of 0.91 for addition/deletion, 0.78 for same polarity substitution, and 0.70 for punctuation changes. These findings demonstrate that preference data and DPO training produce more reliable, semantically accurate paraphrases, enabling downstream applications such as improved summarization and more robust question-answering. The PTD model surpasses automated metrics and provides a more reliable framework for evaluating paraphrase quality, advancing paraphrase-type research toward richer, user-aligned language generation and establishing a stronger foundation for future evaluations grounded in human-centric criteria.

2502.08006 2026-06-03 cs.LG cs.AI stat.ML

Greed is Good: A Unifying Perspective on Guided Generation

贪婪即美德:引导生成的统一视角

Zander W. Blasingame, Chen Liu

AI总结 本文通过将后验引导视为端到端引导的贪婪策略,统一了两种梯度引导方法,并提出了在计算与精度之间权衡的插值方法,在逆图像问题和分子生成任务上验证了有效性。

Comments Accepted at NeurIPS 2025

详情
AI中文摘要

无训练引导生成是一种广泛使用且强大的技术,允许最终用户对流/扩散模型的生成过程施加进一步控制。一般来说,针对基于梯度的引导,已经出现了两种技术系列:即后验引导(即通过目标预测模型将当前样本投影到目标分布进行引导)和端到端引导(即通过在整个ODE求解过程中执行反向传播进行引导)。在这项工作中,我们表明这两个看似分离的系列实际上可以通过将后验引导视为端到端引导的贪婪策略来统一。我们探索了这两个系列之间的理论联系,并深入分析了这两种技术相对于连续理想梯度的关系。基于这一分析,我们提出了一种在这两个系列之间插值的方法,从而在引导梯度的计算与精度之间实现权衡。然后,我们在几个逆图像问题和性质引导的分子生成任务上验证了这项工作。

英文摘要

Training-free guided generation is a widely used and powerful technique that allows the end user to exert further control over the generative process of flow/diffusion models. Generally speaking, two families of techniques have emerged for solving this problem for gradient-based guidance: namely, posterior guidance (i.e., guidance via projecting the current sample to the target distribution via the target prediction model) and end-to-end guidance (i.e., guidance by performing backpropagation throughout the entire ODE solve). In this work, we show that these two seemingly separate families can actually be unified by looking at posterior guidance as a greedy strategy of end-to-end guidance. We explore the theoretical connections between these two families and provide an in-depth theoretical of these two techniques relative to the continuous ideal gradients. Motivated by this analysis we then show a method for interpolating between these two families enabling a trade-off between compute and accuracy of the guidance gradients. We then validate this work on several inverse image problems and property-guided molecular generation.

2505.08886 2026-06-03 cs.CV cs.LG

Optimizing Neuro-Fuzzy and Colonial Competition Algorithms for Skin Cancer Diagnosis in Dermatoscopic Images

优化神经模糊与殖民竞争算法用于皮肤镜图像中的皮肤癌诊断

Hamideh Khaleghpour, Brett McKinney

AI总结 本研究融合图像处理、神经模糊和殖民竞争算法,在ISIC数据库的560张皮肤镜图像上实现94%准确率,旨在辅助临床早期黑色素瘤检测。

Comments 7 pages, 10 figures. Accepted at the 2nd Asia Pacific Computer Systems Conference (APCS 2024), March 15-17, 2024

详情
Journal ref
Proceedings of the 2024 7th International Conference on Information and Computer Technologies, pages 166-172, IEEE, March 2024
AI中文摘要

皮肤癌发病率的上升,加上公众意识有限和临床专业知识的不足,凸显了对先进诊断辅助工具的迫切需求。人工智能(AI)已成为该领域有前景的工具,特别是在区分恶性与良性皮肤病变方面。利用公开可用的皮肤病变数据集,研究人员一直在开发基于AI的诊断解决方案。然而,此类计算机系统在临床环境中的整合仍处于初期阶段。本研究旨在通过融合图像处理技术和机器学习算法(特别是神经模糊和殖民竞争方法)来弥合这一差距。应用于ISIC数据库中的皮肤镜图像,我们的方法在560张图像的数据集上达到了94%的显著准确率。这些结果强调了我们的方法在帮助临床医生早期检测黑色素瘤方面的潜力,从而为皮肤癌诊断做出重要贡献。

英文摘要

The rising incidence of skin cancer, coupled with limited public awareness and a shortfall in clinical expertise, underscores an urgent need for advanced diagnostic aids. Artificial Intelligence (AI) has emerged as a promising tool in this domain, particularly for distinguishing malignant from benign skin lesions. Leveraging publicly available datasets of skin lesions, researchers have been developing AI-based diagnostic solutions. However, the integration of such computer systems in clinical settings is still nascent. This study aims to bridge this gap by employing a fusion of image processing techniques and machine learning algorithms, specifically neuro-fuzzy and colonial competition approaches. Applied to dermoscopic images from the ISIC database, our method achieved a notable accuracy of 94% on a dataset of 560 images. These results underscore the potential of our approach in aiding clinicians in the early detection of melanoma, thereby contributing significantly to skin cancer diagnostics.

2412.05123 2026-06-03 cs.SD eess.AS

Differentiable Optimization of Linear Differential Microphone Arrays: A Joint Geometry and Filter Design Framework

线性差分麦克风阵列的可微优化:联合几何与滤波器设计框架

Siminfar Samakoush Galougah, Ramani Duraiswami

AI总结 提出一种可微优化框架,通过联合优化麦克风位置和滤波器权重,实现线性差分麦克风阵列的最优波束模式,在保证无失真约束的同时兼顾指向性、鲁棒性和硬件效率。

Comments 5 pages, 4 figures, 2 tables

详情
AI中文摘要

本文提出了一种用于约束线性差分麦克风阵列(LDMA)设计的可微优化框架。该方法采用非均匀延迟求和波束形成器作为轻量级基础系统模型,通过联合优化麦克风位置和滤波器权重,证明了其能够实现LDMA的最优波束模式。该公式能够在期望声方向实现无失真约束的滤波器优化设计,同时对麦克风定位施加约束以确保一致性能。通过多个指标的评估,包括均方误差(MSE)、指向性指数(DI)、白噪声增益(WNG)和计算时间,并与最先进方法进行比较,该方法展示了一种灵活、指向性强、鲁棒且硬件高效的设计。

英文摘要

This paper presents a differentiable optimization framework for the design of constrained Linear Differential Microphone Arrays (LDMAs). The proposed method leverages a non-uniform delay-and-sum beamformer as a light-weight base system model, proving its ability to achieve the optimal beampattern of LDMAs by jointly optimizing microphone positions and filter weights. The formulation enables the optimized design of a filter with a distortion-free constraint in the desired sound direction, while also imposing constraints on microphone positioning to ensure consistent performance. Through evaluation on multiple metrics, including Mean Squared Error (MSE), Directivity Index (DI), White Noise Gain (WNG), and computation time, and comparison with state-of-the-art methods, this approach demonstrates a flexible, directive, robust, and hardware-efficient design.

2412.05109 2026-06-03 cs.LG cs.IT math.IT math.PR math.ST stat.ML stat.TH

Generating Rectifiable Measures through Neural Networks

通过神经网络生成可求积测度

Erwin Riegler, Alex Bühler, Yang Pan, Helmut Bölcskei

AI总结 本文证明可数m-可求积测度可通过ReLU神经网络将[0,1]上的一维勒贝格测度推前得到,在Wasserstein距离下达到任意小逼近误差,且所需网络数量上界为2^{O(ε^{-m} log^2 ε)},该率等于可求积参数m。

详情
AI中文摘要

我们推导了(可数)$m$-可求积测度类的通用逼近结果。具体地,我们证明$m$-可求积测度可以通过ReLU神经网络将$[0,1]$上的一维勒贝格测度推前得到,在Wasserstein距离下达到任意小的逼近误差。此外,所考虑网络的权重是量化和有界的,达到逼近误差$\varepsilon$所需的ReLU神经网络数量不超过$2^{b(\varepsilon)}$,其中$b(\varepsilon)=\mathcal{O}(\varepsilon^{-m}\log^2(\varepsilon))$。这一结果改进了Perekrestenko等人的引理IX.4,因为它表明当$\varepsilon$趋于零时$b(\varepsilon)$趋于无穷的速率等于可求积参数$m$,而$m$可能远小于环境维度。我们将此结果推广到可数$m$-可求积测度,并证明该速率仍然等于可求积参数$m$,前提是(除其他技术假设外)测度在可数$m$-可求积支撑集的各个分量上指数衰减。

英文摘要

We derive universal approximation results for the class of (countably) $m$-rectifiable measures. Specifically, we prove that $m$-rectifiable measures can be approximated as push-forwards of the one-dimensional Lebesgue measure on $[0,1]$ using ReLU neural networks with arbitrarily small approximation error in terms of Wasserstein distance. What is more, the weights in the networks under consideration are quantized and bounded and the number of ReLU neural networks required to achieve an approximation error of $\varepsilon$ is no larger than $2^{b(\varepsilon)}$ with $b(\varepsilon)=\mathcal{O}(\varepsilon^{-m}\log^2(\varepsilon))$. This result improves Lemma IX.4 in Perekrestenko et al. as it shows that the rate at which $b(\varepsilon)$ tends to infinity as $\varepsilon$ tends to zero equals the rectifiability parameter $m$, which can be much smaller than the ambient dimension. We extend this result to countably $m$-rectifiable measures and show that this rate still equals the rectifiability parameter $m$ provided that, among other technical assumptions, the measure decays exponentially on the individual components of the countably $m$-rectifiable support set.

2108.09403 2026-06-03 cs.RO cs.DC

Deadlock and Noise in Self-Organized Aggregation Without Computation

无计算的自组织聚合中的死锁与噪声

Joshua J. Daymude, Noble C. Harasha, Andréa W. Richa, Ryan Yiu

AI总结 研究无计算自组织聚合算法在多机器人系统中的死锁问题,证明确定性运动下存在死锁构型,并发现少量误差可避免死锁,同时提出一种离散噪声版本。

Comments 17 pages, 11 figures

详情
Journal ref
Stabilization, Safety, and Security of Distributed Systems (SSS 2021), pp. 51-65
AI中文摘要

聚合是群体机器人学中的基本行为,要求系统聚集在一个紧凑、连通的集群中。2014年,Gauci等人提出了一种令人惊讶的算法,仅使用二进制视线传感器且无需算术计算或持久内存,即可可靠地实现群体聚合。该算法已被严格证明能够将一个机器人聚合到另一个机器人,但尚不清楚它是否总能像实验和模拟中观察到的那样聚合$n > 2$个机器人的系统。我们证明,当机器人的运动是均匀且确定性的时,对于$n > 3$个机器人,存在死锁构型,使得该算法无法实现聚合。从积极方面看,我们表明该算法(i)对小量误差具有鲁棒性,从而能够避免死锁,并且(ii)在使用锥形视线传感器时,对于$n = 2$的情况,可证明实现线性运行时间加速。最后,我们引入了该算法的一种带噪声的离散改编,更易于进行噪声的严格分析,其模拟结果与原始的连续算法定性一致。

英文摘要

Aggregation is a fundamental behavior for swarm robotics that requires a system to gather together in a compact, connected cluster. In 2014, Gauci et al. proposed a surprising algorithm that reliably achieves swarm aggregation using only a binary line-of-sight sensor and no arithmetic computation or persistent memory. It has been rigorously proven that this algorithm will aggregate one robot to another, but it remained open whether it would always aggregate a system of $n > 2$ robots as was observed in experiments and simulations. We prove that there exist deadlocked configurations from which this algorithm cannot achieve aggregation for $n > 3$ robots when the robots' motion is uniform and deterministic. On the positive side, we show that the algorithm (i) is robust to small amounts of error, enabling deadlock avoidance, and (ii) provably achieves a linear runtime speedup for the $n = 2$ case when using a cone-of-sight sensor. Finally, we introduce a noisy, discrete adaptation of this algorithm that is more amenable to rigorous analysis of noise and whose simulation results align qualitatively with the original, continuous algorithm.

1007.3881 2026-06-03 cs.CV cs.NA math.NA

Orthogonal multifilters image processing of astronomical images from scanned photographic plates

扫描照相底片天文图像的正交多滤波器处理

Vasil Kolev

AI总结 本文提出基于Haar和Daubechies正交小波构造新的正交多滤波器,用于天文图像的多尺度分析,并应用于扫描照相底片的天文图像分解。

Comments 6 pages, The ACM proceedings of CompSysTech 2010

详情
AI中文摘要

本文介绍了用于天文图像处理的正交多滤波器。我们基于Haar和Daubechies正交小波获得了新的正交多滤波器。最近,多小波作为一种更强大的多尺度分析工具被引入。它在多滤波器设计中增加了若干自由度,并使得同时具有多个有用属性成为可能,如对称性、正交性、短支撑和更高的消失矩。对带有天文图像的扫描照相底片进行了多滤波器分解。

英文摘要

In this paper orthogonal multifilters for astronomical image processing are presented. We obtained new orthogonal multifilters based on the orthogonal wavelet of Haar and Daubechies. Recently, multiwavelets have been introduced as a more powerful multiscale analysis tool. It adds several degrees of freedom in multifilter design and makes it possible to have several useful properties such as symmetry, orthogonality, short support, and a higher number of vanishing moments simultaneously. Multifilter decomposition of scanned photographic plates with astronomical images is made.

0710.0736 2026-06-03 cs.CV cs.NA math.NA

Colour image segmentation by the vector-valued Allen-Cahn phase-field model: a multigrid solution

基于向量值Allen-Cahn相场模型的彩色图像分割:多重网格解法

David A Kay, Alessandro Tomasi

AI总结 提出结合向量值Allen-Cahn相场方程与初始数据拟合项的彩色图像分割PDE模型,并采用多重网格有限元方法实现高效鲁棒的分割。

Comments 17 pages, 9 figures

详情
Journal ref
IEEE Trans. Im. Proc. 18.10 (2009)
AI中文摘要

我们提出了一种用于彩色图像分割的PDE驱动模型数值解的新方法,并给出了结果的数值示例。该方法将向量值Allen-Cahn相场方程与初始数据拟合项相结合。已知该方法与Mumford-Shah问题以及Chan和Vese的水平集分割密切相关。我们的数值解使用有限元空间的多重网格分裂进行,从而为大型图像的分割产生了一种高效且鲁棒的方法。

英文摘要

We propose a new method for the numerical solution of a PDE-driven model for colour image segmentation and give numerical examples of the results. The method combines the vector-valued Allen-Cahn phase field equation with initial data fitting terms. This method is known to be closely related to the Mumford-Shah problem and the level set segmentation by Chan and Vese. Our numerical solution is performed using a multigrid splitting of a finite element space, thereby producing an efficient and robust method for the segmentation of large images.

1011.1939 2026-06-03 cs.RO cs.SY eess.SY math.OC

Discrete Partitioning and Coverage Control for Gossiping Robots

面向闲聊机器人的离散分区与覆盖控制

Joseph W. Durham, Ruggero Carli, Paolo Frasca, Francesco Bullo

AI总结 针对非凸环境,提出基于图表示和短程不可靠成对通信的分布式算法,实现机器人团队的分区与覆盖控制,并证明收敛到成对最优分区。

Comments Accepted to IEEE TRO. 14 double-column pages, 10 figures. v2 is a thorough revision of v1, including new algorithms and revised mathematical and simulation results

详情
AI中文摘要

我们提出了分布式算法,用于自动部署一组移动机器人以对非凸环境进行分区和覆盖。为处理任意非凸环境,我们将其表示为图。我们的分区和覆盖算法仅需要短程、不可靠的成对“闲聊”通信。该算法包含两个部分:(1) 一个运动协议,确保相邻机器人至少间歇性地通信;(2) 一个成对分区规则,用于在两个机器人通信时更新领地所有权。通过研究图顶点分区空间上的适当动力系统,我们证明了领地所有权在有限时间内收敛到成对最优分区。这一新的平衡集代表了比常见Lloyd类型算法更优的性能。此外,我们详细说明了算法如何在大规模团队和大规模环境中良好扩展,以及计算如何在有限资源下随时运行。最后,我们报告了在复杂环境中的大规模仿真和使用Player/Stage机器人控制系统的硬件实验。

英文摘要

We propose distributed algorithms to automatically deploy a team of mobile robots to partition and provide coverage of a non-convex environment. To handle arbitrary non-convex environments, we represent them as graphs. Our partitioning and coverage algorithm requires only short-range, unreliable pairwise "gossip" communication. The algorithm has two components: (1) a motion protocol to ensure that neighboring robots communicate at least sporadically, and (2) a pairwise partitioning rule to update territory ownership when two robots communicate. By studying an appropriate dynamical system on the space of partitions of the graph vertices, we prove that territory ownership converges to a pairwise-optimal partition in finite time. This new equilibrium set represents improved performance over common Lloyd-type algorithms. Additionally, we detail how our algorithm scales well for large teams in large environments and how the computation can run in anytime with limited resources. Finally, we report on large-scale simulations in complex environments and hardware experiments using the Player/Stage robot control system.

1003.2022 2026-06-03 cs.CV cs.CE cs.IT cs.NA math.IT math.NA

Fast space-variant elliptical filtering using box splines

使用盒样条进行快速空间变椭圆滤波

Kunal Narayan Chaudhury, Arrate Munoz-Barrutia, Michael Unser

AI总结 本文提出一种基于径向均匀盒样条的方法,通过预积分和局部有限差分实现每像素固定计算量的空间变高斯椭圆滤波,支持连续控制尺寸、伸长和方向。

Comments 12 figures; IEEE Transactions on Image Processing, vol. 19, 2010

详情
Journal ref
IEEE Transactions on Image Processing, vol. 19(9), pp. 2290 - 2306, 2010
AI中文摘要

线性空间变(非卷积)滤波器的高效实现是图像处理中一个具有挑战性的计算问题。在本文中,我们证明可以使用每像素固定数量的计算来对图像进行具有变化大小、伸长和方向的高斯型椭圆窗口滤波。相关算法基于一族光滑紧支撑分段多项式——径向均匀盒样条,通过预积分和局部有限差分实现。径向均匀盒样条是通过重复卷积固定数量的盒分布构造的,这些盒分布经过适当缩放并以均匀方式径向分布。这些盒样条的吸引人特性包括其渐近行为、简单的协方差结构以及准可分离性。随着阶数的增加,它们收敛到高斯函数,并可通过控制组成盒分布的尺度来近似具有不同协方差的各向异性高斯函数。基于第二个特性,我们开发了一种连续控制这些高斯型函数大小、伸长和方向的技术。最后,利用准可分离结构以及盒分布的某种缩放性质,高效实现了相关的空间变椭圆滤波,该滤波每像素需要O(1)次计算,与滤波器的形状和大小无关。

英文摘要

The efficient realization of linear space-variant (non-convolution) filters is a challenging computational problem in image processing. In this paper, we demonstrate that it is possible to filter an image with a Gaussian-like elliptic window of varying size, elongation and orientation using a fixed number of computations per pixel. The associated algorithm, which is based on a family of smooth compactly supported piecewise polynomials, the radially-uniform box splines, is realized using pre-integration and local finite-differences. The radially-uniform box splines are constructed through the repeated convolution of a fixed number of box distributions, which have been suitably scaled and distributed radially in an uniform fashion. The attractive features of these box splines are their asymptotic behavior, their simple covariance structure, and their quasi-separability. They converge to Gaussians with the increase of their order, and are used to approximate anisotropic Gaussians of varying covariance simply by controlling the scales of the constituent box distributions. Based on the second feature, we develop a technique for continuously controlling the size, elongation and orientation of these Gaussian-like functions. Finally, the quasi-separable structure, along with a certain scaling property of box distributions, is used to efficiently realize the associated space-variant elliptical filtering, which requires O(1) computations per pixel irrespective of the shape and size of the filter.

1202.0253 2026-06-03 cs.RO cs.SY eess.SY

High-speed Flight in an Ergodic Forest

遍历森林中的高速飞行

Sertac Karaman, Emilio Frazzoli

AI总结 本文研究在仅已知障碍物生成过程统计特性的随机障碍物场中高速导航的理论基础,通过遍历性和渗流理论揭示了无限无碰撞轨迹存在的相变现象,并推导了临界速度的上下界。

Comments Manuscript submitted to the IEEE Transactions on Robotics

详情
AI中文摘要

受鸟类在密集森林等杂乱环境中飞行的启发,本文研究了一个新颖运动规划问题的理论基础:在仅已知障碍物生成过程统计特性的情况下,通过随机生成的障碍物场进行高速导航。类似于平面森林环境,假设障碍物生成过程决定了圆盘形障碍物的位置和大小。当该过程是遍历的,并且在鸟类动力学的温和技术条件下,证明了通过森林的无限无碰撞轨迹的存在性表现出相变。一方面,如果鸟的飞行速度超过某个临界速度,那么以概率1,不存在无限无碰撞轨迹,即无论控制鸟运动的规划算法如何,鸟几乎必然最终会与某棵树碰撞。另一方面,如果鸟的飞行速度低于该临界速度,那么几乎必然存在至少一条无限无碰撞轨迹。针对齐次泊松森林的特殊情况,考虑鸟动力学的简单模型,推导了临界速度的上下界。对于相同情况,给出了一个等价渗流模型。利用该模型,通过蒙特卡洛模拟近似了相图。本文还通过遍历理论和渗流理论建立了机器人运动规划与统计物理之间的新联系,这可能具有独立的研究价值。

英文摘要

Inspired by birds flying through cluttered environments such as dense forests, this paper studies the theoretical foundations of a novel motion planning problem: high-speed navigation through a randomly-generated obstacle field when only the statistics of the obstacle generating process are known a priori. Resembling a planar forest environment, the obstacle generating process is assumed to determine the locations and sizes of disk-shaped obstacles. When this process is ergodic, and under mild technical conditions on the dynamics of the bird, it is shown that the existence of an infinite collision-free trajectory through the forest exhibits a phase transition. On one hand, if the bird flies faster than a certain critical speed, then, with probability one, there is no infinite collision-free trajectory, i.e., the bird will eventually collide with some tree, almost surely, regardless of the planning algorithm governing the bird's motion. On the other hand, if the bird flies slower than this critical speed, then there exists at least one infinite collision-free trajectory, almost surely. Lower and upper bounds on the critical speed are derived for the special case of a homogeneous Poisson forest considering a simple model for the bird's dynamics. For the same case, an equivalent percolation model is provided. Using this model, the phase diagram is approximated in Monte-Carlo simulations. This paper also establishes novel connections between robot motion planning and statistical physics through ergodic theory and percolation theory, which may be of independent interest.

1303.4778 2026-06-03 cs.LG cs.NA math.NA stat.ML

Greedy Feature Selection for Subspace Clustering

子空间聚类的贪婪特征选择

Eva L. Dyer, Aswin C. Sankaranarayanan, Richard G. Baraniuk

AI总结 本文研究使用贪婪方法(正交匹配追踪)进行子空间聚类的精确特征选择,并证明其在稀疏采样条件下优于最近邻方法。

Comments 32 pages, 7 figures, 1 table

详情
Journal ref
Journal of Machine Learning Research, Vol.14, Issue 1, pp. 2487-2517, January 2013
AI中文摘要

子空间的并集为高维数据集合提供了对线性子空间模型的强大推广。为了从数据集合中学习子空间的并集,必须识别集合中属于同一子空间的信号集,以获得数据中存在的子空间结构的准确估计。最近,稀疏恢复方法已被证明为精确特征选择(EFS)提供了可证明且稳健的策略——从集合中恢复位于同一子空间的点集。与最近关于L1最小化EFS的研究并行,本文为使用贪婪方法(即正交匹配追踪(OMP))进行稀疏信号恢复的EFS发展了充分条件。在分析之后,我们提供了对生活在子空间并集上的信号的特征选择策略的实证研究,并刻画了稀疏恢复方法与基于最近邻(NN)的方法之间的差距。特别是,我们证明了稀疏恢复方法比NN方法具有显著优势,并且当数据集中子空间的采样稀疏时,这两种方法之间的差距尤为明显。我们的结果表明,在NN方法无法揭示集合中点所属子空间的许多情况下,OMP可以可靠地恢复精确的特征集。

英文摘要

Unions of subspaces provide a powerful generalization to linear subspace models for collections of high-dimensional data. To learn a union of subspaces from a collection of data, sets of signals in the collection that belong to the same subspace must be identified in order to obtain accurate estimates of the subspace structures present in the data. Recently, sparse recovery methods have been shown to provide a provable and robust strategy for exact feature selection (EFS)--recovering subsets of points from the ensemble that live in the same subspace. In parallel with recent studies of EFS with L1-minimization, in this paper, we develop sufficient conditions for EFS with a greedy method for sparse signal recovery known as orthogonal matching pursuit (OMP). Following our analysis, we provide an empirical study of feature selection strategies for signals living on unions of subspaces and characterize the gap between sparse recovery methods and nearest neighbor (NN)-based approaches. In particular, we demonstrate that sparse recovery methods provide significant advantages over NN methods and the gap between the two approaches is particularly pronounced when the sampling of subspaces in the dataset is sparse. Our results suggest that OMP may be employed to reliably recover exact feature sets in a number of regimes where NN approaches fail to reveal the subspace membership of points in the ensemble.

1111.2262 2026-06-03 cs.LG cs.NA math.NA

Improved Bound for the Nystrom's Method and its Application to Kernel Classification

Nyström方法的改进界及其在核分类中的应用

Rong Jin, Tianbao Yang, Mehrdad Mahdavi, Yu-Feng Li, Zhi-Hua Zhou

AI总结 本文通过积分算子集中不等式和压缩感知理论改进了Nyström方法的谱范数逼近误差界,并应用于核分类,证明在特征值服从p次幂律时可将支持向量数量减少至N^{2p/(p^2-1)}。

详情
AI中文摘要

我们开发了两种分析Nyström方法逼近误差界的方法,一种基于积分算子的集中不等式,另一种基于压缩感知理论。我们表明,在大特征间隙的情况下,以谱范数度量的逼近误差可以从$O(N/\sqrt{m})$改进到$O(N/m^{1 - ρ})$,其中$N$是数据点总数,$m$是采样数据点数,$ρ\in (0, 1/2)$是刻画特征间隙的正常数。当核矩阵的特征值服从$p$次幂律时,基于压缩感知理论的分析在非相干性假设下进一步将界改进为$O(N/m^{p - 1})$,这解释了为什么Nyström方法对特征值倾斜的核矩阵效果良好。我们提出了一种基于Nyström方法的核分类方法,并利用改进的界推导了其泛化性能。我们表明,当核矩阵的特征值服从$p$次幂律时,我们可以将支持向量数量减少到$N^{2p/(p^2 - 1)}$,当$p > 1+\sqrt{2}$时该数量小于$N$,而不会严重牺牲其泛化性能。

英文摘要

We develop two approaches for analyzing the approximation error bound for the Nyström method, one based on the concentration inequality of integral operator, and one based on the compressive sensing theory. We show that the approximation error, measured in the spectral norm, can be improved from $O(N/\sqrt{m})$ to $O(N/m^{1 - ρ})$ in the case of large eigengap, where $N$ is the total number of data points, $m$ is the number of sampled data points, and $ρ\in (0, 1/2)$ is a positive constant that characterizes the eigengap. When the eigenvalues of the kernel matrix follow a $p$-power law, our analysis based on compressive sensing theory further improves the bound to $O(N/m^{p - 1})$ under an incoherence assumption, which explains why the Nyström method works well for kernel matrix with skewed eigenvalues. We present a kernel classification approach based on the Nyström method and derive its generalization performance using the improved bound. We show that when the eigenvalues of kernel matrix follow a $p$-power law, we can reduce the number of support vectors to $N^{2p/(p^2 - 1)}$, a number less than $N$ when $p > 1+\sqrt{2}$, without seriously sacrificing its generalization performance.

1205.0207 2026-06-03 cs.RO cs.SY eess.SY

Shortest Path Set Induced Vertex Ordering and its Application to Distributed Distance Optimal Multi-agent Formation Path Planning

最短路径集诱导的顶点排序及其在分布式距离最优多智能体编队路径规划中的应用

Jingjin Yu

AI总结 针对无向图中不可区分智能体移动到任意目标编队的距离最优路径规划问题,提出一种基于最短路径集诱导顶点排序的集中式算法,并首次实现分布式调度,同时保证相同的收敛时间。

Comments Extended the earlier version to 8 Pages, complete with literature review. One additional section on a distributed scheduling algorithm is added

详情
AI中文摘要

对于在单位边长的连通图上将一组不可区分的智能体移动到任意目标编队的任务,先前的研究表明,使用完全集中式算法可以调度距离最优路径,并具有严格的收敛时间保证。在本研究中,我们表明问题公式实际上在底层图网络上诱导出更基本的顶点排序,这直接导致更直观的调度算法,保证相同的收敛时间且运行更快。更重要的是,这种结构使得在将个体路径分配给智能体后能够实现分布式调度算法,这是以前不可能的。顶点排序也容易扩展到更一般的图——那些具有非单位容量和边长的图——对于这些图,我们再次保证达到期望编队的收敛时间。

英文摘要

For the task of moving a group of indistinguishable agents on a connected graph with unit edge lengths into an arbitrary goal formation, it was previously shown that distance optimal paths can be scheduled to complete with a tight convergence time guarantee, using a fully centralized algorithm. In this study, we show that the problem formulation in fact induces a more fundamental ordering of the vertices on the underlying graph network, which directly leads to a more intuitive scheduling algorithm that assures the same convergence time and runs faster. More importantly, this structure enables a distributed scheduling algorithm once individual paths are assigned to the agents, which was not possible before. The vertex ordering also readily extends to more general graphs - those with non-unit capacities and edge lengths - for which we again guarantee the convergence time until the desired formation is achieved.

1204.3830 2026-06-03 cs.RO cs.AI cs.SY eess.SY

Planning Optimal Paths for Multiple Robots on Graphs

图上多机器人路径规划的最优路径

Jingjin Yu, Steven M. LaValle

AI总结 提出两种基于多流整数线性规划的模型,分别求解多机器人路径规划的最小最后到达时间和最小总距离问题,算法完备且保证最优解。

Comments Changed "agents" to "robots"

详情
AI中文摘要

在本文中,我们研究了图上多机器人路径规划(MPP)的最优问题。我们提出了两种基于多流的整数线性规划(ILP)模型,分别计算MPP公式的最小最后到达时间和最小总距离解。这些ILP模型产生的算法是完备的,并保证得到真正的最优解。此外,我们的灵活框架可以轻松适应MPP问题的其他变体。专注于时间最优算法,我们评估了其性能,既作为独立算法,也作为快速解决大规模问题实例的通用启发式方法。计算结果证实了我们方法的有效性。

英文摘要

In this paper, we study the problem of optimal multi-robot path planning (MPP) on graphs. We propose two multiflow based integer linear programming (ILP) models that computes minimum last arrival time and minimum total distance solutions for our MPP formulation, respectively. The resulting algorithms from these ILP models are complete and guaranteed to yield true optimal solutions. In addition, our flexible framework can easily accommodate other variants of the MPP problem. Focusing on the time optimal algorithm, we evaluate its performance, both as a stand alone algorithm and as a generic heuristic for quickly solving large problem instances. Computational results confirm the effectiveness of our method.

1202.6429 2026-06-03 cs.CV cs.IT cs.NA math.IT math.NA

Stable image reconstruction using total variation minimization

利用全变差最小化的稳定图像重建

Deanna Needell, Rachel Ward

AI总结 本文利用全变差最小化,从欠采样噪声测量中实现图像的高精度鲁棒重建,并给出了近最优保证。

Comments 25 pages

详情
AI中文摘要

本文提出了利用全变差最小化从欠采样噪声测量中实现准确且鲁棒的图像恢复的近最优保证。特别地,我们证明从 O(slog(N)) 个非自适应线性测量中,图像可以重建到其梯度最佳 s 项近似的对数因子范围内,并且通过略微增加测量次数可以消除该因子。在此过程中,我们证明了对于位于适当不相干矩阵零空间中的函数,存在一个加强的 Sobolev 不等式。

英文摘要

This article presents near-optimal guarantees for accurate and robust image recovery from under-sampled noisy measurements using total variation minimization. In particular, we show that from O(slog(N)) nonadaptive linear measurements, an image can be reconstructed to within the best s-term approximation of its gradient up to a logarithmic factor, and this factor can be removed by taking slightly more measurements. Along the way, we prove a strengthened Sobolev inequality for functions lying in the null space of suitably incoherent matrices.

1104.4251 2026-06-03 cs.RO cs.MA cs.SY eess.SY math.OC

Distributed Self-Organization Of Swarms To Find Globally $ε$-Optimal Routes To Locally Sensed Targets

群体分布式自组织以找到局部感知目标的全局$ε$-最优路径

Ishanu Chattopadhyay

AI总结 针对大规模群体,提出一种仅利用局部信息的分布式路径规划算法,通过信息渗透和梯度涌现实现接近最优的路径选择,并严格分析了收敛性、鲁棒性、可扩展性及系统参数的影响。

Comments 38 pages 10 Figures

详情
AI中文摘要

在大规模群体的背景下,研究了局部感知目标的近最优分布式路径规划问题。所提出的算法仅使用可以局部查询的信息,并建立了关于收敛性、鲁棒性和可扩展性的严格理论结果,同时分析了系统参数(如个体通信半径和个体速度)对全局性能的影响。该方法的基本思想是让局部信息在整个群体中渗透,使个体能够间接访问全局上下文。通过相邻个体之间的局部信息交换,以分布式方式计算反映个体性能的梯度。研究表明,为了沿着接近最优的路径到达只能局部感知且位置未知的目标,个体只需向其“最佳”邻居移动,其中“最佳”的概念是通过计算底层概率有限状态自动机的状态特定语言度量得到的。理论结果在超过$10^4$个个体的高保真仿真实验中得到了验证。

英文摘要

The problem of near-optimal distributed path planning to locally sensed targets is investigated in the context of large swarms. The proposed algorithm uses only information that can be locally queried, and rigorous theoretical results on convergence, robustness, scalability are established, and effect of system parameters such as the agent-level communication radius and agent velocities on global performance is analyzed. The fundamental philosophy of the proposed approach is to percolate local information across the swarm, enabling agents to indirectly access the global context. A gradient emerges, reflecting the performance of agents, computed in a distributed manner via local information exchange between neighboring agents. It is shown that to follow near-optimal routes to a target which can be only sensed locally, and whose location is not known a priori, the agents need to simply move towards its "best" neighbor, where the notion of "best" is obtained by computing the state-specific language measure of an underlying probabilistic finite state automata. The theoretical results are validated in high-fidelity simulation experiments, with excess of $10^4$ agents.

1101.4003 2026-06-03 cs.AI cs.LG cs.SY eess.SY math.OC

Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems

Dyna-H:一种应用于角色扮演游戏策略决策系统的启发式规划强化学习算法

Matilde Santos, Jose Antonio Martin H., Victoria Lopez, Guillermo Botella

AI总结 提出Dyna-H算法,结合启发式搜索与Dyna框架,在角色扮演游戏策略决策中实现无模型在线强化学习,实验表明其性能显著优于Q-Learning和Dyna-Q。

详情
AI中文摘要

在角色扮演游戏中,寻找最优轨迹是最重要的任务之一。实际上,策略决策系统成为游戏引擎的关键组成部分。决策方式(在线、批处理或模拟)以及决策所消耗的资源(如执行时间、内存)将在很大程度上影响游戏性能。当可以使用经典搜索算法(如A*)时,它们是最优先的选择。然而,这些方法依赖于搜索空间的精确和完整模型,在许多有趣的场景中无法应用。此时,无模型的序贯决策方法(在不确定性下)是最佳选择。本文提出一种启发式规划策略,将启发式搜索在路径规划中的能力融入Dyna智能体。所提出的Dyna-H算法,与A*一样,会选择更有可能产生结果的路径分支。此外,它具有无模型在线强化学习算法的优点。该方案与单步Q-Learning和Dyna-Q算法进行了对比评估,获得了优异的实验结果:Dyna-H在所有实验中显著优于这两种方法。我们还提出了一个功能类比,即从最差轨迹中采样的启发式与人类行为中梦境(如噩梦)的作用类似。

英文摘要

In a Role-Playing Game, finding optimal trajectories is one of the most important tasks. In fact, the strategy decision system becomes a key component of a game engine. Determining the way in which decisions are taken (online, batch or simulated) and the consumed resources in decision making (e.g. execution time, memory) will influence, in mayor degree, the game performance. When classical search algorithms such as A* can be used, they are the very first option. Nevertheless, such methods rely on precise and complete models of the search space, and there are many interesting scenarios where their application is not possible. Then, model free methods for sequential decision making under uncertainty are the best choice. In this paper, we propose a heuristic planning strategy to incorporate the ability of heuristic-search in path-finding into a Dyna agent. The proposed Dyna-H algorithm, as A* does, selects branches more likely to produce outcomes than other branches. Besides, it has the advantages of being a model-free online reinforcement learning algorithm. The proposal was evaluated against the one-step Q-Learning and Dyna-Q algorithms obtaining excellent experimental results: Dyna-H significantly overcomes both methods in all experiments. We suggest also, a functional analogy between the proposed sampling from worst trajectories heuristic and the role of dreams (e.g. nightmares) in human behavior.

1005.2146 2026-06-03 cs.LG cs.NA math.NA

On the Finite Time Convergence of Cyclic Coordinate Descent Methods

关于循环坐标下降法的有限时间收敛性

Ankan Saha, Ambuj Tewari

AI总结 本文证明了在等调性假设下,两种循环坐标下降变体具有O(1/k)的收敛速率,并通过与梯度下降的比较展示了其优越性。

Comments 20 pages

详情
AI中文摘要

循环坐标下降是一种经典的优化方法,在机器学习中重新引起了兴趣。原因包括其简单性、速度和稳定性,以及在$\ell_1$正则化光滑优化问题上的竞争性能。令人惊讶的是,关于这些问题的有限时间收敛行为知之甚少。大多数现有结果要么仅证明收敛,要么提供渐近速率。我们通过证明在等调性假设下,两种循环坐标下降变体具有$O(1/k)$的收敛速率(其中$k$是迭代次数),填补了这一文献空白。我们的分析通过比较两种变体所达到的目标值以及梯度下降算法来进行。我们表明,循环坐标下降方法生成的迭代点在整个时间上始终优于梯度下降。

英文摘要

Cyclic coordinate descent is a classic optimization method that has witnessed a resurgence of interest in machine learning. Reasons for this include its simplicity, speed and stability, as well as its competitive performance on $\ell_1$ regularized smooth optimization problems. Surprisingly, very little is known about its finite time convergence behavior on these problems. Most existing results either just prove convergence or provide asymptotic rates. We fill this gap in the literature by proving $O(1/k)$ convergence rates (where $k$ is the iteration counter) for two variants of cyclic coordinate descent under an isotonicity assumption. Our analysis proceeds by comparing the objective values attained by the two variants with each other, as well as with the gradient descent algorithm. We show that the iterates generated by the cyclic coordinate descent methods remain better than those of gradient descent uniformly over time.

1209.2058 2026-06-03 cs.RO cs.DC cs.MA cs.SY eess.SY

Safe and Stabilizing Distributed Multi-Path Cellular Flows

安全且稳定的分布式多路径蜂窝流

Taylor T. Johnson, Sayan Mitra

AI总结 针对分区平面中的分布式交通控制问题,提出一种保证实体间最小安全距离并能在单目标下自稳定、多目标下避免死锁的协议,通过临时阻塞和局部地理路由实现安全与进展。

Comments An earlier version of this paper appeared in the 30th IEEE International Conference on Distributed Computing Systems (ICDCS 2010)

详情
AI中文摘要

我们研究了分区平面中的分布式交通控制问题,其中每个分区(单元)内所有实体(机器人、车辆等)的运动是耦合的。在此类系统中建立活性具有挑战性,但这种分析对于将分布式交通控制算法应用于协调机器人群体和智能高速公路系统等场景是必要的。我们提出了一个分布式交通控制协议的正式模型,该模型保证实体之间的最小安全距离,即使某些单元发生故障。一旦新故障停止发生,在单目标情况下,协议保证自稳定,并且具有到目标单元可行路径的实体能够向目标前进。对于多目标情况,故障可能导致系统死锁,因此我们识别了一类非死锁故障,其中所有实体都能向各自目标前进。该算法依赖于两个通用原则:临时阻塞以维护安全性,以及局部地理路由以保证进展。我们的断言式证明可作为其他分布式交通控制协议分析的模板。我们给出了仿真结果,提供了吞吐量作为实体速度、安全距离、单目标路径复杂度、故障恢复率和多目标路径复杂度的函数估计。

英文摘要

We study the problem of distributed traffic control in the partitioned plane, where the movement of all entities (robots, vehicles, etc.) within each partition (cell) is coupled. Establishing liveness in such systems is challenging, but such analysis will be necessary to apply such distributed traffic control algorithms in applications like coordinating robot swarms and the intelligent highway system. We present a formal model of a distributed traffic control protocol that guarantees minimum separation between entities, even as some cells fail. Once new failures cease occurring, in the case of a single target, the protocol is guaranteed to self-stabilize and the entities with feasible paths to the target cell make progress towards it. For multiple targets, failures may cause deadlocks in the system, so we identify a class of non-deadlocking failures where all entities are able to make progress to their respective targets. The algorithm relies on two general principles: temporary blocking for maintenance of safety and local geographical routing for guaranteeing progress. Our assertional proofs may serve as a template for the analysis of other distributed traffic control protocols. We present simulation results that provide estimates of throughput as a function of entity velocity, safety separation, single-target path complexity, failure-recovery rates, and multi-target path complexity.

1112.3010 2026-06-03 cs.CV cs.NA math.NA

A new variational principle for the Euclidean distance function: Linear approach to the non-linear eikonal problem

欧几里得距离函数的新变分原理:非线性程函问题的线性方法

Karthik S. Gurumoorthy, Anand Rangarajan

AI总结 提出一种基于卷积的快速算法,通过求解线性微分方程并取负对数来近似计算欧几里得距离函数,利用快速傅里叶变换高效实现,避免了传统方法对非线性Hamilton-Jacobi方程的直接求解。

详情
AI中文摘要

我们提出了一种基于卷积的快速技术,用于在二维和三维网格位置上计算近似的有符号欧几里得距离函数 $S$。我们的方法不是求解非线性的静态Hamilton-Jacobi方程($\\|\nabla S\\|=1$),而是首先求解线性微分方程中的标量场 $\phi$,然后通过取负对数推导出 $S$ 的解。换句话说,当 $S$ 和 $\phi$ 通过 $\phi = \exp\left(-\frac{S}{\tau}\right)$ 关联,且 $\phi$ 满足对应于变分问题极值的特定线性微分方程时,我们得到近似的欧几里得距离函数 $S = -\tau\log(\phi)$,该函数在 $\tau\rightarrow 0$ 的极限下收敛于真实解。这与快速行进法和快速扫描法等直接通过Godunov迎风离散格式求解Hamilton-Jacobi方程的技术形成鲜明对比。我们的线性公式导致近似欧几里得距离函数的闭式解可表示为离散卷积,因此可通过快速傅里叶变换(FFT)高效计算。我们的解还避免了对导数算子进行空间离散化的需要。当 $\tau\rightarrow 0$ 时,我们展示了结果收敛于真实解,并针对给定的 $\tau$ 值限定了误差。我们解的可微性允许我们通过一组卷积计算近似距离函数的一阶和二阶导数。为了确定距离函数的符号(定义为在封闭区域内为正,区域外为负),我们计算二维中的缠绕数和三维中的拓扑度,这些计算也可以通过快速卷积进行。我们通过一组实验结果证明了我们方法的有效性。

英文摘要

We present a fast convolution-based technique for computing an approximate, signed Euclidean distance function $S$ on a set of 2D and 3D grid locations. Instead of solving the non-linear, static Hamilton-Jacobi equation ($\|\nabla S\|=1$), our solution stems from first solving for a scalar field $ϕ$ in a linear differential equation and then deriving the solution for $S$ by taking the negative logarithm. In other words, when $S$ and $ϕ$ are related by $ϕ= \exp \left(-\frac{S}τ \right)$ and $ϕ$ satisfies a specific linear differential equation corresponding to the extremum of a variational problem, we obtain the approximate Euclidean distance function $S = -τ\log(ϕ)$ which converges to the true solution in the limit as $τ\rightarrow 0$. This is in sharp contrast to techniques like the fast marching and fast sweeping methods which directly solve the Hamilton-Jacobi equation by the Godunov upwind discretization scheme. Our linear formulation results in a closed-form solution to the approximate Euclidean distance function expressible as a discrete convolution, and hence efficiently computable using the fast Fourier transform (FFT). Our solution also circumvents the need for spatial discretization of the derivative operator. As $τ\rightarrow0$ we show the convergence of our results to the true solution and also bound the error for a given value of $τ$. The differentiability of our solution allows us to compute---using a set of convolutions---the first and second derivatives of the approximate distance function. In order to determine the sign of the distance function (defined to be positive inside a closed region and negative outside), we compute the winding number in 2D and the topological degree in 3D, whose computations can also be performed via fast convolutions. We demonstrate the efficacy of our method through a set of experimental results.

1201.5604 2026-06-03 cs.AI cs.LG cs.NE cs.SY eess.SY math.OC

Discrete and fuzzy dynamical genetic programming in the XCSF learning classifier system

XCSF学习分类系统中的离散与模糊动态遗传编程

Richard J. Preen, Larry Bull

AI总结 本文在XCSF框架内使用离散和模糊动态系统表示(异步随机布尔网络和模糊逻辑网络),通过自适应的开放式进化设计集成系统,解决多个经典测试问题。

详情
Journal ref
Soft Computing (2014), 18(1):153-167
AI中文摘要

学习分类系统中已经提出了多种表示方案,从二进制编码到神经网络。本文报告了在XCSF学习分类系统中使用离散和模糊动态系统表示的研究结果。具体而言,在离散情况下使用异步随机布尔网络表示传统的条件-动作生产系统规则,在连续值情况下使用异步模糊逻辑网络。研究表明,可以在XCSF中使用自适应的开放式进化来设计此类动态系统的集成,以解决多个著名的测试问题。

英文摘要

A number of representation schemes have been presented for use within learning classifier systems, ranging from binary encodings to neural networks. This paper presents results from an investigation into using discrete and fuzzy dynamical system representations within the XCSF learning classifier system. In particular, asynchronous random Boolean networks are used to represent the traditional condition-action production system rules in the discrete case and asynchronous fuzzy logic networks in the continuous-valued case. It is shown possible to use self-adaptive, open-ended evolution to design an ensemble of such dynamical systems within XCSF to solve a number of well-known test problems.

1210.7559 2026-06-03 cs.LG cs.NA math.NA stat.ML

Tensor decompositions for learning latent variable models

用于学习潜变量模型的张量分解

Anima Anandkumar, Rong Ge, Daniel Hsu, Sham M. Kakade, Matus Telgarsky

AI总结 本文利用低阶可观测矩的张量结构,通过对称张量分解(类似矩阵SVD的推广)实现高斯混合模型、隐马尔可夫模型等潜变量模型的参数估计,并提供了鲁棒张量幂法的详细分析。

详情
Journal ref
Journal of Machine Learning Research, 15(Aug):2773-2832, 2014
AI中文摘要

本文研究了一类广泛的潜变量模型(包括高斯混合模型、隐马尔可夫模型和潜在狄利克雷分配)的计算和统计高效参数估计方法,该方法利用了其低阶可观测矩(通常是二阶和三阶)中的特定张量结构。具体地,参数估计被简化为从矩导出的对称张量中提取某种(正交)分解的问题;这种分解可以看作是矩阵奇异值分解的自然推广。尽管张量分解通常难以计算,但这些特殊结构张量的分解可以通过多种方法高效获得,包括幂迭代和最大化方法(类似于矩阵的情况)。本文提供了鲁棒张量幂方法的详细分析,建立了类似于矩阵奇异向量的Wedin扰动定理的类比。这意味着对于几种流行的潜变量模型,存在一种鲁棒且计算可行的估计方法。

英文摘要

This work considers a computationally and statistically efficient parameter estimation method for a wide class of latent variable models---including Gaussian mixture models, hidden Markov models, and latent Dirichlet allocation---which exploits a certain tensor structure in their low-order observable moments (typically, of second- and third-order). Specifically, parameter estimation is reduced to the problem of extracting a certain (orthogonal) decomposition of a symmetric tensor derived from the moments; this decomposition can be viewed as a natural generalization of the singular value decomposition for matrices. Although tensor decompositions are generally intractable to compute, the decomposition of these specially structured tensors can be efficiently obtained by a variety of approaches, including power iterations and maximization approaches (similar to the case of matrices). A detailed analysis of a robust tensor power method is provided, establishing an analogue of Wedin's perturbation theorem for the singular vectors of matrices. This implies a robust and computationally tractable estimation approach for several popular latent variable models.

1204.4200 2026-06-03 cs.AI cs.LG cs.NE cs.SY eess.SY

Discrete Dynamical Genetic Programming in XCS

XCS中的离散动力遗传编程

Richard J. Preen, Larry Bull

AI总结 本文研究在XCS学习分类器系统中使用异步随机布尔网络作为离散动力系统表示,通过自适应的开放式进化设计集成系统以解决多个经典测试问题。

Comments arXiv admin note: substantial text overlap with arXiv:1201.5604

详情
Journal ref
In Proceedings of the 11th annual conference on genetic and evolutionary computation, GECCO '09, pp. 1299-1306. ACM, 2009
AI中文摘要

在XCS学习分类器系统中,已有多种表示方案,从二进制编码到神经网络。本文研究了在XCS中使用离散动力系统表示的结果。特别地,使用异步随机布尔网络来表示传统的条件-动作生产系统规则。结果表明,可以通过自适应的开放式进化在XCS中设计这样的离散动力系统集成,以解决多个著名的测试问题。

英文摘要

A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results from an investigation into using a discrete dynamical system representation within the XCS Learning Classifier System. In particular, asynchronous random Boolean networks are used to represent the traditional condition-action production system rules. It is shown possible to use self-adaptive, open-ended evolution to design an ensemble of such discrete dynamical systems within XCS to solve a number of well-known test problems.

1211.4116 2026-06-03 cs.LG cs.NA math.AG math.CO math.NA stat.ML

The Algebraic Combinatorial Approach for Low-Rank Matrix Completion

低秩矩阵补全的代数组合方法

Franz J. Király, Louis Theran, Ryota Tomioka

AI总结 本文提出一种基于代数几何和拟阵理论的代数组合新视角,通过研究少量条目间的关系来解决低秩矩阵补全问题,并给出概率为1的算法判断特定条目是否可补全、从少量条目补全该条目及估计补全误差。

Comments 37 pages, with an appendix by Takeaki Uno

详情
AI中文摘要

我们提出了一种新颖的低秩矩阵补全的代数组合观点,该观点基于使用代数几何和拟阵理论的工具研究少量条目之间的关系。该方法的固有局部性允许在封闭的理论和实践框架中处理单个条目。更具体地说,除了引入低秩矩阵补全的代数组合理论外,我们还提出了概率为1的算法,以决定矩阵的特定条目是否可以被补全。我们还描述了从少量其他条目补全该条目的方法,以及估计任何补全该条目的方法所产生的误差。此外,我们展示了关于矩阵补全的已知结果及其采样假设如何与我们的新视角相关联,并可根据可补全性相变进行解释。

英文摘要

We present a novel algebraic combinatorial view on low-rank matrix completion based on studying relations between a few entries with tools from algebraic geometry and matroid theory. The intrinsic locality of the approach allows for the treatment of single entries in a closed theoretical and practical framework. More specifically, apart from introducing an algebraic combinatorial theory of low-rank matrix completion, we present probability-one algorithms to decide whether a particular entry of the matrix can be completed. We also describe methods to complete that entry from a few others, and to estimate the error which is incurred by any method completing that entry. Furthermore, we show how known results on matrix completion and their sampling assumptions can be related to our new perspective and interpreted in terms of a completability phase transition.

1107.5528 2026-06-03 cs.AI cs.SY eess.SY math.OC

Time Consistent Discounting

时间一致折现

Tor Lattimore, Marcus Hutter

AI总结 本文通过引入随年龄变化的折现函数,刻画了时间一致与不一致的折现函数,并证明了即使折现函数时间不一致,智能体仍存在理性策略。

Comments 17 LaTeX pages, 5 figures

详情
Journal ref
Proc. 22nd International Conf. on Algorithmic Learning Theory (ALT-2011) pages 383-397
AI中文摘要

一个可能永生的智能体试图最大化其随时间累积的折现奖励,其中折现用于避免无限效用并鼓励智能体更重视当前奖励而非未来奖励。一些常用的折现函数会导致时间不一致行为,即智能体会随时间改变其计划。这些不一致可能导致非常糟糕的行为。我们将通常的折现效用模型推广到折现函数随智能体年龄变化的形式。然后,我们给出了时间(不)一致折现函数的简单刻画,并证明了对于知道其折现函数是时间不一致的智能体,存在一个理性策略。

英文摘要

A possibly immortal agent tries to maximise its summed discounted rewards over time, where discounting is used to avoid infinite utilities and encourage the agent to value current rewards more than future ones. Some commonly used discount functions lead to time-inconsistent behavior where the agent changes its plan over time. These inconsistencies can lead to very poor behavior. We generalise the usual discounted utility model to one where the discount function changes with the age of the agent. We then give a simple characterisation of time-(in)consistent discount functions and show the existence of a rational policy for an agent that knows its discount function is time-inconsistent.

1301.3584 2026-06-03 cs.LG cs.NA math.NA

Revisiting Natural Gradient for Deep Networks

重新审视深度网络的自然梯度

Razvan Pascanu, Yoshua Bengio

AI总结 本文重新评估了自然梯度算法在深度模型学习中的应用,揭示了其与Hessian-Free、Krylov子空间下降和TONGA方法的联系,并提出了利用无标签数据改进泛化误差、评估对训练集顺序的鲁棒性以及结合二阶信息的扩展算法。

详情
AI中文摘要

我们评估了最初由Amari (1997)提出的自然梯度算法在深度模型学习中的应用。本文的贡献如下:我们展示了自然梯度与其他三种最近提出的深度模型训练方法——Hessian-Free (Martens, 2010)、Krylov子空间下降 (Vinyals and Povey, 2012)和TONGA (Le Roux et al., 2008)——之间的联系。我们描述了如何利用无标签数据来改善自然梯度所获得的泛化误差,并实证评估了该算法相对于随机梯度下降对训练集顺序的鲁棒性。最后,我们将自然梯度扩展到结合流形信息与二阶信息,并使用截断牛顿方法(而非对角近似)来求逆度量矩阵,从而对新算法进行了基准测试。

英文摘要

We evaluate natural gradient, an algorithm originally proposed in Amari (1997), for learning deep models. The contributions of this paper are as follows. We show the connection between natural gradient and three other recently proposed methods for training deep models: Hessian-Free (Martens, 2010), Krylov Subspace Descent (Vinyals and Povey, 2012) and TONGA (Le Roux et al., 2008). We describe how one can use unlabeled data to improve the generalization error obtained by natural gradient and empirically evaluate the robustness of the algorithm to the ordering of the training set compared to stochastic gradient descent. Finally we extend natural gradient to incorporate second order information alongside the manifold information and provide a benchmark of the new algorithm using a truncated Newton approach for inverting the metric matrix instead of using a diagonal approximation of it.

1208.4391 2026-06-03 cs.CV cs.SY eess.SY

Shape Tracking With Occlusions via Coarse-To-Fine Region-Based Sobolev Descent

基于粗到细区域Sobolev下降的遮挡形状跟踪

Yanchao Yang, Ganesh Sundaramoorthi

AI总结 提出一种在参数化区域黎曼流形上通过粗到细优化处理自遮挡和去遮挡的联合形状与外观跟踪方法,实现精确形状检测。

Comments Extension of ICCV paper, added coarse-to-fine optimization based on new Riemannian manifold of parameterized regions

详情
AI中文摘要

我们提出了一种方法,基于参数化区域的新型黎曼流形上的新建模和优化,跟踪视频中物体的精确形状。联合动态形状和外观模型,其中物体的模板被传播以匹配下一帧中的物体形状和辐射度,在复杂物体辐射度和杂乱背景的情况下优于使用全局图像统计的方法。在3D物体运动和视点变化的情况下,物体的自遮挡和去遮挡很突出,当前使用联合形状和外观模型的方法无法适应新的形状和外观信息,导致形状检测不准确。在这项工作中,我们在联合形状和外观跟踪框架中建模自遮挡和去遮挡。自遮挡和用于传播模板的扭曲是耦合的,因此提出了一个联合问题。我们推导了一个粗到细的优化方案,在物体跟踪中具有优势,该方案首先通过粗扰动扰动模板,然后过渡到更细尺度的扰动,无缝且自动地遍历所有尺度。该方案是在我们引入的新型无限维黎曼流形上的梯度下降。该流形由平面参数化区域组成,我们引入的度量是定义在区域上的无穷小向量场上的新型Sobolev型度量。该度量的性质是,梯度下降自动优先考虑粗尺度变形(当它们减少能量时),然后才转向更细尺度的变形。在展示遮挡/去遮挡、复杂辐射度和背景的视频上的实验表明,与最近使用联合形状/外观模型或使用全局统计的方法相比,遮挡/去遮挡建模导致更优越的形状精度。

英文摘要

We present a method to track the precise shape of an object in video based on new modeling and optimization on a new Riemannian manifold of parameterized regions. Joint dynamic shape and appearance models, in which a template of the object is propagated to match the object shape and radiance in the next frame, are advantageous over methods employing global image statistics in cases of complex object radiance and cluttered background. In cases of 3D object motion and viewpoint change, self-occlusions and dis-occlusions of the object are prominent, and current methods employing joint shape and appearance models are unable to adapt to new shape and appearance information, leading to inaccurate shape detection. In this work, we model self-occlusions and dis-occlusions in a joint shape and appearance tracking framework. Self-occlusions and the warp to propagate the template are coupled, thus a joint problem is formulated. We derive a coarse-to-fine optimization scheme, advantageous in object tracking, that initially perturbs the template by coarse perturbations before transitioning to finer-scale perturbations, traversing all scales, seamlessly and automatically. The scheme is a gradient descent on a novel infinite-dimensional Riemannian manifold that we introduce. The manifold consists of planar parameterized regions, and the metric that we introduce is a novel Sobolev-type metric defined on infinitesimal vector fields on regions. The metric has the property of resulting in a gradient descent that automatically favors coarse-scale deformations (when they reduce the energy) before moving to finer-scale deformations. Experiments on video exhibiting occlusion/dis-occlusion, complex radiance and background show that occlusion/dis-occlusion modeling leads to superior shape accuracy compared to recent methods employing joint shape/appearance models or employing global statistics.

1210.2380 2026-06-03 cs.CV cs.IT cs.NA math.IT math.NA

Stable and robust sampling strategies for compressive imaging

压缩成像的稳定鲁棒采样策略

Felix Krahmer, Rachel Ward

AI总结 针对傅里叶测量与Haar小波稀疏的压缩成像,提出基于局部相干性的变密度采样策略,证明近最优嵌入维度的限制等距性质,实现稳定鲁棒的重建。

Comments 17 pages, 4 figures

详情
AI中文摘要

在许多信号处理应用中,人们希望通过频域采样获取在变换域(如空间有限差分或小波)中稀疏的图像。对于此类应用,大量经验证据表明,通过集中于低频的变密度采样策略可以获得更优的图像重建。小波和傅里叶变换域并非不相干,因为低阶小波和低阶频率是相关的,因此压缩感知理论并不能直接推出采样策略和重建保证。本文转向一种更精细的相干性概念——所谓的局部相干性——分别测量每个感知向量与稀疏基的相关程度。对于傅里叶测量和Haar小波稀疏性,局部相干性可以被显式控制和界定,因此对于由从合适的逆平方幂律密度中采样的频率构成的矩阵,我们可以证明具有近最优嵌入维度的限制等距性质。因此,我们提供的变密度采样策略允许对稀疏缺陷稳定且对测量噪声鲁棒的图像重建。我们的结果涵盖了通过ℓ1最小化和全变差最小化的重建。本文开发的局部相干性框架在更一般的稀疏恢复问题中应具有独立意义,因为它表明,对于最优稀疏恢复结果,只要采样策略相应调整,只需感知基到稀疏基的有界平均相干性——而非有界最大相干性——就足够了。

英文摘要

In many signal processing applications, one wishes to acquire images that are sparse in transform domains such as spatial finite differences or wavelets using frequency domain samples. For such applications, overwhelming empirical evidence suggests that superior image reconstruction can be obtained through variable density sampling strategies that concentrate on lower frequencies. The wavelet and Fourier transform domains are not incoherent because low-order wavelets and low-order frequencies are correlated, so compressive sensing theory does not immediately imply sampling strategies and reconstruction guarantees. In this paper we turn to a more refined notion of coherence -- the so-called local coherence -- measuring for each sensing vector separately how correlated it is to the sparsity basis. For Fourier measurements and Haar wavelet sparsity, the local coherence can be controlled and bounded explicitly, so for matrices comprised of frequencies sampled from a suitable inverse square power-law density, we can prove the restricted isometry property with near-optimal embedding dimensions. Consequently, the variable-density sampling strategy we provide allows for image reconstructions that are stable to sparsity defects and robust to measurement noise. Our results cover both reconstruction by $\ell_1$-minimization and by total variation minimization. The local coherence framework developed in this paper should be of independent interest in sparse recovery problems more generally, as it implies that for optimal sparse recovery results, it suffices to have bounded \emph{average} coherence from sensing basis to sparsity basis -- as opposed to bounded maximal coherence -- as long as the sampling strategy is adapted accordingly.

1303.4207 2026-06-03 cs.LG cs.NA math.NA

Improving CUR Matrix Decomposition and the Nyström Approximation via Adaptive Sampling

通过自适应采样改进CUR矩阵分解和Nyström近似

Shusen Wang, Zhihua Zhang

AI总结 本文通过建立自适应列/行采样算法的更一般误差界,提出了具有预期相对误差界的更精确CUR和Nyström算法,并给出了标准Nyström方法和集成Nyström方法的低误差界理论分析。

详情
Journal ref
Journal of Machine Learning Research, 14: 2549-2589, 2013
AI中文摘要

CUR矩阵分解和Nyström近似是两种重要的低秩矩阵近似技术。Nyström方法通过少量列来近似对称半正定矩阵,而CUR通过少量列和行来近似任意数据矩阵。因此,CUR分解可以看作是Nyström近似的扩展。在本文中,我们为自适应列/行采样算法建立了更一般的误差界,基于此我们提出了具有预期相对误差界的更精确CUR和Nyström算法。所提出的CUR和Nyström算法也具有低时间复杂度,并且可以避免将整个数据矩阵保存在内存中。此外,我们对标准Nyström方法和集成Nyström方法的低误差界进行了理论分析。本文建立的主要理论结果是新颖的,并且我们的分析对数据矩阵没有特殊假设。

英文摘要

The CUR matrix decomposition and the Nyström approximation are two important low-rank matrix approximation techniques. The Nyström method approximates a symmetric positive semidefinite matrix in terms of a small number of its columns, while CUR approximates an arbitrary data matrix by a small number of its columns and rows. Thus, CUR decomposition can be regarded as an extension of the Nyström approximation. In this paper we establish a more general error bound for the adaptive column/row sampling algorithm, based on which we propose more accurate CUR and Nyström algorithms with expected relative-error bounds. The proposed CUR and Nyström algorithms also have low time complexity and can avoid maintaining the whole data matrix in RAM. In addition, we give theoretical analysis for the lower error bounds of the standard Nyström method and the ensemble Nyström method. The main theoretical results established in this paper are novel, and our analysis makes no special assumption on the data matrices.