arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 8085
2308.07867 2026-06-03 eess.SY cs.LG cs.SY

Learning Power Flow with Confidence: A Probabilistic Guarantee Framework for Voltage Risk

学习潮流与置信度:电压风险的概率保证框架

Parikshit Pareek, Sidhant Misra, Deepjyoti Deka

AI总结 针对机器学习在电力系统安全应用中缺乏形式化性能保证的问题,提出基于高斯过程回归的概率保证框架,通过顶点度核和网络扫描主动学习算法实现数据高效且可靠的电压风险评估。

Comments 10 pages

详情
AI中文摘要

机器学习缺乏形式化性能保证限制了其在安全关键的电力系统应用中的采用,在这些应用中,置信度和可解释性与准确性同样重要。在这项工作中,我们通过高斯过程回归框架,为潮流学习和电压风险估计提供了概率保证。具体来说,我们建立了期望估计误差的界限,将GP的预测方差与电压风险估计的置信度联系起来,确保与基于蒙特卡洛的ACPF风险量化在统计上等价。为了在低数据情况下增强模型的可学习性,我们首先设计了顶点度核,这是一种拓扑感知的加性核,将电压-负荷相互作用分解为局部邻域,以实现高效的大规模学习。在此基础上,我们引入了一种网络扫描主动学习算法,该算法自适应地采样信息丰富的运行点,并提供了原则性的停止准则,无需样本外验证。这些进展通过结合数据效率和分析保证,缓解了基于机器学习的潮流的主要瓶颈——缺乏可靠的保证。在IEEE 118、500和1354节点系统上的实证评估证实,所提出的VDK-GP实现了低于1E-03 p.u.的平均绝对电压误差,以15倍更少的ACPF计算复现了蒙特卡洛级别的电压风险估计,并在保守地约束违规概率的同时实现了超过120倍的评估时间减少。

英文摘要

The absence of formal performance guarantees in machine learning (ML) has limited its adoption for safety-critical power system applications, where confidence and interpretability are as vital as accuracy. In this work, we present a probabilistic guarantee for power flow learning and voltage risk estimation, derived through the framework of Gaussian Process (GP) regression. Specifically, we establish a bound on the expected estimation error that connects the GP's predictive variance to confidence in voltage risk estimates, ensuring statistical equivalence with Monte Carlo-based ACPF risk quantification. To enhance model learnability in the low-data regime, we first design the Vertex-Degree Kernel (VDK), a topology-aware additive kernel that decomposes voltage-load interactions into local neighborhoods for efficient large-scale learning. Building on this, we introduce a network-swipe active learning (AL) algorithm that adaptively samples informative operating points and provides a principled stopping criterion without requiring out-of-sample validation. Together, these developments mitigate the principal bottleneck of ML-based power flow, its lack of guaranteed reliability, by combining data efficiency with analytical assurance. Empirical evaluations across IEEE 118-, 500-, and 1354-bus systems confirm that the proposed VDK-GP achieves mean absolute voltage errors below 1E-03 p.u., reproduces Monte Carlo-level voltage risk estimates with 15x fewer ACPF computations, and achieves over 120x reduction in evaluation time while conservatively bounding violation probabilities.

2205.15412 2026-06-03 cs.DC cs.MA cs.RO

Asynchronous Deterministic Leader Election in Three-Dimensional Programmable Matter

三维可编程物质中的异步确定性领导者选举

Joseph L. Briones, Tishya Chhabra, Joshua J. Daymude, Andréa W. Richa

AI总结 针对三维可编程物质,提出基于面心立方晶格的分布式算法,在非公平顺序敌手下O(n)轮内确定性选举唯一领导者,并利用并发控制框架转化为非公平异步敌手下首个领导者选举算法。

Comments 18 pages, 4 figures, 2 tables. Accepted at ICDCN 2023

详情
Journal ref
Proceedings of the 24th International Conference on Distributed Computing and Networking (ICDCN 2023), pp. 38-47
AI中文摘要

经过三十多年的科学努力实现可编程物质(一种可以根据用户输入或对环境响应改变其物理特性的物质),在模块化机器人系统的工程和相应的集体行为算法理论方面都取得了许多进展。然而,虽然模块化机器人的设计通常处理真实三维(3D)空间的挑战,但算法理论仍然主要关注二维(2D)抽象,如平面和平面图。在这项工作中,我们使用面心立方(FCC)晶格来表示空间并定义局部空间方向,为可编程物质的规范阿米巴模型形式化了3D几何空间变体。然后,我们给出了一种用于连通、可收缩的2D或3D几何阿米巴系统中领导者选举的分布式算法,该算法在非公平顺序敌手下确定性选举出恰好一个领导者,时间复杂度为O(n)轮,其中n是系统中的阿米巴数量。接着,我们展示了如何使用阿米巴算法的并发控制框架(DISC 2021)转换该算法,以获得已知的第一个在非公平异步敌手下解决领导者选举的阿米巴算法,适用于2D和3D空间。

英文摘要

Over three decades of scientific endeavors to realize programmable matter, a substance that can change its physical properties based on user input or responses to its environment, there have been many advances in both the engineering of modular robotic systems and the corresponding algorithmic theory of collective behavior. However, while the design of modular robots routinely addresses the challenges of realistic three-dimensional (3D) space, algorithmic theory remains largely focused on 2D abstractions such as planes and planar graphs. In this work, we formalize the 3D geometric space variant for the canonical amoebot model of programmable matter, using the face-centered cubic (FCC) lattice to represent space and define local spatial orientations. We then give a distributed algorithm for leader election in connected, contractible 2D or 3D geometric amoebot systems that deterministically elects exactly one leader in $\mathcal{O}(n)$ rounds under an unfair sequential adversary, where $n$ is the number of amoebots in the system. We then demonstrate how this algorithm can be transformed using the concurrency control framework for amoebot algorithms (DISC 2021) to obtain the first known amoebot algorithm, both in 2D and 3D space, to solve leader election under an unfair asynchronous adversary.

2105.02420 2026-06-03 cs.DC cs.ET cs.RO

The Canonical Amoebot Model: Algorithms and Concurrency Control

规范变形虫模型:算法与并发控制

Joshua J. Daymude, Andréa W. Richa, Christian Scheideler

AI总结 提出规范变形虫模型,通过消息传递和对抗性激活模型形式化并发执行,并给出两种并发算法设计方法(直接嵌入并发控制和基于锁的转换框架),以六边形形成算法为例验证。

Comments 48 pages, 7 figures, 2 tables

详情
Journal ref
Distributed Computing (2023) 36, pp. 159-192
AI中文摘要

变形虫模型将主动可编程物质抽象为称为变形虫的简单计算元素的集合,这些元素局部交互以共同完成协调和移动任务。自2014年在SPAA上引入以来,越来越多的文献针对各种问题调整了其假设;然而,如果没有标准化的假设层次结构,很难在变形虫模型下对结果进行精确的系统比较。我们提出了规范变形虫模型,这是一种更新的形式化,区分了核心模型特征和假设变体族。规范变形虫模型解决的一个关键改进是并发性。现有文献大多隐含地假设变形虫的动作是隔离且可靠的,将分析简化为至多一个变形虫同时活动的顺序设置。然而,真实的可编程物质系统是并发的。规范变形虫模型将所有变形虫通信形式化为消息传递,利用并发执行的对抗性激活模型。在这种对时间的精细处理下,我们采用了两种互补的方法来设计并发算法。我们首先建立了一组在任何并发执行下算法正确性的充分条件,将并发控制直接嵌入算法设计。然后,我们提出了一个使用锁的并发控制框架,将在顺序设置中终止并满足特定约定的变形虫算法转换为在并发设置中表现出等效行为的算法。作为案例研究,我们使用一个简单的六边形形成算法演示了这两种方法。规范变形虫模型和这些互补的并发算法设计方法共同为可编程物质的分布式计算研究开辟了新的方向。

英文摘要

The amoebot model abstracts active programmable matter as a collection of simple computational elements called amoebots that interact locally to collectively achieve tasks of coordination and movement. Since its introduction at SPAA 2014, a growing body of literature has adapted its assumptions for a variety of problems; however, without a standardized hierarchy of assumptions, precise systematic comparison of results under the amoebot model is difficult. We propose the canonical amoebot model, an updated formalization that distinguishes between core model features and families of assumption variants. A key improvement addressed by the canonical amoebot model is concurrency. Much of the existing literature implicitly assumes amoebot actions are isolated and reliable, reducing analysis to the sequential setting where at most one amoebot is active at a time. However, real programmable matter systems are concurrent. The canonical amoebot model formalizes all amoebot communication as message passing, leveraging adversarial activation models of concurrent executions. Under this granular treatment of time, we take two complementary approaches to concurrent algorithm design. We first establish a set of sufficient conditions for algorithm correctness under any concurrent execution, embedding concurrency control directly in algorithm design. We then present a concurrency control framework that uses locks to convert amoebot algorithms that terminate in the sequential setting and satisfy certain conventions into algorithms that exhibit equivalent behavior in the concurrent setting. As a case study, we demonstrate both approaches using a simple algorithm for hexagon formation. Together, the canonical amoebot model and these complementary approaches to concurrent algorithm design open new directions for distributed computing research on programmable matter.

1301.3535 2026-06-03 eess.SY cs.AI cs.SY

Airport Gate Scheduling for Passengers, Aircraft, and Operation

面向乘客、飞机和运营的机场登机口调度

Sang Hyun Kim, Eric Feron, John-Paul Clarke, Aude Marzuoli, Daniel Delahaye

AI总结 本文研究机场登机口调度问题,提出兼顾乘客、飞机和运营三个目标的平衡目标函数,以提升乘客体验、交通流效率和运营鲁棒性。

Comments This paper is submitted to the tenth USA/Europe ATM 2013 seminar

详情
AI中文摘要

乘客体验正成为评估航空运输系统性能的关键指标。需要高效且稳健的工具来处理机场运营,同时更好地理解乘客的兴趣和关切。在各种机场运营中,本文研究机场登机口调度以改善乘客体验。提出了三个目标,分别考虑乘客、飞机和运营。分析了这些目标之间的权衡,并提出了一个平衡目标函数。结果表明,平衡目标可以提高客运航站楼和停机坪交通流的效率,以及登机口运营的鲁棒性。

英文摘要

Passengers' experience is becoming a key metric to evaluate the air transportation system's performance. Efficient and robust tools to handle airport operations are needed along with a better understanding of passengers' interests and concerns. Among various airport operations, this paper studies airport gate scheduling for improved passengers' experience. Three objectives accounting for passengers, aircraft, and operation are presented. Trade-offs between these objectives are analyzed, and a balancing objective function is proposed. The results show that the balanced objective can improve the efficiency of traffic flow in passenger terminals and on ramps, as well as the robustness of gate operations.

2004.07506 2026-06-03 cs.LO cs.AI math.LO

On Reductions of Hintikka Sets for Higher-Order Logic

关于高阶逻辑的Hintikka集归约

Alexander Steen, Christoph Benzmüller

AI总结 本文通过将Steen (2018)基于原始等式的Church类型论Hintikka集性质归约到Brown (2007)的Hintikka集性质,推导出Steen性质的一个模型存在定理。

Comments 10 pages; improved version

详情
Journal ref
Journal of Applied Logics IfCoLog Journal of Logics and their Applications, Vol. 12(7), 2025, pp. 1813-1834
AI中文摘要

Steen (2018) 基于原始等式的Church类型论的Hintikka集性质被归约到Brown (2007)的Hintikka集性质。利用这一归约,推导出Steen性质的一个模型存在定理。

英文摘要

Steen's (2018) Hintikka set properties for Church's type theory based on primitive equality are reduced to the Hintikka set properties of Brown (2007). Using this reduction, a model existence theorem for Steen's properties is derived.

2007.04377 2026-06-03 cs.DC cs.RO

Bio-Inspired Energy Distribution for Programmable Matter

可编程物质的仿生能量分布

Joshua J. Daymude, Andréa W. Richa, Jamison W. Weber

AI总结 受枯草芽孢杆菌生物膜生长行为启发,提出一种基于通信的算法,通过抑制饥饿模块的能量消耗,确保可编程物质系统中所有模块获得足够能量,并扩展了amoebot模型的生成树原语以支持崩溃故障自稳定。

详情
Journal ref
Proceedings of the 22nd International Conference on Distributed Computing and Networking (ICDCN 2021), pp. 86-95
AI中文摘要

在主动可编程物质系统中,单个模块需要持续的能量供应才能参与系统的集体行为。这些系统通常由至少一个模块可访问的外部能源供电,并依赖模块间的能量传输在整个系统中分配能量。尽管在解决可编程物质硬件中能量管理的挑战方面投入了大量精力,但可编程物质的算法理论在很大程度上忽略了能量使用和分布对算法可行性和效率的影响。在这项工作中,我们提出了一种受枯草芽孢杆菌生物膜生长行为启发的amoebot模型能量分布算法。这些细菌使用化学信号传递其代谢状态并调节整个生物膜中的营养消耗,确保所有细菌获得所需的营养。我们的算法类似地使用通信来在存在饥饿模块时抑制能量消耗,使所有模块能够获得足够的能量以满足其需求。作为一个支持性但独立的结果,我们扩展了amoebot模型成熟的生成树原语,使其在崩溃故障存在时能够自稳定。最后,我们展示了如何利用这一自稳定原语将我们的能量分布算法与现有的amoebot模型算法组合,从而有效地将先前的工作推广到也考虑能量约束。

英文摘要

In systems of active programmable matter, individual modules require a constant supply of energy to participate in the system's collective behavior. These systems are often powered by an external energy source accessible by at least one module and rely on module-to-module power transfer to distribute energy throughout the system. While much effort has gone into addressing challenging aspects of power management in programmable matter hardware, algorithmic theory for programmable matter has largely ignored the impact of energy usage and distribution on algorithm feasibility and efficiency. In this work, we present an algorithm for energy distribution in the amoebot model that is loosely inspired by the growth behavior of Bacillus subtilis bacterial biofilms. These bacteria use chemical signaling to communicate their metabolic states and regulate nutrient consumption throughout the biofilm, ensuring that all bacteria receive the nutrients they need. Our algorithm similarly uses communication to inhibit energy usage when there are starving modules, enabling all modules to receive sufficient energy to meet their demands. As a supporting but independent result, we extend the amoebot model's well-established spanning forest primitive so that it self-stabilizes in the presence of crash failures. We conclude by showing how this self-stabilizing primitive can be leveraged to compose our energy distribution algorithm with existing amoebot model algorithms, effectively generalizing previous work to also consider energy constraints.

1212.5524 2026-06-03 eess.SY cs.LG cs.SY

Reinforcement learning for port-Hamiltonian systems

面向端口-哈密顿系统的强化学习

Olivier Sprangers, Gabriel A. D. Lopes, Robert Babuska

AI总结 针对端口-哈密顿系统的无源控制中性能优化与PDE求解困难的问题,提出一种基于演员-评论家强化学习的参数化能量平衡无源控制方法,实现近最优控制策略学习并保持系统稳定性。

Comments submitted

详情
Journal ref
IEEE Transactions on Cybernetics, Volume: 45 , Issue: 5 , May 2015
AI中文摘要

端口-哈密顿系统的无源控制(PBC)通过使系统相对于期望的存储函数无源,提供了一种直观的稳定化方法。然而,在大多数情况下,控制律的获得没有考虑任何性能指标,并且必须通过求解复杂的偏微分方程(PDE)来计算。为了解决这些问题,我们将强化学习方法引入能量平衡无源控制(EB-PBC)方法中,这是一种PBC形式,其中闭环能量等于存储能量与供给能量之差。我们提出了一种参数化EB-PBC的技术,该技术保留了系统的PDE匹配条件,不需要指定全局期望哈密顿量,包含性能标准,并且对额外的非线性(如控制输入饱和)具有鲁棒性。控制律的参数通过演员-评论家强化学习找到,从而能够学习满足期望闭环能量景观的近最优控制策略。其优点是,可以使用标准的能量整形技术生成近最优控制器,并且学习到的解可以在能量整形和阻尼注入方面进行解释,从而使得利用无源性理论对稳定性进行数值评估成为可能。从强化学习的角度来看,我们的方法允许将端口-哈密顿系统类纳入演员-评论家框架,通过策略的参数化加速学习。该方法已成功应用于仿真和实际实验中的摆锤起摆问题。

英文摘要

Passivity-based control (PBC) for port-Hamiltonian systems provides an intuitive way of achieving stabilization by rendering a system passive with respect to a desired storage function. However, in most instances the control law is obtained without any performance considerations and it has to be calculated by solving a complex partial differential equation (PDE). In order to address these issues we introduce a reinforcement learning approach into the energy-balancing passivity-based control (EB-PBC) method, which is a form of PBC in which the closed-loop energy is equal to the difference between the stored and supplied energies. We propose a technique to parameterize EB-PBC that preserves the systems's PDE matching conditions, does not require the specification of a global desired Hamiltonian, includes performance criteria, and is robust to extra non-linearities such as control input saturation. The parameters of the control law are found using actor-critic reinforcement learning, enabling learning near-optimal control policies satisfying a desired closed-loop energy landscape. The advantages are that near-optimal controllers can be generated using standard energy shaping techniques and that the solutions learned can be interpreted in terms of energy shaping and damping injection, which makes it possible to numerically assess stability using passivity theory. From the reinforcement learning perspective, our proposal allows for the class of port-Hamiltonian systems to be incorporated in the actor-critic framework, speeding up the learning thanks to the resulting parameterization of the policy. The method has been successfully applied to the pendulum swing-up problem in simulations and real-life experiments.

1903.02091 2026-06-03 math.OC cs.RO cs.SY eess.SY

Geometric Adaptive Control with Neural Networks for a Quadrotor UAV in Wind fields

风场中四旋翼无人机的几何自适应神经网络控制

Mahdis Bisheban, Taeyoung Lee

AI总结 针对风场引起的非结构力和力矩扰动,提出一种基于多层神经网络在线调整权重的几何自适应控制器,实现位置和航向跟踪误差的一致最终有界。

详情
AI中文摘要

本文提出了一种带有人工神经网络的四旋翼无人机几何自适应控制器。假设四旋翼动力学受到风引起的任意非结构力和力矩的干扰。为了解决这个问题,所提出的控制系统增加了多层神经网络,并根据自适应律在线调整神经网络的权重。利用通用逼近定理,表明未知扰动的影响可以得到缓解。更具体地说,在所提出的控制系统下,位置和航向方向的跟踪误差是一致最终有界的,并且最终界可以任意减小。这些方法直接在特殊欧几里得群上开发,以避免局部参数化固有的复杂性或奇异性。首先通过数值例子说明了所提出控制系统的有效性。然后,通过几个室内飞行实验证明,即使对于激进的、敏捷的机动,所提出的控制器也能成功抑制风扰动的影响。

英文摘要

This paper proposes a geometric adaptive controller for a quadrotor unmanned aerial vehicle with artificial neural networks. It is assumed that the dynamics of a quadrotor is disturbed by arbitrary, unstructured forces and moments caused by wind. To address this, the proposed control system is augmented with multilayer neural networks, and the weights of neural networks are adjusted online according to an adaptive law. By utilizing the universal approximation theorem, it is shown that the effects of unknown disturbances can be mitigated. More specifically, under the proposed control system, the tracking errors in the position and the heading direction are uniformly ultimately bounded where the ultimate bound can be reduced arbitrarily. These are developed directly on the special Euclidean group to avoid complexities or singularities inherent to local parameterizations. The efficacy of the proposed control system is first illustrated by numerical examples. Then, several indoor flight experiments are presented to demonstrate that the proposed controller successfully rejects the effects of wind disturbances even for aggressive, agile maneuvers.

1803.06363 2026-06-03 math.OC cs.RO cs.SY eess.SY

Geometric Adaptive Control for a Quadrotor UAV with Wind Disturbance Rejection

四旋翼无人机抗风扰动的几何自适应控制

Mahdis Bisheban, Taeyoung Lee

AI总结 针对四旋翼无人机,提出一种基于在线调整多层神经网络的几何自适应控制方案,以抑制未知非结构扰动,并利用特殊欧几里得群上的李雅普诺夫稳定性理论证明跟踪误差一致最终有界,且可通过数值示例验证其抗风扰动能力。

详情
AI中文摘要

本文提出了一种四旋翼无人机的几何自适应控制方案,其中未知、非结构扰动的影响通过在线调整的多层神经网络来减轻。利用特殊欧几里得群上的李雅普诺夫稳定性理论分析了所提控制器的稳定性,并证明跟踪误差一致最终有界,且其最终界可任意缩小。给出了四旋翼动力学中风扰动的数学模型,并表明所提自适应控制器能够成功抑制风扰动的影响。这些通过数值示例进行了说明。

英文摘要

This paper presents a geometric adaptive control scheme for a quadrotor unmanned aerial vehicle, where the effects of unknown, unstructured disturbances are mitigated by a multilayer neural network that is adjusted online. The stability of the proposed controller is analyzed with Lyapunov stability theory on the special Euclidean group, and it is shown that the tracking errors are uniformly ultimately bounded with an ultimate bound that can be abridged arbitrarily. A mathematical model of wind disturbance on the quadrotor dynamics is presented, and it is shown that the proposed adaptive controller is capable of rejecting the effects of wind disturbances successfully. These are illustrated by numerical examples.

1105.1302 2026-06-03 q-bio.QM cs.CV cs.NA math.NA

A Modified Cross Correlation Algorithm for Reference-free Image Alignment of Non-Circular Projections in Single-Particle Electron Microscopy

一种改进的互相关算法用于单颗粒电子显微镜中非圆形投影的无参考图像对齐

Wooram Park, Gregory S. Chirikjian

AI总结 针对单颗粒电子显微镜中高度非球形结构的图像对齐问题,提出一种改进的互相关方法,通过粗对齐和基于统计噪声的搜索空间缩减,结合人工模糊图像和中间类平均分割,在低信噪比下优于经典互相关和最大似然方法。

Comments 29pages

详情
AI中文摘要

本文提出了一种改进的互相关方法,用于对齐单颗粒电子显微镜中高度非球形结构的同一类图像。在该新方法中,首先对投影图像进行粗对齐,然后使用互相关(CC)方法重新对齐所得图像。粗对齐通过匹配图像的质心和主轴实现。基于加性背景噪声的统计特性,可以量化粗对齐中的未对准分布。因此,互相关方法中重新对齐的搜索空间可以缩小以实现更好的对齐。为了克服互相关函数中虚假峰值相关的问题,我们在迭代互相关方法的早期阶段使用人工模糊图像,并从每次迭代步骤中分割中间类平均。这两种额外的操作与互相关方法中缩小的搜索空间相结合,对于低信噪比图像,比经典互相关和最大似然(ML)方法产生更好的对齐效果。

英文摘要

In this paper we propose a modified cross correlation method to align images from the same class in single-particle electron microscopy of highly non-spherical structures. In this new method, First we coarsely align projection images, and then re-align the resulting images using the cross correlation (CC) method. The coarse alignment is obtained by matching the centers of mass and the principal axes of the images. The distribution of misalignment in this coarse alignment can be quantified based on the statistical properties of the additive background noise. As a consequence, the search space for re-alignment in the cross correlation method can be reduced to achieve better alignment. In order to overcome problems associated with false peaks in the cross correlations function, we use artificially blurred images for the early stage of the iterative cross correlation method and segment the intermediate class average from every iteration step. These two additional manipulations combined with the reduced search space size in the cross correlation method yield better alignments for low signal-to-noise ratio images than both classical cross correlation and maximum likelihood(ML) methods.

1206.3582 2026-06-03 math.OC cs.LG cs.SY eess.SY

Decentralized Learning for Multi-player Multi-armed Bandits

多人多臂老虎机的分散式学习

Dileep Kalathil, Naumaan Nayyar, Rahul Jain

AI总结 针对多人多臂老虎机问题,提出了一种无需协调的分散式在线学习算法dUCB_4,实现了近O(log^2 T)的期望遗憾。

Comments 33 pages, 3 figures. Submitted to IEEE Transactions on Information Theory

详情
AI中文摘要

我们考虑多人多臂老虎机(MAB)模型中的分布式在线学习问题。每个玩家可以选择多个臂。当玩家选择一个臂时,它会获得奖励。我们考虑独立同分布奖励模型和马尔可夫奖励模型。在独立同分布模型中,每个臂被建模为具有未知均值的未知分布的独立同分布过程。在马尔可夫模型中,每个臂被建模为具有未知概率转移矩阵和平稳分布的有限、不可约、非周期且可逆的马尔可夫链。不同玩家从臂中获得不同奖励。如果两个玩家选择同一个臂,则发生“碰撞”,两者均无法获得任何奖励。玩家之间没有专用的控制信道用于协调或通信。用户之间的任何其他通信都是有代价的,并会增加遗憾。我们提出了一种基于索引的在线分布式学习策略,称为${ t dUCB_4}$算法,该算法以正确的方式权衡探索与利用,并实现期望遗憾增长不超过近$O(\log^2 T)$。该研究的动机来自认知无线电网络中多个次要用户的机会频谱接入,他们必须在不同用户看起来不同的各种无线信道中进行选择。据我们所知,这是首个针对多人MAB的分布式学习算法。

英文摘要

We consider the problem of distributed online learning with multiple players in multi-armed bandits (MAB) models. Each player can pick among multiple arms. When a player picks an arm, it gets a reward. We consider both i.i.d. reward model and Markovian reward model. In the i.i.d. model each arm is modelled as an i.i.d. process with an unknown distribution with an unknown mean. In the Markovian model, each arm is modelled as a finite, irreducible, aperiodic and reversible Markov chain with an unknown probability transition matrix and stationary distribution. The arms give different rewards to different players. If two players pick the same arm, there is a "collision", and neither of them get any reward. There is no dedicated control channel for coordination or communication among the players. Any other communication between the users is costly and will add to the regret. We propose an online index-based distributed learning policy called ${\tt dUCB_4}$ algorithm that trades off \textit{exploration v. exploitation} in the right way, and achieves expected regret that grows at most as near-$O(\log^2 T)$. The motivation comes from opportunistic spectrum access by multiple secondary users in cognitive radio networks wherein they must pick among various wireless channels that look different to different users. This is the first distributed learning algorithm for multi-player MABs to the best of our knowledge.

1203.2995 2026-06-03 eess.SY cs.CV cs.SY

Marginal multi-Bernoulli filters: RFS derivation of MHT, JIPDA and association-based MeMBer

边缘多伯努利滤波器:MHT、JIPDA和基于关联的MeMBer的RFS推导

Jason L. Williams

AI总结 本文通过随机有限集推导全贝叶斯RFS滤波器,揭示数据关联隐式存在,并通过近似关联分布得到与JIPDA和MeMBer相关的两种算法,在复杂环境下提升性能。

Comments Journal version at http://ieeexplore.ieee.org/document/7272821. Matlab code of simple implementation included with ancillary files

详情
Journal ref
IEEE Transactions on Aerospace and Electronic Systems, vol 51, no 3, pp 1664-1687, July 2015
AI中文摘要

随机有限集(RFS)的最新发展产生了多种避免数据关联的跟踪方法。本文推导了全贝叶斯RFS滤波器的一种形式,并观察到数据关联隐式存在于类似于MHT的数据结构中。随后,通过近似关联分布得到算法。得到两种算法:一种与JIPDA几乎相同,另一种与MeMBer滤波器相关。两者均在具有挑战性的环境中提升了性能。

英文摘要

Recent developments in random finite sets (RFSs) have yielded a variety of tracking methods that avoid data association. This paper derives a form of the full Bayes RFS filter and observes that data association is implicitly present, in a data structure similar to MHT. Subsequently, algorithms are obtained by approximating the distribution of associations. Two algorithms result: one nearly identical to JIPDA, and another related to the MeMBer filter. Both improve performance in challenging environments.

1207.2940 2026-06-03 stat.ML cs.LG cs.SY eess.SY

Expectation Propagation in Gaussian Process Dynamical Systems: Extended Version

高斯过程动态系统中的期望传播:扩展版

Marc Peter Deisenroth, Shakir Mohamed

AI总结 本文提出基于期望传播的消息传递算法用于高斯过程动态系统的近似推理,通过前向后向平滑迭代获得更精确的潜在结构后验分布,提升预测性能,并统一了现有GPDS平滑器。

详情
Journal ref
Advances in Neural Information Processing Systems 25 (NIPS), pp. 2609-2617, 2012
AI中文摘要

丰富且复杂的时间序列数据,例如来自工程系统、金融市场、视频或神经记录的生成数据,现在已成为现代数据分析的常见特征。解释这些多样化数据集背后的现象需要灵活且准确的模型。在本文中,我们推广高斯过程动态系统(GPDS)作为适合此类分析的丰富模型类。特别地,我们提出了一种基于期望传播的GPDS近似推理消息传递算法。通过将推理视为一般的消息传递问题,我们迭代前向后向平滑。因此,我们获得了更准确的潜在结构后验分布,与最先进的GPDS平滑器(这些平滑器是我们一般消息传递算法的特例)相比,预测性能得到改善。因此,我们提供了一种统一的方法,在其中将消息传递置于GPDS的上下文中。

英文摘要

Rich and complex time-series data, such as those generated from engineering systems, financial markets, videos or neural recordings, are now a common feature of modern data analysis. Explaining the phenomena underlying these diverse data sets requires flexible and accurate models. In this paper, we promote Gaussian process dynamical systems (GPDS) as a rich model class that is appropriate for such analysis. In particular, we present a message passing algorithm for approximate inference in GPDSs based on expectation propagation. By posing inference as a general message passing problem, we iterate forward-backward smoothing. Thus, we obtain more accurate posterior distributions over latent structures, resulting in improved predictive performance compared to state-of-the-art GPDS smoothers, which are special cases of our general message passing algorithm. Hence, we provide a unifying approach within which to contextualize message passing in GPDSs.

1102.5597 2026-06-03 math.NA cs.LG cs.NA

Fast and Faster: A Comparison of Two Streamed Matrix Decomposition Algorithms

快速与更快:两种流式矩阵分解算法的比较

Radim Řeh{ů}řek

AI总结 本文比较了单遍分布式算法和两遍流式随机算法在恒定内存下处理大规模矩阵分解的性能与精度,以英文维基百科为数据集进行潜在语义分析实验。

详情
Journal ref
NIPS Workshop on Low-Rank Methods for Large-Scale Machine Learning, 2010
AI中文摘要

随着数字数据集规模的爆炸式增长,分解算法的限制因素是对输入的 extit{遍历次数},因为输入通常存储在外存甚至异地。此外,我们只关注相对于输入大小在 extit{恒定内存}中运行的算法,以便处理任意大的输入。在本文中,我们提出了两种此类算法的实际比较:一种是对输入进行单遍操作的分布式方法,另一种是流式两遍随机算法。实验跟踪了分布式计算、过采样和内存权衡对两种算法精度和性能的影响。为了确保有意义的结果,我们选择真实数据集,即整个英文维基百科,作为潜在语义分析的应用场景。

英文摘要

With the explosion of the size of digital dataset, the limiting factor for decomposition algorithms is the \emph{number of passes} over the input, as the input is often stored out-of-core or even off-site. Moreover, we're only interested in algorithms that operate in \emph{constant memory} w.r.t. to the input size, so that arbitrarily large input can be processed. In this paper, we present a practical comparison of two such algorithms: a distributed method that operates in a single pass over the input vs. a streamed two-pass stochastic algorithm. The experiments track the effect of distributed computing, oversampling and memory trade-offs on the accuracy and performance of the two algorithms. To ensure meaningful results, we choose the input to be a real dataset, namely the whole of the English Wikipedia, in the application settings of Latent Semantic Analysis.

1302.6105 2026-06-03 math.OC cs.CV cs.NA math.NA

Image restoration using sparse approximations of spatially varying blur operators in the wavelet domain

利用小波域中空间变化模糊算子的稀疏近似进行图像恢复

Paul Escande, Pierre Weiss, Francois Malgouyres

AI总结 针对空间变化模糊图像恢复问题,提出在小波域中用稀疏矩阵近似模糊算子,并从数学上证明其合理性,数值验证近似质量,且稀疏模式可预定义,适用于盲反卷积等任务。

Comments 6 pages

详情
AI中文摘要

在摄影、卫星或显微成像中,恢复由空间变化模糊降质的图像是一个日益重要的问题。解决这一问题的主要困难之一在于模糊矩阵的巨大维度,这阻碍了使用朴素方法进行矩阵-向量乘法。在本文中,我们提出在小波域中用稀疏矩阵近似模糊算子。我们从数学角度证明了该方法的合理性,并数值研究了近似质量。最后,我们表明矩阵的稀疏模式可以预定义,这在盲反卷积等任务中至关重要。

英文摘要

Restoration of images degraded by spatially varying blurs is an issue of increasing importance in the context of photography, satellite or microscopy imaging. One of the main difficulty to solve this problem comes from the huge dimensions of the blur matrix. It prevents the use of naive approaches for performing matrix-vector multiplications. In this paper, we propose to approximate the blur operator by a matrix sparse in the wavelet domain. We justify this approach from a mathematical point of view and investigate the approximation quality numerically. We finish by showing that the sparsity pattern of the matrix can be pre-defined, which is central in tasks such as blind deconvolution.

1210.6649 2026-06-03 astro-ph.IM cs.CV cs.NA math.NA

Extended object reconstruction in adaptive-optics imaging: the multiresolution approach

自适应光学成像中的扩展目标重建:多分辨率方法

Roberto Baena Gallé, Jorge Núñez, Szymon Gladysz

AI总结 提出使用小波和曲波等多分辨率变换重建自适应光学系统获取的扩展目标图像,通过静态PSF的多通道反卷积方法优于传统的盲/近视反卷积方法。

Comments In revision in Astronomy & Astrophysics. 19 pages, 13 figures

详情
AI中文摘要

我们提出将多分辨率变换(如小波变换(WT)和曲波变换(CT))应用于自适应光学(AO)系统获取的扩展目标图像重建。这种多通道方法通常利用概率工具来区分显著结构与噪声和重建残差。此外,我们旨在检验历史假设:使用静态PSF的图像重建算法不适用于AO成像。我们将哈勃太空望远镜(HST)拍摄的土星图像与帕洛马天文台5米海尔望远镜的AO PSF进行卷积,并添加散粒噪声和读出噪声。随后,我们对模糊和噪声数据应用不同方法以恢复原始目标。这些方法包括多帧盲反卷积(使用IDAC算法)、带正则化的近视反卷积(使用MISTRAL)以及基于小波或曲波的静态PSF反卷积(AWMLE和ACMLE算法)。我们使用均方误差(MSE)和结构相似性指数(SSIM)来比较结果。我们讨论了这两种指标的优缺点。我们发现,根据MSE和SSIM的测量,CT比WT产生更好的结果。使用静态PSF的多通道反卷积产生的结果通常优于近视/盲方法(对于我们测试的图像),这表明方法抑制噪声和跟踪底层迭代过程的能力与近视/盲方法更新PSF的能力同样关键。

英文摘要

We propose the application of multiresolution transforms, such as wavelets (WT) and curvelets (CT), to the reconstruction of images of extended objects that have been acquired with adaptive optics (AO) systems. Such multichannel approaches normally make use of probabilistic tools in order to distinguish significant structures from noise and reconstruction residuals. Furthermore, we aim to check the historical assumption that image-reconstruction algorithms using static PSFs are not suitable for AO imaging. We convolve an image of Saturn taken with the Hubble Space Telescope (HST) with AO PSFs from the 5-m Hale telescope at the Palomar Observatory and add both shot and readout noise. Subsequently, we apply different approaches to the blurred and noisy data in order to recover the original object. The approaches include multi-frame blind deconvolution (with the algorithm IDAC), myopic deconvolution with regularization (with MISTRAL) and wavelets- or curvelets-based static PSF deconvolution (AWMLE and ACMLE algorithms). We used the mean squared error (MSE) and the structural similarity index (SSIM) to compare the results. We discuss the strengths and weaknesses of the two metrics. We found that CT produces better results than WT, as measured in terms of MSE and SSIM. Multichannel deconvolution with a static PSF produces results which are generally better than the results obtained with the myopic/blind approaches (for the images we tested) thus showing that the ability of a method to suppress the noise and to track the underlying iterative process is just as critical as the capability of the myopic/blind approaches to update the PSF.

1210.3098 2026-06-03 math.NA cs.CV cs.IT cs.NA math.IT

Near-optimal compressed sensing guarantees for total variation minimization

全变差最小化的近最优压缩感知保证

Deanna Needell, Rachel Ward

AI总结 针对多维信号压缩感知重建问题,本文证明通过全变差最小化,从 O(sd*log(N^d)) 个线性测量中可重建信号,误差与梯度最佳 s 项近似成比例,并证明该保证在空间维度 d 上多项式因子内最优。

详情
AI中文摘要

考虑压缩感知设置中从欠定测量集重建多维信号的问题。没有任何额外假设,该问题是不适定的。然而,对于自然图像或电影等信号,与测量一致的最小全变差估计通常能产生对潜在信号的良好近似,即使测量数量远小于环境维度。本文将二维图像的最新重建保证推广到任意维度 d>1 的信号和各向同性全变差问题。具体来说,我们证明多维信号 x 可以从 O(sd*log(N^d)) 个线性测量中通过全变差最小化重建,重建误差在其梯度最佳 s 项近似的因子内。我们提供的重建保证在空间维度 d 的多项式因子内必然是最优的。

英文摘要

Consider the problem of reconstructing a multidimensional signal from an underdetermined set of measurements, as in the setting of compressed sensing. Without any additional assumptions, this problem is ill-posed. However, for signals such as natural images or movies, the minimal total variation estimate consistent with the measurements often produces a good approximation to the underlying signal, even if the number of measurements is far smaller than the ambient dimensionality. This paper extends recent reconstruction guarantees for two-dimensional images to signals of arbitrary dimension d>1 and to isotropic total variation problems. To be precise, we show that a multidimensional signal x can be reconstructed from O(sd*log(N^d)) linear measurements using total variation minimization to within a factor of the best s-term approximation of its gradient. The reconstruction guarantees we provide are necessarily optimal up to polynomial factors in the spatial dimension d.

1205.4133 2026-06-03 math.NA cs.LG cs.NA

Constrained Overcomplete Analysis Operator Learning for Cosparse Signal Modelling

约束过完全分析算子学习用于共稀疏信号建模

Mehrdad Yaghoobi, Sangnam Nam, Remi Gribonval, Mike E. Davies

AI总结 提出一种基于L1优化的约束学习框架,通过投影次梯度和Douglas-Rachford分裂技术学习过完全分析算子,实现共稀疏信号建模,并验证了其在干净和噪声训练集上的鲁棒恢复能力。

Comments 29 pages, 13 figures, accepted to be published in TSP

详情
AI中文摘要

我们考虑从训练样本集合中学习低维信号模型的问题。主流方法是学习一个过完全字典,利用稀疏合成系数对训练样本提供良好近似。这个著名的稀疏模型有一个不太为人知的对应物,即分析形式的共稀疏分析模型。在这个新模型中,信号的特征在于它们在使用过完全(线性)分析算子的变换域中的简约性。我们提出基于L1优化的约束优化框架,从训练语料库中学习分析算子。在优化框架中引入约束的原因是为了排除平凡解。尽管目前还没有最终答案确定哪个约束最相关,但我们研究了模型自适应领域的一些常规约束,并为此使用了均匀归一化紧框架(UNTF)。然后,我们推导了一个实用的学习算法,基于投影次梯度和Douglas-Rachford分裂技术,并展示了当提供足够大小的干净训练集时,该算法能够稳健地恢复真实分析算子。我们还使用一些含噪的共稀疏信号找到了图像的分析算子,这确实是一个更现实的实验。由于推导出的优化问题不是凸规划,我们通常使用此类变分方法找到局部最小值。针对两种不同设置推导了局部最优性条件,为学习问题在适当条件下的适定性提供了初步的理论支持。

英文摘要

We consider the problem of learning a low-dimensional signal model from a collection of training samples. The mainstream approach would be to learn an overcomplete dictionary to provide good approximations of the training samples using sparse synthesis coefficients. This famous sparse model has a less well known counterpart, in analysis form, called the cosparse analysis model. In this new model, signals are characterised by their parsimony in a transformed domain using an overcomplete (linear) analysis operator. We propose to learn an analysis operator from a training corpus using a constrained optimisation framework based on L1 optimisation. The reason for introducing a constraint in the optimisation framework is to exclude trivial solutions. Although there is no final answer here for which constraint is the most relevant constraint, we investigate some conventional constraints in the model adaptation field and use the uniformly normalised tight frame (UNTF) for this purpose. We then derive a practical learning algorithm, based on projected subgradients and Douglas-Rachford splitting technique, and demonstrate its ability to robustly recover a ground truth analysis operator, when provided with a clean training set, of sufficient size. We also find an analysis operator for images, using some noisy cosparse signals, which is indeed a more realistic experiment. As the derived optimisation problem is not a convex program, we often find a local minimum using such variational methods. Some local optimality conditions are derived for two different settings, providing preliminary theoretical support for the well-posedness of the learning problem under appropriate conditions.

1110.3649 2026-06-03 math.NA cs.CV cs.GR cs.NA

Algorithms to automatically quantify the geometric similarity of anatomical surfaces

自动量化解剖表面几何相似性的算法

D. Boyer, Y. Lipman, E. St. Clair, J. Puente, T. Funkhouser, B. Patel, J. Jernvall, I. Daubechies

AI总结 提出利用局部结构和全局几何关系自动计算二维表面间距离与对应关系的多项式算法,无需人工标记,实现大规模数字化表面的高效比较。

Comments Changes with respect to v1, v2: an Erratum was added, correcting the references for one of the three datasets. Note that the datasets and code for this paper can be obtained from the Data Conservancy (see Download column on v1, v2)

详情
Journal ref
PNAS 2011 108 (45) 18221-18226
AI中文摘要

我们描述了用于计算(嵌入三维空间的)二维表面对之间距离的新方法,这些方法利用局部结构以及结构间几何关系所包含的全局信息。我们提出了自动确定这些距离以及几何对应关系的算法。这一研究源于自然科学学生对理解统一生命多样性的形态连续性的追求。目前,科学家利用物理特征研究现存和灭绝动物之间的进化关系时,分析的是从精心定义的解剖对应点(地标)中提取的数据。识别和记录这些地标耗时且只能由训练有素的形态学家准确完成。这使得非形态学家无法进行这些研究,并导致表型组学在阐明进化模式方面落后于基因组学。与已提出的其他形态对应算法不同,我们的方法不需要用户预先标记任何特殊特征或地标。它也与计算几何中的其他开创性工作不同,因为我们的算法本质上是多项式的,因此更快,使得对大量数字化表面进行成对比较成为可能。我们使用代表灵长类和人类牙齿及不同骨骼的三个数据集展示了我们的方法,并表明它能产生高度准确的结果。

英文摘要

We describe new approaches for distances between pairs of 2-dimensional surfaces (embedded in 3-dimensional space) that use local structures and global information contained in inter-structure geometric relationships. We present algorithms to automatically determine these distances as well as geometric correspondences. This is motivated by the aspiration of students of natural science to understand the continuity of form that unites the diversity of life. At present, scientists using physical traits to study evolutionary relationships among living and extinct animals analyze data extracted from carefully defined anatomical correspondence points (landmarks). Identifying and recording these landmarks is time consuming and can be done accurately only by trained morphologists. This renders these studies inaccessible to non-morphologists, and causes phenomics to lag behind genomics in elucidating evolutionary patterns. Unlike other algorithms presented for morphological correspondences our approach does not require any preliminary marking of special features or landmarks by the user. It also differs from other seminal work in computational geometry in that our algorithms are polynomial in nature and thus faster, making pairwise comparisons feasible for significantly larger numbers of digitized surfaces. We illustrate our approach using three datasets representing teeth and different bones of primates and humans, and show that it leads to highly accurate results.

0911.1419 2026-06-03 cs.DS cond-mat.stat-mech cs.DM cs.LG cs.NA math.NA math.OC

Belief Propagation and Loop Calculus for the Permanent of a Non-Negative Matrix

非负矩阵积和式的信念传播与环路微积分

Yusuke Watanabe, Michael Chertkov

AI总结 针对非负矩阵积和式的计算问题,利用信念传播固定点导出了精确的积和式表达式,并提供了基于Bethe自由能和Ihara图zeta函数的两种推导。

Comments 11 pages; submitted to Journal of Physics A: Mathematical Theoretical

详情
AI中文摘要

我们考虑计算一个正$(N\times N)$非负矩阵$P=(P_i^j|i,j=1,\cdots,N)$的积和式,或等价地,完全二分图$K_{N,N}$上完美匹配的加权计数问题。该问题已知具有指数复杂度。作为图模型的配分函数$Z$,该问题允许精确的环路微积分表示[Chertkov, Chernyak '06],该表示基于Bethe自由能泛函在非整数双随机边际信念矩阵$\beta=(\beta_i^j|i,j=1,\cdots,N)$上的内部最小值,该矩阵也对应于信念传播(BP)型迭代消息传递算法的固定点。我们的主要结果是给出精确配分函数(积和式)用BP边际矩阵$\beta$表示的显式表达式:$Z=\mbox{Perm}(P)=Z_{BP} \mbox{Perm}(\beta_i^j(1-\beta_i^j))/\prod_{i,j}(1-\beta_i^j)$,其中$Z_{BP}$是用$\beta$显式表示的BP积和式表达式。我们给出了该公式的两种推导:一种直接基于Bethe自由能,另一种结合了Ihara图$\zeta$函数和环路微积分方法。假设已计算出信念传播边际矩阵$\beta$,我们提供了两个下界和一个上界来估计乘积项。两个互补的下界分别基于Gurvits-van der Waerden定理以及修正积和式与行列式之间的关系。

英文摘要

We consider computation of permanent of a positive $(N\times N)$ non-negative matrix, $P=(P_i^j|i,j=1,\cdots,N)$, or equivalently the problem of weighted counting of the perfect matchings over the complete bipartite graph $K_{N,N}$. The problem is known to be of likely exponential complexity. Stated as the partition function $Z$ of a graphical model, the problem allows exact Loop Calculus representation [Chertkov, Chernyak '06] in terms of an interior minimum of the Bethe Free Energy functional over non-integer doubly stochastic matrix of marginal beliefs, $β=(β_i^j|i,j=1,\cdots,N)$, also correspondent to a fixed point of the iterative message-passing algorithm of the Belief Propagation (BP) type. Our main result is an explicit expression of the exact partition function (permanent) in terms of the matrix of BP marginals, $β$, as $Z=\mbox{Perm}(P)=Z_{BP} \mbox{Perm}(β_i^j(1-β_i^j))/\prod_{i,j}(1-β_i^j)$, where $Z_{BP}$ is the BP expression for the permanent stated explicitly in terms if $β$. We give two derivations of the formula, a direct one based on the Bethe Free Energy and an alternative one combining the Ihara graph-$ζ$ function and the Loop Calculus approaches. Assuming that the matrix $β$ of the Belief Propagation marginals is calculated, we provide two lower bounds and one upper-bound to estimate the multiplicative term. Two complementary lower bounds are based on the Gurvits-van der Waerden theorem and on a relation between the modified permanent and determinant respectively.

1208.4773 2026-06-03 eess.SY cs.AI cs.LG cs.SY

Optimized Look-Ahead Tree Policies: A Bridge Between Look-Ahead Tree Policies and Direct Policy Search

优化前瞻树策略:连接前瞻树策略与直接策略搜索的桥梁

Tobias Jung, Louis Wehenkel, Damien Ernst, Francis Maes

AI总结 提出一种混合策略学习方案,通过直接策略搜索学习节点评分函数来指导小型前瞻树的构建,从而结合直接策略搜索和前瞻树策略的优势。

Comments In Submission

详情
AI中文摘要

直接策略搜索(DPS)和前瞻树(LT)策略是两类广泛使用的技术,用于为序列决策问题产生高性能策略。要使DPS方法有效工作,一个关键问题是针对目标问题选择合适的参数化策略空间。LT方法的一个基本问题是,为了做出好的决策,这类策略必须开发非常大的前瞻树,这可能需要过多的在线计算资源。在本文中,我们提出了一种新的混合策略学习方案,它位于DPS和LT的交集,其中策略是一种算法,以有向方式开发一个小型前瞻树,由通过DPS学习的节点评分函数引导。基于LT的表示被证明是在DPS方案中表示策略的一种通用方式,同时,DPS能够显著减少做出高质量决策所需的前瞻树的大小。我们通过实验将我们的方法与两种其他最先进的DPS技术和四种常见的LT策略在四个基准领域进行比较,并表明它结合了其起源的两种技术的优势。特别是,我们表明我们的方法:(1)总体上比纯DPS和纯LT策略产生更好的性能策略,(2)需要的策略评估次数远少于其他DPS技术,(3)易于调整,(4)产生的策略对初始条件的扰动具有相当的鲁棒性。

英文摘要

Direct policy search (DPS) and look-ahead tree (LT) policies are two widely used classes of techniques to produce high performance policies for sequential decision-making problems. To make DPS approaches work well, one crucial issue is to select an appropriate space of parameterized policies with respect to the targeted problem. A fundamental issue in LT approaches is that, to take good decisions, such policies must develop very large look-ahead trees which may require excessive online computational resources. In this paper, we propose a new hybrid policy learning scheme that lies at the intersection of DPS and LT, in which the policy is an algorithm that develops a small look-ahead tree in a directed way, guided by a node scoring function that is learned through DPS. The LT-based representation is shown to be a versatile way of representing policies in a DPS scheme, while at the same time, DPS enables to significantly reduce the size of the look-ahead trees that are required to take high-quality decisions. We experimentally compare our method with two other state-of-the-art DPS techniques and four common LT policies on four benchmark domains and show that it combines the advantages of the two techniques from which it originates. In particular, we show that our method: (1) produces overall better performing policies than both pure DPS and pure LT policies, (2) requires a substantially smaller number of policy evaluations than other DPS techniques, (3) is easy to tune and (4) results in policies that are quite robust with respect to perturbations of the initial conditions.

1205.2584 2026-06-03 math.NA cs.LG cs.NA math.OC

Low Complexity Damped Gauss-Newton Algorithms for CANDECOMP/PARAFAC

低复杂度阻尼高斯-牛顿算法用于CANDECOMP/PARAFAC分解

Anh Huy Phan, Petr Tichavský, Andrzej Cichocki

AI总结 针对CP分解中阻尼高斯-牛顿算法计算复杂度过高的问题,提出基于分块逆近似Hessian的快速实现,显著降低计算和内存需求。

详情
AI中文摘要

用于CANDECOMP/PARAFAC (CP) 分解的阻尼高斯-牛顿 (dGN) 算法可以处理因子共线性和不同因子量级的挑战;然而,对于大小为 $I_1\times I_N$、秩为 $R$ 的 $N$ 维张量分解,该算法由于需要构建大小为 $(RT \times RT)$ 的大型近似Hessian矩阵并求逆(其中 $T = \sum_n I_n$),计算量巨大。本文提出了一种dGN算法的快速实现,基于分块形式的逆近似Hessian的新表达式。新实现具有较低的计算复杂度,除了梯度计算(这部分两种方法相同)外,只需要对一个大小为 $NR^2\times NR^2$ 的矩阵求逆,如果 $T \gg NR$,这远小于整个近似Hessian矩阵。此外,该实现具有更低的内存需求,因为Hessian矩阵及其逆矩阵都不需要完整存储。还提出了处理复数数据的算法变体。在困难基准张量示例上,将所提算法的复杂度和性能与dGN和带线搜索的ALS进行了比较。

英文摘要

The damped Gauss-Newton (dGN) algorithm for CANDECOMP/PARAFAC (CP) decomposition can handle the challenges of collinearity of factors and different magnitudes of factors; nevertheless, for factorization of an $N$-D tensor of size $I_1\times I_N$ with rank $R$, the algorithm is computationally demanding due to construction of large approximate Hessian of size $(RT \times RT)$ and its inversion where $T = \sum_n I_n$. In this paper, we propose a fast implementation of the dGN algorithm which is based on novel expressions of the inverse approximate Hessian in block form. The new implementation has lower computational complexity, besides computation of the gradient (this part is common to both methods), requiring the inversion of a matrix of size $NR^2\times NR^2$, which is much smaller than the whole approximate Hessian, if $T \gg NR$. In addition, the implementation has lower memory requirements, because neither the Hessian nor its inverse never need to be stored in their entirety. A variant of the algorithm working with complex valued data is proposed as well. Complexity and performance of the proposed algorithm is compared with those of dGN and ALS with line search on examples of difficult benchmark tensors.

1204.5717 2026-06-03 cs.DS cs.RO cs.SY eess.SY

Multi-agent Path Planning and Network Flow

多智能体路径规划与网络流

Jingjin Yu, Steven M. LaValle

AI总结 将图上的多智能体路径规划问题归约到网络流问题,利用组合网络流算法和线性规划技术求解,并证明当目标置换不变时存在最长完成时间不超过 n+V-1 步的可行解路径集,给出 O(nVE) 时间算法,进一步研究时间和距离最优性及其帕累托最优结构。

Comments Corrected an inaccuracy on time optimal solution for average arrival time

详情
AI中文摘要

本文将图(路标图)上的多智能体路径规划与网络流问题联系起来,表明前者可以归约到后者,从而使得组合网络流算法以及一般的线性规划技术能够应用于图上的多智能体路径规划问题。利用这一联系,我们证明当目标是置换不变时,该问题总是存在一个可行解路径集,其最长完成时间不超过 $n + V - 1$ 步,其中 $n$ 是智能体数量,$V$ 是底层图的顶点数。然后,我们给出一个完整算法,在 $O(nVE)$ 时间内找到这样的解,其中 $E$ 是图的边数。进一步,我们研究可行解的时间和距离最优性,表明它们具有成对的帕累托最优结构,并再次为优化这两个实际目标提供了高效算法。

英文摘要

This paper connects multi-agent path planning on graphs (roadmaps) to network flow problems, showing that the former can be reduced to the latter, therefore enabling the application of combinatorial network flow algorithms, as well as general linear program techniques, to multi-agent path planning problems on graphs. Exploiting this connection, we show that when the goals are permutation invariant, the problem always has a feasible solution path set with a longest finish time of no more than $n + V - 1$ steps, in which $n$ is the number of agents and $V$ is the number of vertices of the underlying graph. We then give a complete algorithm that finds such a solution in $O(nVE)$ time, with $E$ being the number of edges of the graph. Taking a further step, we study time and distance optimality of the feasible solutions, show that they have a pairwise Pareto optimal structure, and again provide efficient algorithms for optimizing two of these practical objectives.

1204.3820 2026-06-03 eess.SY cs.AI cs.RO cs.SY

Distance Optimal Formation Control on Graphs with a Tight Convergence Time Guarantee

图上具有紧收敛时间保证的距离最优编队控制

Jingjin Yu, Steven M. LaValle

AI总结 针对连通图上单位边距下无碰撞移动多个不可区分智能体到任意目标顶点集的任务,提出一种快速距离最优控制算法,并给出紧收敛时间保证。

Comments Brought to be in-sync with final version submitted to CDC 2012 with only minor updates

详情
AI中文摘要

对于在单位边距的连通图上将一组不可区分智能体无碰撞地移动到任意目标顶点集的任务,我们提出了一种快速距离最优控制算法,引导智能体进入期望编队。此外,我们证明了该算法还提供了紧收敛时间保证(时间最优性和距离最优性无法同时满足)。我们的通用图表述允许该算法应用于诸如具有孔洞(模拟障碍物)的任意维度网格等场景。在线可用的仿真验证了我们的理论发展。

英文摘要

For the task of moving a set of indistinguishable agents on a connected graph with unit edge distance to an arbitrary set of goal vertices, free of collisions, we propose a fast distance optimal control algorithm that guides the agents into the desired formation. Moreover, we show that the algorithm also provides a tight convergence time guarantee (time optimality and distance optimality cannot be simultaneously satisfied). Our generic graph formulation allows the algorithm to be applied to scenarios such as grids with holes (modeling obstacles) in arbitrary dimensions. Simulations, available online, confirm our theoretical developments.

1203.2992 2026-06-03 eess.SY cs.CV cs.SY

Hybrid Poisson and multi-Bernoulli filters

混合泊松和多伯努利滤波器

Jason L. Williams

AI总结 提出一种结合概率假设密度和多目标多伯努利滤波器的混合方法,通过维持未检测目标的泊松分量和回收低存在概率的伯努利分量,实现快速航迹起始并减少伯努利分量数量。

Comments Submitted to 15th International Conference on Information Fusion (2012)

详情
AI中文摘要

概率假设密度(PHD)和多目标多伯努利(MeMBer)滤波器是基于随机有限集(RFS)的两种主要算法。本文研究了一种结合这两种方法的方法。我们的工作受到一篇姊妹论文的启发,该论文证明了全贝叶斯RFS滤波器自然包含一个代表从未被检测到的目标的泊松分量,以及一个代表跟踪中目标的多伯努利分量的线性组合。这里我们展示了维持未检测目标的泊松分量所带来的好处(在航迹起始速度方面)。随后,我们提出了一种回收方法,将存在概率较低的伯努利分量投影到泊松分量上(而不是删除它们)。我们表明,这使我们能够使用更少的伯努利分量(即航迹)实现相似的跟踪性能。

英文摘要

The probability hypothesis density (PHD) and multi-target multi-Bernoulli (MeMBer) filters are two leading algorithms that have emerged from random finite sets (RFS). In this paper we study a method which combines these two approaches. Our work is motivated by a sister paper, which proves that the full Bayes RFS filter naturally incorporates a Poisson component representing targets that have never been detected, and a linear combination of multi-Bernoulli components representing targets under track. Here we demonstrate the benefit (in speed of track initiation) that maintenance of a Poisson component of undetected targets provides. Subsequently, we propose a method of recycling, which projects Bernoulli components with a low probability of existence onto the Poisson component (as opposed to deleting them). We show that this allows us to achieve similar tracking performance using a fraction of the number of Bernoulli components (i.e., tracks).

1109.0217 2026-06-03 math.NA cs.CV cs.NA

Vessel Segmentation in Medical Imaging Using a Tight-Frame Based Algorithm

基于紧框架算法的医学图像血管分割

Xiaohao Cai, Raymond Chan, Serena Morigi, Fiorella Sgallari

AI总结 提出一种基于紧框架的迭代算法,用于磁共振血管造影图像中管状结构(如血管)的自动分割,通过去噪、平滑和锐化边界区域,在少量迭代内收敛,并优于现有PDE和变分方法。

详情
AI中文摘要

紧框架作为正交小波的推广,已成功应用于图像处理中的多种问题,包括修复、脉冲噪声去除、超分辨率图像恢复等。分割是识别图像中物体轮廓的过程。目前存在多种基于变分方法和偏微分方程(PDE)建模的高效分割算法。本文提出应用紧框架方法自动识别磁共振血管造影(MRA)图像中的管状结构(如血管)。我们的方法迭代地细化一个包围血管可能边界或表面的区域。在每次迭代中,我们应用紧框架算法对可能边界进行去噪和平滑,并锐化该区域。我们证明了算法的收敛性。在真实2D/3D MRA图像上的数值实验表明,我们的方法非常高效,通常在几次迭代内收敛,并且由于能够提取图像中更多的管状目标和精细细节,优于现有的PDE和变分方法。

英文摘要

Tight-frame, a generalization of orthogonal wavelets, has been used successfully in various problems in image processing, including inpainting, impulse noise removal, super-resolution image restoration, etc. Segmentation is the process of identifying object outlines within images. There are quite a few efficient algorithms for segmentation that depend on the variational approach and the partial differential equation (PDE) modeling. In this paper, we propose to apply the tight-frame approach to automatically identify tube-like structures such as blood vessels in Magnetic Resonance Angiography (MRA) images. Our method iteratively refines a region that encloses the possible boundary or surface of the vessels. In each iteration, we apply the tight-frame algorithm to denoise and smooth the possible boundary and sharpen the region. We prove the convergence of our algorithm. Numerical experiments on real 2D/3D MRA images demonstrate that our method is very efficient with convergence usually within a few iterations, and it outperforms existing PDE and variational methods as it can extract more tubular objects and fine details in the images.

1108.3405 2026-06-03 eess.SY cs.MA cs.RO cs.SY math.OC

Hybrid 3-D Formation Control for Unmanned Helicopters

无人直升机的混合三维编队控制

A. Karimoddini, H. Lin, B. M. Chen, T. H. Lee

AI总结 针对无人直升机编队控制,提出一种混合监督控制框架,通过状态空间球形抽象和双相似性实现离散逻辑与连续动态的交互,并嵌入防碰撞机制。

Comments Submitted for publication

详情
AI中文摘要

无人机团队构成典型的网络化信息物理系统,涉及离散逻辑与连续动态的交互。本文提出一种用于无人直升机三维领航-跟随编队控制的混合监督控制框架。所提出的混合控制框架捕捉了决策单元与路径规划器连续动态之间的内部交互,从而提高了系统的整体可靠性。为设计此类混合控制器,提出了一种状态空间的球形抽象作为新的抽象方法。利用划分空间上的多仿射函数性质,得到有限状态离散事件系统模型,该模型被证明与原始连续变量动态系统是双相似的。然后,在离散域中,为抽象模型模块化设计了一个逻辑监督器。由于抽象DES模型与原始无人机动态之间的双相似性,设计的逻辑监督器可通过接口层实现为混合控制器。该监督器驱动无人机动态以满足设计要求。换言之,混合控制器能够从控制视界内的任意初始状态将无人机引导至期望编队,并维持编队。此外,在设计的监督器中嵌入了防碰撞机制。最后,通过为无人直升机开发的硬件在环仿真平台验证了该算法。结果表明了算法的有效性。

英文摘要

Teams of Unmanned Aerial Vehicles (UAVs) form typical networked cyber-physical systems that involve the interaction of discrete logic and continuous dynamics. This paper presents a hybrid supervisory control framework for the three-dimensional leader follower formation control of unmanned helicopters. The proposed hybrid control framework captures internal interactions between the decision making unit and the path planner continuous dynamics of the system, and hence improves the system's overall reliability. To design such a hybrid controller, a spherical abstraction of the state space is proposed as a new method of abstraction. Utilizing the properties of multi-affine functions over the partitioned space leads to a finite state Discrete Event System (DES) model, which is shown to be bisimilar to the original continuous-variable dynamical system. Then, in the discrete domain, a logic supervisor is modularly designed for the abstracted model. Due to the bisimilarity between the abstracted DES model and the original UAV dynamics, the designed logic supervisor can be implemented as a hybrid controller through an interface layer. This supervisor drives the UAV dynamics to satisfy the design requirements. In other words, the hybrid controller is able to bring the UAVs to the desired formation starting from any initial state inside the control horizon and then, maintain the formation. Moreover, a collision avoidance mechanism is embedded in the designed supervisor. Finally, the algorithm has been verified by a hardware-in-the-loop simulation platform, which is developed for unmanned helicopters. The presented results show the effectiveness of the algorithm.

1104.3792 2026-06-03 stat.ML cs.LG cs.NA math.NA

A sufficient condition on monotonic increase of the number of nonzero entry in the optimizer of L1 norm penalized least-square problem

L1范数惩罚最小二乘问题优化器中非零条目数单调递增的充分条件

J. Duan, Charles Soussen, David Brie, Jerome Idier, Y. -P. Wang

AI总结 本文针对L1范数惩罚最小二乘问题(LASSO),提出了一个充分条件,在该条件下当超参数减小时优化器中非零条目数单调递增,并将结果推广到全变分情形。

详情
AI中文摘要

基于$\ell$-1范数的优化广泛应用于信号处理,尤其是近期的压缩感知理论。本文研究$\ell$-1范数惩罚最小二乘问题的解路径,其约束形式称为最小绝对收缩和选择算子(LASSO)。解路径是随着超参数(拉格朗日乘子)变化的所有优化器的集合。解路径的研究对于理解和观察近似项与正则化项之间的权衡曲线具有重要意义。如果已知给定问题的解路径,它可以帮助我们在给定准则(如Akaike信息准则)下找到最优超参数。本文提出了$\ell$-1范数惩罚最小二乘问题的一个充分条件。在该充分条件下,当超参数减小时,优化器或解向量中的非零条目数单调递增。我们还将结果推广到常用的全变分情形,其中$\ell$-1范数作用于解向量的一阶导数。我们证明所提出的条件与Donoho等人\cite{Donoho08}给出的条件以及Efron等人\cite{Efron04}的正锥条件具有内在联系。然而,所提出的条件不需要像Donoho等人的条件那样假设信号的稀疏水平,并且在用于实际应用时比Efron等人的正锥条件更容易验证。

英文摘要

The $\ell$-1 norm based optimization is widely used in signal processing, especially in recent compressed sensing theory. This paper studies the solution path of the $\ell$-1 norm penalized least-square problem, whose constrained form is known as Least Absolute Shrinkage and Selection Operator (LASSO). A solution path is the set of all the optimizers with respect to the evolution of the hyperparameter (Lagrange multiplier). The study of the solution path is of great significance in viewing and understanding the profile of the tradeoff between the approximation and regularization terms. If the solution path of a given problem is known, it can help us to find the optimal hyperparameter under a given criterion such as the Akaike Information Criterion. In this paper we present a sufficient condition on $\ell$-1 norm penalized least-square problem. Under this sufficient condition, the number of nonzero entries in the optimizer or solution vector increases monotonically when the hyperparameter decreases. We also generalize the result to the often used total variation case, where the $\ell$-1 norm is taken over the first order derivative of the solution vector. We prove that the proposed condition has intrinsic connections with the condition given by Donoho, et al \cite{Donoho08} and the positive cone condition by Efron {\it el al} \cite{Efron04}. However, the proposed condition does not need to assume the sparsity level of the signal as required by Donoho et al's condition, and is easier to verify than Efron, et al's positive cone condition when being used for practical applications.

1104.1159 2026-06-03 math.OC cs.RO cs.SY eess.SY

LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees

不确定环境下具有概率满足保证的LTL控制

Xu Chu Ding, Stephen L. Smith, Calin Belta, Daniela Rus

AI总结 提出一种最大化任务完成概率的机器人控制策略生成方法,任务由线性时序逻辑公式描述,通过将问题转化为马尔可夫决策过程的最优策略求解,并利用概率模型检验技术给出完整解决方案。

Comments Technical Report accompanying IFAC 2011

详情
AI中文摘要

我们提出一种生成机器人控制策略的方法,该策略最大化完成任务的概率。任务由一组属性的线性时序逻辑(LTL)公式给出,这些属性可以在划分环境的区域中满足。我们假设属性在区域中满足的概率已知,并且机器人只能在当前区域确定命题的真值。受分区抽象相关结果的启发,我们假设运动在图上进行。为了考虑噪声传感器和执行器,我们假设一个控制动作会启用多个具有已知概率的转移。我们证明该问题可以简化为为马尔可夫决策过程(MDP)生成控制策略的问题,使得在其状态上满足LTL公式的概率最大化。我们基于概率模型检验的现有结果,为后一个问题提供了完整解决方案。我们包含一个说明性案例研究。

英文摘要

We present a method to generate a robot control strategy that maximizes the probability to accomplish a task. The task is given as a Linear Temporal Logic (LTL) formula over a set of properties that can be satisfied at the regions of a partitioned environment. We assume that the probabilities with which the properties are satisfied at the regions are known, and the robot can determine the truth value of a proposition only at the current region. Motivated by several results on partitioned-based abstractions, we assume that the motion is performed on a graph. To account for noisy sensors and actuators, we assume that a control action enables several transitions with known probabilities. We show that this problem can be reduced to the problem of generating a control policy for a Markov Decision Process (MDP) such that the probability of satisfying an LTL formula over its states is maximized. We provide a complete solution for the latter problem that builds on existing results from probabilistic model checking. We include an illustrative case study.

1103.4065 2026-06-03 eess.SY cs.RO cs.SY math.OC

Probabilistically Safe Vehicle Control in a Hostile Environment

敌对环境中概率安全的车辆控制

Igor Cizelj, Xu Chu Ding, Morteza Lahijanian, Alessandro Pinto, Calin Belta

AI总结 本文提出一种在静态障碍和移动对手的敌对环境中控制车辆的方法,通过将对手运动建模为泊松过程、车辆穿越时间建模为指数分布,并利用马尔可夫决策过程和概率计算树逻辑最大化任务完成概率。

详情
AI中文摘要

本文提出了一种在具有静态障碍和移动对手的敌对环境中控制车辆的方法。车辆需要满足一个任务目标,该目标表示为在分区环境区域上满足的一组属性的时序逻辑规范。我们将对手在环境区域之间的运动建模为泊松过程。此外,我们假设车辆在每个区域的两个面之间穿越所需的时间服从指数分布,并从环境模拟器中获得该指数分布的速率。我们将车辆的运动和对手分布的车辆更新捕获为马尔可夫决策过程。利用概率计算树逻辑中的工具,我们为车辆找到一种控制策略,以最大化完成任务目标的概率。我们通过说明性案例研究展示了我们的方法。

英文摘要

In this paper we present an approach to control a vehicle in a hostile environment with static obstacles and moving adversaries. The vehicle is required to satisfy a mission objective expressed as a temporal logic specification over a set of properties satisfied at regions of a partitioned environment. We model the movements of adversaries in between regions of the environment as Poisson processes. Furthermore, we assume that the time it takes for the vehicle to traverse in between two facets of each region is exponentially distributed, and we obtain the rate of this exponential distribution from a simulator of the environment. We capture the motion of the vehicle and the vehicle updates of adversaries distributions as a Markov Decision Process. Using tools in Probabilistic Computational Tree Logic, we find a control strategy for the vehicle that maximizes the probability of accomplishing the mission objective. We demonstrate our approach with illustrative case studies.