arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

1707.00281 2026-06-04 cs.CV cs.NA math.NA math.OC

A Batch-Incremental Video Background Estimation Model using Weighted Low-Rank Approximation of Matrices

一种基于矩阵加权低秩近似的批量增量视频背景估计模型

Aritra Dutta, Xin Li, Peter Richtárik

AI总结本文提出一种批量增量视频背景估计模型，通过加权低秩近似改进传统方法，在实测和合成视频上优于GRASTA、ReProCS等算法。

1706.09597 2026-06-04 cs.AI cs.SY eess.SY

Path Integral Networks: End-to-End Differentiable Optimal Control

路径积分网络：端到端可微最优控制

Masashi Okada, Luca Rigazio, Takenobu Aoshima

AI总结本文提出路径积分网络（PI-Net），一种基于路径积分最优控制算法的递归网络表示，用于最优控制规划。PI-Net通过反向传播和随机梯度下降端到端学习系统动态和成本模型，具备规划能力，可泛化到未见状态，适用于连续控制任务，并支持多种学习方案。

1609.00932 2026-06-04 cs.LG cs.AI cs.SY eess.SY math.PR physics.data-an

Spectral learning of dynamic systems from nonequilibrium data

从非平衡数据中学习动态系统的谱方法

Hao Wu, Frank Noé

AI总结本文研究了在不假设数据同分布的情况下，通过施加平衡约束从非平衡观测数据中提取系统平衡动力学的谱学习特性，并提出了一种适用于连续数据的无bin扩展方法，实现线性复杂度下的稳定估计。

详情

Journal ref: Proceedings of the 29th conference on Neural Information Processing Systems (NIPS), Barcelona, Spain, 2016, pp. 4179-4187

AI中文摘要

可观测操作模型（OOMs）及相关模型是建模和分析随机系统的重要且强大的工具。它们精确描述有限秩系统的动力学，并可通过谱学习在假设数据同分布的情况下高效一致地估计。本文研究了在分析长时间尺度系统时不假设数据同分布的谱学习特性，并展示通过施加平衡约束可从非平衡观测数据中提取系统平衡动力学。此外，本文提出了一种适用于连续数据的无bin扩展谱学习方法。与其他连续值谱算法相比，无bin算法仅需线性复杂度即可实现平衡动力学的一致估计。

英文摘要

Observable operator models (OOMs) and related models are one of the most important and powerful tools for modeling and analyzing stochastic systems. They exactly describe dynamics of finite-rank systems and can be efficiently and consistently estimated through spectral learning under the assumption of identically distributed data. In this paper, we investigate the properties of spectral learning without this assumption due to the requirements of analyzing large-time scale systems, and show that the equilibrium dynamics of a system can be extracted from nonequilibrium observation data by imposing an equilibrium constraint. In addition, we propose a binless extension of spectral learning for continuous data. In comparison with the other continuous-valued spectral algorithms, the binless algorithm can achieve consistent estimation of equilibrium dynamics with only linear complexity.

URL PDF HTML ☆

赞 0 踩 0

1706.02869 2026-06-04 cs.LG cs.NA cs.SY eess.SY math.NA

Adaptive Consensus ADMM for Distributed Optimization

自适应共识ADMM用于分布式优化

Zheng Xu, Gavin Taylor, Hao Li, Mario Figueiredo, Xiaoming Yuan, Tom Goldstein

AI总结本文提出自适应共识ADMM方法，通过为每个节点定制参数提升分布式优化性能，并证明其O(1/k)收敛速率。

Comments ICML 2017

1602.07764 2026-06-04 cs.AI cs.LG cs.NA math.NA math.OC stat.ML

Reinforcement Learning of POMDPs using Spectral Methods

使用谱方法进行POMDP的强化学习

Kamyar Azizzadenesheli, Alessandro Lazaric, Animashree Anandkumar

AI总结本文提出基于谱分解方法的POMDP强化学习算法，通过轨迹学习参数并利用优化 oracle 得到最优无记忆策略，证明了与最优无记忆策略的最优 regret 绑定和高维空间的高效扩展性。

详情

Journal ref: 29th Annual Conference on Learning Theory, PMLR 49:193-256, 2016

AI中文摘要

我们提出了一种新的基于谱分解方法的POMDP强化学习算法。尽管谱方法之前已被用于一致学习隐马尔可夫模型等被动潜在变量模型，但POMDP更具挑战性，因为学习者与环境交互可能会改变未来的观测。我们设计了一种通过回合运行的算法，每个回合中利用谱技术从由固定策略生成的轨迹中学习POMDP参数。回合结束时，优化 oracle 返回基于估计POMDP模型的最优无记忆规划策略，该策略最大化预期奖励。我们证明了与最优无记忆策略相比的最优 regret 绑定以及在观测和动作空间维度上的高效扩展性。

英文摘要

We propose a new reinforcement learning algorithm for partially observable Markov decision processes (POMDP) based on spectral decomposition methods. While spectral methods have been previously employed for consistent learning of (passive) latent variable models such as hidden Markov models, POMDPs are more challenging since the learner interacts with the environment and possibly changes the future observations in the process. We devise a learning algorithm running through episodes, in each episode we employ spectral techniques to learn the POMDP parameters from a trajectory generated by a fixed policy. At the end of the episode, an optimization oracle returns the optimal memoryless planning policy which maximizes the expected reward based on the estimated POMDP model. We prove an order-optimal regret bound with respect to the optimal memoryless policy and efficient scaling with respect to the dimensionality of observation and action spaces.

URL PDF HTML ☆

赞 0 踩 0

1405.6341 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY

Efficient Model Learning for Human-Robot Collaborative Tasks

高效的人机协作任务模型学习

Stefanos Nikolaidis, Keren Gu, Ramya Ramakrishnan, Julie Shah

AI总结本文提出一种框架，通过联合动作演示学习人类用户模型，使机器人能自动计算稳健的协作策略。采用无监督学习聚类动作序列，学习逆强化学习奖励函数，并在混合可观测马尔可夫决策过程框架中应用，实现对新用户的类型推断和策略计算。

详情

DOI: 10.1145/2696454.2696455
Journal ref: Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction (HRI 2015)

AI中文摘要

我们提出了一种框架，用于从联合动作演示中学习人类用户模型，使机器人能够计算协作任务的稳健策略。学习过程完全自动，无需人工干预。首先，我们描述了使用无监督学习算法将演示的动作序列聚类为不同的人类类型。这些演示序列还被机器人用来通过逆强化学习算法学习代表每种类型的奖励函数。学习的模型随后作为混合可观测马尔可夫决策过程（MO-MDP）的一部分使用，其中人类类型是部分可观测变量。通过该框架，我们可以推断新用户类型（未包含在训练集中），并计算与新用户偏好一致且对人类动作偏离具有鲁棒性的机器人策略。最后，我们通过人类受试者实验数据验证了该方法，并进行了概念验证演示，其中一个人与小型工业机器人进行协作任务。

英文摘要

We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These demonstrated sequences are also used by the robot to learn a reward function that is representative for each type, through the employment of an inverse reinforcement learning algorithm. The learned model is then used as part of a Mixed Observability Markov Decision Process formulation, wherein the human type is a partially observable variable. With this framework, we can infer, either offline or online, the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this new user and will be robust to deviations of the human actions from prior demonstrations. Finally we validate the approach using data collected in human subject experiments, and conduct proof-of-concept demonstrations in which a person performs a collaborative task with a small industrial robot.

URL PDF HTML ☆

赞 0 踩 0

1706.04097 2026-06-04 cs.LG cs.DS cs.NA math.NA stat.ML

Provable Alternating Gradient Descent for Non-negative Matrix Factorization with Strong Correlations

可证明的非负矩阵分解交替梯度下降法用于强相关性情况

Yuanzhi Li, Yingyu Liang

AI总结本文提出了一种简单的交替梯度下降算法，证明在强相关性下能有效恢复真实特征矩阵，并展示了其在噪声下的鲁棒性。

Comments Accepted to the International Conference on Machine Learning (ICML), 2017

1702.07944 2026-06-04 cs.LG cs.AI cs.SY eess.SY math.OC stat.ML

Stochastic Variance Reduction Methods for Policy Evaluation

基于随机方差缩减的方法用于策略评估

Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou

AI总结本文提出基于线性函数逼近的策略评估方法，通过将经验策略评估问题转化为二次凸-凹鞍点问题，并设计了双变量批量梯度方法及两种随机方差缩减算法，实现线性缩放和线性收敛。

Comments Accepted by ICML 2017

1706.01127 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC

Virtual Constraints and Hybrid Zero Dynamics for Realizing Underactuated Bipedal Locomotion

虚拟约束与混合零动力学用于实现欠驱动双足运动

Jessy W Grizzle, Christine Chevallereau

AI总结本文提出了一种协调理论，用于设计反馈控制器实现欠驱动双足机器人稳定行走。通过引入虚拟约束和混合零动力学，同步关节相位变量并捕捉欠驱动特性。

Comments 17 pages, 4 figures, bookchapter

详情

AI中文摘要

欠驱动在人类运动中普遍存在，也应普遍存在于双足机器人运动中。本章提出了一种协调理论，用于设计反馈控制器实现欠驱动双足机器人稳定行走。引入了两个基本工具：虚拟约束和混合零动力学。虚拟约束是通过时间不变反馈控制器施加在机械模型状态变量上的关系。其作用之一是同步机器人关节到内部步态相位变量。另一个作用是诱导一个低维系统，即零动力学，捕捉机器人模型的欠驱动特性，而无需任何近似。为增强直观，首先建立了物理约束与虚拟约束之间的关系。从这里，开发了欠驱动双足模型的混合零动力学，并确立了其在设计渐近稳定的行走运动中的基本作用。本章包含大量参考了已实现这些技术的机器人。

英文摘要

Underactuation is ubiquitous in human locomotion and should be ubiquitous in bipedal robotic locomotion as well. This chapter presents a coherent theory for the design of feedback controllers that achieve stable walking gaits in underactuated bipedal robots. Two fundamental tools are introduced, virtual constraints and hybrid zero dynamics. Virtual constraints are relations on the state variables of a mechanical model that are imposed through a time-invariant feedback controller. One of their roles is to synchronize the robot's joints to an internal gait phasing variable. A second role is to induce a low dimensional system, the zero dynamics, that captures the underactuated aspects of a robot's model, without any approximations. To enhance intuition, the relation between physical constraints and virtual constraints is first established. From here, the hybrid zero dynamics of an underactuated bipedal model is developed, and its fundamental role in the design of asymptotically stable walking motions is established. The chapter includes numerous references to robots on which the highlighted techniques have been implemented.

URL PDF HTML ☆

赞 0 踩 0

1706.00241 2026-06-04 cs.LG cs.NA math.NA stat.ML

Krylov Subspace Recycling for Fast Iterative Least-Squares in Machine Learning

Krylov子空间回收用于机器学习中的快速迭代最小二乘法

Filip de Roos, Philipp Hennig

AI总结本文研究了利用Krylov子空间回收方法提高机器学习中对称正定线性问题求解效率，通过迭代优化低秩近似以平衡计算成本与数值精度。

详情

AI中文摘要

求解对称正定线性问题是机器学习中的基础计算任务。精确解，众所周知，其计算复杂度与矩阵大小呈立方关系。为缓解这一问题，已提出几种线性时间的近似方法，如谱方法和诱导点方法，这些方法现在被广泛应用。这些方法是低秩近似，提前选择低秩空间，并不随时间迭代优化。虽然这允许数据集大小的线性成本，但也导致有限的、无法纠正的近似误差。数值线性代数领域的作者探索了如何迭代优化此类低秩近似，其成本仅为少量矩阵-向量乘法。这一想法尤其在机器学习中许多情况下具有吸引力，其中需要解决一系列相关的对称正定线性问题。从机器学习的角度来看，此类消减方法可以被解释为在时间序列的数值任务中，低秩近似的迁移学习。我们研究了此类方法在我们领域中的应用。我们的实验证明，在中等规模的回归和分类问题上，这种方法可以介于低计算成本和数值精度之间。

英文摘要

Solving symmetric positive definite linear problems is a fundamental computational task in machine learning. The exact solution, famously, is cubicly expensive in the size of the matrix. To alleviate this problem, several linear-time approximations, such as spectral and inducing-point methods, have been suggested and are now in wide use. These are low-rank approximations that choose the low-rank space a priori and do not refine it over time. While this allows linear cost in the data-set size, it also causes a finite, uncorrected approximation error. Authors from numerical linear algebra have explored ways to iteratively refine such low-rank approximations, at a cost of a small number of matrix-vector multiplications. This idea is particularly interesting in the many situations in machine learning where one has to solve a sequence of related symmetric positive definite linear problems. From the machine learning perspective, such deflation methods can be interpreted as transfer learning of a low-rank approximation across a time-series of numerical tasks. We study the use of such methods for our field. Our empirical results show that, on regression and classification problems of intermediate size, this approach can interpolate between low computational cost and numerical precision.

URL PDF HTML ☆

赞 0 踩 0

1705.10432 2026-06-04 cs.AI cs.RO cs.SY eess.SY

Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning

基于深度强化学习的细粒度加速控制用于自动驾驶交叉口管理

Hamid Mirzaei, Tony Givargis

AI总结本文利用信任区域策略优化方法，实现自动驾驶车辆在网格街道中的细粒度加速控制，以达成全局管理目标。

Comments Accepted in IEEE Smart World Congress 2017

1705.08566 2026-06-04 cs.RO cs.SY eess.SY

A Near-Optimal Separation Principle for Nonlinear Stochastic Systems Arising in Robotic Path Planning and Control

非线性随机系统在机器人路径规划与控制中的近最优分离原理

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

AI总结本文提出了一种针对机器人路径规划与控制中非线性随机系统的近最优分离方法，通过小噪声假设实现可计算的近最优控制设计，同时推导出具有高斯噪声的非线性随机系统的轨迹优化线性二次调节器设计。

Comments 7 pages, 4 Figures, Submitted to 56th IEEE Conference on Decision and Control (CDC), 2017

1705.05804 2026-06-04 cs.CV cs.NA math.NA stat.ML

The Incremental Multiresolution Matrix Factorization Algorithm

增量多分辨率矩阵分解算法

Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh

AI总结本文提出增量多分辨率矩阵分解算法，用于揭示对称矩阵的层次块结构，通过逐特征分析提升大规模矩阵处理能力，并在医学影像回归任务中验证其有效性。

Comments Computer Vision and Pattern Recognition (CVPR) 2017, 10 pages

详情

AI中文摘要

多分辨率分析和矩阵分解是计算机视觉的基础工具。本文研究了这两个不同领域的交汇，并获得揭示对称矩阵层次块结构的技术，这对许多视觉问题的成功至关重要。我们的新算法，增量多分辨率矩阵分解，逐特征揭示此类结构，因此能有效扩展至大规模矩阵。我们描述了这种多尺度分析比直接全局分解能识别的更多。我们通过医学影像数据评估所得到的分解在回归任务中的有效性。我们还利用该分解在由流行深度网络学习的表示上进行操作，提供证据表明这些网络即使未显式训练以执行此类推断，也能推断语义关系。我们展示了该算法可作为探索工具来改进网络架构，并在视觉的众多其他设置中使用。

英文摘要

Multiresolution analysis and matrix factorization are foundational tools in computer vision. In this work, we study the interface between these two distinct topics and obtain techniques to uncover hierarchical block structure in symmetric matrices -- an important aspect in the success of many vision problems. Our new algorithm, the incremental multiresolution matrix factorization, uncovers such structure one feature at a time, and hence scales well to large matrices. We describe how this multiscale analysis goes much farther than what a direct global factorization of the data can identify. We evaluate the efficacy of the resulting factorizations for relative leveraging within regression tasks using medical imaging data. We also use the factorization on representations learned by popular deep networks, providing evidence of their ability to infer semantic relationships even when they are not explicitly trained to do so. We show that this algorithm can be used as an exploratory tool to improve the network architecture, and within numerous other settings in vision.

URL PDF HTML ☆

赞 0 踩 0

1705.05727 2026-06-04 cs.RO cs.SY eess.SY nlin.CD

A General Scheme Implicit Force Control for a Flexible-Link Manipulator

一种灵活连杆机械臂的通用隐式力控方案

Cecilia Murrugarra, Osberth De Castro, Juan Carlos Grieco, Gerardo Fernandez

AI总结本文提出了一种针对与柔顺环境交互的单连杆柔顺机械臂的隐式力控方案，基于机械臂的数学模型，考虑了柔性梁的动力学和重力作用，通过结构参数确定控制器参数，利用李雅普诺夫理论保证稳定性，采用内外闭环控制结构实现位置和力的跟踪控制及振动抑制。

Comments 16 pages, 14 figures

1705.05475 2026-06-04 cs.LG cs.NA cs.NE math.NA q-bio.NC

Sparse Coding by Spiking Neural Networks: Convergence Theory and Computational Results

稀疏编码的脉冲神经网络：收敛理论与计算结果

Ping Tak Peter Tang, Tsung-Han Lin, Mike Davies

AI总结本文提出一种脉冲神经网络模型，证明其能可靠解决稀疏编码问题，为非冯·诺依曼架构计算机提供了理论保障。

Comments 13 pages, 3 figures

1705.05344 2026-06-04 cs.RO cs.SY eess.SY

GP-ILQG: Data-driven Robust Optimal Control for Uncertain Nonlinear Dynamical Systems

GP-ILQG：基于数据的鲁棒最优控制用于不确定非线性动力学系统

Gilwoo Lee, Siddhartha S. Srinivasa, Matthew T. Mason

AI总结本文提出GP-ILQG算法，结合数据驱动的系统辨识方法与基于微分动态规划的鲁棒最优控制方法，有效解决现实与仿真之间的差距问题，实现快速修正模型并提升控制鲁棒性。

1705.05116 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY

Tuning Modular Networks with Weighted Losses for Hand-Eye Coordination

通过加权损失调节模块网络以提升手眼协调

Fangyi Zhang, Jürgen Leitner, Michael Milford, Peter I. Corke

AI总结本文提出端到端微调方法，通过加权损失提升模块化深度视觉-运动策略在平面抓取任务中的手眼协调性能。

Comments 2 pages, to appear in the Deep Learning for Robotic Vision (DLRV) Workshop in CVPR 2017

1705.04763 2026-06-04 cs.RO cs.SY eess.SY

High-Precision Trajectory Tracking in Changing Environments Through $\mathcal{L}_1$ Adaptive Feedback and Iterative Learning

通过L1自适应反馈和迭代学习实现高精度轨迹跟踪

Karime Pereida, Rikky R. P. R. Duivenvoorden, Angela P. Schoellig

AI总结本文提出结合L1自适应反馈与迭代学习控制的框架，提升系统在未知动态扰动下的轨迹跟踪性能，通过实验验证其优于纯ILC方法的鲁棒性和泛化能力。

Comments 7 pages, 5 figures, Proc. of the 2017 IEEE International Conference on Robotics and Automation

详情

AI中文摘要

随着机器人等自动化系统被引入未知和动态环境，需要鲁棒且自适应的控制策略以应对扰动、未建模动态和参数不确定性。本文提出并证明了一种结合L1自适应反馈和迭代学习控制（ILC）的框架，以提高受未知和变化扰动影响的系统轨迹跟踪性能。L1自适应控制器使系统在存在未知和变化扰动的情况下表现出可重复的预定义行为，但并不意味着实现完美轨迹跟踪。ILC通过以往执行的经验改进跟踪性能。ILC的性能受限于底层系统的鲁棒性和可重复性，而在本方法中，这由L1自适应控制器处理。特别地，我们能够将学习的轨迹泛化到不同的系统配置，因为L1自适应控制器处理了系统底层的变化。我们通过四旋翼在未知动态扰动下的实验，展示了结合方法相比纯ILC在轨迹跟踪性能和泛化能力上的改进。这是首次在实验中展示L1自适应控制与ILC结合的工作。

英文摘要

As robots and other automated systems are introduced to unknown and dynamic environments, robust and adaptive control strategies are required to cope with disturbances, unmodeled dynamics and parametric uncertainties. In this paper, we propose and provide theoretical proofs of a combined $\mathcal{L}_1$ adaptive feedback and iterative learning control (ILC) framework to improve trajectory tracking of a system subject to unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller forces the system to behave in a repeatable, predefined way, even in the presence of unknown and changing disturbances; however, this does not imply that perfect trajectory tracking is achieved. ILC improves the tracking performance based on experience from previous executions. The performance of ILC is limited by the robustness and repeatability of the underlying system, which, in this approach, is handled by the $\mathcal{L}_1$ adaptive controller. In particular, we are able to generalize learned trajectories across different system configurations because the $\mathcal{L}_1$ adaptive controller handles the underlying changes in the system. We demonstrate the improved trajectory tracking performance and generalization capabilities of the combined method compared to pure ILC in experiments with a quadrotor subject to unknown, dynamic disturbances. This is the first work to show $\mathcal{L}_1$ adaptive control combined with ILC in experiment.

URL PDF HTML ☆

赞 0 踩 0

1702.02453 2026-06-04 cs.LG cs.RO cs.SY eess.SY

Preparing for the Unknown: Learning a Universal Policy with Online System Identification

为未知做准备：学习通用策略与在线系统识别

Wenhao Yu, Jie Tan, C. Karen Liu, Greg Turk

AI总结本文提出了一种学习通用策略的方法，通过在线系统识别和大量训练示例，使策略在未知动态模型下具备鲁棒性，适用于多种动态模型和环境变化。

Comments Accepted as a conference paper at RSS 2017

详情

AI中文摘要

我们提出了一种学习控制策略的新方法，该方法能够在未知动态模型下有效运行。我们通过利用大量由物理模拟器生成的训练示例来创建此类策略。系统由两个组件组成：通用策略（UP）和在线系统识别（OSI）函数。我们描述我们的控制策略为通用，因为它是在广泛动态模型上训练的。这些动态模型的变化可能包括机器人组件的质量和惯性差异、摩擦系数变化或未知被操作物体的质量。通过在这些变化上训练通用策略，控制策略在未知环境中准备了更广泛的可能条件。系统第二部分利用系统的近期状态和动作历史来预测动态模型参数mu。在线系统识别的mu值然后作为输入提供给控制策略（连同系统状态）。UP-OSI是一种在广泛动态模型上适用且对环境突然变化具有响应性的稳健控制策略。我们评估了该系统在多种任务上的性能，包括cart-pole翻转问题、双倒立摆、跳蛙器的运动和机械臂的块投掷任务。UP-OSI在各种动态模型上均有效。此外，当测试动态模型超出训练范围时，UP-OSI在UP单独的情况下表现更优，即使UP被给予实际的动态模型值。除了创建更稳健的控制器的好处外，UP-OSI还具有缩小模拟与真实物理系统现实差距的潜力。

英文摘要

We present a new method of learning control policies that successfully operate under unknown dynamic models. We create such policies by leveraging a large number of training examples that are generated using a physical simulator. Our system is made of two components: a Universal Policy (UP) and a function for Online System Identification (OSI). We describe our control policy as universal because it is trained over a wide array of dynamic models. These variations in the dynamic model may include differences in mass and inertia of the robots' components, variable friction coefficients, or unknown mass of an object to be manipulated. By training the Universal Policy with this variation, the control policy is prepared for a wider array of possible conditions when executed in an unknown environment. The second part of our system uses the recent state and action history of the system to predict the dynamics model parameters mu. The value of mu from the Online System Identification is then provided as input to the control policy (along with the system state). Together, UP-OSI is a robust control policy that can be used across a wide range of dynamic models, and that is also responsive to sudden changes in the environment. We have evaluated the performance of this system on a variety of tasks, including the problem of cart-pole swing-up, the double inverted pendulum, locomotion of a hopper, and block-throwing of a manipulator. UP-OSI is effective at these tasks across a wide range of dynamic models. Moreover, when tested with dynamic models outside of the training range, UP-OSI outperforms the Universal Policy alone, even when UP is given the actual value of the model dynamics. In addition to the benefits of creating more robust controllers, UP-OSI also holds out promise of narrowing the Reality Gap between simulated and real physical systems.

URL PDF HTML ☆

赞 0 踩 0

1704.04722 2026-06-04 cs.RO cs.SY eess.SY

Bounded Distributed Flocking Control of Nonholonomic Mobile Robots

非holonomic移动机器人有界分布式编队控制

Thang Nguyen, Hung La, Vahid Azimi, Thanh-Trung Han

AI总结本文提出基于有界反馈的非holonomic移动机器人分布式编队控制方法，解决动态特性带来的挑战，实现速度一致性、避障和聚拢维持。

1702.04077 2026-06-04 cs.LG cs.NA math.NA stat.ML

Mutual Kernel Matrix Completion

互核矩阵补全

Tsuyoshi Kato, Rachelle Rivero

AI总结本文提出互核矩阵补全算法，通过融合数据与核矩阵补全方法，提升生物数据分类任务中缺失核矩阵的补全效果。

Comments 10 pages, 4 figures

详情

AI中文摘要

随着各种数据的大量涌入，从其中提取知识已成为数据科学家的一项有趣但繁琐的任务，特别是当数据形式异构且存在缺失信息时。许多数据补全技术已被引入，尤其是在核方法出现后。然而，在现有文献中，关于同时补全多个不完整核矩阵的研究却很少受到关注。本文提出了一种新的方法，称为互核矩阵补全（MKMC）算法，通过结合数据融合和核矩阵补全的概念，应用于生物数据集以用于分类任务。我们首先引入了一个目标函数，通过利用EM算法进行最小化，从而得到涉及的核矩阵中缺失条目的估计。补全后的核矩阵随后被结合以生成一个模型矩阵，可用于进一步改进获得的估计。我们的研究结果表明，E步和M步以闭合形式给出，使我们的算法在时间和内存方面都高效。完成补全后，补全的核矩阵用于训练SVM分类器，以测试数据点之间关系的保持程度。我们的实证结果表明，所提出的算法在保持数据点之间关系和准确恢复缺失核矩阵条目方面优于传统补全技术。目前，MKMC为多个相关不完整核矩阵的相互估计问题提供了一个有前途的解决方案。

英文摘要

With the huge influx of various data nowadays, extracting knowledge from them has become an interesting but tedious task among data scientists, particularly when the data come in heterogeneous form and have missing information. Many data completion techniques had been introduced, especially in the advent of kernel methods. However, among the many data completion techniques available in the literature, studies about mutually completing several incomplete kernel matrices have not been given much attention yet. In this paper, we present a new method, called Mutual Kernel Matrix Completion (MKMC) algorithm, that tackles this problem of mutually inferring the missing entries of multiple kernel matrices by combining the notions of data fusion and kernel matrix completion, applied on biological data sets to be used for classification task. We first introduced an objective function that will be minimized by exploiting the EM algorithm, which in turn results to an estimate of the missing entries of the kernel matrices involved. The completed kernel matrices are then combined to produce a model matrix that can be used to further improve the obtained estimates. An interesting result of our study is that the E-step and the M-step are given in closed form, which makes our algorithm efficient in terms of time and memory. After completion, the (completed) kernel matrices are then used to train an SVM classifier to test how well the relationships among the entries are preserved. Our empirical results show that the proposed algorithm bested the traditional completion techniques in preserving the relationships among the data points, and in accurately recovering the missing kernel matrix entries. By far, MKMC offers a promising solution to the problem of mutual estimation of a number of relevant incomplete kernel matrices.

URL PDF HTML ☆

赞 0 踩 0

1605.00609 2026-06-04 cs.LG cs.IT cs.NA math.IT math.NA stat.ML

Algorithms for Learning Sparse Additive Models with Interactions in High Dimensions

高维空间中包含交互项的稀疏加法模型的学习算法

Hemant Tyagi, Anastasios Kyrillidis, Bernd Gärtner, Andreas Krause

AI总结本文提出了一种在高维空间中学习包含稀疏交互项的加法模型的算法，通过压缩感知方法有效恢复模型结构并保证误差界。

Comments To appear in Information and Inference: A Journal of the IMA. Made following changes after review process: (a) Corrected typos throughout the text. (b) Corrected choice of sampling distribution in Section 5, see eqs. (5.2), (5.3). (c) More detailed comparison with existing work in Section 8. (d) Added Section B in appendix on roots of cubic equation

详情

AI中文摘要

一个函数$f: \mathbb{R}^d \rightarrow \mathbb{R}$是稀疏加法模型（SPAM），如果其形式为$f(\mathbf{x}) = \sum_{l \in \mathcal{S}}ϕ_{l}(x_l)$，其中$\mathcal{S} \subset [d]$，且$|\mathcal{S}| \ll d$。假设$ϕ$和$\mathcal{S}$未知，已有大量工作致力于从样本中估计$f$。本文考虑了一种广义的SPAMs，允许存在少量的二次交互项。对于某些$\mathcal{S}_1 \subset [d], \mathcal{S}_2 \subset {[d] \choose 2}$，其中$|\mathcal{S}_1| \ll d, |\mathcal{S}_2| \ll d^2$，函数$f$现在被假设为形式：$\sum_{p \in \mathcal{S}_1}ϕ_{p} (x_p) + \sum_{(l,l^{\prime}) \in \mathcal{S}_2}ϕ_{(l,l^{\prime})} (x_l,x_{l^{\prime}})$。假设我们能够任意查询$f$的域内任意点，我们推导出高效的算法，能够以有限样本界证明恢复$\mathcal{S}_1,\mathcal{S}_2$。我们的分析涵盖了无噪声设置，即获得精确的$f$样本，也扩展到有噪声设置，其中查询被噪声污染。特别是对于有噪声设置，我们考虑了两种噪声模型：独立同分布高斯噪声和任意但有界的噪声。我们的主要方法依赖于稀疏Hessian矩阵的估计，为此我们提供了两种新的压缩感知方案。一旦$\mathcal{S}_1, \mathcal{S}_2$已知，我们展示了如何通过额外的$f$查询估计个体组件$ϕ_p$, $ϕ_{(l,l^{\prime})}$，并保证均匀误差界。最后，我们通过合成数据的模拟结果验证了我们的理论发现。

英文摘要

A function $f: \mathbb{R}^d \rightarrow \mathbb{R}$ is a Sparse Additive Model (SPAM), if it is of the form $f(\mathbf{x}) = \sum_{l \in \mathcal{S}}ϕ_{l}(x_l)$ where $\mathcal{S} \subset [d]$, $|\mathcal{S}| \ll d$. Assuming $ϕ$'s, $\mathcal{S}$ to be unknown, there exists extensive work for estimating $f$ from its samples. In this work, we consider a generalized version of SPAMs, that also allows for the presence of a sparse number of second order interaction terms. For some $\mathcal{S}_1 \subset [d], \mathcal{S}_2 \subset {[d] \choose 2}$, with $|\mathcal{S}_1| \ll d, |\mathcal{S}_2| \ll d^2$, the function $f$ is now assumed to be of the form: $\sum_{p \in \mathcal{S}_1}ϕ_{p} (x_p) + \sum_{(l,l^{\prime}) \in \mathcal{S}_2}ϕ_{(l,l^{\prime})} (x_l,x_{l^{\prime}})$. Assuming we have the freedom to query $f$ anywhere in its domain, we derive efficient algorithms that provably recover $\mathcal{S}_1,\mathcal{S}_2$ with finite sample bounds. Our analysis covers the noiseless setting where exact samples of $f$ are obtained, and also extends to the noisy setting where the queries are corrupted with noise. For the noisy setting in particular, we consider two noise models namely: i.i.d Gaussian noise and arbitrary but bounded noise. Our main methods for identification of $\mathcal{S}_2$ essentially rely on estimation of sparse Hessian matrices, for which we provide two novel compressed sensing based schemes. Once $\mathcal{S}_1, \mathcal{S}_2$ are known, we show how the individual components $ϕ_p$, $ϕ_{(l,l^{\prime})}$ can be estimated via additional queries of $f$, with uniform error bounds. Lastly, we provide simulation results on synthetic data that validate our theoretical findings.

URL PDF HTML ☆

赞 0 踩 0

1601.04037 2026-06-04 cs.RO cs.AI cs.SY eess.SY math.DS math.OC

Funnel Libraries for Real-Time Robust Feedback Motion Planning

用于实时鲁棒反馈运动规划的 funnel 库

Anirudha Majumdar, Russ Tedrake

AI总结本文提出利用预计算的 funnel 库实现实时鲁棒反馈运动规划，通过凸优化计算 funnel 并在运行时安全组合运动计划，验证了在复杂环境中高动态机器人系统鲁棒性和安全性。

Comments International Journal of Robotics Research (To Appear)

详情

AI中文摘要

我们考虑了在存在环境不确定性、参数模型不确定性和扰动时，生成保证成功的机器人运动计划的问题。此外，我们还考虑了必须在实时中生成这些计划的场景，因为环境中的约束（如障碍物）可能在运行时通过有噪声的传感器感知到。我们的方法是预先计算不同系统操作的“funnels”库，这些 funnels 确保在执行对应操作的反馈控制器时，状态在扰动范围内保持。我们利用凸优化（特别是求和平方编程）的强大计算能力来计算这些 funnels。所得到的 funnel 库然后在运行时被顺序组合以生成运动计划，同时确保机器人的安全性。本文的一个主要优势是通过显式考虑不确定性的影响，机器人可以根据运动计划对扰动的脆弱性来评估。我们通过大量硬件实验（在高速（约12英里/小时）下避障的小型固定翼飞机）和彻底的仿真实验（地面车辆和四旋翼模型在复杂环境中导航）来演示和验证我们的方法。据我们所知，这些演示构成了首次证明安全且鲁棒的控制方法，用于具有复杂非线性动力学的机器人系统，在具有复杂几何约束的环境中实时规划。

英文摘要

We consider the problem of generating motion plans for a robot that are guaranteed to succeed despite uncertainty in the environment, parametric model uncertainty, and disturbances. Furthermore, we consider scenarios where these plans must be generated in real-time, because constraints such as obstacles in the environment may not be known until they are perceived (with a noisy sensor) at runtime. Our approach is to pre-compute a library of "funnels" along different maneuvers of the system that the state is guaranteed to remain within (despite bounded disturbances) when the feedback controller corresponding to the maneuver is executed. We leverage powerful computational machinery from convex optimization (sums-of-squares programming in particular) to compute these funnels. The resulting funnel library is then used to sequentially compose motion plans at runtime while ensuring the safety of the robot. A major advantage of the work presented here is that by explicitly taking into account the effect of uncertainty, the robot can evaluate motion plans based on how vulnerable they are to disturbances. We demonstrate and validate our method using extensive hardware experiments on a small fixed-wing airplane avoiding obstacles at high speed (~12 mph), along with thorough simulation experiments of ground vehicle and quadrotor models navigating through cluttered environments. To our knowledge, these demonstrations constitute one of the first examples of provably safe and robust control for robotic systems with complex nonlinear dynamics that need to plan in real-time in environments with complex geometric constraints.

URL PDF HTML ☆

赞 0 踩 0

1704.06803 2026-06-04 cs.LG cs.IR cs.NA math.NA stat.ML

Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks

基于循环多图神经网络的几何矩阵补全

Federico Monti, Michael M. Bronstein, Xavier Bresson

AI总结本文提出利用几何深度学习改进矩阵补全，结合图卷积网络和循环神经网络，学习图结构模式和非线性扩散过程，以提升推荐系统性能，参数数量与矩阵规模无关。

1610.06688 2026-06-04 cs.CV cs.NA math.NA

Multispectral image denoising with optimized vector non-local mean filter

多光谱图像去噪的优化向量非局部均值滤波

Ahmed Ben Said, Rachid Hadjidj, Kamel Eddine Melkemi, Sebti Foufou

AI总结本文提出将非局部均值滤波扩展至向量域，用于多光谱图像去噪，通过优化参数和计算复杂度提升去噪性能。

Comments 30 pages, 17 figures, journal paper

详情

DOI: 10.1016/j.dsp.2016.07.017

AI中文摘要

如今，许多应用依赖高质量图像以确保任务执行性能。然而，噪声是大多数应用中不可避免的问题。因此，开发技术以减轻噪声影响，同时保持图像相关信息的完整性至关重要。本文提出将非局部均值滤波（NLM）扩展至向量情况，并应用于多光谱图像去噪。目标是利用多光谱成像系统带来的额外信息。NLM滤波器利用图像中的信息冗余来去除噪声。恢复的像素是图像中所有像素的加权平均。在我们的贡献中，我们提出了一种优化框架，其中动态调整NLM滤波器参数，并通过考虑最相似像素来降低计算复杂度。滤波器参数使用Stein的无偏风险估计器（SURE）而非随意方法进行优化。实验在受加性白高斯噪声污染的多光谱图像上进行，并提供了PSNR和与其他方法的相似性比较，以展示本方法在去噪性能和计算复杂度方面的效率。

英文摘要

Nowadays, many applications rely on images of high quality to ensure good performance in conducting their tasks. However, noise goes against this objective as it is an unavoidable issue in most applications. Therefore, it is essential to develop techniques to attenuate the impact of noise, while maintaining the integrity of relevant information in images. We propose in this work to extend the application of the Non-Local Means filter (NLM) to the vector case and apply it for denoising multispectral images. The objective is to benefit from the additional information brought by multispectral imaging systems. The NLM filter exploits the redundancy of information in an image to remove noise. A restored pixel is a weighted average of all pixels in the image. In our contribution, we propose an optimization framework where we dynamically fine tune the NLM filter parameters and attenuate its computational complexity by considering only pixels which are most similar to each other in computing a restored pixel. Filter parameters are optimized using Stein's Unbiased Risk Estimator (SURE) rather than using ad hoc means. Experiments have been conducted on multispectral images corrupted with additive white Gaussian noise and PSNR and similarity comparison with other approaches are provided to illustrate the efficiency of our approach in terms of both denoising performance and computation complexity.

URL PDF HTML ☆

赞 0 踩 0

1704.05249 2026-06-04 cs.LG cs.NI cs.SY eess.SY

Hot or not? Forecasting cellular network hot spots using sector performance indicators

热点与否？利用扇区性能指标预测蜂窝网络热点

Joan Serrà, Ilias Leontiadis, Alexandros Karatzoglou, Konstantina Papagiannaki

AI总结本文研究蜂窝网络热点评分的时空模式，利用树形机器学习模型预测热点，发现树模型在预测常规和非常规热点时分别提升14%和153%的准确性。

Comments Accepted for publication at ICDE 2017 - Industrial Track

详情

AI中文摘要

为管理维护大规模蜂窝网络，运营商需了解何时哪些扇区表现不佳。为此，他们使用所谓的热点评分，即多种网络测量的组合结果，反映单个扇区的即时整体性能。尽管运营商对网络当前性能和整体趋势有良好理解，但预测每个扇区随时间的变化却极具挑战性，因为其受常规和非常规事件影响，由人类行为和硬件故障触发。本文研究热点评分的时空模式，揭示其规律性。基于观察，我们探索利用近期测量历史预测未来热点的可能性。为此，我们考虑基于树的机器学习模型，并研究其性能随时间、历史数据量和预测时间跨度的变化。结果表明，与最佳基线相比，树模型在预测常规热点时可提升14%，在预测非常规热点时可提升153%。后者为中等时间跨度内预测孤立、非常规行为的热点提供了有力证据。整体而言，本文为蜂窝扇区动态及其可预测性提供了见解，并为更具前瞻性的网络运营和更长的预测时间跨度铺平了道路。

英文摘要

To manage and maintain large-scale cellular networks, operators need to know which sectors underperform at any given time. For this purpose, they use the so-called hot spot score, which is the result of a combination of multiple network measurements and reflects the instantaneous overall performance of individual sectors. While operators have a good understanding of the current performance of a network and its overall trend, forecasting the performance of each sector over time is a challenging task, as it is affected by both regular and non-regular events, triggered by human behavior and hardware failures. In this paper, we study the spatio-temporal patterns of the hot spot score and uncover its regularities. Based on our observations, we then explore the possibility to use recent measurements' history to predict future hot spots. To this end, we consider tree-based machine learning models, and study their performance as a function of time, amount of past data, and prediction horizon. Our results indicate that, compared to the best baseline, tree-based models can deliver up to 14% better forecasts for regular hot spots and 153% better forecasts for non-regular hot spots. The latter brings strong evidence that, for moderate horizons, forecasts can be made even for sectors exhibiting isolated, non-regular behavior. Overall, our work provides insight into the dynamics of cellular sectors and their predictability. It also paves the way for more proactive network operations with greater forecasting horizons.

URL PDF HTML ☆

赞 0 踩 0

1611.05317 2026-06-04 cs.LG cs.SY eess.SY

A Learning Scheme for Microgrid Islanding and Reconnection

微电网孤岛与重新连接的学习方案

Carter Lassetter, Eduardo Cotilla-Sanchez, Jinsub Kim

AI总结本文提出一种学习方案，通过实时数据预测微电网重新连接到主电网的稳定性，利用支持向量机和动态模拟器提高预测准确性。

Comments 10 pages, 5 figures

详情

AI中文摘要

本文介绍了一种潜在的学习方案，能够动态预测子网络重新连接到主电网的稳定性。随着电力系统趋向智能化和绿色化，自给自足的微电网部署变得更为可能。微电网可能独立运行或与主电网同步，因此控制方法需考虑孤岛和重新连接。目前，最优且安全的重新连接能力尚不明确，目前仅限于连接点之间的简单同步。本文提出一种利用实时数据从相量测量单元（PMUs）的支持向量机（SVM）来预测子网络重新连接是否会导致稳定或不稳定。通过动态模拟器生成训练数据，用于在不同运行状态下训练SVM。分类器在多种情况下进行测试以确保多样性。在大多数条件下，动态预测的准确率约为85%。

英文摘要

This paper introduces a potential learning scheme that can dynamically predict the stability of the reconnection of sub-networks to a main grid. As the future electrical power systems tend towards smarter and greener technology, the deployment of self sufficient networks, or microgrids, becomes more likely. Microgrids may operate on their own or synchronized with the main grid, thus control methods need to take into account islanding and reconnecting of said networks. The ability to optimally and safely reconnect a portion of the grid is not well understood and, as of now, limited to raw synchronization between interconnection points. A support vector machine (SVM) leveraging real-time data from phasor measurement units (PMUs) is proposed to predict in real time whether the reconnection of a sub-network to the main grid would lead to stability or instability. A dynamics simulator fed with pre-acquired system parameters is used to create training data for the SVM in various operating states. The classifier was tested on a variety of cases and operating points to ensure diversity. Accuracies of approximately 85% were observed throughout most conditions when making dynamic predictions of a given network.

URL PDF HTML ☆

赞 0 踩 0

1704.03103 2026-06-04 cs.RO cs.AI cs.CG cs.SY eess.SY

Minkowski Operations of Sets with Application to Robot Localization

Minkowski运算与机器人定位的应用

Benoit Desrochers, Luc Jaulin

AI总结本文通过引入Minkowski和与差的分离器，高效解决机器人在非结构化环境中基于声呐测量的定位问题，并通过测试案例验证了方法的有效性。

Comments In Proceedings SNR 2017, arXiv:1704.02421

1605.07367 2026-06-04 cs.LG cs.NA math.NA math.OC stat.ML

Riemannian stochastic variance reduced gradient on Grassmann manifold

黎曼流形上的随机方差缩减梯度算法

Hiroyuki Kasai, Hiroyuki Sato, Bamdev Mishra

AI总结本文提出了一种在紧凑流形搜索空间中扩展欧几里得随机方差缩减梯度算法的黎曼扩展方法，针对格拉斯曼流形进行研究，解决了多个梯度的平均、加法和减法问题，并在不同步长下分析了算法的收敛性。

详情

AI中文摘要

随机方差缩减算法近年来在最小化大量但有限的损失函数的平均值方面变得流行。本文提出了一种新颖的黎曼扩展欧几里得随机方差缩减梯度算法（R-SVRG）到紧凑流形搜索空间。为此，我们展示了在格拉斯曼流形上的发展。通过在格拉斯曼流形上引入对数映射和向量的平行翻译来解决多个梯度的平均、加法和减法的关键挑战。我们展示了所提出算法在衰减步长下的全局收敛性分析，并在固定步长下在某些自然假设下进行了局部收敛率分析。所提出算法被应用于格拉斯曼流形上的多个问题，如主成分分析、低秩矩阵补全和Karcher均值计算。在所有这些情况下，所提出算法都优于标准的黎曼随机梯度下降算法。

英文摘要

Stochastic variance reduction algorithms have recently become popular for minimizing the average of a large, but finite, number of loss functions. In this paper, we propose a novel Riemannian extension of the Euclidean stochastic variance reduced gradient algorithm (R-SVRG) to a compact manifold search space. To this end, we show the developments on the Grassmann manifold. The key challenges of averaging, addition, and subtraction of multiple gradients are addressed with notions like logarithm mapping and parallel translation of vectors on the Grassmann manifold. We present a global convergence analysis of the proposed algorithm with decay step-sizes and a local convergence rate analysis under fixed step-size with some natural assumptions. The proposed algorithm is applied on a number of problems on the Grassmann manifold like principal components analysis, low-rank matrix completion, and the Karcher mean computation. In all these cases, the proposed algorithm outperforms the standard Riemannian stochastic gradient descent algorithm.

URL PDF HTML ☆

赞 0 踩 0

1704.02075 2026-06-04 cs.RO cs.SY eess.SY

On Sensing, Agility, and Computation Requirements for a Data-gathering Agile Robotic Vehicle

关于数据收集敏捷机器人车辆的感知、敏捷性和计算需求

Fangchang Ma, Sertac Karaman

AI总结本文研究了机器人车辆在未知数据源位置下收集信息的性能，分析了感知、敏捷性和计算能力对信息获取的影响，提出了理论结果和实验验证。

Comments 22 pages, 11 figures

详情

AI中文摘要

我们考虑了一种任务为通过访问一组空间分布的数据源来收集信息的机器人车辆，这些数据源的位置在事先未知，但可以在飞行中发现。我们假设涉及漂移的一阶机器人动力学，并假设数据源的位置是泊松分布的。在此设定下，我们以感知、敏捷性和计算能力来表征机器人的性能。更具体地说，机器人性能的表征包括其从远处感知目标位置的能力、快速移动的能力以及进行推理和规划的计算能力。我们还以每个数据源可以获取的信息量和分布来表征机器人的性能。我们的理论结果包括：目标位置的信息量分布对从远处感知目标的需求影响巨大；性能随着移动能力的增加而增加，但回报递减；并且计算需求随着规划的增加而比推理更快增加，同时随着感知范围和移动能力的增加。我们提供了计算实验来验证我们的理论结果。最后，我们展示了这些结果可以用于移动机器人系统的感知、驱动和计算能力的协同设计，以执行信息收集任务。我们的证明技术建立了机器人信息收集基本问题与统计力学中的最后通路渗透问题之间的新联系，这可能本身就有价值。

英文摘要

We consider a robotic vehicle tasked with gathering information by visiting a set of spatially-distributed data sources, the locations of which are not known a priori, but are discovered on the fly. We assume a first-order robot dynamics involving drift and that the locations of the data sources are Poisson-distributed. In this setting, we characterize the performance of the robot in terms of its sensing, agility, and computation capabilities. More specifically, the robot's performance is characterized in terms of its ability to sense the target locations from a distance, to maneuver quickly, and to perform computations for inference and planning. We also characterize the performance of the robot in terms of the amount and distribution of information that can be acquired at each data source. The following are among our theoretical results: the distribution of the amount of information among the target locations immensely impacts the requirements for sensing targets from a distance; performance increases with increasing maneuvering capability, but with diminishing returns; and the computation requirements increase more rapidly for planning as opposed to inference, with both increasing sensing range and maneuvering ability. We provide computational experiments to validate our theoretical results. Finally, we demonstrate that these results can be utilized in the co-design of sensing, actuation, and computation capabilities of mobile robotic systems for an information-gathering mission. Our proof techniques establish novel connections between the fundamental problems of robotic information-gathering and the last-passage percolation problem of statistical mechanics, which may be of interest on its own right.

URL PDF HTML ☆

赞 0 踩 0