arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21506
1612.06176 2026-06-04 cs.CV cs.NA math.NA stat.ML

An extended Perona-Malik model based on probabilistic models

基于概率模型扩展的Perona-Malik模型

Lars M. Mescheder, Dirk A. Lorenz

AI总结 本文基于高斯尺度混合模型扩展了Perona-Malik模型,通过EM算法推导出滞后扩散算法,并改进其以更好地捕捉恢复中的不确定性,同时提出计算可行的放松方法,实验显示改进算法在恢复纹理区域和模糊边缘方面表现更优。

详情
AI中文摘要

Perona-Malik模型在从噪声输入中恢复图像方面非常成功。本文将该模型重新诠释为高斯尺度混合物的语言,并推导出一些扩展。具体来说,我们展示了将EM算法应用于高斯尺度混合物导致滞后扩散算法用于计算Perona-Malik扩散方程的稳态点。此外,我们展示了这些高斯尺度混合物的均场近似如何导致一种改进的滞后扩散算法,更准确地捕捉恢复中的不确定性。由于这种改进在实践中难以计算,我们提出对均场目标进行放松以使算法计算可行。我们的数值实验表明,这种改进的滞后扩散算法在恢复纹理区域和模糊边缘方面通常比未改进的算法表现更好。作为高斯尺度混合框架的第二个应用,我们展示了如何通过高效采样过程获得概率模型,使计算条件均值和其他期望在算法上可行。同样,所得到的算法与滞后扩散算法有很强的相似性。最后,我们展示了在相同框架下,通过离散边缘先验可以得到概率版本的Mumford-Shah分割模型。

英文摘要

The Perona-Malik model has been very successful at restoring images from noisy input. In this paper, we reinterpret the Perona-Malik model in the language of Gaussian scale mixtures and derive some extensions of the model. Specifically, we show that the expectation-maximization (EM) algorithm applied to Gaussian scale mixtures leads to the lagged-diffusivity algorithm for computing stationary points of the Perona-Malik diffusion equations. Moreover, we show how mean field approximations to these Gaussian scale mixtures lead to a modification of the lagged-diffusivity algorithm that better captures the uncertainties in the restoration. Since this modification can be hard to compute in practice we propose relaxations to the mean field objective to make the algorithm computationally feasible. Our numerical experiments show that this modified lagged-diffusivity algorithm often performs better at restoring textured areas and fuzzy edges than the unmodified algorithm. As a second application of the Gaussian scale mixture framework, we show how an efficient sampling procedure can be obtained for the probabilistic model, making the computation of the conditional mean and other expectations algorithmically feasible. Again, the resulting algorithm has a strong resemblance to the lagged-diffusivity algorithm. Finally, we show that a probabilistic version of the Mumford-Shah segementation model can be obtained in the same framework with a discrete edge-prior.

1612.06008 2026-06-04 cs.RO cs.SY eess.SY

Optimal Control-Based UAV Path Planning with Dynamically-Constrained TSP with Neighborhoods

基于最优控制的无人机路径规划与动态约束TSP带邻居问题

Dae-Sung Jang, Hyeok-Joo Chae, Han-Lim Choi

AI总结 本文提出一种基于采样 roadmap 算法的无人机路径规划方法,通过最优控制生成路径以减少计算时间并提升解的质量,解决动态约束 TSP 带邻居问题。

Comments 17 pages, 7 figures

详情
AI中文摘要

本文针对具备遥感能力的无人机路径规划问题,将其视为动态约束的旅行商问题带邻居,提出一种结合最优控制的采样 roadmap 算法,通过减少局部路径优化的数值计算和提取 roadmap 中的信息来提高计算效率。数值仿真验证了该算法在降低计算时间及提升解质量方面优于传统 roadmap 基路径规划方法。

英文摘要

This paper addresses path planning of an unmanned aerial vehicle (UAV) with remote sensing capabilities (or wireless communication capabilities). The goal of the path planning is to find a minimum-flight-time closed tour of the UAV visiting all executable areas of given remote sensing and communication tasks; in order to incorporate the nonlinear vehicle dynamics, this problem is regarded as a dynamically-constrained traveling salesman problem with neighborhoods. To obtain a close-to-optimal solution for the path planning in a tractable manner, a sampling-based roadmap algorithm that embeds an optimal control-based path generation process is proposed. The algorithm improves the computational efficiency by reducing numerical computations required for optimizing inefficient local paths, and by extracting additional information from a roadmap of a fixed number of samples. Comparative numerical simulations validate the efficiency of the presented algorithm in reducing computation time and improving the solution quality compared to previous roadmap-based planning methods.

1612.05323 2026-06-04 cs.CV cs.NA math.NA

A Stochastic Large Deformation Model for Computational Anatomy

计算解剖学中的一种随机大变形模型

Alexis Arnaudon, Darryl D. Holm, Akshay Pai, Stefan Sommer

AI总结 本文提出一种随机模型,用于在大变形流形度量映射框架中引入随机变化,通过几何性质定制的设置,解决带噪声地标点模板估计问题,并提出两种高效估计噪声场参数的方法。

详情
AI中文摘要

在使用计算解剖学研究人体器官形状时,发现变异来源于受试者间解剖差异、疾病特异性效应和测量噪声。本文介绍了一种随机模型,用于将随机变化纳入大变形流形度量映射(LDDMM)框架中。通过在特定设置中考虑随机性,该设置适合LDDMM的几何性质,我们为带噪声的地标点模板估计问题建立了公式,并给出了两种高效估计噪声场参数的方法。一种方法直接用有限组微分方程近似每个地标点的方差时间演化,另一种基于期望最大化算法。在第二种方法中,通过应用随机扰动的大变形梯度流算法的桥采样技术,在不注册地标点的情况下评估数据似然性。该方法和估计算法在合成示例和人类胼胝体形状数据上进行了实验验证。

英文摘要

In the study of shapes of human organs using computational anatomy, variations are found to arise from inter-subject anatomical differences, disease-specific effects, and measurement noise. This paper introduces a stochastic model for incorporating random variations into the Large Deformation Diffeomorphic Metric Mapping (LDDMM) framework. By accounting for randomness in a particular setup which is crafted to fit the geometrical properties of LDDMM, we formulate the template estimation problem for landmarks with noise and give two methods for efficiently estimating the parameters of the noise fields from a prescribed data set. One method directly approximates the time evolution of the variance of each landmark by a finite set of differential equations, and the other is based on an Expectation-Maximisation algorithm. In the second method, the evaluation of the data likelihood is achieved without registering the landmarks, by applying bridge sampling using a stochastically perturbed version of the large deformation gradient flow algorithm. The method and the estimation algorithms are experimentally validated on synthetic examples and shape data of human corpora callosa.

1612.04915 2026-06-04 cs.RO cs.SY eess.SY

Collaborative Object Transportation Using MAVs via Passive Force Control

利用MAVs的被动力控进行协同物体运输

Andrea Tagliabue, Mina Kamel, Sebastian Verling, Roland Siegwart, Juan Nieto

AI总结 本文提出一种基于被动力控的MAVs协同运输策略,通过双六旋翼运输大型物体,无需通信链路、负载形状或抓取点位置信息。

Comments under review for the IEEE International Conference on Robotics and Automation (ICRA) 2017

详情
AI中文摘要

本文提出了一种基于被动力控的MAVs协同运输策略,旨在开发一种不依赖MAV之间通信链路、负载形状知识或抓取点位置的鲁棒方法。所提出的方法基于主从范式,其中从动代理通过顺应控制器保证对主代理施加于负载的外部力的顺应性。外部力作用于从动代理的估计是通过基于无迹卡尔曼滤波(UKF)的非线性估计器从视觉惯性导航系统提供的信息中进行估计。实验结果展示了力估计器的性能,并展示了1.2米长物体的协同运输。

英文摘要

This paper shows a strategy based on passive force control for collaborative object transportation using Micro Aerial Vehicles (MAVs), focusing on the transportation of a bulky object by two hexacopters. The goal is to develop a robust approach which does not rely on: (a) communication links between the MAVs, (b) the knowledge of the payload shape and (c) the position of grasping point. The proposed approach is based on the master-slave paradigm, in which the slave agent guarantees compliance to the external force applied by the master to the payload via an admittance controller. The external force acting on the slave is estimated using a non-linear estimator based on the Unscented Kalman Filter (UKF) from the information provided by a visual inertial navigation system. Experimental results demonstrate the performance of the force estimator and show the collaborative transportation of a 1.2 m long object.

1607.07797 2026-06-04 cs.RO cs.FL cs.SY eess.SY

Combined Top-Down and Bottom-Up Approaches to Performance-guaranteed Integrated Task and Motion Planning of Cooperative Multi-agent Systems

结合自上而下与自下而上方法的性能保证协同多智能体系统集成任务与运动规划

Rafael Rodrigues da Silva, Bo Wu, Jin Dai, Hai Lin

AI总结 本文提出一种分层框架,通过结合自下而上的反应式运动控制器与自上而下的任务计划,实现协同多智能体系统的性能保证任务与运动规划,通过假设-保证范式验证局部任务与全局任务的一致性。

Comments Submitted to Automatica

详情
AI中文摘要

我们提出了一种分层设计框架,用于自动合成协同多智能体系统的协调方案和控制策略,以满足正式的性能要求。通过将自下而上的反应式运动控制器与自上而下的任务计划相结合。一方面,从一个全局任务开始,该任务以所有智能体任务能力的正则语言指定,任务规划层位于所提出框架的顶部,将全局任务分解为与每个智能体个体能力一致的局部任务,并通过假设-保证范式论证局部任务的完成是否意味着全局任务的满足。另一方面,每个智能体关联的自下而上运动计划通过组合基本运动原语生成,这些原语通过差分动态逻辑(d$\mathcal{L}$)验证安全,通过可满足性模理论(SMT)求解器在面对局部任务要求和环境描述所施加的约束时搜索可行解。研究表明,所提出的框架能够处理动态环境,因为运动原语具有反应特性,使运动计划能够适应局部环境变化。此外,当SMT求解器无法找到可行解时,任务重新配置可以由运动规划层触发。整体设计框架的有效性通过自动化仓库案例研究得到验证。

英文摘要

We propose a hierarchical design framework to automatically synthesize coordination schemes and control policies for cooperative multi-agent systems to fulfill formal performance requirements, by associating a bottom-up reactive motion controller with a top-down mission plan. On one hand, starting from a global mission that is specified as a regular language over all the agents' mission capabilities, a mission planning layer sits on the top of the proposed framework, decomposing the global mission into local tasks that are in consistency with each agent's individual capabilities, and compositionally justifying whether the achievement of local tasks implies the satisfaction of the global mission via an assume-guarantee paradigm. On the other hand, bottom-up motion plans associated with each agent are synthesized corresponding to the obtained local missions by composing basic motion primitives, which are verified safe by differential dynamic logic (d$\mathcal{L}$), through a Satisfiability Modulo Theories (SMT) solver that searches feasible solutions in face of constraints imposed by local task requirements and the environment description. It is shown that the proposed framework can handle dynamical environments as the motion primitives possess reactive features, making the motion plans adaptive to local environmental changes. Furthermore, on-line mission reconfiguration can be triggered by the motion planning layer once no feasible solutions can be found through the SMT solver. The effectiveness of the overall design framework is validated by an automated warehouse case study.

1609.05258 2026-06-04 cs.RO cs.AI cs.CV cs.SY eess.SY

The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to Foster Reproducible Research

ACRV 摘取基准 (APB):一个促进可重复研究的机器人货架摘取基准

Jürgen Leitner, Adam W. Tow, Jake E. Dean, Niko Suenderhauf, Joseph W. Durham, Matthew Cooper, Markus Eich, Christopher Lehnert, Ruben Mangels, Christopher McCool, Peter Kujala, Lachlan Nicholson, Trung Pham, James Sergeant, Liao Wu, Fangyi Zhang, Ben Upcroft, Peter Corke

AI总结 本文提出ACRV摘取基准(APB),通过42个常见物品、广泛可用的货架和精确的物品排列指南,提供可重复的机器人摘取基准,支持完整机器人系统的比较。

Comments 8 pages, submitted to RA:Letters

详情
AI中文摘要

机器人挑战如亚马逊摘取挑战(APC)或DARPA挑战是推动科学进步的重要方式。它们使研究在明确的基准上进行比较,所有参与者享有相同的测试条件。然而,此类挑战事件仅偶尔举行,参赛人数有限,且测试条件难以在主事件后复制。我们提出一个新的物理基准挑战:ACRV摘取基准(APB)。该基准设计为可重复,包含42个常见物品、广泛可用的货架和精确的物品排列指南。明确的评估协议使完整机器人系统(包括感知和操作)的比较成为可能,而不仅仅是子系统。本文还描述并报告了基于Baxter机器人开放基线系统的实验结果。

英文摘要

Robotic challenges like the Amazon Picking Challenge (APC) or the DARPA Challenges are an established and important way to drive scientific progress. They make research comparable on a well-defined benchmark with equal test conditions for all participants. However, such challenge events occur only occasionally, are limited to a small number of contestants, and the test conditions are very difficult to replicate after the main event. We present a new physical benchmark challenge for robotic picking: the ACRV Picking Benchmark (APB). Designed to be reproducible, it consists of a set of 42 common objects, a widely available shelf, and exact guidelines for object arrangement using stencils. A well-defined evaluation protocol enables the comparison of \emph{complete} robotic systems -- including perception and manipulation -- instead of sub-systems only. Our paper also describes and reports results achieved by an open baseline system based on a Baxter robot.

1612.04324 2026-06-04 cs.RO cs.SY eess.SY

Stabilization and Trajectory Control of a Quadrotor with Uncertain Suspended Load

四旋翼载具带不确定悬挂负载的稳定与轨迹控制

Xu Zhou, Xiaoli Zhang, Jiucai Zhang, Rui Liu

AI总结 本文研究四旋翼搭载不确定悬挂负载时的稳定与轨迹控制问题,通过比较三种控制器性能,发现负载质量不确定性主要影响稳定而非轨迹跟踪,提出关键运动质量概念并验证鲁棒控制器的有效性。

Comments 56 pages, 12 figures, article submitted to ASME Journal of Dynamic Systems Measurement and Control, 2016 April

详情
AI中文摘要

近年来,四旋翼搭载固定已知质量悬挂负载的稳定与轨迹控制已被广泛研究。然而,负载质量不总是事先已知,或在实际运输中可能变化。这种质量不确定性会给四旋翼系统带来不确定扰动,导致现有控制器的稳定性和轨迹跟踪性能变差。为提高这种情况下四旋翼的稳定性与轨迹跟踪能力,本文全面研究了不确定负载质量对四旋翼的影响。通过比较三种不同控制器——比例导数(PD)控制器、滑模控制器(SMC)和模型预测控制器(MPC)的性能,证明稳定而非轨迹跟踪误差是负载质量不确定性的主要影响因素。存在一个关键运动质量,使四旋翼能够维持期望的运输性能。此外,仿真结果验证了具有强抗扰能力的控制器在实际应用中的有效性。

英文摘要

Stabilization and trajectory control of a quadrotor carrying a suspended load with a fixed known mass has been extensively studied in recent years. However, the load mass is not always known beforehand or may vary during the practical transportations. This mass uncertainty brings uncertain disturbances to the quadrotor system, causing existing controllers to have worse stability and trajectory tracking performance. To improve the quadrotor stability and trajectory tracking capability in this situation, we fully investigate the impacts of the uncertain load mass on the quadrotor. By comparing the performances of three different controllers -- the proportional-derivative (PD) controller, the sliding mode controller (SMC), and the model predictive controller (MPC) -- stabilization rather than trajectory tracking error is proved to be the main influence in the load mass uncertainty. A critical motion mass exists for the quadrotor to maintain a desired transportation performance. Moreover, simulation results verify that a controller with strong robustness against disturbances is a good choice for practical applications.

1607.08481 2026-06-04 cs.CV cs.NA math.NA

A Nonlocal Denoising Algorithm for Manifold-Valued Images Using Second Order Statistics

基于二阶统计的非局部去噪算法用于流形值图像

Friederike Laus, Mila Nikolova, Johannes Persch, Gabriele Steidl

AI总结 本文首次将非局部块方法推广到流形值图像,通过最小均方误差估计提出新的估计器,用于恢复流形值图像。

详情
AI中文摘要

非局部块方法,特别是Lebrun等人(2013)的贝叶斯方法,被认为是去噪(彩色)图像的有效方法,这些图像受到白高斯噪声影响。本文首次尝试将该技术推广到流形值图像。此类图像,例如具有相位或方向信息或值在对称正定矩阵流形上的图像,在现实应用中很常见。将正态分布推广到流形不是标准的,已有不同尝试。本文聚焦于一个直接的内在模型,并讨论特定流形的其他方法。我们将Lebrun等人的贝叶斯方法重新解释为最小均方误差估计,这促使我们定义相应的估计器。有了这个估计器,我们提出了一种非局部块方法用于恢复流形值图像。各种概念验证示例展示了所提算法的潜力。

英文摘要

Nonlocal patch-based methods, in particular the Bayes' approach of Lebrun, Buades and Morel (2013), are considered as state-of-the-art methods for denoising (color) images corrupted by white Gaussian noise of moderate variance. This paper is the first attempt to generalize this technique to manifold-valued images. Such images, for example images with phase or directional entries or with values in the manifold of symmetric positive definite matrices, are frequently encountered in real-world applications. Generalizing the normal law to manifolds is not canonical and different attempts have been considered. Here we focus on a straightforward intrinsic model and discuss the relation to other approaches for specific manifolds. We reinterpret the Bayesian approach of Lebrun et al. (2013) in terms of minimum mean squared error estimation, which motivates our definition of a corresponding estimator on the manifold. With this estimator at hand we present a nonlocal patch-based method for the restoration of manifold-valued images. Various proof of concept examples demonstrate the potential of the proposed algorithm.

1612.02739 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY

Controlling Robot Morphology from Incomplete Measurements

从不完整测量中控制机器人形态

Martin Pecka, Karel Zimmermann, Michal Reinštein, Tomáš Svoboda

AI总结 针对复杂形态机器人在城市搜索与救援任务中的地形穿越需求,提出通过自主控制处理不完整数据并确保安全性的方法。

Comments Accepted into IEEE Transactions to Industrial Electronics, Special Section on Motion Control for Novel Emerging Robotic Devices and Systems

详情
AI中文摘要

复杂形态的移动机器人对于在城市搜索与救援任务中穿越粗糙地形至关重要。由于远程操作复杂形态会增加操作员的认知负担,因此需要自主控制。自主控制会测量机器人状态和周围地形,通常只能部分观测,因此数据往往不完整。我们对缺失测量进行边缘化,并评估一个显式安全条件。如果安全条件被违反,身体安装的机械臂通过触觉探索收集缺失数据。

英文摘要

Mobile robots with complex morphology are essential for traversing rough terrains in Urban Search & Rescue missions (USAR). Since teleoperation of the complex morphology causes high cognitive load of the operator, the morphology is controlled autonomously. The autonomous control measures the robot state and surrounding terrain which is usually only partially observable, and thus the data are often incomplete. We marginalize the control over the missing measurements and evaluate an explicit safety condition. If the safety condition is violated, tactile terrain exploration by the body-mounted robotic arm gathers the missing data.

1606.07315 2026-06-04 cs.LG cs.NA math.NA

Nearly-optimal Robust Matrix Completion

近优鲁棒矩阵补全

Yeshwanth Cherapanamjeri, Kartik Gupta, Prateek Jain

AI总结 本文提出一种简单投影梯度下降方法,通过交替进行投影梯度下降和硬阈值清理来估计低秩矩阵,实现近最优观测和损坏数量的鲁棒矩阵补全,同时改进了低秩矩阵补全的时间复杂度。

详情
AI中文摘要

本文提出了一种简单的投影梯度下降方法,通过交替进行投影梯度下降和硬阈值清理来估计低秩矩阵,实现近最优观测和损坏数量的鲁棒矩阵补全,同时改进了低秩矩阵补全的时间复杂度。

英文摘要

In this paper, we consider the problem of Robust Matrix Completion (RMC) where the goal is to recover a low-rank matrix by observing a small number of its entries out of which a few can be arbitrarily corrupted. We propose a simple projected gradient descent method to estimate the low-rank matrix that alternately performs a projected gradient descent step and cleans up a few of the corrupted entries using hard-thresholding. Our algorithm solves RMC using nearly optimal number of observations as well as nearly optimal number of corruptions. Our result also implies significant improvement over the existing time complexity bounds for the low-rank matrix completion problem. Finally, an application of our result to the robust PCA problem (low-rank+sparse matrix separation) leads to nearly linear time (in matrix dimensions) algorithm for the same; existing state-of-the-art methods require quadratic time. Our empirical results corroborate our theoretical results and show that even for moderate sized problems, our method for robust PCA is an an order of magnitude faster than the existing methods.

1612.01034 2026-06-04 cs.RO cs.SY eess.SY

Localization of networked robot systems subject to random delay and packet loss

网络机器人系统在随机延迟和丢包下的定位

Manh Duong Phung, Thi Thanh Van Nguyen, Thuan Hoang Tran, Quang Vinh Tran

AI总结 本文针对通信延迟和丢包影响下的移动机器人定位问题,提出统一的状态空间表示法和最优线性估计器,通过相关因子整合延迟测量,验证了方法在仿真和真实机器人系统中的有效性。

Comments In 2013 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM

详情
AI中文摘要

本文针对移动机器人在通信延迟和丢包下的定位问题,构建了统一的状态空间表示法以描述混合不确定性。基于该表示法,开发了最优线性估计器,通过相关因子整合延迟测量以提高估计精度。该估计器进一步扩展至非线性系统。在MATLAB仿真和真实机器人系统实验中验证了该方法的性能,良好的定位结果证明了其在网络化移动机器人定位中的有效性。

英文摘要

This paper deals with the localization problem of mobile robot subject to communication delay and packet loss. The delay and loss may appear in a random fashion in both control inputs and observation measurements. A unified state-space representation is constructed to describe these mixed uncertainties. Based on it, the optimal linear estimator is developed. The main idea is the derivation of a relevance factor to incorporate delayed measurements to the being estimate. The estimator is then extended for nonlinear systems. The performance of this method is tested within the simulations in MATLAB and the experiments in a real robot system. The good localization results prove the efficiency of the method for the purpose of localization of networked mobile robot.

1611.09436 2026-06-04 cs.RO cs.SY eess.SY

Proposal of algorithms for navigation and obstacles avoidance of autonomous mobile robot

自主移动机器人导航与障碍物避障算法的提出

T. T. Hoang, D. T. Hiep, P. M. Duong, N. T. T. Van, B. G. Duong, T. Q. Vinh

AI总结 本文提出算法用于室内自主移动机器人导航与避障,利用激光雷达获取3D环境图像,通过改进的向量场直方图算法生成2D地图并控制轨迹跟踪,实验结果良好。

Comments In 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA)

详情
AI中文摘要

本文提出了用于室内自主移动机器人导航和避障的算法。使用激光雷达获取环境的3D图像,提出新的3D到2D图像压力和障碍物检测(IPaBD)算法,用于生成2D全局地图。该地图是设计轨迹的基础。开发了跟踪控制器以使机器人跟随轨迹。利用超声波传感器处理障碍物避障问题。提出了改进的向量场直方图(改进的VFH)算法,以克服原始VFH的一些限制。已进行实验,结果令人鼓舞。

英文摘要

This paper presents algorithms to navigate and avoid obstacles for an in-door autonomous mobile robot. A laser range finder is used to obtain 3D images of the environment. A new algorithm, namely 3D-to-2D image pressure and barriers detection (IPaBD), is proposed to create a 2D global map from the 3D images. This map is basic to design the trajectory. A tracking controller is developed to control the robot to follow the trajectory. The obstacle avoidance is addressed with the use of sonar sensors. An improved vector field histogram (Improved-VFH) algorithm is presented with improvements to overcome some limitations of the original VFH. Experiments have been conducted and the result is encouraged.

1611.09433 2026-06-04 cs.RO cs.HC cs.SY eess.SY

A novel platform for internet-based mobile robot systems

一种基于互联网的移动机器人系统新平台

P. M. Duong, T. T. Hoang, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结 本文提出一种基于互联网的移动机器人系统软硬件架构,通过3G网络连接多传感器智能机器人,采用客户端-服务器架构实现数据传输,并通过避障和安全点达成等自主机制确保安全,为远程控制算法等研究提供实验平台。

Comments In 2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA)

详情
AI中文摘要

本文介绍了一种用于在线移动机器人系统的软硬件结构。硬件部分主要由通过3G移动网络连接到互联网的多传感器智能机器人组成。系统采用客户端-服务器软件架构,客户端和服务器之间的数据传输通过不同的传输协议进行。自主机制如避障和安全点达成被实现以确保机器人安全。该架构已在真实互联网上投入使用,初步结果令人鼓舞。通过采用这种结构,将非常容易构建用于研究各种远程操作主题(如远程控制算法、界面设计、网络协议和应用等)的实验平台。

英文摘要

In this paper, we introduce a software and hardware structure for on-line mobile robotic systems. The hardware mainly consists of a Multi-Sensor Smart Robot connected to the Internet through 3G mobile network. The system employs a client-server software architecture in which the exchanged data between the client and the server is transmitted through different transport protocols. Autonomous mechanisms such as obstacle avoidance and safe-point achievement are implemented to ensure the robot safety. This architecture is put into operation on the real Internet and the preliminary result is promising. By adopting this structure, it will be very easy to construct an experimental platform for the research on diverse tele-operation topics such as remote control algorithms, interface designs, network protocols and applications etc.

1611.09431 2026-06-04 cs.RO cs.SY eess.SY

Localization of a unicycle-like mobile robot using LRF and omni-directional camera

使用LRF和 omnidirectional相机对类双轮车移动机器人进行定位

Tran Hiep Dinh, Manh Duong Phung, Thuan Hoang Tran, Quang Vinh Tran

AI总结 本文提出利用扩展卡尔曼滤波器对配备LRF和 omnidirectional相机的类双轮车移动机器人进行定位,通过改进的最小二乘二次方法提取环境线段并利用线匹配算法提高定位精度。

Comments In 2012 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

详情
AI中文摘要

本文针对定位问题,采用扩展卡尔曼滤波器(EKF)对配备激光雷达(LRF)传感器和 omnidirectional相机的类双轮车移动机器人进行定位。LRF用于扫描环境并用线段描述,线段通过改进的最小二乘二次方法提取,其中引入动态阈值。相机用于确定机器人方位。EKF的预测步骤通过提取机器人运动学模型和输入信号参数进行,修正步骤通过实现线匹配算法和比较局部与全局地图中线段参数进行。线匹配算法中引入转换矩阵以降低计算成本。在真实移动机器人系统中进行了实验,结果证明了该方法在定位中的适用性。

英文摘要

This paper addresses the localization problem. The extended Kalman filter (EKF) is employed to localize a unicycle-like mobile robot equipped with a laser range finder (LRF) sensor and an omni-directional camera. The LRF is used to scan the environment which is described with line segments. The segments are extracted by a modified least square quadratic method in which a dynamic threshold is injected. The camera is employed to determine the robot's orientation. The prediction step of the EKF is performed by extracting parameters from the kinematic model and input signal of the robot. The correction step is conducted with the implementation of a line matching algorithm and the comparison between line's parameters of the local and global maps. In the line matching algorithm, a conversion matrix is introduced to reduce the computation cost. Experiments have been carried out in a real mobile robot system and the results prove the applicability of the method for the purpose of localization.

1611.09424 2026-06-04 cs.RO cs.SY eess.SY

Development of a multi-sensor perceptual system for mobile robot and EKF-based localization

移动机器人多传感器感知系统的发展及基于EKF的定位方法

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结 本文提出一种基于现代传感器和多点通信通道的移动机器人感知系统,采用传感器融合模型处理数据以提高机器人定位与控制的准确性,通过扩展卡尔曼滤波器优化系统状态。

Comments In 2012 International Conference on Systems and Informatics (ICSAI). arXiv admin note: substantial text overlap with arXiv:1611.07112, arXiv:1611.07114

详情
AI中文摘要

本文介绍了移动机器人感知系统的设计与实现,利用现代传感器和多点通信通道获取数据,并通过传感器融合模型处理以获得对机器人定位和控制有意义的信息。由于数据获取的不确定性,应用扩展卡尔曼滤波器以获得系统的最优状态。已进行了若干实验,结果证实了感知系统的有效运行和卡尔曼滤波方法的高效性。

英文摘要

This paper presents the design and implementation of a perceptual system for the mobile robot using modern sensors and multi-point communication channels. The data extracted from the perceptual system is processed by a sensor fusion model to obtain meaningful information for the robot localization and control. Due to the uncertainties of acquiring data, an extended Kalman filter was applied to get optimal states of the system. Several experiments have been conducted and the results confirmed the functioning operation of the perceptual system and the efficiency of the Kalman filter approach.

1611.05977 2026-06-04 cs.LG cs.NA math.NA stat.AP stat.ML

Robust and Scalable Column/Row Sampling from Corrupted Big Data

鲁棒且可扩展的列/行采样从受腐蚀的大数据

Mostafa Rahmani, George Atia

AI总结 本文提出新的采样算法,能在严重数据腐蚀下定位信息列,并开发可扩展的随机化设计,同时对稀疏腐蚀和异常值具有鲁棒性,实验显示优于现有鲁棒采样算法。

详情
AI中文摘要

传统采样技术在数据严重腐蚀时无法生成数据描述性草图,因为此类腐蚀破坏了其所需的低秩结构。本文提出新的采样算法,可在存在严重数据腐蚀时定位信息列,并开发新的可扩展随机化设计。所提方法同时对稀疏腐蚀和异常值具有鲁棒性,并通过真实和合成数据的实验表明显著优于现有鲁棒采样算法。

英文摘要

Conventional sampling techniques fall short of drawing descriptive sketches of the data when the data is grossly corrupted as such corruptions break the low rank structure required for them to perform satisfactorily. In this paper, we present new sampling algorithms which can locate the informative columns in presence of severe data corruptions. In addition, we develop new scalable randomized designs of the proposed algorithms. The proposed approach is simultaneously robust to sparse corruption and outliers and substantially outperforms the state-of-the-art robust sampling algorithms as demonstrated by experiments conducted using both real and synthetic data.

1311.3979 2026-06-04 cs.RO cs.SY eess.SY

Precision improvement of MEMS gyros for indoor mobile robots with horizontal motion inspired by methods of TRIZ

为室内移动机器人改进MEMS陀螺仪精度的水平运动启发的TRIZ方法

Dongmyoung Shin, Sung Gil Park, Byung Soo Song, Eung Su Kim, Oleg Kupervasser, Denis Pivovartchuk, Ilya Gartseev, Oleg Antipov, Evgeniy Kruchenkov, Alexey Milovanov, Andrey Kochetov, Igor Sazonov, Igor Nogtev, Sun Woo Hyun

AI总结 本文利用TRIZ方法解决室内移动机器人中水平运动下MEMS陀螺仪精度提升问题,通过创新方法提高传感器性能。

Comments 6 pages, the paper is accepted to 9th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, Hawaii, USA (IEEE-NEMS 2014) as an oral presentation

详情
Journal ref
Proceedings of 9th IEEE International Conference on Nano/Micro Engineered and Molecular Systems (IEEE-NEMS 2014) April 13-16, 2014,Hawaii,USA, pp 102-107
AI中文摘要

在本文中,通过TRIZ(

英文摘要

In the paper, the problem of precision improvement for the MEMS gyrosensors on indoor robots with horizontal motion is solved by methods of TRIZ ("the theory of inventive problem solving").

1602.05703 2026-06-04 cs.LG cs.SY eess.SY

Adaptive Least Mean Squares Estimation of Graph Signals

自适应最小均方图信号估计

Paolo Di Lorenzo, Sergio Barbarossa, Paolo Banelli, Stefania Sardellitti

AI总结 本文提出一种自适应图信号估计方法,通过最小均方策略实现带限图信号的重建与跟踪,结合理论分析与数值实验验证了方法的有效性,并提出在线适应的图采样策略。

Comments Submitted to IEEE Transactions on Signal and Information Processing over Networks

详情
AI中文摘要

本文旨在提出一种最小均方(LMS)策略,用于自适应估计定义在图上的信号。假设图信号在已知带宽下带限,该方法能够在有限观测下实现保证均方误差性能的重建与跟踪。详细的均方分析提供了所提方法的性能,并导致了设计有用的图信号采样策略的若干见解。数值结果验证了我们的理论发现,并展示了所提方法的性能。此外,为应对带宽未知的情况,我们提出了一种在图频域中进行稀疏在线估计信号支持的方法,从而实现了图采样策略的在线适应。最后,我们应用所提方法在认知网络环境中构建给定操作区域的功率空间密度制图。

英文摘要

The aim of this paper is to propose a least mean squares (LMS) strategy for adaptive estimation of signals defined over graphs. Assuming the graph signal to be band-limited, over a known bandwidth, the method enables reconstruction, with guaranteed performance in terms of mean-square error, and tracking from a limited number of observations over a subset of vertices. A detailed mean square analysis provides the performance of the proposed method, and leads to several insights for designing useful sampling strategies for graph signals. Numerical results validate our theoretical findings, and illustrate the performance of the proposed method. Furthermore, to cope with the case where the bandwidth is not known beforehand, we propose a method that performs a sparse online estimation of the signal support in the (graph) frequency domain, which enables online adaptation of the graph sampling strategy. Finally, we apply the proposed method to build the power spatial density cartography of a given operational region in a cognitive network environment.

1511.05261 2026-06-04 cs.CV cs.LG cs.NA math.NA stat.ML

Robust PCA via Nonconvex Rank Approximation

通过非凸秩近似实现鲁棒PCA

Zhao Kang, Chong Peng, Qiang Cheng

AI总结 本文提出非凸秩近似方法,以改进鲁棒PCA中核范数的局限性,通过高效算法提升准确性和效率。

Comments IEEE International Conference on Data Mining

详情
AI中文摘要

在数据挖掘和机器学习中,许多应用需要恢复低秩矩阵。鲁棒主成分分析(RPCA)是处理此类问题的通用框架。RPCA中核范数作为秩函数的凸替代物被广泛研究。在某些假设下,它可以以高概率恢复底层低秩矩阵。然而,这些假设可能在实际应用中不成立。由于核范数通过将所有奇异值相加来近似秩,即本质上是奇异值的ℓ1范数,因此产生的近似误差并不 trivial,导致最终的矩阵估计器可能有显著偏差。为寻求更接近的近似并缓解核范数的上述限制,我们提出了一种非凸秩近似。这种对矩阵秩的近似比核范数更紧密。为了解决相关的非凸最小化问题,我们开发了高效的增广拉格朗日乘子优化算法。实验结果表明,我们的方法在准确性和效率上均优于当前最先进的算法。

英文摘要

Numerous applications in data mining and machine learning require recovering a matrix of minimal rank. Robust principal component analysis (RPCA) is a general framework for handling this kind of problems. Nuclear norm based convex surrogate of the rank function in RPCA is widely investigated. Under certain assumptions, it can recover the underlying true low rank matrix with high probability. However, those assumptions may not hold in real-world applications. Since the nuclear norm approximates the rank by adding all singular values together, which is essentially a $\ell_1$-norm of the singular values, the resulting approximation error is not trivial and thus the resulting matrix estimator can be significantly biased. To seek a closer approximation and to alleviate the above-mentioned limitations of the nuclear norm, we propose a nonconvex rank approximation. This approximation to the matrix rank is tighter than the nuclear norm. To solve the associated nonconvex minimization problem, we develop an efficient augmented Lagrange multiplier based optimization algorithm. Experimental results demonstrate that our method outperforms current state-of-the-art algorithms in both accuracy and efficiency.

1510.06469 2026-06-04 cs.RO cs.SY eess.SY

Optimal Temporal Logic Planning in Probabilistic Semantic Maps

在概率语义地图中进行最优时间逻辑规划

Jie Fu, Nikolay Atanasov, Ufuk Topcu, George J. Pappas

AI总结 本文研究了在概率语义地图中基于时间逻辑约束的机器人运动规划问题,提出通过引入置信度参数delta将随机控制问题转化为确定性最短路径问题,并设计启发函数以提高A*算法的效率和正确性。

Comments 8 pages, 6 figures. submitted to IEEE International Conference on Robotics and Automation 2016

详情
AI中文摘要

本文考虑了在通过语义同时定位与建图(SLAM)获得的概率地图中,机器人运动规划受时间逻辑约束的问题。地图分布的不确定性对保证线性时间逻辑(LTL)规范的正确性构成重大挑战。我们展示该问题可转化为一个最优控制问题,其中语义地图和逻辑公式评估都是随机的。我们的第一项贡献是通过引入置信度参数delta,将LTL子类的随机控制问题转化为确定性最短路径问题。从确定性问题获得的机器人轨迹在真实环境中具有最小成本并以概率delta满足逻辑规范。我们的第二项贡献是设计了一个可接受的启发函数,引导确定性问题的规划朝着满足时间逻辑规范的方向进行。这使我们能够使用A*算法获得最优且高效的解决方案。在模拟语义环境中使用差分驱动机器人验证了我们方法的性能和正确性。

英文摘要

This paper considers robot motion planning under temporal logic constraints in probabilistic maps obtained by semantic simultaneous localization and mapping (SLAM). The uncertainty in a map distribution presents a great challenge for obtaining correctness guarantees with respect to the linear temporal logic (LTL) specification. We show that the problem can be formulated as an optimal control problem in which both the semantic map and the logic formula evaluation are stochastic. Our first contribution is to reduce the stochastic control problem for a subclass of LTL to a deterministic shortest path problem by introducing a confidence parameter $delta$. A robot trajectory obtained from the deterministic problem is guaranteed to have minimum cost and to satisfy the logic specification in the true environment with probability $delta$. Our second contribution is to design an admissible heuristic function that guides the planning in the deterministic problem towards satisfying the temporal logic specification. This allows us to obtain an optimal and very efficient solution using the A* algorithm. The performance and correctness of our approach are demonstrated in a simulated semantic environment using a differential-drive robot.

1611.05095 2026-06-04 cs.LG cs.RO cs.SY eess.SY

Learning Dexterous Manipulation Policies from Experience and Imitation

从经验与模仿中学习灵巧操作策略

Vikash Kumar, Abhishek Gupta, Emanuel Todorov, Sergey Levine

AI总结 本文研究了通过经验与模仿学习反馈控制灵巧五指手非抓取操作的任务,提出基于轨迹优化的局部控制器,并通过深度学习和最近邻方法进行泛化,展示了小数据训练下的有效性和盲操作优势。

Comments Initial draft for a journal submission

详情
AI中文摘要

我们探索了基于学习的反馈控制方法,用于控制执行非抓取操作的灵巧五指手。首先,我们学习了能够从预定义初始状态开始执行任务的局部控制器。这些控制器是通过轨迹优化构建的,基于从传感器数据直接学习到的局部线性时变模型。在某些情况下,我们使用通过虚拟环境中的遥控收集的人类示范来初始化优化器。我们证明,这些控制器在模拟和物理平台上都能在初始条件的有限范围内稳健地执行任务。然后,我们考虑了两种泛化方法:深度学习和最近邻。我们发现最近邻方法性能更高。然而,神经网络也有其优势:它仅使用触觉和本体感觉反馈,而没有关于物体的视觉反馈(即盲操作),并且学习了一个时间不变的策略。相比之下,最近邻方法根据运动捕捉感知的初始物体状态切换时间变化的局部控制器。尽管两种泛化方法仍有改进空间,我们的工作表明(i)复杂的非抓取操作任务的局部轨迹控制器可以从惊人的少量训练数据中构建,(ii)此类控制器的集合可以插值形成更全局的控制器。结果总结在补充视频中:https://youtu.be/E0wmO6deqjo

英文摘要

We explore learning-based approaches for feedback control of a dexterous five-finger hand performing non-prehensile manipulation. First, we learn local controllers that are able to perform the task starting at a predefined initial state. These controllers are constructed using trajectory optimization with respect to locally-linear time-varying models learned directly from sensor data. In some cases, we initialize the optimizer with human demonstrations collected via teleoperation in a virtual environment. We demonstrate that such controllers can perform the task robustly, both in simulation and on the physical platform, for a limited range of initial conditions around the trained starting state. We then consider two interpolation methods for generalizing to a wider range of initial conditions: deep learning, and nearest neighbors. We find that nearest neighbors achieve higher performance. Nevertheless, the neural network has its advantages: it uses only tactile and proprioceptive feedback but no visual feedback about the object (i.e. it performs the task blind) and learns a time-invariant policy. In contrast, the nearest neighbors method switches between time-varying local controllers based on the proximity of initial object states sensed via motion capture. While both generalization methods leave room for improvement, our work shows that (i) local trajectory-based controllers for complex non-prehensile manipulation tasks can be constructed from surprisingly small amounts of training data, and (ii) collections of such controllers can be interpolated to form more global controllers. Results are summarized in the supplementary video: https://youtu.be/E0wmO6deqjo

1308.5133 2026-06-04 cs.RO cs.NE cs.SY eess.SY

Performance Measurement Under Increasing Environmental Uncertainty In The Context of Interval Type-2 Fuzzy Logic Based Robotic Sailing

在区间型2模糊逻辑基础上的机器人航海中环境不确定性增加时的性能测量

Naisan Benatar, Uwe Aickelin, Jonathan M. Garibald

AI总结 本文探讨了在环境不确定性显著变化时,传统性能指标如RMSE的不足,提出更复杂的比较方法,应用于机器人控制问题,证明其比简单方法更稳健。

Comments International Conference on Fuzzy Systems 2013 (Fuzz-IEEE 2013)

详情
AI中文摘要

基于模糊逻辑的机器人控制器在不确定环境下的性能测量是一个被当前文献忽视的领域。本文指出标准指标如RMSE在环境不确定性显著变化时不合适。本文综述了其他作者应用的方法,设计了更复杂的比较方法,并将其应用于机器人控制问题,与单一指标进行比较。结果表明,所描述的技术比更简单的方法提供了更稳健的性能比较,允许更好的比较。

英文摘要

Performance measurement of robotic controllers based on fuzzy logic, operating under uncertainty, is a subject area which has been somewhat ignored in the current literature. In this paper standard measures such as RMSE are shown to be inappropriate for use under conditions where the environmental uncertainty changes significantly between experiments. An overview of current methods which have been applied by other authors is presented, followed by a design of a more sophisticated method of comparison. This method is then applied to a robotic control problem to observe its outcome compared with a single measure. Results show that the technique described provides a more robust method of performance comparison than less complex methods allowing better comparisons to be drawn.

1410.7057 2026-06-04 cs.LG cs.DC cs.SY eess.SY stat.ML

Sparse Distributed Learning via Heterogeneous Diffusion Adaptive Networks

稀疏分布式学习 via 异质扩散自适应网络

Bijit Kumar Das, Mrityunjoy Chakraborty, Jerónimo Arenas-García

AI总结 本文提出通过异质扩散自适应网络实现稀疏参数向量的分布式估计,通过选择性应用凸正则化方法减少计算开销,同时保持最优性能。

Comments 4 pages, 1 figure, conference, submitted to IEEE ISCAS 2015, Lisbon, Portugal

详情
AI中文摘要

近年来,关于通过扩散LMS策略在网内进行稀疏参数向量分布式估计的研究已有所涉及。在所有现有工作中,每个网络节点都使用了一些凸正则化方法,以实现优于简单扩散LMS的整体网络性能,尽管这导致了计算开销的增加。本文提供了分析和实验结果,表明凸正则化可以仅应用于某些选定的节点,其余节点保持稀疏性无感知,同时仍能实现与在所有节点上部署凸正则化相同最优行为。由于在部分节点中采用无正则化学习,所提出的方法需要更少的计算成本。我们还提供了一条选择稀疏感知节点的指南和最优正则化参数的闭式表达式。

英文摘要

In-network distributed estimation of sparse parameter vectors via diffusion LMS strategies has been studied and investigated in recent years. In all the existing works, some convex regularization approach has been used at each node of the network in order to achieve an overall network performance superior to that of the simple diffusion LMS, albeit at the cost of increased computational overhead. In this paper, we provide analytical as well as experimental results which show that the convex regularization can be selectively applied only to some chosen nodes keeping rest of the nodes sparsity agnostic, while still enjoying the same optimum behavior as can be realized by deploying the convex regularization at all the nodes. Due to the incorporation of unregularized learning at a subset of nodes, less computational cost is needed in the proposed approach. We also provide a guideline for selection of the sparsity aware nodes and a closed form expression for the optimum regularization parameter.

1611.03372 2026-06-04 cs.RO cs.AI cs.SE cs.SY eess.SY

A stochastically verifiable autonomous control architecture with reasoning

一种具有推理能力的随机可验证自主控制架构

Paolo Izzo, Hongyang Qu, Sandor M. Veres

AI总结 本文提出一种具有推理能力的随机可验证自主控制架构LISA,通过将系统抽象为DTMC和MDP模型,实现代理与环境的概率验证,提升设计与运行时的验证效率。

Comments Accepted at IEEE Conf. Decision and Control, 2016

详情
AI中文摘要

本文介绍了一种名为有限指令集代理(LISA)的新代理架构,用于自主控制。该架构基于先前的AgentSpeak实现,结构比其前身更简单,旨在促进设计时和运行时的验证方法。研究并展示了将LISA系统抽象为两种不同的离散概率模型(DTMC和MDP)的过程。LISA系统为代理和环境的完整建模提供了工具,用于概率验证。代理程序可以自动编译为DTMC或MDP模型进行验证,使用Prism工具。自动生成的Prism模型可用于设计时和运行时的验证。运行时验证在LISA系统中作为内部建模机制,用于预测未来的 outcomes。

英文摘要

A new agent architecture called Limited Instruction Set Agent (LISA) is introduced for autonomous control. The new architecture is based on previous implementations of AgentSpeak and it is structurally simpler than its predecessors with the aim of facilitating design-time and run-time verification methods. The process of abstracting the LISA system to two different types of discrete probabilistic models (DTMC and MDP) is investigated and illustrated. The LISA system provides a tool for complete modelling of the agent and the environment for probabilistic verification. The agent program can be automatically compiled into a DTMC or a MDP model for verification with Prism. The automatically generated Prism model can be used for both design-time and run-time verification. The run-time verification is investigated and illustrated in the LISA system as an internal modelling mechanism for prediction of future outcomes.

1610.05838 2026-06-04 cs.LG cs.NA math.NA

CuMF_SGD: Fast and Scalable Matrix Factorization

CuMF_SGD:快速且可扩展的矩阵分解

Xiaolong Xie, Wei Tan, Liana L. Fong, Yun Liang

AI总结 本文提出CuMF_SGD,利用GPU高带宽内存和快节点连接加速大规模矩阵分解,通过批量Hogwild!和波前更新方案及优化内核,在单CPU和多GPU上实现3.1X-28.2X的加速。

详情
AI中文摘要

矩阵分解(MF)已广泛应用于推荐系统、主题建模和词嵌入等领域。随机梯度下降(SGD)因其能处理大数据集和易于增量学习而流行。我们发现SGD用于MF是内存受限的。单节点CPU系统带缓存仅适用于小数据集;分布式系统具有更高的聚合内存带宽但网络连接相对较慢。这一观察启发我们通过利用GPU的高内存带宽和快速节点连接来加速MF。我们提出了cuMF_SGD,一种基于CUDA的SGD解决方案用于大规模MF问题。在单个CPU上,我们设计了两种工作负载调度方案,即批量Hogwild!和波前更新,充分利用大量核心。特别是,批量Hogwild!作为Hogwild!的向量版本克服了内存不连续的问题。我们还开发了高度优化的SGD更新内核,利用缓存、 warp-shuffle指令和半精度浮点数。我们还设计了分区方案以利用多个GPU,同时解决SGD并行化时的收敛问题。在仅使用一个Maxwell或Pascal GPU的三个数据集上,cuMF_SGD相比1-64个CPU节点的最新CPU解决方案快3.1X-28.2X。评估还显示cuMF_SGD在大数据集上能良好扩展到多个GPU。

英文摘要

Matrix factorization (MF) has been widely used in e.g., recommender systems, topic modeling and word embedding. Stochastic gradient descent (SGD) is popular in solving MF problems because it can deal with large data sets and is easy to do incremental learning. We observed that SGD for MF is memory bound. Meanwhile, single-node CPU systems with caching performs well only for small data sets; distributed systems have higher aggregated memory bandwidth but suffer from relatively slow network connection. This observation inspires us to accelerate MF by utilizing GPUs's high memory bandwidth and fast intra-node connection. We present cuMF_SGD, a CUDA-based SGD solution for large-scale MF problems. On a single CPU, we design two workload schedule schemes, i.e., batch-Hogwild! and wavefront-update that fully exploit the massive amount of cores. Especially, batch-Hogwild! as a vectorized version of Hogwild! overcomes the issue of memory discontinuity. We also develop highly-optimized kernels for SGD update, leveraging cache, warp-shuffle instructions and half-precision floats. We also design a partition scheme to utilize multiple GPUs while addressing the well-known convergence issue when parallelizing SGD. On three data sets with only one Maxwell or Pascal GPU, cuMF_SGD runs 3.1X-28.2X as fast compared with state-of-art CPU solutions on 1-64 CPU nodes. Evaluations also show that cuMF_SGD scales well on multiple GPUs in large data sets.

1611.01142 2026-06-04 cs.LG cs.SY eess.SY

Using a Deep Reinforcement Learning Agent for Traffic Signal Control

使用深度强化学习代理进行交通信号控制

Wade Genders, Saiedeh Razavi

AI总结 本文提出一种基于深度强化学习的交通信号控制系统,通过离散交通状态编码和Q-learning训练,有效减少交通延误、队列长度和旅行时间。

详情
AI中文摘要

确保交通系统高效是现代社会的优先事项。技术进步使得交通系统能够收集前所未有的大量多样化数据。我们提出了一种利用这种高质量数据的交通信号控制系统,与现有系统相比抽象程度较低。我们应用现代深度强化学习方法,在交通微观模拟器SUMO中构建了一个真正自适应的交通信号控制代理。我们提出了一种新的状态空间,即离散交通状态编码,信息密度高。该离散交通状态编码作为输入传递给深度卷积神经网络,通过经验回放进行训练。我们的代理与一个具有单隐藏层的神经网络交通信号控制代理进行了比较,平均累计延迟减少了82%,平均队列长度减少了66%,平均旅行时间减少了20%。

英文摘要

Ensuring transportation systems are efficient is a priority for modern society. Technological advances have made it possible for transportation systems to collect large volumes of varied data on an unprecedented scale. We propose a traffic signal control system which takes advantage of this new, high quality data, with minimal abstraction compared to other proposed systems. We apply modern deep reinforcement learning methods to build a truly adaptive traffic signal control agent in the traffic microsimulator SUMO. We propose a new state space, the discrete traffic state encoding, which is information dense. The discrete traffic state encoding is used as input to a deep convolutional neural network, trained using Q-learning with experience replay. Our agent was compared against a one hidden layer neural network traffic signal control agent and reduces average cumulative delay by 82%, average queue length by 66% and average travel time by 20%.

1610.08127 2026-06-04 cs.LG cs.AI cs.NA math.NA stat.ML

Fast Bayesian Non-Negative Matrix Factorisation and Tri-Factorisation

快速的贝叶斯非负矩阵分解与三因子分解

Thomas Brouwer, Jes Frellsen, Pietro Lio'

AI总结 本文提出一种快速变分贝叶斯算法,用于非负矩阵分解和三因子分解,相比Gibbs采样和非概率方法,该方法在迭代和时间步收敛速度更快,且无需额外样本估计后验。

Comments NIPS 2016 Workshop on Advances in Approximate Bayesian Inference

详情
AI中文摘要

我们提出了一种快速的变分贝叶斯算法,用于执行非负矩阵分解和三因子分解。我们证明了我们的方法在每次迭代和时间步(墙钟时间)上的收敛速度比Gibbs采样和非概率方法更快,并且不需要额外的样本来估计后验。我们特别展示了对于矩阵三因子分解,收敛具有挑战性,但我们的变分贝叶斯方法提供了一种快速的解决方案,使三因子分解方法能够更有效地使用。

英文摘要

We present a fast variational Bayesian algorithm for performing non-negative matrix factorisation and tri-factorisation. We show that our approach achieves faster convergence per iteration and timestep (wall-clock) than Gibbs sampling and non-probabilistic approaches, and do not require additional samples to estimate the posterior. We show that in particular for matrix tri-factorisation convergence is difficult, but our variational Bayesian approach offers a fast solution, allowing the tri-factorisation approach to be used more effectively.

1606.00119 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Contextual Bandits with Latent Confounders: An NMF Approach

具有潜在混杂因素的上下文老虎机:一种NMF方法

Rajat Sen, Karthikeyan Shanmugam, Murat Kocaoglu, Alexandros G. Dimakis, Sanjay Shakkottai

AI总结 本文提出基于NMF的ε-贪心算法,通过低维结构学习与最优臂选择平衡,实现在线矩阵补全的 regret 保障,适用于高维数据场景。

Comments 37 pages, 2 figures

详情
AI中文摘要

受在线推荐和广告系统启发,本文考虑了具有潜在低维混杂因子的随机上下文老虎机因果模型。在该模型中,L个观察到的上下文和K个臂之间通过潜在混杂因子相关联。臂选择和潜在混杂因子因果决定奖励,而观察到的上下文与混杂因子相关。在此模型下,L×K的均值奖励矩阵U可分解为非负因子A和W。本文提出ε-贪心NMF-Bandit算法,通过干预序列选择臂,实现学习低维结构与最小化遗憾的平衡。算法在时间T时的遗憾为O(Lpoly(m,logK)logT),相较于传统上下文老虎机的O(LKlogT)更优。这些保证基于较弱的统计RIP条件。此外,本文提出一类生成模型满足充分条件,并推导出O(KmlogT)的下界。这些是首次针对在线矩阵补全与老虎机反馈的regret保证,当秩大于一时。

英文摘要

Motivated by online recommendation and advertising systems, we consider a causal model for stochastic contextual bandits with a latent low-dimensional confounder. In our model, there are $L$ observed contexts and $K$ arms of the bandit. The observed context influences the reward obtained through a latent confounder variable with cardinality $m$ ($m \ll L,K$). The arm choice and the latent confounder causally determines the reward while the observed context is correlated with the confounder. Under this model, the $L \times K$ mean reward matrix $\mathbf{U}$ (for each context in $[L]$ and each arm in $[K]$) factorizes into non-negative factors $\mathbf{A}$ ($L \times m$) and $\mathbf{W}$ ($m \times K$). This insight enables us to propose an $ε$-greedy NMF-Bandit algorithm that designs a sequence of interventions (selecting specific arms), that achieves a balance between learning this low-dimensional structure and selecting the best arm to minimize regret. Our algorithm achieves a regret of $\mathcal{O}\left(L\mathrm{poly}(m, \log K) \log T \right)$ at time $T$, as compared to $\mathcal{O}(LK\log T)$ for conventional contextual bandits, assuming a constant gap between the best arm and the rest for each context. These guarantees are obtained under mild sufficiency conditions on the factors that are weaker versions of the well-known Statistical RIP condition. We further propose a class of generative models that satisfy our sufficient conditions, and derive a lower bound of $\mathcal{O}\left(Km\log T\right)$. These are the first regret guarantees for online matrix completion with bandit feedback, when the rank is greater than one. We further compare the performance of our algorithm with the state of the art, on synthetic and real world data-sets.

1610.07722 2026-06-04 cs.LG cs.NA math.NA

Sparse Hierarchical Tucker Factorization and its Application to Healthcare

稀疏分层Tucker分解及其在医疗领域的应用

Ioakeim Perros, Robert Chen, Richard Vuduc, Jimeng Sun

AI总结 本文提出稀疏分层Tucker分解方法,用于处理稀疏高阶张量数据。该方法通过嵌套采样技术解决传统分层Tucker方法的可扩展性问题,提升了效率和准确性,并在医疗数据集上验证了其性能。

Comments This is an extended version of a paper presented at the 15th IEEE International Conference on Data Mining (ICDM 2015)

详情
AI中文摘要

我们提出了一种新的张量分解方法,称为稀疏分层-Tucker(Sparse H-Tucker),用于稀疏和高阶数据张量。Sparse H-Tucker受经典分层Tucker方法启发,旨在计算输入数据集的树状结构分解,可被领域专家解释。然而,Sparse H-Tucker采用嵌套采样技术克服了分层Tucker的关键可扩展性问题,即创建不可行的密集核心张量;我们的方法结果是一种更快、更节省空间且更准确的方法。我们广泛测试了该方法在一个真实医疗数据集上,该数据集来自30,000名患者,产生一个18阶稀疏数据张量。与竞争方法不同,Sparse H-Tucker可以在单个多线程机器上分析完整数据集。它比最先进的方法更准确且更快:在输入数据的12阶子集上,Sparse H-Tucker比之前最先进的方法准确度提高了18倍,速度提高了7.5倍。即使对于低阶张量(如4阶),我们的方法所需时间也接近一个数量级,内存使用也减少两个数量级,相比传统张量分解方法如CP和Tucker。此外,我们发现Sparse H-Tucker在非零张量元素数量上几乎线性扩展。所得到的模型还提供可解释的疾病层级,这已由临床专家验证。

英文摘要

We propose a new tensor factorization method, called the Sparse Hierarchical-Tucker (Sparse H-Tucker), for sparse and high-order data tensors. Sparse H-Tucker is inspired by its namesake, the classical Hierarchical Tucker method, which aims to compute a tree-structured factorization of an input data set that may be readily interpreted by a domain expert. However, Sparse H-Tucker uses a nested sampling technique to overcome a key scalability problem in Hierarchical Tucker, which is the creation of an unwieldy intermediate dense core tensor; the result of our approach is a faster, more space-efficient, and more accurate method. We extensively test our method on a real healthcare dataset, which is collected from 30K patients and results in an 18th order sparse data tensor. Unlike competing methods, Sparse H-Tucker can analyze the full data set on a single multi-threaded machine. It can also do so more accurately and in less time than the state-of-the-art: on a 12th order subset of the input data, Sparse H-Tucker is 18x more accurate and 7.5x faster than a previously state-of-the-art method. Even for analyzing low order tensors (e.g., 4-order), our method requires close to an order of magnitude less time and over two orders of magnitude less memory, as compared to traditional tensor factorization methods such as CP and Tucker. Moreover, we observe that Sparse H-Tucker scales nearly linearly in the number of non-zero tensor elements. The resulting model also provides an interpretable disease hierarchy, which is confirmed by a clinical expert.

1610.03518 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model

通过学习深度逆动力学模型实现仿真到现实世界的迁移

Paul Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Joshua Tobin, Pieter Abbeel, Wojciech Zaremba

AI总结 本文提出通过学习深度逆动力学模型,在仿真与现实世界之间实现控制策略的迁移,解决仿真与现实差异导致的性能下降问题。

详情
AI中文摘要

在仿真中开发控制策略通常比直接在现实世界中运行实验更实用和安全。这适用于通过规划和优化获得的策略,甚至更适用于通过强化学习获得的策略,后者通常非常数据密集。然而,仿真中成功的策略在部署到现实机器人时往往无法工作。然而,策略在仿真中执行的整体思路在现实世界中通常仍然有效。本文研究了此类场景,其中仿真中遍历的状态序列在现实世界中仍然合理,即使控制细节不同,例如摩擦、接触、质量和几何属性的差异。在执行过程中,我们的方法在每个时间步计算仿真基于的控制策略会做什么,但不执行这些控制在现实机器人上,而是计算仿真期望的下一个状态,并依赖于学习的深度逆动力学模型来决定最合适的现实世界动作以达到这些状态。深度模型只有在训练数据足够的情况下才有效,我们还提出了一种数据收集方法来(逐步)学习深度逆动力学模型。我们的实验表明,我们的方法在处理仿真到现实世界模型差异的各种基线方法中表现良好,包括输出误差控制和高斯动态适应。

英文摘要

Developing control policies in simulation is often more practical and safer than directly running experiments in the real world. This applies to policies obtained from planning and optimization, and even more so to policies obtained from reinforcement learning, which is often very data demanding. However, a policy that succeeds in simulation often doesn't work when deployed on a real robot. Nevertheless, often the overall gist of what the policy does in simulation remains valid in the real world. In this paper we investigate such settings, where the sequence of states traversed in simulation remains reasonable for the real world, even if the details of the controls are not, as could be the case when the key differences lie in detailed friction, contact, mass and geometry properties. During execution, at each time step our approach computes what the simulation-based control policy would do, but then, rather than executing these controls on the real robot, our approach computes what the simulation expects the resulting next state(s) will be, and then relies on a learned deep inverse dynamics model to decide which real-world action is most suitable to achieve those next states. Deep models are only as good as their training data, and we also propose an approach for data collection to (incrementally) learn the deep inverse dynamics model. Our experiments shows our approach compares favorably with various baselines that have been developed for dealing with simulation to real world model discrepancy, including output error control and Gaussian dynamics adaptation.