arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1705.01426 2026-06-04 cs.RO cs.SY eess.SY

A Nonlinear Model Predictive Control Scheme for Cooperative Manipulation with Singularity and Collision Avoidance

一种用于协同操作的非线性模型预测控制方案,包含奇异性和避障

Alexandros Nikou, Christos Verginis, Shahab Heshmati-alamdari, Dimos V. Dimarogonas

AI总结 本文提出了一种非线性模型预测控制方案,用于在存在障碍物的有限工作空间中安全导航被N个机器人代理抓取的物体,同时避免碰撞和奇异配置。

Comments Simulation results with 3 agents added

详情
AI中文摘要

本文解决了由N个机器人代理共同运输一个刚性抓取物体的问题。特别地,我们提出了一种非线性模型预测控制(NMPC)方案,该方案保证在有障碍物的有限工作空间中将物体导航到期望姿态,同时满足代理的输入饱和条件。此外,所提出的方法确保代理之间及与工作空间障碍物之间不发生碰撞,且不进入奇异配置。NMPC的可行性和收敛性分析被明确提供。最后,仿真结果展示了所提方法的有效性和效率。

英文摘要

This paper addresses the problem of cooperative transportation of an object rigidly grasped by $N$ robotic agents. In particular, we propose a Nonlinear Model Predictive Control (NMPC) scheme that guarantees the navigation of the object to a desired pose in a bounded workspace with obstacles, while complying with certain input saturations of the agents. Moreover, the proposed methodology ensures that the agents do not collide with each other or with the workspace obstacles as well as that they do not pass through singular configurations. The feasibility and convergence analysis of the NMPC are explicitly provided. Finally, simulation results illustrate the validity and efficiency of the proposed method.

1602.06667 2026-06-04 cs.RO cs.AI cs.SY eess.SY

A Motion Planning Strategy for the Active Vision-Based Mapping of Ground-Level Structures

一种用于主动视觉建图的地面结构运动规划策略

Manikandasriram Srinivasan Ramanagopal, André Phu-Van Nguyen, Jerome Le Ny

AI总结 本文提出了一种指导配备摄像头或深度传感器的地面机器人自主建图有限三维结构可见部分的策略,通过运动规划算法确定合适视角并自动填补点云中的空洞,适用于建筑、施工和检测领域。

Comments Accepted for publication in IEEE Transactions on Automation Science and Engineering. Available in IEEE Xplore at http://ieeexplore.ieee.org/document/8093664

详情
AI中文摘要

本文提出了一种策略,用于指导配备摄像头或深度传感器的地面机器人,以自主建图有限三维结构的可见部分。我们描述了确定合适连续视角的运动规划算法,并尝试自动填补由感知和感知层产生的点云中的空洞。重点是准确重建中等大小结构的3D模型,而非映射大型开放环境。所提出的算法不需要以网格模型或包围盒形式的初始化,生成的路径适用于视觉传感器同时用于建图和机器人局部化的情况,特别是在没有额外绝对定位系统时。我们分析了我们的策略的覆盖性质,并将其性能与经典前沿探索算法进行比较。我们展示了其在不同结构大小、局部化精度水平和深度传感器范围下的有效性,并在真实世界实验中验证了我们的设计。

英文摘要

This paper presents a strategy to guide a mobile ground robot equipped with a camera or depth sensor, in order to autonomously map the visible part of a bounded three-dimensional structure. We describe motion planning algorithms that determine appropriate successive viewpoints and attempt to fill holes automatically in a point cloud produced by the sensing and perception layer. The emphasis is on accurately reconstructing a 3D model of a structure of moderate size rather than mapping large open environments, with applications for example in architecture, construction and inspection. The proposed algorithms do not require any initialization in the form of a mesh model or a bounding box, and the paths generated are well adapted to situations where the vision sensor is used simultaneously for mapping and for localizing the robot, in the absence of additional absolute positioning system. We analyze the coverage properties of our policy, and compare its performance to the classic frontier based exploration algorithm. We illustrate its efficacy for different structure sizes, levels of localization accuracy and range of the depth sensor, and validate our design on a real-world experiment.

1711.03906 2026-06-04 cs.LG cs.DC cs.NI cs.RO cs.SY eess.SY

D-SLATS: Distributed Simultaneous Localization and Time Synchronization

D-SLATS:分布式的同时定位与时间同步

Amr Alanwar, Henrique Ferraz, Kevin Hsieh, Rohit Thazhath, Paul Martin, Joao Hespanha, Mani Srivastava

AI总结 本文提出D-SLATS框架,通过分布式扩展卡尔曼滤波和优化技术联合解决时间同步与定位问题,实现3微秒精度和30厘米误差。

详情
AI中文摘要

通过过去十年,我们见证了物联网(IoT)设备数量的激增,随之而来的是一次对时间和空间上协同行动的更大需求。尽管时间同步和定位这两个问题在许多方面有共同点,但传统上它们被分别处理或在集中式方法中结合,导致资源利用效率低下,或在设备数量方面不可扩展的解决方案。因此,我们提出D-SLATS,一个由三种不同且独立算法组成的框架,以分布式方式联合解决时间和定位问题。前两个算法主要基于分布式扩展卡尔曼滤波(EKF),而第三个算法使用优化技术。不需要融合中心,设备仅与邻居通信。所提出的方法在定制的超宽带通信测试平台和四旋翼无人机上进行了评估,代表了静态和移动节点的网络。我们的算法实现了高达三微秒的时间同步精度和30厘米的定位误差。

英文摘要

Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of resources, or in solutions that are not scalable in terms of the number of IoT devices. Therefore, we propose D-SLATS, a framework comprised of three different and independent algorithms to jointly solve time synchronization and localization problems in a distributed fashion. The First two algorithms are based mainly on the distributed Extended Kalman Filter (EKF) whereas the third one uses optimization techniques. No fusion center is required, and the devices only communicate with their neighbors. The proposed methods are evaluated on custom Ultra-Wideband communication Testbed and a quadrotor, representing a network of both static and mobile nodes. Our algorithms achieve up to three microseconds time synchronization accuracy and 30 cm localization error.

1701.08585 2026-06-04 cs.LG cs.SI cs.SY eess.SY math.OC

Variational Policy for Guiding Point Processes

变分策略用于引导点过程

Yichen Wang, Grady Williams, Evangelos Theodorou, Le Song

AI总结 本文提出基于最优测度和变分推断的凸优化框架,用于设计点过程的最优控制策略,以更高效准确地引导系统状态。

Comments ICML 2017

详情
AI中文摘要

时间点过程已被广泛应用于建模由在线用户生成的事件序列数据。本文考虑如何设计点过程的最优控制策略,以将由点过程驱动的随机系统引导至目标状态。我们从最优测度和变分推断的角度提出关键洞察,并进一步提出一个凸优化框架和高效的算法,用于自适应更新策略。在合成和真实数据上的实验表明,我们的算法比其他随机控制方法在引导用户活动方面更加准确和高效。

英文摘要

Temporal point processes have been widely applied to model event sequence data generated by online users. In this paper, we consider the problem of how to design the optimal control policy for point processes, such that the stochastic system driven by the point process is steered to a target state. In particular, we exploit the key insight to view the stochastic optimal control problem from the perspective of optimal measure and variational inference. We further propose a convex optimization framework and an efficient algorithm to update the policy adaptively to the current system state. Experiments on synthetic and real-world data show that our algorithm can steer the user activities much more accurately and efficiently than other stochastic control methods.

1711.02857 2026-06-04 cs.LG cs.AI cs.CV cs.NA math.NA stat.ML

Learning Sparse Visual Representations with Leaky Capped Norm Regularizers

通过泄漏受限范数正则化器学习稀疏视觉表示

Jianqiao Wangni, Dahua Lin

AI总结 本文提出泄漏受限范数正则化器,用于学习过完备视觉表示,证明了其在3D形状恢复中的收敛性,优于ℓ1和非凸正则化方法。

详情
AI中文摘要

诱导稀疏性的正则化是学习过完备视觉表示的重要组成部分。尽管ℓ1正则化广受欢迎,本文研究了非凸正则化在该问题中的应用。我们的贡献包括三个部分:首先,我们提出了泄漏受限范数正则化器(LCNR),允许模型权重低于一定阈值的部分被更强地正则化,从而实现强稀疏性,仅引入可控的估计偏差。我们提出了一种主要化-最小化算法来优化联合目标函数。其次,我们的研究显示,在单目3D形状恢复和神经网络中,LCNR优于ℓ1和其他非凸正则化方法,实现了最先进的性能和更快的收敛速度。第三,我们证明了在3D恢复问题上的理论全局收敛速度。到目前为止,这是首次对3D恢复问题的收敛性分析。

英文摘要

Sparsity inducing regularization is an important part for learning over-complete visual representations. Despite the popularity of $\ell_1$ regularization, in this paper, we investigate the usage of non-convex regularizations in this problem. Our contribution consists of three parts. First, we propose the leaky capped norm regularization (LCNR), which allows model weights below a certain threshold to be regularized more strongly as opposed to those above, therefore imposes strong sparsity and only introduces controllable estimation bias. We propose a majorization-minimization algorithm to optimize the joint objective function. Second, our study over monocular 3D shape recovery and neural networks with LCNR outperforms $\ell_1$ and other non-convex regularizations, achieving state-of-the-art performance and faster convergence. Third, we prove a theoretical global convergence speed on the 3D recovery problem. To the best of our knowledge, this is the first convergence analysis of the 3D recovery problem.

1711.00946 2026-06-04 cs.LG cs.SY eess.SY math.OC stat.ML

Learning Linear Dynamical Systems via Spectral Filtering

通过谱滤波学习线性动力系统

Elad Hazan, Karan Singh, Cyril Zhang

AI总结 本文提出一种高效算法,通过过度参数化线性动力系统实现在线预测,利用谱滤波技术获得近优 regret 保证。

Comments Published as a conference paper at NIPS 2017

详情
AI中文摘要

我们提出了一种高效且实用的算法,用于在线预测具有对称转移矩阵的离散时间线性动力系统。我们通过不当学习避免非凸优化问题:通过多项式对数因子过度参数化LDS类,在换取损失函数的凸性。由此产生一个具有近优 regret 保证的多项式时间算法,具有类似的一般学习样本复杂度界。我们的算法基于一种新颖的过滤技术,可能具有独立兴趣:我们将时间序列与某个Hankel矩阵的特征向量进行卷积。

英文摘要

We present an efficient and practical algorithm for the online prediction of discrete-time linear dynamical systems with a symmetric transition matrix. We circumvent the non-convex optimization problem using improper learning: carefully overparameterize the class of LDSs by a polylogarithmic factor, in exchange for convexity of the loss functions. From this arises a polynomial-time algorithm with a near-optimal regret guarantee, with an analogous sample complexity bound for agnostic learning. Our algorithm is based on a novel filtering technique, which may be of independent interest: we convolve the time series with the eigenvectors of a certain Hankel matrix.

1708.01930 2026-06-04 cs.AI cs.MA cs.RO cs.SY eess.SY

Enhanced Emotion Enabled Cognitive Agent Based Rear End Collision Avoidance Controller for Autonomous Vehicles

增强型情感驱动认知代理基于后方碰撞避免控制器用于自动驾驶车辆

Faisal Riaz, Muaz A. Niazi

AI总结 本文提出一种基于增强型情感驱动认知代理的后方碰撞避免控制器,通过引入恐惧情绪生成机制,提高自动驾驶车辆的碰撞避免效率和规则数量。

Comments 39 pages, 17 figures

详情
AI中文摘要

后方碰撞是自然中最致命的事故,导致大多数交通伤亡和伤害。现有研究提出了许多后方碰撞避免解决方案,但这些方案高度依赖精确的数学模型。然而,实际道路驾驶受非线性因素如路面状况、驾驶员反应时间、行人流量和车辆动力学影响,因此获得车辆控制系统精确数学模型具有挑战性。这个问题通过模糊逻辑解决了,但过多的模糊规则直接影响其效率。此外,这些基于模糊逻辑的控制器未使用适当的代理建模来模拟人工驾驶员执行这些模糊规则的功能。鉴于这些限制,我们提出了一种增强型情感驱动认知代理(EEEC_Agent)控制器,帮助自动驾驶车辆(AVs)以较少的规则进行后方碰撞避免,设计基于恐惧情绪,并具有高效率。为了在EEEC_Agent中引入恐惧情绪生成机制,采用了Orton, Clore & Collins(OCC)模型。EEEC_Agent的恐惧生成机制通过NetLogo模拟验证。此外,通过特别设计的原型AV平台对EEEC_Agent的功能进行了实际验证。最终,与现有最先进研究的定性比较研究表明,所提出的模型优于近期研究。

英文摘要

Rear end collisions are deadliest in nature and cause most of traffic casualties and injuries. In the existing research, many rear end collision avoidance solutions have been proposed. However, the problem with these proposed solutions is that they are highly dependent on precise mathematical models. Whereas, the real road driving is influenced by non-linear factors such as road surface situations, driver reaction time, pedestrian flow and vehicle dynamics, hence obtaining the accurate mathematical model of the vehicle control system is challenging. This problem with precise control based rear end collision avoidance schemes has been addressed using fuzzy logic, but the excessive number of fuzzy rules straightforwardly prejudice their efficiency. Furthermore, these fuzzy logic based controllers have been proposed without using proper agent based modeling that helps in mimicking the functions of an artificial human driver executing these fuzzy rules. Keeping in view these limitations, we have proposed an Enhanced Emotion Enabled Cognitive Agent (EEEC_Agent) based controller that helps the Autonomous Vehicles (AVs) to perform rear end collision avoidance with less number of rules, designed after fear emotion, and high efficiency. To introduce a fear emotion generation mechanism in EEEC_Agent, Orton, Clore & Collins (OCC) model has been employed. The fear generation mechanism of EEEC_Agent has been verified using NetLogo simulation. Furthermore, practical validation of EEEC_Agent functions has been performed using specially built prototype AV platform. Eventually, the qualitative comparative study with existing state of the art research works reflect that proposed model outperforms recent research.

1710.02555 2026-06-04 cs.RO cs.SY eess.SY

Model Predictive Path-Following for Constrained Differentially Flat Systems

基于约束微分平坦系统的模型预测路径跟踪

Melissa Greeff, Angela P. Schoellig

AI总结 本文提出一种结合前馈线性化与基于路径的模型预测控制的新型预测路径跟踪方法,通过微分平坦性将非线性问题转化为凸优化问题,并通过动态路径参考提高系统鲁棒性,实验验证了在四旋翼上优于传统轨迹跟踪控制器的性能。

Comments 8 pages, submitted to ICRA 2018

详情
AI中文摘要

对于许多任务,预测路径跟踪控制可以通过优先考虑接近路径而非沿路径的时间进展,并提前考虑路径变化来显著提高自主机器人性能和鲁棒性。本文提出了一种新颖的预测路径跟踪方法,将前馈线性化与基于路径的模型预测控制相结合。我们的方法有几个关键优势。通过利用微分平坦性,我们将基于路径的模型预测控制问题从非线性问题转化为凸优化问题。通过动态路径参考,可以实现对干扰的鲁棒性,该参考根据机器人进展调整其速度。我们还考虑了关键系统约束。我们在四旋翼上进行了实验,展示了在正常条件、初始偏移和风扰情况下,相比传统轨迹跟踪控制器,保持四旋翼更接近期望路径的改进性能。

英文摘要

For many tasks, predictive path-following control can significantly improve the performance and robustness of autonomous robots over traditional trajectory tracking control. It does this by prioritizing closeness to the path over timed progress along the path and by looking ahead to account for changes in the path. We propose a novel predictive path-following approach that couples feedforward linearization with path-based model predictive control. Our approach has a few key advantages. By utilizing the differential flatness property, we reduce the path-based model predictive control problem from a nonlinear to a convex optimization problem. Robustness to disturbances is achieved by a dynamic path reference, which adjusts its speed based on the robot's progress. We also account for key system constraints. We demonstrate these advantages in experiment on a quadrotor. We show improved performance over a baseline trajectory tracking controller by keeping the quadrotor closer to the desired path under nominal conditions, with an initial offset and under a wind disturbance.

1710.00620 2026-06-04 cs.CV cs.NA math.NA

Out-of-focus Blur: Image De-blurring

失焦模糊:图像去模糊

Yuzhen Lu

AI总结 本文研究通过模拟研究解决因失焦模糊导致的图像去模糊问题,采用正则化方法和共轭梯度法提升去模糊效果,提出最优参数选择策略。

Comments 11 pages

详情
AI中文摘要

图像去模糊在许多真实场景或物体成像中至关重要。本项目通过模拟研究,针对由失焦模糊扭曲的图像进行去模糊处理。首先探索伪逆滤波器,但因噪声放大而失败。随后采用Tikhonov正则化方法,相比伪逆滤波器有显著改进。在Tikhonov正则化中,正则化参数的选择对获得高质量图像至关重要,正则化解具有半收敛性质。当使用预设的不一致原理确定最优值时,相对恢复误差为8.49%。此外,采用共轭梯度法进行图像去模糊,计算速度快且结果更优,相对恢复误差为8.22%。迭代次数在CG中充当正则化参数,迭代解也具有半收敛性质。

英文摘要

Image de-blurring is important in many cases of imaging a real scene or object by a camera. This project focuses on de-blurring an image distorted by an out-of-focus blur through a simulation study. A pseudo-inverse filter is first explored but it fails because of severe noise amplification. Then Tikhonov regularization methods are employed, which produce greatly improved results compared to the pseudo-inverse filter. In Tikhonov regularization, the choice of the regularization parameter plays a critical rule in obtaining a high-quality image, and the regularized solutions possess a semi-convergence property. The best result, with the relative restoration error of 8.49%, is achieved when the prescribed discrepancy principle is used to decide an optimal value. Furthermore, an iterative method, Conjugated Gradient, is employed for image de-blurring, which is fast in computation and leads to an even better result with the relative restoration error of 8.22%. The number of iteration in CG acts as a regularization parameter, and the iterates have a semi-convergence property as well.

1710.11040 2026-06-04 cs.RO cs.AI cs.SY eess.SY math.OC

How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

机器人应如何评估风险?迈向机器人学中的风险轴理论

Anirudha Majumdar, Marco Pavone

AI总结 本文探讨了机器人风险评估的理论基础,提出风险度量应满足的公理,讨论了风险度量的表示定理及其在机器人应用中的实例,并分析了常用风险度量的局限性。

Comments Extended version of paper published in International Symposium on Robotics Research (ISRR) 2017

详情
AI中文摘要

赋予机器人评估风险和做出风险感知决策的能力被视为确保在不确定环境下运作的机器人安全的关键步骤。但,机器人应如何量化风险?一种自然且常见的方法是考虑一种框架,即随机结果被赋予成本——这种分配由一个成本随机变量捕捉。量化风险则对应于评估风险度量,即从成本随机变量到实数的映射。然而,什么是构成

英文摘要

Endowing robots with the capability of assessing risk and making risk-aware decisions is widely considered a key step toward ensuring safety for robots operating under uncertainty. But, how should a robot quantify risk? A natural and common approach is to consider the framework whereby costs are assigned to stochastic outcomes - an assignment captured by a cost random variable. Quantifying risk then corresponds to evaluating a risk metric, i.e., a mapping from the cost random variable to a real number. Yet, the question of what constitutes a "good" risk metric has received little attention within the robotics community. The goal of this paper is to explore and partially address this question by advocating axioms that risk metrics in robotics applications should satisfy in order to be employed as rational assessments of risk. We discuss general representation theorems that precisely characterize the class of metrics that satisfy these axioms (referred to as distortion risk metrics), and provide instantiations that can be used in applications. We further discuss pitfalls of commonly used risk metrics in robotics, and discuss additional properties that one must consider in sequential decision making tasks. Our hope is that the ideas presented here will lead to a foundational framework for quantifying risk (and hence safety) in robotics applications.

1710.09854 2026-06-04 cs.LG cs.NA math.NA stat.ML

Gradient Sparsification for Communication-Efficient Distributed Optimization

梯度稀疏化用于通信高效的分布式优化

Jianqiao Wangni, Jialei Wang, Ji Liu, Tong Zhang

AI总结 本文提出通过凸优化方法减少梯度通信开销,设计高效算法实现梯度稀疏化,验证了在逻辑回归、支持向量机和卷积神经网络中的有效性。

详情
AI中文摘要

现代大规模机器学习应用需要在分布式计算架构上实现随机优化算法。关键瓶颈是不同工作者之间交换信息(如随机梯度)的通信开销。本文提出了一种凸优化公式,以最小化随机梯度的编码长度。为高效求解最优稀疏化,提出了几种简单快速的算法用于近似解,具有理论保证的稀疏性。在ℓ2正则化逻辑回归、支持向量机和卷积神经网络上的实验验证了我们的稀疏化方法。

英文摘要

Modern large scale machine learning applications require stochastic optimization algorithms to be implemented on distributed computational architectures. A key bottleneck is the communication overhead for exchanging information such as stochastic gradients among different workers. In this paper, to reduce the communication cost we propose a convex optimization formulation to minimize the coding length of stochastic gradients. To solve the optimal sparsification efficiently, several simple and fast algorithms are proposed for approximate solution, with theoretical guaranteed for sparseness. Experiments on $\ell_2$ regularized logistic regression, support vector machines, and convolutional neural networks validate our sparsification approaches.

1710.09657 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Segment Parameter Labelling in MCMC Mean-Shift Change Detection

MCMC均值迁移中的分段参数标记

Alireza Ahrabian, Shirin Enshaeifar, Clive Cheong-Took, Payam Barnaghi

AI总结 本文提出一种基于贝叶斯均值迁移的分段变化检测算法,利用分段参数重复性提升性能。

详情
AI中文摘要

本文解决时间序列数据在贝叶斯模型中关于感兴趣统计参数的分段问题。通常假设每个分段内的参数是不同的,因此许多贝叶斯变化点检测模型未利用分段参数模式,这可能提高性能。本文提出了一种贝叶斯均值迁移变化点检测算法,通过引入利用狄利克雷过程先验的分段类别标签来利用分段参数的重复性。所提出方法在合成和真实数据上的性能评估表明,使用参数标记可提高性能。

英文摘要

This work addresses the problem of segmentation in time series data with respect to a statistical parameter of interest in Bayesian models. It is common to assume that the parameters are distinct within each segment. As such, many Bayesian change point detection models do not exploit the segment parameter patterns, which can improve performance. This work proposes a Bayesian mean-shift change point detection algorithm that makes use of repetition in segment parameters, by introducing segment class labels that utilise a Dirichlet process prior. The performance of the proposed approach was assessed on both synthetic and real world data, highlighting the enhanced performance when using parameter labelling.

1710.09627 2026-06-04 cs.AI cs.NI cs.SY eess.SY

SRE: Semantic Rules Engine For the Industrial Internet-Of-Things Gateways

SRE:面向工业互联网-of-things网关的语义规则引擎

Charbel El Kaed, Imran Khan, Andre Van Den Berg, Hicham Hossayni, Christophe Saint-Marcel

AI总结 本文提出一种面向工业网关的语义规则引擎SRE,用于实现动态灵活的基于规则的控制策略,支持实时管理规则并提供语义查询结果。

Comments Accepted for publication in forthcoming issue of IEEE Transactions on Industrial Informatics. The content is final but has NOT been proof-read

详情
Journal ref
IEEE Transactions on Industrial Informatics, 2017
AI中文摘要

物联网范式的发展为解决现实问题提供了机会。例如,能源管理吸引了学术界、工业界、政府和监管机构的广泛关注。它涉及收集能源使用数据、分析数据并通过控制策略优化能源消耗。然而,在工业环境中,进行此类优化并不简单。业务规则的变化、过程控制和客户要求的变化使问题更加具有挑战性。本文提出了一种面向工业网关的语义规则引擎(SRE),允许实现动态且灵活的基于规则的控制策略。它简单、表达能力强,并允许在不造成任何服务中断的情况下实时管理规则。此外,它能够处理语义查询,并通过从已定义的概念中推断额外知识来提供结果。SRE已在不同硬件平台和商业产品上得到验证和测试。还提供了性能评估以验证其对客户要求的符合性。

英文摘要

The Advent of the Internet-of-Things (IoT) paradigm has brought opportunities to solve many real-world problems. Energy management, for example, has attracted huge interest from academia, industries, governments and regulatory bodies. It involves collecting energy usage data, analyzing it, and optimizing the energy consumption by applying control strategies. However, in industrial environments, performing such optimization is not trivial. The changes in business rules, process control, and customer requirements make it much more challenging. In this paper, a Semantic Rules Engine (SRE) for industrial gateways is presented that allows implementing dynamic and flexible rule-based control strategies. It is simple, expressive, and allows managing rules on-the-fly without causing any service interruption. Additionally, it can handle semantic queries and provide results by inferring additional knowledge from previously defined concepts in ontologies. SRE has been validated and tested on different hardware platforms and in commercial products. Performance evaluations are also presented to validate its conformance to the customer requirements.

1710.07147 2026-06-04 cs.AI cs.SY eess.SY

A Two-Phase Safe Vehicle Routing and Scheduling Problem: Formulations and Solution Algorithms

两阶段安全车辆路径与调度问题:建模与求解算法

Aschkan Omidvar, Eren Erman Ozguven, O. Arda Vanli, R. Tavakkoli-Moghaddam

AI总结 本文提出一种两阶段时间依赖车辆路径与调度优化模型,通过避免重复拥堵和选择事故概率较低的路线,替代传统最短距离或行驶时间目标。第一阶段利用混合整数规划模型确定安全路径;第二阶段通过调整出发时间和速度避免拥堵,采用改进的模拟退火算法求解。

详情
AI中文摘要

我们提出一个两阶段时间依赖车辆路径与调度优化模型,通过(1)避免重复拥堵和(2)选择事故概率较低的路线,替代文献中常见的最短距离或行驶时间目标。第一阶段根据时间动态考虑道路网络上的速度变化,解决混合整数规划模型以确定车队和节点序列的安全路径。第二阶段将每条路线视为独立的交通路径(固定路线和节点序列),通过调整车辆从每个节点的出发时间和调整各边的次优速度来避免拥堵。提出的改进模拟退火(SA)算法用于迭代求解这两个复杂模型,能够以较短的时间提供解决方案。

英文摘要

We propose a two phase time dependent vehicle routing and scheduling optimization model that identifies the safest routes, as a substitute for the classical objectives given in the literature such as shortest distance or travel time, through (1) avoiding recurring congestions, and (2) selecting routes that have a lower probability of crash occurrences and non-recurring congestion caused by those crashes. In the first phase, we solve a mixed-integer programming model which takes the dynamic speed variations into account on a graph of roadway networks according to the time of day, and identify the routing of a fleet and sequence of nodes on the safest feasible paths. Second phase considers each route as an independent transit path (fixed route with fixed node sequences), and tries to avoid congestion by rescheduling the departure times of each vehicle from each node, and by adjusting the sub-optimal speed on each arc. A modified simulated annealing (SA) algorithm is formulated to solve both complex models iteratively, which is found to be capable of providing solutions in a considerably short amount of time.

1709.03153 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY

MBMF: Model-Based Priors for Model-Free Reinforcement Learning

MBMF:基于模型的先验用于无模型强化学习

Somil Bansal, Roberto Calandra, Kurtland Chua, Sergey Levine, Claire Tomlin

AI总结 本文提出一种结合模型与无模型强化学习的方法,通过学习概率动力学模型作为先验,提升数据效率和成本效益。

Comments After we submitted the paper for consideration in CoRL 2017 we found a paper published in the recent past with a similar method (see related work for a discussion). Considering the similarities between the two papers, we have decided to retract our paper from CoRL 2017

详情
AI中文摘要

强化学习主要分为无模型和有模型两种范式。每种范式都有其优势和局限性,并已成功应用于适合其相应优势的真实世界领域。本文提出一种新方法,旨在弥合这两种范式的差距。我们通过学习概率动力学模型,并将其作为交织的无模型优化的先验,结合两种范式的优点,从而实现数据高效和成本节约。结果表明,我们的方法在性能上优于纯有模型和纯无模型方法,以及简单切换范式的方法。

英文摘要

Reinforcement Learning is divided in two main paradigms: model-free and model-based. Each of these two paradigms has strengths and limitations, and has been successfully applied to real world domains that are appropriate to its corresponding strengths. In this paper, we present a new approach aimed at bridging the gap between these two paradigms. We aim to take the best of the two paradigms and combine them in an approach that is at the same time data-efficient and cost-savvy. We do so by learning a probabilistic dynamics model and leveraging it as a prior for the intertwined model-free optimization. As a result, our approach can exploit the generality and structure of the dynamics model, but is also capable of ignoring its inevitable inaccuracies, by directly incorporating the evidence provided by the direct observation of the cost. Preliminary results demonstrate that our approach outperforms purely model-based and model-free approaches, as well as the approach of simply switching from a model-based to a model-free setting.

1707.09095 2026-06-04 cs.AI cs.SY eess.SY

Toward the Starting Line: A Systems Engineering Approach to Strong AI

迈向起点:一种系统工程方法用于强人工智能

Tansu Alpcan, Sarah M. Erfani, Christopher Leckie

AI总结 本文提出一种基于系统工程的方法,旨在解决强人工智能的起点问题,通过跨学科融合推动主流研究。

Comments 11 pages, 3 figures

详情
AI中文摘要

人工一般智能(AGI)或强人工智能旨在创造具有人类水平智能的机器,相较于现有计算和人工智能系统仍是一个雄心勃勃的目标。在经历了多次 hype 周期和 AI 历史教训后,显然需要一个重大的概念飞跃才能跨越起点,从而启动主流 AGI 研究。本文旨在为达到这一起点做出小的理论贡献。通过对 AGI 问题从不同视角进行广泛分析,介绍了一种基于系统理论和工程研究的方法,该方法建立在现有的主流 AI 和系统基础之上。识别了系统学科与 AI 研究之间的几个有前途的交叉促进机会。讨论了具体潜在的研究方向。

英文摘要

Artificial General Intelligence (AGI) or Strong AI aims to create machines with human-like or human-level intelligence, which is still a very ambitious goal when compared to the existing computing and AI systems. After many hype cycles and lessons from AI history, it is clear that a big conceptual leap is needed for crossing the starting line to kick-start mainstream AGI research. This position paper aims to make a small conceptual contribution toward reaching that starting line. After a broad analysis of the AGI problem from different perspectives, a system-theoretic and engineering-based research approach is introduced, which builds upon the existing mainstream AI and systems foundations. Several promising cross-fertilization opportunities between systems disciplines and AI research are identified. Specific potential research directions are discussed.

1710.06232 2026-06-04 cs.CV cs.SY eess.SY

Analysis of feature detector and descriptor combinations with a localization experiment for various performance metrics

基于多种性能指标的特征检测器与描述符组合分析:定位实验

Ertugrul Bayraktar, Pinar Boyraz

AI总结 本文通过移动机器人室内实验,比较不同特征检测器与描述符组合在图像匹配中的性能,分析不同组合在精度、时间、角度差等五项指标下的表现。

Comments 11 pages, 3 figures, 1 table

详情
Journal ref
Turkish Journal of Electrical Engineering & Computer Sciences, (2017) 25: 2444 - 2454
AI中文摘要

本研究旨在提供特征检测器/描述符方法的详细性能比较,特别是当其各种组合用于图像匹配时的表现。通过移动机器人在室内环境中的定位实验作为案例研究,使用3090张查询图像和127张数据集图像。研究包括五种特征检测器方法(FAST、ORB、SURF、SIFT、BRISK)和五种特征描述符方法(BRIEF、BRISK、SIFT、SURF、ORB)。这些方法在23种不同组合中使用,通过本研究定义的性能标准获得有意义且一致的比较结果。所有方法作为独立的特征检测器或描述符分别使用。性能分析展示了各种检测器和描述符组合的判别能力。分析使用五个参数:(i)准确性,(ii)时间,(iii)关键点之间的角度差,(iv)正确匹配的数量,(v)正确匹配关键点之间的距离。在60°范围内,覆盖系统五个旋转姿态点,FAST-SURF组合具有最低的距离和角度差值以及最高的匹配关键点数量。SIFT-SURF是准确度最高的组合,正确分类率为98.41%。最快的算法是ORB-BRIEF,匹配560张在运动中捕获的图像和127张数据集图像的总运行时间为21,303.30秒。

英文摘要

The purpose of this study is to provide a detailed performance comparison of feature detector/descriptor methods, particularly when their various combinations are used for image-matching. The localization experiments of a mobile robot in an indoor environment are presented as a case study. In these experiments, 3090 query images and 127 dataset images were used. This study includes five methods for feature detectors (features from accelerated segment test (FAST), oriented FAST and rotated binary robust independent elementary features (BRIEF) (ORB), speeded-up robust features (SURF), scale invariant feature transform (SIFT), and binary robust invariant scalable keypoints (BRISK)) and five other methods for feature descriptors (BRIEF, BRISK, SIFT, SURF, and ORB). These methods were used in 23 different combinations and it was possible to obtain meaningful and consistent comparison results using the performance criteria defined in this study. All of these methods were used independently and separately from each other as either feature detector or descriptor. The performance analysis shows the discriminative power of various combinations of detector and descriptor methods. The analysis is completed using five parameters: (i) accuracy, (ii) time, (iii) angle difference between keypoints, (iv) number of correct matches, and (v) distance between correctly matched keypoints. In a range of 60°, covering five rotational pose points for our system, the FAST-SURF combination had the lowest distance and angle difference values and the highest number of matched keypoints. SIFT-SURF was the most accurate combination with a 98.41% correct classification rate. The fastest algorithm was ORB-BRIEF, with a total running time of 21,303.30 s to match 560 images captured during motion with 127 dataset images.

1710.05472 2026-06-04 cs.LG cs.SY eess.SY

Safe Learning of Quadrotor Dynamics Using Barrier Certificates

使用屏障证书安全学习四旋翼动力学

Li Wang, Evangelos A. Theodorou, Magnus Egerstedt

AI总结 本文提出基于高斯过程的数据驱动方法,通过屏障证书确保四旋翼在部分未知环境中的安全学习,结合自适应采样方案和递归高斯过程预测实现动态建模。

Comments Submitted to ICRA 2018, 8 pages

详情
AI中文摘要

为了有效控制复杂动力系统,通常需要准确的非线性模型。然而,这些模型并不总是已知的。在本文中,我们提出了一种基于高斯过程的数据驱动方法,用于学习在部分未知环境中运行的四旋翼动力学模型。挑战在于,若学习过程未被谨慎控制,系统将不稳定,即四旋翼将坠毁。为此,采用屏障证书进行安全学习。屏障证书建立了一个非保守的正向不变安全区域,基于高斯过程的统计特性提供高概率的安全保证。设计了一个学习控制器,以高效探索不确定状态并扩展基于自适应采样方案的屏障认证安全区域。此外,开发了一种递归高斯过程预测方法,用于实时学习复杂的四旋翼动力学。仿真结果证明了所提方法的有效性。

英文摘要

To effectively control complex dynamical systems, accurate nonlinear models are typically needed. However, these models are not always known. In this paper, we present a data-driven approach based on Gaussian processes that learns models of quadrotors operating in partially unknown environments. What makes this challenging is that if the learning process is not carefully controlled, the system will go unstable, i.e., the quadcopter will crash. To this end, barrier certificates are employed for safe learning. The barrier certificates establish a non-conservative forward invariant safe region, in which high probability safety guarantees are provided based on the statistics of the Gaussian Process. A learning controller is designed to efficiently explore those uncertain states and expand the barrier certified safe region based on an adaptive sampling scheme. In addition, a recursive Gaussian Process prediction method is developed to learn the complex quadrotor dynamics in real-time. Simulation results are provided to demonstrate the effectiveness of the proposed approach.

1608.01431 2026-06-04 cs.CV cs.NA math.NA

An efficient iterative thresholding method for image segmentation

一种高效的迭代阈值方法用于图像分割

Dong Wang, Haohan Li, Xiaoyu Wei, Xiaoping Wang

AI总结 本文提出了一种高效的迭代阈值方法用于多相图像分割,通过非局部多相能量近似轮廓长度,实现最优复杂度O(N log N)的高效分割。

Comments 14 pages, 21 figures

详情
AI中文摘要

我们提出了一种高效的迭代阈值方法用于多相图像分割。该算法基于最小化分段常数Mumford-Shah功能,在其中轮廓长度(或周长)被近似为非局部多相能量。通过迭代方法求解最小化问题。每次迭代包括计算简单的卷积后跟随阈值步骤。该算法易于实现且具有最优复杂度O(N log N)每迭代。我们还展示了迭代算法具有总能量衰减性质。我们展示了一些数值结果以显示我们方法的效率。

英文摘要

We proposed an efficient iterative thresholding method for multi-phase image segmentation. The algorithm is based on minimizing piecewise constant Mumford-Shah functional in which the contour length (or perimeter) is approximated by a non-local multi-phase energy. The minimization problem is solved by an iterative method. Each iteration consists of computing simple convolutions followed by a thresholding step. The algorithm is easy to implement and has the optimal complexity $O(N \log N)$ per iteration. We also show that the iterative algorithm has the total energy decaying property. We present some numerical results to show the efficiency of our method.

1710.02242 2026-06-04 cs.LG cs.NA math.NA

Solving differential equations with unknown constitutive relations as recurrent neural networks

利用未知本构关系求解微分方程作为循环神经网络

Tobias Hagge, Panos Stinis, Enoch Yeung, Alexandre M. Tartakovsky

AI总结 本文提出用循环神经网络学习未知的反应速率项,通过离散化的常微分方程作为训练问题的一部分,解决部分可用状态变量测量数据下的微分方程问题,应用于fedbatch生物反应器模拟。

Comments 19 pages, 8 figures

详情
AI中文摘要

我们解决一个具有未知功能形式的sink(反应速率)项的常微分方程组。我们假设状态变量的测量(时间序列)部分可用,并利用循环神经网络来“学习”反应速率。这通过将离散化的常微分方程作为循环神经网络训练问题的一部分来实现。我们扩展了TensorFlow的循环神经网络架构,创建了一个简单但可扩展且有效的求解器,用于未知函数的求解,并应用于fedbatch生物反应器模拟问题。使用最近深度学习文献中的技术使训练具有在数千个时间步上表现的行为的函数成为可能。我们的网络在结构上类似于循环神经网络,但设计和功能上的差异要求对训练此类网络的传统智慧进行修改。

英文摘要

We solve a system of ordinary differential equations with an unknown functional form of a sink (reaction rate) term. We assume that the measurements (time series) of state variables are partially available, and we use recurrent neural network to "learn" the reaction rate from this data. This is achieved by including a discretized ordinary differential equations as part of a recurrent neural network training problem. We extend TensorFlow's recurrent neural network architecture to create a simple but scalable and effective solver for the unknown functions, and apply it to a fedbatch bioreactor simulation problem. Use of techniques from recent deep learning literature enables training of functions with behavior manifesting over thousands of time steps. Our networks are structurally similar to recurrent neural networks, but differences in design and function require modifications to the conventional wisdom about training such networks.

1710.00489 2026-06-04 cs.RO cs.AI cs.CV cs.NE cs.SY eess.SY

SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Planning and Control

SE3-姿态网络:用于视觉-运动规划和控制的结构深度动力学模型

Arunkumar Byravan, Felix Leeb, Franziska Meier, Dieter Fox

AI总结 本文提出了一种基于结构深度动力学模型的深度视觉-运动控制方法,通过编码器-解码器结构学习低维姿态嵌入,实现场景分割和姿态预测,并在现实世界中实现了闭环控制。

Comments 8 pages, Initial submission to IEEE International Conference on Robotics and Automation (ICRA) 2018

详情
AI中文摘要

本文提出了一种基于结构深度动力学模型的深度视觉-运动控制方法。我们的深度动力学模型是一种SE3-Nets的变体,通过编码器-解码器结构学习低维姿态嵌入用于视觉-运动控制。与以往工作不同,我们的动力学模型是结构化的:给定一个输入场景,我们的网络明确学习分割显著部分并预测其姿态嵌入以及其运动作为姿态空间中的变化。我们通过一对相隔动作的点云训练我们的模型,并展示在仅提供帧间点对数据关联的监督下,我们的网络能够学习有意义的场景分割以及一致的姿态。我们进一步展示我们的模型可以直接在学习的低维姿态空间中用于闭环控制,其中动作通过最小化姿态空间中的误差使用基于梯度的方法计算,类似于传统模型驱动控制。我们展示了在模拟和现实世界中控制Baxter机器人从原始深度数据的结果,并与两种基线深度网络进行了比较。我们的方法在实时运行,实现了良好的场景动态预测,并在多个控制运行中优于基线方法。视频结果可在:https://rse-lab.cs.washington.edu/se3-structured-deep-ctrl/

英文摘要

In this work, we present an approach to deep visuomotor control using structured deep dynamics models. Our deep dynamics model, a variant of SE3-Nets, learns a low-dimensional pose embedding for visuomotor control via an encoder-decoder structure. Unlike prior work, our dynamics model is structured: given an input scene, our network explicitly learns to segment salient parts and predict their pose-embedding along with their motion modeled as a change in the pose space due to the applied actions. We train our model using a pair of point clouds separated by an action and show that given supervision only in the form of point-wise data associations between the frames our network is able to learn a meaningful segmentation of the scene along with consistent poses. We further show that our model can be used for closed-loop control directly in the learned low-dimensional pose space, where the actions are computed by minimizing error in the pose space using gradient-based methods, similar to traditional model-based control. We present results on controlling a Baxter robot from raw depth data in simulation and in the real world and compare against two baseline deep networks. Our method runs in real-time, achieves good prediction of scene dynamics and outperforms the baseline methods on multiple control runs. Video results can be found at: https://rse-lab.cs.washington.edu/se3-structured-deep-ctrl/

1709.09578 2026-06-04 cs.LG cs.NA math.NA

Neural networks for topology optimization

神经网络用于拓扑优化

Ivan Sosnovik, Ivan Oseledets

AI总结 本文提出基于深度学习的拓扑优化加速方法,将布局问题转化为图像分割任务,利用卷积编码器-解码器架构实现高效优化,实验表明方法显著提升优化速度并具有良好的泛化能力。

详情
AI中文摘要

在本研究中,我们提出了一种基于深度学习的方法,以加速拓扑优化方法。我们试图解决布局问题。本工作的主要创新点是将问题表述为图像分割任务。我们利用深度学习方法的高效像素级图像标注技术来进行拓扑优化。我们引入了卷积编码器-解码器架构,并介绍了通过高性能方法解决上述问题的整体方法。进行的实验展示了优化过程的显著加速。所提出的方法具有出色的泛化能力。我们展示了所提出模型应用于其他问题的能力。成功的实验结果以及当前方法的缺点均进行了讨论。

英文摘要

In this research, we propose a deep learning based approach for speeding up the topology optimization methods. The problem we seek to solve is the layout problem. The main novelty of this work is to state the problem as an image segmentation task. We leverage the power of deep learning methods as the efficient pixel-wise image labeling technique to perform the topology optimization. We introduce convolutional encoder-decoder architecture and the overall approach of solving the above-described problem with high performance. The conducted experiments demonstrate the significant acceleration of the optimization process. The proposed approach has excellent generalization properties. We demonstrate the ability of the application of the proposed model to other problems. The successful results, as well as the drawbacks of the current method, are discussed.

1709.07032 2026-06-04 cs.RO cs.MA cs.SY eess.SY stat.AP

Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems

数据驱动的自动驾驶按需出行系统模型预测控制

Ramon Iglesias, Federico Rossi, Kevin Wang, David Hallac, Jure Leskovec, Marco Pavone

AI总结 本文提出一种端到端的数据驱动框架,用于控制自动驾驶按需出行系统,通过时间扩展网络建模并设计MPC算法,利用历史数据预测短期需求,减少乘客等待时间达89.6%。

Comments Submitted to the International Conference on Robotics and Automation 2018

详情
AI中文摘要

本文旨在提出一种端到端的数据驱动框架,用于控制自动驾驶按需出行系统(AMoD,即自动驾驶车队)。我们首先使用时间扩展网络建模AMoD系统,并提出一种计算最优再平衡策略(即预置重新定位)和给定旅行需求的最小可行车队规模的公式。然后,我们适应此公式,设计出一种模型预测控制(MPC)算法,利用基于历史数据的短期需求预测来计算再平衡策略。我们使用最先进的LSTM神经网络和滴滴出行的真实客户数据测试该控制器的端到端性能,证明该方法在大规模系统中表现优异(MPC算法的计算复杂度不依赖于系统中的客户和车辆数量),并且在减少平均乘客等待时间方面优于现有再平衡策略,最高可减少89.6%。

英文摘要

The goal of this paper is to present an end-to-end, data-driven framework to control Autonomous Mobility-on-Demand systems (AMoD, i.e. fleets of self-driving vehicles). We first model the AMoD system using a time-expanded network, and present a formulation that computes the optimal rebalancing strategy (i.e., preemptive repositioning) and the minimum feasible fleet size for a given travel demand. Then, we adapt this formulation to devise a Model Predictive Control (MPC) algorithm that leverages short-term demand forecasts based on historical data to compute rebalancing strategies. We test the end-to-end performance of this controller with a state-of-the-art LSTM neural network to predict customer demand and real customer data from DiDi Chuxing: we show that this approach scales very well for large systems (indeed, the computational complexity of the MPC algorithm does not depend on the number of customers and of vehicles in the system) and outperforms state-of-the-art rebalancing strategies by reducing the mean customer wait time by up to to 89.6%.

1703.01250 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

虚拟与现实:在强化学习中权衡模拟与物理实验

Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

AI总结 本文提出利用模拟数据优化强化学习,通过结合低成本但不准确的模拟信息与高成本但准确的物理实验,提高效率。

Comments 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

详情
AI中文摘要

在实践中,控制策略的参数通常手动调整,这耗时且令人沮丧。强化学习是一种有前途的替代方法,旨在自动化此过程,但通常需要太多实验才实用。本文提出了一种解决方案,通过利用可用于大多数机器人平台的模拟先验知识。具体而言,我们扩展了熵搜索,一种最大化每次实验信息增益的贝叶斯优化算法,以处理多个信息源的情况。结果是一种原则性的方法,可以有效地将低成本但不准确的模拟信息与高成本且准确的物理实验结合起来。我们将其应用于摆杆系统,证明该算法可以在比仅使用物理系统标准贝叶斯优化更少的实验中找到良好的控制策略。

英文摘要

In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robotic platforms. Specifically, we extend Entropy Search, a Bayesian optimization algorithm that maximizes information gain from each experiment, to the case of multiple information sources. The result is a principled way to automatically combine cheap, but inaccurate information from simulations with expensive and accurate physical experiments in a cost-effective manner. We apply the resulting method to a cart-pole system, which confirms that the algorithm can find good control policies with fewer experiments than standard Bayesian optimization on the physical system only.

1605.01950 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Automatic LQR Tuning Based on Gaussian Process Global Optimization

基于高斯过程全局优化的自动LQR调优

Alonso Marco, Philipp Hennig, Jeannette Bohg, Stefan Schaal, Sebastian Trimpe

AI总结 本文提出一种结合线性最优控制的自动控制器调优框架,利用贝叶斯优化算法提升控制器参数,通过实验数据优化性能目标,以七自由度机械臂平衡倒立杆为例验证方法有效性。

Comments 8 pages, 5 figures, to appear in IEEE 2016 International Conference on Robotics and Automation. Video demonstration of the experiments available at https://am.is.tuebingen.mpg.de/publications/marco_icra_2016

详情
AI中文摘要

本文提出一种基于线性最优控制与贝叶斯优化的自动控制器调优框架。该框架根据预定义的性能目标,利用实验数据自动改进初始控制器参数。所采用的贝叶斯优化算法为熵搜索,将潜在目标表示为高斯过程,并构建关于目标最小值位置的显式信念。通过最大化每次实验评估的信息增益,该框架能够在较少评估次数下获得改进的控制器。实验演示使用了七自由度机械臂平衡倒立杆的任务,二、四维调优问题的结果展示了该方法在机器人平台上的自动控制器调优潜力。

英文摘要

This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree-of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Results of a two- and four-dimensional tuning problems highlight the method's potential for automatic controller tuning on robotic platforms.

1709.06080 2026-06-04 cs.LG cs.AI cs.NA math.NA

Feedforward and Recurrent Neural Networks Backward Propagation and Hessian in Matrix Form

前馈和循环神经网络的反向传播与Hessian矩阵形式

Maxim Naumov

AI总结 本文研究了前馈和循环神经网络的线性代数理论,推导了Hessian的精确表达式,并展示了权重梯度和Hessian的矩阵形式。

Comments 23 pages, 4 figures

详情
AI中文摘要

本文聚焦于前馈(FNN)和循环(RNN)神经网络背后的线性代数理论。我们回顾了反向传播,包括通过时间反向传播(BPTT)。此外,我们推导出Hessian的新的精确表达式,代表了二次效应。我们证明,对于t个时间步,权重梯度可以表示为秩-t矩阵,而权重Hessian则可以表示为t²个Kronecker积之和,这些Kronecker积由秩-1和W^TAW矩阵组成,其中A和W是某些矩阵。此外,我们还证明,对于大小为r的mini-batch,权重更新可以表示为秩-rt矩阵。最后,我们简要评论了Hessian矩阵的特征值。

英文摘要

In this paper we focus on the linear algebra theory behind feedforward (FNN) and recurrent (RNN) neural networks. We review backward propagation, including backward propagation through time (BPTT). Also, we obtain a new exact expression for Hessian, which represents second order effects. We show that for $t$ time steps the weight gradient can be expressed as a rank-$t$ matrix, while the weight Hessian is as a sum of $t^{2}$ Kronecker products of rank-$1$ and $W^{T}AW$ matrices, for some matrix $A$ and weight matrix $W$. Also, we show that for a mini-batch of size $r$, the weight update can be expressed as a rank-$rt$ matrix. Finally, we briefly comment on the eigenvalues of the Hessian matrix.

1709.05843 2026-06-04 cs.RO cs.MA cs.SY eess.SY

Decentralized Collision-Free Control of Multiple Robots in 2D and 3D Spaces

多机器人在二维和三维空间中去碰撞的去中心化控制

Xiaotian Yang

AI总结 本文提出在任意未知二维和三维区域中实现多机器人去中心化无碰撞控制的方法,通过网格选择和算法设计解决完全覆盖和搜索任务,算法在MATLAB中验证并与其他算法对比。

详情
AI中文摘要

去中心化机器人控制吸引了大量研究兴趣。然而,一些研究使用了不现实的假设而未考虑碰撞避免。本报告聚焦于在任意未知的二维和三维区域内实现多机器人无碰撞控制,以解决完全覆盖和搜索任务。所有算法均为去中心化,因为机器人能力有限,并且已数学证明。报告首先讨论了两种任务中的网格选择。网格模式简化了区域表示,机器人只需在邻居顶点之间直线移动。对于100%完全二维覆盖,提出了等边三角形网格。对于忽略边界效应的完全覆盖,每种情况都计算出在二维和三维区域中顶点最少的网格。第二部分针对二维和三维区域的完全覆盖,提出了一种去中心化无碰撞算法,驱动机器人前往离参考点最远的区域。该区域可以是静态或扩展的,并在MATLAB中模拟。第三部分提供了三种基于网格的去中心化随机算法,用于在二维或三维区域中搜索目标。目标数量可以是已知或未知的。在第一个算法中,机器人随机选择空闲邻居,优先选择未访问的邻居。第二个算法在机器人靠近时添加排斥力以分散机器人。第三个算法中,如果被已访问的顶点包围,机器人将使用广度优先搜索算法前往最近的未访问顶点。第二个搜索算法在Pioneer 3-DX机器人上验证。展示了生成公式以估计搜索时间的一般方法。算法在MATLAB中与其他五个算法比较,以展示其有效性。

英文摘要

Decentralized control of robots has attracted huge research interests. However, some of the research used unrealistic assumptions without collision avoidance. This report focuses on the collision-free control for multiple robots in both complete coverage and search tasks in 2D and 3D areas which are arbitrary unknown. All algorithms are decentralized as robots have limited abilities and they are mathematically proved. The report starts with the grid selection in the two tasks. Grid patterns simplify the representation of the area and robots only need to move straightly between neighbor vertices. For the 100% complete 2D coverage, the equilateral triangular grid is proposed. For the complete coverage ignoring the boundary effect, the grid with the fewest vertices is calculated in every situation for both 2D and 3D areas. The second part is for the complete coverage in 2D and 3D areas. A decentralized collision-free algorithm with the above selected grid is presented driving robots to sections which are furthest from the reference point. The area can be static or expanding, and the algorithm is simulated in MATLAB. Thirdly, three grid-based decentralized random algorithms with collision avoidance are provided to search targets in 2D or 3D areas. The number of targets can be known or unknown. In the first algorithm, robots choose vacant neighbors randomly with priorities on unvisited ones while the second one adds the repulsive force to disperse robots if they are close. In the third algorithm, if surrounded by visited vertices, the robot will use the breadth-first search algorithm to go to one of the nearest unvisited vertices via the grid. The second search algorithm is verified on Pioneer 3-DX robots. The general way to generate the formula to estimate the search time is demonstrated. Algorithms are compared with five other algorithms in MATLAB to show their effectiveness.

1602.04436 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Autoregressive Moving Average Graph Filtering

自回归移动平均图滤波器

Elvin Isufi, Andreas Loukas, Andrea Simonetto, Geert Leus

AI总结 本文提出了一种自回归移动平均图滤波器,能够近似任意图频响应并实现信号去噪和插值。该方法适用于静态和时变场景,通过二维滤波处理时变图信号。

详情
Journal ref
IEEE Transactions on Signal Processing, vol. 67 (2), pages 274 - 288, 2017
AI中文摘要

本文提出了一种自回归移动平均图滤波器,能够近似任意图频响应并实现信号去噪和插值。该方法适用于静态和时变场景,通过二维滤波处理时变图信号。

英文摘要

One of the cornerstones of the field of signal processing on graphs are graph filters, direct analogues of classical filters, but intended for signals defined on graphs. This work brings forth new insights on the distributed graph filtering problem. We design a family of autoregressive moving average (ARMA) recursions, which (i) are able to approximate any desired graph frequency response, and (ii) give exact solutions for tasks such as graph signal denoising and interpolation. The design philosophy, which allows us to design the ARMA coefficients independently from the underlying graph, renders the ARMA graph filters suitable in static and, particularly, time-varying settings. The latter occur when the graph signal and/or graph are changing over time. We show that in case of a time-varying graph signal our approach extends naturally to a two-dimensional filter, operating concurrently in the graph and regular time domains. We also derive sufficient conditions for filter stability when the graph and signal are time-varying. The analytical and numerical results presented in this paper illustrate that ARMA graph filters are practically appealing for static and time-varying settings, as predicted by theoretical derivations.

1709.03248 2026-06-04 cs.RO cs.SY eess.SY

Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits

利用椭圆轨道的矢量场引导用于车队监控

Aseem V. Borkar, Vivek S. Borkar, Arpita Sinha

AI总结 本文提出了一种基于矢量场的新型引导方案,用于无人机沿可能非线性轨迹跟踪和监控地面车队。通过回归算法计算时间变化的椭圆,确保无人机轨迹反复穿越该移动椭圆。

详情
AI中文摘要

我们提出了一种基于矢量场的新型引导方案,用于跟踪和监控沿可能非线性轨迹移动的车队,由空中代理执行。该方案首先使用简单的回归算法计算一个时间变化的椭圆,该椭圆包含车队中的所有目标。然后确保代理收敛到一个轨迹,该轨迹反复穿越这个移动的椭圆。该方案通过非线性微分方程的扰动理论进行分析,并提供了支持性模拟。还讨论了一些相关实现问题,并强调了该方案的优势。

英文摘要

We propose a novel vector field based guidance scheme for tracking and surveillance of a convoy, moving along a possibly nonlinear trajectory on the ground, by an aerial agent. The scheme first computes a time varying ellipse that encompasses all the targets in the convoy using a simple regression based algorithm. It then ensures convergence of the agent to a trajectory that repeatedly traverses this moving ellipse. The scheme is analyzed using perturbation theory of nonlinear differential equations and supporting simulations are provided. Some related implementation issues are discussed and advantages of the scheme are highlighted.

1709.04073 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Linear Stochastic Approximation: Constant Step-Size and Iterate Averaging

线性随机逼近:固定步长和迭代平均

Chandrashekar Lakshminarayanan, Csaba Szepesvári

AI总结 本文研究了固定步长和Polyak-Ruppert平均的线性随机逼近算法,分析了其均方误差随迭代次数的变化,并探讨了在不同数据分布下固定步长的选择条件及启发式调整方法。

Comments 16 pages, 2 figures, was submitted to NIPS 2017

详情
AI中文摘要

本文研究了具有固定步长和Polyak-Ruppert(PR)迭代平均的$d$维线性随机逼近算法(LSAs)。LSAs广泛应用于机器学习和强化学习(RL)中,其目标是利用噪声数据和每个迭代$O(d)$次更新来计算合适的$θ_*∈\mathbb{R}^d$(即最优解或固定点)。本文受RL中从经验回放中评估策略的问题启发,探讨了属于时间差分(TD)类学习算法的LSAs。对于具有固定步长和PR平均的LSAs,我们提供了$t$次迭代后的均方误差(MSE)的界限。我们假设数据是独立同分布且具有有限方差(底层分布为$P$)且期望动力学是Hurwitz的。对于给定的LSA与PR平均,以及满足上述假设的数据分布$P$,我们证明存在一个常数步长范围,使得其MSE衰减为$O(1/t)$。我们还探讨了在数据分布$\mathcal{P}$中选择统一常数步长的条件,并证明并非所有数据分布都允许这样的统一常数步长。此外,我们建议一种启发式步长调整算法,用于为给定的数据分布$P$选择LSA的常数步长。我们还比较了我们的结果与相关工作,并讨论了我们的结果在TD算法作为LSAs的上下文中的意义。

英文摘要

We consider $d$-dimensional linear stochastic approximation algorithms (LSAs) with a constant step-size and the so called Polyak-Ruppert (PR) averaging of iterates. LSAs are widely applied in machine learning and reinforcement learning (RL), where the aim is to compute an appropriate $θ_{*} \in \mathbb{R}^d$ (that is an optimum or a fixed point) using noisy data and $O(d)$ updates per iteration. In this paper, we are motivated by the problem (in RL) of policy evaluation from experience replay using the \emph{temporal difference} (TD) class of learning algorithms that are also LSAs. For LSAs with a constant step-size, and PR averaging, we provide bounds for the mean squared error (MSE) after $t$ iterations. We assume that data is \iid with finite variance (underlying distribution being $P$) and that the expected dynamics is Hurwitz. For a given LSA with PR averaging, and data distribution $P$ satisfying the said assumptions, we show that there exists a range of constant step-sizes such that its MSE decays as $O(\frac{1}{t})$. We examine the conditions under which a constant step-size can be chosen uniformly for a class of data distributions $\mathcal{P}$, and show that not all data distributions `admit' such a uniform constant step-size. We also suggest a heuristic step-size tuning algorithm to choose a constant step-size of a given LSA for a given data distribution $P$. We compare our results with related work and also discuss the implication of our results in the context of TD algorithms that are LSAs.