arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21511
1904.08833 2026-06-04 cs.RO cs.SY eess.SY

A Passivity-based Nonlinear Admittance Control with Application to Powered Upper-limb Control under Unknown Environmental Interactions

一种基于被动性的非线性阻抗控制及其在未知环境交互下的动力上肢控制应用

Min Jun Kim, Woongyong Lee, Jae Yeon Choi, Goobong Chung, Kyung-Lyong Han, Il Seop Choi, Christian Ott, Wan Kyun Chung

AI总结 本文提出了一种基于被动性理论的动力上肢外骨骼机器人的阻抗控制器,通过非线性运动方程建模,利用被动性理论将人类操作员和环境交互纳入控制回路,通过力/扭矩传感器与人类交互,通过末端执行器与环境交互,尽管环境交互无法被任何传感器检测到(未知),但被动性允许自然交互。分析表明,当控制增益趋于无穷大时,实际系统的行为与名义模型相似,表明所提出的方法是一种阻抗控制器。然而,由于实际中控制增益无法无限增长,根据可实现的控制增益性能限制也被分析。分析结果表明,按无限范数意义,性能与控制增益成线性关系。在实验中,使用1自由度测试台验证了所提出的方法,并用实际的动力上肢外骨骼设备来提升和操控未知负载。

Comments Accepted in IEEE/ASME Transactions on Mechatronics (T-MECH)

详情
AI中文摘要

本文提出了一种基于被动性理论的动力上肢外骨骼机器人的阻抗控制器,该机器人由非线性运动方程支配。被动性允许我们将人类操作员和环境交互纳入控制回路。机器人通过F/T传感器与人类操作员交互,主要通过末端执行器与环境交互。尽管环境交互无法被任何传感器检测到(因此未知),被动性允许我们实现自然交互。分析表明,当控制增益趋于无穷大时,实际系统的行为与名义模型相似,这表明所提出的方法是一种阻抗控制器。然而,由于实际中控制增益无法无限增长,根据可实现的控制增益的性能限制也被分析。分析结果表明,按无限范数意义,性能与控制增益成线性关系。在实验中,所提出的方法通过1自由度测试台进行了验证,并使用实际的动力上肢外骨骼设备来提升和操控未知负载。

英文摘要

This paper presents an admittance controller based on the passivity theory for a powered upper-limb exoskeleton robot which is governed by the nonlinear equation of motion. Passivity allows us to include a human operator and environmental interaction in the control loop. The robot interacts with the human operator via F/T sensor and interacts with the environment mainly via end-effectors. Although the environmental interaction cannot be detected by any sensors (hence unknown), passivity allows us to have natural interaction. An analysis shows that the behavior of the actual system mimics that of a nominal model as the control gain goes to infinity, which implies that the proposed approach is an admittance controller. However, because the control gain cannot grow infinitely in practice, the performance limitation according to the achievable control gain is also analyzed. The result of this analysis indicates that the performance in the sense of infinite norm increases linearly with the control gain. In the experiments, the proposed properties were verified using 1 degree-of-freedom testbench, and an actual powered upper-limb exoskeleton was used to lift and maneuver the unknown payload.

1904.05814 2026-06-04 cs.CV cs.GR cs.LG cs.NA cs.RO math.NA

Probabilistic Permutation Synchronization using the Riemannian Structure of the Birkhoff Polytope

利用Birkhoff多面体的Riemannian结构的概率排列同步

Tolga Birdal, Umut Şimşekli

AI总结 本文提出了一种新的几何和概率方法,用于在多个对象或图像集合之间同步对应关系。核心方法包括基于Birkhoff-Riemannian L-BFGS优化放松后的循环一致性损失,以及基于Birkhoff-Riemannian Langevin Monte Carlo生成Birkhoff多面体样本并估计解的置信度。

Comments To appear as oral presentation at CVPR 2019. 20 pages including the supplementary material

详情
AI中文摘要

我们提出了一种全新的几何和概率方法,用于在多个对象或图像集合之间同步对应关系。具体而言,我们提出了两个算法:(1) Birkhoff-Riemannian L-BFGS用于以系统化的方式优化放松后的循环一致性损失的松弛版本;(2) Birkhoff-Riemannian Langevin Monte Carlo用于在Birkhoff多面体上生成样本并估计找到的解的置信度。为此,我们首先介绍了最近发展出的Birkhoff多面体的Riemannian几何。接着,我们引入了一种新的概率同步模型,形式为马尔可夫随机场(MRF)。最后,基于一阶retraction算子,我们将问题 formulation 为模拟随机微分方程,并设计了新的积分器。我们在合成和真实数据集上展示,我们能够以更快的收敛速度和可靠的置信度/不确定性估计获得高质量的多图匹配结果。

英文摘要

We present an entirely new geometric and probabilistic approach to synchronization of correspondences across multiple sets of objects or images. In particular, we present two algorithms: (1) Birkhoff-Riemannian L-BFGS for optimizing the relaxed version of the combinatorially intractable cycle consistency loss in a principled manner, (2) Birkhoff-Riemannian Langevin Monte Carlo for generating samples on the Birkhoff Polytope and estimating the confidence of the found solutions. To this end, we first introduce the very recently developed Riemannian geometry of the Birkhoff Polytope. Next, we introduce a new probabilistic synchronization model in the form of a Markov Random Field (MRF). Finally, based on the first order retraction operators, we formulate our problem as simulating a stochastic differential equation and devise new integrators. We show on both synthetic and real datasets that we achieve high quality multi-graph matching results with faster convergence and reliable confidence/uncertainty estimates.

1803.07187 2026-06-04 cs.CV cs.NA eess.IV math.NA

Unveiling the invisible - mathematical methods for restoring and interpreting illuminated manuscripts

揭示无形之物 - 用于修复和解释手稿的数学方法

Luca Calatroni, Marie d'Autume, Rob Hocking, Stella Panayotova, Simone Parisotto, Paola Ricciardi, Carola-Bibiane Schönlieb

AI总结 本文探讨了用于修复和可视化手稿的数学方法,强调了数字图像处理在艺术领域中的应用和重要性。

详情
AI中文摘要

过去五十年来,数学方法在数字图像分析和处理方面的快速发展,主要集中在摄影、生物医学成像和各种工程领域。然而,艺术领域在此过程中大多被忽视,除了最近十年中少数例外。然而,随着艺术领域数字化的迅速兴起,艺术领域对数字图像处理方法的接受度正在增加,因此关注这一点的重要性也随之增加。本文讨论了一系列用于数字图像修复和数字可视化的方法,特别是手稿,因为它们传统上保持物理未受干扰,为数字操作提供了有趣的机会。同时,它们也展示了数学和数字修复作为通用和客观工具包在艺术领域中的可能性。

英文摘要

The last fifty years have seen an impressive development of mathematical methods for the analysis and processing of digital images, mostly in the context of photography, biomedical imaging and various forms of engineering. The arts have been mostly overlooked in this process, apart from a few exceptional works in the last ten years. With the rapid emergence of digitisation in the arts, however, the arts domain is becoming increasingly receptive to digital image processing methods and the importance of paying attention to this therefore increases. In this paper we discuss a range of mathematical methods for digital image restoration and digital visualisation for illuminated manuscripts. The latter provide an interesting opportunity for digital manipulation because they traditionally remain physically untouched. At the same time they also serve as an example for the possibilities mathematics and digital restoration offer as a generic and objective toolkit for the arts.

1903.01032 2026-06-04 cs.LG cs.SY eess.SY stat.ML

A Fundamental Performance Limitation for Adversarial Classification

对抗分类中的基本性能限制

Abed AlRahman Al Makdah, Vaibhav Katewa, Fabio Pasqualetti

AI总结 本文研究了对抗分类中的基本性能限制,指出在优化准确率的过程中,二分类算法不可避免地会变得更加敏感于数据的对抗操纵,并且准确率与敏感度之间的根本权衡曲线仅取决于数据的统计特性,无法通过调整算法来改进。

详情
AI中文摘要

尽管机器学习算法被广泛用于解决技术、经济和社会相关的问题,但对这些数据驱动算法性能的可证明保证却严重不足,尤其是在数据来自不可靠来源并通过未保护和易受攻击的通道传输时。在本文中,我们采取了重要的步骤来弥合这一差距,并正式证明,在试图优化其准确率时,二分类算法——包括基于机器学习技术的算法——不可避免地会变得更加敏感于数据的对抗操纵。进一步地,对于具有相同复杂度(即分类边界数量)的给定算法类,准确率与敏感度之间的根本权衡曲线仅取决于数据的统计特性,无法通过调整算法来改进。

英文摘要

Despite the widespread use of machine learning algorithms to solve problems of technological, economic, and social relevance, provable guarantees on the performance of these data-driven algorithms are critically lacking, especially when the data originates from unreliable sources and is transmitted over unprotected and easily accessible channels. In this paper we take an important step to bridge this gap and formally show that, in a quest to optimize their accuracy, binary classification algorithms -- including those based on machine-learning techniques -- inevitably become more sensitive to adversarial manipulation of the data. Further, for a given class of algorithms with the same complexity (i.e., number of classification boundaries), the fundamental tradeoff curve between accuracy and sensitivity depends solely on the statistics of the data, and cannot be improved by tuning the algorithm.

1903.02219 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Training in Task Space to Speed Up and Guide Reinforcement Learning

在任务空间中训练以加速和引导强化学习

Guillaume Bellegarda, Katie Byl

AI总结 本文提出在任务空间中训练以提高强化学习的效率和稳定性,通过简化高自由度系统模型、利用正逆运动学以及在笛卡尔空间中学习运动策略,从而减少样本复杂度和训练时间。

详情
AI中文摘要

最近强化学习(RL)领域的突破在学习和部署真实世界机器人系统策略方面取得了显著进展。然而,即使使用当前最先进的算法和计算资源,这些算法仍然面临高样本复杂度的问题,导致训练时间长,尤其是对于高自由度(DOF)系统。此外,新兴策略缺乏感知稳定性和鲁棒性保证也引发了担忧。本文旨在通过以下方法缓解这些缺点:(1)用一个代表性简单的模型来建模复杂的高DOF系统;(2)明确使用正逆运动学,而不需要让RL算法自行学习;(3)在笛卡尔空间中学习运动策略,而不是关节空间。本文将这些方法应用于JPL的Robosimian,但可以轻松应用于任何具有基座和末端执行器的系统。这些运动策略可以在几分钟内生成,并在单台笔记本电脑上训练。我们比较了所学策略的鲁棒性与其他控制方法的鲁棒性。本文的配套视频可在https://youtu.be/xDxxSw5ahnc找到。

英文摘要

Recent breakthroughs in the reinforcement learning (RL) community have made significant advances towards learning and deploying policies on real world robotic systems. However, even with the current state-of-the-art algorithms and computational resources, these algorithms are still plagued with high sample complexity, and thus long training times, especially for high degree of freedom (DOF) systems. There are also concerns arising from lack of perceived stability or robustness guarantees from emerging policies. This paper aims at mitigating these drawbacks by: (1) modeling a complex, high DOF system with a representative simple one, (2) making explicit use of forward and inverse kinematics without forcing the RL algorithm to "learn" them on its own, and (3) learning locomotion policies in Cartesian space instead of joint space. In this paper these methods are applied to JPL's Robosimian, but can be readily used on any system with a base and end effector(s). These locomotion policies can be produced in just a few minutes, trained on a single laptop. We compare the robustness of the resulting learned policies to those of other control methods. An accompanying video for this paper can be found at https://youtu.be/xDxxSw5ahnc .

1903.00927 2026-06-04 cs.RO cs.SY eess.SY

Topological Information-Theoretic Belief Space Planning with Optimality Guarantees

具有最优性保证的拓扑信息论信念空间规划

Andrej Kitanov, Vadim Indelman

AI总结 本文提出了一种高效确定t-bsp误差界限的方法,从而为该方法提供全局最优性保证或解的不确定性边际,该方法基于信息论BSP的最优解和之前引入的拓扑度量。

详情
AI中文摘要

在高维状态空间中确定信念空间规划(BSP)的全局最优解计算成本很高,因为它需要对每个候选动作进行信念传播和目标函数评估。我们最近引入的拓扑信念空间规划t-bsp则仅考虑因子图的拓扑结构来做出决策。在本文中,我们为这一领域贡献了一种新的方法,用于高效确定t-bsp的误差界限,从而提供全局最优性保证或解的不确定性边际。这些界限是基于信息论BSP的最优解,并考虑了之前引入的拓扑度量,该度量基于生成树的数量。在现实和合成模拟中,我们分析了这些界限的紧致性,并展示了该度量如何与另一种计算上更高效的t-bsp度量紧密相关,即图的von Neumann熵近似值,后者可以实现在线性能。

英文摘要

Determining a globally optimal solution of belief space planning (BSP) in high-dimensional state spaces is computationally expensive, as it involves belief propagation and objective function evaluation for each candidate action. Our recently introduced topological belief space planning t-bsp instead performs decision making considering only topologies of factor graphs that correspond to posterior future beliefs. In this paper we contribute to this body of work a novel method for efficiently determining error bounds of t-bsp, thereby providing global optimality guarantees or uncertainty margin of its solution. The bounds are given with respect to an optimal solution of information theoretic BSP considering the previously introduced topological metric which is based on the number of spanning trees. In realistic and synthetic simulations, we analyze tightness of these bounds and show empirically how this metric is closely related to another computationally more efficient t-bsp metric, an approximation of the von Neumann entropy of a graph, which can achieve online performance.

1902.09626 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

在扑翼机器人上学习极端蜂鸟动作

Fan Fei, Zhan Tu, Jian Zhang, Xinyan Deng

AI总结 研究通过模仿蜂鸟的极端机动动作,开发了一种混合控制策略,利用模型驱动的非线性控制和模型无关的强化学习,实现了在12克仿生蜂鸟机器人上实现快速逃避机动。

Comments 6 pages, accepted at ICRA 2019

详情
AI中文摘要

生物学研究表明,蜂鸟在快速逃避时可以执行极端空战动作。在悬停时突然出现的视觉刺激下,蜂鸟会启动快速的后退平移并伴随180度的偏转,随后在不到10次振翅之间完成瞬间姿态稳定。考虑到振翅频率为40Hz,这种激进的动作仅在0.2秒内完成。受蜂鸟在这些极端动作中接近最大性能的启发,我们开发了一种飞行控制系统,并实验表明,这种机动性可通过配备两个执行器的12克仿生蜂鸟机器人实现。所提出的混合控制策略结合了基于模型的非线性控制和无模型强化学习。我们使用基于模型的非线性控制进行正常飞行控制,因为这些条件下的动态模型相对准确。然而,在极端机动中,建模误差变得无法控制。通过在仿真中训练的无模型强化学习策略被优化以'破坏'系统并最大化机动期间的性能。混合策略表现出接近蜂鸟观察到的机动动作。直接仿真到现实的转移得以实现,证明了仿生蜂鸟机器人上蜂鸟式的快速逃避机动。

英文摘要

Biological studies show that hummingbirds can perform extreme aerobatic maneuvers during fast escape. Given a sudden looming visual stimulus at hover, a hummingbird initiates a fast backward translation coupled with a 180-degree yaw turn, which is followed by instant posture stabilization in just under 10 wingbeats. Consider the wingbeat frequency of 40Hz, this aggressive maneuver is carried out in just 0.2 seconds. Inspired by the hummingbirds' near-maximal performance during such extreme maneuvers, we developed a flight control strategy and experimentally demonstrated that such maneuverability can be achieved by an at-scale 12-gram hummingbird robot equipped with just two actuators. The proposed hybrid control policy combines model-based nonlinear control with model-free reinforcement learning. We use model-based nonlinear control for nominal flight control, as the dynamic model is relatively accurate for these conditions. However, during extreme maneuver, the modeling error becomes unmanageable. A model-free reinforcement learning policy trained in simulation was optimized to 'destabilize' the system and maximize the performance during maneuvering. The hybrid policy manifests a maneuver that is close to that observed in hummingbirds. Direct simulation-to-real transfer is achieved, demonstrating the hummingbird-like fast evasive maneuvers on the at-scale hummingbird robot.

1710.11088 2026-06-04 cs.RO cs.SY eess.SY

Robust Cooperative Manipulation without Force/Torque Measurements: Control Design and Experiments

无需力/扭矩测量的鲁棒协同操作:控制设计与实验

Christos K. Verginis, Matteo Mastellaro, Dimos V. Dimarogonas

AI总结 本文提出两种新型控制方法,用于由N个机器人代理协同操作物体,通过四元数反馈避免表示奇异,同时保证物体轨迹的暂态和稳态性能,且具有去中心化和抗扰动特性,无需力/扭矩测量,并通过仿真和实验验证理论结果。

详情
AI中文摘要

本文提出了两种新颖的控制方法,用于由N个机器人代理协同操作物体。首先,我们设计了一种自适应控制协议,利用四元数反馈来避免潜在的表示奇点。其次,我们提出了一种保证物体轨迹预定义暂态和稳态性能的控制协议。两种方法都是去中心化的,因为代理可以自行计算信号而不进行相互通信,同时对外部扰动和模型不确定性具有鲁棒性。此外,我们考虑抓取点是刚性的,从而避免了对力/扭矩测量的需求。通过抓握矩阵伪逆来考虑代理之间可能的功率能力差异,以实现负载分布。最后,通过两个机器人臂的仿真和实验结果验证了理论发现。

英文摘要

This paper presents two novel control methodologies for the cooperative manipulation of an object by N robotic agents. Firstly, we design an adaptive control protocol which employs quaternion feedback for the object orientation to avoid potential representation singularities. Secondly, we propose a control protocol that guarantees predefined transient and steady-state performance for the object trajectory. Both methodologies are decentralized, since the agents calculate their own signals without communicating with each other, as well as robust to external disturbances and model uncertainties. Moreover, we consider that the grasping points are rigid, and avoid the need for force/torque measurements. Load distribution is also included via a grasp matrix pseudo-inverse to account for potential differences in the agents' power capabilities. Finally, simulation and experimental results with two robotic arms verify the theoretical findings.

1711.04178 2026-06-04 cs.LG cs.CV cs.NA math.NA stat.ML

CUR Decompositions, Similarity Matrices, and Subspace Clustering

CUR分解、相似矩阵与子空间聚类

Akram Aldroubi, Keaton Hamm, Ahmet Bugra Koku, Ali Sekmen

AI总结 本文提出了一种利用CUR分解解决子空间聚类问题的通用框架,通过构造相似矩阵实现无噪声情况下的精确聚类,并展示了如何通过CUR分解生成多种相似矩阵以处理噪声数据,同时推导出两种已知的子空间聚类方法。

Comments Approximately 30 pages. Current version contains improved algorithm and numerical experiments from the previous version

详情
AI中文摘要

本文提出了一种利用CUR分解解决子空间聚类问题的通用框架。CUR分解提供了一种自然方法来构造数据来自未知子空间联盟$\mathscr{U}=\underset{i=1}{\overset{M}\bigcup}S_i$的相似矩阵。由此构造的相似矩阵在无噪声情况下能够实现精确聚类。此外,这种分解还能从给定数据集生成多种不同的相似矩阵,从而具有足够的灵活性以对含噪声数据进行准确聚类。我们还展示了两种已知的子空间聚类方法可以从CUR分解中推导出来。本文还提出了一种基于相似矩阵理论构造的算法,并在合成和真实数据上进行了实验以测试该方法。此外,本文还利用了基于CUR的相似矩阵的改进版本,提供了一种启发式算法用于子空间聚类;该算法在Hopkins155运动分割数据集上的聚类性能目前最佳。

英文摘要

A general framework for solving the subspace clustering problem using the CUR decomposition is presented. The CUR decomposition provides a natural way to construct similarity matrices for data that come from a union of unknown subspaces $\mathscr{U}=\underset{i=1}{\overset{M}\bigcup}S_i$. The similarity matrices thus constructed give the exact clustering in the noise-free case. Additionally, this decomposition gives rise to many distinct similarity matrices from a given set of data, which allow enough flexibility to perform accurate clustering of noisy data. We also show that two known methods for subspace clustering can be derived from the CUR decomposition. An algorithm based on the theoretical construction of similarity matrices is presented, and experiments on synthetic and real data are presented to test the method. Additionally, an adaptation of our CUR based similarity matrices is utilized to provide a heuristic algorithm for subspace clustering; this algorithm yields the best overall performance to date for clustering the Hopkins155 motion segmentation dataset.

1811.11573 2026-06-04 cs.RO cs.SY eess.SY

Distributed Impedance Control of Latency-Prone Robotic Systems with Series Elastic Actuation

具有串联弹性执行器的延迟敏感机器人系统的分布式阻抗控制

Ye Zhao, Luis Sentis

AI总结 本文研究了具有串联弹性执行器的延迟敏感机器人系统的分布式阻抗控制问题,提出了一种关键阻尼增益设计方法,用于优化SEA级联控制架构的阻抗控制器设计,并通过频率域方法分析了时间延迟、滤波和负载惯量对SEA阻抗性能的影响。

Comments 24 pages, 16 figures. arXiv admin note: text overlap with arXiv:1501.02854

详情
AI中文摘要

机器人系统越来越多地依赖分布式反馈控制器来解决复杂且延迟敏感的传感和决策问题。这些需求带来了计算负担的增加,从而导致更大的控制器延迟。为了最大化对机械扰动的鲁棒性和实现高性能控制,我们强调需要在靠近控制对象的位置执行阻尼反馈,并在延迟敏感的集中式控制过程中分配刚度反馈。此外,串联弹性执行器(SEAs)近年来在力控机器人中变得越来越普遍,以实现与环境和人类的顺应性交互。然而,设计最优的阻抗控制器和表征具有时间延迟和滤波的SEAs的阻抗性能仍然是未充分研究的问题。本文通过设计一种关键阻尼增益设计方法,针对一类SEA级联控制架构(由外阻抗和内扭矩反馈环组成)解决最优控制器设计问题。通过所提出的控制器设计准则,我们采用频域方法对时间延迟、滤波和负载惯量对SEA阻抗性能的影响进行了深入分析。这些结果通过在高性能执行器和 omnidirectional 移动基座上的分析、仿真和实验测试进一步验证。

英文摘要

Robotic systems are increasingly relying on distributed feedback controllers to tackle complex and latency-prone sensing and decision problems. These demands come at the cost of a growing computational burden and, as a result, larger controller latencies. To maximize robustness to mechanical disturbances and achieve high control performance, we emphasize the necessity for executing damping feedback in close proximity to the control plant while allocating stiffness feedback in a latency-prone centralized control process. Additionally, series elastic actuators (SEAs) are becoming prevalent in torque-controlled robots during recent years to achieve compliant interactions with environments and humans. However, designing optimal impedance controllers and characterizing impedance performance for SEAs with time delays and filtering are still under-explored problems. The presented study addresses the optimal controller design problem by devising a critically-damped gain design method for a class of SEA cascaded control architectures, which is composed of outer-impedance and inner-torque feedback loops. Via the proposed controller design criterion, we adopt frequency-domain methods to thoroughly analyze the effects of time delays, filtering and load inertia on SEA impedance performance. These results are further validated through the analysis, simulation, and experimental testing on high-performance actuators and on an omnidirectional mobile base.

1803.11247 2026-06-04 cs.RO cs.SY eess.SY

Scalable Integrated Task and Motion Planning from Signal Temporal Logic Specifications

可扩展的信号时序逻辑规范下的集成任务和运动规划

Rafael Rodrigues da Silva, Hai Lin

AI总结 本文提出了一种可扩展且可证明完整的算法,直接合成连续轨迹以满足非凸的信号时序逻辑规范,通过分离离散任务规划和连续运动规划,并利用高效的求解器找到高维系统中满足非凸STL规范的动态可行轨迹。

Comments 13 pages, report

详情
AI中文摘要

许多安全关键系统必须实现高阶任务规范,并保证安全性和正确性。为实现这一目标,最近通过从信号时序逻辑(STL)规范中合成控制器取得了许多进展。然而,现有方法要么考虑了状态空间的先验离散化,要么只处理STL的凸片段,或无法证明是完整的。我们提出了一种可扩展、可证明完整的算法,直接合成连续轨迹以满足非凸STL规范。我们在线分离离散任务规划和连续运动规划,并利用高效的可满足性模理论(SMT)和线性规划(LP)求解器,为高维系统找到满足非凸STL规范的动态可行轨迹。所提出的设计算法已被证明是正确且完整的,仿真结果展示了我们方法的可扩展性。

英文摘要

Many safety-critical systems must achieve high-level task specifications with guaranteed safety and correctness. Much recent progress towards this goal has been made through controller synthesis from signal temporal logic (STL) specifications. Existing approaches, however, either consider some a priori discretization of the state-space, deal only with a convex fragment of STL, or are not provably complete. We propose a scalable, provably complete algorithm that directly synthesizes continuous trajectories to satisfy non-convex STL specifications. We separate discrete task planning and continuous motion planning on the fly and harness highly efficient satisfiability modulo theories (SMT) and linear programming (LP) solvers to find dynamically feasible trajectories for high dimensional systems that satisfies non-convex STL specifications. The proposed design algorithms are proved sound and complete, and simulation results demonstrate the scalability of our approach.

1809.10012 2026-06-04 cs.RO cs.LG cs.SY eess.SY

Using Neural Networks to Generate Information Maps for Mobile Sensors

用神经网络为移动传感器生成信息图

Louis Dressel, Mykel J. Kochenderfer

AI总结 本文提出利用卷积神经网络实时生成移动传感器的信息图,以提高轨迹生成的效率和准确性。

Comments Accepted to the 2018 IEEE Conference on Decision and Control (CDC)

详情
AI中文摘要

目标定位是移动传感器的关键任务,具有多种应用。然而,为这些传感器生成信息丰富的轨迹是一个具有挑战性的问题。一种常用方法是使用信息图来估计在传感器状态空间中的任意点进行测量的价值。这些信息图用于生成轨迹;例如,轨迹可能被设计成其测量分布与信息图的分布匹配。无论轨迹生成方法如何,生成信息图作为新观察结果出现是至关重要的。然而,在实时计算这些图可能会有挑战。我们提出使用卷积神经网络从目标估计和传感器模型中实时生成信息图。模拟显示,生成的图准确且计算时间减少了多个数量级。

英文摘要

Target localization is a critical task for mobile sensors and has many applications. However, generating informative trajectories for these sensors is a challenging research problem. A common method uses information maps that estimate the value of taking measurements from any point in the sensor state space. These information maps are used to generate trajectories; for example, a trajectory might be designed so its distribution of measurements matches the distribution of the information map. Regardless of the trajectory generation method, generating information maps as new observations are made is critical. However, it can be challenging to compute these maps in real-time. We propose using convolutional neural networks to generate information maps from a target estimate and sensor model in real-time. Simulations show that maps are accurately rendered while offering orders of magnitude reduction in computation time.

1809.03826 2026-06-04 cs.RO cs.SY eess.SY

Real-time force control of an SEA-based body weight support unit with the 2-DOF control structure

基于SEA的体位支撑装置的实时力控研究:采用2自由度控制结构

Yubo Sun, Yuqi Lei, Wulin Zou, Jianmin Li, Ningbo Yu

AI总结 本文提出一种基于SEA的体位支撑装置,采用2自由度控制结构实现实时力控,通过仿真和实验验证了其在康复中的有效性。

Comments In proceedings of the IEEE International Conference on Real-time Computing and Robotics 2018

详情
AI中文摘要

体位支撑(BWS)是康复领域的重要技术。随着康复科学与工程的快速发展,BWS正迅速发展并受到广泛关注。本文构建了一种新型重力卸载系统,允许患者在三维笛卡尔空间内自由移动并受到重力支撑。因此,对于患有神经损伤(如中风或脊髓损伤)的患者,可以将残余运动控制能力专注于平衡和行走的治疗训练。实时力控性能对BWS单元提供合适支撑并避免干扰至关重要。在本文中,我们重新设计了BWS单元,采用一系列弹性执行机构结构以提高人机交互性能。进一步,采用了2自由度(2-DOF)控制方法以实现精确且鲁棒的BWS力控。仿真和实验结果验证了BWS设计和实时控制方法的有效性。

英文摘要

Body weight support (BWS) is a fundamental technique in rehabilitation. Along with the dramatic progressing of rehabilitation science and engineering, BWS is quickly evolving with new initiatives and has attracted deep research effort in recent years. We have built up a novel gravity offloading system, in which the patient is allowed to move freely in the three-dimensional Cartesian space and receives support against gravity. Thus, the patients, especially for those that suffer from neurological injury such as stroke or spinal cord injury, can focus their residual motor control capabilities on essential therapeutic trainings of balance and gait. The real-time force control performance is critical for the BWS unit to provide suitable support and avoid disturbance. In this work, we have re-designed our BWS unit with a series elastic actuation structure to improve the human-robot interaction performance. Further, the 2 degrees of freedom (2-DOF) control approach was taken for accurate and robust BWS force control. Both simulation and experimental results have validated the efficacy of the BWS design and real-time control methods.

1809.01353 2026-06-04 cs.LG cs.NA math.NA stat.ML

IKA: Independent Kernel Approximator

IKA:独立核近似器

Matteo Ronchetti

AI总结 本文提出IKA方法,通过线性组合任意选择的函数进行低秩核近似,优于Nyström方法,在STL-10数据集上表现更优。

详情
AI中文摘要

本文提出IKA方法,通过线性组合任意选择的函数进行低秩核近似,优于Nyström方法,在STL-10数据集上表现更优。

英文摘要

This paper describes a new method for low rank kernel approximation called IKA. The main advantage of IKA is that it produces a function $ψ(x)$ defined as a linear combination of arbitrarily chosen functions. In contrast the approximation produced by Nyström method is a linear combination of kernel evaluations. The proposed method consistently outperformed Nyström method in a comparison on the STL-10 dataset. Numerical results are reproducible using the source code available at https://gitlab.com/matteo-ronchetti/IKA

1808.00869 2026-06-04 cs.RO cs.SY eess.SY

Developing Robot Driver Etiquette Based on Naturalistic Human Driving Behavior

基于自然人类驾驶行为发展机器人驾驶礼仪

Xianan Huang, Songan Zhang, Huei Peng

AI总结 本文研究机器人驾驶礼仪问题,通过分析自然驾驶数据库提取人类驾驶行为关键参数,为未来高自动化车辆算法设计和仿真中的人类驾驶行为建模提供指导。

详情
AI中文摘要

自动化车辆可通过提升安全、出行和燃油效率改变社会。然而,由于成本较高和商业模式变化,未来几十年内,高度自动化车辆很可能继续与人类驾驶车辆交互。过去,高度自动化(机器人)车辆的控制/设计主要考虑安全和效率,但未能解决周围人类驾驶车辆的“驾驶文化”问题。因此,机器人车辆可能表现出与其他车辆截然不同的行为。本文研究这一“驾驶礼仪”问题。作为第一步,我们报告了从大规模自然驾驶数据库中提取的人类驾驶车辆的关键行为参数。这些结果可用于指导未来高自动化车辆的算法设计,或在仿真中开发现实的人类驾驶车辆行为模型。

英文摘要

Automated vehicles can change the society by improved safety, mobility and fuel efficiency. However, due to the higher cost and change in business model, over the coming decades, the highly automated vehicles likely will continue to interact with many human-driven vehicles. In the past, the control/design of the highly automated (robotic) vehicles mainly considers safety and efficiency but failed to address the "driving culture" of surrounding human-driven vehicles. Thus, the robotic vehicles may demonstrate behaviors very different from other vehicles. We study this "driving etiquette" problem in this paper. As the first step, we report the key behavior parameters of human driven vehicles derived from a large naturalistic driving database. The results can be used to guide future algorithm design of highly automated vehicles or to develop realistic human-driven vehicle behavior model in simulations.

1807.09905 2026-06-04 cs.RO cs.SY eess.SY

Toward Efficient and Robust Biped Walking Optimization

迈向高效且稳健的双足行走优化

Nihar Talele, Katie Byl

AI总结 本文研究双足机器人行走的高效与稳健优化,探讨步态的能耗与鲁棒性量化,以及运动轨迹与机器人参数的联合优化,通过五连杆平面行走模型验证效率与稳健性的平衡。

详情
AI中文摘要

实用的双足机器人运动需要同时具备能量效率和对变化与不确定性的鲁棒性。本文基于最近的轨迹优化研究,有两个主要目标:首先,展示考虑并量化步态的能耗与鲁棒性的重要性,以及优化名义运动轨迹和机器人设计参数及反馈控制策略的重要性;其次,通过五连杆平面行走模型的优化研究,提供关于提高效率与鲁棒性之间权衡和一般趋势的初步数据。在解决重要的开放性挑战时,特别讨论了在选择始终是近似优化的指标以及在结构和调节反馈控制时的选择影响。

英文摘要

Practical bipedal robot locomotion needs to be both energy efficient and robust to variability and uncertainty. In this paper, we build upon recent works in trajectory optimization for robot locomotion with two primary goals. First, we wish to demonstrate the importance of (a) considering and quantifying not only energy efficiency but also robustness of gaits, and (b) optimization not only of nominal motion trajectories but also of robot design parameters and feedback control policies. As a second, complementary focus, we present results from optimization studies on a 5-link planar walking model, to provide preliminary data on particular trade-offs and general trends in improving efficiency versus robustness. In addressing important, open challenges, we focus in particular on discussions of the effects of choices made (a) in formulating what is always, necessarily only an approximate optimization, in choosing metrics for performance, and (b) in structuring and tuning feedback control.

1709.04940 2026-06-04 cs.RO cs.SY eess.SY

A Robust Model Predictive Control Approach for Autonomous Underwater Vehicles Operating in a Constrained workspace

一种用于在受限工作空间中运行的自主水下机器人鲁棒模型预测控制方法

Shahab Heshmati-alamdari, George C. Karras, Panos Marantos, Kostas J. Kyriakopoulos

AI总结 本文提出了一种新型非线性模型预测控制方案,用于在存在静态障碍物的受限工作空间中引导水下机器人到达特定路径点,通过考虑障碍物、工作空间边界、推进器饱和度和预定义的车辆速度上限等约束条件,提高控制性能。

Comments IEEE International Conference on Robotics and Automation (ICRA-2018), Accepted

详情
AI中文摘要

本文提出了一种新型非线性模型预测控制(NMPC)方案,用于在存在静态障碍物的受限工作空间中运行的水下机器人。控制器的目标是引导车辆朝着特定路径点前进。各种限制,如障碍物、工作空间边界、推进器饱和度和预定义的车辆速度上限,被作为状态和输入约束,并在控制设计中得到保证。所提出的方案结合了车辆的全部动力学特性,包括海洋 currents。因此,所提出方案计算出的控制输入以一种方式制定,使车辆能够利用有利的海洋 currents,从而减少推进器的能耗。所提出控制策略的性能通过在带有障碍物的受限测试水槽中使用4自由度水下机器人进行实验验证。

英文摘要

This paper presents a novel Nonlinear Model Predictive Control (NMPC) scheme for underwater robotic vehicles operating in a constrained workspace including static obstacles. The purpose of the controller is to guide the vehicle towards specific way points. Various limitations such as: obstacles, workspace boundary, thruster saturation and predefined desired upper bound of the vehicle velocity are captured as state and input constraints and are guaranteed during the control design. The proposed scheme incorporates the full dynamics of the vehicle in which the ocean currents are also involved. Hence, the control inputs calculated by the proposed scheme are formulated in a way that the vehicle will exploit the ocean currents, when these are in favor of the way-point tracking mission which results in reduced energy consumption by the thrusters. The performance of the proposed control strategy is experimentally verified using a $4$ Degrees of Freedom (DoF) underwater robotic vehicle inside a constrained test tank with obstacles.

1806.03145 2026-06-04 cs.LG cs.SY eess.SY stat.ML

Fidelity-based Probabilistic Q-learning for Control of Quantum Systems

基于保真度的概率Q学习用于量子系统的控制

Chunlin Chen, Daoyi Dong, Han-Xiong Li, Jian Chu, Tzyh-Jong Tarn

AI总结 本文提出基于保真度的概率Q学习方法,用于解决强化学习中探索与利用的平衡问题,并应用于量子系统控制,通过迭代更新动作概率实现自然探索策略,提升学习效率。

Comments 13 pages, 16 figures

详情
Journal ref
IEEE Transactions on Neural Networks and Learning Systems, VOL. 25, NO. 5, pp.920-933, MAY 2014
AI中文摘要

在强化学习中,探索与利用的平衡是一个关键问题,尤其是对于Q学习。本文提出一种基于保真度的概率Q学习(FPQL)方法,以自然解决此问题并应用于量子系统控制。该方法利用保真度指导学习过程,迭代更新每个状态的动作概率,从而实现自然探索策略而非依赖配置参数的尖锐策略。首先提出概率Q学习(PQL)算法以展示概率动作选择的基本思想,随后针对量子系统控制提出FPQL算法。通过两个例子(自旋-1/2系统和λ型原子系统)测试FPQL算法的性能。结果表明,FPQL算法在探索与利用之间取得更好的平衡,能够避免局部最优策略并加速学习过程。

英文摘要

The balance between exploration and exploitation is a key problem for reinforcement learning methods, especially for Q-learning. In this paper, a fidelity-based probabilistic Q-learning (FPQL) approach is presented to naturally solve this problem and applied for learning control of quantum systems. In this approach, fidelity is adopted to help direct the learning process and the probability of each action to be selected at a certain state is updated iteratively along with the learning process, which leads to a natural exploration strategy instead of a pointed one with configured parameters. A probabilistic Q-learning (PQL) algorithm is first presented to demonstrate the basic idea of probabilistic action selection. Then the FPQL algorithm is presented for learning control of quantum systems. Two examples (a spin- 1/2 system and a lamda-type atomic system) are demonstrated to test the performance of the FPQL algorithm. The results show that FPQL algorithms attain a better balance between exploration and exploitation, and can also avoid local optimal policies and accelerate the learning process.

1806.00727 2026-06-04 cs.RO cs.AI cs.SY eess.SY

Closed-loop Bayesian Semantic Data Fusion for Collaborative Human-Autonomy Target Search

闭环贝叶斯语义数据融合用于协同人机目标搜索

Luke Burks, Ian Loefgren, Luke Barbier, Jeremy Muesing, Jamison McGinley, Sousheel Vunnam, Nisar Ahmed

AI总结 本文提出一种闭环贝叶斯语义数据融合方法,通过CPOMDP规划生成最优轨迹,结合不完美传感器数据和人类提供的语义观察,提升动态目标搜索效率。

Comments Final version accepted and submitted to 2018 FUSION Conference (Cambridge, UK, July 2018)

详情
AI中文摘要

在搜索应用中,自主无人车辆必须能够高效重新获取和定位长时间可能处于视线外的大空间中移动目标。为此,本文开发并验证了一种新的协同人机感知解决方案。我们的方法利用连续部分可观测马尔可夫决策过程(CPOMDP)规划,生成最优利用不完美传感器数据和可请求的语义自然语言观察的车辆轨迹。关键创新是可扩展的层次高斯混合模型形式,用于在连续动态状态空间中高效求解包含语义观察的CPOMDPs。该方法在定制测试平台上通过真实的人机团队在动态室内目标搜索和捕捉场景中进行了演示和验证。

英文摘要

In search applications, autonomous unmanned vehicles must be able to efficiently reacquire and localize mobile targets that can remain out of view for long periods of time in large spaces. As such, all available information sources must be actively leveraged -- including imprecise but readily available semantic observations provided by humans. To achieve this, this work develops and validates a novel collaborative human-machine sensing solution for dynamic target search. Our approach uses continuous partially observable Markov decision process (CPOMDP) planning to generate vehicle trajectories that optimally exploit imperfect detection data from onboard sensors, as well as semantic natural language observations that can be specifically requested from human sensors. The key innovation is a scalable hierarchical Gaussian mixture model formulation for efficiently solving CPOMDPs with semantic observations in continuous dynamic state spaces. The approach is demonstrated and validated with a real human-robot team engaged in dynamic indoor target search and capture scenarios on a custom testbed.

1801.02686 2026-06-04 cs.CV cs.SY eess.SY

Towards Multi-Object Detection and Tracking in Urban Scenario under Uncertainties

面向城市场景下的多目标检测与跟踪在不确定性中的研究

Achim Kampker, Mohsen Sefati, Arya Abdul Rachman, Kai Kreisköther, Pascual Campoy

AI总结 本文提出一种实时框架,结合3D激光雷达的遮挡感知检测与高效启发式过滤,以应对城市环境中传感器限制和目标运动复杂性带来的不确定性,实现高效的多目标跟踪。

Comments Some significant editorial/editing issues are found upon review. Paper will undergo language re-proofing before resubmitted

详情
Journal ref
4th.VEHITS.Proc. 109 (2018) 156-167
AI中文摘要

面向自动驾驶车辆的城市应用需要可靠的感知技术来应对高不确定性。最近引入的紧凑型3D激光雷达传感器提供了周围空间信息,可用于增强车辆感知。我们提出了一种实时集成框架,用于使用3D激光雷达的多目标检测和跟踪,旨在城市使用。我们的方法结合了遮挡感知的检测方法,计算高效的启发式规则过滤和自适应概率跟踪,以处理3D激光雷达的传感限制和目标运动复杂性带来的不确定性。使用真实世界预录制的3D激光雷达数据进行评估并与最新作品进行比较的结果表明,我们的框架能够在城市环境中实现有希望的跟踪性能。

英文摘要

Urban-oriented autonomous vehicles require a reliable perception technology to tackle the high amount of uncertainties. The recently introduced compact 3D LIDAR sensor offers a surround spatial information that can be exploited to enhance the vehicle perception. We present a real-time integrated framework of multi-target object detection and tracking using 3D LIDAR geared toward urban use. Our approach combines sensor occlusion-aware detection method with computationally efficient heuristics rule-based filtering and adaptive probabilistic tracking to handle uncertainties arising from sensing limitation of 3D LIDAR and complexity of the target object movement. The evaluation results using real-world pre-recorded 3D LIDAR data and comparison with state-of-the-art works shows that our framework is capable of achieving promising tracking performance in the urban situation.

1803.08137 2026-06-04 cs.CV cs.AI cs.NA math.NA stat.ML

Robust Blind Deconvolution via Mirror Descent

通过镜像下降实现鲁棒盲去卷积

Sathya N. Ravi, Ronak Mehta, Vikas Singh

AI总结 本文研究盲去卷积的鲁棒性和收敛性,提出一种具有理论保证的算法,在实践中表现优异。

详情
AI中文摘要

我们重新审视盲去卷积问题,重点在于理解其鲁棒性和收敛性属性。可证明的鲁棒性对噪声和其他扰动的容忍能力最近在视觉领域受到关注,从获得对抗攻击的免疫性到评估和描述关键任务应用中算法的失败模式。此外,许多基于深度架构的盲去卷积方法内部使用或优化基本公式,因此更清楚地理解该子模块的行为、何时可以求解以及它可以容忍多少噪声注入是首要要求。我们推导了盲去卷积理论基础的新见解。出现的算法具有良好的收敛保证,并在我们论文中正式定义的意义上被证明是鲁棒的。有趣的是,这些技术结果在实践中表现非常出色,其中在标准数据集上,我们的算法结果与或优于现有最先进方法。关键词:盲去卷积,鲁棒连续优化

英文摘要

We revisit the Blind Deconvolution problem with a focus on understanding its robustness and convergence properties. Provable robustness to noise and other perturbations is receiving recent interest in vision, from obtaining immunity to adversarial attacks to assessing and describing failure modes of algorithms in mission critical applications. Further, many blind deconvolution methods based on deep architectures internally make use of or optimize the basic formulation, so a clearer understanding of how this sub-module behaves, when it can be solved, and what noise injection it can tolerate is a first order requirement. We derive new insights into the theoretical underpinnings of blind deconvolution. The algorithm that emerges has nice convergence guarantees and is provably robust in a sense we formalize in the paper. Interestingly, these technical results play out very well in practice, where on standard datasets our algorithm yields results competitive with or superior to the state of the art. Keywords: blind deconvolution, robust continuous optimization

1802.05730 2026-06-04 cs.RO cs.SY eess.SY

Pedestrian-Robot Interaction Experiments in an Exit Corridor

行人-机器人在出口走廊中的交互实验

Zhuo Chen, Chao Jiang, Yi Guo

AI总结 本文通过实验研究机器人与行人交互对群体流动的影响,发现机器人运动减缓行人速度,为未来设计机器人辅助疏散算法提供指导。

Comments Submitted to the 15th International Conference on Ubiquitous Robots, Honolulu, 2018

详情
AI中文摘要

人类-机器人交互(HRI)的研究因机器人在人群中的导航问题而受到越来越多的关注。本文提出了在单向出口走廊中进行行人-机器人交互的实证研究。我们部署了一个沿行人流动方向垂直移动的移动机器人,并安装了行人运动跟踪系统以记录集体运动。我们分析了行人个体和集体运动,并测量了机器人运动对整体行人流动的影响。实验结果表明,被动HRI的效果是,当有机器人存在时,行人整体速度减慢,机器人运动越快,平均行人速度越低。实验结果显示出集体HRI效果与之前报告的模拟结果在定性上一致。本研究可用于指导未来机器人辅助行人疏散算法的设计。

英文摘要

The study of human-robot interaction (HRI) has received increasing research attention for robot navigation in pedestrian crowds. In this paper, we present empirical study of pedestrian-robot interaction in an uni-directional exit corridor. We deploy a mobile robot moving in a direction perpendicular to that of the pedestrian flow, and install a pedestrian motion tracking system to record the collective motion. We analyze both individual and collective motion of pedestrians, and measure the effect of the robot motion on the overall pedestrian flow. The experimental results show the effect of passive HRI, where the pedestrians' overall speed is slowed down in the presence of the robot, and the faster the robot moves, the lower the average pedestrian velocity becomes. Experiment results show qualitative consistency of the collective HRI effect with simulation results that was previously reported. The study can be used to guide future design of robot-assisted pedestrian evacuation algorithms.

1801.07229 2026-06-04 cs.AI cs.SY eess.SY

Combinatorial framework for planning in geological exploration

地质勘探规划的组合框架

Mark Sh. Levin

AI总结 本文提出了一种用于油气田地质勘探规划的组合框架,通过多属性评估、层次化设计和区域整合,优化勘探方案。

Comments 14 pages, 15 figures, 11 tables

详情
AI中文摘要

本文描述了一种用于油气田地质勘探规划的组合框架。该框架包括构建四层树状模型、生成局部设计替代方案、多属性评估、层次化设计、区域整合以及计划聚合。第二至第五阶段基于层次化多属性形态学设计方法,第六阶段基于检测替代方案的'核心'并扩展其元素。替代方案的评估基于专家判断,并通过亚姆拉半岛的数值示例进行了说明。

英文摘要

The paper describes combinatorial framework for planning of geological exploration for oil-gas fields. The suggested scheme of the geological exploration involves the following stages: (1) building of special 4-layer tree-like model (layer of geological exploration): productive layer, group of productive layers, oil-gas field, oil-gas region (or group of the fields); (2) generations of local design (exploration) alternatives for each low-layer geological objects: conservation, additional search, independent utilization, joint utilization; (3) multicriteria (i.e., multi-attribute) assessment of the design (exploration) alternatives and their interrelation (compatibility) and mapping if the obtained vector estimates into integrated ordinal scale; (4) hierarchical design ('bottom-up') of composite exploration plans for each oil-gas field; (5) integration of the plans into region plans and (6) aggregation of the region plans into a general exploration plan. Stages 2, 3, 4, and 5 are based on hierarchical multicriteria morphological design (HMMD) method (assessment of ranking of alternatives, selection and composition of alternatives into composite alternatives). The composition problem is based on morphological clique model. Aggregation of the obtained modular alternatives (stage 6) is based on detection of a alternatives 'kernel' and its extension by addition of elements (multiple choice model). In addition, the usage of multiset estimates for alternatives is described as well. The alternative estimates are based on expert judgment. The suggested combinatorial planning methodology is illustrated by numerical examples for geological exploration of Yamal peninsula.

1611.04706 2026-06-04 cs.RO cs.SY eess.SY

High-Dimensional Stochastic Optimal Control using Continuous Tensor Decompositions

高维随机最优控制的连续张量分解应用

Alex A. Gorodetsky, Sertac Karaman, Youssef M. Marzouk

AI总结 本文提出基于连续张量分解的动态规划算法,解决高维状态空间中的随机最优控制问题,通过压缩表示实现多项式时间复杂度,显著提升计算效率。

Comments 32 pages, 20 figures

详情
AI中文摘要

运动规划和控制问题在几乎所有机器人应用中都是嵌入和关键的。这些问题通常被公式化为随机最优控制问题,并通过动态规划算法解决。不幸的是,大多数保证收敛到最优解的现有算法受到维度诅咒的影响:算法的运行时间随着系统状态空间维度的增长呈指数级增长。我们提出新的动态规划算法,以减轻在具有某些低秩结构的问题中的维度诅咒。所提出的算法基于最近由作者开发的连续张量分解。本质上,这些算法以压缩格式表示高维函数(例如价值函数),并在这种格式中直接执行动态规划计算(例如价值迭代、策略迭代)。在某些技术假设下,新算法保证能够以任意精度收敛到最优解。此外,新算法的运行时间与状态维度和价值函数的秩呈多项式增长。这种方法在

英文摘要

Motion planning and control problems are embedded and essential in almost all robotics applications. These problems are often formulated as stochastic optimal control problems and solved using dynamic programming algorithms. Unfortunately, most existing algorithms that guarantee convergence to optimal solutions suffer from the curse of dimensionality: the run time of the algorithm grows exponentially with the dimension of the state space of the system. We propose novel dynamic programming algorithms that alleviate the curse of dimensionality in problems that exhibit certain low-rank structure. The proposed algorithms are based on continuous tensor decompositions recently developed by the authors. Essentially, the algorithms represent high-dimensional functions (e.g., the value function) in a compressed format, and directly perform dynamic programming computations (e.g., value iteration, policy iteration) in this format. Under certain technical assumptions, the new algorithms guarantee convergence towards optimal solutions with arbitrary precision. Furthermore, the run times of the new algorithms scale polynomially with the state dimension and polynomially with the ranks of the value function. This approach realizes substantial computational savings in "compressible" problem instances, where value functions admit low-rank approximations. We demonstrate the new algorithms in a wide range of problems, including a simulated six-dimensional agile quadcopter maneuvering example and a seven-dimensional aircraft perching example. In some of these examples, we estimate computational savings of up to ten orders of magnitude over standard value iteration algorithms. We further demonstrate the algorithms running in real time on board a quadcopter during a flight experiment under motion capture.

1603.07567 2026-06-04 cs.RO cs.SY eess.SY

Dynamics, Control, and Estimation for Aerial Robots Tethered by Cables or Bars

缆绳或刚性杆连接的空中机器人动力学、控制与估计

Marco Tognon, Antonio Franchi

AI总结 研究缆绳或刚性杆连接的空中机器人动力学特性,提出基于加速度计和陀螺仪的非线性观测器和控制器,实现对系统状态的高精度估计与轨迹跟踪。

详情
Journal ref
IEEE Transaction on Robotics Volume: 33, Issue: 4, Aug. 2017
AI中文摘要

我们考虑了通过被动缆绳或被动刚性连接器连接到地面的空中机器人控制问题。我们详细阐述了该非线性机械系统的特性,包括微分平坦性、可控性和可观测性。我们证明该机器人系统相对于两个输出对微分平坦:连接器的高度和车辆的姿态;连接器的高度和纵向连接力(如缆绳张力或杆压缩)。我们展示了仅使用机载加速度计和陀螺仪设计的近全球收敛非线性观测器,用于估计完整状态。我们还设计了两个近全球收敛的非线性控制器,以跟踪任何足够平滑的时间变化轨迹。最后,我们通过数值测试在远离名义条件下的鲁棒性:非线性交叉耦合效应、参数偏差、测量噪声和非理想执行器。

英文摘要

We consider the problem of controlling an aerial robot connected to the ground by a passive cable or a passive rigid link. We provide a thorough characterization of this nonlinear dynamical robotic system in terms of fundamental properties such as differential flatness, controllability, and observability. We prove that the robotic system is differentially flat with respect to two output pairs: elevation of the link and attitude of the vehicle; elevation of the link and longitudinal link force (e.g., cable tension, or bar compression). We show the design of an almost globally convergent nonlinear observer of the full state that resorts only to an onboard accelerometer and a gyroscope. We also design two almost globally convergent nonlinear controllers to track any sufficiently smooth time-varying trajectory of the two output pairs. Finally we numerically test the robustness of the proposed method in several far-from-nominal conditions: nonlinear cross-coupling effects, parameter deviations, measurements noise and non ideal actuators.

1712.02917 2026-06-04 cs.RO cs.SY eess.SY

Using Intermittent Synchronization to Compensate for Rhythmic Body Motion During Autonomous Surgical Cutting and Debridement

利用间歇同步来补偿自主手术切割和清创中的节律性身体运动

Vatsal Patel, Sanjay Krishnan, Aimee Goncalves, Carolyn Chen, Walter Doug Boyd, Ken Goldberg

AI总结 本文提出利用间歇同步技术来补偿手术中因呼吸、心跳和节律性运动导致的不稳定性,通过实验验证该方法在切割和清创任务中具有更高的鲁棒性和精度。

详情
AI中文摘要

手术过程中解剖结构由于呼吸、心跳和节律性运动而很少保持静止。受专家外科医生的启发,我们提出了一种与节律性运动极值(即最低速度窗口)进行间歇同步的方法。我们进行了两项实验:(1)模式切割,(2)清创。在(1)中,间歇同步方法虽然比跟踪运动慢1.8倍,但对噪声和控制延迟的鲁棒性显著提高,并将最大切割误差减少了2.6倍。在(2)中,无同步的基线方法每次移除的成功率为62%,而间歇同步方法达到80%。

英文摘要

Anatomical structures are rarely static during a surgical procedure due to breathing, heartbeats, and peristaltic movements. Inspired by observing an expert surgeon, we propose an intermittent synchronization with the extrema of the rhythmic motion (i.e., the lowest velocity windows). We performed 2 experiments: (1) pattern cutting, and (2) debridement. In (1), we found that the intermittent synchronization approach, while 1.8x slower than tracking motion, was significantly more robust to noise and control latency, and it reduced the max cutting error by 2.6x In (2), a baseline approach with no synchronization achieves 62% success rate for each removal, while intermittent synchronization achieves 80%.

1712.00634 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY math.OC

PFAx: Predictable Feature Analysis to Perform Control

PFAx:可预测特征分析用于控制

Stefan Richthofer, Laurenz Wiskott

AI总结 PFAx通过整合补充信息提升预测性能,并透明展示补充信息对特征选择的影响,应用于强化学习环境中的智能体控制优化。

详情
AI中文摘要

可预测特征分析(PFA)(Richthofer, Wiskott, ICMLA 2015)是一种对高维输入信号进行降维的算法,提取最可预测的子信号。本文扩展了PFA,考虑补充信息以提高预测。补充信息不参与特征提取,特征仅从主输入中提取。PFAx透明地展示补充信息如何提升预测质量,并可生成补充信息以实现主信号的特定目标。该方法应用于强化学习环境,使智能体局部优化状态,接近目标。后续论文将扩展此方法以实现全局优化。

英文摘要

Predictable Feature Analysis (PFA) (Richthofer, Wiskott, ICMLA 2015) is an algorithm that performs dimensionality reduction on high dimensional input signal. It extracts those subsignals that are most predictable according to a certain prediction model. We refer to these extracted signals as predictable features. In this work we extend the notion of PFA to take supplementary information into account for improving its predictions. Such information can be a multidimensional signal like the main input to PFA, but is regarded external. That means it won't participate in the feature extraction - no features get extracted or composed of it. Features will be exclusively extracted from the main input such that they are most predictable based on themselves and the supplementary information. We refer to this enhanced PFA as PFAx (PFA extended). Even more important than improving prediction quality is to observe the effect of supplementary information on feature selection. PFAx transparently provides insight how the supplementary information adds to prediction quality and whether it is valuable at all. Finally we show how to invert that relation and can generate the supplementary information such that it would yield a certain desired outcome of the main signal. We apply this to a setting inspired by reinforcement learning and let the algorithm learn how to control an agent in an environment. With this method it is feasible to locally optimize the agent's state, i.e. reach a certain goal that is near enough. We are preparing a follow-up paper that extends this method such that also global optimization is feasible.

1310.7443 2026-06-04 cs.CV cs.NA math.NA

On Convergent Finite Difference Schemes for Variational - PDE Based Image Processing

关于变分-PDE基图像处理的收敛有限差分方案

V. B. S. Prasath, Juan C. Moreno

AI总结 本文提出一种自适应各向异性Huber函数图像修复方案,结合L2-L1正则化函数,通过Split Bregman方法实现图像去噪与边缘保持,实验表明该算法具有最佳收敛性。

Comments 23 pages, 12 figures, 2 tables

详情
Journal ref
Computational and Applied Mathematics, 2017
AI中文摘要

我们研究了一种基于自适应各向异性Huber函数的图像修复方案。通过结合L2-L1正则化函数,自适应Huber函数基于的能量最小化模型在噪声数字图像中提供去噪和边缘保持。我们研究了一种基于连续分段线性函数的收敛有限差分方案,并使用变量分割方案,即Split Bregman,以获得离散最小化器。给出的实验结果包括图像去噪,并与加性操作分割、双固定点和投影梯度方案的比较表明,我们的算法获得了最佳的收敛速率。

英文摘要

We study an adaptive anisotropic Huber functional based image restoration scheme. By using a combination of L2-L1 regularization functions, an adaptive Huber functional based energy minimization model provides denoising with edge preservation in noisy digital images. We study a convergent finite difference scheme based on continuous piecewise linear functions and use a variable splitting scheme, namely the Split Bregman, to obtain the discrete minimizer. Experimental results are given in image denoising and comparison with additive operator splitting, dual fixed point, and projected gradient schemes illustrate that the best convergence rates are obtained for our algorithm.

1703.09499 2026-06-04 cs.CV cs.NA math.NA

Locality preserving projection on SPD matrix Lie group: algorithm and analysis

局部保持投影在SPD矩阵李群上的应用:算法与分析

Yangyang Li, Ruqian Lu

AI总结 本文提出在SPD矩阵李群上进行降维的算法,通过局部保持投影思想构建Laplacian矩阵,有效处理高维SPD矩阵,提升人脸识别和动作识别性能。

Comments 15 pages, 3 tables

详情
AI中文摘要

对用于图像识别的对称正定(SPD)矩阵作为特征描述符通常是高维的。传统流形学习仅适用于降维高维向量数据。对于高维SPD矩阵,直接使用流形学习算法降维矩阵数据是不可能的。SPD矩阵必须首先转换为长向量,然后降维此向量。然而,这种方法破坏了SPD矩阵空间的空间结构。为克服这一限制,我们提出了一种新的在SPD矩阵空间上的降维算法,将高维SPD矩阵转换为低维SPD矩阵。我们的工作基于所有相同大小的SPD矩阵集具有李群结构的事实,并旨在将流形学习转换到SPD矩阵李群。我们使用局部保持投影(LPP)算法的基本思想,构建对应的Laplacian矩阵在SPD矩阵李群上。因此,我们称我们的方法为Lie-LPP以强调其李群特性。我们展示了详细的算法分析,并通过实验表明Lie-LPP在人类动作识别和人类面孔识别上实现了有效的结果。

英文摘要

Symmetric positive definite (SPD) matrices used as feature descriptors in image recognition are usually high dimensional. Traditional manifold learning is only applicable for reducing the dimension of high-dimensional vector-form data. For high-dimensional SPD matrices, directly using manifold learning algorithms to reduce the dimension of matrix-form data is impossible. The SPD matrix must first be transformed into a long vector, and then the dimension of this vector must be reduced. However, this approach breaks the spatial structure of the SPD matrix space. To overcome this limitation, we propose a new dimension reduction algorithm on SPD matrix space to transform high-dimensional SPD matrices into low-dimensional SPD matrices. Our work is based on the fact that the set of all SPD matrices with the same size has a Lie group structure, and we aim to transform the manifold learning to the SPD matrix Lie group. We use the basic idea of the manifold learning algorithm called locality preserving projection (LPP) to construct the corresponding Laplacian matrix on the SPD matrix Lie group. Thus, we call our approach Lie-LPP to emphasize its Lie group character. We present a detailed algorithm analysis and show through experiments that Lie-LPP achieves effective results on human action recognition and human face recognition.

1711.04683 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML

Tensor Decompositions for Modeling Inverse Dynamics

张量分解用于逆动力学建模

Stephan Baier, Volker Tresp

AI总结 本文提出利用张量分解方法建模逆动力学,通过处理位置、速度和加速度的三重交互,实现对高非线性函数的近似,并在SARCOS机械臂数据集上验证了其优越性。

详情
AI中文摘要

建模逆动力学对于精确的前馈机器人控制至关重要。该模型计算所需的关节扭矩,以执行预期的运动。高度非线性的动态系统逆函数可以通过回归技术近似。我们提出了一种回归方法,即利用位置x速度x加速度的三重交互的张量分解模型。大多数张量分解工作都解决了密集张量的分解问题。本文在稀疏张量的分解基础上进行扩展,仅包含少量非零条目。稀疏张量的分解已成功应用于关系学习,例如大规模知识图谱的建模。最近,该方法已扩展到多类分类问题,涉及离散输入变量。在高维稀疏张量中表示数据可以近似复杂的高非线性函数。本文展示了稀疏张量分解如何应用于回归问题。此外,我们通过学习从连续输入到张量分解的潜在表示的映射,利用基函数将方法扩展到连续输入。我们在具有七自由度SARCOS机械臂轨迹的数据集上评估了所提出的模型。实验结果表明,所提出的功能张量模型相比挑战性的最新方法具有优越的性能。

英文摘要

Modeling inverse dynamics is crucial for accurate feedforward robot control. The model computes the necessary joint torques, to perform a desired movement. The highly non-linear inverse function of the dynamical system can be approximated using regression techniques. We propose as regression method a tensor decomposition model that exploits the inherent three-way interaction of positions x velocities x accelerations. Most work in tensor factorization has addressed the decomposition of dense tensors. In this paper, we build upon the decomposition of sparse tensors, with only small amounts of nonzero entries. The decomposition of sparse tensors has successfully been used in relational learning, e.g., the modeling of large knowledge graphs. Recently, the approach has been extended to multi-class classification with discrete input variables. Representing the data in high dimensional sparse tensors enables the approximation of complex highly non-linear functions. In this paper we show how the decomposition of sparse tensors can be applied to regression problems. Furthermore, we extend the method to continuous inputs, by learning a mapping from the continuous inputs to the latent representations of the tensor decomposition, using basis functions. We evaluate our proposed model on a dataset with trajectories from a seven degrees of freedom SARCOS robot arm. Our experimental results show superior performance of the proposed functional tensor model, compared to challenging state-of-the art methods.