机器人 / 具身智能 - arXivDaily 专题

2510.18085 2026-06-18 cs.RO cs.AI cs.MA 版本更新 90%

R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations

R2BC: 从单智能体演示进行多智能体模仿学习

Connor Mattson, Varun Raveendra, Ellen Novoseller, Nicholas Waytowich, Vernon J. Lawhern, Daniel S. Brown

发表机构 * Kahlert School of Computing, University of Utah（犹他大学凯勒尔计算学院）； DEVCOM Army Research Laboratory（陆军研究实验室）

专题命中机器人学习：多机器人模仿学习，核心是机器人学习

AI总结提出R2BC方法，通过轮换单智能体演示训练多机器人系统，无需联合动作空间演示，在模拟和实物任务中性能媲美或超越基于特权同步演示的基线方法。

Comments 8 pages, 6 figures. In Proceedings: IEEE International Conference on Robotics & Automation (ICRA 2026)

详情

AI中文摘要

模仿学习（IL）是人类教授机器人的自然方式，尤其是在高质量演示易于获取的情况下。虽然IL已广泛应用于单机器人场景，但将其扩展到多智能体系统的研究相对较少，尤其是在单个人类必须为协作机器人团队提供演示的场景中。本文介绍并研究了轮换行为克隆（R2BC），该方法使单个人类操作员能够通过顺序的单智能体演示有效训练多机器人系统。我们的方法允许人类一次远程操作一个智能体，并逐步向整个系统教授多智能体行为，无需联合多智能体动作空间的演示。我们表明，在四个多智能体模拟任务中，R2BC方法的性能与基于特权同步演示的Oracle行为克隆方法相当，甚至在某些情况下超越后者。最后，我们在两个使用真实人类演示训练的物理机器人任务上部署了R2BC。

英文摘要

Imitation Learning (IL) is a natural way for humans to teach robots, particularly when high-quality demonstrations are easy to obtain. While IL has been widely applied to single-robot settings, relatively few studies have addressed the extension of these methods to multi-agent systems, especially in settings where a single human must provide demonstrations to a team of collaborating robots. In this paper, we introduce and study Round-Robin Behavior Cloning (R2BC), a method that enables a single human operator to effectively train multi-robot systems through sequential, single-agent demonstrations. Our approach allows the human to teleoperate one agent at a time and incrementally teach multi-agent behavior to the entire system, without requiring demonstrations in the joint multi-agent action space. We show that R2BC methods match, and in some cases surpass, the performance of an oracle behavior cloning approach trained on privileged synchronized demonstrations across four multi-agent simulated tasks. Finally, we deploy R2BC on two physical robot tasks trained using real human demonstrations.

URL PDF HTML ☆

赞 0 踩 0

2512.11736 2026-06-18 cs.RO 版本更新 85%

Bench-Push: Benchmarking Pushing-based Navigation and Manipulation Tasks for Mobile Robots

Bench-Push：基于推动的移动机器人导航与操作任务基准测试

Ninghan Zhong, Steven Caro, Megnath Ramesh, Rishi Bhatnagar, Avraiem Iskandar, Stephen L. Smith

发表机构 * Institute for Robotics and Intelligent Machines, Georgia Institute of Technology（机器人与智能机器研究所，佐治亚理工学院）； Department of Electrical and Computer Engineering, University of Waterloo（电气与计算机工程系，滑铁卢大学）； Department of Mechanical Engineering, University of Alberta（机械工程系，阿尔伯塔大学）

专题命中机器人学习：提出推动式移动机器人导航与操作基准

AI总结提出首个统一的推动式移动机器人导航与操作基准Bench-Push，包含多种模拟环境、新评估指标和基线实现，用于解决可移动障碍物环境中的机器人推动任务评估问题。

Comments Published in CRV 2026

详情

AI中文摘要

移动机器人越来越多地部署在具有可移动物体的杂乱环境中，这对禁止交互的传统方法提出了挑战。在这种环境中，移动机器人必须超越传统的避障策略，利用推动或轻推策略来实现其目标。尽管基于推动的机器人研究正在增长，但评估依赖于临时设置，限制了可重复性和交叉比较。为了解决这个问题，我们提出了Bench-Push，这是首个用于基于推动的移动机器人导航和操作任务的统一基准。Bench-Push包括多个组件：1）一系列全面的模拟环境，捕捉推动任务中的基本挑战，包括在具有可移动障碍物的迷宫中导航、自主船舶在冰覆盖水域中导航、箱子递送和区域清理，每个任务都有不同复杂程度；2）新的评估指标，用于捕捉效率、交互努力和部分任务完成；3）使用Bench-Push评估跨环境的已建立基线的示例实现。Bench-Push作为Python库开源，采用模块化设计。代码、文档和训练模型可在https://this URL找到。

英文摘要

Mobile robots are increasingly deployed in cluttered environments with movable objects, posing challenges for traditional methods that prohibit interaction. In such settings, the mobile robot must go beyond traditional obstacle avoidance, leveraging pushing or nudging strategies to accomplish its goals. While research in pushing-based robotics is growing, evaluations rely on ad hoc setups, limiting reproducibility and cross-comparison. To address this, we present Bench-Push, the first unified benchmark for pushing-based mobile robot navigation and manipulation tasks. Bench-Push includes multiple components: 1) a comprehensive range of simulated environments that capture the fundamental challenges in pushing-based tasks, including navigating a maze with movable obstacles, autonomous ship navigation in ice-covered waters, box delivery, and area clearing, each with varying levels of complexity; 2) novel evaluation metrics to capture efficiency, interaction effort, and partial task completion; and 3) demonstrations using Bench-Push to evaluate example implementations of established baselines across environments. Bench-Push is open-sourced as a Python library with a modular design. The code, documentation, and trained models can be found at https://github.com/IvanIZ/BenchNPIN.

URL PDF HTML ☆

赞 0 踩 0

2602.01700 2026-06-18 cs.RO 版本更新 80%

Tilt-Ropter: A Fully Actuated Hybrid Aerial-Terrestrial Vehicle with Tilt Rotors and Passive Wheels

Tilt-Ropter: 一种带有倾转旋翼和被动轮的全驱动混合空中-地面车辆

Ruoyu Wang, Xuchen Liu, Zongzhou Wu, Zixuan Guo, Wendi Ding, Ben M. Chen

发表机构 * Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong（机械与自动化工程系，香港中文大学）； Faculty of Engineering, The University of Hong Kong（工程学院，香港大学）； Peng Cheng Laboratory（鹏城实验室）

专题命中机器人学习：提出混合空中-地面车辆Tilt-Ropter，属于机器人。

AI总结提出全驱动混合空中-地面车辆Tilt-Ropter，通过倾转旋翼和被动轮实现高效多模态运动，并设计统一非线性模型预测控制器实现低跟踪误差和地面运动功耗降低92.8%。

Comments 8 pages, 10 figures. Accepted by the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2026)

详情

AI中文摘要

在这项工作中，我们提出了Tilt-Ropter，一种全驱动的混合空中-地面车辆（HATV），它集成了倾转旋翼和被动轮，以实现高效的多模态运动。与传统的欠驱动HATV不同，Tilt-Ropter的全驱动设计允许力和扭矩解耦控制，提高了机动性和地面运动效率。开发了一个统一的非线性模型预测控制器（NMPC）来跟踪参考轨迹，强制执行非完整约束，并适应运动模式间的接触效应，同时通过专门的控制分配确保执行器可行性。为了解决复杂的轮地动力学问题，集成了一个外部力估计器来提供实时交互力估计。该系统通过仿真和实际实验进行了验证，包括无缝的空地过渡和轨迹跟踪任务。实验结果表明，两种模式下的跟踪误差都很低，并且地面运动期间的功耗相比飞行降低了92.8%，突显了该平台在能源受限环境中执行长时间任务的适用性。

英文摘要

In this work, we present Tilt-Ropter, a fully actuated hybrid aerial-terrestrial vehicle (HATV) that integrates tilt rotors with passive wheels to enable efficient multi-modal locomotion. Unlike conventional underactuated HATVs, the fully actuated design of Tilt-Ropter allows decoupled force and torque control, improving maneuverability and ground locomotion efficiency. A unified nonlinear model predictive controller (NMPC) is developed to track reference trajectories, enforce non-holonomic constraints, and accommodate contact effects across locomotion modes, while ensuring actuator feasibility through dedicated control allocation. To address complex wheel-ground dynamics, an external wrench estimator is incorporated to provide real-time interaction wrench estimates. The system is validated through simulation and real-world experiments, including seamless air-ground transitions and trajectory tracking tasks. Experimental results demonstrate low tracking errors in both modes and reveal a 92.8% reduction in power consumption during ground locomotion compared to flight, highlighting the platform's suitability for long-duration missions in energy-constrained environments.

URL PDF HTML ☆

赞 0 踩 0

2503.08895 2026-06-18 cs.RO 版本更新 80%

Mutual Adaptation in Human-Robot Co-Transportation with Human Preference Uncertainty

人机协同运输中考虑人类偏好不确定性的相互适应

Al Jaber Mahmud, Weizi Li, Xuan Wang

发表机构 * George Mason University（乔治·马歇尔大学）； University of California, Riverside（加州大学河滨分校）

专题命中机器人学习：人机协同运输中的相互适应

AI总结针对人机协同运输中人类偏好参数不确定及适应策略平衡问题，提出统一框架，通过建模偏好概率分布、时变固执度及协调规划模型，结合位姿优化策略，实现相互适应以提升任务性能。

Comments 9 pages, 6 figures

详情

AI中文摘要

相互适应可以通过整合机器人和人类对环境的理解来增强人机协同运输的整体任务性能。虽然人类建模有助于捕捉人类的主观偏好，但存在两个挑战：（i）人类偏好参数的不确定性，以及（ii）需要平衡对人和机器都有利的适应策略。在本文中，我们提出了一个统一的框架来应对这些挑战，并通过相互适应提高任务性能。首先，我们不依赖固定参数，而是通过纳入一系列不确定的人类偏好参数来建模人类选择的概率分布。在此基础上，我们引入时变固执度量和协调规划模型，该模型允许机器人领导团队的轨迹，或者如果人类偏好的路径与机器人的计划冲突且其固执度超过阈值，则机器人转为跟随人类。最后，我们引入一种用于低级控制的位姿优化策略，以减轻人类领导时的不确定行为。为了验证该框架，我们设计并进行了包含二十名人类参与者反馈的研究。然后，通过仿真，我们展示了我们的模型在通过相互适应和位姿优化增强任务性能方面的有效性。

英文摘要

Mutual adaptation can enhance overall task performance in human-robot co-transportation by integrating both the robot's and the human's understanding of the environment. While human modeling helps capture humans' subjective preferences, two challenges persist: (i) the uncertainty of human preference parameters and (ii) the need to balance adaptation strategies that benefit both humans and robots. In this paper, we propose a unified framework to address these challenges and improve task performance through mutual adaptation. First, instead of relying on fixed parameters, we model a probability distribution of human choices by incorporating a range of uncertain human preference parameters. Building on this, we introduce a time-varying stubbornness measure and a coordinated planning model, which allows either the robot to lead the team's trajectory or, if a human's preferred path conflicts with the robot's plan and their stubbornness exceeds a threshold, the robot to transition to following the human. Finally, we introduce a pose optimization strategy for low-level control to mitigate the uncertain human behaviors when they are leading. To validate the framework, we design and perform a study with human feedback from twenty human participants. We then demonstrate, through simulations, the effectiveness of our models in enhancing task performance with mutual adaptation and pose optimization.

URL PDF HTML ☆

赞 0 踩 0

2511.02036 2026-06-18 cs.RO 版本更新 70%

TurboMap: GPU-Accelerated Local Mapping for Visual SLAM

TurboMap: 面向视觉SLAM的GPU加速局部建图

Parsa Hosseininejad, Kimia Khabiri, Shishir Gopinath, Soudabeh Mohammadhashemi, Karthik Dantu, Steven Y. Ko

发表机构 * Simon Fraser University（西蒙弗雷泽大学）； University at Buffalo（布法罗大学）

专题命中机器人学习：SLAM是机器人感知的核心技术

AI总结针对视觉SLAM中局部建图延迟问题，提出GPU并行化与CPU优化结合的TurboMap后端，通过重构地图点创建、融合及关键帧管理，实现1.3-1.6倍加速且保持精度。

Comments Accepted for presentation at IROS 2026, preprint

详情

AI中文摘要

在实时视觉SLAM系统中，局部建图必须在严格的延迟约束下运行，因为延迟会降低地图质量并增加跟踪失败的风险。GPU并行化是降低延迟的有效途径。然而，由于同步共享状态更新以及将大型地图数据结构传输到GPU的开销，并行化局部建图具有挑战性。本文提出TurboMap，一个GPU并行化且CPU优化的局部建图后端，全面解决了这些挑战。我们重构了地图点创建，以在GPU上实现并行关键点对应搜索，重新设计并并行化了地图点融合，在CPU上优化了冗余关键帧剔除，并集成了基于GPU的快速局部光束法平差求解器。为最小化数据传输和同步成本，我们引入了持久化的GPU驻留关键帧存储。在EuRoC和TUM-VI数据集上的实验表明，平均局部建图速度分别提升1.3倍和1.6倍，同时保持精度不变。

英文摘要

In real-time Visual SLAM systems, local mapping must operate under strict latency constraints, as delays degrade map quality and increase the risk of tracking failure. GPU parallelization offers a promising way to reduce latency. However, parallelizing local mapping is challenging due to synchronized shared-state updates and the overhead of transferring large map data structures to the GPU. This paper presents TurboMap, a GPU-parallelized and CPU-optimized local mapping backend that holistically addresses these challenges. We restructure Map Point Creation to enable parallel Keypoint Correspondence Search on the GPU, redesign and parallelize Map Point Fusion, optimize Redundant Keyframe Culling on the CPU, and integrate a fast GPU-based Local Bundle Adjustment solver. To minimize data transfer and synchronization costs, we introduce persistent GPU-resident keyframe storage. Experiments on the EuRoC and TUM-VI datasets show average local mapping speedups of 1.3x and 1.6x, respectively, while preserving accuracy.

URL PDF HTML ☆

赞 0 踩 0