自动驾驶 - arXivDaily 专题

2606.19227 2026-06-18 cs.RO 新提交 80%

Constant Time-Delay Leader Following with Neural Networks and Invariant Extended Kalman Filters for Arbitrary Trajectories

基于神经网络与不变扩展卡尔曼滤波的任意轨迹恒定时间延迟领航跟随

Luka Antonyshyn, Paulo Ricardo Marques de Araujo, Sidney Givigi

发表机构 * University of Toronto Institute for Aerospace Studies（多伦多大学航空航天研究所）； School of Computing（计算学院）

专题命中规划控制：车辆队列轨迹跟踪，属于自动驾驶规划控制

AI总结提出一种结合概率Seq2Seq神经网络与不变扩展卡尔曼滤波的恒定时间延迟轨迹跟踪方法，用于无通信、无全局坐标的车队，在SE(2)流形上准确估计领车轨迹，并利用几何模型预测控制提升性能。

Comments 9 pages, 6 figures

详情

AI中文摘要

本文提出了一种用于车辆队列的恒定时间延迟轨迹跟踪方法，该方法无需车辆间通信、公共坐标系或全球定位。该方法将概率序列到序列（Seq2Seq）神经网络与不变扩展卡尔曼滤波（IEKF）相结合，以热启动预测过程，从而在SE(2)流形上准确估计领车相对轨迹。进一步引入几何模型预测控制器，以充分利用基于流形的轨迹预测来改善控制性能。该系统能够处理具有不同速度和运动轮廓的任意非线性轨迹，同时减少了对基于专家领域知识的轨迹跟踪系统设计的需求，即使在长轨迹延迟下也是如此。通过运动学仿真中与纯IEKF基线、基于学习的方法以及真实轨迹的对比，以及使用真实机器人车辆的实验，验证了该方法的有效性。

英文摘要

This paper proposes a constant time-delay trajectory tracking method for vehicle convoys operating without inter-vehicle communication, a common coordinate system, or global positioning. The method integrates a probabilistic sequence-to-sequence (Seq2Seq) neural network with an invariant extended Kalman filter (IEKF) to warm-start the prediction process, allowing accurate estimation of a leader vehicle's relative trajectory on the SE(2) manifold. A geometric model predictive controller is further incorporated to fully exploit the manifold-based trajectory predictions for improved control performance. The system can handle arbitrary nonlinear trajectories with varying speeds and motion profiles while reducing the need for expert-based domain knowledge for the design of trajectory following systems, even under long trajectory delays. The effectiveness of the method is validated through comparisons with a pure IEKF baseline, learning-based methods, and the ground-truth trajectory in kinematic simulations, as well as in experiments using real robotic vehicles.

URL PDF HTML ☆

赞 0 踩 0

2606.18630 2026-06-18 cs.RO 新提交 80%

DNN Koopman-Based Deviation Compensation for UGV Path Tracking Control on Coupled Slope and Potholed Road

基于DNN Koopman的偏差补偿用于耦合坡度和坑洼道路上的UGV路径跟踪控制

Jian Zhao, Wenbo Zhou, Zhicheng Chen, Bing Zhu, Jiayi Han, Dongjian Song, Yinju Lin, Peixing Zhang

发表机构 * Xiamen King Long United Automotive Industry Co., Ltd.（厦门金龙联合汽车工业有限公司）

专题命中规划控制：UGV路径跟踪控制，补偿坡道和坑洼扰动

AI总结提出基于DNN Koopman的偏差补偿策略，结合自适应遗忘递推最小二乘估计轮胎刚度、Laguerre模型预测控制与事件触发协同补偿，在耦合坡度和坑洼道路上提升UGV路径跟踪精度超11.5%

Comments 22 pages, 13 figures

详情

AI中文摘要

在越野场景中运行的无人地面车辆面临复杂地形扰动，这些扰动会显著降低路径跟踪性能。针对这一挑战，本文提出了一种基于深度神经网络Koopman的偏差补偿策略，用于无人地面车辆路径跟踪控制。首先，基于耦合坡度上的车辆动力学函数，设计了一种带有解耦误差项的自适应遗忘递推最小二乘法来估计轮胎侧偏刚度。在此基础上，通过引入Laguerre函数，设计了一种Laguerre模型预测控制路径跟踪控制策略，该策略可在不同耦合坡度场景下降低计算资源消耗的同时保持可靠的跟踪性能。然后，通过将Koopman算子理论与深度神经网络相结合，提出了一种深度神经网络Koopman路径偏差补偿方法，该方法显著提高了无人地面车辆在坑洼道路扰动下的路径跟踪精度。此外，基于补偿激活准则和可信度验证，建立了一种将Laguerre模型预测控制与深度神经网络Koopman耦合的事件触发并行协同补偿机制。该机制提高了坑洼道路上的路径跟踪精度，同时确保了整体转向指令的可行性和深度神经网络Koopman补偿后车辆的稳定性。最后，构建了硬件在环实验平台进行验证。实验结果表明，所提出的无人地面车辆路径跟踪策略在多种工况下跟踪性能提升超过11.5%。

英文摘要

Unmanned ground vehicles (UGVs) operating in off-road scenarios are confronted with complex terrain disturbances that can substantially degrade path tracking performance. To address this challenge, this paper proposes a deep neural network (DNN) Koopman-based deviation compensation strategy for UGV path tracking control. Firstly, based on the vehicle dynamic function on coupled slope, an adaptive forgetting recursive least squares method with decoupled error terms is designed to estimate tire cornering stiffness. On this basis, a Laguerre model predictive control (LMPC) path tracking control strategy is designed by incorporating Laguerre functions, which can reduce computational resource usage while maintaining reliable tracking performance across different coupled slope scenarios. Then, by integrating Koopman operator theory with DNN, a DNN Koopman (DK) path deviation compensation method is proposed, which significantly improves the path tracking accuracy of UGV under potholed road disturbances. Furthermore, an event-triggered parallel cooperative (EPC) compensation mechanism that couples LMPC with DK is established based on compensation activation criteria and credibility verification. This mechanism improves path tracking accuracy on potholed road while ensuring the feasibility of overall steering command and stability of vehicle after DK compensation. Finally, a hardware-in-the-loop (HiL) experimental platform is constructed for validation. Experimental results demonstrate that the proposed UGV path tracking strategy improves tracking performance by more than 11.5% across multiple operating conditions.

URL PDF HTML ☆

赞 0 踩 0

2604.25848 2026-06-18 cs.AI 版本更新 75%

A Distributionally Robust Reinforcement Learning Framework for Constrained Urban EV Dispatch

面向约束城市电动汽车调度的分布鲁棒强化学习框架

An Nguyen, Hoang Nguyen, Phuong Le, Hung Pham, Cuong Do, Laurent El Ghaoui

发表机构 * College of Engineering and Computer Science, VinUniversity, Hanoi, Vietnam（VinUniversity 工程与计算机科学学院，河内，越南）； Center for Environmental Intelligence, VinUniversity, Hanoi, Vietnam（VinUniversity 环境智能中心，河内，越南）

专题命中规划控制：城市电动汽车调度，涉及充电和路径规划

AI总结针对城市电动汽车调度中充电站和馈线容量约束及不确定需求，提出基于半马尔可夫决策过程与分布鲁棒软演员-评论家算法，通过图卷积编码器和滚动混合整数线性规划保证可行性，在纽约出租车数据仿真中实现最高净利润且零违规。

详情

AI中文摘要

我们研究城市规模的电动汽车（EV）网约车车队控制，其中调度、重新定位和充电决策必须在不确定且空间相关的出行需求和旅行时间下，遵守充电器和馈线限制。我们将问题建模为六边形网格半马尔可夫决策过程（semi-MDP），具有混合动作——用于服务、重新定位和充电的离散动作，以及连续充电功率——和可变动作持续时间。为了保证训练和部署期间的物理可行性，策略在由掩码温度退火actor产生的高层意图上学习。这些意图在每个决策步骤通过一个时间受限的滚动混合整数线性规划（MILP）进行投影，该规划严格强制执行荷电状态、充电端口和馈线约束。为了缓解分布偏移，我们针对一个Wasserstein-1模糊集优化软演员-评论家（SAC）智能体，该模糊集使用图对齐的马氏基础度量来捕捉空间相关性。鲁棒备份使用Kantorovich-Rubinstein对偶、投影次梯度内环和原始-对偶风险预算更新。我们的架构结合了两层图卷积网络（GCN）编码器、双评论家和一个驱动对手的价值网络。基于纽约出租车数据构建的大规模电动汽车车队模拟器上的实验表明，PD-RSAC实现了最高的净利润，达到122万美元，而强启发式、单智能体RL和多智能体RL基线（包括Greedy、SAC、MAPPO和MADDPG）的净利润为58万至70万美元，同时保持零馈线限制违规。

英文摘要

We study city-scale control of electric-vehicle (EV) ride-hailing fleets where dispatch, repositioning, and charging decisions must respect charger and feeder limits under uncertain, spatially correlated demand and travel times. We formulate the problem as a hex-grid semi-Markov decision process (semi-MDP) with mixed actions -- discrete actions for serving, repositioning, and charging, together with continuous charging power -- and variable action durations. To guarantee physical feasibility during both training and deployment, the policy learns over high-level intentions produced by a masked, temperature-annealed actor. These intentions are projected at every decision step through a time-limited rolling mixed-integer linear program (MILP) that strictly enforces state-of-charge, port, and feeder constraints. To mitigate distributional shifts, we optimize a Soft Actor-Critic (SAC) agent against a Wasserstein-1 ambiguity set with a graph-aligned Mahalanobis ground metric that captures spatial correlations. The robust backup uses the Kantorovich-Rubinstein dual, a projected subgradient inner loop, and a primal-dual risk-budget update. Our architecture combines a two-layer Graph Convolutional Network (GCN) encoder, twin critics, and a value network that drives the adversary. Experiments on a large-scale EV fleet simulator built from NYC taxi data show that PD-RSAC achieves the highest net profit, reaching \$1.22M, compared with \$0.58M-\$0.70M for strong heuristic, single-agent RL, and multi-agent RL baselines, including Greedy, SAC, MAPPO, and MADDPG, while maintaining zero feeder-limit violations.

URL PDF HTML ☆

赞 0 踩 0

2606.18883 2026-06-18 cs.RO 新提交 65%

ZiMPedance: Impedance-Aware ZMP Modeling and Control for Payload Carrying with Quadruped Robots

ZiMPedance：面向四足机器人负载搬运的阻抗感知ZMP建模与控制

Giovanni B. Dessy, Lorenzo Amatucci, Victor Barasuol, Claudio Semini

发表机构 * Dynamic Legged Systems Lab, Istituto Italiano di Tecnologia (IIT)（动态腿部系统实验室，意大利技术研究院（IIT））

专题命中规划控制：涉及ZMP建模与控制，可迁移至自动驾驶

AI总结提出扩展零力矩点（ZMP）公式以包含被动负载接口动力学，结合模型预测控制减少稳定性违规达10倍，并提高运动效率。

详情

AI中文摘要

四足机器人的负载运输受到机器人与负载之间物理接口动力学的强烈影响。与主动机械臂相比，被动弹簧臂减轻了重量和复杂性，但其弹簧-阻尼动力学可能引入振荡力，降低运动稳定性。本文推导了一个扩展的零力矩点（ZMP）公式，该公式包含被动负载接口动力学，将刚度、阻尼和负载质量与稳定性裕度联系起来。分析表明，欠阻尼配置可能与运动谐波共振。基于这一见解，我们通过被动子系统动力学增强了单刚体动力学模型，并将其集成到模型预测控制框架中。在仿真中，所提出的控制器将稳定性违规减少高达10倍（从7.0%降至0.7%），并通过将水平地面反作用力努力降低高达15%来提高运动效率。硬件实验表明，在标称控制器失效的拉放扰动下，携带2公斤负载的机器人能够稳定运动。同一模型还使得通过被动臂动力学实现末端执行器跟踪成为可能，而无需直接驱动臂。

英文摘要

Load transportation with quadruped robots is strongly affected by the dynamics of the physical interface between the robot and the load. Passive spring-based arms reduce weight and complexity compared to active manipulators, but their spring-damper dynamics can introduce oscillatory forces that degrade locomotion stability. This paper derives an extended Zero Moment Point (ZMP) formulation that includes passive payload-interface dynamics, relating stiffness, damping, and payload mass to the stability margin. The analysis shows that underdamped configurations can resonate with locomotion harmonics. Based on this insight, we augment a Single Rigid Body Dynamics model with passive subsystem dynamics and integrate it into a Model Predictive Control framework. In simulation, the proposed controller reduces stability violations by up to $10\times$, from $7.0\%$ to $0.7\%$, and increase locomotion efficiency by lowering horizontal ground reaction force effort by up to $15\%$ compared to a nominal baseline. Hardware experiments with a $2\,\mathrm{kg}$ payload show stable locomotion under pull-release disturbances where the nominal controller fails. The same model also enables end-effector tracking through passive arm dynamics without direct arm actuation.

URL PDF HTML ☆

赞 0 踩 0

2606.18516 2026-06-18 cs.RO 新提交 60%

Task Allocation and Motion Planning in Dynamic, Cluttered Environments via CBBA and Graphs of Convex Sets

动态杂乱环境下的任务分配与运动规划：基于CBBA与凸集图

Matthew D. Osburn, Cameron K. Peterson, John L. Salmon

发表机构 * Electrical and Computer Engineering（电气与计算机工程系）； Mechanical Engineering（机械工程系）

专题命中规划控制：动态环境中的轨迹规划

AI总结针对动态杂乱环境中的多智能体任务规划，提出结合凸集图（GCS）进行轨迹优化与共识捆绑算法（CBBA）进行分布式任务分配的方法，实现安全高效的轨迹规划和任务协调。

Comments 15 pages single column, 10 figures, AIAA-Scitech 2027 Submission

详情

AI中文摘要

在杂乱、动态环境中的多智能体任务规划需要在分配任务给智能体的同时，确定通过环境的安全、时间高效的轨迹。当任务是动态的（例如会合目标）时，分配决策不仅取决于哪个智能体最适合某项任务，还取决于该任务何时何地可以到达。本文提出了一个解决该问题的方法，该方法将凸集图（GCS）用于轨迹优化，与共识捆绑算法（CBBA）用于分布式任务分配相结合。在我们的方法中，GCS通过使用时间扩展（3D+时间）配置空间找到通过动态环境的最优轨迹。同时，CBBA协调跨智能体的任务分配，使得在移动环境中能够做出明智的决策。然后，我们连接分配和规划，使智能体能够在3D+时间配置空间中避免碰撞，并提供准确的任务完成时间估计。我们在具有静态和动态任务的模拟杂乱环境中展示了我们方法的有效性。

英文摘要

Multi-agent task planning in cluttered, dynamic environments requires assigning tasks to agents while simultaneously determining safe, time-efficient trajectories through the environment. When tasks are dynamic, such as rendezvous objectives, allocation decisions depend not only on which agent is best suited for a task, but also on when and where that task can be reached. This paper presents a solution to this problem, which combines Graphs of Convex Sets (GCS) for trajectory optimization with the Consensus-Based Bundle Algorithm (CBBA) for distributed task allocation. In our approach, GCS finds optimal trajectories through dynamic environments using a time-extended (3D+time) configuration space. At the same time, CBBA coordinates task assignments across agents, enabling informed decision-making in a moving environment. We then connect allocation and planning to allow the agents to avoid collisions in the 3D+time configuration space and provide accurate time estimates for task completion. We demonstrate the effectiveness of our approach in simulated cluttered environments with static and dynamic tasks.

URL PDF HTML ☆

赞 0 踩 0