自动驾驶 - arXivDaily 专题

2606.20336 2026-06-19 cs.RO 新提交 90%

Autonomous Driving with Priority-Ordered STL Specifications Under Multimodal Uncertainty

多模态不确定性下基于优先级排序STL规范的自动驾驶

Taha Bouzid, Shuhao Qi, Mircea Lazar, Sofie Haesaert

发表机构 * Eindhoven University of Technology（埃因霍温理工大学）

专题命中规划控制：基于STL的自动驾驶轨迹规划

AI总结提出一种不确定性感知的轨迹规划框架，通过信号时序逻辑的词典序优先级处理冲突目标，并结合模型预测路径积分控制实现，在仿真中验证了有效性。

详情

AI中文摘要

自动驾驶车辆必须规划满足安全、乘客舒适度和交通规则等多重要求的轨迹。然而，在安全关键场景中，不可能同时满足所有要求，因此需要根据重要性进行优先级排序。同时，在这些安全关键场景中，应明确考虑周围交通（如其他车辆和行人）轨迹预测的不确定性。在这项工作中，我们提出了一种不确定性感知的轨迹规划框架，该框架结合了信号时序逻辑（STL）规范上的预定义词典序，该排序在不确定性下仍然有效。我们使用模型预测路径积分（MPPI）控制实现了该公式，并在仿真场景中展示了我们方法的有效性，表明我们的框架在现实的多模态不确定性下有效处理了冲突目标。

英文摘要

Autonomous vehicles must plan trajectories that satisfy a multitude of requirements on safety, passenger comfort, and compliance with traffic rules. However, in safety-critical scenarios, it is not always possible to satisfy all requirements simultaneously, necessitating their prioritization based on importance. At the same time, in these safety-critical scenarios, the uncertainty in trajectory predictions of the surrounding traffic, such as other vehicles and pedestrians, should be explicitly accounted for. In this work, we propose an uncertainty-aware trajectory planning framework that incorporates a predefined lexicographic ordering over Signal Temporal Logic (STL) specifications that stays valid under uncertainty. We implement this formulation with Model Predictive Path Integral (MPPI) control and we demonstrate the effectiveness of our method on simulation scenarios, showing that our framework efficiently handles conflicting objectives under realistic multi-modal uncertainty.

URL PDF HTML ☆

赞 0 踩 0

2606.20274 2026-06-19 cs.AI 新提交 90%

Lagrange: An Open-Vocabulary, Energy-Based Sparse Framework for Generalized End-to-End Driving

Lagrange: 一种面向通用端到端驾驶的开放词汇、基于能量的稀疏框架

Shihao Ji, HongXi Li, Zihui Song, Mingyu Li

专题命中规划控制：提出端到端驾驶框架Lagrange

AI总结提出Lagrange框架，利用掩码潜在场和视觉语言模型实现开放词汇、稀疏计算，通过拉格朗日动作最小化确保运动学约束，在nuScenes和CODA基准上验证了鲁棒性和可解释性。

详情

AI中文摘要

将端到端自动驾驶扩展到复杂的开放世界环境，需要能够泛化到异常场景的感知模型和能够产生运动学有效轨迹的规划器。现有范式在表示效率和泛化能力之间存在明显分歧。密集模型（如占用网络）虽然几何鲁棒，但存在关键计算瓶颈，且难以进行高层语义推理。相反，稀疏的基于查询的规划器效率高，但依赖于封闭集定义，使其容易受到分布外事件的影响。尽管最近的视觉-语言-动作模型提供了开放词汇推理，但其自回归离散令牌生成从根本上与车辆动力学的连续高频控制需求相冲突。为解决这一问题，我们提出了Lagrange，一种基于掩码潜在场的开放词汇、计算稀疏的驾驶框架。Lagrange不依赖密集体积重建或封闭集查询机制，而是利用视觉语言模型将类别无关的目标提议编码为连续语义视觉令牌。我们引入了一种意图驱动的掩码交叉注意力模块，该模块在时间上过滤不相关实体，并将注意力令牌解码为定义在空间坐标上的隐式连续能量场。通过将决策制定为跨越该能量场的拉格朗日动作最小化问题，我们在执行碰撞避免的同时强制遵守车辆运动学。在标准（nuScenes）和长尾（CODA）基准上的大量离线评估表明，Lagrange为鲁棒、可解释且运动学可行的开放世界自主性建立了一个有前景的框架。

英文摘要

Scaling end-to-end autonomous driving to complex, open-world environments requires perceptual models that generalize to anomalous scenarios and planners that produce kinematically valid trajectories. Existing paradigms face a distinct dichotomy between representational efficiency and generalization capacity. Dense models (e.g., occupancy networks), while geometrically robust, incur critical computational bottlenecks and struggle with high-level semantic reasoning. Conversely, sparse, query-based planners are efficient but reliant on closed-set definitions, rendering them vulnerable to out-of-distribution (OOD) events. Although recent Vision-Language-Action (VLA) models offer open-vocabulary reasoning, their autoregressive, discrete token generation fundamentally conflicts with the continuous, high-frequency control requirements of vehicle dynamics. To address this, we propose Lagrange, an open-vocabulary, computationally sparse driving framework based on Masked Latent Fields (MLF). Rather than relying on dense volumetric reconstructions or closed-set query mechanisms, Lagrange exploits Vision-Language Models (VLMs) to encode class-agnostic object proposals into continuous semantic visual tokens. We introduce an intent-driven masked cross-attention module that temporally filters irrelevant entities, decoding the attended tokens into an implicit continuous energy field defined over spatial coordinates. By framing decision-making as a Lagrangian action minimization problem spanning this energy field, we enforce strict compliance with vehicle kinematics while executing collision avoidance. Extensive offline evaluations on both standard (nuScenes) and long-tail (CODA) benchmarks demonstrate that Lagrange establishes a promising framework for robust, interpretable, and kinematically feasible open-world autonomy.

URL PDF HTML ☆

赞 0 踩 0

2606.19370 2026-06-19 cs.LG cs.AI cs.MA 新提交 85%

Human-like autonomy emerges from self-play and a pinch of human data

类人自主性从自我对弈和少量人类数据中涌现

Daphne Cornelisse, Julian Hunt, Zixu Zhang, Waël Doulazmi, Kevin Joseph, Jaime Fernández Fisac, Eugene Vinitsky

发表机构 * NYU Tandon School of Engineering（纽约大学坦登工程学院）； NYU Courant（纽约大学库朗数学科学研究所）； Princeton University（普林斯顿大学）； Centre for Robotics, Mines Paris（巴黎矿业大学机器人中心）； Valeo（法雷奥）

专题命中规划控制：自我对弈强化学习训练驾驶策略。

AI总结提出一种结合自我对弈强化学习与少量人类演示的正则化方法，仅用30分钟人类数据即可训练出与人类协调的驾驶策略，训练时间仅15小时。

Comments 10 pages

详情

AI中文摘要

自我对弈强化学习最近成为一种无需任何人类数据即可训练驾驶策略的方法。它利用廉价的大规模模拟来替代昂贵的大规模人类驾驶演示。这种方法的一个关键局限性是，通过纯自我对弈训练的策略可以学习有效但不符合人类习惯的驾驶惯例。先前的工作试图通过广泛的奖励工程和领域随机化来缓解这种行为偏差，但这些方法脆弱且劳动密集。我们的方法没有完全抛弃人类演示，而是将其作为最小安全目标达到奖励之上的正则化目标。就像好炖菜中的香料一样，我们发现少量人类数据大有裨益：我们的方法仅使用30分钟的人类演示，比同类模仿学习方法少2500倍。由此产生的策略与保留的人类轨迹协调，并在单个消费级GPU上15小时内完成训练。视频和完整源代码见https://this URL。

英文摘要

Self-play reinforcement learning has recently emerged as a way to train driving policies without any human data. It uses cheap, large-scale simulations to substitute expensive, large-scale human driving demonstrations. A key limitation of this approach is that policies trained through pure self-play can learn effective but alien driving conventions incompatible with people. Previous works attempt to mitigate such behavioral misalignments through extensive reward engineering and domain randomization, which are brittle and labor-intensive. Instead of completely discarding human demonstrations, our method treats them as a regularization objective on top of a minimal safe goal-reaching reward. Like the spice in a good stew, we find that a little human data goes a long way: our method uses only 30 minutes of human demonstrations, 2500x fewer than comparable imitation learning approaches. Resulting policies coordinate with held-out human trajectories and complete training in 15 hours on a single consumer-grade GPU. Videos and full source code are available at https://spiced-self-play.com/.

URL PDF HTML ☆

赞 0 踩 0

2606.19672 2026-06-19 cs.RO 新提交 75%

Safe Local Navigation for Ackermann-Steered Robots in Unmapped Environments

阿克曼转向机器人在未映射环境中的安全局部导航

Christian Schaible, Shahin Sirouspour

发表机构 * McMaster University（麦克马斯特大学）

专题命中规划控制：阿克曼转向机器人在未映射环境中的安全局部导航

AI总结提出一种控制框架，通过局部障碍物检测确定最安全航向角，构建边界线并优化车辆-障碍物间距，实现阿克曼转向机器人在无全局目标环境中的安全局部导航。

Comments Presented at the 23rd Conference on Robots and Vision (CRV 2026)

Journal ref Proc. 23rd Conference on Robots and Vision (CRV), 2026

详情

DOI: 10.21428/d82e957c.84a2e865

AI中文摘要

提出了一种控制框架，用于在缺乏全局目标的未映射环境中，对配备阿克曼转向的移动机器人进行安全局部导航。基于局部障碍物检测，沿车辆前方最大开阔空间方向确定最安全航向角。在该方向引导下，在车辆左右两侧构建边界线以实现障碍物分离。这些边界线通过求解一个最大化车辆-障碍物间距的凸二次优化获得。可选地，对边界线施加约束以保持平行性并平滑先前控制步骤的突变。然后使用反馈线性化控制器调节车辆与一条或两条边界线的距离，从而有效跟踪通过最大化障碍物间距保证安全的局部参考路径。该控制方案包含开源代码。实验结果表明，与一些现有的基于探索的规划器相比，所提方法生成的导航路径更安全，计算时间显著缩短。

英文摘要

A control framework is proposed for safe local navigation of mobile robots equipped with Ackermann steering in unmapped environments where a global goal is absent. Based on local obstacle detections, the safest heading angle is determined along the direction of the largest open space ahead of the vehicle. Guided by this direction, bounding lines are constructed on the left and right sides of the vehicle to achieve obstacle separation. These bounding lines are obtained by solving a convex quadratic optimization that maximizes vehicle-to-obstacle clearance. Optionally, conditions are imposed on the bounding lines to preserve parallelism and smooth abrupt changes from prior control steps. A feedback-linearizing controller is then used to regulate the vehicle's distance from one or both bounding lines, effectively enabling tracking of a local reference path that preserves safety through obstacle clearance maximization. Open-source code is included for the application of this control scheme. Experimental results demonstrate that the proposed method produces safer navigation paths with significantly shorter computation times, compared to some existing exploration-based planners.

URL PDF HTML ☆

赞 0 踩 0

2606.13794 2026-06-19 eess.SY cs.AI cs.RO cs.SY 新提交 70%

An integrated interpretable control effectiveness learning and nonlinear control allocation methodology for overactuated aircrafts

过驱动飞行器的可解释控制效能学习与非线性控制分配集成方法

Umut Demir, Aamir Ahmad, Walter Fichter

发表机构 * University of Stuttgart, Faculty of Aerospace Engineering and Geodesy, Institute of Flight Mechanics and Control (iFR)（斯图加特大学航空航天工程与大地测量学院飞行力学与控制研究所）

专题命中规划控制：过驱动飞行器控制分配

AI总结提出一种基于稀疏非线性动力学辨识的学习控制效能映射方法，结合在线自适应机制，实现过驱动飞行器的高效非线性控制分配，兼具可解释性和低计算成本。

详情

AI中文摘要

非线性动力学以及多个执行器之间产生的强耦合削弱了传统线性控制分配技术背后的假设。当飞行进入非线性效应主导的模态时，线性分配器因模型失配增加而精度下降，进而降低飞行控制系统的性能和鲁棒性。高保真机载模型和黑箱数据驱动方法可以在整个飞行包线内恢复精度，但分别带来实时分配难以承受的计算负担，并牺牲了验证和故障诊断所需的可解释性。本文通过使用稀疏非线性动力学辨识从代表性飞行数据中学习显式的、受物理约束的控制效能映射解析模型，解决了这些限制。所得映射紧凑、可解释，并允许解析导数，从而能够在非线性求解器中高效计算，同时额外包含执行器动力学，无需机载模型。在线自适应机制监控预测残差，并在检测到显著对象变化时刷新模型，从而在执行器故障和变化工况下提供平滑重构。该方法在一款高保真非线性基准飞行器上经过一系列激进机动评估，达到了与完整非线性机载模型相当的精度，同时相对于现有基线显著降低了计算成本。

英文摘要

Nonlinear dynamics and the strong couplings that arise between multiple effectors undermine the assumptions behind conventional, linear control allocation techniques. When flight enters regimes where nonlinear effects dominate, linear allocators exhibit reduced accuracy due to increased model mismatch, which subsequently degrades performance and robustness of the flight control system. High fidelity onboard models and black box data driven approaches can recover accuracy across the flight envelope, but respectively impose computational burdens prohibitive for real time allocation and sacrifice the interpretability required for verification and fault diagnosis. This paper addresses these limitations by learning an explicit, physics constrained analytical model of the control effectiveness mapping from representative flight data using Sparse Identification of Nonlinear Dynamics. The resulting mapping is compact, interpretable, and admits analytical derivatives, enabling efficient computation within nonlinear solvers that additionally incorporate actuator dynamics, without requiring an onboard model. An online adaptation mechanism monitors prediction residuals and refreshes the model when significant plant changes are detected, providing graceful reconfiguration under actuator failures and varying operating conditions. The methodology is evaluated on a high fidelity nonlinear benchmark aircraft across a range of aggressive maneuvers, achieving accuracy comparable to a full nonlinear onboard model while substantially reducing computational cost relative to established baselines.

URL PDF HTML ☆

赞 0 踩 0