arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.01516 2026-05-05 cs.RO

Dynamics Distillation for Efficient and Transferable Control Learning

Xunjiang Gu, Kashyap Chitta, Mahsa Golchoubian, Vladimir Suplin, Igor Gilitschenski

Comments 9 pages, 3 figures, under review

详情

英文摘要

Robust control policy learning for autonomous driving requires training environments to be both physically realistic and computationally scalable, properties that existing simulators provide only in isolation. We introduce Sim2Sim2Sim, a framework that bridges high-fidelity vehicle simulation and scalable reinforcement learning by distilling simulator dynamics into a highly parallelizable learned dynamics model. By training control policies purely within this distilled environment and deploying them back into the high-fidelity source simulator, we demonstrate more efficient policy optimization and reliable transfer under challenging dynamics. We further show that predictive accuracy alone does not fully characterize a learned dynamics model's suitability as a reinforcement learning training environment, which should also be assessed by the quality of the policies it enables.

URL PDF HTML ☆

赞 0 踩 0

2605.01515 2026-05-05 cs.SD cs.CR

MelShield: Robust Mel-Domain Audio Watermarking for Provenance Attribution of AI Generated Synthesized Speech

Yutong Jin, Qi Li, Lingshuang Liu, Jianbing Ni

Comments Accepted by ACISP 2026

2605.01513 2026-05-05 cs.LG cs.AI

Protein-Conditioned Multi-Objective Reinforcement Learning for Full-Length mRNA Design

Zixi Shao, Tao Wang, Yibei Xiao, Tianyi Huang

2605.01510 2026-05-05 cs.CV

SwiftPie: Lightning-fast Subject-driven Image Personalization via One step Diffusion

Huy Duong, Trong-Tung Nguyen, Cuong Pham, Anh Tran, Khoi Nguyen, Minh Hoai

Comments CVPR26 Finding

2605.01506 2026-05-05 cs.CV

OmniEncoder: See, Hear, and Feel Continuous Motion Like Humans With One Encoder

Detao Bai, Shimin Yao, Weixuan Chen, Chengen Lai, Yuanming Li, Zhiheng Ma, Xihan Wei

2605.01502 2026-05-05 cs.CV

RADMI: Latent Information Aggregation as a Proxy for Model Uncertainty

William Stevens, Mohit Prabhushankar, Ghassan AlRegib

Comments 7 pages, 4 figures, 3 tables, accepted to IEEE ICIP 2026

2605.01501 2026-05-05 cs.RO cs.MA

Distributed Algorithm with Emergent Area Partitioning and Base Station's Situation Awareness for Multi-Robot Patrolling

Kazuho Kobayashi, Shohei Kobayashi, Seiya Ueno, Takehiro Higuchi

2605.01498 2026-05-05 cs.CV

Towards Visual Query Localization in the 3D World

Liang Peng, Bohan Tan, Zhipeng Zhang, Haobo Li, Yifan Jiao, Xingping Dong, Libo Zhang

Comments Accepted to CVPR 2026. 8 pages

2605.01496 2026-05-05 cs.CV

SF20K Competition 2025: Summary and findings

Ridouane Ghermi, Xi Wang, Vicky Kalogeiton, Ivan Laptev

2605.01495 2026-05-05 cs.CL cs.AI

FT-RAG: A Fine-grained Retrieval-Augmented Generation Framework for Complex Table Reasoning

Zebin Guo, Weidong Geng, Ruichen Mao

2605.01490 2026-05-05 cs.CV cs.AI cs.LG

CGFformer: Cluster-Guidance Frequency Transformer for Pansharpening

Zijian Zhou, Jianing Zhang, Kai Sun, Xiangyu Zhao, Chunxia Zhang, Xiangyong Cao

Comments 35 pages, 12 pages

2605.01485 2026-05-05 cs.RO physics.soc-ph

Cut-In Gap Acceptance Toward Autonomous vs. Human-Driven Vehicles: Evidence from the Waymo Open Motion Dataset

Abdulaziz Alhuraish, Yuhang Wang, Hao Zhou

2605.01484 2026-05-05 cs.LG stat.ML

Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks

Sunil Kumar Maurya, Xin Liu

Comments Accepted to ACL 2026 Main Conference

2605.01483 2026-05-05 cs.CV cs.AI

Research on Vision-Language Question Answering Models for Industrial Robots

Ping Li, Bartlomiej Brzozka

Comments 8 Pages, 5 figures

2605.01480 2026-05-05 cs.CV

AttnRouter: Per-Category Attention Routing for Training-Free Image Editing on MMDiT

Guandong Li, Mengxia Ye

Comments 11 pages, 7 figures

2605.01479 2026-05-05 cs.CV

CSGuard: Toward Forgery-Resistant Watermarking in Diffusion Models via Compressed Sensing Constraint

Jiewei Lai, Lan Zhang, Chen Tang, Pengcheng Sun, Zhaopeng Zhang, Yunhao Wang, Hui Jin

2605.01478 2026-05-05 cs.CV cs.AI cs.LG

LIE: LiDAR-only HD Map Construction with Intensity Enhancement via Online Knowledge Distillation

Kanak Mazumder, Fabian B. Flohr

Comments This work has been accepted for publication in International Conference on Intelligent Transportation Systems (ITSC), IEEE, 2026. The final published version will be available via IEEE Xplore

2605.01477 2026-05-05 cs.RO

Action Agent: Agentic Video Generation Meets Flow-Constrained Diffusion

Jeffrin Sam, Nguyen Khang, Yara Mahmoud, Miguel Altamirano Cabrera, Dzmitry Tsetserukou

Comments 8 pages, 5 figures

2605.01474 2026-05-05 cs.CL

ReMedi: Reasoner for Medical Clinical Prediction

Yushi Cao, Yiming Chen, Hongchao Jiang, Hung-yi Lee, Robby T. Tan

Comments ACL 2026 findings

2605.01468 2026-05-05 cs.CV cs.AI

Decision Boundary-aware Generation for Long-tailed Learning

Jiacheng Yang, Ruichi Zhang, Chikai Shang, Mengke Li, Xinyi Shang, Junlong Gao, Yonggang Zhang, Yang Lu

Comments Accepted by CVPR 2026

2605.01461 2026-05-05 cs.RO cs.MA

LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging

Peihan Li, Joanna Gutierrez, Fabian Hernandez, Qi Lu, Lifeng Zhou

2605.01451 2026-05-05 cs.CL

Auditing demographic bias in AI-based emergency police dispatch: a cross-lingual evaluation of eleven large language models

William Guey, Wei Zhang, Pierrick Bougault, Yi Wang, Bertan Ucar, Vitor D. de Moura, José O. Gomes

Comments 26 pages, 7 figures. Submitted to Humanities and Social Sciences Communications (Nature) collection on Artificial Intelligence and Emerging Technologies in Public Safety. Code and data: https://github.com/williamguey/llmdispatchbias

2605.01450 2026-05-05 cs.CV

Registration-Free Learnable Multi-View Capture of Faces in Dense Semantic Correspondence

Panagiotis P. Filntisis, George Retsinas, Radek Daněček, Vanessa Sklyarova, Petros Maragos, Timo Bolkart

Comments 19 pages, CVPR 2026

2605.01448 2026-05-05 cs.RO cs.CV

Decompose and Recompose: Reasoning New Skills from Existing Abilities for Cross-Task Robotic Manipulation

Xitie Zhang, Aming Wu, Yahong Han

Comments Accepted by ICML 2026

2605.01442 2026-05-05 cs.AI math.LO

Rethinking Explanations: Formalizing Contrast in Description Logics

Yasir Mahmood, Arnab Sharma, Axel-Cyrille Ngonga Ngomo, Balram Tiwari

Comments Pre-print to the paper accepted at XAI World conference, 2024 (https://xaiworldconference.com/)

2605.01441 2026-05-05 cs.CL cs.CY cs.HC

Artificial intelligence language technologies in multilingual healthcare: Grand challenges ahead

Vicent Briva-Iglesias

2605.01434 2026-05-05 cs.RO

High-Speed, Scalable Sensor Readout for Dexterous Robotic Hands via Shift-Register Multiplexing

Jaehoon Kim, Lazaros Christoforidis, Michalis Papadakis, Victor Kartsch, Robert K. Katzschmann

2605.01432 2026-05-05 cs.RO

Evidence-Based Landing Site Selection and Vison-Based Landing for UAVs in Unstructured Environments

Sina Sajjadi, Jacopo Panerati, Sina Soleymanpour, Varunkumar Mehta, Farrokh Janabi-Sharifi, Iraj Mantegh

2605.01429 2026-05-05 cs.AI cs.LG

SCALE-LoRA: Auditing Post-Retrieval LoRA Composition with Residual Merging and View Reliability

Shuaipeng Zhou, Yu Zhang

Comments 12 pages, 1 figure, 6 tables

2605.01428 2026-05-05 cs.CL

Hallucinations Undermine Trust; Metacognition is a Way Forward

Gal Yona, Mor Geva, Yossi Matias

Comments To appear in ICML 2026 (Position Track)