arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.01427 2026-05-05 cs.RO

SixthSense: Task-Agnostic Proprioception-Only Whole-Body Wrench Estimation for Humanoids

Xingzhou Chen, Xiayan Xu, Yan Ning, Jiyu Yu, Yizheng Zhang, Siyi Qian, Lingzhu Xiang, Jiahao Chen, Yuquan Wang, Haodong Zhang, Ling Shi

详情

英文摘要

Humanoid robots are entering our physical world at scale, yet as oversized toys--good at singing and dancing, but short on force-interaction capabilities for practical tasks. Bridging this gap necessitates prioritizing reliable contact perception as a fundamental requirement. Estimating external wrenches in humanoids is complicated by floating-base dynamics and indeterminate contact locations. Existing analytical frameworks require idealistic assumptions and hard-to-obtain measurements, which are often unavailable in practice. To bridge this gap, we propose SixthSense, a task-agnostic approach that infers whole-body contact timing, location, and wrenches from proprioception and IMU data alone. To capture the multi-modal dynamics between unstructured contact inputs and the uncertain motion outputs, we employ conditional flow matching to tokenize proprioceptive histories and estimate a spatiotemporally sparse contact-event flow. SixthSense serves as a plug-and-play perception module for applications including collision detection, physical human-robot interaction, and force-feedback teleoperation. Experiments across standing, walking, and whole-body motion-tracking policies showcased unprecedented performance in diverse behaviors.

URL PDF HTML ☆

赞 0 踩 0

2605.01425 2026-05-05 cs.LG

Barriers to Counterfactual Credit Attribution for Autoregressive Models

Aloni Cohen, Chenhao Zhang

Comments ICML 2026

2605.01424 2026-05-05 cs.LG cs.AI

Quantifying Multimodal Capabilities: Formal Generalization Guarantees in Pairwise Metric Learning

Richeng Zhou, Xuelin Zhang, Liyuan Liu

2605.01420 2026-05-05 cs.AI

Artificial Jagged Intelligence as Uneven Optimization Energy Allocation Capability Concentration, Redistribution, and Optimization Governance

Wesley Shu, Peng Wei

2605.01418 2026-05-05 cs.AI

TimeTok: Granularity-Controllable Time-Series Generation via Hierarchical Tokenization

Seokhyun Lee, Jaeho Kim, Changjun Oh, Mihaela van der Schaar, Changhee Lee

2605.01417 2026-05-05 cs.CL cs.AI

Medmarks: A Comprehensive Open-Source LLM Benchmark Suite for Medical Tasks

Benjamin Warner, Ratna Sagari Grandhi, Max Kieffer, Aymane Ouraq, Saurav Panigrahi, Geetu Ambwani, Kunal Bagga, Nikhil Khandekar, Arya Hariharan, Nishant Mishra, Manish Ram, Shamus Sim Zi Yang, Ahmed Essouaied, Adepoju Jeremiah Moyondafoluwa, Robert Scholz, Bofeng Huang, Molly Beavers, Srishti Gureja, Anish Mahishi, Sameed Khan, Maxime Griot, Hunar Batra, Jean-Benoit Delbrouck, Siddhant Bharadwaj, Ronald Clark, Ashish Vashist, Anas Zafar, Leema Krishna Murali, Harsh Deshpande, Ameen Patel, William Brown, Johannes Hagemann, Connor Lane, Paul Steven Scotti, Tanishq Mathew Abraham

Comments website: https://medmarks.ai

2605.01415 2026-05-05 cs.AI cs.CY

AI Safety as Control of Irreversibility: A Systems Framework for Decision-Energy and Sovereignty Boundaries

Wesley Shu, Peng Wei

2605.01403 2026-05-05 cs.LG

Rethinking Multi-Label Node Classification: Do Tuned Classic GNNs Suffice?

Yuxuan Xiao, Shengzhong Zhang

2605.01399 2026-05-05 cs.CL cs.AI cs.IR

Verbal-R3: Verbal Reranker as the Missing Bridge between Retrieval and Reasoning

Sangkwon Park, Donghun Kang, Jisoo Mok, Sungroh Yoon

Comments ACL 2026 Main Conference

2605.01393 2026-05-05 cs.CV

Recall to Predict: Grounding Motion Forecasting in Interpretable Motion Bank

Abhishek Vivekanandan, Ahmed Abouelazm, J. Marius Zöllner

Comments Sumitted for PeerReview

2605.01383 2026-05-05 cs.LG cond-mat.dis-nn physics.comp-ph

Sequential Learning and Catastrophic Forgetting in Differentiable Resistor Networks

Maniru Ibrahim

2605.01382 2026-05-05 cs.CV cs.AI

Sparse Representation Learning for Vessels

Chinmay Prabhakar, Bastian Wittmann, Paul Büschl, Hongwei Bran Li, Bjoern Menze, Suprosanna Shit

2605.01381 2026-05-05 cs.CL cs.LG

A framework for analyzing concept representations in neural models

Burin Naowarat, Hao Tang, Sharon Goldwater

Comments CoNLL 2026

2605.01376 2026-05-05 cs.AI

A Cellular Doctrine of Morality: Intrinsic Active Precision and the Mind-Reality Overload Dilemma

Ahsan Adeel

2605.01373 2026-05-05 cs.CL cs.AI

Focus on the Core: Empowering Diffusion Large Language Models by Self-Contrast

Jinyuan Feng, Xin Yu, Yiqun Chen, Xiaochi Wei, Yan Gao, Yi Wu, Yao Hu, Zhiqiang Pu

2605.01372 2026-05-05 cs.CL

Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders

Ailiang Lin, Zhuoyun Li, Keyu Mao, Kotaro Funakoshi, Manabu Okumura

2605.01371 2026-05-05 cs.RO

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

Daoxuan Zhang, Ping Chen, Jianyi Zhou, Shuo Yang

Comments 20 pages, 7 figures

详情

英文摘要

The rapid advancement of Multimodal Large Language Models (MLLMs) has empowered Unmanned Aerial Vehicle (UAV) with exceptional capabilities in spatial reasoning, semantic understanding, and complex decision-making, making them inherently suited for UAV Search and Rescue (SAR). However, existing UAV SAR research is dominated by traditional vision and path-planning methods and lacks a comprehensive and unified benchmark for embodied agents. To bridge this gap, we first propose the novel task of \textbf{Embodied Search and Rescue (ESAR)}, which requires aerial agents to autonomously explore complex environments, identify rescue clues, and reason about victim locations to execute informed decision-making. Additionally, we present \textbf{ESARBench}, the first comprehensive benchmark designed to evaluate MLLM-driven UAV agents in highly realistic SAR scenarios. Leveraging Unreal Engine 5 and AirSim, we construct four high-fidelity, large-scale open environments mapped directly from real-world Geographic Information System (GIS) data to ensure photorealistic landscapes. To rigorously simulate actual rescue operations, our benchmark incorporates dynamic variables including weather conditions, time of day, and stochastic clue placement. Furthermore, we create a dataset of 600 tasks modeled after real-world rescue cases and propose a robust set of evaluation metrics. We evaluate diverse baselines, ranging from traditional heuristics to advanced ground and aerial MLLM-based ObjectNav agents. Experimental results highlight the challenges in ESAR, revealing critical bottlenecks in spatial memory, aerial adaptation, and the trade-off between search efficiency and flight safety. We hope ESARBench serves as a valuable resource to advance research on Embodied Search and Rescue domain. Source code and project page: https://4amgodvzx.github.io/ESAR.github.io.

URL PDF HTML ☆

赞 0 踩 0

2605.01368 2026-05-05 cs.RO

Assistance Without Interruption: A Benchmark and LLM-based Framework for Non-Intrusive Human-Robot Assistance

Yuedi Zhang, Shuanghao Bai, Wanqi Zhou, Haoran Zhang, Qi Zhang, Zhirong Luan, Badong Chen

2605.01365 2026-05-05 cs.CV cs.RO

VoxAfford: Multi-Scale Voxel-Token Fusion for Open-Vocabulary 3D Affordance Detection

Haowen Sun, Shaolong Zhang, Mingyang Li, Chengzhong Ma, Xinzhe Chen, Qiongjie Cui, Xingyu Chen, Zeyang Liu, Xuguang Lan

2605.01364 2026-05-05 cs.LG cs.SY eess.SY

Toward a foundational thermal model for residential buildings

Ting-Yu Dai, Kingsley Nweye, Dev Niyogi, Zoltan Nagy

2605.01359 2026-05-05 cs.AI

Structural Ranking of the Cognitive Plausibility of Computational Models of Analogy and Metaphors with the Minimal Cognitive Grid

Alessio Donvito, Antonio Lieto

Comments 35 pages

2605.01358 2026-05-05 cs.LG

PACE: Parameter Change for Unsupervised Environment Design

Fang Yuan, Quanjun Yin, Siqi Shen, Yuxiang Xie, Junqiang Yang, Long Qin, Junjie Zeng, Qinglun Li

2605.01357 2026-05-05 cs.CL

On Stable Long-Form Generation: Benchmarking and Mitigating Length Volatility

Zhitao He, Haolin Yang, Rui Min, Zeyu Qin, Yi R. Fung

2605.01356 2026-05-05 cs.LG cs.AI

Model-Based Proactive Cost Generation for Learning Safe Policies Offline with Limited Violation Data

Ruiqi Xue, Lei Yuan, Kainuo Cheng, Jing-Wen Yang, Yang Yu

2605.01350 2026-05-05 cs.CL

LLM Output Detectability and Task Performance Can be Jointly Optimized

Koshiro Saito, Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

Comments Preprint. Under review

2605.01347 2026-05-05 cs.CL cs.AI cs.LG

MAD-OPD: Breaking the Ceiling in On-Policy Distillation via Multi-Agent Debate

Jianze Wang, Ying Liu, Jinlong Chen, Xuchun Hu, Qilong Zhang, Yu Cao, Jun Wang, Hua Yang, Yong Xie, Qianglong Chen

Comments Preprint. 9-page main paper + appendix. 8 figures, 7 tables. Code: https://github.com/chiefovoavicii/MAD-OPD

2605.01346 2026-05-05 cs.CV

CHASE: Competing Hypotheses for Ambiguity-Aware Selective Prediction

Kartik Jhawar, Yuhao Geng, Atul N. Parikh, Lipo Wang

2605.01340 2026-05-05 cs.RO eess.SP

Terrain Perception for Agricultural UAVs in Complex Farmland via Rotating mmWave Radar

Zhihao Zhan, Le Tao, Shaobin Li, Chenxin Fang, Xingrui Yang, Liang Li, Rui Fan, Yuhang Ming

2605.01339 2026-05-05 cs.LG

Robust Parameter Learning for Uncertain MDPs

Yannik Schnitzer, Alessandro Abate, David Parker

2605.01338 2026-05-05 cs.AI

DiagramNet: An End-to-End Recognition Framework and Dataset for Non-Standard System-Level Diagrams

Jincheng Lou, Ruohan Xu, Jiapeng Li, Junyin Pi, Runzhe Tao, Weijian Fan, Xiao Tan, Guojie Luo, Yibo Lin

Comments 13 pages, 7 figures. Preprint