arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.08341 2026-04-10 cs.RO

A Unified Multi-Layer Framework for Skill Acquisition from Imperfect Human Demonstrations

Zi-Qi Yang, Mehrdad R. Kermani

Comments 6 pages, 4 figures. Submitted to a conference proceeding

详情

英文摘要

Current Human-Robot Interaction (HRI) systems for skill teaching are fragmented, and existing approaches in the literature do not offer a cohesive framework that is simultaneously efficient, intuitive, and universally safe. This paper presents a novel, layered control framework that addresses this fundamental gap by enabling robust, compliant Learning from Demonstration (LfD) built upon a foundation of universal robot compliance. The proposed approach is structured in three progressive and interconnected stages. First, we introduce a real-time LfD method that learns both the trajectory and variable impedance from a single demonstration, significantly improving efficiency and reproduction fidelity. To ensure high-quality and intuitive {kinesthetic teaching}, we then present a null-space optimization strategy that proactively manages singularities and provides a consistent interaction feel during human demonstration. Finally, to ensure generalized safety, we introduce a foundational null-space compliance method that enables the entire robot body to compliantly adapt to post-learning external interactions without compromising main task performance. This final contribution transforms the system into a versatile HRI platform, moving beyond end-effector (EE)-specific applications. We validate the complete framework through comprehensive comparative experiments on a 7-DOF KUKA LWR robot. The results demonstrate a safer, more intuitive, and more efficient unified system for a wide range of human-robot collaborative tasks.

URL PDF HTML ☆

赞 0 踩 0

2604.08340 2026-04-10 cs.CV cs.AI

PokeGym: A Visually-Driven Long-Horizon Benchmark for Vision-Language Models

Ruizhi Zhang, Ye Huang, Yuangang Pan, Chuanfu Shen, Zhilin Liu, Ting Xie, Wen Li, Lixin Duan

Comments Tech report

2604.08337 2026-04-10 cs.CV cs.AI

InstAP: Instance-Aware Vision-Language Pre-Train for Spatial-Temporal Understanding

Ashutosh Kumar, Rajat Saini, Jingjing Pan, Mustafa Erdogan, Mingfang Zhang, Betty Le Dem, Norimasa Kobori, Quan Kong

2604.08336 2026-04-10 cs.LG

Leveraging Complementary Embeddings for Replay Selection in Continual Learning with Small Buffers

Danit Yanowsky, Daphna Weinshall

2604.08335 2026-04-10 cs.LG cs.AI

Dead Weights, Live Signals: Feedforward Graphs of Frozen Language Models

Marcus Armstrong, Navid Ayoobi, Arjun Mukherjee

2604.08333 2026-04-10 cs.CV cs.AI cs.LG

Lost in the Hype: Revealing and Dissecting the Performance Degradation of Medical Multimodal Large Language Models in Image Classification

Xun Zhu, Fanbin Mo, Xi Chen, Kaili Zheng, Shaoshuai Yang, Yiming Shi, Jian Gao, Miao Li, Ji Wu

2604.08326 2026-04-10 cs.AI

ProMedical: Hierarchical Fine-Grained Criteria Modeling for Medical LLM Alignment via Explicit Injection

He Geng, Yangmin Huang, Lixian Lai, Qianyun Du, Hui Chu, Zhiyang He, Jiaxue Hu, Xiaodong Tao

Comments ACL 2026

2604.08322 2026-04-10 cs.CV

Fundus-R1: Training a Fundus-Reading MLLM with Knowledge-Aware Reasoning on Public Data

Yuchuan Deng, Qijie Wei, Kaiheng Qian, Jiazhen Liu, Zijie Xin, Bangxiang Lan, Jingyu Liu, Jianfeng Dong, Xirong Li

2604.08301 2026-04-10 cs.CV

GroundingAnomaly: Spatially-Grounded Diffusion for Few-Shot Anomaly Synthesis

Yishen Liu, Hongcang Chen, Pengcheng Zhao, Yunfan Bao, Yuxi Tian, Jieming Zhang, Hao Chen, Zheng Zhi, Yongchun Liu, Ying Li, Dongpu Cao

Comments 32 pages, 15 figures

2604.08294 2026-04-10 cs.CV cs.AI cs.CL

Can Vision Language Models Judge Action Quality? An Empirical Evaluation

Miguel Monte e Freitas, Rui Henriques, Ricardo Rei, Pedro Henrique Martins

2604.08292 2026-04-10 cs.RO

EMMa: End-Effector Stability-Oriented Mobile Manipulation for Tracked Rescue Robots

Yifei Wang, Hao Zhang, Jidong Huang, Shuohang Fang, Haoyao Chen

Comments 14 pages, 17 figures

2604.08284 2026-04-10 cs.CL cs.AI

Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models

Yating Wang, Wenting Zhao, Yaqi Zhao, Yongshun Gong, Yilong Yin, Haoliang Sun

Comments 17 pages,3 figures. Under review

2604.08282 2026-04-10 cs.CV

Revisiting Radar Perception With Spectral Point Clouds

Hamza Alsharif, Jing Gu, Pavol Jancura, Satish Ravindran, Gijs Dubbelman

Comments CVPR 2026 Workshop (PBVS 2026). Project page: https://www.tue-mps.org/Spectral-Point-Clouds-Radar/

2604.08276 2026-04-10 cs.AI cs.CR

ACF: A Collaborative Framework for Agent Covert Communication under Cognitive Asymmetry

Wansheng Wu, Kaibo Huang, Yukun Wei, Zhongliang Yang, Linna Zhou

Comments 5 pages, 3 figures. Submitted to IEEE Signal Processing Letters (SPL). Source code is available at https://github.com/Dwinovo/ACF-Stego

2604.08275 2026-04-10 cs.CL

Floating or Suggesting Ideas? A Large-Scale Contrastive Analysis of Metaphorical and Literal Verb-Object Constructions

Prisca Piccirilli, Alexander Fraser, Sabine Schulte im Walde

Comments 17 pages, 4 figures, 3 tables. Accepted at CMCL@LREC2026

2604.08272 2026-04-10 cs.CV eess.IV

Preventing Overfitting in Deep Image Prior for Hyperspectral Image Denoising

Panagiotis Gkotsis, Athanasios A. Rontogiannis

Comments 7 pages, 5 figures

2604.08271 2026-04-10 cs.LG

An Illusion of Unlearning? Assessing Machine Unlearning Through Internal Representations

Yichen Gao, Altay Unal, Akshay Rangamani, Zhihui Zhu

Comments 9 pages main text, 21 pages total, 6 figures. Accepted at AISTATS 2026

2604.08266 2026-04-10 cs.CV

Orion-Lite: Distilling LLM Reasoning into Efficient Vision-Only Driving Models

Jing Gu, Niccolò Cavagnero, Gijs Dubbelman

2604.08263 2026-04-10 cs.AI

Neural-Symbolic Knowledge Tracing: Injecting Educational Knowledge into Deep Learning for Responsible Learner Modelling

Danial Hooshyar, Gustav Šír, Yeongwook Yang, Tommi Kärkkäinen, Raija Hämäläinen, Ekaterina Krivich, Mutlu Cukurova, Dragan Gašević, Roger Azevedo

详情

英文摘要

The growing use of artificial intelligence (AI) in education, particularly large language models (LLMs), has increased interest in intelligent tutoring systems. However, LLMs often show limited adaptivity and struggle to model learners' evolving knowledge over time, highlighting the need for dedicated learner modelling approaches. Although deep knowledge tracing methods achieve strong predictive performance, their opacity and susceptibility to bias can limit alignment with pedagogical principles. To address this, we propose Responsible-DKT, a neural-symbolic deep knowledge tracing approach that integrates symbolic educational knowledge (e.g., mastery and non-mastery rules) into sequential neural models for responsible learner modelling. Experiments on a real-world dataset of students' math interactions show that Responsible-DKT outperforms both a neural-symbolic baseline and a fully data-driven PyTorch DKT model across training settings. The model achieves over 0.80 AUC with only 10% of training data and up to 0.90 AUC, improving performance by up to 13%. It also demonstrates improved temporal reliability, producing lower early- and mid-sequence prediction errors and the lowest prediction inconsistency rates across sequence lengths, indicating that prediction updates remain directionally aligned with observed student responses over time. Furthermore, the neural-symbolic approach offers intrinsic interpretability via a grounded computation graph that exposes the logic behind each prediction, enabling both local and global explanations. It also allows empirical evaluation of pedagogical assumptions, revealing that repeated incorrect responses (non-mastery) strongly influence prediction updates. These results indicate that neural-symbolic approaches enhance both performance and interpretability, mitigate data limitations, and support more responsible, human-centered AI in education.

URL PDF HTML ☆

赞 0 踩 0

2604.08260 2026-04-10 cs.CL cs.AI

Behavior-Aware Item Modeling via Dynamic Procedural Solution Representations for Knowledge Tracing

Jun Seo, Sangwon Ryu, Heejin Do, Hyounghun Kim, Gary Geunbae Lee

Comments ACL Findings 2026

2604.08258 2026-04-10 cs.RO

EvoGymCM: Harnessing Continuous Material Stiffness for Soft Robot Co-Design

Le Shen, Kangyao Huang, Wentao Zhao, Huaping Liu

Comments 8 pages, 11 figures. Preprint. Under review at IROS 2026

2604.08245 2026-04-10 cs.AI

From Phenomenological Fitting to Endogenous Deduction: A Paradigm Leap via Meta-Principle Physics Architecture

Helong Hu, HongDan Pan, ShuiQing Hu

Comments 23 pages, 4 figures, 11 table

2604.08238 2026-04-10 cs.CV

$\oslash$ Source Models Leak What They Shouldn't $\nrightarrow$: Unlearning Zero-Shot Transfer in Domain Adaptation Through Adversarial Optimization

Arnav Devalapally, Poornima Jain, Kartik Srinivas, Vineeth N. Balasubramanian

Comments CVPR 2026

2604.08232 2026-04-10 cs.AI

HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation

He Zhao, Yijun Yang, Zichuan Lin, Deheng Ye, Chunyan Miao

2604.08230 2026-04-10 cs.CV

Generalization Under Scrutiny: Cross-Domain Detection Progresses, Pitfalls, and Persistent Challenges

Saniya M. Deshmukh, Kailash A. Hambarde, Hugo Proença

Comments 44 pages, 8 figures, 4 tables

2604.08226 2026-04-10 cs.AI cs.HC cs.SY eess.SY

Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework

Seyed Amir Ahmad Safavi-Naini, Elahe Meftah, Josh Mohess, Pooya Mohammadi Kazaj, Georgios Siontis, Zahra Atf, Peter R. Lewis, Mauricio Reyes, Girish Nadkarni, Roland Wiest, Stephan Windecker, Christoph Grani, Ali Soroush, Isaac Shiri

Comments Code, data (Clinical AI Skill-Mix dimension specifications), and an exploratory dashboard are available at https://github.com/Sdamirsa/Clinical-World-Model

2604.08212 2026-04-10 cs.CV

Vision-Language Foundation Models for Comprehensive Automated Pavement Condition Assessment

Blessing Agyei Kyem, Joshua Kofi Asamoah, Anthony Dontoh, Armstrong Aboah

2604.08211 2026-04-10 cs.CV

SciFigDetect: A Benchmark for AI-Generated Scientific Figure Detection

You Hu, Chenzhuo Zhao, Changfa Mo, Haotian Liu, Xiaobai Li

2604.08209 2026-04-10 cs.CV

OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering

Yiduo Jia, Muzhi Zhu, Hao Zhong, Mingyu Liu, Yuling Xi, Hao Chen, Bin Qin, Yongjie Yang, Zhenbo Luo, Chunhua Shen

Comments Project page: https://aim-uofa.github.io/OmniJigsaw/

2604.08204 2026-04-10 cs.LG cs.NE

Introducing Echo Networks for Computational Neuroevolution

Christian Kroos, Fabian Küch

Comments Accepted for AMLDS 2026 (International Conference on Advanced Machine Learning and Data Science)