arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2409.14590 2026-04-09 cs.LG cs.AI stat.ML

Explainable AI needs formalization

Stefan Haufe, Rick Wilming, Benedict Clark, Rustam Zhumagambetov, Ahcène Boubekki, Jörg Martin, Danny Panknin

详情

英文摘要

The field of "explainable artificial intelligence" (XAI) seemingly addresses the desire that decisions of machine learning systems should be human-understandable. However, in its current state, XAI itself needs scrutiny. Popular methods cannot reliably answer relevant questions about ML models, their training data, or test inputs, because they systematically attribute importance to input features that are independent of the prediction target. This limits the utility of XAI for diagnosing and correcting data and models, for scientific discovery, and for identifying intervention targets. The fundamental reason for this is that current XAI methods do not address well-defined problems and are not evaluated against targeted criteria of explanation correctness. Researchers should formally define the problems they intend to solve and design methods accordingly. This will lead to diverse use-case-dependent notions of explanation correctness and objective metrics of explanation performance that can be used to validate XAI algorithms.

URL PDF HTML ☆

赞 0 踩 0

2409.09298 2026-04-09 cs.LG cs.AI cs.DB

Matrix Profile for Anomaly Detection on Multidimensional Time Series

Chin-Chia Michael Yeh, Audrey Der, Uday Singh Saini, Vivian Lai, Yan Zheng, Junpeng Wang, Xin Dai, Zhongfang Zhuang, Yujie Fan, Huiyuan Chen, Prince Osei Aboagye, Liang Wang, Wei Zhang, Eamonn Keogh

Comments https://github.com/mcyeh/mmpad_tsb

2409.06490 2026-04-09 cs.CV stat.AP

UAVDB: Point-Guided Masks for UAV Detection and Segmentation

Yu-Hsi Chen

Comments 14 pages, 4 figures, 4 tables

2409.01633 2026-04-09 cs.LG cs.AI cs.CV

SleepNet and DreamNet: Enriching and Reconstructing Representations for Consolidated Visual Classification

Mingze Ni, Wei Liu

2405.16240 2026-04-09 cs.LG

AFL: A Single-Round Analytic Approach for Federated Learning with Pre-trained Models

Run He, Kai Tong, Di Fang, Han Sun, Ziqian Zeng, Haoran Li, Tianyi Chen, Huiping Zhuang

Comments Published in CVPR 2025

2405.11619 2026-04-09 cs.LG cs.AI

Novel Interpretable and Robust Web-based AI Platform for Phishing Email Detection

Abdulla Al-Subaiey, Mohammed Al-Thani, Naser Abdullah Alam, Kaniz Fatema Antora, Amith Khandakar, SM Ashfaq Uz Zaman

Comments 19 pages, 7 figures, dataset link: https://www.kaggle.com/datasets/naserabdullahalam/phishing-email-dataset/

2403.06568 2026-04-09 cs.AI

Better Understandings and Configurations in MaxSAT Local Search Solvers via Anytime Performance Analysis

Furong Ye, Chuan Luo, Shaowei Cai

2402.02249 2026-04-09 cs.LG

Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget

Florian E. Dorner, Moritz Hardt

Comments 34 pages, 3 Figures, Published at ICML 2024

2309.08780 2026-04-09 cs.RO

STERN: Simultaneous Trajectory Estimation and Relative Navigation for Autonomous Underwater Proximity Operations

Aldo Terán Espinoza, Antonio Terán Espinoza, John Folkesson, Clemens Deutsch, Niklas Rolleberg, Peter Sigray, Jakob Kuttenkeuler

Comments v2 updated after revision. Article contains 24 pages and 18 figures. Published in the IEEE Journal of Oceanic Engineering, available at: https://doi.org/10.1109/JOE.2025.3624470

详情

DOI: 10.1109/JOE.2025.3624470
Journal ref: IEEE Journal of Oceanic Engineering, vol. 51, no. 1, pp. 293-316, Jan. 2026

英文摘要

Due to the challenges regarding the limits of their endurance and autonomous capabilities, underwater docking for autonomous underwater vehicles (AUVs) has become a topic of interest for many academic and commercial applications. Herein, we take on the problem of relative navigation for the generalized version of the docking operation, which we address as proximity operations. Proximity operations typically involve only two actors, a chaser and a target. We leverage the similarities to proximity operations (prox-ops) from spacecraft robotic missions to frame the diverse docking scenarios with a set of phases the chaser undergoes on the way to its target. We emphasize the versatility on the use of factor graphs as a generalized representation to model the underlying simultaneous trajectory estimation and relative navigation (STERN) problem that arises with any prox-ops scenario, regardless of the sensor suite or the agents' dynamic constraints. To emphasize the flexibility of factor graphs as the modeling foundation for arbitrary underwater prox-ops, we compile a list of state-of-the-art research in the field and represent the different scenario using the same factor graph representation. We detail the procedure required to model, design, and implement factor graph-based estimators by addressing a long-distance acoustic homing scenario of an AUV to a moving mothership using datasets from simulated and real-world deployments; an analysis of these results is provided to shed light on the flexibility and limitations of the dynamic assumptions of the moving target. A description of our front- and back-end is also presented together with a timing breakdown of all processes to show its potential deployment on a real-time system.

URL PDF HTML ☆

赞 0 踩 0

2306.14685 2026-04-09 cs.CV cs.AI

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

Ximing Xing, Chuang Wang, Haitao Zhou, Jing Zhang, Qian Yu, Dong Xu

Comments Accepted by NeurIPS 2023. Project page: https://ximinng.github.io/DiffSketcher-project/

2303.11789 2026-04-09 cs.LG cs.DC cs.SY eess.SY math.PR

Decentralized Online Learning for Random Inverse Problems Over Graphs

Xiwei Zhang, Tao Li, Yan Chen, Qianyuan Long

2604.07350 2026-04-09 cs.CV cs.GR cs.LG

Fast Spatial Memory with Elastic Test-Time Training

Ziqiao Ma, Xueyang Yu, Haoyu Zhen, Yuncong Yang, Joyce Chai, Chuang Gan

Comments Project Page: https://fast-spatial-memory.github.io/

2604.07348 2026-04-09 cs.CV cs.AI cs.GR cs.LG cs.RO

MoRight: Motion Control Done Right

Shaowei Liu, Xuanchi Ren, Tianchang Shen, Huan Ling, Saurabh Gupta, Shenlong Wang, Sanja Fidler, Jun Gao

Comments Project Page: https://research.nvidia.com/labs/sil/projects/moright

2604.07343 2026-04-09 cs.CL cs.LG

Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization

Qiyao Ma, Dechen Gao, Rui Cai, Boqi Zhao, Hanchu Zhou, Junshan Zhang, Zhe Zhao

2604.07340 2026-04-09 cs.CV

TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders

Teng Li, Ziyuan Huang, Cong Chen, Yangfu Li, Yuanhuiyi Lyu, Dandan Zheng, Chunhua Shen, Jun Zhang

2604.07338 2026-04-09 cs.CV cs.CL cs.MM

Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images

Yuechen Jiang, Enze Zhang, Md Mohsinul Kabir, Qianqian Xie, Stavroula Golfomitsou, Konstantinos Arvanitis, Sophia Ananiadou

2604.07337 2026-04-09 cs.CV

From Blobs to Spokes: High-Fidelity Surface Reconstruction via Oriented Gaussians

Diego Gomez, Antoine Guédon, Nissim Maruani, Bingchen Gong, Maks Ovsjanikov

Comments Our project page is available in http://diego1401.github.io/BlobsToSpokesWebsite/index.html

2604.07335 2026-04-09 cs.RO

TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks

Longyan Wu, Jieji Ren, Chenghang Jiang, Junxi Zhou, Shijia Peng, Ran Huang, Guoying Gu, Li Chen, Hongyang Li

详情

英文摘要

Handheld paradigms offer an efficient and intuitive way for collecting large-scale demonstration of robot manipulation. However, achieving contact-rich bimanual manipulation through these methods remains a pivotal challenge, which is substantially hindered by hardware adaptability and data efficacy. Prior hardware designs remain gripper-specific and often face a trade-off between tracking precision and portability. Furthermore, the lack of online feasibility checking during demonstration leads to poor replayability. More importantly, existing handheld setups struggle to collect interactive recovery data during robot execution, lacking the authentic tactile information necessary for robust policy refinement. To bridge these gaps, we present TAMEn, a tactile-aware manipulation engine for closed-loop data collection in contact-rich tasks. Our system features a cross-morphology wearable interface that enables rapid adaptation across heterogeneous grippers. To balance data quality and environmental diversity, we implement a dual-modal acquisition pipeline: a precision mode leveraging motion capture for high-fidelity demonstrations, and a portable mode utilizing VR-based tracking for in-the-wild acquisition and tactile-visualized recovery teleoperation. Building on this hardware, we unify large-scale tactile pretraining, task-specific bimanual demonstrations, and human-in-the-loop recovery data into a pyramid-structured data regime, enabling closed-loop policy refinement. Experiments show that our feasibility-aware pipeline significantly improves demonstration replayability, and that the proposed visuo-tactile learning framework increases task success rates from 34% to 75% across diverse bimanual manipulation tasks. We further open-source the hardware and dataset to facilitate reproducibility and support research in visuo-tactile manipulation.

URL PDF HTML ☆

赞 0 踩 0

2604.07331 2026-04-09 cs.RO cs.AI cs.CV

RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild

Wenjing Margaret Mao, Jefferson Ng, Luyang Hu, Daniel Gehrig, Antonio Loquercio

Comments 8 pages, 4 figures. *Equal contribution by first three authors. Project webpage: https://roshi-mocap.github.io/

2604.07329 2026-04-09 cs.CV

Distilling Photon-Counting CT into Routine Chest CT through Clinically Validated Degradation Modeling

Junqi Liu, Xinze Zhou, Wenxuan Li, Scott Ye, Arkadiusz Sitek, Xiaofeng Yang, Yucheng Tang, Daguang Xu, Kai Ding, Kang Wang, Yang Yang, Alan L. Yuille, Zongwei Zhou

2604.07320 2026-04-09 cs.CL cs.AI

Evaluating In-Context Translation with Synchronous Context-Free Grammar Transduction

Jackson Petty, Jaulie Goe, Tal Linzen

2604.07316 2026-04-09 cs.LG

SL-FAC: A Communication-Efficient Split Learning Framework with Frequency-Aware Compression

Zehang Lin, Miao Yang, Haihan Zhu, Zheng Lin, Jianhao Huang, Jing Yang, Guangjin Pan, Dianxin Luan, Zihan Fang, Shunzhi Zhu, Wei Ni, John Thompson

Comments 6 pages, 4 figures

2604.07306 2026-04-09 cs.CV cs.LG

Beyond Loss Values: Robust Dynamic Pruning via Loss Trajectory Alignment

Huaiyuan Qin, Muli Yang, Gabriel James Goenawan, Kai Wang, Zheng Wang, Peng Hu, Xi Peng, Hongyuan Zhu

Comments Published in CVPR 2026 Findings

2604.07298 2026-04-09 cs.CV cs.AI eess.IV

Region-Graph Optimal Transport Routing for Mixture-of-Experts Whole-Slide Image Classification

Xin Tian, Jiuliu Lu, Ephraim Tsalik, Bart Wanders, Colleen Knoth, Julian Knight

Comments 10 pages, 2 figures, 2 tables

2604.07286 2026-04-09 cs.RO cs.AI cs.LG

CADENCE: Context-Adaptive Depth Estimation for Navigation and Computational Efficiency

Timothy K Johnsen, Marco Levorato

Comments 7 pages, 7 figures, Accepted for publication at IEEE World AI IoT Congress (AIIoT) 2026

2604.07285 2026-04-09 cs.CL cs.CY

Why teaching resists automation in an AI-inundated era: Human judgment, non-modular work, and the limits of delegation

Songhee Han

2604.07282 2026-04-09 cs.CV cs.LG

Are Face Embeddings Compatible Across Deep Neural Network Models?

Fizza Rubab, Yiying Tong, Arun Ross

2604.07279 2026-04-09 cs.CV

Mem3R: Streaming 3D Reconstruction with Hybrid Memory via Test-Time Training

Changkun Liu, Jiezhi Yang, Zeman Li, Yuan Deng, Jiancong Guo, Luca Ballan

Comments Project page: https://lck666666.github.io/Mem3R/

2604.07274 2026-04-09 cs.CL cs.AI cs.LG

A Systematic Study of Retrieval Pipeline Design for Retrieval-Augmented Medical Question Answering

Nusrat Sultana, Abdullah Muhammad Moosa, Kazi Afzalur Rahman, Sajal Chandra Banik

2604.07272 2026-04-09 cs.CL

ClickGuard: A Trustworthy Adaptive Fusion Framework for Clickbait Detection

Chhavi Dhiman, Naman Chawla, Riya Dhami, Gaurav Kumar, Ganesh Naik