arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.20101 2026-03-23 cs.AI

Pitfalls in Evaluating Interpretability Agents

Tal Haklay, Nikhil Prakash, Sana Pandey, Antonio Torralba, Aaron Mueller, Jacob Andreas, Tamar Rott Shaham, Yonatan Belinkov

详情

英文摘要

Automated interpretability systems aim to reduce the need for human labor and scale analysis to increasingly large models and diverse tasks. Recent efforts toward this goal leverage large language models (LLMs) at increasing levels of autonomy, ranging from fixed one-shot workflows to fully autonomous interpretability agents. This shift creates a corresponding need to scale evaluation approaches to keep pace with both the volume and complexity of generated explanations. We investigate this challenge in the context of automated circuit analysis -- explaining the roles of model components when performing specific tasks. To this end, we build an agentic system in which a research agent iteratively designs experiments and refines hypotheses. When evaluated against human expert explanations across six circuit analysis tasks in the literature, the system appears competitive. However, closer examination reveals several pitfalls of replication-based evaluation: human expert explanations can be subjective or incomplete, outcome-based comparisons obscure the research process, and LLM-based systems may reproduce published findings via memorization or informed guessing. To address some of these pitfalls, we propose an unsupervised intrinsic evaluation based on the functional interchangeability of model components. Our work demonstrates fundamental challenges in evaluating complex automated interpretability systems and reveals key limitations of replication-based evaluation.

URL PDF HTML ☆

赞 0 踩 0

2603.20100 2026-03-23 cs.CL cs.AI

An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

Yuming Feng, Christy Yang

2603.20086 2026-03-23 cs.CV

Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment

Shiqi Gao, Kang Fu, Zitong Xu, Huiyu Duan, Xiongkuo Min, Jia Wang, Guangtao Zhai

2603.20077 2026-03-23 cs.CV cs.RO eess.IV

A Unified Platform and Quality Assurance Framework for 3D Ultrasound Reconstruction with Robotic, Optical, and Electromagnetic Tracking

Lewis Howell, Manisha Waterston, Tze Min Wah, James H. Chandler, James R. McLaughlan

Comments This work has been submitted to the IEEE for possible publication

2603.20076 2026-03-23 cs.RO

Uncertainty Matters: Structured Probabilistic Online Mapping for Motion Prediction in Autonomous Driving

Pritom Gogoi, Faris Janjoš, Bin Yang, Andreas Look

2603.20074 2026-03-23 cs.CV

MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space Models

Puskal Khadka, KC Santosh

2603.20063 2026-03-23 cs.LG cs.AI

Fine-tuning Timeseries Predictors Using Reinforcement Learning

Hugo Cazaux, Ralph Rudd, Hlynur Stefánsson, Sverrir Ólafsson, Eyjólfur Ingi Ásgeirsson

2603.20059 2026-03-23 cs.AI

DIAL-KG: Schema-Free Incremental Knowledge Graph Construction via Dynamic Schema Induction and Evolution-Intent Assessment

Weidong Bao, Yilin Wang, Ruyu Gao, Fangling Leng, Yubin Bao, Ge Yu

Comments Accepted to DASFAA 2026. 16 pages, 4 figures

2603.20046 2026-03-23 cs.AI

Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

Wenjian Zhang, Kongcheng Zhang, Jiaxin Qi, Baisheng Lai, Jianqiang Huang

2603.20042 2026-03-23 cs.CL cs.AI

LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families

Jianan Chen, Xiaoxue Gao, Tatsuya Kawahara, Nancy F. Chen

2603.20037 2026-03-23 cs.LG cs.NI

Federated Hyperdimensional Computing for Resource-Constrained Industrial IoT

Nikita Zeulin, Olga Galinina, Nageen Himayat, Sergey Andreev

Comments Submitted to the IEEE for possible publication

2603.20036 2026-03-23 cs.LG

Continual Learning as Shared-Manifold Continuation Under Compatible Shift

Henry J. Kobs

Comments 11 pages, 4 figures, repo: https://github.com/kkryon/spma

2603.20021 2026-03-23 cs.LG

ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images

Anand Choudhary, Xiaowu Sun, Thabo Mahendiran, Ortal Senouf, Denise Auberson, Bernard De Bruyne, Stephane Fournier, Olivier Muller, Emmanuel Abbé, Pascal Frossard, Dorina Thanou

2603.20017 2026-03-23 cs.CL cs.DB cs.IR

RouterKGQA: Specialized--General Model Routing for Constraint-Aware Knowledge Graph Question Answering

Bo Yuan, Hexuan Deng, Xuebo Liu, Min Zhang

2603.20016 2026-03-23 cs.CV

CFCML: A Coarse-to-Fine Crossmodal Learning Framework For Disease Diagnosis Using Multimodal Images and Tabular Data

Tianling Liu, Hongying Liu, Fanhua Shang, Lequan Yu, Tong Han, Liang Wan

2603.20014 2026-03-23 cs.LG

AgenticRS-EnsNAS: Ensemble-Decoupled Self-Evolving Architecture Search

Yun Chen, Moyu Zhang, Jinxin Hu, Yu Zhang, Xiaoyi Zeng

2603.20009 2026-03-23 cs.LG cs.DB cs.IR

A Super Fast K-means for Indexing Vector Embeddings

Leonardo Kuffo, Sven Hepkema, Peter Boncz

2603.20005 2026-03-23 cs.CV

NEC-Diff: Noise-Robust Event-RAW Complementary Diffusion for Seeing Motion in Extreme Darkness

Haoyue Liu, Jinghan Xu, Luxin Feng, Hanyu Zhou, Haozhi Zhao, Yi Chang, Luxin Yan

Comments Accepted by CVPR 2026

2603.20003 2026-03-23 cs.CL

An Agentic Approach to Generating XAI-Narratives

Yifan He, David Martens

2603.19997 2026-03-23 cs.CL

When Contextual Inference Fails: Cancelability in Interactive Instruction Following

Natalia Bila, Kata Naszádi, Alexandra Mayn, Christof Monz

2603.19994 2026-03-23 cs.CV cs.LG eess.IV eess.SP

Evaluating Test-Time Adaptation For Facial Expression Recognition Under Natural Cross-Dataset Distribution Shifts

John Turnbull, Shivam Grover, Amin Jalali, Ali Etemad

Comments Accepted at ICASSP 2026

2603.19993 2026-03-23 cs.CV

MedSPOT: A Workflow-Aware Sequential Grounding Benchmark for Clinical GUI

Rozain Shakeel, Abdul Rahman Mohammad Ali, Muneeb Mushtaq, Tausifa Jan Saleem, Tajamul Ashraf

Comments Project page: https://rozainmalik.github.io/MedSPOT_web/

2603.19987 2026-03-23 cs.LG cs.AI cs.CL

Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Yurun Yuan, Tengyang Xie

2603.19970 2026-03-23 cs.LG cs.AI

Graph2TS: Structure-Controlled Time Series Generation via Quantile-Graph VAEs

Shaoshuai Du, Joze M. Rozanec, Andy Pimentel, Ana-Lucia Varbanescu

2603.19958 2026-03-23 cs.RO

Radar-Inertial Odometry with Online Spatio-Temporal Calibration via Continuous-Time IMU Modeling

Vlaho-Josip Štironja, Luka Petrović, Juraj Peršić, Ivan Marković, Ivan Petrović

2603.19957 2026-03-23 cs.CV cs.AI cs.LG

HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

Ruicheng Yuan, Zhenxuan Zhang, Anbang Wang, Liwei Hu, Xiangqian Hua, Yaya Peng, Jiawei Luo, Guang Yang

Comments 10 pages, 1 figures, 3 tables

2603.19954 2026-03-23 cs.AI cs.CL cs.LG

On the Ability of Transformers to Verify Plans

Yash Sarrof, Yupei Du, Katharina Stein, Alexander Koller, Sylvie Thiébaux, Michael Hahn

2603.19940 2026-03-23 cs.CL

Hybrid topic modelling for computational close reading: Mapping narrative themes in Pushkin's Evgenij Onegin

Angelo Maria Sabatini

Comments 25 pages, 4 figures, 2 supplementary materials; submitted to Digital Scholarship in the Humanities (under review)

2603.19939 2026-03-23 cs.CV

Timestep-Aware Block Masking for Efficient Diffusion Model Inference

Haodong He, Yuan Gao, Weizhong Zhang, Gui-Song Xia

Comments 10 pages

2603.19936 2026-03-23 cs.CV cs.RO

LIORNet: Self-Supervised LiDAR Snow Removal Framework for Autonomous Driving under Adverse Weather Conditions

Ji-il Park, Inwook Shim

Comments 14 pages, 6 figures, 2 tables

详情

英文摘要

LiDAR sensors provide high-resolution 3D perception and long-range detection, making them indispensable for autonomous driving and robotics. However, their performance significantly degrades under adverse weather conditions such as snow, rain, and fog, where spurious noise points dominate the point cloud and lead to false perception. To address this problem, various approaches have been proposed: distance-based filters exploiting spatial sparsity, intensity-based filters leveraging reflectance distributions, and learning-based methods that adapt to complex environments. Nevertheless, distance-based methods struggle to distinguish valid object points from noise, intensity-based methods often rely on fixed thresholds that lack adaptability to changing conditions, and learning-based methods suffer from the high cost of annotation, limited generalization, and computational overhead. In this study, we propose LIORNet, which eliminates these drawbacks and integrates the strengths of all three paradigms. LIORNet is built upon a U-Net++ backbone and employs a self-supervised learning strategy guided by pseudo-labels generated from multiple physical and statistical cues, including range-dependent intensity thresholds, snow reflectivity, point sparsity, and sensing range constraints. This design enables LIORNet to distinguish noise points from environmental structures without requiring manual annotations, thereby overcoming the difficulty of snow labeling and the limitations of single-principle approaches. Extensive experiments on the WADS and CADC datasets demonstrate that LIORNet outperforms state-of-the-art filtering algorithms in both accuracy and runtime while preserving critical environmental features. These results highlight LIORNet as a practical and robust solution for LiDAR perception in extreme weather, with strong potential for real-time deployment in autonomous driving systems.

URL PDF HTML ☆

赞 0 踩 0