arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.09448 2026-03-11 cs.CV cs.AI

A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation

Yoon Jo Kim, Wonyoung Cho, Jongmin Lee, Han Joo Chae, Hyunki Park, Sang Hoon Seo, Noh Jae Myung, Kyungmi Yang, Dongryul Oh, Jin Sung Kim

Comments Submitted to MICCAI 2026

详情

英文摘要

Delineating the clinical target volume (CTV) in radiotherapy involves complex margins constrained by tumor location and anatomical barriers. While deep learning models automate this process, their rigid reliance on expert-annotated data requires costly retraining whenever clinical guidelines update. To overcome this limitation, we introduce OncoAgent, a novel guideline-aware AI agent framework that seamlessly converts textual clinical guidelines into three-dimensional target contours in a training-free manner. Evaluated on esophageal cancer cases, the agent achieves a zero-shot Dice similarity coefficient of 0.842 for the CTV and 0.880 for the planning target volume, demonstrating performance highly comparable to a fully supervised nnU-Net baseline. Notably, in a blinded clinical evaluation, physicians strongly preferred OncoAgent over the supervised baseline, rating it higher in guideline compliance, modification effort, and clinical acceptability. Furthermore, the framework generalizes zero-shot to alternative esophageal guidelines and other anatomical sites (e.g., prostate) without any retraining. Beyond mere volumetric overlap, our agent-based paradigm offers near-instantaneous adaptability to alternative guidelines, providing a scalable and transparent pathway toward interpretability in radiotherapy treatment planning.

URL PDF HTML ☆

赞 0 踩 0

2603.09446 2026-03-11 cs.CV

GIIM: Graph-based Learning of Inter- and Intra-view Dependencies for Multi-view Medical Image Diagnosis

Tran Bao Sam, Hung Vu, Dao Trung Kien, Tran Dat Dang, Van Ha Tang, Steven Truong

Comments To appear in the 40th AAAI Conference on Artificial Intelligence (AAAI-26). 10 pages, 2 figures

2603.09436 2026-03-11 cs.LG

From Weighting to Modeling: A Nonparametric Estimator for Off-Policy Evaluation

Rong J. B. Zhu

2603.09435 2026-03-11 cs.AI

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

Athanasios Davvetas, Michael Papademas, Xenia Ziouvelou, Vangelis Karkaletsis

Comments 10 pages, 1 figure, 4 tables, 2 equations

2603.09434 2026-03-11 cs.CL cs.AI

Common Sense vs. Morality: The Curious Case of Narrative Focus Bias in LLMs

Saugata Purkayastha, Pranav Kushare, Pragya Paramita Pal, Sukannya Purkayastha

Comments Accepted at LREC 2026

2603.09419 2026-03-11 cs.CV

MetaDAT: Generalizable Trajectory Prediction via Meta Pre-training and Data-Adaptive Test-Time Updating

Yuning Wang, Pu Zhang, Yuan He, Ke Wang, Jianru Xue

Comments ICRA 2026

2603.09416 2026-03-11 cs.CL cs.AI

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health

Trung Hieu Ngo, Adrien Bazoge, Solen Quiniou, Pierre-Antoine Gourraud, Emmanuel Morin

Comments Accepted as Findings at EACL 2026

2603.09415 2026-03-11 cs.RO cs.AI

From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation

Ju Dong, Liding Zhang, Lei Zhang, Yu Fu, Kaixin Bai, Zoltan-Csaba Marton, Zhenshan Bing, Zhaopeng Chen, Alois Christian Knoll, Jianwei Zhang

Comments https://sites.google.com/view/flow2one, 8 pages

2603.09414 2026-03-11 cs.CV cs.AI

PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue

Zirui Zhang, Yaping Zhang, Lu Xiang, Yang Zhao, Feifei Zhai, Yu Zhou, Chengqing Zong

Comments Accepted by IEEE TMM

2603.09411 2026-03-11 cs.CV

RiO-DETR: DETR for Real-time Oriented Object Detection

Zhangchi Hu, Yifan Zhao, Yansong Peng, Wenzhang Sun, Xiangchen Yin, Jie Chen, Peixi Wu, Hebei Li, Xinghao Wang, Dongsheng Jiang, Xiaoyan Sun

Comments 30 pages, 9 figures

2603.09408 2026-03-11 cs.CV cs.AI cs.LG

Reviving ConvNeXt for Efficient Convolutional Diffusion Models

Taesung Kwon, Lorenzo Bianchi, Lennart Wittke, Felix Watine, Fabio Carrara, Jong Chul Ye, Romann Weber, Vinicius Azevedo

Comments CVPR 2026. Official implementation: https://github.com/star-kwon/FCDM

2603.09400 2026-03-11 cs.CL

Reward Prediction with Factorized World States

Yijun Shen, Delong Chen, Xianming Hu, Jiaming Mi, Hongbo Zhao, Kai Zhang, Pascale Fung

2603.09399 2026-03-11 cs.RO

Vision-Augmented On-Track System Identification for Autonomous Racing via Attention-Based Priors and Iterative Neural Correction

Zhiping Wu, Cheng Hu, Yiqin Wang, Lei Xie, Hongye Su

2603.09392 2026-03-11 cs.CV cs.AI

ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts

Yaping Zhang, Yupu Liang, Zhiyang Zhang, Zhiyuan Chen, Lu Xiang, Yang Zhao, Yu Zhou, Chengqing Zong

Comments accepted by ICDAR 2025

2603.09385 2026-03-11 cs.CV

EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation

Yinrui Ren, Jinjing Zhu, Kanghao Chen, Zhuoxiao Li, Jing Ou, Zidong Cao, Tongyan Hua, Peilun Shi, Yingchun Fu, Wufan Zhao, Hui Xiong

2603.09374 2026-03-11 cs.CV cs.AI

MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification

Nikola Jovišić, Milica Škipina, Nicola Dall'Asen, Dubravko Ćulibrk

Comments 10 pages, 2 figures, 4 tables. Code will be released

2603.09373 2026-03-11 cs.CL

Quantifying and extending the coverage of spatial categorization data sets

Wanchun Li, Alexandra Carstensen, Yang Xu, Terry Regier, Charles Kemp

2603.09370 2026-03-11 cs.LG

From Representation to Clusters: A Contrastive Learning Approach for Attributed Hypergraph Clustering

Li Ni, Shuaikang Zeng, Lin Mu, Longlong Lin

Comments Accepted at The Web Conference 2026. 12 pages, 5 figures

2603.09367 2026-03-11 cs.CV cs.AI

M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition

Yanshan Li, Ke Ma, Miaomiao Wei, Linhui Dai

详情

英文摘要

In recent years, contrastive learning has drawn significant attention as an effective approach to reducing reliance on labeled data. However, existing methods for self-supervised skeleton-based action recognition still face three major limitations: insufficient modeling of view discrepancies, lack of effective adversarial mechanisms, and uncontrollable augmentation perturbations. To tackle these issues, we propose the Multi-view Mini-Max infinite skeleton-data Game Contrastive Learning for skeleton-based action Recognition (M3GCLR), a game-theoretic contrastive framework. First, we establish the Infinite Skeleton-data Game (ISG) model and the ISG equilibrium theorem, and further provide a rigorous proof, enabling mini-max optimization based on multi-view mutual information. Then, we generate normal-extreme data pairs through multi-view rotation augmentation and adopt temporally averaged input as a neutral anchor to achieve structural alignment, thereby explicitly characterizing perturbation strength. Next, leveraging the proposed equilibrium theorem, we construct a strongly adversarial mini-max skeleton-data game to encourage the model to mine richer action-discriminative information. Finally, we introduce the dual-loss equilibrium optimizer to optimize the game equilibrium, allowing the learning process to maximize action-relevant information while minimizing encoding redundancy, and we prove the equivalence between the proposed optimizer and the ISG model. Extensive Experiments show that M3GCLR achieves three-stream 82.1%, 85.8% accuracy on NTU RGB+D 60 (X-Sub, X-View) and 72.3%, 75.0% accuracy on NTU RGB+D 120 (X-Sub, X-Set). On PKU-MMD Part I and II, it attains 89.1%, 45.2% in three-stream respectively, all results matching or outperforming state-of-the-art performance. Ablation studies confirm the effectiveness of each component.

URL PDF HTML ☆

赞 0 踩 0

2603.09359 2026-03-11 cs.CV

Evidential Perfusion Physics-Informed Neural Networks with Residual Uncertainty Quantification

Junhyeok Lee, Minseo Choi, Han Jang, Young Hun Jeon, Heeseong Eum, Joon Jang, Chul-Ho Sohn, Kyu Sung Choi

2603.09356 2026-03-11 cs.LG cs.AI cs.CR

Democratising Clinical AI through Dataset Condensation for Classical Clinical Models

Anshul Thakur, Soheila Molaei, Pafue Christy Nganjimi, Joshua Fieggen, Andrew A. S. Soltan, Danielle Belgrave, Lei Clifton, David A. Clifton

Comments 22 pages, 5 figures, 5 tables

2603.09353 2026-03-11 cs.LG

Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework

Engin Deniz Erkan, Elif Surer, Ulas Yaman

2603.09349 2026-03-11 cs.LG cs.AI

TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection

Xiong Zhang, Hong Peng, Changlong Fu, Xin Jin, Yun Yang, Cheng Xie

2603.09341 2026-03-11 cs.CL cs.AI

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

Jiashuo Sun, Yixuan Xie, Jimeng Shi, Shaowen Wang, Jiawei Han

Comments 14 pages, 7 tables, 5 figures

2603.09338 2026-03-11 cs.CV

Predictive Spectral Calibration for Source-Free Test-Time Regression

Nguyen Viet Tuan Kiet, Huynh Thanh Trung, Pham Huy Hieu

2603.09337 2026-03-11 cs.CV cs.AI

Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments

Yang Li, Xing Chen, Yutao Liu, Gege Qi, Yanxian BI, Zizhe Wang, Yunjian Zhang, Yao Zhu

Comments Code available

2603.09332 2026-03-11 cs.SD cs.AI

TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control

Shihao He, Yihan Xia, Fang Liu, Taotao Wang, Shengli Zhang

2603.09331 2026-03-11 cs.LG

Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning

Heng Zhang, Haddy Alchaer, Arash Ajoudani, Yu She

Comments under review

2603.09320 2026-03-11 cs.CV cs.AI

SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation

Aodi Wu, Jianhong Zuo, Zeyuan Zhao, Xubo Luo, Ruisuo Wang, Xue Wan

Comments 8 pages, 5 figures

2603.09319 2026-03-11 cs.RO cs.CV

NLiPsCalib: An Efficient Calibration Framework for High-Fidelity 3D Reconstruction of Curved Visuotactile Sensors

Xuhao Qin, Feiyu Zhao, Yatao Leng, Runze Hu, Chenxi Xiao

Comments 8 pages, 8 figures, accepted to 2026 IEEE International Conference on Robotics & Automation (ICRA 2026)