arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.13969 2026-03-17 cs.CV eess.IV

Leveraging a Statistical Shape Model for Efficient Generation of Annotated Training Data: A Case Study on Liver Landmarks Segmentation

Denis Krnjaca, Lorena Krames, Werner Nahm

详情

英文摘要

Anatomical landmark segmentation serves as a critical initial step for robust multimodal registration during computer-assisted interventions. Current approaches predominantly rely on deep learning, which often necessitates the extensive manual generation of annotated datasets. In this paper, we present a novel strategy for creating large annotated datasets using a statistical shape model (SSM) based on a mean shape that is manually labeled only once. We demonstrate the method's efficacy through its application to deep-learning-based anatomical landmark segmentation, specifically targeting the detection of the anterior ridge and the falciform ligament in 3D liver shapes. A specialized deep learning network was trained with 8,800 annotated liver shapes generated by the SSM. The network's performance was evaluated on 500 unseen synthetic SSM shapes, yielding a mean Intersection over Union of 91.4% (87.4% for the anterior ridge and 87.6% for the falciform ligament). Subsequently, the network was applied to clinical patient liver shapes, with qualitative evaluation indicating promising results and highlighting the generalizability of the proposed approach. Our findings suggest that the SSM-based data generation approach alleviates the labor-intensive process of manual labeling while enabling the creation of large annotated training datasets for machine learning. Although our study focuses on liver anatomy, the proposed methodology holds potential for a broad range of applications where annotated training datasets play a pivotal role in developing accurate deep-learning models.

URL PDF HTML ☆

赞 0 踩 0

2603.13964 2026-03-17 cs.CV

VID-AD: A Dataset for Image-Level Logical Anomaly Detection under Vision-Induced Distraction

Hiroto Nakata, Yawen Zou, Shunsuke Sakai, Shun Maeda, Chunzhi Gu, Yijin Wei, Shangce Gao, Chao Zhang

2603.13960 2026-03-17 cs.CV

IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation

Chenru Wang, Yunyi Chen, Zijun Yang, Joey Tianyi Zhou, Chi Zhang

Comments CVPR26 Accepted

2603.13956 2026-03-17 cs.AI

EviAgent: Evidence-Driven Agent for Radiology Report Generation

Tuoshi Qi, Shenshen Bu, Yingfei Xiang, Zhiming Dai

2603.13951 2026-03-17 cs.CV

DCP-CLIP:A Coarse-to-Fine Framework for Open-Vocabulary Semantic Segmentation with Dual Interaction

Jing Wang, Huimin Shi, Quan Zhou, Qibo Liu, Suofei Zhang, Huimin Lu

Comments 13 pages, 7 figures

2603.13950 2026-03-17 cs.CL

ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering

Hussein Jawad, Nicolas J-B Brunel

2603.13944 2026-03-17 cs.RO

ToMPC: Task-oriented Model Predictive Control via ADMM for Safe Robotic Manipulation

Xinyu Jia, Wenxin Wang, Jun Yang, Yongping Pan, Haoyong Yu

Comments 8 pages, 10 figures, accepted by IEEE Robotics and Automation Letters (RAL)

2603.13943 2026-03-17 cs.CV cs.LG

Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing

Kursat Komurcu, Linas Petkevicius

Comments ICLR 2026 Workshop ML4RS Main Track: https://openreview.net/forum?id=WBHfQLbgZR

2603.13940 2026-03-17 cs.AI

GroupGuard: A Framework for Modeling and Defending Collusive Attacks in Multi-Agent Systems

Yiling Tao, Xinran Zheng, Shuo Yang, Meiling Tao, Xingjun Wang

2603.13931 2026-03-17 cs.LG

True 4-Bit Quantized Convolutional Neural Network Training on CPU: Achieving Full-Precision Parity

Shivnath Tathe

Comments 6 pages, 4 figures, 9 tables. Code available at https://github.com/shivnathtathe/vgg4bit-and-simpleresnet4bit

2603.13928 2026-03-17 cs.CV cs.AI

Discriminative Flow Matching Via Local Generative Predictors

Om Govind Jha, Manoj Bamniya, Ayon Borthakur

2603.13927 2026-03-17 cs.LG

Close to Reality: Interpretable and Feasible Data Augmentation for Imbalanced Learning

Matheus Camilo da Silva, Gabriel Gustavo Costanzo, Andrea de Lorenzo, Sylvio Barbon Junior

2603.13925 2026-03-17 cs.RO cs.AI

SmoothVLA: Aligning Vision-Language-Action Models with Physical Constraints via Intrinsic Smoothness Optimization

Jiashun Li, Xiaoyu Shi, Hong Xie, Mingsheng Shang, Yun Lu

2603.13919 2026-03-17 cs.CV

OpenCOOD-Air: Prompting Heterogeneous Ground-Air Collaborative Perception with Spatial Conversion and Offset Prediction

Xianke Wu, Songlin Bai, Chengxiang Li, Zhiyao Luo, Yulin Tian, Fenghua Zhu, Yisheng Lv, Yonglin Tian

2603.13917 2026-03-17 cs.CV

Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics

Dennis Haitz, Athradi Shritish Shetty, Michael Weinmann, Markus Ulrich

Comments Accepted at the XXV ISPRS Congress 2026; to appear in the ISPRS Annals

2603.13173 2026-03-17 cs.AI cs.CL

Semantic Invariance in Agentic AI

I. de Zarzà, J. de Curtò, Jordi Cabot, Pietro Manzoni, Carlos T. Calafate

Comments Accepted for publication in 20th International Conference on Agents and Multi-Agent Systems: Technologies and Applications (AMSTA 2026), to appear in Springer Nature proceedings (KES Smart Innovation Systems and Technologies). The final authenticated version will be available online at Springer

2603.13099 2026-03-17 cs.AI cs.CV cs.IR cs.MM

Beyond Final Answers: CRYSTAL Benchmark for Transparent Multimodal Reasoning Evaluation

Wayner Barrios, SouYoung Jin

2603.12932 2026-03-17 cs.CL

DS$^2$-Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning

Ruiyao Xu, Noelle I. Samia, Han Liu

Comments EACL 2026 Findings

2603.12840 2026-03-17 cs.SD cs.AI

DAST: A Dual-Stream Voice Anonymization Attacker with Staged Training

Ridwan Arefeen, Xiaoxiao Miao, Rong Tong, Aik Beng Ng, Simon See, Timothy Liu

2603.12244 2026-03-17 cs.LG cs.AI

Separable neural architectures as a primitive for unified predictive and generative intelligence

Reza T. Batley, Apurba Sarker, Rajib Mostakim, Andrew Klichine, Sourav Saha

2603.12185 2026-03-17 cs.RO

ComFree-Sim: A GPU-Parallelized Analytical Contact Physics Engine for Scalable Contact-Rich Robotics Simulation and Control

Chetan Borse, Zhixian Xie, Wei-Cheng Huang, Wanxin Jin

Comments 9 pages

2603.12165 2026-03-17 cs.CL

QAQ: Bidirectional Semantic Coherence for Selecting High-Quality Synthetic Code Instructions

Jiayin Lei, Ming Ma, Yunxi Duan, Chenxi Li, Tianming Yang

Comments 14 pages, 5 figures. Under review at ACL 2026

2603.12071 2026-03-17 cs.CV cs.AI

LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments

Zhaoyang Jiang, Zhizhong Fu, David McAllister, Yunsoo Kim, Honghan Wu

2603.12064 2026-03-17 cs.CV

Dense Dynamic Scene Reconstruction and Camera Pose Estimation from Multi-View Videos

Shuo Sun, Unal Artan, Malcolm Mielle, Achim J. Lilienthaland, Martin Magnusson

Comments fix typos

2603.11935 2026-03-17 cs.LG cs.AI

MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?

Xingze Zou, Jing Wang, Yuhua Zheng, Xueyi Chen, Haolei Bai, Lingcheng Kong, Syed A. R. Abu-Bakar, Zhaode Wang, Chengfei Lv, Haoji Hu, Huan Wang

Comments Paper webpage: https://zeezou-isee.github.io/Mobilekernelbench/

2603.11618 2026-03-17 cs.CV cs.LG

Shape-of-You: Fused Gromov-Wasserstein Optimal Transport for Semantic Correspondence in-the-Wild

Jiin Im, Sisung Liu, Je Hyeong Hong

Comments Accepted at CVPR 2026. Supplementary material included after references. 18 pages, 11 figures, 10 tables

2603.11501 2026-03-17 cs.LG cs.AI cs.CR

KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation

Qizhi Chen, Chao Qi, Yihong Huang, Muquan Li, Rongzheng Wang, Dongyang Zhang, Ke Qin, Shuang Liang

Comments Accepted in the ACM Web Conference 2026 (WWW 2026)

2603.11473 2026-03-17 cs.LG cs.SY eess.SY math.OC

Slack More, Predict Better: Proximal Relaxation for Probabilistic Latent Variable Model-based Soft Sensors

Zehua Zou, Yiran Ma, Yulong Zhang, Zhengnan Li, Zeyu Yang, Jinhao Xie, Xiaoyu Jiang, Zhichao Chen

Comments This paper has been provisionally accepted for publication in the "IEEE Transactions on Industrial Informatics"

2603.11445 2026-03-17 cs.AI cs.MA

Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution

Xing Zhang, Yanwei Cui, Guanghui Wang, Wei Qiu, Ziyuan Li, Fangwei Han, Yajing Huang, Hengzhi Qiu, Bing Zhu, Peiyang He

Comments ICLR 2026 Workshop on MALGAI

2603.10789 2026-03-17 cs.CL

LuxBorrow: From Pompier to Pompjee, Tracing Borrowing in Luxembourgish

Nina Hosseini-Kivanani, Fred Philippy

Comments Paper got accepted to LREC2026, 4 Figures and 2 Tables