arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Yipei Wang, Yinsong Xu, Weixi Yi, Shaheer Ullah Saeed, Natasha Thorley, Alexander Ng, Yukun Zhou, Wen Yan, Dean Barratt, Shonit Punwani, Veeru Kasivisvanathan, Mark Emberton, Daniel C. Alexander, Yipeng Hu

2603.03960 2026-03-05 cs.RO cs.CV

Structural Action Transformer for 3D Dexterous Manipulation

Xiaohan Lei, Min Wang, Bohong Weng, Wengang Zhou, Houqiang Li

Comments Accepted by CVPR

2603.03957 2026-03-05 cs.RO

ArthroCut: Autonomous Policy Learning for Robotic Bone Resection in Knee Arthroplasty

Xu Lu, Yiling Zhang, Wenquan Cheng, Longfei Ma, Fang Chen, Hongen Liao

Comments Accepted for publication at the 2026 IEEE International Conference on Robotics and Automation (ICRA)

2603.03956 2026-03-05 cs.CV cs.AI

Towards Generalized Multimodal Homography Estimation

Jinkun You, Jiaxin Cheng, Jie Zhang, Yicong Zhou

2603.03953 2026-03-05 cs.RO cs.AI cs.CV

RVN-Bench: A Benchmark for Reactive Visual Navigation

Jaewon Lee, Jaeseok Heo, Gunmin Lee, Howoong Jun, Jeongwoo Oh, Songhwai Oh

2603.03946 2026-03-05 cs.LG

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Cong Liu, Chengyue Gong, Zhenyu Liu, Jiale Zhao, Yuxuan Zhang

2603.03942 2026-03-05 cs.RO

Lightweight Visual Reasoning for Socially-Aware Robots

Alessio Galatolo, Ronald Cumbal, Alexandros Rouchitsas, Katie Winkle, Didem Gürdür Broo, Ginevra Castellano

Comments ICRA26

2603.03941 2026-03-05 cs.CV

Slice-wise quality assessment of high b-value breast DWI via deep learning-based artifact detection

Ameya Markale, Luise Brock, Ihor Horishnyi, Dominika Skwierawska, Tri-Thien Nguyen, Hannes Schreiter, Shirin Heidarikahkesh, Lorenz A. Kapsner, Michael Uder, Sabine Ohlmeyer, Frederik B Laun, Andrzej Liebert, Sebastian Bickelhaupt

详情

英文摘要

Diffusion-weighted imaging (DWI) can support lesion detection and characterization in breast magnetic resonance imaging (MRI), however especially high b-value diffusion-weighted acquisitions can be prone to intensity artifacts that can affect diagnostic image assessment. This study aims to detect both hyper- and hypointense artifacts on high b-value diffusion-weighted images (b=1500 s/mm2) using deep learning, employing either a binary classification (artifact presence) or a multiclass classification (artifact intensity) approach on a slice-wise dataset.This IRB-approved retrospective study used the single-center dataset comprising n=11806 slices from routine 3T breast MRI examinations performed between 2022 and mid-2023. Three convolutional neural network (CNN) architectures (DenseNet121, ResNet18, and SEResNet50) were trained for binary classification of hyper- and hypointense artifacts. The best performing model (DenseNet121) was applied to an independent holdout test set and was further trained separately for multiclass classification. Evaluation included area under receiver operating characteristic curve (AUROC), area under precision recall curve (AUPRC), precision, and recall, as well as analysis of predicted bounding box positions, derived from the network Grad-CAM heatmaps. DenseNet121 achieved AUROCs of 0.92 and 0.94 for hyper- and hypointense artifact detection, respectively, and weighted AUROCs of 0.85 and 0.88 for multiclass classification on single-slice high b-value diffusion-weighted images. A radiologist evaluated bounding box precision on a 1-5 Likert-like scale across 200 slices, achieving mean scores of 3.33+-1.04 for hyperintense artifacts and 2.62+-0.81 for hypointense artifacts. Hyper- and hypointense artifact detection in slice-wise breast DWI MRI dataset (b=1500 s/mm2) using CNNs particularly DenseNet121, seems promising and requires further validation.

URL PDF HTML ☆

赞 0 踩 0

2603.03939 2026-03-05 cs.CV cs.AI

Cross-Modal Mapping and Dual-Branch Reconstruction for 2D-3D Multimodal Industrial Anomaly Detection

Radia Daci, Vito Renò, Cosimo Patruno, Angelo Cardellicchio, Abdelmalik Taleb-Ahmed, Marco Leo, Cosimo Distante

2603.03935 2026-03-05 cs.CV cs.RO

DISC: Dense Integrated Semantic Context for Large-Scale Open-Set Semantic Mapping

Felix Igelbrink, Lennart Niecksch, Martin Atzmueller, Joachim Hertzberg

2603.03922 2026-03-05 cs.LG stat.ML

Hierarchical Inference and Closure Learning via Adaptive Surrogates for ODEs and PDEs

Pengyu Zhang, Arnaud Vadeboncoeur, Alex Glyn-Davies, Mark Girolami

2603.03915 2026-03-05 cs.CL cs.AI

Rethinking Role-Playing Evaluation: Anonymous Benchmarking and a Systematic Study of Personality Effects

Ji-Lun Peng, Yun-Nung Chen

2603.03911 2026-03-05 cs.AI cs.CL cs.CR

From Threat Intelligence to Firewall Rules: Semantic Relations in Hybrid AI Agent and Expert System Architectures

Chiara Bonfanti, Davide Colaiacomo, Luca Cagliero, Cataldo Basile

2603.03907 2026-03-05 cs.CV

Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks

Zhichao Yang, Jianjie Wang, Zhixianhe Zhang, Pangu Xie, Xiangfei Sheng, Pengfei Chen, Leida Li

Comments The paper has been accepted by CVPR 2026

2603.03903 2026-03-05 cs.CV cs.LG

From Misclassifications to Outliers: Joint Reliability Assessment in Classification

Yang Li, Youyang Sha, Yinzhi Wang, Timothy Hospedales, Xi Shen, Shell Xu Hu, Xuanlong Yu

Comments 15 pages, 3 figures. The source code is publicly available at https://github.com/Intellindust-AI-Lab/SUREPlus

2603.03902 2026-03-05 cs.LG cs.AI

PatchDecomp: Interpretable Patch-Based Time Series Forecasting

Hiroki Tomioka, Genta Yoshimura

2603.03892 2026-03-05 cs.CV cs.AI

A novel network for classification of cuneiform tablet metadata

Frederik Hagelskjær

Comments Point cloud, deep learning, cuneiform

2603.03884 2026-03-05 cs.CL cs.AI

CzechTopic: A Benchmark for Zero-Shot Topic Localization in Historical Czech Documents

Martin Kostelník, Michal Hradiš, Martin Dočekal

2603.03882 2026-03-05 cs.CV

UniSync: Towards Generalizable and High-Fidelity Lip Synchronization for Challenging Scenarios

Ruidi Fan, Yang Zhou, Siyuan Wang, Tian Yu, Yutong Jiang, Xusheng Liu

Comments 9 pages, 5 figures

2603.03879 2026-03-05 cs.CV

Yolo-Key-6D: Single Stage Monocular 6D Pose Estimation with Keypoint Enhancements

Kemal Alperen Çetiner, Hazım Kemal Ekenel

Comments Accepted to VISAPP 2026

2603.03872 2026-03-05 cs.LG

Believe Your Model: Distribution-Guided Confidence Calibration

Xizhong Yang, Haotian Zhang, Huiming Wang, Mofei Song

Comments 38 pages

2603.03871 2026-03-05 cs.CV

Bridging Human Evaluation to Infrared and Visible Image Fusion

Jinyuan Liu, Xingyuan Li, Qingyun Mei, Haoyuan Xu, Zhiying Jiang, Long Ma, Risheng Liu, Xin Fan

2603.03867 2026-03-05 cs.LG

k-hop Fairness: Addressing Disparities in Graph Link Prediction Beyond First-Order Neighborhoods

Lilian Marey, Tiphaine Viard, Charlotte Laclau

2603.03865 2026-03-05 cs.LG cs.AI cs.CR

Structure-Aware Distributed Backdoor Attacks in Federated Learning

Wang Jian, Shen Hong, Ke Wei, Liu Xue Hua

Comments 17pages,12 figures

2603.03862 2026-03-05 cs.CL

Assessing the Effectiveness of LLMs in Delivering Cognitive Behavioral Therapy

Navdeep Singh Bedi, Ana-Maria Bucur, Noriko Kando, Fabio Crestani

Comments Accepted to LREC 2026

2603.03857 2026-03-05 cs.CV

DeepScan: A Training-Free Framework for Visually Grounded Reasoning in Large Vision-Language Models

Yangfu Li, Hongjian Zhan, Jiawei Chen, Yuning Gong, Qi Liu, Yue Lu

Comments 18 pages 17 figures

Journal ref Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

2603.03856 2026-03-05 cs.CL

Coupling Local Context and Global Semantic Prototypes via a Hierarchical Architecture for Rhetorical Roles Labeling

Anas Belfathi, Nicolas Hernandez, Laura Monceaux, Warren Bonnard, Mary Catherine Lavissiere, Christine Jacquin, Richard Dufour

Comments Accepted at EACL 2026

2603.03846 2026-03-05 cs.CL

Benchmarking Motivational Interviewing Competence of Large Language Models

Aishwariya Jha, Prakrithi Shivaprakash, Lekhansh Shukla, Animesh Mukherjee, Prabhat Chand, Pratima Murthy

Comments 17 pages, 6 figures, 2 tables

详情

英文摘要

Motivational interviewing (MI) promotes behavioural change in substance use disorders. Its fidelity is measured using the Motivational Interviewing Treatment Integrity (MITI) framework. While large language models (LLMs) can potentially generate MI-consistent therapist responses, their competence using MITI is not well-researched, especially in real world clinical transcripts. We aim to benchmark MI competence of proprietary and open-source models compared to human therapists in real-world transcripts and assess distinguishability from human therapists. Methods: We shortlisted 3 proprietary and 7 open-source LLMs from LMArena, evaluated performance using MITI 4.2 framework on two datasets (96 handcrafted model transcripts, 34 real-world clinical transcripts). We generated parallel LLM-therapist utterances iteratively for each transcript while keeping client responses static, and ranked performance using a composite ranking system with MITI components and verbosity. We conducted a distinguishability experiment with two independent psychiatrists to identify human-vs-LLM responses. Results: All 10 tested LLMs had fair (MITI global scores >3.5) to good (MITI global scores >4) competence across MITI measures, and three best-performing models (gemma-3-27b-it, gemini-2.5-pro, grok-3) were tested on real-world transcripts. All showed good competence, with LLMs outperforming human-expert in Complex Reflection percentage (39% vs 96%) and Reflection-Question ratio (1.2 vs >2.8). In the distinguishability experiment, psychiatrists identified LLM responses with only 56% accuracy, with d-prime: 0.17 and 0.25 for gemini-2.5-pro and gemma-3-27b-it respectively. Conclusion: LLMs can achieve good MI proficiency in real-world clinical transcripts using MITI framework. These findings suggest that even open-source LLMs are viable candidates for expanding MI counselling sessions in low-resource settings.

URL PDF HTML ☆

赞 0 踩 0

2603.03844 2026-03-05 cs.CL

Semantic Bridging Domains: Pseudo-Source as Test-Time Connector

Xizhong Yang, Huiming Wang, Ning Xu, Mofei Song

Comments 25 pages

2603.03839 2026-03-05 cs.CV

All-in-One Image Restoration via Causal-Deconfounding Wavelet-Disentangled Prompt Network

Bingnan Wang, Bin Qin, Jiangmeng Li, Fanjiang Xu, Fuchun Sun, Hui Xiong

Comments Accepted by IEEE TIP 2026