arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.15967 2026-03-19 cs.CV

A Comprehensive Benchmark of Histopathology Foundation Models for Kidney Digital Pathology Images

Harishwar Reddy Kasireddy, Patricio S. La Rosa, Akshita Gupta, Anindya S. Paul, Jamie L. Fermin, William L. Clapp, Meryl A. Waldman, Tarek M. El-Ashkar, Sanjay Jain, Luis Rodrigues, Kuang Yu Jen, Avi Z. Rosenberg, Michael T. Eadon, Jeffrey B. Hodgin, Pinaki Sarder

Comments 31 Pages, 14 Tables, 12 figures, Co-correspondence to jhodgin@med.umich.edu and pinaki.sarder@ufl.edu

详情

英文摘要

Histopathology foundation models (HFMs), pretrained on large-scale cancer datasets, have advanced computational pathology. However, their applicability to non-cancerous chronic kidney disease remains underexplored, despite coexistence of renal pathology with malignancies such as renal cell and urothelial carcinoma. We systematically evaluate 11 publicly available HFMs across 11 kidney-specific downstream tasks spanning multiple stains (PAS, H&E, PASM, and IHC), spatial scales (tile and slide-level), task types (classification, regression, and copy detection), and clinical objectives, including detection, diagnosis, and prognosis. Tile-level performance is assessed using repeated stratified group cross-validation, while slide-level tasks are evaluated using repeated nested stratified cross-validation. Statistical significance is examined using Friedman test followed by pairwise Wilcoxon signed-rank testing with Holm-Bonferroni correction and compact letter display visualization. To promote reproducibility, we release an open-source Python package, kidney-hfm-eval, available at https://pypi.org/project/kidney-hfm-eval/ , that reproduces the evaluation pipelines. Results show moderate to strong performance on tasks driven by coarse meso-scale renal morphology, including diagnostic classification and detection of prominent structural alterations. In contrast, performance consistently declines for tasks requiring fine-grained microstructural discrimination, complex biological phenotypes, or slide-level prognostic inference, largely independent of stain type. Overall, current HFMs appear to encode predominantly static meso-scale representations and may have limited capacity to capture subtle renal pathology or prognosis-related signals. Our results highlight the need for kidney-specific, multi-stain, and multimodal foundation models to support clinically reliable decision-making in nephrology.

URL PDF HTML ☆

赞 0 踩 0

2603.15363 2026-03-19 cs.LG math.DS

Deep learning and the rate of approximation by flows

Jingpu Cheng, Qianxiao Li, Ting Lin, Zuowei Shen

2603.15359 2026-03-19 cs.RO

NavThinker: Action-Conditioned World Models for Coupled Prediction and Planning in Social Navigation

Tianshuai Hu, Zeying Gong, Lingdong Kong, XiaoDong Mei, Yiyi Ding, Qi Zeng, Ao Liang, Rong Li, Yangyi Zhong, Junwei Liang

2603.15352 2026-03-19 cs.SD cs.AI eess.AS

NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation

Qinke Ni, Huan Liao, Dekun Chen, Yuxiang Wang, Zhizheng Wu

Comments Submit to Interspeech 2026

2603.15026 2026-03-19 cs.CV cs.LG

Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods

Omer Ben Hayun, Roy Betser, Meir Yossef Levi, Levi Kassel, Guy Gilboa

Comments Accepted to CVPR 2026

2603.14925 2026-03-19 cs.CV cs.GR

Workflow-Aware Structured Layer Decomposition for Illustration Production

Tianyu Zhang, Dongchi Li, Keiichi Sawada, Haoran Xie

Comments 17 pages, 15 figures

2603.14827 2026-03-19 cs.CV

SemanticFace: Semantic Facial Action Estimation via Semantic Distillation in Interpretable Space

Zejian Kang, Kai Zheng, Yuanchen Fei, Wentao Yang, Hongyuan Zou, Xiangru Huang

2603.13698 2026-03-19 cs.RO cs.AI

SAATT Nav: a Socially Aware Autonomous Transparent Transportation Navigation Framework for Wheelchairs

Yutong Zhang, Shaiv Y. Mehra, Bradley S. Duerstock, Juan P. Wachs

Comments 8 pages, 4 figures, 2 tables, 1 algorithm. Submitted to IROS 2026

2603.13402 2026-03-19 cs.CV cs.LG

Event-Driven Video Generation

Chika Maduabuchi

2603.11426 2026-03-19 cs.RO

Grounding Robot Generalization in Training Data via Retrieval-Augmented VLMs

Jensen Gao, Dorsa Sadigh, Sandy Huang, Dhruv Shah

Comments 12 pages

2603.10744 2026-03-19 cs.CV

Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers

Wenhao Sun, Ji Li, Zhaoqiang Liu

Comments Accepted by CVPR2026. Project Page: https://wenhao-sun77.github.io/JiT/

2603.10492 2026-03-19 cs.CL

Human-AI Co-reasoning for Clinical Diagnosis with Evidence-Integrated Language Agent

Zhongzhen Huang, Yan Ling, Hong Chen, Ye Feng, Li Wu, Linjie Mu, Shaoting Zhang, Xiaofan Zhang, Kun Qian, Xiaomu Li

Comments After further evaluation, we have decided to withdraw the current version of this manuscript for further revision. We plan to add new experiments, improve the writing and overall presentation for greater clarity and coherence, and re-examine the dataset and related descriptions to ensure rigor and reliability before submitting an updated version

2603.09565 2026-03-19 cs.RO

ReTac-ACT: A State-Gated Vision-Tactile Fusion Transformer for Precision Assembly

Minchi Ruan, LiangQing Zhou, Hongtong Li, Zongtao Wang, ZhaoMing Lu, Jianwei Zhang, Bin Fang

2603.06313 2026-03-19 cs.CV

WMoE-CLIP: Wavelet-Enhanced Mixture-of-Experts Prompt Learning for Zero-Shot Anomaly Detection

Peng Chen, Chao Huang

2603.05538 2026-03-19 cs.LG cs.AI physics.comp-ph

JAWS: Enhancing Long-term Rollout of Neural PDE Solvers via Spatially-Adaptive Jacobian Regularization

Fengxiang Nie, Yasuhiro Suzuki

Comments 22 pages, 18 figures

2603.00340 2026-03-19 cs.LG

Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models

Yuandong Zhang, Othmane Echchabi, Tianshu Feng, Wenyi Zhang, Hsuai-Kai Liao, Charles Chang

Comments Accepted for publication in the International Journal of Geographical Information Science, February 2026. This is the accepted manuscript. The final version of record will appear in IJGIS (Taylor and Francis)

2602.18735 2026-03-19 cs.CV cs.RO

LaS-Comp: Zero-shot 3D Completion with Latent-Spatial Consistency

Weilong Yan, Haipeng Li, Hao Xu, Nianjin Ye, Yihao Ai, Shuaicheng Liu, Jingyu Hu

Comments Accepted by CVPR2026

2601.21592 2026-03-19 cs.CV

Unifying Heterogeneous Degradations: Uncertainty-Aware Diffusion Bridge Model for All-in-One Image Restoration

Luwei Tu, Jiawei Wu, Xing Luo, Zhi Jin

2601.12882 2026-03-19 cs.CV cs.AI

YOLO26: An Analysis of NMS-Free End to End Framework for Real-Time Object Detection

Sudip Chakrabarty

2601.12224 2026-03-19 cs.CV cs.AI

Where It Moves, It Matters: Referring Surgical Instrument Segmentation via Motion

Meng Wei, Kun Yuan, Shi Li, Yue Zhou, Long Bai, Nassir Navab, Hongliang Ren, Hong Joo Lee, Tom Vercauteren, Nicolas Padoy

2601.11639 2026-03-19 cs.LG

Global Optimization By Gradient From Hierarchical Score-Matching Spaces

Ming Li

Comments Correct inconsistencies in title capitalization, fix tiny error of one formula and modify it's formatting

2601.09233 2026-03-19 cs.LG cs.AI cs.CL

GIFT: Reconciling Post-Training Objectives via Finite-Temperature Gibbs Initialization

Zhengyang Zhao, Lu Ma, Yizhen Jiang, Xiaochen Ma, Zimo Meng, Chengyu Shen, Lexiang Tang, Haoze Sun, Peng Pei, Wentao Zhang

2601.03981 2026-03-19 cs.CL

RADAR: Retrieval-Augmented Detector with Adversarial Refinement for Robust Fake News Detection

Song-Duo Ma, Yi-Hung Liu, Hsin-Yu Lin, Pin-Yu Chen, Hong-Yan Huang, Shau-Yung Hsu, Yun-Nung Chen

2512.20991 2026-03-19 cs.AI cs.MA

FinAgent: An Agentic AI Framework Integrating Personal Finance and Nutrition Planning

Toqeer Ali Syed, Abdulaziz Alshahrani, Ali Ullah, Ali Akarma, Sohail Khan, Muhammad Nauman, Salman Jan

Comments This paper was presented at the IEEE International Conference on Computing and Applications (ICCA 2025), Bahrain

2512.16923 2026-03-19 cs.CV

Generative Refocusing: Flexible Defocus Control from a Single Image

Chun-Wei Tuan Mu, Cheng-De Fan, Jia-Bin Huang, Yu-Lun Liu

Comments Project website: https://generative-refocusing.github.io/

2512.15248 2026-03-19 cs.CL

The Moralization Corpus: Frame-Based Annotation and Analysis of Moralizing Speech Acts across Diverse Text Genres

Maria Becker, Mirko Sommer, Lars Tapken, Yi Wan Teh, Bruno Brocai

2512.12220 2026-03-19 cs.CV

TechImage-Bench: Rubric-Based Evaluation for Technical Image Generation

Minheng Ni, Zhengyuan Yang, Yaowen Zhang, Linjie Li, Chung-Ching Lin, Kevin Lin, Zhendong Wang, Xiaofei Wang, Shujie Liu, Lei Zhang, Wangmeng Zuo, Lijuan Wang

2512.11903 2026-03-19 cs.RO cs.CV

Aion: Towards Hierarchical 4D Scene Graphs with Temporal Flow Dynamics

Iacopo Catalano, Eduardo Montijano, Javier Civera, Julio A. Placed, Jorge Pena-Queralta

Comments Accepted at ICRA 2026, 8 pages

2512.01899 2026-03-19 cs.LG stat.ML

Provably Safe Model Updates

Leo Elmecker-Plakolm, Pierre Fasterling, Philip Sosnin, Calvin Tsay, Matthew Wicker

Comments 12 pages, 9 figures. This work has been accepted for publication at SaTML 2026. The final version will be available on IEEE Xplore

2511.20292 2026-03-19 cs.RO

Dynamic-ICP: Doppler-Aware Iterative Closest Point Registration for Dynamic Scenes

Dong Wang, Daniel Casado Herraez, Stefan May, Andreas Nüchter

Comments 8 pages, 5 figures