arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.25376 2026-04-29 cs.CV cs.AI

CoRE: Concept-Reasoning Expansion for Continual Brain Lesion Segmentation

Qianqian Chen, Anglin Liu, Jingyang Zhang, Yudong Zhang

详情

英文摘要

Accurate brain lesion segmentation in MRI is vital for effective clinical diagnosis and treatment planning. Due to high annotation costs and strict data privacy regulations, universal models require employing Continual Learning (CL) to adapt to evolving clinical tasks without losing previously acquired knowledge. However, existing CL paradigms often suffer from capacity limits or redundant parameter growth, and even advanced dynamic methods rely mostly on image-perception strategies that struggle to handle the substantial pathological and multimodal heterogeneity inherent in brain imaging. To address this issue, we propose Concept-Reasoning Expansion (CoRE) framework, which establishes a joint decision-making mechanism by integrating visual features with structured concepts. Through the alignment of image tokens with a hierarchical concept library, CoRE simulates clinical reasoning to guide both interpretable expert routing and demand-based model growth. This collaborative process ensures model evolution is grounded in clinical priors, preventing redundant parameter expansion while maximizing knowledge reuse. Extensive evaluations across 12 sequential brain lesion MRI tasks demonstrate that CoRE achieves state-of-the-art performance and provides a high knowledge starting point for efficient future adaptation. Its superior few-shot transferability and clinical interpretability further validate its effectiveness in managing non-stationary clinical data streams. Our code will be released soon.

URL PDF HTML ☆

赞 0 踩 0

2604.25374 2026-04-29 cs.CL cs.AI

Language corpora for the Dutch medical domain

B. van Es

Comments 11 pages, no figures

2604.25370 2026-04-29 cs.CV cs.AI

GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deployment

Kidus Zewde, Simiao Ren, Xingyu Shen, Jenny Wu, Yuchen Zhou, Tommy Duong, Zikang Zhang, Ethan Traister

Comments 11 pages; GPT-image-2 social media dataset; Twitter API collection and multilingual curation; C2PA watermark stripping on platform upload; browser-automated AI badge verification; CLIP semantic clustering; AI-generated image provenance and attribution

2604.25369 2026-04-29 cs.AI

Multi-action Tangled Program Graphs for Multi-task Reinforcement Learning with Continuous Control

Quentin Vacher, Nicolas Beuve, Mickaël Dardaillon, Karol Desnos

2604.25361 2026-04-29 cs.CV

HuM-Eval: A Coarse-to-Fine Framework for Human-Centric Video Evaluation

Bingzi Zhang, Kaisi Guan, Ruihua Song

Comments Accepted to the 2026 IEEE International Conference on Multimedia and Expo (ICME 2026)

2604.25359 2026-04-29 cs.CL cs.AI

The Structured Output Benchmark: A Multi-Source Benchmark for Evaluating Structured Output Quality in Large Language Models

Abhinav Kumar Singh, Harsha Vardhan Khurdula, Yoeven D Khemlani, Vineet Agarwal

Comments 19 pages, 4 figures, 11 tables, submitted to NeurIPS 2026

2604.25358 2026-04-29 cs.CV

Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settings

Luca Parolari, Nicla Faccioli, Lamberto Ballan

Comments Accepted to CVPRF 2026

2604.25352 2026-04-29 cs.LG cs.AI

GraphPL: Leveraging GNN for Efficient and Robust Modalities Imputation in Patchwork Learning

Xingjian Hu, Zuoyu Yan, Jianhua Zhu, Liangcai Gao, Fei Wang, Tengfei Ma

Comments Accepted at ICASSP 2026. This is a preprint of the work

2604.25345 2026-04-29 cs.AI astro-ph.IM

Plausible but Wrong: A case study on Agentic Failures in Astrophysical Workflows

Shivam Rawat, Lucie Flek

2604.25334 2026-04-29 cs.LG cs.AI

VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification

Hongfei Wu, Ruijian Han, Yancheng Yuan

2604.25329 2026-04-29 cs.RO

ProDrive: Proactive Planning for Autonomous Driving via Ego-Environment Co-Evolution

Chuyao Fu, Shengzhe Gan, Zhuoli Ouyang, Yuhan Rui, Xiaowei Chi, Sirui Han, Jiankun Wang, Hong Zhang

Comments Accepted to CVPR 2026 GigaBrain Challenge Workshop

2604.25323 2026-04-29 cs.RO

ANCHOR: A Physically Grounded Closed-Loop Framework for Robust Home-Service Mobile Manipulation

Jinhao Jiang, Shengyu Fang, Sibo Zuo, Yujie Tang, Yirui Li

详情

英文摘要

Recent advances in open-vocabulary mobile manipulation have brought robots into real domestic environments. In such settings, reliable long-horizon execution under open-set object references and frequent disturbances becomes essential. However, many failures persist. These are not caused by semantic misunderstanding but by inconsistencies between symbolic plans and the evolving physical world, manifested as three recurring limitations: (i) existing systems often rely on pre-scanned semantic maps that become inconsistent after scene changes and disturbances; (ii) they select navigation endpoints without considering downstream manipulation feasibility, causing the "arrived but inoperable" problem; and (iii) they handle anomalies through undifferentiated global replanning, which often fails to contain local errors. To address this execution inconsistency, we present ANCHOR, a physically grounded closed-loop framework that aligns symbolic reasoning with verifiable physical state during execution. ANCHOR integrates three mechanisms: (i) physically anchored task planning, which binds symbolic predicates to observable geometric anchors and re-validates them after each action; (ii) operability-aware base alignment, which ensures that navigation endpoints satisfy kinematic reachability and local collision feasibility; and (iii) minimum-responsible-layer hierarchical recovery, which localizes failures across perception, base-arm coordination, and execution layers to prevent cascading retries. Across 60 real-robot trials in previously unseen environments, ANCHOR improves task success from 53.3% to 71.7% and achieves a 71.4% recovery rate under perturbations, demonstrating that explicit physical grounding and structured failure containment are critical for robust mobile manipulation. Our project page is available at https://anchor9178.github.io/ANCHOR/ .

URL PDF HTML ☆

赞 0 踩 0

2604.25322 2026-04-29 cs.CV

Assessment of the quantitative impact of occlusal positioning splints on temporomandibular joint conditions

Agnieszka Anna Tomaka, Krzysztof Domino, Dariusz Pojda, Michał Tarnawski

Comments 27 pages, 9 figures

2604.25319 2026-04-29 cs.CV

Edge-Cloud Collaborative Reconstruction via Structure-Aware Latent Diffusion for Downstream Remote Sensing Perception

Yun Li, Xianju Li

Comments 6 pages, 3 figures

2604.25316 2026-04-29 cs.CV

Towards Robust Deep Learning-based Rumex Obtusifolius Detection from Drone Images

Fabian Dionys Schrag, Mehmet Ozgur Turkoglu, Konrad Schindler, Ralph Lukas Stoop

Comments under review

2604.25315 2026-04-29 cs.CV

SaliencyDecor: Enhancing Neural Network Interpretability through Feature Decorrelation

Ali Karkehabadi, Jamshid Hassanpour, Houman Homayoun, Avesta Sasan

Comments Accepted for publication at the International Joint Conference on Neural Networks (IJCNN 2026)

2604.25314 2026-04-29 cs.CV

Golden RPG: Confidence-Adaptive Region-Aware Noise for Compositional Text-to-Image Generation

Hao Li

Comments 13 pages

2604.25310 2026-04-29 cs.CV eess.IV

Rapid tracking through strongly scattering media with physics-informed neuromorphic speckle analysis

Yuqing Cao, Shuo Zhu, Rongzhou Chen, Jingyan Chen, Ni Chen, Edmund Y. Lam

2604.25306 2026-04-29 cs.LG cs.AI

QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention

Sehyeon Oh, Yongin Kwon, Jemin Lee

Comments 11 pages, 6 figures

2604.25304 2026-04-29 cs.LG

RCProb: Probabilistic Rule Extraction for Efficient Simplification of Tree Ensembles

Josue Obregon

Comments 20 pages, 3 figures. Submitted to Information Sciences, currently under review

2604.25300 2026-04-29 cs.CV eess.IV

DenseScout: Algorithm-System Co-design for Budgeted Tiny Object Selection on Edge Platforms

Xiong Zhouzhi, Zimo Zeng, Yi Chen, Shuqi Xu, Yunfeng Yan, Donglian Qi

Comments 19 pages, 8 figures

2604.25299 2026-04-29 cs.CV cs.AI

The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents

Yuwei Sun, Yuxuan Yao, Hui Li, Siyu Zhu

2604.25297 2026-04-29 cs.CL cs.AI

LegalMidm: Use-Case-Driven Legal Domain Specialization for Korean Large Language Model

Youngjoon Jang, Chanhee Park, Hyeonseok Moon, Young-kyoung Ham, Jiwon Moon, Jinhyeon Kim, JuKyung Jung, Heuiseok Lim

Comments ICLR 2026 DATA-FM Workshop

2604.25296 2026-04-29 cs.CL

Learning from Medical Entity Trees: An Entity-Centric Medical Data Engineering Framework for MLLMs

Jianghang Lin, Haihua Yang, Deli Yu, Kai Wu, Kai Ye, Jinghao Lin, Zihan Wang, Yuhang Wu, Liujuan Cao

2604.25295 2026-04-29 cs.LG

Optimization-Free Topological Sort for Causal Discovery via the Schur Complement of Score Jacobians

Rui Wu, Hong Xie

Comments 18 pages, 3 figures, 7 tables

2604.25292 2026-04-29 cs.RO cs.SY eess.SY

Slot-hopping Enabled Loiter Guidance and Automation for Fixed-wing UAV Corridors

Pradeep J, Siddhardha Kedarisetty, Ashwini Ratnoo

2604.25289 2026-04-29 cs.LG cs.CV

Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds

Liuzhuozheng Li, Zhiyuan Zhan, Shuhong Liu, Dengyang Jiang, Zanyi Wang, Guang Dai, Jingdong Wang, Mengmeng Wang

2604.25276 2026-04-29 cs.CV

OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding

Minghang Zheng, Zihao Yin, Yi Yang, Yuxin Peng, Yang Liu

Comments CVPR 2026

2604.25273 2026-04-29 cs.CV

Combating Visual Neglect and Semantic Drift in Large Multimodal Models for Enhanced Cross-Modal Retrieval

Guosheng Zhang, Linkai Liu, Keyao Wang, Haixiao Yue, Zhiwen Tan, Xiao Tan

2604.25269 2026-04-29 cs.LG stat.ML

Online combinatorial optimization with stochastic decision sets and adversarial losses

Gergely Neu, Michal Valko

Comments Published at Neural Information Processing Systems (NeurIPS) 2014