arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.00684 2026-04-02 cs.CV

TP-Seg: Task-Prototype Framework for Unified Medical Lesion Segmentation

Jiawei Xu, Qiangqiang Zhou, Dandan Zhu, Yong Chen, Yugen Yi, Xiaoqi Zhao

详情

英文摘要

Building a unified model with a single set of parameters to efficiently handle diverse types of medical lesion segmentation has become a crucial objective for AI-assisted diagnosis. Existing unified segmentation approaches typically rely on shared encoders across heterogeneous tasks and modalities, which often leads to feature entanglement, gradient interference, and suboptimal lesion discrimination. In this work, we propose TP-Seg, a task-prototype framework for unified medical lesion segmentation. On one hand, the task-conditioned adapter effectively balances shared and task-specific representations through a dual-path expert structure, enabling adaptive feature extraction across diverse medical imaging modalities and lesion types. On the other hand, the prototype-guided task decoder introduces learnable task prototypes as semantic anchors and employs a cross-attention mechanism to achieve fine-grained modeling of task-specific foreground and background semantics. Without bells and whistles, TP-Seg consistently outperforms specialized, general and unified segmentation methods across 8 different medical lesion segmentation tasks covering multiple imaging modalities, demonstrating strong generalization, scalability and clinical applicability.

URL PDF HTML ☆

赞 0 踩 0

2604.00682 2026-04-02 cs.CV

MoonAnything: A Vision Benchmark with Large-Scale Lunar Supervised Data

Clémentine Grethen, Yuang Shi, Simone Gasparini, Géraldine Morin

Comments Accepted to ACM MMSys 2026

2604.00677 2026-04-02 cs.CV

CL-VISTA: Benchmarking Continual Learning in Video Large Language Models

Haiyang Guo, Yichen Shi, Fei Zhu, Wenzhuo Liu, Hongbo Zhao, Fanhu Zeng, Shijie Ma, Da-Han Wang, Xu-Yao Zhang

Comments Preprint

2604.00669 2026-04-02 cs.LG math.DS

Embedded Variational Neural Stochastic Differential Equations for Learning Heterogeneous Dynamics

Sandeep Kumar Samota, Reema Gupta, Snehashish Chakraverty

2604.00666 2026-04-02 cs.CL

TRIMS: Trajectory-Ranked Instruction Masked Supervision for Diffusion Language Models

Lingjie Chen, Ruizhong Qiu, Yuyu Fan, Yanjun Zhao, Hanghang Tong

Comments 10 pages, 7 figures, 1 algorithm

2604.00651 2026-04-02 cs.CV

When AI and Experts Agree on Error: Intrinsic Ambiguity in Dermatoscopic Images

Loris Cino, Pier Luigi Mazzeo, Alessandro Martella, Giulia Radi, Renato Rossi, Cosimo Distante

2604.00628 2026-04-02 cs.RO cs.HC

StretchBot: A Neuro-Symbolic Framework for Adaptive Guidance with Assistive Robots

Luca Vogelgesang, Ahmed Mehdi Soltani, Mohammadhossein Khojasteh, Xinrui Zu, Stefano De Giorgis, Madalina Croitoru, Filip Ilievski

2604.00613 2026-04-02 cs.CL

English to Central Kurdish Speech Translation: Corpus Creation, Evaluation, and Orthographic Standardization

Mohammad Mohammadamini, Daban Q. Jaff, Josep Crego, Marie Tahon, Antoine Laurent

2604.00610 2026-04-02 cs.CL

Speech LLMs are Contextual Reasoning Transcribers

Keqi Deng, Ruchao Fan, Bo Ren, Yiming Wang, Jinyu Li

2604.00609 2026-04-02 cs.CV

TALENT: Target-aware Efficient Tuning for Referring Image Segmentation

Shuo Jin, Siyue Yu, Bingfeng Zhang, Chao Yao, Meiqin Liu, Jimin Xiao

Comments Accepted by CVPR26 Findings

2604.00601 2026-04-02 cs.CV

KG-CMI: Knowledge graph enhanced cross-Mamba interaction for medical visual question answering

Xianyao Zheng, Hong Yu, Hui Cui, Changming Sun, Xiangyu Li, Ran Su, Leyi Wei, Jia Zhou, Junbo Wang, Qiangguo Jin

2604.00599 2026-04-02 cs.LG

Predicting Dynamics of Ultra-Large Complex Systems by Inferring Governing Equations

Qi Shao, Duxin Chen, Jiawen Chen, Yujie Zeng, Athen Ma, Wenwu Yu, Vito Latora, Wei Lin

Comments 15 pages, 5 figures, under review

2604.00597 2026-04-02 cs.CV

Towards Viewpoint-Robust End-to-End Autonomous Driving with 3D Foundation Model Priors

Hiroki Hashimoto, Hiromichi Goto, Hiroyuki Sugai, Hiroshi Kera, Kazuhiko Kawamoto

Comments Accepted at CVPR Workshop on Simulation for Autonomous Driving 2026

2604.00594 2026-04-02 cs.AI

Agent psychometrics: Task-level performance prediction in agentic coding benchmarks

Chris Ge, Daria Kryvosheieva, Daniel Fried, Uzay Girit, Kaivalya Hariharan

2604.00592 2026-04-02 cs.CV cs.HC

HarassGuard: Detecting Harassment Behaviors in Social Virtual Reality with Vision-Language Models

Junhee Lee, Minseok Kim, Hwanjo Heo, Seungwon Woo, Jinwoo Kim

Comments To appear in the 2026 TVCG Special Issue on the 2026 IEEE Conference on Virtual Reality and 3D User Interfaces (VR)

2604.00586 2026-04-02 cs.CL

More Human, More Efficient: Aligning Annotations with Quantized SLMs

Jiayu Wang, Junyoung Lee

2604.00580 2026-04-02 cs.LG q-bio.BM

Representation choice shapes the interpretation of protein conformational dynamics

Axel Giottonini, Thomas Lemmin

2604.00568 2026-04-02 cs.CL

A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory

Taihei Shiotani, Masahiro Kaneko, Naoaki Okazaki

2604.00559 2026-04-02 cs.CV

FecalFed: Privacy-Preserving Poultry Disease Detection via Federated Learning

Tien-Yu Chi

Comments Accepted to the CVPR 2026 Workshop on Vision for Agriculture

2604.00558 2026-04-02 cs.CV

STAR: Mitigating Cascading Errors in Spatial Reasoning via Turn-point Alignment and Segment-level DPO

Pukun Zhao, Longxiang Wang, Chen Chen, Peicheng Wang, Fanqing Zhou, Runze Li, Haojian Huang

Comments 9 pages, 6 figures, 4 tables, Accepted by ICME 2026

2604.00557 2026-04-02 cs.RO cs.CV cs.LG

Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

Yichen Xie, Yixiao Wang, Shuqi Zhao, Cheng-En Wu, Masayoshi Tomizuka, Jianwen Xie, Hao-Shu Fang

2604.00556 2026-04-02 cs.LG cs.AI cs.ET q-fin.CP q-fin.RM

HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation

Hongyang Yang, Yanxin Zhang, Yang She, Yue Xiao, Hao Wu, Yiyang Zhang, Jiapeng Hou, Rongshan Zhang

Comments Accepted at the DMO-FinTech Workshop (PAKDD 2026)

2604.00550 2026-04-02 cs.AI

BloClaw: An Omniscient, Multi-Modal Agentic Workspace for Next-Generation Scientific Discovery

Yao Qin, Yangyang Yan, Jinhua Pang, Xiaoming Zhang

2604.00549 2026-04-02 cs.CV

TF-SSD: A Strong Pipeline via Synergic Mask Filter for Training-free Co-salient Object Detection

Zhijin He, Shuo Jin, Siyue Yu, Shuwei Wu, Bingfeng Zhang, Li Yu, Jimin Xiao

Comments Accepted by CVPR26

2604.00548 2026-04-02 cs.CV

Reliev3R: Relieving Feed-forward Reconstruction from Multi-View Geometric Annotations

Youyu Chen, Junjun Jiang, Yueru Luo, Kui Jiang, Xianming Liu, Xu Yan, Dave Zhenyu Chen

Comments Accepted by CVPR2026

2604.00547 2026-04-02 cs.AI cs.LG

Does Unification Come at a Cost? Uni-SafeBench: A Safety Benchmark for Unified Multimodal Large Models

Zixiang Peng, Yongxiu Xu, Qinyi Zhang, Jiexun Shen, Yifan Zhang, Hongbo Xu, Yubin Wang, Gaopeng Gou

2604.00545 2026-04-02 cs.CV

Neuropsychiatric Deviations From Normative Profiles: An MRI-Derived Marker for Early Alzheimer's Disease Detection

Synne Hjertager Osenbroch, Lisa Ramona Rosvold, Yao Lu, Alvaro Fernandez-Quilez

Comments Accepted and to be presented (ORAL) in ISBI 2026

2604.00538 2026-04-02 cs.CV

TRiGS: Temporal Rigid-Body Motion for Scalable 4D Gaussian Splatting

Suwoong Yeom, Joonsik Nam, Seunggyu Choi, Lucas Yunkyu Lee, Sangmin Kim, Jaesik Park, Joonsoo Kim, Kugjin Yun, Kyeongbo Kong, Sukju Kang

Comments Project page: https://wwwjjn.github.io/TRiGS-project_page/

2604.00537 2026-04-02 cs.CV cs.AI

MATHENA: Mamba-based Architectural Tooth Hierarchical Estimator and Holistic Evaluation Network for Anatomy

Kyeonghun Kim, Jaehyung Park, Youngung Han, Anna Jung, Seongbin Park, Sumin Lee, Jiwon Yang, Jiyoon Han, Subeen Lee, Junsu Lim, Hyunsu Go, Eunseob Choi, Hyeonseok Jung, Soo Yong Kim, Woo Kyoung Jeong, Won Jae Lee, Pa Hong, Hyuk-Jae Lee, Ken Ying-Kai Liao, Nam-Joon Kim

Comments 10 pages, 3 figures, 4 tables

2604.00536 2026-04-02 cs.CL cs.AI

Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation

Zhiting Fan, Ruizhe Chen, Tianxiang Hu, Ru Peng, Zenan Huang, Haokai Xu, Yixin Chen, Jian Wu, Junbo Zhao, Zuozhu Liu