arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.21206 2026-03-24 cs.CV

Boundary-Aware Instance Segmentation in Microscopy Imaging

Thomas Mendelson, Joshua Francois, Galit Lahav, Tammy Riklin-Raviv

Comments Accepted for publication in IEEE International Symposium on Biomedical Imaging (ISBI) 2026

详情

英文摘要

Accurate delineation of individual cells in microscopy videos is essential for studying cellular dynamics, yet separating touching or overlapping instances remains a persistent challenge. Although foundation-model for segmentation such as SAM have broadened the accessibility of image segmentation, they still struggle to separate nearby cell instances in dense microscopy scenes without extensive prompting. We propose a prompt-free, boundary-aware instance segmentation framework that predicts signed distance functions (SDFs) instead of binary masks, enabling smooth and geometry-consistent modeling of cell contours. A learned sigmoid mapping converts SDFs into probability maps, yielding sharp boundary localization and robust separation of adjacent instances. Training is guided by a unified Modified Hausdorff Distance (MHD) loss that integrates region- and boundary-based terms. Evaluations on both public and private high-throughput microscopy datasets demonstrate improved boundary accuracy and instance-level performance compared to recent SAM-based and foundation-model approaches. Source code is available at: https://github.com/ThomasMendelson/BAISeg.git

URL PDF HTML ☆

赞 0 踩 0

2603.21195 2026-03-24 cs.RO

GAPG: Geometry Aware Push-Grasping Synergy for Goal-Oriented Manipulation in Clutter

Lijingze Xiao, Jinhong Du, Yang Cong, Supeng Diao, Yu Ren

Comments Accepted to ICRA 2026

2603.21193 2026-03-24 cs.CL cs.AI cs.DL

Context Selection for Hypothesis and Statistical Evidence Extraction from Full-Text Scientific Articles

Sai Koneru, Jian Wu, Sarah Rajtmajer

2603.21192 2026-03-24 cs.CV cs.MM

DSCSNet: A Dynamic Sparse Compression Sensing Network for Closely-Spaced Infrared Small Target Unmixing

Zhiyang Tang, Yiming Zhu, Ruimin Huang, Meng Yang, Yong Ma, Jun Huang, Fan Fan

Comments 13 pages, 8 figures

2603.21191 2026-03-24 cs.LG math.OC stat.ML

On the Role of Batch Size in Stochastic Conditional Gradient Methods

Rustem Islamov, Roman Machacek, Aurelien Lucchi, Antonio Silveti-Falls, Eduard Gorbunov, Volkan Cevher

2603.21183 2026-03-24 cs.RO cs.LG cs.MA

Architecture for Multi-Unmanned Aerial Vehicles based Autonomous Precision Agriculture Systems

Ebasa Temesgen, Nathnael Minyelshowa, Lebsework Negash

2603.21177 2026-03-24 cs.LG cs.AI

Prompt replay: speeding up grpo with on-policy reuse of high-signal prompts

Andrei Baroian, Rutger Berger

2603.21176 2026-03-24 cs.CV

GIDE: Unlocking Diffusion LLMs for Precise Training-Free Image Editing

Zifeng Zhu, Jiaming Han, Jiaxiang Zhao, Minnan Luo, Xiangyu Yue

Comments 25 pages, 7 figures

2603.21175 2026-03-24 cs.LG cs.AI

Reward Sharpness-Aware Fine-Tuning for Diffusion Models

Kwanyoung Kim, Byeongsu Sim

Comments Cam ready version of CVPR26

2603.21173 2026-03-24 cs.LG cs.AI

Rethinking Plasticity in Deep Reinforcement Learning

Zhiqiang He

2603.21172 2026-03-24 cs.CL

Entropy Alone is Insufficient for Safe Selective Prediction in LLMs

Edward Phillips, Fredrik K. Gustafsson, Sean Wu, Anshul Thakur, David A. Clifton

2603.21170 2026-03-24 cs.LG

Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

Elif Ceren Gok Yildirim, Murat Onur Yildirim, Joaquin Vanschoren

Comments Published at CPAL 2026

2603.21169 2026-03-24 cs.LG

Model Evolution Under Zeroth-Order Optimization: A Neural Tangent Kernel Perspective

Chen Zhang, Yuxin Cheng, Chenchen Ding, Shuqi Wang, Jingreng Lei, Runsheng Yu, Yik-Chung WU, Ngai Wong

Comments ICLR 2026 Workshop on Scientific Methods for Understanding Deep Learning (20 pages, 18 figures)

2603.21166 2026-03-24 cs.CV

Training-Free Instance-Aware 3D Scene Reconstruction and Diffusion-Based View Synthesis from Sparse Images

Jiatong Xia, Lingqiao Liu

Comments Accepted by SIGGRAPH Asia 2025

2603.21162 2026-03-24 cs.AI cs.LG

Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

Leonid Ugadiarov, Yuri Kuratov, Aleksandr Panov, Alexey Skrynnik

Comments The paper has been accepted to the ICAPS-2026 conference. 5 pages, 2 figures

2603.21160 2026-03-24 cs.LG cs.CV

Beyond a Single Signal: SPECTREG2, A Unified MultiExpert Anomaly Detector for Unknown Unknowns

Rahul D Ray

2603.21155 2026-03-24 cs.AI

Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

Zihui Chen, Yuling Wang, Pengfei Jiao, Kai Wu, Xiao Wang, Xiang Ao, Dalin Zhang

Comments Accepted by TheWebConf (WWW) 2026

2603.21153 2026-03-24 cs.LG

Learning from Label Proportions with Dual-proportion Constraints

Tianhao Ma, Ximing Li, Changchun Li, Renchu Guan

2603.21143 2026-03-24 cs.RO

Affordance-Guided Enveloping Grasp Demonstration Toward Non-destructive Disassembly of Pinch-Infeasible Mating Parts

Masaki Tsutsumi, Takuya Kiyokawa, Gen Sako, Kensuke Harada

Comments 6 pages, 7 figures

2603.21142 2026-03-24 cs.RO

Dynamic Control Barrier Function Regulation with Vision-Language Models for Safe, Adaptive, and Realtime Visual Navigation

Jeffrey Chen, Rohan Chandra

2603.21140 2026-03-24 cs.AI

ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation

Zhuojie Yang, Wentao Wan, Keze Wang

Comments Accepted by AAAI 2026

2603.21138 2026-03-24 cs.CV

Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues

Wenjin Hou, Xiaoxiao Sun, Hehe Fan

2603.21136 2026-03-24 cs.CV

MS-CustomNet: Controllable Multi-Subject Customization with Hierarchical Relational Semantics

Pengxiang Cai, Mengyang Li

2603.21135 2026-03-24 cs.CV cs.AI

One Pool Is Not Enough: Multi-Cluster Memory for Practical Test-Time Adaptation

Yu-Wen Tseng, Xingyi Zheng, Ya-Chen Wu, I-Bin Liao, Yung-Hui Li, Hong-Han Shuai, Wen-Huang Cheng

Comments 14 pages, 6 figures

详情

英文摘要

Test-time adaptation (TTA) adapts pre-trained models to distribution shifts at inference using only unlabeled test data. Under the Practical TTA (PTTA) setting, where test streams are temporally correlated and non-i.i.d., memory has become an indispensable component for stable adaptation, yet existing methods universally store amples in a single unstructured pool. We show that this single-cluster design is fundamentally mismatched to PTTA: a stream clusterability analysis reveals that test streams are inherently multi-modal, with the optimal number of mixture components consistently far exceeding one. To close this structural gap, we propose Multi-Cluster Memory (MCM), a plug-and-play framework that organizes stored samples into multiple clusters using lightweight pixel-level statistical descriptors. MCM introduces three complementary mechanisms: descriptor-based cluster assignment to capture distinct distributional modes, Adjacent Cluster Consolidation (ACC) to bound memory usage by merging the most similar temporally adjacent clusters, and Uniform Cluster Retrieval (UCR) to ensure balanced supervision across all modes during adaptation. Integrated with three contemporary TTA methods on CIFAR-10-C, CIFAR-100-C, ImageNet-C, and DomainNet, MCM achieves consistent improvements across all 12 configurations, with gains up to 5.00% on ImageNet-C and 12.13% on DomainNet. Notably, these gains scale with distributional complexity: larger label spaces with greater multi-modality benefit most from multi-cluster organization. GMM-based memory diagnostics further confirm that MCM maintains near-optimal distributional balance, entropy, and mode coverage, whereas single-cluster memory exhibits persistent imbalance and progressive mode loss. These results establish memory organization as a key design axis for practical test-time adaptation.

URL PDF HTML ☆

赞 0 踩 0

2603.21134 2026-03-24 cs.RO cs.CV

Anatomical Prior-Driven Framework for Autonomous Robotic Cardiac Ultrasound Standard View Acquisition

Zhiyan Cao, Zhengxi Wu, Yiwei Wang, Pei-Hsuan Lin, Li Zhang, Zhen Xie, Huan Zhao, Han Ding

Comments Accepted for publication at the IEEE ICRA 2026. 8 pages, 5 figures, 3 tables

2603.21129 2026-03-24 cs.CV

ReDiffuse: Rotation Equivariant Diffusion Model for Multi-focus Image Fusion

Bo Li, Tingting Bao, Lingling Zhang, Weiping Fu, Yaxian Wang, Jun Liu

Comments 10 pages, 9 figures

2603.21123 2026-03-24 cs.RO

VisFly-Lab: Unified Differentiable Framework for First-Order Reinforcement Learning of Quadrotor Control

Fanxing Li, Fangyu Sun, Tianbao Zhang, Shuyu Wu, Dexin Zuo, yufei Yan, Wenxian Yu, Danping Zou

2603.21115 2026-03-24 cs.CV

LiFR-Seg: Anytime High-Frame-Rate Segmentation via Event-Guided Propagation

Xiaoshan Wu, Xiaoyang Lyu, Yifei Yu, Bo Wang, Zhongrui Wang, Xiaojuan Qi

Comments Accepted at ICLR 2026

2603.21114 2026-03-24 cs.CV

CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs

Shanmukha Vellamcheti, Uday Kiran Kothapalli, Disharee Bhowmick, Sathyanarayanan N. Aakur

Comments 28 pages, 10 figures, 3 tables. Project page: https://shanmukha-here.github.io/CVT-Bench

2603.21111 2026-03-24 cs.CV cs.LG

Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning

Shih-Wen Liu, Yen-Chang Chen, Wei-Ta Chu, Fu-En Yang, Yu-Chiang Frank Wang

Comments Accepted to CVPR 2026