arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.21904 2026-02-26 cs.CV cs.RO

UNet-Based Keypoint Regression for 3D Cone Localization in Autonomous Racing

Mariia Baidachna, James Carty, Aidan Ferguson, Joseph Agrane, Varad Kulkarni, Aubrey Agub, Michael Baxendale, Aaron David, Rachel Horton, Elliott Atkinson

Comments 8 pages, 9 figures. Accepted to ICCV End-to-End 3D Learning Workshop 2025 and presented as a poster; not included in the final proceedings due to a conference administrative error

2602.21899 2026-02-26 cs.RO cs.NI

Enhancing Cellular-enabled Collaborative Robots Planning through GNSS data for SAR Scenarios

Arnau Romero, Carmen Delgado, Jana Baguer, Raúl Suárez, Xavier Costa-Pérez

Comments arXiv admin note: substantial text overlap with arXiv:2403.09177

2602.21887 2026-02-26 cs.CL

ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection

Changjiang Gao, Zixian Huang, Kaichen Yang, Jiajun Chen, Jixing Li, Shujian Huang

2602.21873 2026-02-26 cs.CV cs.LG

GFPL: Generative Federated Prototype Learning for Resource-Constrained and Data-Imbalanced Vision Task

Shiwei Lu, Yuhang He, Jiashuo Li, Qiang Wang, Yihong Gong

2602.21864 2026-02-26 cs.CV cs.AI cs.CL cs.GR

DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs

Yanbin Wei, Jiangyue Yan, Chun Kang, Yang Chen, Hua Liu, James Kwok, Yu Zhang

Comments CVPR 2026

2602.21862 2026-02-26 cs.CL

Personalized Graph-Empowered Large Language Model for Proactive Information Access

Chia Cheng Chang, An-Zi Yen, Hen-Hsen Huang, Hsin-Hsi Chen

2602.21857 2026-02-26 cs.AI cs.CL cs.LG

Distill and Align Decomposition for Enhanced Claim Verification

Jabez Magomere, Elena Kochkina, Samuel Mensah, Simerjot Kaur, Fernando Acero, Arturo Oncevay, Charese H. Smiley, Xiaomo Liu, Manuela Veloso

Comments EACL Findings 2026

2602.21855 2026-02-26 cs.CV cs.AI

Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

Lokesha Rasanjalee, Jin Lin Tan, Dileepa Pitawela, Rajvinder Singh, Hsiang-Ting Chen

Comments Accepted at IEEE ISBI 2026

2602.21854 2026-02-26 cs.CL

FewMMBench: A Benchmark for Multimodal Few-Shot Learning

Mustafa Dogan, Ilker Kesen, Iacer Calixto, Aykut Erdem, Erkut Erdem

Comments Preprint. 49 pages, 38 Figures, 5 Tables

2602.21849 2026-02-26 cs.CV

Meta-FC: Meta-Learning with Feature Consistency for Robust and Generalizable Watermarking

Yuheng Li, Weitong Chen, Chengcheng Zhu, Jiale Zhang, Chunpeng Ge, Di Wu, Guodong Long

2602.21845 2026-02-26 cs.LG cs.AI cs.CY

xai-cola: A Python library for sparsifying counterfactual explanations

Lin Zhu, Lei You

Comments 5pages, 1 figure

2602.21844 2026-02-26 cs.LG cs.DC cs.GT

JSAM: Privacy Straggler-Resilient Joint Client Selection and Incentive Mechanism Design in Differentially Private Federated Learning

Ruichen Xu, Ying-Jun Angela Zhang, Jianwei Huang

2602.21829 2026-02-26 cs.CV cs.AI

StoryMovie: A Dataset for Semantic Alignment of Visual Stories with Movie Scripts and Subtitles

Daniel Oliveira, David Martins de Matos

Comments 15 pages, submitted to Journal of Visual Communication and Image Representation

2602.21824 2026-02-26 cs.LG

DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion

Marcel Lamott, Saifullah Saifullah, Nauman Riaz, Yves-Noel Weweler, Tobias Alt-Veit, Ahmad Sarmad Ali, Muhammad Armaghan Shakir, Adrian Kalwa, Momina Moetesum, Andreas Dengel, Sheraz Ahmed, Faisal Shafait, Ulrich Schwanecke, Adrian Ulges

2602.21816 2026-02-26 cs.RO

Self-Curriculum Model-based Reinforcement Learning for Shape Control of Deformable Linear Objects

Zhaowei Liang, Song Wang, Zhao Jin, Shirui Wu, Dan Wu

2602.21811 2026-02-26 cs.RO

DexRepNet++: Learning Dexterous Robotic Manipulation with Geometric and Spatial Hand-Object Representations

Qingtao Liu, Zhengnan Sun, Yu Cui, Haoming Li, Gaofeng Li, Lin Shao, Jiming Chen, Qi Ye

Comments Accepted by IEEE Transactions on Robotics (T-RO), 2026

2602.21810 2026-02-26 cs.CV

GeoMotion: Rethinking Motion Segmentation via Latent 4D Geometry

Xiankang He, Peile Lin, Ying Cui, Dongyan Guo, Chunhua Shen, Xiaoqin Zhang

2602.21798 2026-02-26 cs.LG cs.AI

Excitation: Momentum For Experts

Sagi Shaier

2602.21786 2026-02-26 cs.CL

D-COT: Disciplined Chain-of-Thought Learning for Efficient Reasoning in Small Language Models

Shunsuke Ubukata

Comments 9 pages, 3 figures. Code: https://github.com/gitpullpull/DisciplinedChainOfThought | Benchmarks: https://huggingface.co/datasets/gitpullpull/D-CoT-Benchmarks | Dataset: https://huggingface.co/datasets/gitpullpull/D-CoT-datasets

2602.21783 2026-02-26 cs.RO cs.LG

Therapist-Robot-Patient Physical Interaction is Worth a Thousand Words: Enabling Intuitive Therapist Guidance via Remote Haptic Control

Beatrice Luciani, Alex van den Berg, Matti Lang, Alexandre L. Ratschat, Laura Marchal-Crespo

Comments 14 pages, 5 figures, 3 tables

2602.21780 2026-02-26 cs.CV

XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression

Zunhai Su, Weihao Ye, Hansen Feng, Keyu Fan, Jing Zhang, Dahai Yu, Zhengwu Liu, Ngai Wong

Comments Submission to the Journal of the Society for Information Display

2602.21779 2026-02-26 cs.CV cs.AI

Beyond Static Artifacts: A Forensic Benchmark for Video Deepfake Reasoning in Vision Language Models

Zheyuan Gu, Qingsong Zhao, Yusong Wang, Zhaohong Huang, Xinqi Li, Cheng Yuan, Jiaowei Shao, Chi Zhang, Xuelong Li

Comments 16 pages, 9 figures. Submitted to CVPR 2026

2602.21773 2026-02-26 cs.LG cs.CV

Easy to Learn, Yet Hard to Forget: Towards Robust Unlearning Under Bias

JuneHyoung Kwon, MiHyeon Kim, Eunju Lee, Yoonji Lee, Seunghoon Lee, YoungBin Kim

Comments Accepted to AAAI 2026

2602.21765 2026-02-26 cs.LG cs.AI stat.ML

Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

Kenton Tang, Yuzhu Chen, Fengxiang He

2602.21763 2026-02-26 cs.CL

Improving Implicit Discourse Relation Recognition with Natural Language Explanations from LLMs

Heng Wang, Changxing Wu

Comments AAAI26'0ral

2602.21762 2026-02-26 cs.CV

SAPNet++: Evolving Point-Prompted Instance Segmentation with Semantic and Spatial Awareness

Zhaoyang Wei, Xumeng Han, Xuehui Yu, Xue Yang, Guorong Li, Zhenjun Han, Jianbin Jiao

Comments 18 pages

详情

DOI: 10.1109/TPAMI.2026.3667694
Journal ref: TPAMI 2026

英文摘要

Single-point annotation is increasingly prominent in visual tasks for labeling cost reduction. However, it challenges tasks requiring high precision, such as the point-prompted instance segmentation (PPIS) task, which aims to estimate precise masks using single-point prompts to train a segmentation network. Due to the constraints of point annotations, granularity ambiguity and boundary uncertainty arise the difficulty distinguishing between different levels of detail (eg. whole object vs. parts) and the challenge of precisely delineating object boundaries. Previous works have usually inherited the paradigm of mask generation along with proposal selection to achieve PPIS. However, proposal selection relies solely on category information, failing to resolve the ambiguity of different granularity. Furthermore, mask generators offer only finite discrete solutions that often deviate from actual masks, particularly at boundaries. To address these issues, we propose the Semantic-Aware Point-Prompted Instance Segmentation Network (SAPNet). It integrates Point Distance Guidance and Box Mining Strategy to tackle group and local issues caused by the point's granularity ambiguity. Additionally, we incorporate completeness scores within proposals to add spatial granularity awareness, enhancing multiple instance learning (MIL) in proposal selection termed S-MIL. The Multi-level Affinity Refinement conveys pixel and semantic clues, narrowing boundary uncertainty during mask refinement. These modules culminate in SAPNet++, mitigating point prompt's granularity ambiguity and boundary uncertainty and significantly improving segmentation performance. Extensive experiments on four challenging datasets validate the effectiveness of our methods, highlighting the potential to advance PPIS.

URL PDF HTML ☆

赞 0 踩 0

2602.21760 2026-02-26 cs.CV

Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling

Euisoo Jung, Byunghyun Kim, Hyunjin Kim, Seonghye Cho, Jae-Gil Lee

2602.21757 2026-02-26 cs.LG cs.AI

Learning from Yesterday's Error: An Efficient Online Learning Method for Traffic Demand Prediction

Xiannan Huang, Quan Yuan, Chao Yang

2602.21754 2026-02-26 cs.CV

LiREC-Net: A Target-Free and Learning-Based Network for LiDAR, RGB, and Event Calibration

Aditya Ranjan Dash, Ramy Battrawy, René Schuster, Didier Stricker

Comments Accepted in CVPR 2026

2602.21745 2026-02-26 cs.AI cs.CY

The ASIR Courage Model: A Phase-Dynamic Framework for Truth Transitions in Human and AI Systems

Hyo Jin Kim

Comments 13 pages, 5 figures. Version 1. Includes recursive feedback extension and simulation results. Data available via DOI: 10.5281/zenodo.18754266