arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.20496 2026-02-25 cs.CV

Pip-Stereo: Progressive Iterations Pruner for Iterative Optimization based Stereo Matching

Jintu Zheng, Qizhe Liu, HuangXin Xu, Zhuojie Chen

Comments Accepted to CVPR 2026 (3D vision track)

详情

英文摘要

While iterative stereo matching achieves high accuracy, its dependence on Recurrent Neural Networks (RNN) hinders edge deployment, a challenge underexplored in existing researches. We analyze iterative refinement and reveal that disparity updates are spatially sparse and temporally redundant. First, we introduce a progressive iteration pruning strategy that suppresses redundant update steps, effectively collapsing the recursive computation into a near-single-pass inference. Second, we propose a collaborative monocular prior transfer framework that implicitly embeds depth priors without requiring a dedicated monocular encoder, thereby eliminating its associated computational burden. Third, we develop FlashGRU, a hardware-aware RNN operator leveraging structured sparsity and I/O-conscious design, achieving a 7.28$\times$ speedup, 76.6\% memory peak reduction and 80.9\% global memory requests reduction over natvie ConvGRUs under 2K resolution. Our PipStereo enables real-time, high-fidelity stereo matching on edge hardware: it processes 320$\times$640 frames in just 75ms on an NVIDIA Jetson Orin NX (FP16) and 19ms on RTX 4090, matching the accuracy of large iterative based models, and our generalization ability and accuracy far exceeds that of existing real-time methods. Our embedded AI projects will be updated at: https://github.com/XPENG-Aridge-AI.

URL PDF HTML ☆

赞 0 踩 0

2602.20494 2026-02-25 cs.AI

KairosVL: Orchestrating Time Series and Semantics for Unified Reasoning

Haotian Si, Changhua Pei, Xiao He, Zeyan Li, Zhe Xie, Zexin Wang, Jiyao Hu, Zhaoyang Yu, Tieying Zhang, Dan Pei, Jianhui Li, Gaogang Xie

2602.20492 2026-02-25 cs.LG cs.AI

Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA

Nuocheng Yang, Sihua Wang, Ouwen Huan, Mingzhe Chen, Tony Q. S. Quek, Changchuan Yin

Comments 13 pages, 5 figures

2602.20480 2026-02-25 cs.LG cs.AI

VINA: Variational Invertible Neural Architectures

Shubhanshu Shekhar, Mohammad Javad Khojasteh, Ananya Acharya, Tony Tohme, Kamal Youcef-Toumi

Comments 57 pages, 11 figures, 5 tables

2602.20479 2026-02-25 cs.CV

Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation

Lin Li, Ziqi Jiang, Gefan Ye, Zhenqi He, Jiahui Li, Jun Xiao, Kwang-Ting Cheng, Long Chen

2602.20476 2026-02-25 cs.CV

SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens

Anindita Ghosh, Vladislav Golyanik, Taku Komura, Philipp Slusallek, Christian Theobalt, Rishabh Dabral

Comments 13 pages, 6 figures, 4 tables

2602.20468 2026-02-25 cs.LG

CGSTA: Cross-Scale Graph Contrast with Stability-Aware Alignment for Multivariate Time-Series Anomaly Detection

Zhongpeng Qi, Jun Zhang, Wei Li, Zhuoxuan Liang

Comments Accepted by DASFAA'26

2602.20467 2026-02-25 cs.LG cs.AI

Elimination-compensation pruning for fully-connected neural networks

Enrico Ballini, Luca Muscarnera, Alessio Fumagalli, Anna Scotti, Francesco Regazzoni

2602.20466 2026-02-25 cs.RO

Grasp to Act: Dexterous Grasping for Tool Use in Dynamic Settings

Harsh Gupta, Mohammad Amin Mirzaee, Wenzhen Yuan

Comments Result videos can be found at https://grasp2act.github.io/

2602.20463 2026-02-25 cs.LG

A Long-Short Flow-Map Perspective for Drifting Models

Zhiqi Li, Bo Zhu

Comments 25 pages, 7 figures

2602.20461 2026-02-25 cs.LG

Nonparametric Teaching of Attention Learners

Chen Zhang, Jianghui Wang, Bingyang Cheng, Zhongtao Chen, Wendong XU, Cong Wang, Marco Canini, Francesco Orabona, Yik Chung WU, Ngai Wong

Comments ICLR 2026 (36 pages, 6 figures)

2602.20459 2026-02-25 cs.AI cs.CL

PreScience: A Benchmark for Forecasting Scientific Contributions

Anirudh Ajith, Amanpreet Singh, Jay DeYoung, Nadav Kunievsky, Austin C. Kozlowski, Oyvind Tafjord, James Evans, Daniel S. Weld, Tom Hope, Doug Downey

Comments 10 pages (53 with bibliography and appendix), 4 figures (13 with appendix), 4 tables (10 with appendix), 1 algorithm

2602.20457 2026-02-25 cs.LG stat.ML

Oracle-Robust Online Alignment for Large Language Models

Zimeng Li, Mudit Gaur, Vaneet Aggarwal

2602.20449 2026-02-25 cs.LG cs.AI cs.CL q-bio.BM

Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

Anna Hart, Chi Han, Jeonghwan Kim, Huimin Zhao, Heng Ji

2602.20442 2026-02-25 cs.LG cs.AI

Imputation of Unknown Missingness in Sparse Electronic Health Records

Jun Han, Josue Nassar, Sanjit Singh Batra, Aldo Cordova-Palomera, Vijay Nori, Robert E. Tillman

2602.20433 2026-02-25 cs.CL

Disentangling Geometry, Performance, and Training in Language Models

Atharva Kulkarni, Jacob Mitchell Springer, Arjun Subramonian, Swabha Swayamdipta

2602.20424 2026-02-25 cs.AI

Implicit Intelligence -- Evaluating Agents on What Users Don't Say

Ved Sirdeshmukh, Marc Wetter

2602.20423 2026-02-25 cs.CV cs.CL

MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation

Taha Koleilat, Hojat Asgariandehkordi, Omid Nejati Manzari, Berardino Barile, Yiming Xiao, Hassan Rivaz

Comments CVPR 2026; Project Page: https://tahakoleilat.github.io/MedCLIPSeg

2602.20422 2026-02-25 cs.AI cs.LG

Diffusion Modulation via Environment Mechanism Modeling for Planning

Hanping Zhang, Yuhong Guo

2602.20419 2026-02-25 cs.LG

CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks

Bolin Shen, Zhan Cheng, Neil Zhenqiang Gong, Fan Yao, Yushun Dong

2602.20418 2026-02-25 cs.LG

CITED: A Decision Boundary-Aware Signature for GNNs Towards Model Extraction Defense

Bolin Shen, Md Shamim Seraj, Zhan Cheng, Shayok Chakraborty, Yushun Dong

2602.20417 2026-02-25 cs.CV

gQIR: Generative Quanta Image Reconstruction

Aryan Garg, Sizhuo Ma, Mohit Gupta

Comments CVPR 2026

2602.20412 2026-02-25 cs.CV

SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images

Aayush Dhakal, Subash Khanal, Srikumar Sastry, Jacob Arndt, Philipe Ambrozio Dias, Dalton Lunga, Nathan Jacobs

Comments Accepted to CVPR 2026

2602.20404 2026-02-25 cs.LG

$κ$-Explorer: A Unified Framework for Active Model Estimation in MDPs

Xihe Gu, Urbashi Mitra, Tara Javidi

2602.20403 2026-02-25 cs.LG math.OC stat.ML

Wasserstein Distributionally Robust Online Learning

Guixian Chen, Salar Fattahi, Soroosh Shafiee

2602.20400 2026-02-25 cs.LG cs.AI

Three Concrete Challenges and Two Hopes for the Safety of Unsupervised Elicitation

Callum Canavan, Aditya Shrivastava, Allison Qi, Jonathan Michala, Fabien Roger

Comments 19 pages, 9 figures

2602.20379 2026-02-25 cs.CL cs.AI

Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

Mukul Chhabra, Luigi Medrano, Arush Verma

Comments 12 pages including appendix, 6 figures

2602.20375 2026-02-25 cs.RO

Generalizing from References using a Multi-Task Reference and Goal-Driven RL Framework

Jiashun Wang, M. Eva Mungai, He Li, Jean Pierre Sleiman, Jessica Hodgins, Farbod Farshidian

2602.20372 2026-02-25 cs.CL

How communicatively optimal are exact numeral systems? Once more on lexicon size and morphosyntactic complexity

Chundra Cathcart, Arne Rubehn, Katja Bocklage, Luca Ciucci, Kellen Parker van Dam, Alžběta Kučerová, Jekaterina Mažara, Carlo Y. Meloni, David Snee, Johann-Mattis List

2602.20363 2026-02-25 cs.CV

Aesthetic Camera Viewpoint Suggestion with 3D Aesthetic Field

Sheyang Tang, Armin Shafiee Sarvestani, Jialu Xu, Xiaoyu Xu, Zhou Wang

Comments 14 pages, 10 figures