arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.25072 2026-03-27 cs.CV

GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding

Junpeng Ma, Sashuai Zhou, Guanghao Li, Xin Gao, Yue Cao, Hengyu Zeng, Yuxiang Yan, Zhibin Wang, Jun Song, Bo Zheng, Shanghang Zhang, Jian Pu

Comments 11 pages, 3 figures

详情

英文摘要

Video Large Language Models (VLMs) have achieved remarkable success in video understanding, but the significant computational cost from processing dense frames severely limits their practical application. Existing methods alleviate this by selecting keyframes, but their greedy decision-making, combined with a decoupled evaluation of relevance and diversity, often falls into local optima and results in erroneously selecting irrelevant noise frames. To address these challenges, we propose GIFT: Global Irreplaceability Frame Targeting, a novel training-free framework that selects frames by assessing their intrinsic irreplaceability. Specifically, we first introduce Directed Diversity to quantify a frame's uniqueness conditioned on relevance, which allows us to formulate a unified irreplaceability score. Subsequently, our Budget-Aware Refinement strategy employs a adaptive iterative process that first secures a core set of frames with the highest irreplaceability, and then shifts its priority to building crucial temporal context around these selections as the budget expands. Extensive experiments demonstrate that GIFT achieves a maximum average improvement of 12.5% across long-form video benchmarks on LLaVA-Video-7B compared to uniform sampling.

URL PDF HTML ☆

赞 0 踩 0

2603.25070 2026-03-27 cs.LG cs.AI

An Explainable Ensemble Learning Framework for Crop Classification with Optimized Feature Pyramids and Deep Networks

Syed Rayhan Masud, SK Muktadir Hossain, Md. Ridoy Sarkar, Mohammad Sakib Mahmood, Md. Kishor Morol, Rakib Hossain Sajib

2603.25062 2026-03-27 cs.LG

SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Autoregressive Contrastive Learning

Xinyu Wang, Fei Dou, Jinbo Bi, Minghu Song

Comments 15 pages, 6 figures. Submitted to ICML 2026. Primary category: cs.LG (Machine Learning); Secondary: cs.AI, q-bio.QM

2603.25058 2026-03-27 cs.CV

Learning Explicit Continuous Motion Representation for Dynamic Gaussian Splatting from Monocular Videos

Xuankai Zhang, Junjin Xiao, Shangwei Huang, Wei-shi Zheng, Qing Zhang

Comments Accepted to CVPR 2026

2603.25054 2026-03-27 cs.CV

Synergistic Event-SVE Imaging for Quantitative Propellant Combustion Diagnostics

Jing Tao, Taihang Lei, Banglei Guan, Ying Qu, Xudong Na, Likun Ma, Yang Shang, Qifeng Yu

2603.25047 2026-03-27 cs.LG stat.ML

The Order Is The Message

Jordan LeDoux

Comments 51 pages, 12 figures

2603.25046 2026-03-27 cs.AI cs.LG

MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

Huyen Ngoc Tran, Dung Trung Tran, Hong Nguyen, Xuan Vu Phan, Nam-Phong Nguyen

2603.25042 2026-03-27 cs.CV

MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes

Wonjoon Lee, Sungmin Woo, Donghyeong Kim, Jungho Lee, Sangheon Park, Sangyoun Lee

2603.25038 2026-03-27 cs.RO

$π$, But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation

Johnathan Tucker, Denis Liu, Aiden Swann, Allen Ren, Javier Yu, Jiankai Sun, Brandon Kim, Lachlain McGranahan, Quan Vuong, Mac Schwager

2603.25035 2026-03-27 cs.AI

Mechanistically Interpreting Compression in Vision-Language Models

Veeraraju Elluru, Arth Singh, Roberto Aguero, Ajay Agarwal, Debojyoti Das, Hreetam Paul

Comments 15 pages, 7 figures, 12 tables

2603.25033 2026-03-27 cs.LG

Epistemic Compression: The Case for Deliberate Ignorance in High-Stakes AI

Steffen Lukas

Comments 28 pages, 6 figures

2603.25031 2026-03-27 cs.AI

From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support

Boning Zhao, Clover Hu, Xinnuo Li

2603.25026 2026-03-27 cs.CV

CARE: Training-Free Controllable Restoration for Medical Images via Dual-Latent Steering

Xu Liu

2603.25025 2026-03-27 cs.AI

System-Anchored Knee Estimation for Low-Cost Context Window Selection in PDE Forecasting

Wenshuo Wang, Fan Zhang

2603.25022 2026-03-27 cs.AI cs.CR cs.CY cs.LG

A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures

Peng Wei, Wesley Shu

2603.25020 2026-03-27 cs.CV

GDPO-Listener: Expressive Interactive Head Generation via Auto-Regressive Flow Matching and Group reward-Decoupled Policy Optimization

Zhangyu Jin, Maksim Siniukov, Deuksin Kwon, Ashutosh Chaubey, Mohammad Soleymani

2603.25015 2026-03-27 cs.CL cs.AI cs.SE

Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

Tony Mason

2603.25009 2026-03-27 cs.LG

A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

Shalima Binta Manir, Anamika Paul Rupa

2603.25006 2026-03-27 cs.CV cs.AI

Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

Md. Rokon Mia, Rakib Hossain Sajib, Abdullah Al Noman, Abir Ahmed, B M Taslimul Haque

2603.25004 2026-03-27 cs.CV cs.MM

Interpretable Zero-shot Referring Expression Comprehension with Query-driven Scene Graphs

Yike Wu, Necva Bolucu, Stephen Wan, Dadong Wang, Jiahao Xia, Jian Zhang

Comments Accepted by T-MM

2603.25001 2026-03-27 cs.AI

Rethinking Failure Attribution in Multi-Agent Systems: A Multi-Perspective Benchmark and Evaluation

Yeonjun In, Mehrab Tanjim, Jayakumar Subramanian, Sungchul Kim, Uttaran Bhattacharya, Wonjoong Kim, Sangwu Park, Somdeb Sarkhel, Chanyoung Park

Comments Under review

2603.25000 2026-03-27 cs.CV

Distributed Real-Time Vehicle Control for Emergency Vehicle Transit: A Scalable Cooperative Method

WenXi Wang, JunQi Zhang

Comments Submitted to IEEE Transactions on Cybernetics

详情

英文摘要

Rapid transit of emergency vehicles is critical for saving lives and reducing property loss but often relies on surrounding ordinary vehicles to cooperatively adjust their driving behaviors. It is important to ensure rapid transit of emergency vehicles while minimizing the impact on ordinary vehicles. Centralized mathematical solver and reinforcement learning are the state-of-the-art methods. The former obtains optimal solutions but is only practical for small-scale scenarios. The latter implicitly learns through extensive centralized training but the trained model exhibits limited scalability to different traffic conditions. Hence, existing methods suffer from two fundamental limitations: high computational cost and lack of scalability. To overcome above limitations, this work proposes a scalable distributed vehicle control method, where vehicles adjust their driving behaviors in a distributed manner online using only local instead of global information. We proved that the proposed distributed method using only local information is approximately equivalent to the one using global information, which enables vehicles to evaluate their candidate states and make approximately optimal decisions in real time without pre-training and with natural adaptability to varying traffic conditions. Then, a distributed conflict resolution mechanism is further proposed to guarantee vehicles' safety by avoiding their decision conflicts, which eliminates the single-point-of-failure risk of centralized methods and provides deterministic safety guarantees that learned methods cannot offer. Compared with existing methods, simulation experiments based on real-world traffic datasets demonstrate that the proposed method achieves faster decision-making, less impact on ordinary vehicles, and maintains much stronger scalability across different traffic densities and road configurations.

URL PDF HTML ☆

赞 0 踩 0

2603.24991 2026-03-27 cs.CV

Towards Video Anomaly Detection from Event Streams: A Baseline and Benchmark Datasets

Peng Wu, Yuting Yan, Guansong Pang, Yujia Sun, Qingsen Yan, Peng Wang, Yanning Zhang

2603.24981 2026-03-27 cs.CL

Exons-Detect: Identifying and Amplifying Exonic Tokens via Hidden-State Discrepancy for Robust AI-Generated Text Detection

Xiaowei Zhu, Yubing Ren, Fang Fang, Shi Wang, Yanan Cao, Li Guo

2603.24979 2026-03-27 cs.CL

LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems

Yuhang Zhou, Zhuokai Zhao, Ke Li, Spilios Evmorfos, Gökalp Demirci, Mingyi Wang, Qiao Liu, Qifei Wang, Serena Li, Weiwei Li, Tingting Wang, Mingze Gao, Gedi Zhou, Abhishek Kumar, Xiangjun Fan, Lizhu Zhang, Jiayi Liu

Comments 11 pages, 2 tables

2603.24969 2026-03-27 cs.CV

PASDiff: Physics-Aware Semantic Guidance for Joint Real-world Low-Light Face Enhancement and Restoration

Yilin Ni, Wenjie Li, Zhengxue Wang, Juncheng Li, Guangwei Gao, Jian Yang

2603.24967 2026-03-27 cs.AI

The Anatomy of Uncertainty in LLMs

Aditya Taparia, Ransalu Senanayake, Kowshik Thopalli, Vivek Narayanaswamy

Comments 10 pages, 6 figures

2603.24965 2026-03-27 cs.CV cs.AI

Self-Corrected Image Generation with Explainable Latent Rewards

Yinyi Luo, Hrishikesh Gokhale, Marios Savvides, Jindong Wang, Shengfeng He

Comments CVPR 2026

2603.24961 2026-03-27 cs.AI cs.CL cs.CV

Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

Dingjie Song, Tianlong Xu, Yi-Fan Zhang, Hang Li, Zhiling Yan, Xing Fan, Haoyang Li, Lichao Sun, Qingsong Wen

Comments Accepted by the 27th International Conference on Artificial Intelligence in Education (AIED'26)

2603.24955 2026-03-27 cs.CL cs.AI

Toward domain-specific machine translation and quality estimation systems

Javad Pourmostafa Roshan Sharami

Comments PhD Dissertation