arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.01162 2026-03-24 cs.LG stat.ML

Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic

Hongyi Zhou, Kai Ye, Erhan Xu, Jin Zhu, Ying Yang, Shijin Gong, Chengchun Shi

Comments 5 pages, 53 figures

详情

英文摘要

Group relative policy optimization (GRPO), a core methodological component of DeepSeekMath and DeepSeek-R1, has emerged as a cornerstone for scaling reasoning capabilities of large language models. Despite its widespread adoption and the proliferation of follow-up works, the theoretical properties of GRPO remain less studied. This paper provides a unified framework to understand GRPO through the lens of classical U-statistics. We demonstrate that the GRPO policy gradient is inherently a U-statistic, allowing us to characterize its mean squared error (MSE), derive the finite-sample error bound and asymptotic distribution of the suboptimality gap for its learned policy. Our findings reveal that GRPO is asymptotically equivalent to an oracle policy gradient algorithm -- one with access to a value function that quantifies the goodness of its learning policy at each training iteration -- and achieves asymptotically optimal performance within a broad class of policy gradient algorithms. Furthermore, we establish a universal scaling law that offers principled guidance for selecting the optimal group size. Empirical experiments further validate our theoretical findings, demonstrating that the optimal group size is universal, and verify the oracle property of GRPO.

URL PDF HTML ☆

赞 0 踩 0

2603.00919 2026-03-24 cs.CV cs.RO

DriveCode: Domain Specific Numerical Encoding for LLM-Based Autonomous Driving

Zhiye Wang, Yanbo Jiang, Rui Zhou, Bo Zhang, Fang Zhang, Zhenhua Xu, Yaqin Zhang, Jianqiang Wang

Comments The project page is available at https://shiftwilliam.github.io/DriveCode

2603.00431 2026-03-24 cs.CV cs.AI

Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models

Hulingxiao He, Zhi Tan, Yuxin Peng

Comments Published as a conference paper at CVPR 2026

2602.23306 2026-03-24 cs.CV

ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding

Yiran Guan, Sifan Tu, Dingkang Liang, Linghao Zhu, Jianzhong Ju, Zhenbo Luo, Jian Luan, Yuliang Liu, Xiang Bai

Comments Accept by ICLR 2026, Code: https://github.com/1ranGuan/thinkomni

2602.22271 2026-03-24 cs.LG math.PR math.ST stat.TH

Support Tokens, Stability Margins, and a New Foundation for Robust LLMs

Deepak Agarwal, Dhyey Dharmendrakumar Mavani, Suyash Gupta, Karthik Sethuraman, Tejas Dharamsi

Comments 45 pages, 9 figures

2602.21499 2026-03-24 cs.CV

Easy3E: Feed-Forward 3D Asset Editing via Rectified Voxel Flow

Shimin Hu, Yuanyi Wei, Fei Zha, Yudong Guo, Juyong Zhang

Comments CVPR 2026, Project Page: https://ustc3dv.github.io/Easy3E/

2602.18922 2026-03-24 cs.CL cs.AI cs.LG

Why Agent Caching Fails and How to Fix It: Structured Intent Canonicalization with Few-Shot Learning

Abhinaba Basu

Comments Added github repo and Hugging Face dataset link

2602.16127 2026-03-24 cs.RO cs.SY eess.SY

Reactive Slip Control in Multifingered Grasping: Hybrid Tactile Sensing and Internal-Force Optimization

Théo Ayral, Saifeddine Aloui, Mathieu Grossard

Comments Accepted to IEEE International Conference on Robotics and Automation (ICRA), 2026

2602.15856 2026-03-24 cs.CL cs.AI cs.IR

Rethinking Soft Compression in Retrieval-Augmented Generation: A Query-Conditioned Selector Perspective

Yunhao Liu, Zian Jia, Xinyu Gao, Kanjun Xu, Yun Xiong

Comments Accepted by WWW 2026

2602.15677 2026-03-24 cs.LG q-bio.QM

CAMEL: An ECG Language Model for Forecasting Cardiac Events

Neelay Velingker, Alaia Solko-Breslin, Mayank Keoliya, Seewon Choi, Jiayi Xin, Anika Marathe, Alireza Oraii, Rajat Deo, Sameed Khatana, Rajeev Alur, Mayur Naik, Eric Wong

Comments 24 pages, 6 figures

2602.13091 2026-03-24 cs.CV

BAAF: Universal Transformation of One-Class Classifiers for Unsupervised Image Anomaly Detection

Declan McIntosh, Alexandra Branzan Albu

Comments 6 figures, 14 pages main paper, 25 pages total with supplemental

2602.07446 2026-03-24 cs.CV

PTB-XL-Image-17K: A Large-Scale Synthetic ECG Image Dataset with Comprehensive Ground Truth for Deep Learning-Based Digitization

Naqcho Ali Mehdi, Aamir Ali Drigh

Comments 8 pages, 4 figures, dataset paper

2602.07077 2026-03-24 cs.SD cs.AI

CALM: Class-Conditional Sparse Attention Vectors for Large Audio-Language Models

Videet Mehta, Liming Wang, Hilde Kuehne, Rogerio Feris, James R. Glass, M. Jehanzeb Mirza

Comments 11 pages, 6 figures

2602.04577 2026-03-24 cs.CL

Semantic Self-Distillation for Language Model Uncertainty

Edward Phillips, Sean Wu, Fredrik K. Gustafsson, Boyan Gao, David A. Clifton

Comments Added experiments on MMLU dataset, investigating utility of likelihood for multiple-choice answer selection

2602.03773 2026-03-24 cs.LG

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Ian Wu, Yuxiao Qu, Amrith Setlur, Aviral Kumar

Comments preprint v2; revised 2026-03-22 (updated IMO-AnswerBench results)

2602.02223 2026-03-24 cs.CV

Evaluating OCR Performance for Assistive Technology: Effects of Walking Speed, Camera Placement, and Camera Type

Junchi Feng, Nikhil Ballem, Mahya Beheshti, Giles Hamilton-Fletcher, Todd Hudson, Maurizio Porfiri, William H. Seiple, John-Ross Rizzo

2601.21998 2026-03-24 cs.CV cs.RO

Causal World Modeling for Robot Control

Lin Li, Qihang Zhang, Yiming Luo, Shuai Yang, Ruilin Wang, Fei Han, Mingrui Yu, Zelin Gao, Nan Xue, Xing Zhu, Yujun Shen, Yinghao Xu

Comments Project page: https://technology.robbyant.com/lingbot-va Code: https://github.com/robbyant/lingbot-va

2601.20009 2026-03-24 cs.CL cs.AI cs.LG

LinguaMap: Which Layers of LLMs Speak Your Language and How to Tune Them?

J. Ben Tamo, Daniel Carlander-Reuterfelt, Jonathan Rubin, Dezhi Hong, Mingxian Wang, Oleg Poliannikov

2601.18486 2026-03-24 cs.CL cs.CY

Different Demographic Cues Yield Inconsistent Conclusions About LLM Personalization and Bias

Manuel Tonneau, Neil K. R. Seghal, Niyati Malhotra, Sharif Kazemi, Victor Orozco-Olvera, Ana María Muñoz Boudet, Lakshmi Subramanian, Samuel P. Fraiberger, Sharath Chandra Guntuku, Valentin Hofmann

2601.11930 2026-03-24 cs.CV

SupScene: Scene-Structured Overlap Supervision for Image Retrieval in Unconstrained SfM

Xulei Shi, Maoyu Wang, Yuning Peng, Guanbo Wang, Xin Wang, Yifan Liao, Qi Chen, Pengjie Tao

2601.10744 2026-03-24 cs.AI cs.CV

Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration

Sen Wang, Bangwei Liu, Zhenkun Gao, Lizhuang Ma, Xuhong Wang, Yuan Xie, Xin Tan

Comments Accepted by CVPR 2026

2601.10679 2026-03-24 cs.AI cs.LG

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

Zirui Ren, Ziming Liu

2601.09237 2026-03-24 cs.LG

XLinear: A Lightweight and Accurate MLP-Based Model for Long-Term Time Series Forecasting with Exogenous Inputs

Xinyang Chen, Huidong Jin, Yu Huang, Zaiwen Feng

Comments Accepted by AAAI 2026

详情

DOI: 10.1609/aaai.v40i24.39121
Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 40, No. 24, 2026

英文摘要

Despite the prevalent assumption of uniform variable importance in long-term time series forecasting models, real world applications often exhibit asymmetric causal relationships and varying data acquisition costs. Specifically, cost-effective exogenous data (e.g., local weather) can unilaterally influence dynamics of endogenous variables, such as lake surface temperature. Exploiting these links enables more effective forecasts when exogenous inputs are readily available. Transformer-based models capture long-range dependencies but incur high computation and suffer from permutation invariance. Patch-based variants improve efficiency yet can miss local temporal patterns. To efficiently exploit informative signals across both the temporal dimension and relevant exogenous variables, this study proposes XLinear, a lightweight time series forecasting model built upon MultiLayer Perceptrons (MLPs). XLinear uses a global token derived from an endogenous variable as a pivotal hub for interacting with exogenous variables, and employs MLPs with sigmoid activation to extract both temporal patterns and variate-wise dependencies. Its prediction head then integrates these signals to forecast the endogenous series. We evaluate XLinear on seven standard benchmarks and five real-world datasets with exogenous inputs. Compared with state-of-the-art models, XLinear delivers superior accuracy and efficiency for both multivariate forecasts and univariate forecasts influenced by exogenous inputs.

URL PDF HTML ☆

赞 0 踩 0

2601.07242 2026-03-24 cs.RO cs.CV

HERE: Hierarchical Active Exploration of Radiance Field with Epistemic Uncertainty Minimization

Taekbeom Lee, Dabin Kim, Youngseok Jang, H. Jin Kim

Comments Accepted to IEEE RA-L. The first two authors contributed equally

2601.05237 2026-03-24 cs.CV

ObjectForesight: Predicting Future 3D Object Trajectories from Human Videos

Rustin Soraki, Homanga Bharadhwaj, Ali Farhadi, Roozbeh Mottaghi

Comments Preprint. Project Website: objectforesight.github.io

2601.05175 2026-03-24 cs.CV

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Shuming Liu, Mingchen Zhuge, Changsheng Zhao, Jun Chen, Lemeng Wu, Zechun Liu, Chenchen Zhu, Zhipeng Cai, Chong Zhou, Haozhe Liu, Ernie Chang, Saksham Suri, Hongyu Xu, Qi Qian, Wei Wen, Balakrishnan Varadarajan, Zhuang Liu, Hu Xu, Florian Bordes, Raghuraman Krishnamoorthi, Bernard Ghanem, Vikas Chandra, Yunyang Xiong

Comments Accepted to CVPR 2026. Project page: https://ivul-kaust.github.io/projects/videoauto-r1/

2601.05105 2026-03-24 cs.CV

UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition

Filippo Ghilotti, Samuel Brucker, Nahku Saidy, Matteo Matteucci, Mario Bijelic, Felix Heide

2601.02763 2026-03-24 cs.CV

ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image Restoration

Xu Zhang, Huan Zhang, Guoli Wang, Qian Zhang, Lefei Zhang

Comments Accepted to AAAI 2026. Project page: https://github.com/House-yuyu/ClearAIR

2601.01781 2026-03-24 cs.CV cs.AI cs.LG

Subimage Overlap Prediction: Task-Aligned Self-Supervised Pretraining For Semantic Segmentation In Remote Sensing Imagery

Lakshay Sharma, Alex Marin

Comments Accepted at CV4EO Workshop at WACV 2026

2601.01003 2026-03-24 cs.LG cs.RO

Contractive Diffusion Policies: Robust Action Diffusion via Contractive Score-Based Sampling with Differential Equations

Amin Abyaneh, Charlotte Morissette, Mohamad H. Danesh, Anas El Houssaini, David Meger, Gregory Dudek, Hsiu-Chin Lin

Comments Published as a conference paper at ICLR 2026