arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

We introduce Leap+Verify, a framework that applies speculative execution -- predicting future model weights and validating predictions before acceptance -- to accelerate neural network training. Inspired by speculative decoding in language model inference and by the Automatically Scalable Computation (ASC) architecture for program execution, Leap+Verify decomposes training into three dynamically detected regimes (chaotic, transition, stable) using activation-space cosine similarity as a real-time Lyapunov proxy signal. Within each regime, analytic weight predictors (momentum, linear, quadratic extrapolation) attempt to forecast model parameters K training steps ahead; predictions are accepted only when validated against a held-out loss criterion. We evaluate Leap+Verify on GPT-2 124M and Qwen 2.5-1.5B trained on WikiText-103 across five random seeds, sweeping prediction depth K in {5, 10, 25, 50, 75, 100}. Momentum-based prediction (Adam moment extrapolation) fails catastrophically at both scales, with predicted losses exceeding actuals by 100-10,000x -- a universal norm explosion in optimizer-state extrapolation. Finite-difference predictors (linear, quadratic) succeed where momentum fails: at 124M, they achieve 24% strict acceptance at K=5 in stable regimes; at 1.5B, they achieve 37% strict acceptance in transition regimes. The scale-dependent finding is in regime distribution: GPT-2 124M spends 34% of training in stable regime, while Qwen 1.5B spends 64% in chaotic regime and reaches stable in only 0-2 of 40 checkpoints. Larger models are more predictable when predictable, but less often predictable -- the practical bottleneck shifts from predictor accuracy to regime availability. Cross-seed results are highly consistent (less than 1% validation loss variance), and the three-regime framework produces identical phase boundaries (plus or minus 50 steps) across seeds.

URL PDF HTML ☆

赞 0 踩 0

2602.19571 2026-02-24 cs.CV

HOCA-Bench: Beyond Semantic Perception to Predictive World Modeling via Hegelian Ontological-Causal Anomalies

Chang Liu, Yunfan Ye, Qingyang Zhou, Xichen Tan, Mengxuan Luo, Zhenyu Qiu, Wei Peng, Zhiping Cai

2602.19569 2026-02-24 cs.CL cs.AI

Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering

Wuzhenghong Wen, Bowen Zhou, Jinwen Huang, Xianjie Wu, Yuwei Sun, Su Pan, Liang Li, Jianting Liu

Comments 6pages

2602.19562 2026-02-24 cs.AI cs.CV

A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data

Joseph Bingham

Comments 19 Pages, 6 figures, preprint

2602.19552 2026-02-24 cs.LG cs.CC cs.DS

The Sample Complexity of Replicable Realizable PAC Learning

Kasper Green Larsen, Markus Engelund Mathiasen, Chirag Pabbaraju, Clement Svendsen

2602.19548 2026-02-24 cs.CL cs.LG

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Jeffrey Li, Josh Gardner, Doug Kang, Fangping Shi, Karanjeet Singh, Chun-Liang Li, Herumb Shandilya, David Hall, Oncel Tuzel, Percy Liang, Ludwig Schmidt, Hadi Pour Ansari, Fartash Faghri

2602.19543 2026-02-24 cs.CL cs.IR

Hyper-KGGen: A Skill-Driven Knowledge Extractor for High-Quality Knowledge Hypergraph Generation

Rizhuo Huang, Yifan Feng, Rundong Xue, Shihui Ying, Jun-Hai Yong, Chuan Shi, Shaoyi Du, Yue Gao

2602.19542 2026-02-24 cs.CV

Vinedresser3D: Agentic Text-guided 3D Editing

Yankuan Chi, Xiang Li, Zixuan Huang, James M. Rehg

Comments CVPR 2026, Project website:https://vinedresser3d.github.io/

2602.19540 2026-02-24 cs.CV cs.AI

A Green Learning Approach to LDCT Image Restoration

Wei Wang, Yixing Wu, C. -C. Jay Kuo

Comments Published in IEEE International Conference on Image Processing (ICIP), 2025, pp. 1762-1767. Final version available at IEEE Xplore

Journal ref Proceedings of the IEEE International Conference on Image Processing (ICIP), 2025, pp. 1762-1767

2602.19539 2026-02-24 cs.CV cs.CR cs.LG

Can a Teenager Fool an AI? Evaluating Low-Cost Cosmetic Attacks on Age Estimation Systems

Xingyu Shen, Tommy Duong, Xiaodong An, Zengqi Zhao, Zebang Hu, Haoyu Hu, Ziyou Wang, Finn Guo, Simiao Ren

Comments 13 pages, 6 figures

2602.19538 2026-02-24 cs.RO cs.AI cs.LG

Cost-Aware Diffusion Active Search

Arundhati Banerjee, Jeff Schneider

Comments In submission

2602.19536 2026-02-24 cs.CV cs.AI

Fore-Mamba3D: Mamba-based Foreground-Enhanced Encoding for 3D Object Detection

Zhiwei Ning, Xuanang Gao, Jiaxi Cao, Runze Yang, Huiying Xu, Xinzhong Zhu, Jie Yang, Wei Liu

2602.19528 2026-02-24 cs.LG stat.ML

Beyond Accuracy: A Unified Random Matrix Theory Diagnostic Framework for Crash Classification Models

Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

2602.19526 2026-02-24 cs.CL

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Yinuo Xu, Shuo Lu, Jianjie Cheng, Meng Wang, Qianlong Xie, Xingxing Wang, Ran He, Jian Liang

2602.19523 2026-02-24 cs.CV

OSInsert: Towards High-authenticity and High-fidelity Image Composition

Jingyuan Wang, Li Niu

2602.19519 2026-02-24 cs.AI cs.LG

Ada-RS: Adaptive Rejection Sampling for Selective Thinking

Yirou Ge, Yixi Li, Alec Chiu, Shivani Shekhar, Zijie Pan, Avinash Thangali, Yun-Shiuan Chuang, Chaitanya Kulkarni, Uma Kona, Linsey Pang, Prakhar Mehrotra

2602.19518 2026-02-24 cs.RO

Anticipate, Adapt, Act: A Hybrid Framework for Task Planning

Nabanita Dash, Ayush Kaura, Shivam Singh, Ramandeep Singh, Snehasis Banerjee, Mohan Sridharan, K. Madhava Krishna

Comments Accepted at IEEE European Conference on Mobile Robots (ECMR)

2602.19512 2026-02-24 cs.LG cs.CV

Variational Trajectory Optimization of Anisotropic Diffusion Schedules

Pengxi Liu, Zeyu Michael Li, Xiang Cheng

2602.19510 2026-02-24 cs.LG math.OC stat.ML

Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon

Rudrajit Das, Neel Patel, Meisam Razaviyayn, Vahab Mirrokni

2602.19506 2026-02-24 cs.CV cs.LG

Relational Feature Caching for Accelerating Diffusion Transformers

Byunggwan Son, Jeimin Jeon, Jeongwoo Choi, Bumsub Ham

Comments Accepted to ICLR 2026

2602.19505 2026-02-24 cs.CV

Test-Time Computing for Referring Multimodal Large Language Models

Mingrui Wu, Hao Chen, Jiayi Ji, Xiaoshuai Sun, Zhiyuan Liu, Liujuan Cao, Ming-Ming Cheng, Rongrong Ji

Comments arXiv admin note: substantial text overlap with arXiv:2407.21534

2602.19503 2026-02-24 cs.CV

A Text-Guided Vision Model for Enhanced Recognition of Small Instances

Hyun-Ki Jung

Comments Accepted for publication in Applied Computer Science (2026)

Journal ref Applied Computer Science, Vol. 22, No. 1, 2026

2602.19498 2026-02-24 cs.LG cs.AI

Softmax is not Enough (for Adaptive Conformal Classification)

Navid Akhavan Attar, Hesam Asadollahzadeh, Ling Luo, Uwe Aickelin

2602.19497 2026-02-24 cs.CV

MICON-Bench: Benchmarking and Enhancing Multi-Image Context Image Generation in Unified Multimodal Models

Mingrui Wu, Hang Liu, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji

Comments CVPR2026

2602.19491 2026-02-24 cs.RO cs.AI cs.HC

Botson: An Accessible and Low-Cost Platform for Social Robotics Research

Samuel Bellaire, Abdalmalek Abu-raddaha, Natalie Kim, Nathan Morhan, William Elliott, Samir Rawashdeh

Comments 5 pages, 7 figures

2602.19471 2026-02-24 cs.CV

Forgetting-Resistant and Lesion-Aware Source-Free Domain Adaptive Fundus Image Analysis with Vision-Language Model

Zheang Huai, Hui Tang, Hualiang Wang, Xiaomeng Li

Comments 10 pages

2602.19461 2026-02-24 cs.CV cs.LG

Laplacian Multi-scale Flow Matching for Generative Modeling

Zelin Zhao, Petr Molodyk, Haotian Xue, Yongxin Chen

Comments Accepted to appear in ICLR 2026

2602.19458 2026-02-24 cs.AI cs.HC

ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

Ziyang Guo, Yifan Wu, Jason Hartline, Kenneth Holstein, Jessica Hullman

2602.19455 2026-02-24 cs.LG cs.AI cs.CL stat.ML

SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

Zelin He, Boran Han, Xiyuan Zhang, Shuai Zhang, Haotian Lin, Qi Zhu, Haoyang Fang, Danielle C. Maddix, Abdul Fatir Ansari, Akash Chandrayan, Abhinav Pradhan, Bernie Wang, Matthew Reimherr

Comments Accepted by the 29th International Conference on Artificial Intelligence and Statistics (AISTATS 2026)

2602.19454 2026-02-24 cs.CV

HD-TTA: Hypothesis-Driven Test-Time Adaptation for Safer Brain Tumor Segmentation

Kartik Jhawar, Lipo Wang

Comments 11 pages, 3 figures, 2 tables