arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Shiyu Wang, Haolin Chen, Liangwei Yang, Jielin Qiu, Rithesh Murthy, Ming Zhu, Zixiang Chen, Silvio Savarese, Caiming Xiong, Shelby Heinecke, Huan Wang

2602.18447 2026-02-24 cs.CL cs.AI

ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

Siran Liu, Cyril Y. He

2602.18446 2026-02-24 cs.CL cs.AI

ReportLogic: Evaluating Logical Quality in Deep Research Reports

Jujia Zhao, Zhaoxin Huan, Zihan Wang, Xiaolu Zhang, Jun Zhou, Suzan Verberne, Zhaochun Ren

2602.17155 2026-02-24 cs.LG

Powering Up Zeroth-Order Training via Subspace Gradient Orthogonalization

Yicheng Lang, Changsheng Wang, Yihua Zhang, Mingyi Hong, Zheng Zhang, Wotao Yin, Sijia Liu

2602.17048 2026-02-24 cs.CV

StructCore: Structure-Aware Image-Level Scoring for Training-Free Unsupervised Anomaly Detection

Joongwon Chae, Lihui Luo, Yang Liu, Runming Wang, Dongmei Yu, Zeming Liang, Xi Yuan, Dayan Zhang, Zhenglin Chen, Peiwu Qin, Ilmoon Chae

2602.17009 2026-02-24 cs.LG

Action-Graph Policies: Learning Action Co-dependencies in Multi-Agent Reinforcement Learning

Nikunj Gupta, James Zachary Hare, Jesse Milzman, Rajgopal Kannan, Viktor Prasanna

2602.16412 2026-02-24 cs.CV

ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding

Daichi Yashima, Shuhei Kurita, Yusuke Oda, Komei Sugiura

Comments Accepted to CVPR 2026

2602.16336 2026-02-24 cs.LG cs.AI

HAWX: A Hardware-Aware FrameWork for Fast and Scalable ApproXimation of DNNs

Samira Nazari, Mohammad Saeed Almasi, Mahdi Taheri, Ali Azarpeyvand, Ali Mokhtari, Ali Mahani, Christian Herglotz

2602.15950 2026-02-24 cs.CV cs.LG

Can Vision-Language Models See Squares? Text-Recognition Mediates Spatial Reasoning Across Three Model Families

Yuval Levental

Comments 9 pages, 3 figures, 2 tables. Workshop-length paper

2602.13870 2026-02-24 cs.CL

ADAB: Arabic Dataset for Automated Politeness Benchmarking -- A Large-Scale Resource for Computational Sociopragmatics

Hend Al-Khalifa, Nadia Ghezaiel, Maria Bounnit, Hend Hamed Alhazmi, Noof Abdullah Alfear, Reem Fahad Alqifari, Ameera Masoud Almasoud, Sharefah Al-Ghamdi

Comments Paper accepted @ The Fifteenth biennial Language Resources and Evaluation Conference (LREC2026)

2602.12268 2026-02-24 cs.AI

CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use

Zhen Zhang, Kaiqiang Song, Xun Wang, Yebowen Hu, Weixiang Yan, Chenyang Zhao, Henry Peng Zou, Haoyun Deng, Sathish Reddy Indurthi, Shujian Liu, Simin Ma, Xiaoyang Wang, Xin Eric Wang, Song Wang

2602.10116 2026-02-24 cs.CV cs.RO

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

Hongchi Xia, Xuan Li, Zhaoshuo Li, Qianli Ma, Jiashu Xu, Ming-Yu Liu, Yin Cui, Tsung-Yi Lin, Wei-Chiu Ma, Shenlong Wang, Shuran Song, Fangyin Wei

Comments Project Page: https://research.nvidia.com/labs/dir/sage/

2602.09515 2026-02-24 cs.CV

Energy-Efficient Fast Object Detection on Edge Devices for IoT Systems

Mas Nurul Achmadiah, Afaroj Ahamad, Chi-Chia Sun, Wen-Kai Kuo

Comments 14 pages, 12 figures

Journal ref IEEE Internet of Things Journal, vol. 12, no. 11, pp. 16681-16693, June 2025

2602.09084 2026-02-24 cs.CV

Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling

Ruijie Ye, Jiayi Zhang, Zhuoxin Liu, Zihao Zhu, Siyuan Yang, Li Li, Tianfu Fu, Franck Dernoncourt, Yue Zhao, Jiacheng Zhu, Ryan Rossi, Wenhao Chai, Zhengzhong Tu

Comments Project Website: agent-banana.github.io

2602.07854 2026-02-24 cs.CV

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Chendong Xiang, Jiajun Liu, Jintao Zhang, Xiao Yang, Zhengwei Fang, Shizun Wang, Zijun Wang, Yingtian Zou, Hang Su, Jun Zhu

2602.07135 2026-02-24 cs.LG cs.AI

Landscaper: Understanding Loss Landscapes Through Multi-Dimensional Topological Analysis

Jiaqing Chen, Nicholas Hadler, Tiankai Xie, Rostyslav Hnatyshyn, Caleb Geniesse, Yaoqing Yang, Michael W. Mahoney, Talita Perciano, John F. Hartwig, Ross Maciejewski, Gunther H. Weber

2602.05495 2026-02-24 cs.CL cs.AI

Transport and Merge: Cross-Architecture Merging for Large Language Models

Chenhang Cui, Binyun Yang, Fei Shen, Yuxin Chen, Jingnan Zheng, Xiang Wang, An Zhang, Tat-Seng Chua

2602.03003 2026-02-24 cs.AI cs.LG

Open Problems in Differentiable Social Choice: Learning Mechanisms, Decisions, and Alignment

Zhiyu An, Wan Du

2602.01696 2026-02-24 cs.CV cs.AI

Cross-Modal Purification and Fusion for Small-Object RGB-D Transmission-Line Defect Detection

Jiaming Cui, Wenqiang Li, Shuai Zhou, Ruifeng Qin, Feng Shen

2601.21363 2026-02-24 cs.RO

Towards Bridging the Gap between Large-Scale Pretraining and Efficient Finetuning for Humanoid Control

Weidong Huang, Zhehan Li, Hangxin Liu, Biao Hou, Yao Su, Jingwen Zhang

Comments ICLR 2026

2601.20369 2026-02-24 cs.CV

RepSFNet : A Single Fusion Network with Structural Reparameterization for Crowd Counting

Mas Nurul Achmadiah, Chi-Chia Sun, Wen-Kai Kuo, Jun-Wei Hsieh

Comments 6 pages. Published in Proceedings of the IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS) 2025

Journal ref Proceedings of the IEEE International Conference on Advanced Visual and Signal-Based Systems (AVSS), pp. 1-6, 2025

2601.18936 2026-02-24 cs.LG

Bi-Level Online Provisioning and Scheduling with Switching Costs and Cross-Level Constraints

Jialei Liu, C. Emre Koksal, Ming Shi

2601.18650 2026-02-24 cs.LG cs.AI

FaLW: A Forgetting-aware Loss Reweighting for Long-tailed Unlearning

Liheng Yu, Zhe Zhao, Yuxuan Wang, Pengkun Wang, Xiaofeng Cao, Binwu Wang, Yang Wang

Comments camera-ready for iclr2026

2601.11231 2026-02-24 cs.RO cs.SY eess.SY

Adaptive Monitoring of Stochastic Fire Front Processes via Information-seeking Predictive Control

Savvas Papaioannou, Panayiotis Kolios, Christos G. Panayiotou, Marios M. Polycarpou

Comments 2025 IEEE 64th Conference on Decision and Control (CDC)

2601.11036 2026-02-24 cs.LG

Self-Augmented Mixture-of-Experts for QoS Prediction

Kecheng Cai, Chao Peng, Chenyang Xu, Xia Chen, Yi Wang, Shuo Shi, Qiyuan Liang

Comments There was an error in the test dataset leakage, leading to an inaccurate improvement magnitude. However, the method and framework remain valid. The paper and data will be revised and resubmitted

2601.04205 2026-02-24 cs.CL cs.AI

STaRR: Spatial-Temporal Token-Dynamics-Aware Responsive Remasking for Diffusion Language Models

Xinhao Sun, Huaijin Zhao, Maoliang Li, Zihao Zheng, Jiayu Chen, Yun Liang, Xiang Chen

2601.01678 2026-02-24 cs.LG

HeurekaBench: A Benchmarking Framework for AI Co-scientist

Siba Smarak Panigrahi, Jovana Videnović, Maria Brbić

Comments 33 pages, 5 figures, 7 tables. Code available at https://github.com/mlbio-epfl/HeurekaBench. Accepted to ICLR 2026

2512.20908 2026-02-24 cs.CL

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Kaiyuan Liu, Shaotian Yan, Rui Miao, Bing Wang, Chen Shen, Jun Zhang, Jieping Ye

详情

英文摘要

Reasoning distillation has attracted increasing attention. It typically leverages a large teacher model to generate reasoning paths, which are then used to fine-tune a student model so that it mimics the teacher's behavior in training contexts. However, previous approaches have lacked a detailed analysis of the origins of the distilled model's capabilities. It remains unclear whether the student can maintain consistent behaviors with the teacher in novel test-time contexts, or whether it regresses to its original output patterns, raising concerns about the generalization of distillation models. To analyse this question, we introduce a cross-model Reasoning Distillation Provenance Tracing framework. For each action (e.g., a sentence) produced by the distilled model, we obtain the predictive probabilities assigned by the teacher, the original student, and the distilled model under the same context. By comparing these probabilities, we classify each action into different categories. By systematically disentangling the provenance of each action, we experimentally demonstrate that, in test-time contexts, the distilled model can indeed generate teacher-originated actions, which correlate with and plausibly explain observed performance on distilled model. Building on this analysis, we further propose a teacher-guided data selection method. Unlike prior approach that rely on heuristics, our method directly compares teacher-student divergences on the training data, providing a principled selection criterion. We validate the effectiveness of our approach across multiple representative teacher models and diverse student models. The results highlight the utility of our provenance-tracing framework and underscore its promise for reasoning distillation. We hope to share Reasoning Distillation Provenance Tracing and our insights into reasoning distillation with the community.

URL PDF HTML ☆

赞 0 踩 0

2512.20363 2026-02-24 cs.LG cs.AI cs.DC stat.AP stat.ML

Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

Daniel M. Jimenez-Gutierrez, Mehrdad Hassanzadeh, David Solans, Mohammed Elbamby, Nicolas Kourtellis, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea Vitaletti

Comments Accepted for publication to the 40th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2026)

2512.12132 2026-02-24 cs.LG cs.NA math.NA

Approximation with SiLU Networks: Constant Depth and Exponential Rates for Basic Operations

Koffi O. Ayena

Comments 22 pages, 18 figures, submitted to the journal

AI 大模型

视觉与机器人

科学与医疗

Prompt Optimization Via Diffusion Language Models