arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.03214 2026-03-09 cs.CV

FARTrack: Fast Autoregressive Visual Tracking with High Performance

Guijie Wang, Tong Lin, Yifan Bai, Anjia Cao, Shiyi Liang, Wangbo Zhao, Xing Wei

详情

英文摘要

Inference speed and tracking performance are two critical evaluation metrics in the field of visual tracking. However, high-performance trackers often suffer from slow processing speeds, making them impractical for deployment on resource-constrained devices. To alleviate this issue, we propose FARTrack, a Fast Auto-Regressive Tracking framework. Since autoregression emphasizes the temporal nature of the trajectory sequence, it can maintain high performance while achieving efficient execution across various devices. FARTrack introduces Task-Specific Self-Distillation and Inter-frame Autoregressive Sparsification, designed from the perspectives of shallow-yet-accurate distillation and redundant-to-essential token optimization, respectively. Task-Specific Self-Distillation achieves model compression by distilling task-specific tokens layer by layer, enhancing the model's inference speed while avoiding suboptimal manual teacher-student layer pairs assignments. Meanwhile, Inter-frame Autoregressive Sparsification sequentially condenses multiple templates, avoiding additional runtime overhead while learning a temporally-global optimal sparsification strategy. FARTrack demonstrates outstanding speed and competitive performance. It delivers an AO of 70.6% on GOT-10k in real-time. Beyond, our fastest model achieves a speed of 343 FPS on the GPU and 121 FPS on the CPU.

URL PDF HTML ☆

赞 0 踩 0

2602.01719 2026-03-09 cs.CL

COMI: Coarse-to-fine Context Compression via Marginal Information Gain

Jiwei Tang, Shilei Liu, Zhicheng Zhang, Yujin Yuan, Libin Zheng, Wenbo Su, Bo Zheng

Comments Accepted at ICLR 2026

2602.01288 2026-03-09 cs.LG

EDIS: Diagnosing LLM Reasoning via Entropy Dynamics

Chenghua Zhu, Siyan Wu, Xiangkang Zeng, Zishan Xu, Zhaolu Kang, Yifu Guo, Yuquan Lu, Junduan Huang, Guojing Zhou

Comments 16 pages, 12 figures

2601.22302 2026-03-09 cs.LG cs.CR cs.DC

ZK-HybridFL: Zero-Knowledge Proof-Enhanced Hybrid Ledger for Federated Learning

Amirhossein Taherpour, Xiaodong Wang

Comments Accepted for publication in IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

2601.17830 2026-03-09 cs.CV

SRA 2: Variational Autoencoder Self-Representation Alignment for Efficient Diffusion Training

Mengmeng Wang, Dengyang Jiang, Liuzhuozheng Li, Yucheng Lin, Guojiang Shen, Xiangjie Kong, Yong Liu, Guang Dai, Jingdong Wang

2601.16538 2026-03-09 cs.CV

OnlineSI: Taming Large Language Model for Online 3D Understanding and Grounding

Zixian Liu, Zhaoxi Chen, Liang Pan, Ziwei Liu

Comments Project Page: https://onlinesi.github.io/

2601.15160 2026-03-09 cs.AI cs.CL

Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

Yuval Kansal, Niraj K. Jha

2601.14895 2026-03-09 cs.CV cs.AI

SpatialMem: Metric-Aligned Long-Horizon Video Memory for Language Grounding and QA

Xinyi Zheng, Yunze Liu, Chi-Hao Wu, Fan Zhang, Hao Zheng, Wenqi Zhou, Walterio W. Mayol-Cuevas, Junxiao Shen

2601.13350 2026-03-09 cs.LG

Beyond Mapping : Domain-Invariant Representations via Spectral Embedding of Optimal Transport Plans

Abdel Djalil Sad Saoud, Fred Maurice Ngolè Mboula, Hanane Slimani

Comments Accepted at The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2026)

2601.09648 2026-03-09 cs.CL

Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation

Andrew Moore, Paul Rayson, Dawn Archer, Tim Czerniak, Dawn Knight, Daisy Lal, Gearóid Ó Donnchadha, Mícheál Ó Meachair, Scott Piao, Elaine Uí Dhonnchadha, Johanna Vuorinen, Yan Yabo, Xiaobin Yang

Comments 12 pages, 2 figures, accepted to LREC 2026

2601.05747 2026-03-09 cs.CV cs.RO

FlyPose: Towards Robust Human Pose Estimation From Aerial Views

Hassaan Farooq, Marvin Brenner, Peter Stütz

Comments 11 pages, 9 figures, IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

2601.03869 2026-03-09 cs.CV cs.GR cs.LG cs.RO

Bayesian Monocular Depth Refinement via Neural Radiance Fields

Arun Muthukkumar

Comments IEEE 8th International Conference on Algorithms, Computing and Artificial Intelligence (ACAI 2025)

2601.02751 2026-03-09 cs.CL cs.AI cs.CR

Window-based Membership Inference Attacks Against Fine-tuned Large Language Models

Yuetian Chen, Yuntao Du, Kaiyuan Zhang, Ashish Kundu, Charles Fleming, Bruno Ribeiro, Ninghui Li

Comments Accepted to USENIX Security 2026. This extended arXiv version includes complete experimental results. The source code is publicly available at: https://github.com/Stry233/WBC/

2512.22266 2026-03-09 cs.LG cs.AI

LLMTM: Benchmarking and Optimizing LLMs for Temporal Motif Analysis in Dynamic Graphs

Bing Hao, Minglai Shao, Zengyi Wo, Yunlong Chu, Yuhang Liu, Ruijie Wang

Comments Accepted to AAAI 2026

2512.14202 2026-03-09 cs.LG cs.AI

Understanding and Improving Hyperbolic Deep Reinforcement Learning

Timo Klein, Thomas Lang, Andrii Shkabrii, Alexander Sturm, Kevin Sidak, Lukas Miklautz, Claudia Plant, Yllka Velaj, Sebastian Tschiatschek

Comments ICLR 2026 Camera-ready

2512.08535 2026-03-09 cs.CV

Photo3D: Advancing Photorealistic 3D Generation through Structure-Aligned Detail Enhancement

Xinyue Liang, Zhinyuan Ma, Lingchen Sun, Yanjun Guo, Lei Zhang

2512.08445 2026-03-09 cs.CV cs.LG

Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts

Madhav Gupta, Vishak Prasad C, Ganesh Ramakrishnan

Comments Accepted to the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

2512.06547 2026-03-09 cs.LG cs.AI cs.DC

A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation

Xiaocan Li, Shiliang Wu, Zheng Shen

2512.06306 2026-03-09 cs.CV cs.AI

Exploiting Spatiotemporal Properties for Efficient Event-Driven Human Pose Estimation

Haoxian Zhou, Chuanzhi Xu, Langyi Chen, Pengfei Ye, Haodong Chen, Yuk Ying Chung, Qiang Qu

2512.06261 2026-03-09 cs.RO

Safe Model Predictive Diffusion with Shielding

Taekyung Kim, Keyvan Majd, Hideki Okamoto, Bardh Hoxha, Dimitra Panagou, Georgios Fainekos

Comments 2026 IEEE International Conference on Robotics and Automation (ICRA). Project page: https://www.taekyung.me/safe-mpd

2512.05962 2026-03-09 cs.LG cs.AI

Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity

Germán Kruszewski, Pierre Erbacher, Jos Rozen, Marc Dymetman

Comments Published as an ICLR 2026 conference paper

2512.05270 2026-03-09 cs.RO cs.AI cs.HC cs.MA cs.SY eess.SY

XR-DT: Extended Reality-Enhanced Digital Twin for Safe Motion Planning via Human-Aware Model Predictive Path Integral Control

Tianyi Wang, Jiseop Byeon, Ahmad Yehia, Yiming Xu, Jihyung Park, Tianyi Zeng, Sikai Chen, Ziran Wang, Junfeng Jiao, Christian Claudel

Comments 8 pages, 6 figures, 3 tables

2512.04559 2026-03-09 cs.LG cs.AI

Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function

Hyeongyu Kang, Jaewoo Lee, Woocheol Shin, Kiyoung Om, Jinkyoo Park

Comments ICLR 2026

2512.04461 2026-03-09 cs.CV

UniTS: Unified Spatio-Temporal Generative Model for Remote Sensing

Yuxiang Zhang, Shunlin Liang, Wenyuan Li, Han Ma, Jianglei Xu, Yichuan Ma, Jiangwei Xie, Wei Li, Mengmeng Zhang, Ran Tao, Xiang-Gen Xia

2512.02702 2026-03-09 cs.CV

A method for tissue-mask supported whole-body image registration in the UK Biobank

Yasemin Utkueri, Elin Lundström, Håkan Ahlström, Johan Öfverstedt, Joel Kullberg

Comments 35 pages, 10 figures

2511.22829 2026-03-09 cs.RO cs.HC

Safe Autonomous Lane Changing: Planning with Dynamic Risk Fields and Time-Varying Convex Space Generation

Yijun Lu, Zhihao Lin, Zhen Tian

2511.19319 2026-03-09 cs.CV

SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis

Lingwei Dang, Zonghan Li, Juntong Li, Hongwen Zhang, Liang An, Yebin Liu, Qingyao Wu

Comments The structure and logic of writing will undergo a complete revision

2511.18112 2026-03-09 cs.RO

EchoVLA: Synergistic Declarative Memory for VLA-Driven Mobile Manipulation

Min Lin, Xiwen Liang, Bingqian Lin, Liu Jingzhi, Zijian Jiao, Kehan Li, Yu Sun, Weijia Liufu, Yuhan Ma, Yuecheng Liu, Shen Zhao, Yuzheng Zhuang, Xiaodan Liang

2511.17938 2026-03-09 cs.CL cs.LG

SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization

Jianghao Wu, Yasmeen George, Jin Ye, Yicheng Wu, Daniel F. Schmidt, Jianfei Cai

2511.17581 2026-03-09 cs.LG cs.CV

EgoCogNav: Cognition-aware Human Egocentric Navigation

Zhiwen Qiu, Ziang Liu, Wenqian Niu, Tapomayukh Bhattacharjee, Saleh Kalantari

Comments 11 pages, 4 figures