arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.10898 2026-04-15 cs.LG cs.AI cs.CL

ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval

David H. Yang, Yuxuan Zhu, Mohammad Mohammadi Amiri, Keerthiram Murugesan, Tejaswini Pedapati, Subhajit Chaudhury, Pin-Yu Chen

详情

英文摘要

Large language models (LLMs) have shown great performance on complex reasoning tasks but often require generating long intermediate thoughts before reaching a final answer. During generation, LLMs rely on a key-value (KV) cache for autoregressive decoding. However, the memory footprint of the KV cache grows with output length. Prior work on KV cache optimization mostly focus on compressing the long input context, while retaining the full KV cache for decoding. For tasks requiring long output generation, this leads to increased computational and memory costs. In this paper, we introduce ZoomR, a novel approach that enables LLMs to adaptively compress verbose reasoning thoughts into summaries and uses a dynamic KV cache selection policy that leverages these summaries while also strategically "zooming in" on fine-grained details. By using summary keys as a coarse-grained index during decoding, ZoomR uses the query to retrieve details for only the most important thoughts. This hierarchical strategy significantly reduces memory usage by avoiding full-cache attention at each step. Experiments across math and reasoning tasks show that our approach achieves competitive performance compared to baselines, while reducing inference memory requirements by more than $4\times$. These results demonstrate that a multi-granularity KV selection enables more memory efficient decoding, especially for long output generation.

URL PDF HTML ☆

赞 0 踩 0

2604.10655 2026-04-15 cs.CV cs.AI cs.MM

LoViF 2026 The First Challenge on Weather Removal in Videos

Chenghao Qian, Xin Li, Yeying Jin, Shangguan Sun, Yilian Zhong, Yuxiang Chen, Shibo Yin, Yushun Fang, Xilei Zhu, Yahui Wang, Chen Lu, Ying Fu, Jianan Tian, Jifan Zhang, Chen Zhou, Junyang Jiang, Yuping Sun, Zhuohang Shi, Xiaojing Liu, Jiao Liu, Yatong Zhou, Shuai Liu, Qiang Deng, Jiajia Mi, Qianhao Luo, Weiling Li

Comments CVPR Workshop Challenge Report

2604.10291 2026-04-15 cs.AI

TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale

Malgorzata Gwiazda, Yifu Cai, Mononito Goswami, Arjun Choudhry, Artur Dubrawski

2604.10288 2026-04-15 cs.AI

Dead Cognitions: A Census of Misattributed Insights

Aaron Tuor, claude. ai

2604.10200 2026-04-15 cs.AI cs.CV

Edu-MMBias: A Three-Tier Multimodal Benchmark for Auditing Social Bias in Vision-Language Models under Educational Contexts

Ruijia Li, Mingzi Zhang, Zengyi Yu, Yuang Wei, Bo Jiang

2604.10151 2026-04-15 cs.CL

Nationality encoding in language model hidden states: Probing culturally differentiated representations in persona-conditioned academic text

Paul Jackson, Ruizhe Li, Elspeth Edelstein

Comments 42 pages, 6 tables

2604.09853 2026-04-15 cs.CV

Do vision models perceive illusory motion in static images like humans?

Isabella Elaine Rosario, Fan L. Cheng, Zitang Sun, Nikolaus Kriegeskorte

Comments Accepted to CVPR 2026 Findings

2604.09689 2026-04-15 cs.CV cs.AI cs.LG

Face Density as a Proxy for Data Complexity: Quantifying the Hardness of Instance Count

Abolfazl Mohammadi-Seif, Ricardo Baeza-Yates

Comments This work has been accepted for publication in the Proceedings of IEEE CAI 2026. The final published version should be cited

2604.09443 2026-04-15 cs.CL cs.AI

Many-Tier Instruction Hierarchy in LLM Agents

Jingyu Zhang, Tianjian Li, William Jurayj, Hongyuan Zhan, Benjamin Van Durme, Daniel Khashabi

2604.09166 2026-04-15 cs.LG

Automated Batch Distillation Process Simulation for a Large Hybrid Dataset for Deep Anomaly Detection

Jennifer Werner, Justus Arweiler, Indra Jungjohann, Jochen Schmid, Fabian Jirasek, Hans Hasse, Michael Bortz

2604.06063 2026-04-15 cs.CV cs.MM

EDGE-Shield: Efficient Denoising-staGE Shield for Violative Content Filtering via Scalable Reference-Based Matching

Takara Taniguchi, Ryohei Shimizu, Duc Minh Vo, Kota Izumi, Shiqi Yang, Teppei Suzuki

2604.05818 2026-04-15 cs.CV cs.CL cs.IR

WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering

Yingjian Zhu, Xinming Wang, Kun Ding, Ying Wang, Bin Fan, Shiming Xiang

Comments Accepted by ACL 2026 Findings

2604.05795 2026-04-15 cs.CL

Measuring What Matters!! Assessing Therapeutic Principles in Mental-Health Conversation

Abdullah Mazhar, Het Riteshkumar Shah, Aseem Srivastava, Smriti Joshi, Md Shad Akhtar

Comments Accepted at ACL 2026 (Main)

2604.05643 2026-04-15 cs.CL

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

Hongyuan Yuan, Xinran He, Run Shao, Bolei He, Xianwei Xue, Mengke Chen, Qiutong Pan, Haiwei Wang, Haifeng Li

Comments Accepted by ACL2026 Findings

2604.02830 2026-04-15 cs.CL

GRADE: Probing Knowledge Gaps in LLMs through Gradient Subspace Dynamics

Yujing Wang, Yuanbang Liang, Yukun Lai, Hainan Zhang, Hanqi Yan

2604.01315 2026-04-15 cs.LG

Detecting Complex Money Laundering Patterns with Incremental and Distributed Graph Modeling

Haseeb Tariq, Alen Kaja, Marwan Hassani

2603.27552 2026-04-15 cs.LG cs.DC

BLOSSOM: Block-wise Federated Learning Over Shared and Sparse Observed Modalities

Pranav M R, Jayant Chandwani, Ahmed M. Abdelmoniem, Arnab K. Paul

Comments Accepted to IJCNN 2026 (6 pages, 2 figures, 3 tables)

2603.23488 2026-04-15 cs.CV

One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

Adrien Ramanana Rahary, Nicolas Dufour, Patrick Perez, David Picard

Comments 36 pages, 17 figures

2603.22607 2026-04-15 cs.CV

Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off

Fulvio Sanguigni, Davide Lobba, Bin Ren, Marcella Cornia, Nicu Sebe, Rita Cucchiara

Comments Project page: https://furio1999.github.io/Dress-ED/

2603.21440 2026-04-15 cs.CL cs.AI

KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

Shuai Wang, Yinan Yu

Comments Accepted to IJCNN 2026

2603.21045 2026-04-15 cs.CV cs.AI

LPNSR: Optimal Noise-Guided Diffusion Image Super-Resolution Via Learnable Noise Prediction

Shuwei Huang, Shizhuo Liu, Zijun Wei

2603.10652 2026-04-15 cs.CV cs.AI

Are Video Reasoning Models Ready to Go Outside?

Yangfan He, Changgyu Boo, Jaehong Yoon

Comments Project Page: https://robust-video-reason.github.io/

2603.08291 2026-04-15 cs.AI

A Survey of Multimodal Mathematical Reasoning: From Perception, Alignment to Reasoning

Tianyu Yang, Sihong Wu, Yilun Zhao, Zhenwen Liang, Lisen Dai, Chen Zhao, Minhao Cheng, Arman Cohan, Xiangliang Zhang

Comments ACL 2026

2603.07083 2026-04-15 cs.LG

Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction

Michael Hauri, Friedemann Zenke

2603.06552 2026-04-15 cs.CL

KCLarity at SemEval-2026 Task 6: Encoder and Zero-Shot Approaches to Political Evasion Detection

Archie Sage, Salvatore Greco

Comments Camera-ready version to appear in the SemEval 2026 Proceedings

2603.05004 2026-04-15 cs.LG cs.AI

Poisoning the Inner Prediction Logic of Graph Neural Networks for Clean-Label Backdoor Attacks

Yuxiang Zhang, Bin Ma, Enyan Dai

Comments Under review as TMLR regular paper

2603.00655 2026-04-15 cs.CV

Mema: Memory-Augmented Adapter for Enhanced Vision-Language Understanding

Ying Liu, Yudong Han, Kean Shi, Liyuan Pan

2602.22394 2026-04-15 cs.CV

Vision Transformers Need More Than Registers

Cheng Shi, Yizhou Yu, Sibei Yang

Comments Accepted by CVPR 2026

2602.18502 2026-04-15 cs.CV cs.LG

Mitigating Shortcut Learning via Feature Disentanglement in Medical Imaging: A Benchmark Study

Sarah Müller, Philipp Berens

Comments Minor edits: formatting improvements and typo fixes; no changes to content or results

2602.02370 2026-04-15 cs.CV

Uncertainty-Aware Image Classification In Biomedical Imaging Using Spectral-normalized Neural Gaussian Processes

Uma Meleti, Jeffrey J. Nirschl

Comments Published at the IEEE International Symposium on Biomedical Imaging (ISBI) 2026