arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.12278 2026-02-13 cs.IR cs.AI

AttentionRetriever: Attention Layers are Secretly Long Document Retrievers

David Jiahao Fu, Lam Thanh Do, Jiayu Li, Kevin Chen-Chuan Chang

2602.12276 2026-02-13 cs.AI cs.CL

Agentic Test-Time Scaling for WebAgents

Nicholas Lee, Lutfi Eren Erdogan, Chris Joseph John, Surya Krishnapillai, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

2602.12273 2026-02-13 math.OC cs.LG cs.NA math.NA

Learning to Control: The iUzawa-Net for Nonsmooth Optimal Control of Linear PDEs

Yongcun Song, Xiaoming Yuan, Hangrui Yue, Tianyou Zeng

2602.12271 2026-02-13 cs.CV cs.LG

MonarchRT: Efficient Attention for Real-Time Video Generation

Krish Agarwal, Zhuoming Chen, Cheng Luo, Yongqi Chen, Haizhong Zheng, Xun Huang, Atri Rudra, Beidi Chen

详情

英文摘要

Real-time video generation with Diffusion Transformers is bottlenecked by the quadratic cost of 3D self-attention, especially in real-time regimes that are both few-step and autoregressive, where errors compound across time and each denoising step must carry substantially more information. In this setting, we find that prior sparse-attention approximations break down, despite showing strong results for bidirectional, many-step diffusion. Specifically, we observe that video attention is not reliably sparse, but instead combines pronounced periodic structure driven by spatiotemporal position with dynamic, sparse semantic correspondences and dense mixing, exceeding the representational capacity of even oracle top-k attention. Building on this insight, we propose Monarch-RT, a structured attention parameterization for video diffusion models that factorizes attention using Monarch matrices. Through appropriately aligned block structure and our extended tiled Monarch parameterization, we achieve high expressivity while preserving computational efficiency. We further overcome the overhead of parameterization through finetuning, with custom Triton kernels. We first validate the high efficacy of Monarch-RT over existing sparse baselines designed only for bidirectional models. We further observe that Monarch-RT attains up to 95% attention sparsity with no loss in quality when applied to the state-of-the-art model Self-Forcing, making Monarch-RT a pioneering work on highly-capable sparse attention parameterization for real-time video generation. Our optimized implementation outperforms FlashAttention-2, FlashAttention-3, and FlashAttention-4 kernels on Nvidia RTX 5090, H100, and B200 GPUs respectively, providing kernel speedups in the range of 1.4-11.8X. This enables us, for the first time, to achieve true real-time video generation with Self-Forcing at 16 FPS on a single RTX 5090.

URL PDF HTML ☆

赞 0 踩 0

2602.12270 2026-02-13 econ.TH cs.AI cs.GT

Creative Ownership in the Age of AI

Annie Liang, Jay Lu

2602.12264 2026-02-13 cs.IT cs.NI eess.SP math.IT

Transmit or Idle: Efficient AoI Optimal Transmission Policy for Gossiping Receivers

Irtiza Hasan, Ahmed Arafa

Comments To appear in IEEE ICC 2026

2602.12257 2026-02-13 math.PR cs.AI

On the implicit regularization of Langevin dynamics with projected noise

Govind Menon, Austin J. Stromme, Adrien Vacher

Comments 30 pages, 1 figure

2602.12256 2026-02-13 cs.SE

Automated Test Suite Enhancement Using Large Language Models with Few-shot Prompting

Alex Chudic, Gül Çalıklı

Comments 13 pages, 3 figures, accepted to ICPC 2026 (34th International Conference on Program Comprehension)

2602.12253 2026-02-13 cs.GT cs.LG

Is Online Linear Optimization Sufficient for Strategic Robustness?

Yang Cai, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng

Comments 26 pages

2602.12251 2026-02-13 cs.CL cs.AI cs.HC

A technical curriculum on language-oriented artificial intelligence in translation and specialised communication

Ralph Krüger

Comments 10 pages, 1 figure, EAMT 2026, TAITT Workshop

2602.12250 2026-02-13 cs.LG cs.CR cs.SI

Community Concealment from Unsupervised Graph Learning-Based Clustering

Dalyapraz Manatova, Pablo Moriano, L. Jean Camp

2602.12245 2026-02-13 cs.LG cs.AI

Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces

Anthony Kobanda, Waris Radji

2602.12244 2026-02-13 cs.RO

Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks

Zhihong Liu, Yang Li, Rengming Huang, Cewu Lu, Panpan Cai

2602.12243 2026-02-13 cs.MA

Federated Gaussian Process Learning via Pseudo-Representations for Large-Scale Multi-Robot Systems

Sanket A. Salunkhe, George P. Kontoudis

Comments Accepted at 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)

Journal ref 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)

2602.12241 2026-02-13 cs.CL cs.LG cs.SD

Moonshine v2: Ergodic Streaming Encoder ASR for Latency-Critical Speech Applications

Manjunath Kudlur, Evan King, James Wang, Pete Warden

Comments 7 pages, 5 figures

2602.12237 2026-02-13 cs.LG cs.AI cs.CL

Olmix: A Framework for Data Mixing Throughout LM Development

Mayee F. Chen, Tyler Murray, David Heineman, Matt Jordan, Hannaneh Hajishirzi, Christopher Ré, Luca Soldaini, Kyle Lo

2602.12233 2026-02-13 cs.LG

Categorical Flow Maps

Daan Roos, Oscar Davis, Floor Eijkelboom, Michael Bronstein, Max Welling, İsmail İlkan Ceylan, Luca Ambrogioni, Jan-Willem van de Meent

2602.12231 2026-02-13 cs.GT

Adjusted Winner: from Splitting to Selling

Robert Bredereck, Bin Sun, Eyal Briman, Nimrod Talmon

2602.12229 2026-02-13 cs.LG

Diffusion Alignment Beyond KL: Variance Minimisation as Effective Policy Optimiser

Zijing Ou, Jacob Si, Junyi Zhu, Ondrej Bohdal, Mete Ozay, Taha Ceritli, Yingzhen Li

2602.12218 2026-02-13 cs.LG cs.AI

The Observer Effect in World Models: Invasive Adaptation Corrupts Latent Physics

Christian Internò, Jumpei Yamaguchi, Loren Amdahl-Culleton, Markus Olhofer, David Klindt, Barbara Hammer

2602.12209 2026-02-13 cs.CR cs.CC cs.DS

Keeping a Secret Requires a Good Memory: Space Lower-Bounds for Private Algorithms

Alessandro Epasto, Xin Lyu, Pasin Manurangsi

Comments comments welcome

2602.12204 2026-02-13 cs.LG

Learning to Forget Attention: Memory Consolidation for Adaptive Compute Reduction

Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

2602.12203 2026-02-13 cs.CL

ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images

Mathieu Sibue, Andres Muñoz Garza, Samuel Mensah, Pranav Shetty, Zhiqiang Ma, Xiaomo Liu, Manuela Veloso

Comments EACL 2026, main conference

2602.12199 2026-02-13 cs.RO cs.NA math.NA

Sub--Riemannian boundary value problems for Optimal Geometric Locomotion

Oliver Gross, Florine Hartwig, Martin Rumpf, Peter Schröder

2602.12196 2026-02-13 cs.CL cs.AI

Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education

Mohamed Huti, Alasdair Mackintosh, Amy Waldock, Dominic Andrews, Maxime Lelièvre, Moritz Boos, Tobias Murray, Paul Atherton, Robin A. A. Ince, Oliver G. B. Garrod

2602.12189 2026-02-13 cs.LG

WaveFormer: Wavelet Embedding Transformer for Biomedical Signals

Habib Irani, Bikram De, Vangelis Metsis

2602.12187 2026-02-13 cs.IR cs.AI

SAGEO Arena: A Realistic Environment for Evaluating Search-Augmented Generative Engine Optimization

Sunghwan Kim, Wooseok Jeong, Serin Kim, Sangam Lee, Dongha Lee

Comments Work in Progress

2602.12183 2026-02-13 cs.CR cs.SE

Unknown Attack Detection in IoT Networks using Large Language Models: A Robust, Data-efficient Approach

Shan Ali, Feifei Niu, Paria Shirani, Lionel C. Briand

Comments 13 pages, 2 figures

2602.12182 2026-02-13 cs.IT math.IT

Rate-Reliability Tradeoff for Deterministic Identification over Gaussian Channels

Pau Colomer, Christian Deppe, Holger Boche, Andreas Winter

Comments 10 pages, 1 figure. The first half of this preprint will be presented at the 2026 IEEE International Conference on Communications, Glasgow, 24-28 May 2026

2602.12181 2026-02-13 cs.GT cs.LG cs.MA

Convex Markov Games and Beyond: New Proof of Existence, Characterization and Learning Algorithms for Nash Equilibria

Anas Barakat, Ioannis Panageas, Antonios Varvitsiotis

Comments AISTATS 2026