arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2603.20449 2026-03-24 cs.SE cs.AI

Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

Cailin Winston, Claris Winston, René Just

2603.20434 2026-03-24 eess.SY cs.LG cs.SY

Verifiable Error Bounds for Physics-Informed Neural KKL Observers

Hannah Berin-Costain, Harry Wang, Kirsten Morris, Jun Liu

Comments 6 pages, 4 figures

2603.20408 2026-03-24 cs.GT cs.AI cs.LG cs.SY eess.SY math.OC

Meta-Learning for Repeated Bayesian Persuasion

Ata Poyraz Turna, Asrin Efe Yorulmaz, Tamer Başar

Comments 40 pages

2603.20404 2026-03-24 cs.NI cs.LG cs.MA

Hetero-Net: An Energy-Efficient Resource Allocation and 3D Placement in Heterogeneous LoRa Networks via Multi-Agent Optimization

Abdullahi Isa Ahmed, Ana Maria Drăgulinescu, El Mehdi Amhoud

Comments 6 pages, 7 figures

2603.20389 2026-03-24 cond-mat.mtrl-sci cs.LG physics.chem-ph

A chemical language model for reticular materials design

Dhruv Menon, Vivek Singh, Xu Chen, Mohammad Reza Alizadeh Kiapi, Ivan Zyuzin, Hamish W. Macleod, Nakul Rampal, William Shepard, Omar M. Yaghi, David Fairen-Jimenez

Comments 45 pages, 26 figures, Supplementary Information included; code available at: https://github.com/fairen-group/nexerra-r1

2603.20388 2026-03-24 math.ST cs.LG econ.EM stat.ML stat.TH

From Cross-Validation to SURE: Asymptotic Risk of Tuned Regularized Estimators

Karun Adusumilli, Maximilian Kasy, Ashia Wilson

2603.20387 2026-03-24 eess.AS cs.SD

End-to-End Multi-Task Learning for Adjustable Joint Noise Reduction and Hearing Loss Compensation

Philippe Gonzalez, Vera Margrethe Frederiksen, Torsten Dau, Tobias May

2603.20366 2026-03-24 cs.IR cs.AI

WebNavigator: Global Web Navigation via Interaction Graph Retrieval

Xuanwang Zhang, Yuteng Han, Jinnan Qi, Mulong Xie, Zhen Wu, Xinyu Dai

Comments 24 pages, 3 figures

2603.20365 2026-03-24 stat.ML cs.AI cs.LG

Comprehensive Description of Uncertainty in Measurement for Representation and Propagation with Scalable Precision

Ali Darijani, Jürgen Beyerer, Zahra Sadat Hajseyed Nasrollah, Luisa Hoffmann, Michael Heizmann

2603.20357 2026-03-24 cs.CR cs.AI

Memory poisoning and secure multi-agent systems

Vicenç Torra, Maria Bras-Amorós

Comments 15 pages, 2 figures

2603.20354 2026-03-24 cs.MM cs.AI

Leum-VL Technical Report

Yuxuan He, Chaiming Huang, Yifan Wu, Hongjun Wang, Chenkui Shen, Jifan Zhang, Long Li

Comments 27 pages, 5 figures

详情

英文摘要

A short video succeeds not simply because of what it shows, but because of how it schedules attention -- yet current multimodal models lack the structural grammar to parse or produce this organization. Existing models can describe scenes, answer event-centric questions, and read on-screen text, but they are far less reliable at identifying timeline-grounded units such as hooks, cut rationales, shot-induced tension, and platform-facing packaging cues. We propose SV6D (Structured Video in Six Dimensions), inspired by professional storyboard practice in film and television production, a representation framework that decomposes internet-native video into six complementary structural dimensions -- subject, aesthetics, camera language, editing, narrative, and dissemination -- with each label tied to physically observable evidence on the timeline. We formalize a unified optimization objective over SV6D that combines Hungarian-matched temporal alignment, dimension-wise semantic label distance, and quality regularization. Building on this framework, we present Leum-VL-8B, an 8B video-language model that realizes the SV6D objective through an expert-driven post-training pipeline, further refined through verifiable reinforcement learning on perception-oriented tasks. Leum-VL-8B achieves 70.8 on VideoMME (w/o subtitles), 70.0 on MVBench, and 61.6 on MotionBench, while remaining competitive on general multimodal evaluations such as MMBench-EN. We also construct FeedBench, a benchmark for structure-sensitive short-video understanding. Our results indicate that the missing layer in video AI is not pixel generation but structural representation: grounded on the timeline, linked to visible evidence, and directly consumable by downstream workflows such as editing, retrieval, recommendation, and generation control, including text-heavy internet video formats with overlays and image-text layouts.

URL PDF HTML ☆

赞 0 踩 0

2603.20351 2026-03-24 cs.CR cs.AI

MANA: Towards Efficient Mobile Ad Detection via Multimodal Agentic UI Navigation

Yizhe Zhao, Yongjian Fu, Zihao Feng, Hao Pan, Yongheng Deng, Yaoxue Zhang, Ju Ren

2603.20346 2026-03-24 q-bio.GN cs.LG

G2DR: A Genotype-First Framework for Genetics-Informed Target Prioritization and Drug Repurposing

Muhammad Muneeb, David B. Ascher

2603.20338 2026-03-24 cs.IR cs.AI cs.LG

Low-pass Personalized Subgraph Federated Recommendation

Wooseok Sim, Hogun Park

Comments Accepted at ICLR 2026. 31 pages, 3 figures, 12 tables

2603.20336 2026-03-24 cs.IR cs.AI cs.DB

GEM: A Native Graph-based Index for Multi-Vector Retrieval

Yao Tian, Zhoujin Tian, Xi Zhao, Ruiyuan Zhang, Xiaofang Zhou

Comments This paper has been accepted by SIGMOD 2026

2603.20324 2026-03-24 cs.MA cs.AI

When Agents Disagree: The Selection Bottleneck in Multi-Agent LLM Pipelines

Artem Maryanskyy

Comments 12 pages, 3 figures, 5 tables

2603.20321 2026-03-24 q-bio.MN cs.AI cs.CL

GIP-RAG: An Evidence-Grounded Retrieval-Augmented Framework for Interpretable Gene Interaction and Pathway Impact Analysis

Fujian Jia, Jiwen Gu, Cheng Lu, Dezhi Zhao, Mengjiang Huang, Yuanzhi Lu, Xin Liu, Kang Liu

Comments 29 pages

2603.20320 2026-03-24 cs.SE cs.AI cs.LG

The Causal Impact of Tool Affordance on Safety Alignment in LLM Agents

Shasha Yu, Fiona Carroll, Barry L. Bentley

2603.20316 2026-03-24 cs.IR cs.AI

Bypassing Document Ingestion: An MCP Approach to Financial Q&A

Sasan Mansouri, Edoardo Pilla, Mark Wahrenburg, Fabian Woebbeking

Comments 19 pages, 10 figures

2603.20313 2026-03-24 cs.SE cs.AI

Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection

Sarat Mudunuri, Jian Wan, Ally Qin, Srinivasan Manoharan

2603.20311 2026-03-24 cs.SE cs.AI cs.CL

kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation

Rohan Siva, Kai Cheung, Lichi Li, Ganesh Sundaram

Comments 9 pages, 7 figures

2603.20308 2026-03-24 cs.MA cs.AI

Reason-to-Transmit: Deliberative Adaptive Communication for Cooperative Perception

Aayam Bansal, Ishaan Gangwani

2603.20300 2026-03-24 cs.SE cs.AI

From Human Interfaces to Agent Interfaces: Rethinking Software Design in the Age of AI-Native Systems

Shaolin Wang, Yi Mei, Haoyang Che, He Jiang, Shui Yu, Ying Gu

Comments 4 pages, 1 figure, 1 table

2603.20299 2026-03-24 cs.SE cs.AI

HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs

Yusen Wu, Xiaotie Deng

详情

英文摘要

Existing Retrieval-Augmented Generation (RAG) methods for code struggle to capture the high-level architectural patterns and cross-file dependencies inherent in complex, theory-driven codebases, such as those in algorithmic game theory (AGT), leading to a persistent semantic and structural gap between abstract concepts and executable implementations. To address this challenge, we propose Hierarchical Code/Architecture-guided Agent Generation (HCAG), a framework that reformulates repository-level code generation as a structured, planning-oriented process over hierarchical knowledge. HCAG adopts a two-phase design: an offline hierarchical abstraction phase that recursively parses code repositories and aligned theoretical texts to construct a multi-resolution semantic knowledge base explicitly linking theory, architecture, and implementation; and an online hierarchical retrieval and scaffolded generation phase that performs top-down, level-wise retrieval to guide LLMs in an architecture-then-module generation paradigm. To further improve robustness and consistency, HCAG integrates a multi-agent discussion inspired by cooperative game. We provide a theoretical analysis showing that hierarchical abstraction with adaptive node compression achieves cost-optimality compared to flat and iterative RAG baselines. Extensive experiments on diverse game-theoretic system generation tasks demonstrate that HCAG substantially outperforms representative repository-level methods in code quality, architectural coherence, and requirement pass rate. In addition, HCAG produces a large-scale, aligned theory-implementation dataset that effectively enhances domain-specific LLMs through post-training. Although demonstrated in AGT, HCAG paradigm also offers a general blueprint for mining, reusing, and generating complex systems from structured codebases in other domains.

URL PDF HTML ☆

赞 0 踩 0

2603.20281 2026-03-24 cs.GT cs.AI

On the Fragility of AI Agent Collusion

Jussi Keppo, Yuze Li, Gerry Tsoukalas, Nuo Yuan

Comments 48 pages, 7 figures, 8 tables (including appendix)

2603.20279 2026-03-24 cs.CR cs.AI cs.LG cs.MA

Learning Communication Between Heterogeneous Agents in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence

Alex Popa, Adrian Taylor, Ranwa Al Mallah

Comments 6 pages, 3 figures, 1 algorithm, conference paper. CyMARL-CommFormer code available at https://github.com/Poly-AIvsAI/CyMARL-CommFormer/tree/main

2603.20278 2026-03-24 cs.IR cs.AI cs.CL

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Zhuofeng Li, Dongfu Jiang, Xueguang Ma, Haoxiang Zhang, Ping Nie, Yuyu Zhang, Kai Zou, Jianwen Xie, Yu Zhang, Wenhu Chen

2603.20274 2026-03-24 cs.FL cs.LG

Solomonoff induction

Tom F. Sterkenburg

2603.20265 2026-03-24 cs.IT cs.AI cs.LG cs.MA cs.SY eess.SY math.IT

JCAS-MARL: Joint Communication and Sensing UAV Networks via Resource-Constrained Multi-Agent Reinforcement Learning

Islam Guven, Mehmet Parlak

Comments 6 pages, 8 figures, submitted to the conference

2603.20263 2026-03-24 eess.IV cs.CV cs.LG

MiSiSUn: Minimum Simplex Semisupervised Unmixing

Behnood Rasti, Bikram Koirala, Paul Scheunders