arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2601.21301 2026-01-30 cs.LG stat.ML

Achieving $\varepsilon^{-2}$ Dependence for Average-Reward Q-Learning with a New Contraction Principle

Zijun Chen, Zaiwei Chen, Nian Si, Shengbo Wang

2601.21296 2026-01-30 cs.LG cs.AI

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Shaobo Wang, Yantai Yang, Guo Chen, Peiru Li, Kaixin Li, Yufa Zhou, Zhaorun Chen, Linfeng Zhang

Comments Accepted by ICLR 2026, 20 pages, 9 figures, 11 tables

2601.21291 2026-01-30 cs.CV

Gaussian Belief Propagation Network for Depth Completion

Jie Tang, Pingping Xie, Jian Li, Ping Tan

2601.21284 2026-01-30 cs.LG cs.AI cs.ET math.AP

PILD: Physics-Informed Learning via Diffusion

Tianyi Zeng, Tianyi Wang, Jiaru Zhang, Zimo Zeng, Feiyang Zhang, Yiming Xu, Sikai Chen, Yajie Zou, Yangyang Wang, Junfeng Jiao, Christian Claudel, Xinbo Chen

2601.21282 2026-01-30 cs.CV

WorldBench: Disambiguating Physics for Diagnostic Evaluation of World Models

Rishi Upadhyay, Howard Zhang, Jim Solomon, Ayush Agrawal, Pranay Boreddy, Shruti Satya Narayana, Yunhao Ba, Alex Wong, Celso M de Melo, Achuta Kadambi

Comments Webpage: https://world-bench.github.io/

2601.21281 2026-01-30 cs.LG

EGAM: Extended Graph Attention Model for Solving Routing Problems

Licheng Wang, Yuzi Yan, Mingtao Huang, Yuan Shen

2601.21269 2026-01-30 cs.CV cs.AI

Lightweight High-Fidelity Low-Bitrate Talking Face Compression for 3D Video Conference

Jianglong Li, Jun Xu, Bingcong Lu, Zhengxue Cheng, Hongwei Hu, Ronghua Wu, Li Song

2601.21255 2026-01-30 cs.CV cs.AI cs.LG

Hypersolid: Emergent Vision Representations via Short-Range Repulsion

Esteban Rodríguez-Betancourt, Edgar Casasola-Murillo

Comments 17 pages, 16 figures

2601.21251 2026-01-30 cs.RO

Abstracting Robot Manipulation Skills via Mixture-of-Experts Diffusion Policies

Ce Hao, Xuanran Zhai, Yaohua Liu, Harold Soh

2601.21249 2026-01-30 cs.AI cs.LG

Position: Certifiable State Integrity in Cyber-Physical Systems -- Why Modular Sovereignty Solves the Plasticity-Stability Paradox

Enzo Nicolás Spotorno, Antônio Augusto Medeiros Fröhlich

Comments 14 pages, (8 main text, 6 references and appendices), 2 figures

2601.21248 2026-01-30 cs.CV

NFCDS: A Plug-and-Play Noise Frequency-Controlled Diffusion Sampling Strategy for Image Restoration

Zhen Wang, Hongyi Liu, Jianing Li, Zhihui Wei

2601.21246 2026-01-30 cs.LG cs.AI

Conditional Generative Framework with Peak-Aware Attention for Robust Chemical Detection under Interferences

Namkyung Yoon, Sanghong Kim, Hwangnam Kim

Comments 24 pages, 5 figures

2601.21242 2026-01-30 cs.LG cs.AI

Understanding Diffusion Models via Ratio-Based Function Approximation with SignReLU Networks

Luwei Sun, Dongrui Shen, Jianfe Li, Yulong Zhao, Han Feng

Comments 34 pages

2601.21238 2026-01-30 cs.CV cs.AI

PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models

Xuewen Liu, Zhikai Li, Jing Zhang, Mengjuan Chen, Qingyi Gu

Comments ICLR 2026

2601.21235 2026-01-30 cs.CL cs.AI

SHARP: Social Harm Analysis via Risk Profiles for Measuring Inequities in Large Language Models

Alok Abhishek, Tushar Bandopadhyay, Lisa Erickson

Comments Pre Print, 29 pages. key words: Social harm evaluation in LLMs, Large language models, Risk sensitive model selection, Evaluation for high-stakes domains, Worst-case behavior in LLMs, Algorithmic bias, Fairness in machine learning

2601.21234 2026-01-30 cs.LG

PHDME: Physics-Informed Diffusion Models without Explicit Governing Equations

Kaiyuan Tan, Kendra Givens, Peilun Li, Thomas Beckers

2601.21226 2026-01-30 cs.AI

Delegation Without Living Governance

Wolfgang Rohde

2601.21220 2026-01-30 cs.CV

LAMP: Learning Universal Adversarial Perturbations for Multi-Image Tasks via Pre-trained Models

Alvi Md Ishmam, Najibul Haque Sarker, Zaber Ibn Abdul Hakim, Chris Thomas

Comments Accepted in main technical track AAAI 2026

2601.21219 2026-01-30 cs.LG cond-mat.dis-nn

Soft Quantization: Model Compression Via Weight Coupling

Daniel T. Bernstein, Luca Di Carlo, David Schwab

Comments 7 pages, 6 figures

2601.21215 2026-01-30 cs.LG cs.AI eess.SP

Temporal Context and Architecture: A Benchmark for Naturalistic EEG Decoding

Mehmet Ergezer

Journal ref ICASSP 2026

2601.21212 2026-01-30 cs.AI cs.CY

Intelli-Planner: Towards Customized Urban Planning via Large Language Model Empowered Reinforcement Learning

Xixian Yong, Peilin Sun, Zihe Wang, Xiao Zhou

Comments The Web Conference 2026

2601.21210 2026-01-30 cs.AI

Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification

Paul He, Yinya Huang, Mrinmaya Sachan, Zhijing Jin

Comments EACL 2026 Main

2601.21208 2026-01-30 cs.AI cs.IR

When should I search more: Adaptive Complex Query Optimization with Reinforcement Learning

Wei Wen, Sihang Deng, Tianjun Wei, Keyu Chen, Ruizhi Qiao, Xing Sun

Comments 16 pages, 7 figures

详情

英文摘要

Query optimization is a crucial component for the efficacy of Retrieval-Augmented Generation (RAG) systems. While reinforcement learning (RL)-based agentic and reasoning methods have recently emerged as a promising direction on query optimization, most existing approaches focus on the expansion and abstraction of a single query. However, complex user queries are prevalent in real-world scenarios, often requiring multiple parallel and sequential search strategies to handle disambiguation and decomposition. Directly applying RL to these complex cases introduces significant hurdles. Determining the optimal number of sub-queries and effectively re-ranking and merging retrieved documents vastly expands the search space and complicates reward design, frequently leading to training instability. To address these challenges, we propose a novel RL framework called Adaptive Complex Query Optimization (ACQO). Our framework is designed to adaptively determine when and how to expand the search process. It features two core components: an Adaptive Query Reformulation (AQR) module that dynamically decides when to decompose a query into multiple sub-queries, and a Rank-Score Fusion (RSF) module that ensures robust result aggregation and provides stable reward signals for the learning agent. To mitigate training instabilities, we adopt a Curriculum Reinforcement Learning (CRL) approach, which stabilizes the training process by progressively introducing more challenging queries through a two-stage strategy. Our comprehensive experiments demonstrate that ACQO achieves state-of-the-art performance on three complex query benchmarks, significantly outperforming established baselines. The framework also showcases improved computational efficiency and broad compatibility with different retrieval architectures, establishing it as a powerful and generalizable solution for next-generation RAG systems.

URL PDF HTML ☆

赞 0 踩 0

2601.21203 2026-01-30 cs.LG

Rethinking Self-Training Based Cross-Subject Domain Adaptation for SSVEP Classification

Weiguang Wang, Yong Liu, Yingjie Gao, Guangyuan Xu

Comments Accepted to ICASSP 2026

2601.21199 2026-01-30 cs.CV cs.AI

Thinker: A vision-language foundation model for embodied intelligence

Baiyu Pan, Daqin Luo, Junpeng Yang, Jiyuan Wang, Yixuan Zhang, Hailin Shi, Jichao Jiao

Comments IROS 2025, 4 pages, 3 figures

2601.21193 2026-01-30 cs.CV

Generative Recall, Dense Reranking: Learning Multi-View Semantic IDs for Efficient Text-to-Video Retrieval

Zecheng Zhao, Zhi Chen, Zi Huang, Shazia Sadiq, Tong Chen

Comments 10 pages

详情

英文摘要

Text-to-Video Retrieval (TVR) is essential in video platforms. Dense retrieval with dual-modality encoders leads in accuracy, but its computation and storage scale poorly with corpus size. Thus, real-time large-scale applications adopt two-stage retrieval, where a fast recall model gathers a small candidate pool, which is reranked by an advanced dense retriever. Due to hugely reduced candidates, the reranking model can use any off-the-shelf dense retriever without hurting efficiency, meaning the recall model bounds two-stage TVR performance. Recently, generative retrieval (GR) replaces dense video embeddings with discrete semantic IDs and retrieves by decoding text queries into ID tokens. GR offers near-constant inference and storage complexity, and its semantic IDs capture high-level video features via quantization, making it ideal for quickly eliminating irrelevant candidates during recall. However, as a recall model in two-stage TVR, GR suffers from (i) semantic ambiguity, where each video satisfies diverse queries but is forced into one semantic ID; and (ii) cross-modal misalignment, as semantic IDs are solely derived from visual features without text supervision. We propose Generative Recall and Dense Reranking (GRDR), designing a novel GR method to uplift recalled candidate quality. GRDR assigns multiple semantic IDs to each video using a query-guided multi-view tokenizer exposing diverse semantic access paths, and jointly trains the tokenizer and generative retriever via a shared codebook to cast semantic IDs as the semantic bridge between texts and videos. At inference, trie-constrained decoding generates a compact candidate set reranked by a dense model for fine-grained matching. Experiments on TVR benchmarks show GRDR matches strong dense retrievers in accuracy while reducing index storage by an order of magnitude and accelerating up to 300$\times$ in full-corpus retrieval.

URL PDF HTML ☆

赞 0 踩 0

2601.21192 2026-01-30 cs.AI cs.CL

Do Reasoning Models Enhance Embedding Models?

Wun Yu Chan, Shaojin Chen, Huihao Jing, Kwun Hang Lau, Elton Chun-Chai Li, Zihao Wang, Haoran Li, Yangqiu Song

Comments 10 main pages, 18 appendix pages, 13 figures, 11 tables, 4 prompts

2601.21188 2026-01-30 cs.RO

Disturbance-Aware Flight Control of Robotic Gliding Blimp via Moving Mass Actuation

Hao Cheng, Feitian Zhang

2601.21182 2026-01-30 cs.LG cs.AI

Rethinking Refinement: Correcting Generative Bias without Noise Injection

Xin Peng, Ang Gao

2601.21181 2026-01-30 cs.AI

MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models

Sangyun Chung, Se Yeon Kim, Youngchae Chee, Yong Man Ro

AI 大模型

视觉与机器人

科学与医疗