arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2507.12753 2026-03-04 cs.RO

osmAG-LLM: Zero-Shot Open-Vocabulary Object Navigation via Semantic Maps and Large Language Models Reasoning

Fujing Xie, Sören Schwertfeger, Hermann Blum

Comments accepted at RA-L 2026

2507.08965 2026-03-04 cs.LG cs.AI stat.ML

Improving Classifier-Free Guidance in Masked Diffusion: Low-Dim Theoretical Insights with High-Dim Impact

Kevin Rojas, Ye He, Chieh-Hsin Lai, Yuhta Takida, Yuki Mitsufuji, Molei Tao

2507.08334 2026-03-04 cs.CV cs.AI

CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes

Sangwon Kim, Kyoungoh Lee, Jeyoun Dong, Kwang-Ju Kim

Comments The original version was accepted by ICCV2025 Workshops

2507.08207 2026-03-04 cs.AI

Toward a Dynamic Stackelberg Game-Theoretic Framework for Agentic AI Defense Against LLM Jailbreaking

Zhengye Han, Quanyan Zhu

Comments Accepted to ICLR 2026 AIMS Workshop. 13 pages, 3 figures

2507.02494 2026-03-04 cs.CV cs.LG

MC-INR: Efficient Encoding of Multivariate Scientific Simulation Data using Meta-Learning and Clustered Implicit Neural Representations

Hyunsoo Son, Jeonghyun Noh, Suemin Jeon, Chaoli Wang, Won-Ki Jeong

Comments 5 pages

Journal ref 2025 IEEE Visualization and Visual Analytics (VIS)

2507.01352 2026-03-04 cs.CL cs.AI cs.LG

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Chris Yuhao Liu, Liang Zeng, Yuzhen Xiao, Jujie He, Jiacai Liu, Chaojie Wang, Rui Yan, Wei Shen, Fuxiang Zhang, Jiacheng Xu, Yang Liu, Yahui Zhou

Comments ICLR 2026 Poster

2507.01335 2026-03-04 cs.CL cs.AI

LEDOM: Reverse Language Model

Xunjian Yin, Sitao Cheng, Yuxi Xie, Xinyu Hu, Li Lin, Xinyi Wang, Liangming Pan, William Yang Wang, Xiaojun Wan

Comments Work in progress; Models can be found at: https://huggingface.co/Corning/Reverse-Model-7B-348B/tree/main

2506.17871 2026-03-04 cs.CL cs.AI cs.LG

LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Chenghao Yang, Sida Li, Ari Holtzman

Comments Codebase: https://github.com/yangalan123/LLMBranchingFactor. V3: Significantly rewrite the whole paper for a clearer structure. Correct problems in the theory parts (Remove emphasis on AEP, discussions on variable LLM generation lengths) and strengthen asymptotic analysis. Add Qwen and OLMo2 experiments. Preliminary SFT v.s. RL comparison to better understand the alignment effects on BF

详情

英文摘要

Despite their impressive capabilities, aligned large language models (LLMs) often generate outputs that lack diversity. What drives this consistency in the generation? We investigate this phenomenon through the lens of probability concentration in the model's output distribution. To quantify this concentration, we introduce the *Branching Factor* (BF) -- a token-invariant measure of the effective number of plausible next steps during generation. Our empirical analysis reveals two key findings: (1) BF often decreases as generation progresses, suggesting that LLMs become more predictable as they generate. (2) alignment tuning substantially sharpens the model's output distribution from the outset, reducing BF by a factor of 2-5 overall, and up to an order of magnitude (e.g., from 12 to 1.2) at the beginning positions. This stark reduction helps explain why aligned models often appear less sensitive to decoding strategies. Building on this insight, we find this consistency has surprising implications for complex reasoning. Aligned Chain-of-Thought (CoT) models (e.g., DeepSeek-distilled models), for instance, leverage this effect; by generating longer reasoning chains, they push generation into later, more deterministic (lower BF) stages, resulting in more stable outputs. We hypothesize that alignment tuning does not fundamentally change a model's behavior, but instead steers it toward stylistic tokens (e.g., "Sure") that unlock low-entropy trajectories already present in the base model. This view is supported by nudging experiments, which show prompting base models with such tokens can similarly reduce BF. Together, our findings establish BF as a powerful diagnostic for understanding and controlling LLM outputs - clarifying how alignment reduces variability, how CoT promotes stable generations, and how base models can be steered away from diversity.

URL PDF HTML ☆

赞 0 踩 0

2506.15682 2026-03-04 cs.CV

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Anirud Aggarwal, Abhinav Shrivastava, Matthew Gwilliam

Comments 39 pages, 29 figures, 15 tables. Accepted at ICLR 2026. Project page and code: https://research.aniaggarwal.com/ecad

2506.13259 2026-03-04 cs.LG math.OC

An Explainable and Interpretable Composite Indicator Based on Decision Rules

Salvatore Corrente, Salvatore Greco, Roman Słowiński, Silvano Zappalà

详情

DOI: 10.1016/j.omega.2026.103513

英文摘要

Composite indicators are widely used to score or classify units evaluated on multiple criteria. Their construction typically involves aggregating criteria evaluations, a common practice in Multiple Criteria Decision Aiding (MCDA). Beyond producing a final score or classification, however, ensuring explainability, interpretability, and transparency is crucial. This paper proposes a novel framework for constructing explainable and interpretable composite indicators using if-then decision rules. We explore four scenarios: (i) decision rules explaining classifications derived from the sum of ordinal indicator codes; (ii) interpretation of an opaque numerical composite indicator used to classify units into quantiles; (iii) construction of a composite indicator from decision-maker preference information, given as classifications of reference units; and (iv) explanation of classifications generated by an existing MCDA method. To induce the rules from scored or classified units, we apply the Dominance-based Rough Set Approach. The resulting rules relate class assignments or scores to threshold conditions on indicator values in a clear and intelligible way, clarifying the underlying rationale and supporting the assessment of new units. Our main methodological contribution is the introduction of a decision-rule-based framework for constructing composite indicators. Moreover, the framework extends naturally to continuous composite indicators by treating each distinct score as an ordered class. This is enabled by a new algorithm that efficiently induces all minimal rules in a single run. Although this may yield many rules, explainability is preserved by showing only those satisfied by the unit of interest. Finally, the methodology can handle datasets with missing values, enhancing its practical applicability.

URL PDF HTML ☆

赞 0 踩 0

2506.11103 2026-03-04 cs.CL cs.AI

You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models

Wenchong He, Liqian Peng, Zhe Jiang, Alex Go

Comments 20 pages, 6 figures, 12 tables

2506.08862 2026-03-04 cs.CV cs.LG

StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Zike Wu, Qi Yan, Xuanyu Yi, Lele Wang, Renjie Liao

Comments Accepted by ICLR 2026, Project page: https://streamsplat3d.github.io/

2506.07218 2026-03-04 cs.LG cs.AI cs.CV

Perception-R1: Advancing Multimodal Reasoning Capabilities of MLLMs via Visual Perception Reward

Tong Xiao, Xin Xu, Zhenya Huang, Hongyu Gao, Quan Liu, Qi Liu, Enhong Chen

2506.05334 2026-03-04 cs.CL cs.IR cs.LG

Search Arena: Analyzing Search-Augmented LLMs

Mihran Miroyan, Tsung-Han Wu, Logan King, Tianle Li, Jiayi Pan, Xinyan Hu, Wei-Lin Chiang, Anastasios N. Angelopoulos, Trevor Darrell, Narges Norouzi, Joseph E. Gonzalez

Comments Accepted to ICLR 2026. Code: https://github.com/lmarena/search-arena. Dataset: https://huggingface.co/datasets/lmarena-ai/search-arena-24k

2506.03533 2026-03-04 cs.CL

Go-Browse: Training Web Agents with Structured Exploration

Apurva Gandhi, Graham Neubig

2506.03230 2026-03-04 cs.LG cs.AI cs.CL math.OC

DiaBlo: Diagonal Blocks Are Sufficient For Finetuning

Selcuk Gurses, Aozhong Zhang, Yanxia Deng, Xun Dong, Xin Li, Naigang Wang, Penghang Yin, Zi Yang

Comments Accepted by ICLR 2026

2506.02950 2026-03-04 cs.LG cs.AI cs.CV

Interaction Field Matching: Overcoming Limitations of Electrostatic Models

Stepan I. Manukhov, Alexander Kolesov, Vladimir V. Palyulin, Alexander Korotin

2506.01502 2026-03-04 cs.LG cs.AI stat.ML

Learning of Population Dynamics: Inverse Optimization Meets JKO Scheme

Mikhail Persiianov, Jiawei Chen, Petr Mokrov, Alexander Tyurin, Evgeny Burnaev, Alexander Korotin

2506.01153 2026-03-04 cs.LG

Weight-Space Linear Recurrent Neural Networks

Roussel Desmond Nzoyem, Nawid Keshtmand, Enrique Crespo Fernandez, Idriss Tsayem, Raul Santos-Rodriguez, David A. W. Barton, Tom Deakin

Comments Accepted as a main track publication at ICLR 2026. Contains 40 pages, 23 figures, and 16 tables

2505.22499 2026-03-04 cs.CV

SABER: Spatially Consistent 3D Universal Adversarial Objects for BEV Detectors

Aixuan Li, Mochu Xiang, Bosen Hou, Zhexiong Wan, Jing Zhang, Yuchao Dai

Comments Accepted to CVPR 2026

2505.20934 2026-03-04 cs.LG

NatADiff: Adversarial Boundary Guidance for Natural Adversarial Diffusion

Max Collins, Jordan Vice, Tim French, Ajmal Mian

Comments 10 pages, 3 figures, 2 tables

2505.17561 2026-03-04 cs.CV cs.AI

Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model

Kwanyoung Kim, Sanghyun Kim

Comments Cam ready version of ICLR 26

2505.14899 2026-03-04 cs.RO cs.CL

REFLEX: Metacognitive Reasoning for Reflective Zero-Shot Robotic Planning with Large Language Models

Wenjie Lin, Jin Wei-Kocsis, Jiansong Zhang, Byung-Cheol Min, Dongming Gan, Paul Asunda, Ragu Athinarayanan

2505.13909 2026-03-04 cs.AI cs.CL cs.LG

Efficient Agent Training for Computer Use

Yanheng He, Jiahe Jin, Pengfei Liu

Comments ICLR 2026

2505.13614 2026-03-04 cs.LG stat.ML

Deterministic Bounds and Random Estimates of Metric Tensors on Neuromanifolds

Ke Sun

Comments Published at the Fourteenth International Conference on Learning Representations (ICLR 2026)

2505.13180 2026-03-04 cs.AI

ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models

Matteo Merler, Nicola Dainese, Minttu Alakuijala, Giovanni Bonetta, Pietro Ferrazzi, Yu Tian, Bernardo Magnini, Pekka Marttinen

Comments 8 pages, 5 figures and 1 table in the main text; 50 pages, 16 figures and 19 tables including supplementary material

2505.02156 2026-03-04 cs.CL cs.AI cs.LG

Adaptive Social Learning via Mode Policy Optimization for Language Agents

Minzheng Wang, Yongbin Li, Haobo Wang, Xinghua Zhang, Nan Xu, Bingli Wu, Fei Huang, Haiyang Yu, Wenji Mao

Comments Proceedings of ICLR 2026. The code and data are available, see https://github.com/MozerWang/AMPO

2504.21023 2026-03-04 cs.CL cs.AI cs.LG

Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Sheng Cao, Mingrui Wu, Karthik Prasad, Yuandong Tian, Zechun Liu

Comments Published as a conference paper at ICLR 2025

Journal ref ICLR 2025

2503.22165 2026-03-04 cs.LG

Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models

Zhanke Zhou, Zhaocheng Zhu, Xuan Li, Mikhail Galkin, Xiao Feng, Sanmi Koyejo, Jian Tang, Bo Han