arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2510.04067 2026-03-03 cs.LG cs.AI cs.CL

What Scales in Cross-Entropy Scaling Law?

Junxi Yan, Zixi Wei, Qingyao Ai, Yiqun Liu, Jingtao Zhan

2510.04040 2026-03-03 cs.AI

FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning

Xu Shen, Song Wang, Zhen Tan, Laura Yao, Xinyu Zhao, Kaidi Xu, Xin Wang, Tianlong Chen

2510.02245 2026-03-03 cs.LG cs.AI cs.CL

ExGRPO: Learning to Reason from Experience

Runzhe Zhan, Yafu Li, Zhi Wang, Xiaoye Qu, Dongrui Liu, Jing Shao, Derek F. Wong, Yu Cheng

Comments ICLR 2026 Camera Ready version

2510.00819 2026-03-03 cs.LG cs.AI

Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

Luckeciano C. Melo, Alessandro Abate, Yarin Gal

Comments Published at ICLR 2026

2510.00041 2026-03-03 cs.CV cs.AI

Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness

Yuchen Song, Andong Chen, Wenxin Zhu, Kehai Chen, Xuefeng Bai, Muyun Yang, Tiejun Zhao

2509.26601 2026-03-03 cs.CL cs.AI cs.LG

MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

Chenxi Whitehouse, Sebastian Ruder, Tony Lin, Oksana Kurylo, Haruka Takagi, Janice Lam, Nicolò Busetto, Denise Diaz, Francisco Guzmán

Comments ICLR 2026

2509.26544 2026-03-03 cs.LG

Bayesian Influence Functions for Hessian-Free Data Attribution

Philipp Alexander Kreer, Wilson Wu, Maxwell Adam, Zach Furman, Jesse Hoogland

Comments 37 pages, 20 figures, ICLR 2026 - camera-ready version

2509.26455 2026-03-03 cs.CV

Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting

Hanzhou Liu, Jia Huang, Mi Lu, Srikanth Saripalli, Peng Jiang

Comments Accepted by ICLR 2026

2509.26346 2026-03-03 cs.CV cs.AI cs.CL

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Keming Wu, Sicong Jiang, Max Ku, Ping Nie, Minghao Liu, Wenhu Chen

Comments Accepted by ICLR 2026. Project Page: https://tiger-ai-lab.github.io/EditReward

2509.26324 2026-03-03 cs.RO cs.AI cs.MA

COMRES-VLM: Coordinated Multi-Robot Exploration and Search using Vision Language Models

Ruiyang Wang, Hao-Lun Hsu, David Hunt, Jiwoo Kim, Shaocheng Luo, Miroslav Pajic

2509.25678 2026-03-03 cs.LG

Massively Multimodal Foundation Models: A Framework for Capturing Interactions with Specialized Mixture-of-Experts

Xing Han, Hsing-Huan Chung, Joydeep Ghosh, Paul Pu Liang, Suchi Saria

Comments Published at International Conference on Learning Representations (ICLR) 2026 as a conference paper. 28 pages, 16 figures, 10 tables

2509.25532 2026-03-03 cs.CL cs.AI

Calibrating Verbalized Confidence with Self-Generated Distractors

Victor Wang, Elias Stengel-Eskin

Comments ICLR 2026. Code: https://github.com/victorwang37/dinco

2509.25390 2026-03-03 cs.CV cs.AI

SpinBench: Perspective and Rotation as a Lens on Spatial Reasoning in VLMs

Yuyou Zhang, Radu Corcodel, Chiori Hori, Anoop Cherian, Ding Zhao

Comments ICLR 2026

2509.24393 2026-03-03 cs.AI cs.CL

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention

Yichi Zhang, Yue Ding, Jingwen Yang, Tianwei Luo, Dongbai Li, Ranjie Duan, Qiang Liu, Hang Su, Yinpeng Dong, Jun Zhu

Comments ICLR 2026

2509.24332 2026-03-03 cs.LG cs.AI

Towards Generalizable PDE Dynamics Forecasting via Physics-Guided Invariant Learning

Siyang Li, Yize Chen, Yan Guo, Ming Huang, Hui Xiong

Comments Accepted to ICLR 2026

2509.24203 2026-03-03 cs.LG cs.AI cs.CL

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Chaorui Yao, Yanxi Chen, Yuchang Sun, Yushuo Chen, Wenhao Zhang, Xuchen Pan, Yaliang Li, Bolin Ding

Comments Accepted to ICLR 2026. arXiv v2 update: add references and experiments

2509.24198 2026-03-03 cs.LG

Negative Pre-activations Differentiate Syntax

Linghao Kong, Angelina Ning, Micah Adler, Nir Shavit

Comments 10 pages, 7 figures

详情

英文摘要

Modern large language models increasingly use smooth activation functions such as GELU or SiLU, allowing negative pre-activations to carry both signal and gradient. Nevertheless, many neuron-level interpretability analyses have historically focused on large positive activations, often implicitly treating the negative region as less informative, a carryover from the ReLU-era. We challenge this assumption and ask whether and how negative pre-activations are leveraged by models. We address this question by studying a sparse subpopulation of Wasserstein neurons whose output distributions deviate strongly from a Gaussian baseline and that functionally differentiate similar inputs. We show that this negative region plays an active role rather than reflecting a mere gradient optimization side effect. A minimal, sign-specific intervention that zeroes only the negative pre-activations of a small set of Wasserstein neurons substantially increases perplexity and sharply degrades grammatical performance on BLiMP and TSE, whereas both random and perplexity-matched ablations of many more non-Wasserstein neurons in their negative pre-activations leave grammatical performance largely intact. Conversely, on a suite of non-grammatical benchmarks, the perplexity-matched control ablation is more damaging than the Wasserstein neuron ablation, yielding a double dissociation between syntax and other capabilities. Part-of-speech analysis localizes the excess surprisal to syntactic scaffolding tokens, layer-specific interventions show that small local degradations accumulate across depth, and training-dynamics analysis reveals that the same sign-specific ablation becomes more harmful as Wasserstein neurons emerge and stabilize. Together, these results identify negative pre-activations in a sparse subpopulation of Wasserstein neurons as an actively used substrate for syntax in smooth-activation language models.

URL PDF HTML ☆

赞 0 踩 0

2509.23993 2026-03-03 cs.CV cs.RO

Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning

Muleilan Pei, Shaoshuai Shi, Shaojie Shen

Comments Accepted by ICLR 2026

2509.23624 2026-03-03 cs.CV

DiffInk: Glyph- and Style-Aware Latent Diffusion Transformer for Text to Online Handwriting Generation

Wei Pan, Huiguo He, Hiuyi Cheng, Yilin Shi, Lianwen Jin

Comments Accepted by ICLR 2026

2509.23566 2026-03-03 cs.CV

Towards Interpretable Visual Decoding with Attention to Brain Representations

Pinyuan Feng, Hossein Adeli, Wenxuan Guo, Fan Cheng, Ethan Hwang, Nikolaus Kriegeskorte

Comments Accepted by ICLR 2026

2509.23357 2026-03-03 cs.LG math.OC stat.ML

Landing with the Score: Riemannian Optimization through Denoising

Andrey Kharitenko, Zebang Shen, Riccardo de Santi, Niao He, Florian Doerfler

Comments 41 pages, 9 figures

2509.22611 2026-03-03 cs.LG cs.AI

Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

Junkang Wu, Kexin Huang, Jiancan Wu, An Zhang, Xiang Wang, Xiangnan He

2509.22339 2026-03-03 cs.CV

CircuitSense: A Hierarchical MLLM Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

Arman Akbari, Jian Gao, Yifei Zou, Mei Yang, Jinru Duan, Dmitrii Torbunov, Yanzhi Wang, Yihui Ren, Xuan Zhang

2509.22134 2026-03-03 cs.CL cs.AI

Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

Shijing Hu, Jingyang Li, Zhihui Lu, Pan Zhou

2509.21950 2026-03-03 cs.CV

Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach

Daiqing Wu, Dongbao Yang, Sicheng Zhao, Can Ma, Yu Zhou

Comments Accepted by ICLR 2026

2509.21835 2026-03-03 cs.LG

On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

Xunpeng Huang, Yingyu Lin, Nishant Jain, Kaibo Wang, Difan Zou, Yian Ma, Tong Zhang

Comments 48 pages

2509.21420 2026-03-03 cs.CV

QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive Models

Jian Liu, Chunshi Wang, Song Guo, Haohan Weng, Zhen Zhou, Zhiqi Li, Jiaao Yu, Yiling Zhu, Jing Xu, Biwen Lei, Zhuo Chen, Chunchao Guo

Comments ICLR 2026

2509.21278 2026-03-03 cs.CV cs.AI cs.LG

Does FLUX Already Know How to Perform Physically Plausible Image Composition?

Shilin Lu, Zhuming Lian, Zihan Zhou, Shaocong Zhang, Chen Zhao, Adams Wai-Kin Kong

Comments Accepted by ICLR 2026

2509.21256 2026-03-03 cs.RO

BiNoMaP: Learning Category-Level Bimanual Non-Prehensile Manipulation Primitives

Huayi Zhou, Kui Jia

Comments Under review. The project link is https://hnuzhy.github.io/projects/BiNoMaP

2509.21097 2026-03-03 cs.LG cs.AI

GraphUniverse: Synthetic Graph Generation for Evaluating Inductive Generalization

Louis Van Langendonck, Guillermo Bernárdez, Nina Miolane, Pere Barlet-Ros

Comments Accepted as a conference paper at ICLR 2026

AI 大模型

视觉与机器人

科学与医疗

What Scales in Cross-Entropy Scaling Law?

FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning

ExGRPO: Learning to Reason from Experience

Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness

MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

Bayesian Influence Functions for Hessian-Free Data Attribution

Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

COMRES-VLM: Coordinated Multi-Robot Exploration and Search using Vision Language Models

Massively Multimodal Foundation Models: A Framework for Capturing Interactions with Specialized Mixture-of-Experts

Calibrating Verbalized Confidence with Self-Generated Distractors

SpinBench: Perspective and Rotation as a Lens on Spatial Reasoning in VLMs

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention

Towards Generalizable PDE Dynamics Forecasting via Physics-Guided Invariant Learning

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Negative Pre-activations Differentiate Syntax

Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning

DiffInk: Glyph- and Style-Aware Latent Diffusion Transformer for Text to Online Handwriting Generation

Towards Interpretable Visual Decoding with Attention to Brain Representations

Landing with the Score: Riemannian Optimization through Denoising

Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

CircuitSense: A Hierarchical MLLM Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach

On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive Models

Does FLUX Already Know How to Perform Physically Plausible Image Composition?

BiNoMaP: Learning Category-Level Bimanual Non-Prehensile Manipulation Primitives

GraphUniverse: Synthetic Graph Generation for Evaluating Inductive Generalization