arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2503.21288 2026-02-06 cs.RO

Haptic bilateral teleoperation system for free-hand dental procedures

Lorenzo Pagliara, Vincenzo Petrone, Enrico Ferrentino, Andrea Chiacchio, Giovanni Russo

Comments 13 pages, 8 figures

2502.11655 2026-02-06 cs.CV

TextOCVP: Object-Centric Video Prediction with Language Guidance

Angel Villar-Corrales, Gjergj Plepi, Sven Behnke

Comments Published at TMLR 02/2026

2502.07244 2026-02-06 cs.LG cs.AI stat.ML

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Jiecheng Lu, Shihao Yang

Comments Camera-ready version. Accepted at ICML 2025

Journal ref Proceedings of the Forty-second International Conference on Machine Learning (ICML 2025)

2502.03740 2026-02-06 cs.LG cs.AI

Multiple Invertible and Partial-Equivariant Function for Latent Vector Transformation to Enhance Disentanglement in VAEs

Hee-Jun Jung, Jaehyoung Jeong, Kangil Kim

Comments Accepted in AISTATS 2026

2502.01411 2026-02-06 cs.CV

Human Body Restoration with One-Step Diffusion Model and A New Benchmark

Jue Gong, Jingkai Wang, Zheng Chen, Xing Liu, Hong Gu, Yulun Zhang, Xiaokang Yang

Comments 8 pages, 9 figures. Accepted at ICML 2025

2412.09191 2026-02-06 cs.CV

RAD: Region-Aware Diffusion Models for Image Inpainting

Sora Kim, Sungho Suh, Minsik Lee

Comments Code: https://github.com/srk1995/RAD

2410.03159 2026-02-06 cs.LG cs.AI stat.ML

WAVE: Weighted Autoregressive Varying Gate for Time Series Forecasting

Jiecheng Lu, Xu Han, Yan Sun, Shihao Yang

Comments Camera-ready version. Accepted at ICML 2025

Journal ref Proceedings of the Forty-second International Conference on Machine Learning (ICML 2025)

2409.15176 2026-02-06 cs.CV

SpikeGS: Learning 3D Gaussian Fields from Continuous Spike Stream

Jinze Yu, Xin Peng, Zhengda Lu, Laurent Kneip, Yiqun Wang

Comments Accepted by ACCV 2024

2408.10463 2026-02-06 cs.SD cs.LG eess.AS

Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting

Hyun Jin Park, Dhruuv Agarwal, Neng Chen, Rentao Sun, Kurt Partridge, Justin Chen, Harry Zhang, Pai Zhu, Jacob Bartel, Kyle Kastner, Gary Wang, Andrew Rosenberg, Quan Wang

Comments to be published in a Workshop at Interspeech 2024, Synthetic Data's Transformative Role in Foundational Speech Models

Journal ref Proc. Synthetic Data's Transformative Role in Foundational Speech Models 2024, 86-90

2407.18879 2026-02-06 cs.SD cs.LG eess.AS

Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model

Hyun Jin Park, Dhruuv Agarwal, Neng Chen, Rentao Sun, Kurt Partridge, Justin Chen, Harry Zhang, Pai Zhu, Jacob Bartel, Kyle Kastner, Gary Wang, Andrew Rosenberg, Quan Wang

Comments to be published in a Workshop at Interspeech 2024, Synthetic Data's Transformative Role in Foundational Speech Models

Journal ref Proc. Synthetic Data's Transformative Role in Foundational Speech Models 2024, 16-20

2404.15617 2026-02-06 cs.LG cs.AI math.OC math.ST stat.TH

A Differential and Pointwise Control Approach to Reinforcement Learning

Minh Nguyen, Chandrajit Bajaj

Comments NeurIPS 2025

2308.02594 2026-02-06 cs.LG cs.AI cs.SE

SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents

Amirhossein Zolfagharian, Manel Abdellatif, Lionel C. Briand, Ramesh S

Journal ref in IEEE Transactions on Software Engineering, vol. 51, no. 01, pp. 82-105, Jan. 2025

详情

DOI: 10.1109/TSE.2024.3491496

英文摘要

Deep Reinforcement Learning (DRL) has made significant advancements in various fields, such as autonomous driving, healthcare, and robotics, by enabling agents to learn optimal policies through interactions with their environments. However, the application of DRL in safety-critical domains presents challenges, particularly concerning the safety of the learned policies. DRL agents, which are focused on maximizing rewards, may select unsafe actions, leading to safety violations. Runtime safety monitoring is thus essential to ensure the safe operation of these agents, especially in unpredictable and dynamic environments. This paper introduces SMARLA, a black-box safety monitoring approach specifically designed for DRL agents. SMARLA utilizes machine learning to predict safety violations by observing the agent's behavior during execution. The approach is based on Q-values, which reflect the expected reward for taking actions in specific states. SMARLA employs state abstraction to reduce the complexity of the state space, enhancing the predictive capabilities of the monitoring model. Such abstraction enables the early detection of unsafe states, allowing for the implementation of corrective and preventive measures before incidents occur. We quantitatively and qualitatively validated SMARLA on three well-known case studies widely used in DRL research. Empirical results reveal that SMARLA is accurate at predicting safety violations, with a low false positive rate, and can predict violations at an early stage, approximately halfway through the execution of the agent, before violations occur. We also discuss different decision criteria, based on confidence intervals of the predicted violation probabilities, to trigger safety mechanisms aiming at a trade-off between early detection and low false positive rates.

URL PDF HTML ☆

赞 0 踩 0

2602.05997 2026-02-06 stat.ML cs.LG stat.ME

Causal Inference on Stopped Random Walks in Online Advertising

Jia Yuan Yu

2602.05948 2026-02-06 cs.DC cs.DS cs.MA cs.RO

Location-Aware Dispersion on Anonymous Graphs

Himani, Supantha Pandit, Gokarna Sharma

Comments 3 tables, 2 figures, 6 pseudo-codes

2602.05930 2026-02-06 cs.DL cs.AI

Compound Deception in Elite Peer Review: A Failure Mode Taxonomy of 100 Fabricated Citations at NeurIPS 2025

Samar Ansari

2602.05927 2026-02-06 stat.ML cs.LG

Transformers Are Born Biased: Structural Inductive Biases at Random Initialization and Their Practical Consequences

Siquan Li, Yao Tong, Haonan Wang, Tianyang Hu

详情

英文摘要

Transformers underpin modern large language models (LLMs) and are commonly assumed to be behaviorally unstructured at random initialization, with all meaningful preferences emerging only through large-scale training. We challenge this assumption by showing that randomly initialized transformers already exhibit strong and systematic structural biases. In particular, untrained models display extreme token preferences: across random input sequences, certain tokens are predicted with probabilities orders of magnitude larger. We provide a mechanistic explanation for this phenomenon by dissecting the transformer architecture at initialization. We show that extreme token preference arises from a contraction of token representations along a random seed-dependent direction. This contraction is driven by two interacting forces: (i) asymmetric nonlinear activations in MLP sublayers induce global (inter-sequence) representation concentration, and (ii) self-attention further amplifies this effect through local (intra-sequence) aggregation. Together, these mechanisms align hidden representations along a direction determined solely by the random initialization, producing highly non-uniform next-token predictions. Beyond mechanistic insight, we demonstrate that these initialization-induced biases persist throughout training, forming a stable and intrinsic model identity. Leveraging this property, we introduce SeedPrint, a fingerprinting method that can reliably distinguish models that differ only in their random initialization, even after extensive training and under substantial distribution shift. Finally, we identify a fundamental positional discrepancy inherent to the attention mechanism's intra-sequence contraction that is causally linked to the attention-sink phenomenon. This discovery provides a principled explanation for the emergence of sinks and offers a pathway for their control.

URL PDF HTML ☆

赞 0 踩 0

2602.05898 2026-02-06 math.PR cs.LG q-fin.MF

Universal approximation with signatures of non-geometric rough paths

Mihriban Ceylan, Anna P. Kwossek, David J. Prömel

2602.05848 2026-02-06 cs.NE cs.AI cs.CL

DARWIN: Dynamic Agentically Rewriting Self-Improving Network

Henry Jiang

Comments 6 pages, 3 figures, 2 tables

2602.05846 2026-02-06 stat.ML cs.LG

Optimal scaling laws in learning hierarchical multi-index models

Leonardo Defilippis, Florent Krzakala, Bruno Loureiro, Antoine Maillard

2602.05799 2026-02-06 math.OC cs.LG stat.ML

Non-Stationary Inventory Control with Lead Times

Nele H. Amiri, Sean R. Sinclair, Maximiliano Udenio

2602.05798 2026-02-06 stat.ME cs.LG eess.SP stat.ML

Learning False Discovery Rate Control via Model-Based Neural Networks

Arnau Vilella, Jasin Machkour, Michael Muma, Daniel P. Palomar

Comments Accepted to IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2026

2602.05780 2026-02-06 cs.SE cs.AI

Automated Customization of LLMs for Enterprise Code Repositories Using Semantic Scopes

Ulrich Finkler, Irene Manotas, Wei Zhang, Geert Janssen, Octavian Popescu, Shyam Ramji

2602.05767 2026-02-06 hep-ex cs.LG

PMT Waveform Simulation and Reconstruction with Conditional Diffusion Network

Kainan Liu, Jingyu Huang, Guihong Huang, Jianyi Luo

2602.05738 2026-02-06 eess.IV cs.CV

Disc-Centric Contrastive Learning for Lumbar Spine Severity Grading

Sajjan Acharya, Pralisha Kansakar

2602.05734 2026-02-06 cs.IR cs.AI

Evaluating the impact of word embeddings on similarity scoring in practical information retrieval

Niall McCarroll, Kevin Curran, Eugene McNamee, Angela Clist, Andrew Brammer

2602.05712 2026-02-06 cs.SE cs.AI

Towards Green AI: Decoding the Energy of LLM Inference in Software Development

Lola Solovyeva, Fernando Castor

2602.05710 2026-02-06 cs.CY cs.CL cs.CV cs.LG

Ethology of Latent Spaces

Philippe Boisnard

Comments 23. pages, 14 figures, presented Hyperheritage International Symposium 9 ( https://paragraphe.univ-paris8.fr/IMG/pdf/programme_colloque_his9_campuscondorcet_v3.pdf ) and accepted for publication in double-blind peer review in French in 2026-2027

详情

英文摘要

This study challenges the presumed neutrality of latent spaces in vision language models (VLMs) by adopting an ethological perspective on their algorithmic behaviors. Rather than constituting spaces of homogeneous indeterminacy, latent spaces exhibit model-specific algorithmic sensitivities, understood as differential regimes of perceptual salience shaped by training data and architectural choices. Through a comparative analysis of three models (OpenAI CLIP, OpenCLIP LAION, SigLIP) applied to a corpus of 301 artworks (15th to 20th), we reveal substantial divergences in the attribution of political and cultural categories. Using bipolar semantic axes derived from vector analogies (Mikolov et al., 2013), we show that SigLIP classifies 59.4% of the artworks as politically engaged, compared to only 4% for OpenCLIP. African masks receive the highest political scores in SigLIP while remaining apolitical in OpenAI CLIP. On an aesthetic colonial axis, inter-model discrepancies reach 72.6 percentage points. We introduce three operational concepts: computational latent politicization, describing the emergence of political categories without intentional encoding; emergent bias, irreducible to statistical or normative bias and detectable only through contrastive analysis; and three algorithmic scopic regimes: entropic (LAION), institutional (OpenAI), and semiotic (SigLIP), which structure distinct modes of visibility. Drawing on Foucault's notion of the archive, Jameson's ideologeme, and Simondon's theory of individuation, we argue that training datasets function as quasi-archives whose discursive formations crystallize within latent space. This work contributes to a critical reassessment of the conditions under which VLMs are applied to digital art history and calls for methodologies that integrate learning architectures into any delegation of cultural interpretation to algorithmic agents.

URL PDF HTML ☆

赞 0 踩 0

2602.05708 2026-02-06 cs.DB cs.CL

Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration

Chuangtao Ma, Zeyu Zhang, Arijit Khan, Sebastian Schelter, Paul Groth

2602.05702 2026-02-06 cond-mat.mtrl-sci cs.LG

Broken neural scaling laws in materials science

Max Großmann, Malte Grunert, Erich Runge

2602.05644 2026-02-06 eess.SY cs.LG cs.SY

UAV Trajectory Optimization via Improved Noisy Deep Q-Network

Zhang Hengyu, Maryam Cheraghy, Liu Wei, Armin Farhadi, Meysam Soltanpour, Zhong Zhuoqing

AI 大模型

视觉与机器人

科学与医疗

Haptic bilateral teleoperation system for free-hand dental procedures

TextOCVP: Object-Centric Video Prediction with Language Guidance

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Multiple Invertible and Partial-Equivariant Function for Latent Vector Transformation to Enhance Disentanglement in VAEs

Human Body Restoration with One-Step Diffusion Model and A New Benchmark

RAD: Region-Aware Diffusion Models for Image Inpainting

WAVE: Weighted Autoregressive Varying Gate for Time Series Forecasting

SpikeGS: Learning 3D Gaussian Fields from Continuous Spike Stream

Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting

Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model

A Differential and Pointwise Control Approach to Reinforcement Learning

SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents

Causal Inference on Stopped Random Walks in Online Advertising

Location-Aware Dispersion on Anonymous Graphs

Compound Deception in Elite Peer Review: A Failure Mode Taxonomy of 100 Fabricated Citations at NeurIPS 2025

Transformers Are Born Biased: Structural Inductive Biases at Random Initialization and Their Practical Consequences

Universal approximation with signatures of non-geometric rough paths

DARWIN: Dynamic Agentically Rewriting Self-Improving Network

Optimal scaling laws in learning hierarchical multi-index models

Non-Stationary Inventory Control with Lead Times

Learning False Discovery Rate Control via Model-Based Neural Networks

Automated Customization of LLMs for Enterprise Code Repositories Using Semantic Scopes

PMT Waveform Simulation and Reconstruction with Conditional Diffusion Network

Disc-Centric Contrastive Learning for Lumbar Spine Severity Grading

Evaluating the impact of word embeddings on similarity scoring in practical information retrieval

Towards Green AI: Decoding the Energy of LLM Inference in Software Development

Ethology of Latent Spaces

Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration

Broken neural scaling laws in materials science

UAV Trajectory Optimization via Improved Noisy Deep Q-Network