arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.17749 2026-02-23 eess.AS cs.AI cs.CV cs.SD

Detection and Classification of Cetacean Echolocation Clicks using Image-based Object Detection Methods applied to Advanced Wavelet-based Transformations

Christopher Hauer

Comments My Master thesis CLICK-SPOT from 2025

2602.17747 2026-02-23 q-bio.GN cs.LG

AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice

Ankita Vaishnobi Bisoi, Bharath Ramsundar

Comments 8 pages, 7 figures, 5 tables

2602.17745 2026-02-23 eess.SP cs.RO

Driving-Over Detection in the Railway Environment

Tobias Herrmann, Nikolay Chenkov, Florian Stark, Matthias Härter, Martin Köppel

Journal ref The 12th International Conference on Industrial Engineering and Applications (Europe), 2025

2602.17734 2026-02-23 cs.SE cs.AI

Five Fatal Assumptions: Why T-Shirt Sizing Systematically Fails for AI Projects

Raja Soundaramourty, Ozkan Kilic, Ramu Chenchaiah

2602.17732 2026-02-23 eess.AS cs.SD eess.SP

SIRUP: A diffusion-based virtual upmixer of steering vectors for highly-directive spatialization with first-order ambisonics

Emilio Picard, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii

Journal ref ICASSP, May 2026, Barcelone, Spain

2602.17730 2026-02-23 physics.chem-ph cond-mat.mtrl-sci cs.LG

Clever Materials: When Models Identify Good Materials for the Wrong Reasons

Kevin Maik Jablonka

2602.17720 2026-02-23 cs.CY cs.AI

"Everyone's using it, but no one is allowed to talk about it": College Students' Experiences Navigating the Higher Education Environment in a Generative AI World

Yue Fu, Yifan Lin, Yessica Wang, Sarah Tran, Alexis Hiniker

2602.17708 2026-02-23 physics.chem-ph astro-ph.IM cs.LG physics.ao-ph physics.comp-ph physics.plasm-ph

Spectral Homogenization of the Radiative Transfer Equation via Low-Rank Tensor Train Decomposition

Y. Sungtaek Ju

Comments 30 pages; submitted for publication

2602.17701 2026-02-23 eess.SP cs.LG

Deep Neural Network Architectures for Electrocardiogram Classification: A Comprehensive Evaluation

Yun Song, Wenjia Zheng, Tiedan Chen, Ziyu Wang, Jiazhao Shi, Yisong Chen

2602.17687 2026-02-23 cs.IR cs.AI cs.CL cs.LG

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

Connor Shorten, Augustas Skaburskas, Daniel M. Jones, Charles Pierse, Roberto Esposito, John Trengrove, Etienne Dilocker, Bob van Luijt

Comments 23 pages, 6 figures

详情

英文摘要

AI systems have achieved remarkable success in processing text and relational data, yet visual document processing remains relatively underexplored. Whereas traditional systems require OCR transcriptions to convert these visual documents into text and metadata, recent advances in multimodal foundation models offer retrieval and generation directly from document images. This raises a key question: How do image-based systems compare to established text-based methods? We introduce IRPAPERS, a benchmark of 3,230 pages from 166 scientific papers, with both an image and an OCR transcription for each page. Using 180 needle-in-the-haystack questions, we compare image- and text-based retrieval and question answering systems. Text retrieval using Arctic 2.0 embeddings, BM25, and hybrid text search achieved 46% Recall@1, 78% Recall@5, and 91% Recall@20, while image-based retrieval reaches 43%, 78%, and 93%, respectively. The two modalities exhibit complementary failures, enabling multimodal hybrid search to outperform either alone, achieving 49% Recall@1, 81% Recall@5, and 95% Recall@20. We further evaluate efficiency-performance tradeoffs with MUVERA and assess multiple multi-vector image embedding models. Among closed-source models, Cohere Embed v4 page image embeddings outperform Voyage 3 Large text embeddings and all tested open-source models, achieving 58% Recall@1, 87% Recall@5, and 97% Recall@20. For question answering, text-based RAG systems achieved higher ground-truth alignment than image-based systems (0.82 vs. 0.71), and both benefit substantially from increased retrieval depth, with multi-document retrieval outperforming oracle single-document retrieval. We analyze the complementary limitations of unimodal text and image representations and identify question types that require one modality over the other. The IRPAPERS dataset and all experimental code are publicly available.

URL PDF HTML ☆

赞 0 踩 0

2602.17675 2026-02-23 cs.DC cs.AI

Mind the Boundary: Stabilizing Gemini Enterprise A2A via a Cloud Run Hub Across Projects and Accounts

Takao Morita

Comments 7 pages. Implementation and evaluation study of cross-boundary agent orchestration for Gemini Enterprise UI

2602.17674 2026-02-23 cs.HC cs.CL

Lost Before Translation: Social Information Transmission and Survival in AI-AI Communication

Bijean Ghafouri, Emilio Ferrara

2602.17672 2026-02-23 cs.HC cs.AI cs.CL cs.CR cs.CY

Assessing LLM Response Quality in the Context of Technology-Facilitated Abuse

Vijay Prakash, Majed Almansoori, Donghan Hu, Rahul Chatterjee, Danny Yuxing Huang

2602.17671 2026-02-23 cs.HC cs.AI cs.CL

AI Hallucination from Students' Perspective: A Thematic Analysis

Abdulhadi Shoufan, Ahmad-Azmi-Abdelhamid Esmaeil

详情

英文摘要

As students increasingly rely on large language models, hallucinations pose a growing threat to learning. To mitigate this, AI literacy must expand beyond prompt engineering to address how students should detect and respond to LLM hallucinations. To support this, we need to understand how students experience hallucinations, how they detect them, and why they believe they occur. To investigate these questions, we asked university students three open-ended questions about their experiences with AI hallucinations, their detection strategies, and their mental models of why hallucinations occur. Sixty-three students responded to the survey. Thematic analysis of their responses revealed that reported hallucination issues primarily relate to incorrect or fabricated citations, false information, overconfident but misleading responses, poor adherence to prompts, persistence in incorrect answers, and sycophancy. To detect hallucinations, students rely either on intuitive judgment or on active verification strategies, such as cross-checking with external sources or re-prompting the model. Students' explanations for why hallucinations occur reflected several mental models, including notable misconceptions. Many described AI as a research engine that fabricates information when it cannot locate an answer in its "database." Others attributed hallucinations to issues with training data, inadequate prompting, or the model's inability to understand or verify information. These findings illuminate vulnerabilities in AI-supported learning and highlight the need for explicit instruction in verification protocols, accurate mental models of generative AI, and awareness of behaviors such as sycophancy and confident delivery that obscure inaccuracy. The study contributes empirical evidence for integrating hallucination awareness and mitigation into AI literacy curricula.

URL PDF HTML ☆

赞 0 踩 0

2602.11495 2026-02-23 cs.CR cs.CL

Jailbreaking Leaves a Trace: Understanding and Detecting Jailbreak Attacks from Internal Representations of Large Language Models

Sri Durga Sai Sowmya Kadali, Evangelos E. Papalexakis

详情

英文摘要

Jailbreaking large language models (LLMs) has emerged as a critical security challenge with the widespread deployment of conversational AI systems. Adversarial users exploit these models through carefully crafted prompts to elicit restricted or unsafe outputs, a phenomenon commonly referred to as Jailbreaking. Despite numerous proposed defense mechanisms, attackers continue to develop adaptive prompting strategies, and existing models remain vulnerable. This motivates approaches that examine the internal behavior of LLMs rather than relying solely on prompt-level defenses. In this work, we study jailbreaking from both security and interpretability perspectives by analyzing how internal representations differ between jailbreak and benign prompts. We conduct a systematic layer-wise analysis across multiple open-source models, including GPT-J, LLaMA, Mistral, and the state-space model Mamba, and identify consistent latent-space patterns associated with harmful inputs. We then propose a tensor-based latent representation framework that captures structure in hidden activations and enables lightweight jailbreak detection without model fine-tuning or auxiliary LLM-based detectors. We further demonstrate that the latent signals can be used to actively disrupt jailbreak execution at inference time. On an abliterated LLaMA-3.1-8B model, selectively bypassing high-susceptibility layers blocks 78% of jailbreak attempts while preserving benign behavior on 94% of benign prompts. This intervention operates entirely at inference time and introduces minimal overhead, providing a scalable foundation for achieving stronger coverage by incorporating additional attack distributions or more refined susceptibility thresholds. Our results provide evidence that jailbreak behavior is rooted in identifiable internal structures and suggest a complementary, architecture-agnostic direction for improving LLM security.

URL PDF HTML ☆

赞 0 踩 0

2602.05208 2026-02-23 eess.IV cs.CV

Context-Aware Asymmetric Ensembling for Interpretable Retinopathy of Prematurity Screening via Active Query and Vascular Attention

Md. Mehedi Hassan, Taufiq Hasan

Comments 16 pages, 6 figures

2601.01944 2026-02-23 cs.SE cs.AI cs.CL cs.IR cs.PL

The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities

Matteo Esposito, Andrea Janes, Valentina Lenarduzzi, Davide Taibi

Comments ACCEPTED REGISTERED REPORT AT SANER (CORE A*) 2026

2601.01703 2026-02-23 cs.SI cs.AI cs.DB cs.IR

Beyond Homophily: Community Search on Heterophilic Graphs

Qing Sima, Xiaoyang Wang, Wenjie Zhang

2601.01679 2026-02-23 stat.ML cs.LG

Simplex Deep Linear Discriminant Analysis

Maxat Tezekbayev, Arman Bolatov, Zhenisbek Assylbekov

Comments Accepted at CPAL 2026. Camera-ready version

2511.18555 2026-02-23 stat.ME cs.LG math.DS stat.ML

A joint optimization approach to identifying sparse dynamics using least squares kernel collocation

Alexander W. Hsu, Ike Griss Salas, Jacob M. Stevens-Haas, J. Nathan Kutz, Aleksandr Aravkin, Bamdad Hosseini

2511.05983 2026-02-23 stat.ML cs.LG

Benchmarking of Clustering Validity Measures Revisited

Connor Simpson, Ricardo J. G. B. Campello, Elizabeth Stojanovski

Comments 48 pages, 17 tables, 17 figures

2510.17561 2026-02-23 math.ST cond-mat.dis-nn cs.LG stat.ML stat.TH

Spectral Thresholds in Correlated Spiked Models and Fundamental Limits of Partial Least Squares

Pierre Mergny, Lenka Zdeborová

Comments 24 pages, 4 figures

Journal ref AISTATS 2026

2510.00545 2026-02-23 stat.ML cs.LG

Bayesian Neural Networks for Functional ANOVA model

Seokhun Park, Choeun Kim, Jihu Lee, Yunseop Shin, Insung Kong, Yongdai Kim

2509.02073 2026-02-23 stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG physics.soc-ph

Inference in Spreading Processes with Neural-Network Priors

Davide Ghio, Fabrizio Boncoraglio, Lenka Zdeborová

Comments 26 pages, 13 figures

Journal ref Phys. Rev. E 113, 015301, 2026

详情

DOI: 10.1103/8mww-brdk

英文摘要

Stochastic processes on graphs are a powerful tool for modelling complex dynamical systems such as epidemics. A recent line of work focused on the inference problem where one aims to estimate the state of every node at every time, starting from partial observation of a subset of nodes at a subset of times. In these works, the initial state of the process was assumed to be random i.i.d. over nodes. Such an assumption may not be realistic in practice, where one may have access to a set of covariate variables for every node that influence the initial state of the system. In this work, we will assume that the initial state of a node is an unknown function of such covariate variables. Given that functions can be represented by neural networks, we will study a model where the initial state is given by a simple neural network -- notably the single-layer perceptron acting on the known node-wise covariate variables. Within a Bayesian framework, we study how such neural-network prior information enhances the recovery of initial states and spreading trajectories. We derive a hybrid belief propagation and approximate message passing (BP-AMP) algorithm that handles both the spreading dynamics and the information included in the node covariates, and we assess its performance against the estimators that either use only the spreading information or use only the information from the covariate variables. We show that in some regimes, the model can exhibit first-order phase transitions when using a Rademacher distribution for the neural-network weights. These transitions create a statistical-to-computational gap where even the BP-AMP algorithm, despite the theoretical possibility of perfect recovery, fails to achieve it.

URL PDF HTML ☆

赞 0 踩 0

2509.00479 2026-02-23 eess.IV cs.AI cs.LG

A Novel Method to Determine Total Oxidant Concentration Produced by Non-Thermal Plasma Based on Image Processing and Machine Learning

Mirkan Emir Sancak, Unal Sen, Ulker Diler Keris-Sen

Comments 42 pages, 11 figures, 6 tables. Machine learning assisted colorimetric analysis framework for oxidant quantification in non-thermal plasma systems. This paper will be published later on

2508.06118 2026-02-23 q-bio.NC cs.LG

Ensemble-based graph representation of fMRI data for cognitive brain state classification

Daniil Vlasenko, Vadim Ushakov, Alexey Zaikin, Denis Zakharov

2507.17316 2026-02-23 stat.ML cs.LG

Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability

Dirk van der Hoeven, Julia Olkhovskaia, Tim van Erven

2506.20102 2026-02-23 cs.CR cs.LG cs.SY eess.SY

Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox

Malikussaid, Sutiyo

Comments 6 pages, 2 figures, 4 equations, 1 algorithm, 3 tables, to be published in ISPACS 2025, unabridged version exists as arXiv:2506.20102v1

Journal ref Proc. 2025 Int. Symp. on Intell. Signal Process. and Commun. Syst. (ISPACS), 2025, pp. 1-6

2506.02394 2026-02-23 stat.ME cs.LG

Joint modeling for learning decision-making dynamics in behavioral experiments

Yuan Bian, Xingche Guo, Yuanjia Wang

Journal ref The Annals of Applied Statistics, 19(4): 3372-3393, 2025

详情

DOI: 10.1214/25-AOAS2112

英文摘要

Major depressive disorder (MDD), a leading cause of disability and mortality, is associated with reward-processing abnormalities and concentration issues. Motivated by the probabilistic reward task from the Establishing Moderators and Biosignatures of Antidepressant Response in Clinical Care (EMBARC) study, we propose a novel framework that integrates the reinforcement learning (RL) model and drift-diffusion model (DDM) to jointly analyze reward-based decision-making with response times. To account for emerging evidence suggesting that decision-making may alternate between multiple interleaved strategies, we model latent state switching using a hidden Markov model (HMM). In the ''engaged'' state, decisions follow an RL-DDM, simultaneously capturing reward processing, decision dynamics, and temporal structure. In contrast, in the ''lapsed'' state, decision-making is modeled using a simplified DDM, where specific parameters are fixed to approximate random guessing with equal probability. The proposed method is implemented using a computationally efficient generalized expectation-maximization (EM) algorithm with forward-backward procedures. Through extensive numerical studies, we demonstrate that our proposed method outperforms competing approaches across various reward-generating distributions, under both strategy-switching and non-switching scenarios, as well as in the presence of input perturbations. When applied to the EMBARC study, our framework reveals that MDD patients exhibit lower overall engagement than healthy controls and experience longer decision times when they do engage. Additionally, we show that neuroimaging measures of brain activities are associated with decision-making characteristics in the ''engaged'' state but not in the ''lapsed'' state, providing evidence of brain-behavior association specific to the ''engaged'' state.

URL PDF HTML ☆

赞 0 踩 0

2505.23147 2026-02-23 cs.HC cs.RO

Eye-tracking-Driven Shared Control for Robotic Arms: Wizard of Oz Studies to Assess Design Choices

Anke Fischer-Janzen, Thomas M. Wendt, Daniel Görlich, Kristof Van Laerhoven

Comments Preprint, 23 pages

Journal ref J. Hum.-Robot Interact. (February 2026)

AI 大模型

视觉与机器人

科学与医疗

Detection and Classification of Cetacean Echolocation Clicks using Image-based Object Detection Methods applied to Advanced Wavelet-based Transformations

AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice

Driving-Over Detection in the Railway Environment

Five Fatal Assumptions: Why T-Shirt Sizing Systematically Fails for AI Projects

SIRUP: A diffusion-based virtual upmixer of steering vectors for highly-directive spatialization with first-order ambisonics

Clever Materials: When Models Identify Good Materials for the Wrong Reasons

"Everyone's using it, but no one is allowed to talk about it": College Students' Experiences Navigating the Higher Education Environment in a Generative AI World

Spectral Homogenization of the Radiative Transfer Equation via Low-Rank Tensor Train Decomposition

Deep Neural Network Architectures for Electrocardiogram Classification: A Comprehensive Evaluation

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

Mind the Boundary: Stabilizing Gemini Enterprise A2A via a Cloud Run Hub Across Projects and Accounts

Lost Before Translation: Social Information Transmission and Survival in AI-AI Communication

Assessing LLM Response Quality in the Context of Technology-Facilitated Abuse

AI Hallucination from Students' Perspective: A Thematic Analysis

Jailbreaking Leaves a Trace: Understanding and Detecting Jailbreak Attacks from Internal Representations of Large Language Models

Context-Aware Asymmetric Ensembling for Interpretable Retinopathy of Prematurity Screening via Active Query and Vascular Attention

The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities

Beyond Homophily: Community Search on Heterophilic Graphs

Simplex Deep Linear Discriminant Analysis

A joint optimization approach to identifying sparse dynamics using least squares kernel collocation

Benchmarking of Clustering Validity Measures Revisited

Spectral Thresholds in Correlated Spiked Models and Fundamental Limits of Partial Least Squares

Bayesian Neural Networks for Functional ANOVA model

Inference in Spreading Processes with Neural-Network Priors

A Novel Method to Determine Total Oxidant Concentration Produced by Non-Thermal Plasma Based on Image Processing and Machine Learning

Ensemble-based graph representation of fMRI data for cognitive brain state classification

Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability

Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox

Joint modeling for learning decision-making dynamics in behavioral experiments

Eye-tracking-Driven Shared Control for Robotic Arms: Wizard of Oz Studies to Assess Design Choices