arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2603.28809 2026-04-01 cs.DB cs.AI cs.LG

WAter: A Workload-Adaptive Knob Tuning System based on Workload Compression

Yibo Wang, Jiale Lao, Chen Zhang, Cehua Yang, Jianguo Wang, Mingjie Tang

2603.28804 2026-04-01 physics.ins-det cs.LG hep-ex nucl-ex

Generalizable Foundation Models for Calorimetry via Mixtures-of-Experts and Parameter Efficient Fine Tuning

Carlos Cardona-Giraldo, Cristiano Fanelli, James Giroux, Cole Granger, Benjamin Nachman, Gerald Sabin

Comments 18 pages, 11 figures, 1 table

2603.28798 2026-04-01 cs.CR cs.AI

Design and Development of an ML/DL Attack Resistance of RC-Based PUF for IoT Security

Joy Acharya, Smit Patel, Paawan Sharma, Mohendra Roy

Comments This paper has been accepted for the IEEE GCON 2026 conference, organized by IIT Guwahati

2603.28796 2026-04-01 cs.LO cs.AI

GaloisSAT: Differentiable Boolean Satisfiability Solving via Finite Field Algebra

Curie Kim, Carsten Portner, Mingju Liu, Steve Dai, Haoxing Ren, Brucek Khailany, Alvaro Velasquez, Ismail Alkhouri, Cunxi Yu

2603.28795 2026-04-01 cs.OS cs.AI cs.CL cs.DC

StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving

Azam Nouri

Comments 9 pages, 1 figure

2603.28790 2026-04-01 cs.DC cs.LG

Mitigating Temporal Blindness in Kubernetes Autoscaling: An Attention-Double-LSTM Framework

Faraz Shaikh, Gianluca Reali, Mauro Femminella

Comments Submitted for journal publication

2603.28787 2026-04-01 eess.SP cs.AI cs.CE cs.DC cs.HC cs.LG

Smartphone-Based Identification of Unknown Liquids via Active Vibration Sensing

Yongzhi Huang

Comments Conference on Mobile Computing and Networking (MobiCom),10 pages, 5 figures

Journal ref Proc. of the 27th Annual International Conference on Mobile Computing and Networking (MobiCom 2021), pages 174-187, 2021

2603.28786 2026-04-01 cs.CY cs.AI stat.AP

AI in Work-Based Learning: Understanding the Purposes and Effects of Intelligent Tools Among Student Interns

John Paul P. Miranda, Rhiziel P. Manalese, Sheila M. Geronimo, Vernon Grace M. Maniago, Charlie K. Padilla, Aileen P. De Leon, Santa L. Merle, Mark Anthony A. Castro

Comments 5 pages, 2 tables, conference proceedings

Journal ref 2025 International Workshop on Artificial Intelligence and Education (2026) 411-415

2603.28784 2026-04-01 eess.SP cs.AI

A Multi-Modal Dataset for Ground Reaction Force Estimation Using Consumer Wearable Sensors

Parvin Ghaffarzadeh, Debarati Chakraborty, Koorosh Aslansefat, Ali Dostan, Yiannis Papadopoulos

2603.28780 2026-04-01 cs.DC cs.AI

Byzantine-Robust and Communication-Efficient Distributed Training: Compressive and Cyclic Gradient Coding

Chengxi Li, Youssef Allouah, Rachid Guerraoui, Mikael Skoglund, Ming Xiao

2603.28774 2026-04-01 cs.HC cs.AI cs.MM

Focus360: Guiding User Attention in Immersive Videos for VR

Paulo Vitor S. Silva, Lucas L. Neves, Rafael A. Goiás, Diogo F. C. Silva, Rafael T. Sousa, Arlindo R. Galvão Filho

Comments 2025 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)

2603.28773 2026-04-01 cs.IR cs.CL cs.LG

UltRAG: a Universal Simple Scalable Recipe for Knowledge Graph RAG

Dobrik Georgiev, Kheeran Naidu, Alberto Cattaneo, Federico Monti, Carlo Luschi, Daniel Justus

2603.28769 2026-04-01 cs.DC cs.CL cs.LG

Spark-LLM-Eval: A Distributed Framework for Statistically Rigorous Large Language Model Evaluation

Subhadip Mitra

Comments 16 pages, 2 figures, 6 tables. Open source: https://github.com/bassrehab/spark-llm-eval. Cross-list requested: cs.CL, cs.LG

2603.24750 2026-04-01 cs.IR cs.AI cs.LG

Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

Pronob Kumar Barman, Tera L. Reynolds, James Foulds

2603.24436 2026-04-01 cs.NE cs.AI cs.LG cs.SC

Enes Causal Discovery

Alexis Kafantaris

2603.24176 2026-04-01 eess.IV cs.CV q-bio.NC

Modeling Spatiotemporal Neural Frames for High Resolution Brain Dynamic

Wanying Qu, Jianxiong Gao, Wei Wang, Yanwei Fu

Comments CVPR 2026

2603.23171 2026-04-01 cs.CR cs.AI cs.CY cs.LG

Robust Safety Monitoring of Language Models via Activation Watermarking

Toluwani Aremu, Daniil Ognev, Samuele Poppi, Nils Lukas

Comments 23 pages, 19 figures

2603.22779 2026-04-01 cs.IR cs.AI cs.LG

KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao

Zhi Sun, Wenming Zhang, Yi Wei, Liren Yu, Zhixuan Zhang, Dan Ou, Haihong Tang

2603.22519 2026-04-01 cs.SE cs.AI cs.PL

LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

Michael Hind, Basel Shbita, Bo Wu, Farhan Ahmed, Chad DeLuca, Nathan Fulton, David Cox, Dan Gutfreund

Comments 28 pages

2603.20004 2026-04-01 cs.DB cs.CL

ReViSQL: Achieving Human-Level Text-to-SQL

Yuxuan Zhu, Tengjun Jin, Yoojin Choi, Daniel Kang

详情

英文摘要

Translating natural language to SQL (Text-to-SQL) is a critical challenge in both database research and data analytics applications. Recent efforts have focused on enhancing SQL reasoning by developing large language models and AI agents that decompose Text-to-SQL tasks into manually designed, step-by-step pipelines. However, despite these extensive architectural engineering efforts, a significant gap remains: even state-of-the-art (SOTA) AI agents have not yet achieved the human-level accuracy on the BIRD benchmark. In this paper, we show that closing this gap does not require further architectural complexity, but rather clean training data to improve SQL reasoning of the underlying models. We introduce ReViSQL, a streamlined framework that achieves human-level accuracy on BIRD for the first time. Instead of complex AI agents, ReViSQL leverages reinforcement learning with verifiable rewards (RLVR) on BIRD-Verified, a dataset we curated comprising 2.5k verified Text-to-SQL instances based on the BIRD Train set. To construct BIRD-Verified, we design a data correction and verification workflow involving SQL experts. We identified and corrected data errors in 61.1% of a subset of BIRD Train. By training on BIRD-Verified, we show that improving data quality alone boosts the single-generation accuracy by 8.2-13.9% under the same RLVR algorithm. To further enhance performance, ReViSQL performs inference-time scaling via execution-based reconciliation and majority voting. Empirically, we demonstrate the superiority of our framework with two model scales: ReViSQL-235B-A22B and ReViSQL-30B-A3B. On an expert-verified BIRD Mini-Dev set, ReViSQL-235B-A22B achieves 93.2% execution accuracy, exceeding the proxy human-level accuracy (92.96%) and outperforming the prior open-source SOTA method by 9.8%. Our lightweight ReViSQL-30B-A3B matches the prior SOTA at a 7.5$\times$ lower per-query cost.

URL PDF HTML ☆

赞 0 踩 0

2603.16949 2026-04-01 cs.NI cs.LG cs.SY eess.SY

Entropy-Aware Task Offloading in Mobile Edge Computing

Mohsen Sahraei Ardakani, Hong Wan, Rui Song

Comments 13 pages, submitted to Journal of Blockchain Research

2603.10062 2026-04-01 cs.AR cs.AI cs.MA

Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Zhongming Yu, Naicheng Yu, Hejia Zhang, Wentao Ni, Mingrui Yin, Jiaying Yang, Yujie Zhao, Jishen Zhao

2602.20206 2026-04-01 cs.SE cs.AI cs.CY cs.ET cs.MA

Mitigating "Epistemic Debt" in Generative AI-Scaffolded Novice Programming using Metacognitive Scripts

Sreecharan Sankaranarayanan

详情

英文摘要

The democratization of Large Language Models has given rise to vibe coding, where novice programmers prioritize semantic intent over syntactic implementation. Without pedagogical guardrails, we argue this is fundamentally misaligned with cognitive skill acquisition. Drawing on Kirschner's distinction between cognitive offloading and outsourcing, unrestricted AI encourages novices to outsource the intrinsic cognitive load required for schema formation rather than merely offloading extraneous load. This accumulation of epistemic debt creates fragile experts: developers whose high functional utility masks critically low corrective competence. To quantify and mitigate this debt, we conducted a between-subjects experiment (N=78) using a custom Cursor IDE plugin backed by Claude 3.5 Sonnet. Participants were recruited via Prolific and UserInterviews.com to represent AI-native learners. We compared three conditions: manual (control), unrestricted AI (outsourcing), and scaffolded AI (offloading). The scaffolded condition employed a novel Explanation Gate -- a real-time LLM-as-a-Judge framework enforcing a teach-back protocol before generated code could be integrated. Results reveal a collapse of competence: both AI groups significantly outperformed the manual control on functional utility (p < .001) and did not differ from each other (p = .64), yet unrestricted AI users suffered a 77% failure rate on a subsequent 30-minute AI-blackout maintenance task, vs. only 39% in the scaffolded group. Qualitative analysis suggests successful vibe coders naturally self-scaffold, treating AI as a consultant rather than a contractor. We discuss implications for AI-generated software maintainability and propose that future learning systems must enforce metacognitive friction to prevent mass production of unmaintainable code. Replication package: https://github.com/sreecharansankaranarayanan/vibecheck

URL PDF HTML ☆

赞 0 踩 0

2602.10149 2026-04-01 cs.CR cs.AI

Semantic Labeling for Third-Party Cybersecurity Risk Assessment: A Semi-Supervised Approach to Intent-Aware Question Retrieval

Ali Nour Eldin, Mohamed Sellami, Mehdi Acheli, Walid Gaaloul, Julien Steunou

2601.19066 2026-04-01 cs.SE cs.AI

Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

Runxiang Cheng, Michele Tufano, José Cambronero, Renyao Wei, Sherry Shi, Grant Uy, Pat Rondon, Franjo Ivančić

2601.11691 2026-04-01 eess.IV cs.LG q-bio.QM

Explainable histomorphology-based survival prediction of glioblastoma, IDH-wildtype

Jan-Philipp Redlich, Friedrich Feuerhake, Stefan Nikolin, Nadine Sarah Schaadt, Sarah Teuber-Hanselmann, Joachim Weis, Sabine Luttmann, Andrea Eberle, Christoph Buck, Timm Intemann, Pascal Birnstill, Klaus Kraywinkel, Jonas Ort, Peter Boor, André Homeyer

2512.16081 2026-04-01 cs.HC cs.AI cs.MA

Evaluation of Generative Models for Emotional 3D Animation Generation in VR

Kiran Chhatre, Renan Guarese, Andrii Matviienko, Christopher Peters

Comments 20 pages, 5 figures. Webpage: https://emotional3dhumans.github.io/

详情

DOI: 10.3389/fcomp.2025.1598099

英文摘要

Social interactions incorporate nonverbal signals to convey emotions alongside speech, including facial expressions and body gestures. Generative models have demonstrated promising results in creating full-body nonverbal animations synchronized with speech; however, evaluations using statistical metrics in 2D settings fail to fully capture user-perceived emotions, limiting our understanding of model effectiveness. To address this, we evaluate emotional 3D animation generative models within a Virtual Reality (VR) environment, emphasizing user-centric metrics emotional arousal realism, naturalness, enjoyment, diversity, and interaction quality in a real-time human-agent interaction scenario. Through a user study (N=48), we examine perceived emotional quality for three state of the art speech-driven 3D animation methods across two emotions happiness (high arousal) and neutral (mid arousal). Additionally, we compare these generative models against real human expressions obtained via a reconstruction-based method to assess both their strengths and limitations and how closely they replicate real human facial and body expressions. Our results demonstrate that methods explicitly modeling emotions lead to higher recognition accuracy compared to those focusing solely on speech-driven synchrony. Users rated the realism and naturalness of happy animations significantly higher than those of neutral animations, highlighting the limitations of current generative models in handling subtle emotional states. Generative models underperformed compared to reconstruction-based methods in facial expression quality, and all methods received relatively low ratings for animation enjoyment and interaction quality, emphasizing the importance of incorporating user-centric evaluations into generative model development. Finally, participants positively recognized animation diversity across all generative models.

URL PDF HTML ☆

赞 0 踩 0

2512.05411 2026-04-01 cs.IR cs.AI

A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems

Pranav Pushkar Mishra, Kranti Prakash Yeole, Ramyashree Keshavamurthy, Mokshit Bharat Surana, Fatemeh Sarayloo

Comments Accepted to 2026 IEEE Conference on Artificial Intelligence (CAI). 8 pages, 1 figures, 9 tables

2511.17744 2026-04-01 eess.IV cs.CV

Robust Detection of Retinal Neovascularization in Widefield Optical Coherence Tomography

Jinyi Hao, Jie Wang, Liqin Gao, Tristan T. Hormel, Yukun Guo, An-Lun Wu, Christina J. Flaxel, Steven T. Bailey, Kotaro Tsuboi, Thomas S. Hwang, Yali Jia

Comments 21 pages, 12 figures. Submitted to Optica. Corresponding author: Yali Jia

Journal ref Optica 13(4), 628-641 (2026)

2511.03849 2026-04-01 cs.IT cs.LG math.IT q-bio.PE

Which Similarity-Sensitive Entropy (Sentropy)?

Phuc Nguyen, Josiah Couch, Rahul Bansal, Alexandra Morgan, Chris Tam, Miao Li, Rima Arnaout, Ramy Arnaout

Comments 17 pages, two columns, 9 figures

AI 大模型

视觉与机器人

科学与医疗

WAter: A Workload-Adaptive Knob Tuning System based on Workload Compression

Generalizable Foundation Models for Calorimetry via Mixtures-of-Experts and Parameter Efficient Fine Tuning

Design and Development of an ML/DL Attack Resistance of RC-Based PUF for IoT Security

GaloisSAT: Differentiable Boolean Satisfiability Solving via Finite Field Algebra

StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving

Mitigating Temporal Blindness in Kubernetes Autoscaling: An Attention-Double-LSTM Framework

Smartphone-Based Identification of Unknown Liquids via Active Vibration Sensing

AI in Work-Based Learning: Understanding the Purposes and Effects of Intelligent Tools Among Student Interns

A Multi-Modal Dataset for Ground Reaction Force Estimation Using Consumer Wearable Sensors

Byzantine-Robust and Communication-Efficient Distributed Training: Compressive and Cyclic Gradient Coding

Focus360: Guiding User Attention in Immersive Videos for VR

UltRAG: a Universal Simple Scalable Recipe for Knowledge Graph RAG

Spark-LLM-Eval: A Distributed Framework for Statistically Rigorous Large Language Model Evaluation

Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

Enes Causal Discovery

Modeling Spatiotemporal Neural Frames for High Resolution Brain Dynamic

Robust Safety Monitoring of Language Models via Activation Watermarking

KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao

LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

ReViSQL: Achieving Human-Level Text-to-SQL

Entropy-Aware Task Offloading in Mobile Edge Computing

Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Mitigating "Epistemic Debt" in Generative AI-Scaffolded Novice Programming using Metacognitive Scripts

Semantic Labeling for Third-Party Cybersecurity Risk Assessment: A Semi-Supervised Approach to Intent-Aware Question Retrieval

Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

Explainable histomorphology-based survival prediction of glioblastoma, IDH-wildtype

Evaluation of Generative Models for Emotional 3D Animation Generation in VR

A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems

Robust Detection of Retinal Neovascularization in Widefield Optical Coherence Tomography

Which Similarity-Sensitive Entropy (Sentropy)?