arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.13286 2026-02-17 cs.CV cs.AI cs.LG

Explanatory Interactive Machine Learning for Bias Mitigation in Visual Gender Classification

Nathanya Satriani, Djordje Slijepčević, Markus Schedl, Matthias Zeppelzauer

Comments 8 pages, 4 figures, CBMI2025

Journal ref International Conference on Content-Based Multimedia Indexing (2025) 1-8

2602.13283 2026-02-17 cs.AI cs.CY cs.HC

Accuracy Standards for AI at Work vs. Personal Life: Evidence from an Online Survey

Gaston Besanson, Federico Todeschini

2602.13280 2026-02-17 cs.AI

BEAGLE: Behavior-Enforced Agent for Grounded Learner Emulation

Hanchen David Wang, Clayton Cohn, Zifan Xu, Siyuan Guo, Gautam Biswas, Meiyi Ma

Comments paper under submission at IJCAI

2602.13275 2026-02-17 cs.AI cs.CL

Artificial Organisations

William Waites

2602.13274 2026-02-17 cs.AI cs.CL

ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

Rohan Subramanian Thomas, Shikhar Shiromani, Abdullah Chaudhry, Ruizhe Li, Vasu Sharma, Kevin Zhu, Sunishchal Dev

2602.13272 2026-02-17 cs.AI cs.LG

TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

Muyan Weng, Defu Cao, Wei Yang, Yashaswi Sharma, Yan Liu

2602.13264 2026-02-17 cs.LG cs.AI cs.CL

Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

Souradeep Chattopadhyay, Brendan Kennedy, Sai Munikoti, Soumik Sarkar, Karl Pazdernik

2602.13263 2026-02-17 cs.CL cs.SD eess.AS

Multimodal Consistency-Guided Reference-Free Data Selection for ASR Accent Adaptation

Ligong Lei, Wenwen Lu, Xudong Pang, Zaokere Kadeer, Aishan Wumaier

2602.13262 2026-02-17 cs.AI cs.CL

General learned delegation by clones

Darren Li, Meiqi Chen, Chenze Shao, Fandong Meng, Jie Zhou

Comments Code available at https://github.com/SuffixAutomata/SELFCEST

2602.13259 2026-02-17 cs.SD cs.AI eess.AS

Learning Physiology-Informed Vocal Spectrotemporal Representations for Speech Emotion Recognition

Xu Zhang, Longbing Cao, Runze Yang, Zhangkai Wu

Comments 13 pages, 5 figures

2602.13258 2026-02-17 cs.AI cs.CL cs.MA

MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems

Deepak Babu Piskala

Comments 12 pages, 5 figures. Accepted to ALA Workshop at AAMAS 2026. Code: [](https://github.com/prdeepakbabu/maple-framework)<https://github.com/prdeepakbabu/maple-framework>

2602.13252 2026-02-17 cs.RO cs.NI

DORA: Dataflow Oriented Robotic Architecture

Xiaodong Zhang, Baorui Lv, Xavier Tao, Xiong Wang, Jie Bao, Yong He, Yue Chen, Zijiang Yang

2602.13248 2026-02-17 cs.AI cs.CL cs.RO

X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles

Ashkan Y. Zadeh, Xiaomeng Li, Andry Rakotonirainy, Ronald Schroeter, Sebastien Glaser, Zishuo Zhu

2602.13240 2026-02-17 cs.AI cs.SE

AST-PAC: AST-guided Membership Inference for Code

Roham Koohestani, Ali Al-Kaswan, Jonathan Katzy, Maliheh Izadi

2602.13237 2026-02-17 cs.AI cs.CL

NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

Rizky Ramadhana Putra, Raihan Sultan Pasha Basuki, Yutong Cheng, Peng Gao

Comments Accepted to Findings of EACL 2026. 17 pages, 6 figures

2602.13234 2026-02-17 cs.AI

Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents

Mingyang Liao, Yichen Wan, shuchen wu, Chenxi Miao, Xin Shen, Weikang Li, Yang Li, Deguo Xia, Jizhou Huang

2602.13230 2026-02-17 cs.AI cs.LG

Intelligence as Trajectory-Dominant Pareto Optimization

Truong Xuan Khanh, Truong Quynh Hoa

Comments 13 pages, 3 figures

2602.13226 2026-02-17 cs.AI cs.CL

Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

Xuecong Li, Xiaohong Li, Qiang Hu, Yao Zhang, Junjie Wang

2602.13217 2026-02-17 cs.AI

VeRA: Verified Reasoning Data Augmentation at Scale

Zerui Cheng, Jiashuo Liu, Chunjie Wu, Jianzhu Yao, Pramod Viswanath, Ge Zhang, Wenhao Huang

Comments 36 pages; VeRA technical report

详情

英文摘要

The main issue with most evaluation schemes today is their "static" nature: the same problems are reused repeatedly, allowing for memorization, format exploitation, and eventual saturation. To measure genuine AI progress, we need evaluation that is robust by construction, not by post-hoc detection. In response, we propose VeRA (Verified Reasoning Data Augmentation), a framework that converts benchmark problems into executable specifications, comprising (i) a natural language template with placeholder slots, (ii) a coherent generator that samples valid configurations, and (iii) a deterministic verifier that validates parameters and calculates the corresponding correct answers for each configuration. From a single seed problem, VeRA automatically creates unlimited verified variants with reliable labels at near-zero marginal cost without human involvement. VeRA operates in two complementary modes. VeRA-E (equivalent) rewrites problems while keeping the underlying logic intact, useful for detecting memorization versus genuine reasoning. VeRA-H (hardened) systematically increases complexity while remaining verifiable, enabling reliable creation and labelling of fresh difficult tasks at the boundary of intelligence. Evaluating 16 frontier models with VeRA, we find: (i) VeRA-E improves evaluation quality and reveals contamination patterns. (ii) VeRA-H enables human-free generation of hard tasks with reliable labels. (iii) VeRA establishes verified benchmarks as a general paradigm. VeRA reconceptualizes benchmarks from static objects used until exhausted, to executable specifications generating fresh, verified instances on demand, enhancing robustness and cost-effectiveness for evaluation. With VeRA, we envision that evaluation in any verifiable domain can scale indefinitely without sacrificing label integrity. To stimulate future research, we have open-sourced all code and datasets.

URL PDF HTML ☆

赞 0 踩 0

2602.13214 2026-02-17 cs.AI

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

Lingfeng Li, Yunlong Lu, Yuefei Zhang, Jingyu Yao, Yixin Zhu, KeYuan Cheng, Yongyi Wang, Qirui Zheng, Xionghui Yang, Wenxin Li

2602.13213 2026-02-17 cs.AI cs.HC cs.LG

Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique

Joyjit Roy, Samaresh Kumar Singh

Comments 9 pages, 8 figuers, 6 tables, submitted aty 9th International Conference on Modern Computing, Networking and Applications (MCNA2026)

2602.13212 2026-02-17 cs.RO cs.MA cs.SY eess.SY

UAVGENT: A Language-Guided Distributed Control Framework

Ziyi Zhang, Xiyu Deng, Guannan Qu, Yorie Nakahira

2602.12247 2026-02-17 cs.LG cs.AI

ExtractBench: A Benchmark and Evaluation Methodology for Complex Structured Extraction

Nick Ferguson, Josh Pennington, Narek Beghian, Aravind Mohan, Douwe Kiela, Sheshansh Agrawal, Thien Hang Nguyen

2602.10285 2026-02-17 cs.RO

Adaptive Time Step Flow Matching for Autonomous Driving Motion Planning

Ananya Trivedi, Anjian Li, Mohamed Elnoor, Yusuf Umut Ciftci, Avinash Singh, Jovin D'sa, Sangjae Bae, David Isele, Taskin Padir, Faizan M. Tariq

Comments Accepted to Intelligent Vehicles Symposium 2026

2602.10098 2026-02-17 cs.RO cs.CV

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Jingwen Sun, Wenyao Zhang, Zekun Qi, Shaojie Ren, Zezhi Liu, Hanxin Zhu, Guangzhong Sun, Xin Jin, Zhibo Chen

2602.07506 2026-02-17 cs.RO cs.AI cs.HC

VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots

Peizhen Li, Longbing Cao, Xiao-Ming Wu, Yang Zhang

Comments Accepted to the 2026 IEEE International Conference on Robotics and Automation (ICRA)

2602.04419 2026-02-17 cs.RO

Integrated Exploration and Sequential Manipulation on Scene Graph with LLM-based Situated Replanning

Heqing Yang, Ziyuan Jiao, Shu Wang, Yida Niu, Si Liu, Hangxin Liu

Comments 8 pages, 7 figures; accepted by ICRA 2026

2602.03796 2026-02-17 cs.CV

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Zhixue Fang, Xu He, Songlin Tang, Haoxian Zhang, Qingfeng Li, Xiaoqiang Liu, Pengfei Wan, Kun Gai

Comments Project Page: https://hjrphoebus.github.io/3DiMo/

2602.03546 2026-02-17 cs.LG cond-mat.dis-nn cond-mat.mes-hall cond-mat.soft cs.ET

How to Train Your Resistive Network: Generalized Equilibrium Propagation and Analytical Learning

Jonathan Lin, Aman Desai, Frank Barrows, Francesco Caravelli

Comments 8 pages double column; plus 16 supp mat.;

2602.03195 2026-02-17 cs.LG cs.AI

Reinforcement Learning with Promising Tokens for Large Language Models

Jing-Cheng Pang, Liang Lu, Xian Tang, Kun Jiang, Sijie Wu, Kai Zhang, Xubin Li

AI 大模型

视觉与机器人

科学与医疗