arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2601.13975 2026-01-21 cs.CV cs.LG

Harmonizing the Deep: A Unified Information Pipeline for Robust Marine Biodiversity Assessment Across Heterogeneous Domains

Marco Piccolo, Qiwei Han, Astrid van Toor, Joachim Vanneste

Comments 9 pages, 4 figures 8 tables

2601.13974 2026-01-21 cs.CV

STEC: A Reference-Free Spatio-Temporal Entropy Coverage Metric for Evaluating Sampled Video Frames

Shih-Yao Lin

Comments This paper corresponds to the camera-ready version of a WACV 2026 Workshop paper

2601.13954 2026-01-21 cs.CV

DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging

Adrien Meyer, Didier Mutter, Nicolas Padoy

2601.13951 2026-01-21 cs.CV

VTONGuard: Automatic Detection and Authentication of AI-Generated Virtual Try-On Content

Shengyi Wu, Yan Hong, Shengyao Chen, Zheng Wang, Xianbing Sun, Jiahui Zhan, Jun Lan, Jianfu Zhang

2601.13945 2026-01-21 cs.RO cs.LG

Efficient Coordination with the System-Level Shared State: An Embodied-AI Native Modular Framework

Yixuan Deng, Tongrun Wu, Donghao Wu, Zeyu Wei, Jiayuan Wang, Zhenglong Sun, Yuqing Tang, Xiaoqiang Ji

2601.13935 2026-01-21 cs.CV cs.LG

TrackletGPT: A Language-like GPT Framework for White Matter Tract Segmentation

Anoushkrit Goel, Simroop Singh, Ankita Joshi, Ranjeet Ranjan Jha, Chirag Ahuja, Aditya Nigam, Arnav Bhavsar

Comments Accepted at 23rd IEEE International Symposium on Biomedical Imaging (ISBI), 2026

2601.13931 2026-01-21 cs.SD cs.IR cs.LG

Towards Effective Negation Modeling in Joint Audio-Text Models for Music

Yannis Vasilakis, Rachel Bittner, Johan Pauwels

Comments Accepted at IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2026

2601.13922 2026-01-21 cs.CL

Automatic Prompt Optimization for Dataset-Level Feature Discovery

Adrian Cosma, Oleg Szehr, David Kletz, Alessandro Antonucci, Olivier Pelletier

Comments 5 Figures, 1 Table

2601.13919 2026-01-21 cs.CL cs.CV

HyperWalker: Dynamic Hypergraph-Based Deep Diagnosis for Multi-Hop Clinical Modeling across EHR and X-Ray in Medical VLMs

Yuezhe Yang, Hao Wang, Yige Peng, Jinman Kim, Lei Bi

Comments Under Review

2601.13918 2026-01-21 cs.CL

AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization

Yusheng Liao, Chuan Xuan, Yutong Cai, Lina Yang, Zhe Chen, Yanfeng Wang, Yu Wang

Comments 37 pages, 12 figures

2601.13913 2026-01-21 cs.CV

On the Role of Rotation Equivariance in Monocular 3D Human Pose Estimation

Pavlo Melnyk, Cuong Le, Urs Waldmann, Per-Erik Forssén, Bastian Wandt

2601.13897 2026-01-21 cs.LG cs.AI

TractRLFusion: A GPT-Based Multi-Critic Policy Fusion Framework for Fiber Tractography

Ankita Joshi, Ashutosh Sharma, Anoushkrit Goel, Ranjeet Ranjan Jha, Chirag Ahuja, Arnav Bhavsar, Aditya Nigam

Comments Accepted at 23rd IEEE International Symposium on Biomedical Imaging (ISBI), 2026

2601.13892 2026-01-21 cs.LG

Multi-Objective Hierarchical Optimization with Large Language Models

Andrej Schwanke, Lyubomir Ivanov, David Salinas, Frank Hutter, Arber Zela

Comments 23 pages, 21 figures, 9 tables

2601.13886 2026-01-21 cs.CV

Revisiting Multi-Task Visual Representation Learning

Shangzhe Di, Zhonghua Zhai, Weidi Xie

Comments Code: https://github.com/Becomebright/MTV

2601.13885 2026-01-21 cs.CL cs.AI

Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores

Esma Balkır, Alice Pernthaller, Marco Basaldella, José Hernández-Orallo, Nigel Collier

2601.13882 2026-01-21 cs.CL

OpenLearnLM Benchmark: A Unified Framework for Evaluating Knowledge, Skill, and Attitude in Educational Large Language Models

Unggi Lee, Sookbun Lee, Heungsoo Choi, Jinseo Lee, Haeun Park, Younghoon Jeon, Sungmin Cho, Minju Kang, Junbo Koh, Jiyeong Bae, Minwoo Nam, Juyeon Eun, Yeonji Jung, Yeil Jeong

2601.13880 2026-01-21 cs.AI

LifeAgentBench: A Multi-dimensional Benchmark and Agent for Personal Health Assistants in Digital Health

Ye Tian, Zihao Wang, Onat Gungor, Xiaoran Fan, Tajana Rosing

2601.13876 2026-01-21 cs.CL

Pedagogical Alignment for Vision-Language-Action Models: A Comprehensive Framework for Data, Architecture, and Evaluation in Education

Unggi Lee, Jahyun Jeong, Sunyoung Shin, Haeun Park, Jeongsu Moon, Youngchang Song, Jaechang Shim, JaeHwan Lee, Yunju Noh, Seungwon Choi, Ahhyun Kim, TaeHyeon Kim, Kyungtae Joo, Taeyeong Kim, Gyeonggeon Lee

2601.13871 2026-01-21 cs.CV

OCCAM: Class-Agnostic, Training-Free, Prior-Free and Multi-Class Object Counting

Michail Spanakis, Iason Oikonomidis, Antonis Argyros

2601.13852 2026-01-21 cs.CV cs.LG

Probabilistic Deep Discriminant Analysis for Wind Blade Segmentation

Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

Comments Accepted to ICASSP 2026

2601.13847 2026-01-21 cs.SD

Emotion and Acoustics Should Agree: Cross-Level Inconsistency Analysis for Audio Deepfake Detection

Jinhua Zhang, Zhenqi Jia, Rui Liu

Comments Accepted by ICASSP 2026

2601.13846 2026-01-21 cs.AI cs.CY cs.LG

Virtual Urbanism: An AI-Driven Framework for Quantifying Urban Identity. A Tokyo-Based Pilot Study Using Diffusion-Generated Synthetic Environments

Glinskaya Maria

2601.13835 2026-01-21 cs.CL

The Role of Prosodic and Lexical Cues in Turn-Taking with Self-Supervised Speech Representations

Sam OConnor Russell, Delphine Charuau, Naomi Harte

Comments Accepted to ICASSP 2026

2601.13816 2026-01-21 cs.CV cs.LG

Discriminant Learning-based Colorspace for Blade Segmentation

Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

Comments Accepted to ICASSP 2026

2601.13806 2026-01-21 cs.CL cs.LG

Knowledge Graph-Assisted LLM Post-Training for Enhanced Legal Reasoning

Dezhao Song, Guglielmo Bonifazi, Frank Schilder, Jonathan Richard Schwarz

2601.13801 2026-01-21 cs.RO

HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction

Yuhua Jin, Nikita Kuzmin, Georgii Demianchuk, Mariya Lezina, Fawad Mehboob, Issatay Tokmurziyev, Miguel Altamirano Cabrera, Muhammad Ahsan Mustafa, Dzmitry Tsetserukou

Comments This paper has been accepted for publication at LBR HRI 2026 conference

2601.13797 2026-01-21 cs.CV

PREGEN: Uncovering Latent Thoughts in Composed Video Retrieval

Gabriele Serussi, David Vainshtein, Jonathan Kouchly, Dotan Di Castro, Chaim Baskin

2601.13793 2026-01-21 cs.LG

PAtt: A Pattern Attention Network for ETA Prediction Using Historical Speed Profiles

ByeoungDo Kim, JunYeop Na, Kyungwook Tak, JunTae Kim, DongHyeon Kim, Duckky Kim

Comments 7 pages, 3 figures, ITSC 2025, to be published

2601.13777 2026-01-21 cs.RO

Sample Efficient Learning of Body-Environment Interaction of an Under-Actuated System

Zvi Chapnik, Yizhar Or, Shai Revzen

2601.13776 2026-01-21 cs.LG stat.ML

Orthogonium : A Unified, Efficient Library of Orthogonal and 1-Lipschitz Building Blocks

Thibaut Boissin, Franck Mamalet, Valentin Lafargue, Mathieu Serrurier

Journal ref ICML 2025 Workshop on Championing Open- source Development in Machine Learning (CODEML '25), Jul 2025, Vancouver, France

AI 大模型

视觉与机器人

科学与医疗

Harmonizing the Deep: A Unified Information Pipeline for Robust Marine Biodiversity Assessment Across Heterogeneous Domains

STEC: A Reference-Free Spatio-Temporal Entropy Coverage Metric for Evaluating Sampled Video Frames

DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging

VTONGuard: Automatic Detection and Authentication of AI-Generated Virtual Try-On Content

Efficient Coordination with the System-Level Shared State: An Embodied-AI Native Modular Framework

TrackletGPT: A Language-like GPT Framework for White Matter Tract Segmentation

Towards Effective Negation Modeling in Joint Audio-Text Models for Music

Automatic Prompt Optimization for Dataset-Level Feature Discovery

HyperWalker: Dynamic Hypergraph-Based Deep Diagnosis for Multi-Hop Clinical Modeling across EHR and X-Ray in Medical VLMs

AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization

On the Role of Rotation Equivariance in Monocular 3D Human Pose Estimation

TractRLFusion: A GPT-Based Multi-Critic Policy Fusion Framework for Fiber Tractography

Multi-Objective Hierarchical Optimization with Large Language Models

Revisiting Multi-Task Visual Representation Learning

Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores

OpenLearnLM Benchmark: A Unified Framework for Evaluating Knowledge, Skill, and Attitude in Educational Large Language Models

LifeAgentBench: A Multi-dimensional Benchmark and Agent for Personal Health Assistants in Digital Health

Pedagogical Alignment for Vision-Language-Action Models: A Comprehensive Framework for Data, Architecture, and Evaluation in Education

OCCAM: Class-Agnostic, Training-Free, Prior-Free and Multi-Class Object Counting

Probabilistic Deep Discriminant Analysis for Wind Blade Segmentation

Emotion and Acoustics Should Agree: Cross-Level Inconsistency Analysis for Audio Deepfake Detection

Virtual Urbanism: An AI-Driven Framework for Quantifying Urban Identity. A Tokyo-Based Pilot Study Using Diffusion-Generated Synthetic Environments

The Role of Prosodic and Lexical Cues in Turn-Taking with Self-Supervised Speech Representations

Discriminant Learning-based Colorspace for Blade Segmentation

Knowledge Graph-Assisted LLM Post-Training for Enhanced Legal Reasoning

HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction

PREGEN: Uncovering Latent Thoughts in Composed Video Retrieval

PAtt: A Pattern Attention Network for ETA Prediction Using Historical Speed Profiles

Sample Efficient Learning of Body-Environment Interaction of an Under-Actuated System

Orthogonium : A Unified, Efficient Library of Orthogonal and 1-Lipschitz Building Blocks