arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2601.22356 2026-02-02 cs.LG cs.RO

PoSafeNet: Safe Learning with Poset-Structured Neural Nets

Kiwan Wong, Wei Xiao, Daniela Rus

2601.22355 2026-02-02 cs.LG

Relative Wasserstein Angle and the Problem of the $W_2$-Nearest Gaussian Distribution

Binshuai Wang, Peng Wei

2601.22352 2026-02-02 cs.LG cs.AI

Recoverability Has a Law: The ERR Measure for Tool-Augmented Agents

Sri Vatsa Vuddanti, Satwik Kumar Chittiprolu

Comments Preprint for ICML Submission

2601.22350 2026-02-02 cs.LG cs.AI

Learning Policy Representations for Steerable Behavior Synthesis

Beiming Li, Sergio Rozada, Alejandro Ribeiro

2601.22345 2026-02-02 cs.LG

Failing to Explore: Language Models on Interactive Tasks

Mahdi JafariRaviz, Keivan Rezaei, Arshia Soltani Moakhar, Zahra Sodagar, Yize Cheng, Soheil Feizi

2601.22339 2026-02-02 cs.LG quant-ph

Quantum-Inspired Reinforcement Learning for Secure and Sustainable AIoT-Driven Supply Chain Systems

Muhammad Bilal Akram Dastagir, Omer Tariq, Shahid Mumtaz, Saif Al-Kuwari, Ahmed Farouk

2601.22335 2026-02-02 cs.LG stat.ML

Knowledge Gradient for Preference Learning

Kaiwen Wu, Jacob R. Gardner

2601.22331 2026-02-02 cs.LG stat.CO

Scalable Batch Correction for Cell Painting via Batch-Dependent Kernels and Adaptive Sampling

Aditya Narayan Ravi, Snehal Vadvalkar, Abhishek Pandey, Ilan Shomorony

Comments 40 pages, many figures

2601.22329 2026-02-02 cs.AI cs.CY

Sparks of Rationality: Do Reasoning LLMs Align with Human Judgment and Choice?

Ala N. Tak, Amin Banayeeanzade, Anahita Bolourani, Fatemeh Bahrani, Ashutosh Chaubey, Sai Praneeth Karimireddy, Norbert Schwarz, Jonathan Gratch

2601.22327 2026-02-02 cs.LG

Molecular Representations in Implicit Functional Space via Hyper-Networks

Zehong Wang, Xiaolong Han, Qi Yang, Xiangru Tang, Fang Wu, Xiaoguang Guo, Weixiang Sun, Tianyi Ma, Pietro Lio, Le Cong, Sheng Wang, Chuxu Zhang, Yanfang Ye

2601.22326 2026-02-02 cs.LG stat.AP

Label-Efficient Monitoring of Classification Models via Stratified Importance Sampling

Lupo Marsigli, Angel Lopez de Haro

Comments 24 pages

2601.22322 2026-02-02 cs.LG eess.SP

Spatially-Adaptive Conformal Graph Transformer for Indoor Localization in Wi-Fi Driven Networks

Ayesh Abu Lehyeh, Anastassia Gharib, Safwan Wshah

Comments Accepted to IEEE ICC 2026

2601.22318 2026-02-02 cs.LG

Federate the Router: Learning Language Model Routers with Sparse and Decentralized Evaluations

Baris Askin, Shivam Patel, Anupam Nayak, Andrea Vigano, Jiin Woo, Gauri Joshi, Carlee Joe-Wong

2601.22315 2026-02-02 cs.LG

Gaussian Process Bandit Optimization with Machine Learning Predictions and Application to Hypothesis Generation

Xin Jennifer Chen, Yunjin Tong

2601.22313 2026-02-02 cs.LG

Hair-Trigger Alignment: Black-Box Evaluation Cannot Guarantee Post-Update Alignment

Yavuz Bakman, Duygu Nur Yaldiz, Salman Avestimehr, Sai Praneeth Karimireddy

2601.22312 2026-02-02 cs.LG cond-mat.mtrl-sci cs.CE

SCALAR: Quantifying Structural Hallucination, Consistency, and Reasoning Gaps in Materials Foundation Models

Can Polat, Erchin Serpedin, Mustafa Kurban, Hasan Kurban

2601.22311 2026-02-02 cs.AI cs.CL cs.LG

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents

Zehong Wang, Fang Wu, Hongru Wang, Xiangru Tang, Bolian Li, Zhenfei Yin, Yijun Ma, Yiyang Li, Weixiang Sun, Xiusi Chen, Yanfang Ye

2601.22305 2026-02-02 cs.LG

BayesFlow: A Probability Inference Framework for Meta-Agent Assisted Workflow Generation

Bo Yuan, Yun Zhou, Zhichao Xu, Kiran Ramnath, Aosong Feng, Balasubramaniam Srinivasan

Comments EACL 2026 Finding

2601.22298 2026-02-02 cs.LG cs.AI physics.ao-ph

Conformal Prediction for Generative Models via Adaptive Cluster-Based Density Estimation

Qidong Yang, Qianyu Julie Zhu, Jonathan Giezendanner, Youssef Marzouk, Stephen Bates, Sherrie Wang

2601.22290 2026-02-02 cs.AI

The Six Sigma Agent: Achieving Enterprise-Grade Reliability in LLM Systems Through Consensus-Driven Decomposed Execution

Khush Patel, Siva Surendira, Jithin George, Shreyas Kapale

Comments 25 pages, 7 figures, 2 tables

2601.22289 2026-02-02 cs.RO

ReloPush-BOSS: Optimization-guided Nonmonotone Rearrangement Planning for a Car-like Robot Pusher

Jeeho Ahn, Christoforos Mavrogiannis

Comments Preprint of final version, accepted to RA-L 2026

2601.22284 2026-02-02 cs.LG

Riemannian Lyapunov Optimizer: A Unified Framework for Optimization

Yixuan Wang, Omkar Sudhir Patil, Warren E. Dixon

Comments 22 pages, 4 figures

2601.22275 2026-02-02 cs.CV cs.AI

VMonarch: Efficient Video Diffusion Transformers with Structured Attention

Cheng Liang, Haoxian Chen, Liang Hou, Qi Fan, Gangshan Wu, Xin Tao, Limin Wang

2601.22269 2026-02-02 cs.AI cs.CL cs.LG

JAF: Judge Agent Forest

Sahil Garg, Brad Cheezum, Sridhar Dutta, Vishal Agarwal

详情

英文摘要

Judge agents are fundamental to agentic AI frameworks: they provide automated evaluation, and enable iterative self-refinement of reasoning processes. We introduce JAF: Judge Agent Forest, a framework in which the judge agent conducts joint inference across a cohort of query--response pairs generated by a primary agent, rather than evaluating each in isolation. This paradigm elevates the judge from a local evaluator to a holistic learner: by simultaneously assessing related responses, the judge discerns cross-instance patterns and inconsistencies, whose aggregate feedback enables the primary agent to improve by viewing its own outputs through the judge's collective perspective. Conceptually, JAF bridges belief propagation and ensemble-learning principles: overlapping in-context neighborhoods induce a knowledge-graph structure that facilitates propagation of critique, and repeated, randomized evaluations yield a robust ensemble of context-sensitive judgments. JAF can be instantiated entirely via ICL, with the judge prompted for each query using its associated primary-agent response plus a small, possibly noisy set of peer exemplars. While kNN in embedding space is a natural starting point for exemplars, this approach overlooks categorical structure, domain metadata, or nuanced distinctions accessible to modern LLMs. To overcome these limitations, we develop a flexible locality-sensitive hashing (LSH) algorithm that learns informative binary codes by integrating semantic embeddings, LLM-driven hash predicates, supervision from categorical labels, and relevant side information. These hash codes support efficient, interpretable, and relation-aware selection of diverse exemplars, and further optimize exploration of CoT reasoning paths. We validate JAF with an empirical study on the demanding task of cloud misconfigs triage in large-scale cloud environments.

URL PDF HTML ☆

赞 0 踩 0

2601.22265 2026-02-02 cs.LG cs.NI

Privacy-Preserving Sensor-Based Human Activity Recognition for Low-Resource Healthcare Using Classical Machine Learning

Ramakant Kumar, Pravin Kumar

2601.22259 2026-02-02 cs.LG

Tabular Foundation Models Can Do Survival Analysis

Da In Kim, Wei Siang Lai, Kelly W. Zhang

2601.22249 2026-02-02 cs.LG cs.SE

FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation

Ruiyi Zhang, Peijia Qin, Qi Cao, Eric Xue, Pengtao Xie

2601.22231 2026-02-02 cs.CV

Geometry without Position? When Positional Embeddings Help and Hurt Spatial Reasoning

Jian Shi, Michael Birsak, Wenqing Cui, Zhenyu Li, Peter Wonka

2601.22230 2026-02-02 cs.LG

DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation

Peijia Qin, Ruiyi Zhang, Qi Cao, Pengtao Xie

2601.22218 2026-02-02 cs.CV cs.DL

What Lies Beneath: A Call for Distribution-based Visual Question & Answer Datasets

Jill P. Naiman, Daniel J. Evans, JooYoung Seo

Comments Accepted to ACM/IEEE Joint Conference on Digital Libraries JCDL 2025, 4 pages, 2 figures

AI 大模型

视觉与机器人

科学与医疗