arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.02918 2026-02-04 cs.CV cs.AI cs.LG q-bio.TO

A Multi-scale Linear-time Encoder for Whole-Slide Image Analysis

Jagan Mohan Reddy Dwarampudi, Joshua Wong, Hien Van Nguyen, Tania Banerjee

Comments Accepted to ISBI 2026, 4 pages with 2 figures

2602.02917 2026-02-04 cs.LG

Weighted Temporal Decay Loss for Learning Wearable PPG Data with Sparse Clinical Labels

Yunsung Chung, Keum San Chun, Migyeong Gwak, Han Feng, Yingshuo Liu, Chanho Lim, Viswam Nathan, Nassir Marrouche, Sharanya Arcot Desai

Comments ICASSP 2026

2602.02915 2026-02-04 cs.RO

Modular Isoperimetric Soft Robotic Truss for Lunar Applications

Mihai Stanciu, Isaac Weaver, Adam Rose, James Wade, Kaden Paxton, Chris Paul, Spencer Stowell, Nathan Usevitch

2602.02912 2026-02-04 cs.LG cs.AI stat.ML

Notes on the Reward Representation of Posterior Updates

Pedro A. Ortega

Comments Technical report, 9 pages

2602.02908 2026-02-04 cs.LG cs.AI cs.CV stat.ML

A Random Matrix Theory Perspective on the Consistency of Diffusion Models

Binxu Wang, Jacob Zavatone-Veth, Cengiz Pehlevan

Comments 65 pages; 53 figures

2602.02905 2026-02-04 cs.AI

FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

Zhen Wang, Fan Bai, Zhongyan Luo, Jinyan Su, Kaiser Sun, Xinle Yu, Jieyuan Liu, Kun Zhou, Claire Cardie, Mark Dredze, Eric P. Xing, Zhiting Hu

Comments 30 pages, 4 figures, 10 tables

2602.02903 2026-02-04 cs.LG cs.AI cs.SY eess.SY

Spatiotemporal Decision Transformer for Traffic Coordination

Haoran Su, Yandong Sun, Hanxiao Deng

2602.02900 2026-02-04 cs.LG cs.AI

Manifold-Constrained Energy-Based Transition Models for Offline Reinforcement Learning

Zeyu Fang, Zuyuan Zhang, Mahdi Imani, Tian Lan

2602.02899 2026-02-04 cs.LG cs.DC

Controlled disagreement improves generalization in decentralized training

Zesen Wang, Mikael Johansson

2602.02894 2026-02-04 cs.CV cs.LG

DoubleTake: Contrastive Reasoning for Faithful Decision-Making in Medical Imaging

Daivik Patel, Shrenik Patel

2602.02891 2026-02-04 cs.LG cs.CL

TraceNAS: Zero-shot LLM Pruning via Gradient Trace Correlation

Prajna G. Malettira, Manish Nagaraj, Arjun Roy, Shubham Negi, Kaushik Roy

Comments Preprint

2602.02888 2026-02-04 cs.CL cs.AI

HALT: Hallucination Assessment via Log-probs as Time series

Ahmad Shapiro, Karan Taneja, Ashok Goel

2602.02877 2026-02-04 cs.LG

A Geometry-Aware Efficient Algorithm for Compositional Entropic Risk Minimization

Xiyuan Wei, Linli Zhou, Bokun Wang, Chih-Jen Lin, Tianbao Yang

Comments 36 pages, 7 figures

2602.02873 2026-02-04 cs.CV

ViThinker: Active Vision-Language Reasoning via Dynamic Perceptual Querying

Weihang You, Qingchan Zhu, David Liu, Yi Pan, Geng Yuan, Hanqi Jiang

2602.02864 2026-02-04 cs.RO

Accelerating Structured Chain-of-Thought in Autonomous Vehicles

Yi Gu, Yan Wang, Yuxiao Chen, Yurong You, Wenjie Luo, Yue Wang, Wenhao Ding, Boyi Li, Heng Yang, Boris Ivanovic, Marco Pavone

2602.02863 2026-02-04 cs.AI cs.LG

"I May Not Have Articulated Myself Clearly": Diagnosing Dynamic Instability in LLM Reasoning at Inference Time

Jinkun Chen, Fengxiang Cheng, Sijia Han, Vlado Keselj

Comments 21 pages, 12 figures, 15 tables

2602.02862 2026-02-04 cs.AI cs.LG

STEER: Inference-Time Risk Control via Constrained Quality-Diversity Search

Eric Yang, Jong Ha Lee, Jonathan Amar, Elissa Ye, Yugang Jia

Comments 20 pages

2602.02859 2026-02-04 cs.LG

Late-Stage Generalization Collapse in Grokking: Detecting anti-grokking with Weightwatcher

Hari K Prakash, Charles H Martin

Comments 27 pages

2602.02858 2026-02-04 cs.RO cs.LG cs.MA cs.NI cs.SY eess.SY

IMAGINE: Intelligent Multi-Agent Godot-based Indoor Networked Exploration

Tiago Leite, Maria Conceição, António Grilo

Comments 12 pages, submitted to a journal

详情

英文摘要

The exploration of unknown, Global Navigation Satellite System (GNSS) denied environments by an autonomous communication-aware and collaborative group of Unmanned Aerial Vehicles (UAVs) presents significant challenges in coordination, perception, and decentralized decision-making. This paper implements Multi-Agent Reinforcement Learning (MARL) to address these challenges in a 2D indoor environment, using high-fidelity game-engine simulations (Godot) and continuous action spaces. Policy training aims to achieve emergent collaborative behaviours and decision-making under uncertainty using Network-Distributed Partially Observable Markov Decision Processes (ND-POMDPs). Each UAV is equipped with a Light Detection and Ranging (LiDAR) sensor and can share data (sensor measurements and a local occupancy map) with neighbouring agents. Inter-agent communication constraints include limited range, bandwidth and latency. Extensive ablation studies evaluated MARL training paradigms, reward function, communication system, neural network (NN) architecture, memory mechanisms, and POMDP formulations. This work jointly addresses several key limitations in prior research, namely reliance on discrete actions, single-agent or centralized formulations, assumptions of a priori knowledge and permanent connectivity, inability to handle dynamic obstacles, short planning horizons and architectural complexity in Recurrent NNs/Transformers. Results show that the scalable training paradigm, combined with a simplified architecture, enables rapid autonomous exploration of an indoor area. The implementation of Curriculum-Learning (five increasingly complex levels) also enabled faster, more robust training. This combination of high-fidelity simulation, MARL formulation, and computational efficiency establishes a strong foundation for deploying learned cooperative strategies in physical robotic systems.

URL PDF HTML ☆

赞 0 踩 0

2602.02857 2026-02-04 cs.RO

Latent Perspective-Taking via a Schrödinger Bridge in Influence-Augmented Local Models

Kevin Alcedo, Pedro U. Lima, Rachid Alami

Comments Extended Abstract & Poster, Presented at World Modeling Workshop 2026

2602.02848 2026-02-04 cs.LG

Zero Sum SVD: Balancing Loss Sensitivity for Low Rank LLM Compression

Ali Abbasi, Chayne Thrash, Haoran Qin, Shansita Sharma, Sepehr Seifi, Soheil Kolouri

2602.02847 2026-02-04 cs.LG cs.AI cs.RO

Causal Flow Q-Learning for Robust Offline Reinforcement Learning

Mingxuan Li, Junzhe Zhang, Elias Bareinboim

2602.02846 2026-02-04 cs.RO cs.DC

Kino-PAX$^+$: Near-Optimal Massively Parallel Kinodynamic Sampling-based Motion Planner

Nicolas Perrault, Qi Heng Ho, Morteza Lahijanian

Comments 10 pages, 8 figures

2602.02842 2026-02-04 cs.AI cs.CL cs.LG

Chain of Simulation: A Dual-Mode Reasoning Framework for Large Language Models with Dynamic Problem Routing

Saeid Sheikhi

2602.02831 2026-02-04 cs.RO

Adaptive Linear Path Model-Based Diffusion

Yutaka Shimizu, Masayoshi Tomizuka

Comments ICRA 2026

2602.02828 2026-02-04 cs.LG

A Single Revision Step Improves Token-Efficient LLM Reasoning

Yingchuan Zhang, Terry Ma, Wenxuan Zhong, Ping Ma

2602.02824 2026-02-04 cs.CL

CATNIP: LLM Unlearning via Calibrated and Tokenized Negative Preference Alignment

Zhengbang Yang, Yisheng Zhong, Junyuan Hong, Zhuangdi Zhu

2602.02820 2026-02-04 cs.LG cs.AI cs.CV

From Tokens to Numbers: Continuous Number Modeling for SVG Generation

Michael Ogezi, Martin Bell, Freda Shi, Ethan Smith

2602.02808 2026-02-04 cs.CV cs.AI cs.LG

LmPT: Conditional Point Transformer for Anatomical Landmark Detection on 3D Point Clouds

Matteo Bastico, Pierre Onghena, David Ryckelynck, Beatriz Marcotegui, Santiago Velasco-Forero, Laurent Corté, Caroline Robine--Decourcelle, Etienne Decencière

Comments This paper has been accepted at International Symposium on Biomedical Imaging (ISBI) 2026

Journal ref 2026 IEEE International Symposium on Biomedical Imaging (ISBI)

2602.02793 2026-02-04 cs.LG cs.AI

Causality--Δ: Jacobian-Based Dependency Analysis in Flow Matching Models

Reza Rezvan, Gustav Gille, Moritz Schauer, Richard Torkar

Comments 11 pages, 5 figures. Code: https://github.com/rezaarezvan/causdiff

AI 大模型

视觉与机器人

科学与医疗