arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2505.00527 2026-02-17 cs.RO

DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation

Zixuan Chen, Junhui Yin, Yangtao Chen, Jing Huo, Pinzhuo Tian, Jieqi Shi, Yiwen Hou, Yinchuan Li, Yang Gao

Comments RAL 2026

2504.17880 2026-02-17 cs.RO

Autonomous Navigation of Quadrupeds Using Coverage Path Planning with Morphological Skeleton Map

Alexander James Becoy, Kseniia Khomenko, Luka Peternel, Raj Thilak Rajan

Comments 15 pages, published to Fronters In Robotics (currently in production), major revision: title change, abstract revised, grammar fixed, mathematical notations fixed and made consistent, conclusion revised, related works extended, Algorithm 1-3 revised

Journal ref Frontiers in Robotics and AI, Volume 31, 1601862, July 2025

2504.06193 2026-02-17 cs.LG cs.AI

Heuristic Methods are Good Teachers to Distill MLPs for Graph Link Prediction

Zongyue Qin, Shichang Zhang, Mingxuan Ju, Tong Zhao, Neil Shah, Yizhou Sun

2503.12385 2026-02-17 cs.CV

Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset

Yutao Hu, Sen Li, Jincheng Yan, Wenqi Shao, Xiaoyan Luo

Comments accepted to The Eleventh Workshop on Fine-Grained Visual Categorization in CVPR 2024

2503.09027 2026-02-17 cs.CV

Measure Twice, Cut Once: A Semantic-Oriented Approach to Video Temporal Localization with Video LLMs

Zongshang Pang, Mayu Otani, Yuta Nakashima

Comments ICLR2026

2503.01884 2026-02-17 cs.LG cs.AI

Contextual Quantum Neural Networks for Stock Price Prediction

Sharan Mourya, Hannes Leipold, Bibhas Adhikari

Journal ref Mourya, S., Leipold, H., & Adhikari, B. (2026). Contextual quantum neural networks for stock price prediction. Scientific Reports, 16, Article 34413

详情

DOI: 10.1038/s41598-025-34413-5

英文摘要

In this paper, we apply quantum machine learning (QML) to predict the stock prices of multiple assets using a contextual quantum neural network. Our approach captures recent trends to predict future stock price distributions, moving beyond traditional models that focus on entire historical data, enhancing adaptability and precision. Utilizing the principles of quantum superposition, we introduce a new training technique called the quantum batch gradient update (QBGU), which accelerates the standard stochastic gradient descent (SGD) in quantum applications and improves convergence. Consequently, we propose a quantum multi-task learning (QMTL) architecture, specifically, the share-and-specify ansatz, that integrates task-specific operators controlled by quantum labels, enabling the simultaneous and efficient training of multiple assets on the same quantum circuit as well as enabling efficient portfolio representation with logarithmic overhead in the number of qubits. This architecture represents the first of its kind in quantum finance, offering superior predictive power and computational efficiency for multi-asset stock price forecasting. Through extensive experimentation on S\&P 500 data for Apple, Google, Microsoft, and Amazon stocks, we demonstrate that our approach not only outperforms quantum single-task learning (QSTL) models but also effectively captures inter-asset correlations, leading to enhanced prediction accuracy. Our findings highlight the transformative potential of QML in financial applications, paving the way for more advanced, resource-efficient quantum algorithms in stock price prediction and other complex financial modeling tasks.

URL PDF HTML ☆

赞 0 踩 0

2502.17315 2026-02-17 cs.CL

HIPPO: Enhancing the Table Understanding Capability of LLMs through Hybrid-Modal Preference Optimization

Haolan Wang, Zhenghao Liu, Xinze Li, Xiaocui Yang, Yu Gu, Yukun Yan, Qi Shi, Fangfang Li, Chong Chen, Ge Yu

2502.14560 2026-02-17 cs.LG cs.AI cs.CL

Less is More: Improving LLM Alignment via Preference Data Selection

Xun Deng, Han Zhong, Rui Ai, Fuli Feng, Zheng Wang, Xiangnan He

2502.09980 2026-02-17 cs.CV cs.RO

V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models

Hsu-kuang Chiu, Ryo Hachiuma, Chien-Yi Wang, Stephen F. Smith, Yu-Chiang Frank Wang, Min-Hung Chen

Comments Accepted by ICRA 2026 (IEEE International Conference on Robotics and Automation). Project: https://eddyhkchiu.github.io/v2vllm.github.io/ Code: https://github.com/eddyhkchiu/V2V-LLM Dataset: https://huggingface.co/datasets/eddyhkchiu/V2V-GoT-QA

2502.07443 2026-02-17 cs.AI cs.GT

Approximating Human Strategic Reasoning with LLM-Enhanced Recursive Reasoners Leveraging Multi-agent Hypergames

Vince Trencsenyi, Agnieszka Mensfelt, Kostas Stathis

2502.02415 2026-02-17 cs.LG

Fast Graph Generation via Autoregressive Noisy Filtration Modeling

Markus Krimmel, Jenna Wiens, Karsten Borgwardt, Dexiong Chen

Journal ref Transactions on Machine Learning Research, 2026

2502.00194 2026-02-17 cs.LG cs.AI physics.comp-ph

Physics-Informed Neural Network based Damage Identification for Truss Railroad Bridges

Althaf Shajihan, Kirill Mechitov, Girish Chowdhary, Billie F. Spencer

Comments 30 pages, 15 figures

Journal ref Structure and Infrastructure Engineering, 1-22 (2026)

详情

DOI: 10.1080/15732479.2026.2628861

英文摘要

Railroad bridges are a crucial component of the U.S. freight rail system, which moves over 40 percent of the nation's freight and plays a critical role in the economy. However, aging bridge infrastructure and increasing train traffic pose significant safety hazards and risk service disruptions. The U.S. rail network includes over 100,000 railroad bridges, averaging one every 1.4 miles of track, with steel bridges comprising over 50% of the network's total bridge length. Early identification and assessment of damage in these bridges remain challenging tasks. This study proposes a physics-informed neural network (PINN) based approach for damage identification in steel truss railroad bridges. The proposed approach employs an unsupervised learning approach, eliminating the need for large datasets typically required by supervised methods. The approach utilizes train wheel load data and bridge response during train crossing events as inputs for damage identification. The PINN model explicitly incorporates the governing differential equations of the linear time-varying (LTV) bridge-train system. Herein, this model employs a recurrent neural network (RNN) based architecture incorporating a custom Runge-Kutta (RK) integrator cell, designed for gradient-based learning. The proposed approach updates the bridge finite element model while also quantifying damage severity and localizing the affected structural members. A case study on the Calumet Bridge in Chicago, Illinois, with simulated damage scenarios, is used to demonstrate the model's effectiveness in identifying damage while maintaining low false-positive rates. Furthermore, the damage identification pipeline is designed to seamlessly integrate prior knowledge from inspections and drone surveys, also enabling context-aware updating and assessment of bridge's condition.

URL PDF HTML ☆

赞 0 踩 0

2501.16717 2026-02-17 cs.RO

Strawberry Robotic Operation Interface: An Open-Source Device for Collecting Dexterous Manipulation Data in Robotic Strawberry Farming

Linsheng Hou, Wenwu Lu, Yanan Wang, Chen Peng, Zhenghao Fei

2501.15889 2026-02-17 cs.LG cs.AI

Adaptive Width Neural Networks

Federico Errica, Henrik Christiansen, Viktor Zaverkin, Mathias Niepert, Francesco Alesiani

Comments International Conference on Learning Representations (ICLR 2026)

2501.13354 2026-02-17 cs.CV

ATRNet-STAR: A Large Dataset and Benchmark Towards Remote Sensing Object Recognition in the Wild

Yongxiang Liu, Weijie Li, Li Liu, Jie Zhou, Bowen Peng, Yafei Song, Xuying Xiong, Wei Yang, Tianpeng Liu, Zhen Liu, Xiang Li

Comments 17 pages, 12 figures; Homepage: https://github.com/waterdisappear/ATRNet-STAR . in IEEE Transactions on Pattern Analysis and Machine Intelligence (2026)

详情

DOI: 10.1109/TPAMI.2026.3658649

英文摘要

The absence of publicly available, large-scale, high-quality datasets for Synthetic Aperture Radar Automatic Target Recognition (SAR ATR) has significantly hindered the application of rapidly advancing deep learning techniques, which hold huge potential to unlock new capabilities in this field. This is primarily because collecting large volumes of diverse target samples from SAR images is prohibitively expensive, largely due to privacy concerns, the characteristics of microwave radar imagery perception, and the need for specialized expertise in data annotation. Throughout the history of SAR ATR research, there have been only a number of small datasets, mainly including targets like ships, airplanes, buildings, etc. There is only one vehicle dataset MSTAR collected in the 1990s, which has been a valuable source for SAR ATR. To fill this gap, this paper introduces a large-scale, new dataset named ATRNet-STAR with 40 different vehicle categories collected under various realistic imaging conditions and scenes. It marks a substantial advancement in dataset scale and diversity, comprising over 190,000 well-annotated samples, 10 times larger than its predecessor, the famous MSTAR. Building such a large dataset is a challenging task, and the data collection scheme will be detailed. Secondly, we illustrate the value of ATRNet-STAR via extensively evaluating the performance of 15 representative methods with 7 different experimental settings on challenging classification and detection benchmarks derived from the dataset. Finally, based on our extensive experiments, we identify valuable insights for SAR ATR and discuss potential future research directions in this field. We hope that the scale, diversity, and benchmark of ATRNet-STAR can significantly facilitate the advancement of SAR ATR.

URL PDF HTML ☆

赞 0 踩 0

2501.07575 2026-02-17 cs.CV cs.AI

Dataset Distillation via Committee Voting

Jiacheng Cui, Zhaoyi Li, Xiaochen Ma, Xinyue Bi, Yaxin Luo, Zhiqiang Shen

Comments Code at: https://github.com/Jiacheng8/CV-DD

2501.05633 2026-02-17 cs.LG cs.IT eess.SP math.IT

Regularized Top-$k$: A Bayesian Framework for Gradient Sparsification

Ali Bereyhi, Ben Liang, Gary Boudreau, Ali Afana

Comments This paper has been published in IEEE Transactions on Signal Processing, vol. 73, pp. 4463 - 4478, 2025. The present arXiv version contains additional experimental results. 27 pages, 8 figures, 2 tables

Journal ref IEEE Transactions on Signal Processing, vol. 73, pp. 4463 - 4478, 2025

详情

DOI: 10.1109/TSP.2025.3624791

英文摘要

Error accumulation is effective for gradient sparsification in distributed settings: initially-unselected gradient entries are eventually selected as their accumulated error exceeds a certain level. The accumulation essentially behaves as a scaling of the learning rate for the selected entries. Although this property prevents the slow-down of lateral movements in distributed gradient descent, it can deteriorate convergence in some settings. This work proposes a novel sparsification scheme that controls the learning rate scaling of error accumulation. The development of this scheme follows two major steps: first, gradient sparsification is formulated as an inverse probability (inference) problem, and the Bayesian optimal sparsification mask is derived as a maximum-a-posteriori estimator. Using the prior distribution inherited from Top-k, we derive a new sparsification algorithm which can be interpreted as a regularized form of Top-k. We call this algorithm regularized Top-k (RegTop-k). It utilizes past aggregated gradients to evaluate posterior statistics of the next aggregation. It then prioritizes the local accumulated gradient entries based on these posterior statistics. We validate our derivation through various numerical experiments. In distributed linear regression, it is observed that while Top-k remains at a fixed distance from the global optimum, RegTop-k converges to the global optimum at significantly higher compression ratios. We further demonstrate the generalization of this observation by employing RegTop-k in distributed training of ResNet-18 on CIFAR-10, as well as fine-tuning of multiple computer vision models on the ImageNette dataset. Our numerical results confirm that as the compression ratio increases, RegTop-k sparsification noticeably outperforms Top-k.

URL PDF HTML ☆

赞 0 踩 0

2412.20110 2026-02-17 cs.CV

Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification

Xi Yang, Pai Peng, Wulin Xie, Xiaohuan Lu, Jie Wen

Comments The authors request withdrawal of this article. This version was submitted in error. Compared to the intended final version, it contains inaccuracies and fails to accurately reflect the authors' work and conclusions

2412.14294 2026-02-17 cs.CV cs.LG

TRecViT: A Recurrent Video Transformer

Viorica Pătrăucean, Xu Owen He, Joseph Heyward, Chuhan Zhang, Mehdi S. M. Sajjadi, George-Cristian Muraru, Artem Zholus, Mahdi Karami, Ross Goroshin, Yutian Chen, Simon Osindero, João Carreira, Razvan Pascanu

2412.13474 2026-02-17 cs.RO cs.SY eess.SY

Planning Human-Robot Co-manipulation with Human Motor Control Objectives and Multi-component Reaching Strategies

Kevin Haninger, Luka Peternel

Comments 10 Pages

Journal ref IEEE Robotics and Automation Letters, Volume 10, Issue 2, February 2025

2412.01168 2026-02-17 cs.RO cs.SY eess.SY

On the Surprising Effectiveness of Spectral Clipping in Learning Stable Linear and Latent-Linear Dynamical Systems

Hanyao Guo, Yunhai Han, Harish Ravichandar

2412.00686 2026-02-17 cs.CV cs.AI

LVLM-COUNT: Enhancing the Counting Ability of Large Vision-Language Models

Muhammad Fetrat Qharabagh, Mohammadreza Ghofrani, Kimon Fountoulakis

Comments 38 pages, 24 Figures, 19 Tables

2411.16085 2026-02-17 cs.LG cs.AI cs.CL cs.CV cs.DM

Cautious Optimizers: Improving Training with One Line of Code

Kaizhao Liang, Lizhang Chen, Bo Liu, Qiang Liu

2411.06403 2026-02-17 cs.AI

Mastering NIM and Impartial Games with Weak Neural Networks: An AlphaZero-inspired Multi-Frame Approach

Søren Riis

2410.18784 2026-02-17 cs.LG cs.NA eess.SP math.NA math.ST stat.ML stat.TH

Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality

Zhihan Huang, Yuting Wei, Yuxin Chen

Comments Accepted to Mathematics of Operations Research

2410.10481 2026-02-17 cs.LG cs.AI cs.CR

Model-based Large Language Model Customization as Service

Zhaomin Wu, Jizhou Guo, Junyi Hou, Bingsheng He, Lixin Fan, Qiang Yang

Comments Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)

Journal ref Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (2025)

2410.06244 2026-02-17 cs.CV

Story-Iter: A Training-free Iterative Paradigm for Long Story Visualization

Jiawei Mao, Xiaoke Huang, Yunfei Xie, Yuanqi Chang, Mude Hui, Bingjie Xu, Zeyu Zheng, Zirui Wang, Cihang Xie, Yuyin Zhou

Comments 31 pages, 33 figures, The project page and associated code can be accessed via https://jwmao1.github.io/storyiter/

2410.03919 2026-02-17 cs.LG stat.ML

Online Posterior Sampling with a Diffusion Prior

Branislav Kveton, Boris Oreshkin, Youngsuk Park, Aniket Deshmukh, Rui Song

Comments Advances in Neural Information Processing Systems 37

2410.02081 2026-02-17 cs.LG

MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters

Aitian Ma, Dongsheng Luo, Mo Sha

2409.14823 2026-02-17 cs.SD eess.AS

HiFi-Glot: High-Fidelity Neural Formant Synthesis with Differentiable Resonant Filters

Yicheng Gu, Pablo Pérez Zarazaga, Chaoren Wang, Zhizheng Wu, Zofia Malisz, Gustav Eje Henter, Lauri Juvela

AI 大模型

视觉与机器人

科学与医疗

DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation

Autonomous Navigation of Quadrupeds Using Coverage Path Planning with Morphological Skeleton Map

Heuristic Methods are Good Teachers to Distill MLPs for Graph Link Prediction

Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset

Measure Twice, Cut Once: A Semantic-Oriented Approach to Video Temporal Localization with Video LLMs

Contextual Quantum Neural Networks for Stock Price Prediction

HIPPO: Enhancing the Table Understanding Capability of LLMs through Hybrid-Modal Preference Optimization

Less is More: Improving LLM Alignment via Preference Data Selection

V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models

Approximating Human Strategic Reasoning with LLM-Enhanced Recursive Reasoners Leveraging Multi-agent Hypergames

Fast Graph Generation via Autoregressive Noisy Filtration Modeling

Physics-Informed Neural Network based Damage Identification for Truss Railroad Bridges

Strawberry Robotic Operation Interface: An Open-Source Device for Collecting Dexterous Manipulation Data in Robotic Strawberry Farming

Adaptive Width Neural Networks

ATRNet-STAR: A Large Dataset and Benchmark Towards Remote Sensing Object Recognition in the Wild

Dataset Distillation via Committee Voting

Regularized Top-$k$: A Bayesian Framework for Gradient Sparsification

Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification

TRecViT: A Recurrent Video Transformer

Planning Human-Robot Co-manipulation with Human Motor Control Objectives and Multi-component Reaching Strategies

On the Surprising Effectiveness of Spectral Clipping in Learning Stable Linear and Latent-Linear Dynamical Systems

LVLM-COUNT: Enhancing the Counting Ability of Large Vision-Language Models

Cautious Optimizers: Improving Training with One Line of Code

Mastering NIM and Impartial Games with Weak Neural Networks: An AlphaZero-inspired Multi-Frame Approach

Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality

Model-based Large Language Model Customization as Service

Story-Iter: A Training-free Iterative Paradigm for Long Story Visualization

Online Posterior Sampling with a Diffusion Prior

MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters

HiFi-Glot: High-Fidelity Neural Formant Synthesis with Differentiable Resonant Filters