arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.04470 2026-03-06 cs.RO

Efficient Autonomous Navigation of a Quadruped Robot in Underground Mines on Edge Hardware

Yixiang Gao, Kwame Awuah-Offei

详情

英文摘要

Embodied navigation in underground mines faces significant challenges, including narrow passages, uneven terrain, near-total darkness, GPS-denied conditions, and limited communication infrastructure. While recent learning-based approaches rely on GPU-accelerated inference and extensive training data, we present a fully autonomous navigation stack for a Boston Dynamics Spot quadruped robot that runs entirely on a low-power Intel NUC edge computer with no GPU and no network connectivity requirements. The system integrates LiDAR-inertial odometry, scan-matching localization against a prior map, terrain segmentation, and visibility-graph global planning with a velocity-regulated local path follower, achieving real-time perception-to-action at consistent control rates. After a single mapping pass of the environment, the system handles arbitrary goal locations within the known map without any environment-specific training or learned components. We validate the system through repeated field trials using four target locations of varying traversal difficulty in an experimental underground mine, accumulating over 700 m of fully autonomous traverse with a 100% success rate across all 20 trials (5 repetitions x 4 targets) and an overall Success weighted by Path Length (SPL) of 0.73 \pm 0.09.

URL PDF HTML ☆

赞 0 踩 0

2603.04466 2026-03-06 cs.RO cs.LG

Act-Observe-Rewrite: Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

Vaishak Kumar

2603.04464 2026-03-06 cs.LG cs.AI

Understanding the Dynamics of Demonstration Conflict in In-Context Learning

Difan Jiao, Di Wang, Lijie Hu

Comments 19 pages,12 figures,4 tables

2603.04463 2026-03-06 cs.RO

GAIDE: Graph-based Attention Masking for Spatial- and Embodiment-aware Motion Planning

Davood Soleymanzadeh, Xiao Liang, Minghui Zheng

2603.04461 2026-03-06 cs.LG cs.AI

MAD-SmaAt-GNet: A Multimodal Advection-Guided Neural Network for Precipitation Nowcasting

Samuel van Wonderen, Siamak Mehrkanoon

Comments 12 pages, 5 figs

2603.04460 2026-03-06 cs.LG cs.AI

VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

Chen Guanzhong

2603.04458 2026-03-06 cs.LG cs.AI

Learning Unified Distance Metric for Heterogeneous Attribute Data Clustering

Yiqun Zhang, Mingjie Zhao, Yizhou Chen, Yang Lu, Yiu-ming Cheung

Comments ESWA 2025 paper

详情

DOI: 10.1016/j.eswa.2025.126738
Journal ref: Expert Systems with Applications 273 (2025): 126738

英文摘要

Datasets composed of numerical and categorical attributes (also called mixed data hereinafter) are common in real clustering tasks. Differing from numerical attributes that indicate tendencies between two concepts (e.g., high and low temperature) with their values in well-defined Euclidean distance space, categorical attribute values are different concepts (e.g., different occupations) embedded in an implicit space. Simultaneously exploiting these two very different types of information is an unavoidable but challenging problem, and most advanced attempts either encode the heterogeneous numerical and categorical attributes into one type, or define a unified metric for them for mixed data clustering, leaving their inherent connection unrevealed. This paper, therefore, studies the connection among any-type of attributes and proposes a novel Heterogeneous Attribute Reconstruction and Representation (HARR) learning paradigm accordingly for cluster analysis. The paradigm transforms heterogeneous attributes into a homogeneous status for distance metric learning, and integrates the learning with clustering to automatically adapt the metric to different clustering tasks. Differing from most existing works that directly adopt defined distance metrics or learn attribute weights to search clusters in a subspace. We propose to project the values of each attribute into unified learnable multiple spaces to more finely represent and learn the distance metric for categorical data. HARR is parameter-free, convergence-guaranteed, and can more effectively self-adapt to different sought number of clusters $k$. Extensive experiments illustrate its superiority in terms of accuracy and efficiency.

URL PDF HTML ☆

赞 0 踩 0

2603.04457 2026-03-06 cs.AI cs.CE physics.soc-ph

Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography

Xinmin Fang, Lingfeng Tao, Zhengxiong Li

详情

英文摘要

The fundamental topology of manufacturing has not undergone a paradigm-level transformation since Henry Ford's moving assembly line in 1913. Every major innovation of the past century, from the Toyota Production System to Industry 4.0, has optimized within the Fordist paradigm without altering its structural logic: centralized mega-factories, located near labor pools, producing at scale. We argue that embodied intelligence is poised to break this century-long stasis, not by making existing factories more efficient, but by triggering phase transitions in manufacturing economic geography itself. When embodied AI capabilities cross critical thresholds in dexterity, generalization, reliability, and tactile-vision fusion, the consequences extend far beyond cost reduction: they restructure where factories are built, how supply chains are organized, and what constitutes viable production scale. We formalize this by defining a Capability Space C = (d, g, r, t) and showing that the site-selection objective function undergoes topological reorganization when capability vectors cross critical surfaces. Through three pathways, weight inversion, batch collapse, and human-infrastructure decoupling, we show that embodied intelligence enables demand-proximal micro-manufacturing, eliminates "manufacturing deserts," and reverses geographic concentration driven by labor arbitrage. We further introduce Machine Climate Advantage: once human workers are removed, optimal factory locations are determined by machine-optimal conditions (low humidity, high irradiance, thermal stability), factors orthogonal to traditional siting logic, creating a production geography with no historical precedent. This paper establishes Embodied Intelligence Economics, the study of how physical AI capability thresholds reshape the spatial and structural logic of production.

URL PDF HTML ☆

赞 0 踩 0

2603.04454 2026-03-06 cs.CL cs.AI

Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

Michael Majurski, Cynthia Matuszek

2603.04453 2026-03-06 cs.CL cs.AI cs.LG

Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

Wai Tuck Wong, Jun Sun, Arunesh Sinha

2603.04452 2026-03-06 cs.CL cs.AI

A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

Zonglin Yang, Runze Mao, Tianhao Wu, Han Li, QingGuo Zhou, Zhi X. Chen

Comments 5 figures, 1 table

2603.04451 2026-03-06 cs.LG cs.AI quant-ph

On Emergences of Non-Classical Statistical Characteristics in Classical Neural Networks

Hanyu Zhao, Yang Wu, Yuexian Hou

2603.04449 2026-03-06 cs.LG cs.AI

An Explainable Ensemble Framework for Alzheimer's Disease Prediction Using Structured Clinical and Cognitive Data

Nishan Mitra

Comments 6 pages, 7 figures, 2 tables. Preprint version

2603.04448 2026-03-06 cs.AI cs.CL cs.CV cs.LG cs.MA

SkillNet: Create, Evaluate, and Connect AI Skills

Yuan Liang, Ruobin Zhong, Haoming Xu, Chen Jiang, Yi Zhong, Runnan Fang, Jia-Chen Gu, Shumin Deng, Yunzhi Yao, Mengru Wang, Shuofei Qiao, Xin Xu, Tongtong Wu, Kun Wang, Yang Liu, Zhen Bi, Jungang Lou, Yuchen Eleanor Jiang, Hangcheng Zhu, Gang Yu, Haiwen Hong, Longtao Huang, Hui Xue, Chenxi Wang, Yijun Wang, Zifei Shan, Xi Chen, Zhaopeng Tu, Feiyu Xiong, Xin Xie, Peng Zhang, Zhengke Gui, Lei Liang, Jun Zhou, Chiyu Wu, Jin Shang, Yu Gong, Junyu Lin, Changliang Xu, Hongjie Deng, Wen Zhang, Keyan Ding, Qiang Zhang, Fei Huang, Ningyu Zhang, Jeff Z. Pan, Guilin Qi, Haofen Wang, Huajun Chen

Comments http://skillnet.openkg.cn/

2603.04437 2026-03-06 cs.LG cs.AI

ASFL: An Adaptive Model Splitting and Resource Allocation Framework for Split Federated Learning

Chuiyang Meng, Ming Tang, Vincent W. S. Wong

2603.04436 2026-03-06 cs.LG cs.AI

ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation

Chuiyang Meng, Ming Tang, Vincent W. S. Wong

2603.04431 2026-03-06 cs.LG cs.AI

Uncertainty-Calibrated Spatiotemporal Field Diffusion with Sparse Supervision

Kevin Valencia, Xihaier Luo, Shinjae Yoo, David Keetae Park

Comments 18 pages, 9 figures, 6 tables

2603.04243 2026-03-06 cs.CV

A Unified Framework for Joint Detection of Lacunes and Enlarged Perivascular Spaces

Lucas He, Krinos Li, Hanyuan Zhang, Runlong He, Silvia Ingala, Luigi Lorenzini, Marleen de Bruijne, Frederik Barkhof, Rhodri Davies, Carole Sudre

2603.04179 2026-03-06 cs.CV

NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction

Weirong Chen, Chuanxia Zheng, Ganlin Zhang, Andrea Vedaldi, Daniel Cremers

Comments Accepted to ICLR 2026. Project Page: https://wrchen530.github.io/nova3r

2603.04162 2026-03-06 cs.CL cs.AI

Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Language Model

Jakub Prejzner

Comments 17 pages, 13 tables. All models and Hessians available at https://huggingface.co/Jakubrd4

2603.04058 2026-03-06 cs.CV

TumorFlow: Physics-Guided Longitudinal MRI Synthesis of Glioblastoma Growth

Valentin Biller, Niklas Bubeck, Lucas Zimmer, Ayhan Can Erdur, Sandeep Nagar, Anke Meyer-Baese, Daniel Rückert, Benedikt Wiestler, Jonas Weidner

2603.03769 2026-03-06 cs.CV cs.AI cs.LG

DMD-augmented Unpaired Neural Schrödinger Bridge for Ultra-Low Field MRI Enhancement

Youngmin Kim, Jaeyun Shin, Jeongchan Kim, Taehoon Lee, Jaemin Kim, Peter Hsu, Jelle Veraart, Jong Chul Ye

2603.03510 2026-03-06 cs.CL cs.AI

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

Mohamed El Idrissi

Comments 20 pages, 2 figures, 4 tables

2603.03388 2026-03-06 cs.LG cs.AI

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

Hang Yi, Ziwei Huang, Yining Ma, Zhiguang Cao

Comments Accepted by ICLR

2603.03137 2026-03-06 cs.RO

RL-Based Coverage Path Planning for Deformable Objects on 3D Surfaces

Yuhang Zhang, Jinming Ma, Feng Wu

Comments 8 pages, 8 figures. Accepted to the 2026 IEEE International Conference on Robotics and Automation (ICRA)

2603.03056 2026-03-06 cs.LG cs.CL

Incremental Graph Construction Enables Robust Spectral Clustering of Texts

Marko Pranjić, Boshko Koloski, Nada Lavrač, Senja Pollak, Marko Robnik-Šikonja

Comments MP and BK contributed equally

2603.03043 2026-03-06 cs.LG cs.AI cs.CR cs.CV

IoUCert: Robustness Verification for Anchor-based Object Detectors

Benedikt Brückner, Alejandro J. Mercado, Yanghao Zhang, Panagiotis Kouvaros, Alessio Lomuscio

2603.02743 2026-03-06 cs.CV

MultiShadow: Multi-Object Shadow Generation for Image Compositing via Diffusion Model

Waqas Ahmed, Dean Diepeveen, Ferdous Sohel

Comments This work has been submitted to the IEEE for possible publication

2603.02573 2026-03-06 cs.CV

Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels

Jiahao Lu, Jiayi Xu, Wenbo Hu, Ruijie Zhu, Chengfeng Zhao, Sai-Kit Yeung, Ying Shan, Yuan Liu

Comments Project Page: https://jiah-cloud.github.io/Track4World.github.io/ Code: https://github.com/TencentARC/Track4World

2603.01776 2026-03-06 cs.CL cs.AI cs.CV

FreeAct: Freeing Activations for LLM Quantization

Xiaohao Liu, Xiaobo Xia, Manyi Zhang, Ji-Fu Li, Xianzhi Yu, Fei Shen, Xiu Su, See-Kiong Ng, Tat-Seng Chua

Comments 26 pages, 18 figures, 2 tables