arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.15194 2026-03-17 cs.LG

PiGRAND: Physics-informed Graph Neural Diffusion for Intelligent Additive Manufacturing

Benjamin Uhrich, Tim Häntschel, Erhard Rahm

Comments 36 pages, 29 figures

详情

英文摘要

A comprehensive understanding of heat transport is essential for optimizing various mechanical and engineering applications, including 3D printing. Recent advances in machine learning, combined with physics-based models, have enabled a powerful fusion of numerical methods and data-driven algorithms. This progress is driven by the availability of limited sensor data in various engineering and scientific domains, where the cost of data collection and the inaccessibility of certain measurements are high. To this end, we present PiGRAND, a Physics-informed graph neural diffusion framework. In order to reduce the computational complexity of graph learning, an efficient graph construction procedure was developed. Our approach is inspired by the explicit Euler and implicit Crank-Nicolson methods for modeling continuous heat transport, leveraging sub-learning models to secure the accurate diffusion across graph nodes. To enhance computational performance, our approach is combined with efficient transfer learning. We evaluate PiGRAND on thermal images from 3D printing, demonstrating significant improvements in prediction accuracy and computational performance compared to traditional graph neural diffusion (GRAND) and physics-informed neural networks (PINNs). These enhancements are attributed to the incorporation of physical principles derived from the theoretical study of partial differential equations (PDEs) into the learning model. The PiGRAND code is open-sourced on GitHub: https://github.com/bu32loxa/PiGRAND

URL PDF HTML ☆

赞 0 踩 0

2603.15188 2026-03-17 cs.LG cs.NI

Joint Routing and Model Pruning for Decentralized Federated Learning in Bandwidth-Constrained Multi-Hop Wireless Networks

Xiaoyu He, Weicai Li, Tiejun Lv, Xi Yu

2603.15187 2026-03-17 cs.CL

The Hrunting of AI: Where and How to Improve English Dialectal Fairness

Wei Li, Adrian de Wynter

2603.15186 2026-03-17 cs.RO

NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation

Jiahang Liu, Yuanxing Duan, Jiazhao Zhang, Minghan Li, Shaoan Wang, Zhizheng Zhang, He Wang

2603.15185 2026-03-17 cs.RO cs.AI cs.CV

What Matters for Scalable and Robust Learning in End-to-End Driving Planners?

David Holtz, Niklas Hanselmann, Simon Doll, Marius Cordts, Bernt Schiele

Comments To be published in CVPR Findings 2026

2603.15184 2026-03-17 cs.LG cs.AI cs.NE eess.IV

CATFormer: When Continual Learning Meets Spiking Transformers With Dynamic Thresholds

Vaishnavi Nagabhushana, Kartikay Agrawal, Ayon Borthakur

Comments Accepted for publication in the proceedings of the Neuro for AI & AI for Neuro Workshop at AAAI 2026 (PMLR)

2603.15179 2026-03-17 cs.RO

KiRAS: Keyframe Guided Self-Imitation for Robust and Adaptive Skill Learning in Quadruped Robots

Xiaoyi Wei, Peng Zhai, Jiaxin Tu, Yueqi Zhang, Yuqi Li, Zonghao Zhang, Hu Zhou, Lihua Zhang

Comments Received by 2026 IEEE International Conference on Robotics and Automation (ICRA)

2603.15169 2026-03-17 cs.RO

ForceVLA2: Unleashing Hybrid Force-Position Control with Force Awareness for Contact-Rich Manipulation

Yang Li, Zhaxizhuoma, Hongru Jiang, Junjie Xia, Hongquan Zhang, Jinda Du, Yunsong Zhou, Jia Zeng, Ce Hao, Jieji Ren, Qiaojun Yu, Cewu Lu, Yu Qiao, Jiangmiao Pang

Comments Accepted by CVPR 2026

2603.15168 2026-03-17 cs.CV cs.AI

Multimodal Connectome Fusion via Cross-Attention for Autism Spectrum Disorder Classification Using Graph Learning

Ansar Rahman, Hassan Shojaee-Mend, Sepideh Hatamikia

Comments 29 Pages; 5 Figures

详情

英文摘要

Autism spectrum disorder (ASD) is a complex neurodevelopmental condition characterized by atypical functional brain connectivity and subtle structural alterations. rs-fMRI has been widely used to identify disruptions in large-scale brain networks, while structural MRI provides complementary information about morphological organization. Despite their complementary nature, effectively integrating these heterogeneous imaging modalities within a unified framework remains challenging. This study proposes a multimodal graph learning framework that preserves the dominant role of functional connectivity while integrating structural imaging and phenotypic information for ASD classification. The proposed framework is evaluated on ABIDE-I dataset. Each subject is represented as a node within a population graph. Functional and structural features are extracted as modality-specific node attributes, while inter-subject relationships are modeled using a pairwise association encoder (PAE) based on phenotypic information. Two Edge Variational GCNs are trained to learn subject-level embeddings. To enable effective multimodal integration, we introduce a novel asymmetric transformer-based cross-attention mechanism that allows functional embeddings to selectively incorporate complementary structural information while preserving functional dominance. The fused embeddings are then passed to a MLP for ASD classification. Using stratified 10-fold cross-validation, the framework achieved an AUC of 87.3% and an accuracy of 84.4%. Under leave-one-site-out cross-validation (LOSO-CV), the model achieved an average cross-site accuracy of 82.0%, outperforming existing methods by approximately 3% under 10-fold cross-validation and 7% under LOSO-CV. The proposed framework effectively integrates heterogeneous multimodal data from the multi-site ABIDE-I dataset, improving automated ASD classification across imaging sites.

URL PDF HTML ☆

赞 0 踩 0

2603.15167 2026-03-17 cs.CV

Question-guided Visual Compression with Memory Feedback for Long-Term Video Understanding

Sosuke Yamao, Natsuki Miyahara, Yuankai Qi, Shun Takeuchi

Comments Accepted to CVPR 2026. The first two authors contributed equally to this work

2603.15166 2026-03-17 cs.CV

DAIT: Distillation from Vision-Language Models to Lightweight Classifiers with Adaptive Intermediate Teacher Transfer

Zhengxu He, Jun Li, Zhijian Wu

2603.15153 2026-03-17 cs.CV

TextOVSR: Text-Guided Real-World Opera Video Super-Resolution

Hua Chang, Xin Xu, Wei Liu, Jiayi Wu, Kui Jiang, Fei Ma, Qi Tian

2603.15152 2026-03-17 cs.RO

Master Micro Residual Correction with Adaptive Tactile Fusion and Force-Mixed Control for Contact-Rich Manipulation

Xingting Li, Yifan Xie, Han Liu, Wei Hou, Guangyu Chen, Shoujie Li, Wenbo Ding

2603.15150 2026-03-17 cs.CV

SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation

Shufan Li, Jiuxiang Gu, Kangning Liu, Zhe Lin, Aditya Grover, Jason Kuen

Comments 21 pages, 4 figures

2603.15137 2026-03-17 cs.CV

Context-Aware Sensor Modeling for Asynchronous Multi-Sensor Tracking in Stone Soup

Martin Vonheim Larsen, Kim Mathiassen

2603.15136 2026-03-17 cs.LG cs.AI

Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies

Mumuksh Tayal, Manan Tayal, Ravi Prakash

Comments 24 pages, 6 figures, 4 tables

2603.15134 2026-03-17 cs.RO

Confusion-Aware In-Context-Learning for Vision-Language Models in Robotic Manipulation

Yayun He, Zuheng Kang, Botao Zhao, Zhouyin Wu, Junqing Peng, Jianzong Wang

Comments Accepted by the 29th International Conference on Computer Supported Cooperative Work in Design (CSCWD 2026)

2603.15131 2026-03-17 cs.CV

Low-light Image Enhancement with Retinex Decomposition in Latent Space

Bolun Zheng, Qingshan Lei, Quan Chen, Qianyu Zhang, Kainan Yu, Xu Jia, Lingyu Zhu

Comments Submit to IEEE TIP

2603.15121 2026-03-17 cs.LG stat.ML

Establishing Construct Validity in LLM Capability Benchmarks Requires Nomological Networks

Timo Freiesleben

2603.15117 2026-03-17 cs.CL

MMKU-Bench: A Multimodal Update Benchmark for Diverse Visual Knowledge

Baochen Fu, Yuntao Du, Cheng Chang, Baihao Jin, Wenzhi Deng, Muhao Xu, Hongmei Yan, Weiye Song, Yi Wan

2603.15110 2026-03-17 cs.LG cs.CV

Sampling-guided exploration of active feature selection policies

Gabriel Bernardino, Anders Jonsson, Patrick Clarysse, Nicolas Duchateau

2603.15109 2026-03-17 cs.CV

PAKAN: Pixel Adaptive Kolmogorov-Arnold Network Modules for Pansharpening

Haoyu Zhang, Haojing Chen, Zhen Zhong, Liangjian Deng

Comments 16 pages,5 figures,4 tables

2603.15108 2026-03-17 cs.RO

BodyGuards: Escorting by Multiple Robots in Unknown Environment under Limited Communication

Zhuoli Tian, Yanze Bao, Meng Guo

Comments Accept by ICRA 2026

2603.15106 2026-03-17 cs.AI

PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units

Mark Deutel, Simon Geis, Axel Plinge

Comments 16 pages, 6 figures, 4 tables

详情

英文摘要

Enabling efficient deep neural network (DNN) inference on edge devices with different hardware constraints is a challenging task that typically requires DNN architectures to be specialized for each device separately. To avoid the huge manual effort, one can use neural architecture search (NAS). However, many existing NAS methods are resource-intensive and time-consuming because they require the training of many different DNNs from scratch. Furthermore, they do not take the resource constraints of the target system into account. To address these shortcomings, we propose PrototypeNAS, a zero-shot NAS method to accelerate and automate the selection, compression, and specialization of DNNs to different target microcontroller units (MCUs). We propose a novel three-step search method that decouples DNN design and specialization from DNN training for a given target platform. First, we present a novel search space that not only cuts out smaller DNNs from a single large architecture, but instead combines the structural optimization of multiple architecture types, as well as optimization of their pruning and quantization configurations. Second, we explore the use of an ensemble of zero-shot proxies during optimization instead of a single one. Third, we propose the use of Hypervolume subset selection to distill DNN architectures from the Pareto front of the multi-objective optimization that represent the most meaningful tradeoffs between accuracy and FLOPs. We evaluate the effectiveness of PrototypeNAS on 12 different datasets in three different tasks: image classification, time series classification, and object detection. Our results demonstrate that PrototypeNAS is able to identify DNN models within minutes that are small enough to be deployed on off-the-shelf MCUs and still achieve accuracies comparable to the performance of large DNN models.

URL PDF HTML ☆

赞 0 踩 0

2603.15100 2026-03-17 cs.CV

Learning from Limited and Incomplete Data: A Multimodal Framework for Predicting Pathological Response in NSCLC

Alice Natalina Caragliano, Giulia Farina, Fatih Aksu, Camillo Maria Caruso, Claudia Tacconi, Carlo Greco, Lorenzo Nibid, Edy Ippolito, Michele Fiore, Giuseppe Perrone, Sara Ramella, Paolo Soda, Valerio Guarrasi

2603.15097 2026-03-17 cs.RO

AeroGrab: A Unified Framework for Aerial Grasping in Cluttered Environments

Shivansh Pratap Singh, Naveen Sudheer Nair, Samaksh Ujjawal, Sarthak Mishra, Soham Patil, Rishabh Dev Yadav, Spandan Roy

2603.15094 2026-03-17 cs.CL cs.AI

Bridging National and International Legal Data: Two Projects Based on the Japanese Legal Standard XML Schema for Comparative Law Studies

Makoto Nakamura

Comments 21 pages, 5 figures

2603.15084 2026-03-17 cs.RO

HALO:Closing Sim-to-Real Gap for Heavy-loaded Humanoid Agile Motion Skills via Differentiable Simulation

Xingyi Wang, Chenyun Zhang, Weiji Xie, Chao Yu, Wei Song, Chenjia Bai, Shiqiang Zhu

Comments 9 pages, 5 figures, conference

2603.15083 2026-03-17 cs.CV cs.AI cs.HC cs.MM cs.SD

ReactMotion: Generating Reactive Listener Motions from Speaker Utterance

Cheng Luo, Bizhu Wu, Bing Li, Jianfeng Ren, Ruibin Bai, Rong Qu, Linlin Shen, Bernard Ghanem

Comments 42 pages, 11 tables, 8 figures

2603.15079 2026-03-17 cs.LG math.AT

Interpretable Classification of Time Series Using Euler Characteristic Surfaces

Salam Rabindrajit Luwang, Sushovan Majhi, Vishal Mandal, Atish J. Mitra, Md. Nurujjaman, Buddha Nath Sharma