arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2510.14612 2026-03-27 cs.RO

Proprioceptive Image: An Image Representation of Proprioceptive Data from Quadruped Robots for Contact Estimation Learning

Gabriel Fischer Abati, João Carlos Virgolino Soares, Giulio Turrisi, Victor Barasuol, Claudio Semini

Comments Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2026

详情

英文摘要

This paper presents a novel approach for representing proprioceptive time-series data from quadruped robots as structured two-dimensional images, enabling the use of convolutional neural networks for learning locomotion-related tasks. The proposed method encodes temporal dynamics from multiple proprioceptive signals, such as joint positions, IMU readings, and foot velocities, while preserving the robot's morphological structure in the spatial arrangement of the image. This transformation captures inter-signal correlations and gait-dependent patterns, providing a richer feature space than direct time-series processing. We apply this concept in the problem of contact estimation, a key capability for stable and adaptive locomotion on diverse terrains. Experimental evaluations on both real-world datasets and simulated environments show that our image-based representation consistently enhances prediction accuracy and generalization over conventional sequence-based models, underscoring the potential of cross-modal encoding strategies for robotic state learning. Our method achieves superior performance on the contact dataset, improving contact state accuracy from 87.7% to 94.5% over the recently proposed MI-HGNN method, using a 15 times shorter window size.

URL PDF HTML ☆

赞 0 踩 0

2510.13772 2026-03-27 cs.LG

Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs

Qiwei Yuan, Zhitong Xu, Yinghao Chen, Yiming Xu, Houman Owhadi, Shandian Zhe

Comments Accepted at AISTATS 2026

2510.13675 2026-03-27 cs.CV cs.LG

Seeing and Knowing in the Wild: Open-domain Visual Entity Recognition with Large-scale Knowledge Graphs via Contrastive Learning

Hongkuan Zhou, Lavdim Halilaj, Sebastian Monka, Stefan Schmid, Yuqicheng Zhu, Jingcheng Wu, Nadeem Nazer, Steffen Staab

Comments Accepted by AAAI2026

2510.13046 2026-03-27 cs.CV

One Dimensional CNN ECG Mamba for Multilabel Abnormality Classification in 12 Lead ECG

Huawei Jiang, Husna Mutahira, Gan Huang, Mannan Saeed Muhammad

Comments 6 Pages, 2 figures

2510.12453 2026-03-27 cs.LG

Time-Correlated Video Bridge Matching

Viacheslav Vasilev, Arseny Ivanov, Nikita Gushchin, Maria Kovaleva, Alexander Korotin

2510.10627 2026-03-27 cs.CL

FactAppeal: Identifying Epistemic Factual Appeals in News Media

Guy Mor-Lan, Tamir Sheafer, Shaul R. Shenhav

Comments Accepted to EACL Findings 2026

2510.04900 2026-03-27 cs.LG cs.SY eess.SY

Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

Nick Janssen, Melanie Schaller, Bodo Rosenhahn

Comments Number of pages: 13 Number of figures: 16 Number of Tables: 1

2510.00810 2026-03-27 cs.CL

Family Matters: Language Transfer and Merging for Adapting Small LLMs to Faroese

Jenny Kunz, Iben Nyholm Debess, Annika Simonsen

2509.26087 2026-03-27 cs.CV

Easy3D-Labels: Supervising Semantic Occupancy Estimation with 3D Pseudo-Labels for Automotive Perception

Seamie Hayes, Ganesh Sistu, Tim Brophy, Ciaran Eising

详情

英文摘要

In perception for automated vehicles, safety is critical not only for the driver but also for other agents in the scene, particularly vulnerable road users such as pedestrians and cyclists. Previous representation methods, such as Bird's Eye View, collapse vertical information, leading to ambiguity in 3D object localisation and limiting accurate understanding of the environment for downstream tasks such as motion planning and scene forecasting. In contrast, semantic occupancy provides a full 3D representation of the surroundings, addressing these limitations. Furthermore, self-supervised semantic occupancy has seen increased attention in the automated vehicle domain. Unlike supervised methods that rely on manually annotated data, these approaches use 2D pseudo-labels, improving scalability by reducing the need for labour-intensive annotation. Consequently, such models employ techniques such as novel view synthesis, cross-view rendering, and depth estimation to allow for model supervision against the 2D labels. However, such approaches often incur high computational and memory costs during training, especially for novel view synthesis. To address these issues, we propose Easy3D-Labels, which are 3D pseudo-ground-truth labels generated using Grounded-SAM and Metric3Dv2, with temporal aggregation for densification, permitting supervision directly in 3D space. Easy3D-Labels can be readily integrated into existing models to provide model supervision, yielding substantial performance gains, with mIoU increasing by 45% and RayIoU by 49% when applied to OccNeRF on the Occ3D-nuScenes dataset. Additionally, we introduce EasyOcc, a streamlined model trained solely on these 3D pseudo-labels, avoiding the need for complex rendering strategies, and achieving 15.7 mIoU on Occ3D-nuScenes. Easy3D-Labels improve scene understanding by reducing object duplication and enhancing depth estimation accuracy.

URL PDF HTML ☆

赞 0 踩 0

2509.24296 2026-03-27 cs.CL cs.AI

DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

Zherui Li, Zheng Nie, Zhenhong Zhou, Yue Liu, Yitong Zhang, Yu Cheng, Qingsong Wen, Kun Wang, Yufei Guo, Jiaheng Zhang

Comments Accepted by ICLR2026

2509.23768 2026-03-27 cs.AI cs.CL

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Cheng Yang, Jiaxuan Lu, Haiyuan Wan, Junchi Yu, Feiwei Qin

Comments Accepted by ICLR 2026

2509.19354 2026-03-27 cs.CL cs.AI

GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response

Ahmed El Fekih Zguir, Ferda Ofli, Muhammad Imran

Comments 16 pages, 5 figures, Major revision with new geospatial reasoning framework (GeoResponder), previously titled "RoadMind"

2509.15917 2026-03-27 cs.RO cs.SY eess.SY math.OC

An MPC framework for efficient navigation of mobile robots in cluttered environments

Johannes Köhler, Daniel Zhang, Raffaele Soloperto, Andrea Carron, Melanie Zeilinger

Comments - Code available at: https://github.com/IntelligentControlSystems/ClutteredEnvironment - Supplementary video: https://youtu.be/Hn_hpAmGgq0

2509.15199 2026-03-27 cs.LG cs.DB

CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

Ying Zheng, Yangfan Jiang, Kian-Lee Tan

Comments Accepted at ICDE 2026

2509.08617 2026-03-27 cs.LG

Towards Interpretable Deep Neural Networks for Tabular Data

Khawla Elhadri, Jörg Schlötterer, Christin Seifert

Comments Presented at 3rd Workshop on Unifying Representations in Neural Models (UniReps) at NeuRIPS 2025

2509.08522 2026-03-27 cs.RO

RoboMatch: A Unified Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Tasks

Hanyu Liu, Yunsheng Ma, Jiaxin Huang, Keqiang Ren, Jiayi Wen, Yilin Zheng, Haoru Luan, Baishu Wan, Pan Li, Jiejun Hou, Zhihua Wang, Zhigong Song

Comments Accepted to the 2026 IEEE International Conference on Robotics and Automation (ICRA)

2509.03345 2026-03-27 cs.AI cs.CL

Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning

Yunxin Sun, Abulhair Saparov

2508.15090 2026-03-27 cs.CL cs.AI

Mapping the Course for Prompt-based Structured Prediction

Matt Pauk, Maria Leonor Pacheco

2508.03272 2026-03-27 cs.LG cs.IT math.IT stat.ML

The alpha-beta divergence for real and complex data

Sergio Cruces

2508.02013 2026-03-27 cs.CL

SpeechRole: A Large-Scale Dataset and Benchmark for Evaluating Speech Role-Playing Agents

Changhao Jiang, Jiajun Sun, Yifei Cao, Jiabao Zhuang, Xinmeng Che, Hui Li, Xiaoran Fan, Ming Zhang, Junjie Ye, Shihan Dou, Zhiheng Xi, Jingqi Tong, Yilong Wu, Baoyu Fan, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

2507.20423 2026-03-27 cs.CL cs.AI

CodeNER: Code Prompting for Named Entity Recognition

Sungwoo Han, Hyeyeon Kim, Jingun Kwon, Hidetaka Kamigaito, Manabu Okumura

Comments 18 pages, 7 figures

2507.19737 2026-03-27 cs.LG cs.AI cs.CY

Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

Yinzhou Tang, Huandong Wang, Xiaochen Fan, Yong Li

2507.16507 2026-03-27 cs.AI cs.IR

Agentic RAG with Knowledge Graphs for Complex Multi-Hop Reasoning in Real-World Applications

Jean Lelong, Adnane Errazine, Annabelle Blangero

Comments ECAI 2025 demo track, 4 pages

2507.14237 2026-03-27 cs.SD cs.AI eess.AS eess.SP

U-DREAM: Unsupervised Dereverberation guided by a Reverberation Model

Louis Bahrman, Marius Rodrigues, Mathieu Fontaine, Gaël Richard

2507.10800 2026-03-27 cs.CV

ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference

Ali Hojjat, Janek Haberer, Soren Pirk, Olaf Landsiedel

Comments Accepted at CVPR'26, please cite the conference version

2507.05631 2026-03-27 cs.CV

OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval

Zhiwei Chen, Yupeng Hu, Zixu Li, Zhiheng Fu, Xuemeng Song, Liqiang Nie

2506.13734 2026-03-27 cs.CL cs.AI cs.LG

Instruction Following by Principled Boosting Attention of Large Language Models

Vitoria Guardieiro, Avishree Khare, Adam Stein, Eric Wong

2506.02533 2026-03-27 cs.CL cs.HC

Machine Learning for Enhancing Deliberation in Online Political Discussions and Participatory Processes: A Survey

Maike Behrendt, Stefan Sylvius Wagner, Carina Weinmann, Marike Bormann, Mira Warne, Stefan Harmeling

2506.01061 2026-03-27 cs.CV

AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation

Dahyeon Kye, Changhyun Roh, Sukhun Ko, Chanho Eom, Jihyong Oh

Comments Accepted to IEEE Transactions on Circuits and Systems for Video Technology (TCSVT). Please visit our project page at https://github.com/CMLab-Korea/Awesome-Video-Frame-Interpolation

2505.24840 2026-03-27 cs.CV cs.AI cs.CL cs.LG

The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition

Yuwen Tan, Yuan Qing, Boqing Gong

Comments Accepted to CVPR 2026. Project page and code: https://yuanqing-ai.github.io/llm-hierarchy/