arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2502.00835 2026-03-03 cs.RO cs.LG

CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation

Yuanchen Yuan, Jin Cheng, Núria Armengol Urpí, Stelian Coros

详情

英文摘要

Enabling legged robots to perform non-prehensile loco-manipulation is crucial for enhancing their versatility. Learning behaviors such as whole-body object pushing often requires sophisticated planning strategies or extensive task-specific reward shaping, especially in unstructured environments. In this work, we present CAIMAN, a practical reinforcement learning framework that encourages the agent to gain control over other entities in the environment. CAIMAN leverages causal action influence as an intrinsic motivation objective, allowing legged robots to efficiently acquire object pushing skills even under sparse task rewards. We employ a hierarchical control strategy, combining a low-level locomotion module with a high-level policy that generates task-relevant velocity commands and is trained to maximize the intrinsic reward. To estimate causal action influence, we learn the dynamics of the environment by integrating a kinematic prior with data collected during training. We empirically demonstrate CAIMAN's superior sample efficiency and adaptability to diverse scenarios in simulation, as well as its successful transfer to real-world systems without further fine-tuning. A video demo is available at https://www.youtube.com/watch?v=dNyvT04Cqaw.

URL PDF HTML ☆

赞 0 踩 0

2411.17513 2026-03-03 cs.CV cs.GR cs.LG

Human Vision Constrained Super-Resolution

Volodymyr Karpenko, Taimoor Tariq, Jorge Condor, Piotr Didyk

2410.23450 2026-03-03 cs.LG cs.AI cs.RO stat.ML

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

Ruhan Wang, Yu Yang, Zhishuai Liu, Dongruo Zhou, Pan Xu

Comments 26 pages, 11 tables, 8 figures. Published in Transactions on Machine Learning Research (TMLR)

2410.01143 2026-03-03 cs.RO

StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion

Han Zhang, Benjamin D. Killeen, Yu-Chun Ku, Lalithkumar Seenivasan, Yuxuan Zhao, Mingxu Liu, Yue Yang, Suxi Gu, Alejandro Martin-Gomez, Russell H. Taylor, Greg Osgood, Mathias Unberath

详情

DOI: 10.1049/htl2.12103
Journal ref: Healthcare Technology Letters Vol. 11 Issue 6 2024

英文摘要

In percutaneous pelvic trauma surgery, accurate placement of Kirschner wires (K-wires) is crucial to ensure effective fracture fixation and avoid complications due to breaching the cortical bone along an unsuitable trajectory. Surgical navigation via mixed reality (MR) can help achieve precise wire placement in a low-profile form factor. Current approaches in this domain are as yet unsuitable for real-world deployment because they fall short of guaranteeing accurate visual feedback due to uncontrolled bending of the wire. To ensure accurate feedback, we introduce StraightTrack, an MR navigation system designed for percutaneous wire placement in complex anatomy. StraightTrack features a marker body equipped with a rigid access cannula that mitigates wire bending due to interactions with soft tissue and a covered bony surface. Integrated with an Optical See-Through Head-Mounted Display (OST HMD) capable of tracking the cannula body, StraightTrack offers real-time 3D visualization and guidance without external trackers, which are prone to losing line-of-sight. In phantom experiments with two experienced orthopedic surgeons, StraightTrack improves wire placement accuracy, achieving the ideal trajectory within $5.26 \pm 2.29$ mm and $2.88 \pm 1.49$ degree, compared to over 12.08 mm and 4.07 degree for comparable methods. As MR navigation systems continue to mature, StraightTrack realizes their potential for internal fracture fixation and other percutaneous orthopedic procedures.

URL PDF HTML ☆

赞 0 踩 0

2407.08086 2026-03-03 cs.LG stat.CO stat.ML

The GeometricKernels Package: Heat and Matérn Kernels for Geometric Learning on Manifolds, Meshes, and Graphs

Peter Mostowsky, Vincent Dutordoir, Iskander Azangulov, Noémie Jaquier, Michael John Hutchinson, Aditya Ravuri, Leonel Rozo, Alexander Terenin, Viacheslav Borovitskiy

2405.08205 2026-03-03 cs.LG

Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates

Zhenqiao Song, Yunlong Zhao, Wenxian Shi, Wengong Jin, Yang Yang, Lei Li

2404.17931 2026-03-03 cs.LG cs.CV

Critical Review for One-class Classification: recent advances and the reality behind them

Toshitaka Hayashi, Dalibor Cimr, Hamido Fujita, Richard Cimler

2404.08480 2026-03-03 cs.LG cs.CL stat.CO

Using ChatGPT for Data Science Analyses

Ozan Evkaya, Miguel de Carvalho

Comments 19 pages with figures and appendix

2404.06230 2026-03-03 cs.LG cs.CR cs.DC

Aggressive or Imperceptible, or Both: Network Pruning Assisted Hybrid Byzantines in Federated Learning

Emre Ozfatura, Kerem Ozfatura, Baturalp Buyukates, Mert Coskuner, Alptekin Kupcu, Deniz Gunduz

2307.14025 2026-03-03 cs.LG cs.CV eess.IV q-bio.QM stat.ML

Topological Inductive Bias fosters Multiple Instance Learning in Data-Scarce Scenarios

Salome Kazeminia, Carsten Marr, Bastian Rieck

2305.04979 2026-03-03 cs.LG cs.DC stat.ML

FedHB: Hierarchical Bayesian Federated Learning

Minyoung Kim, Timothy Hospedales

2305.02850 2026-03-03 cs.LG cs.CC cs.CG cs.DS

Impossibility of Depth Reduction in Explainable Clustering

Chengyuan Deng, Surya Teja Gavva, Karthik C. S., Parth Patel, Adarsh Srinivasan

2303.16668 2026-03-03 cs.LG cs.AI cs.CR stat.ML

Protecting Federated Learning from Extreme Model Poisoning Attacks via Multidimensional Time Series Anomaly Detection

Edoardo Gabrielli, Dimitri Belli, Zoe Matrullo, Vittorio Miori, Gabriele Tolomei

2603.01847 2026-03-03 cs.CV

GroupEnsemble: Efficient Uncertainty Estimation for DETR-based Object Detection

Yutong Yang, Katarina Popović, Julian Wiederer, Markus Braun, Vasileios Belagiannis, Bin Yang

Comments Accepted to IEEE IV 2026. 8 pages, 5 figures

详情

英文摘要

Detection Transformer (DETR) and its variants show strong performance on object detection, a key task for autonomous systems. However, a critical limitation of these models is that their confidence scores only reflect semantic uncertainty, failing to capture the equally important spatial uncertainty. This results in an incomplete assessment of the detection reliability. On the other hand, Deep Ensembles can tackle this by providing high-quality spatial uncertainty estimates. However, their immense memory consumption makes them impractical for real-world applications. A cheaper alternative, Monte Carlo (MC) Dropout, suffers from high latency due to the need of multiple forward passes during inference to estimate uncertainty. To address these limitations, we introduce GroupEnsemble, an efficient and effective uncertainty estimation method for DETR-like models. GroupEnsemble simultaneously predicts multiple individual detection sets by feeding additional diverse groups of object queries to the transformer decoder during inference. Each query group is transformed by the shared decoder in isolation and predicts a complete detection set for the same input. An attention mask is applied to the decoder to prevent inter-group query interactions, ensuring each group detects independently to achieve reliable ensemble-based uncertainty estimation. By leveraging the decoder's inherent parallelism, GroupEnsemble efficiently estimates uncertainty in a single forward pass without sequential repetition. We validated our method under autonomous driving scenes and common daily scenes using the Cityscapes and COCO datasets, respectively. The results show that a hybrid approach combining MC-Dropout and GroupEnsemble outperforms Deep Ensembles on several metrics at a fraction of the cost. The code is available at https://github.com/yutongy98/GroupEnsemble.

URL PDF HTML ☆

赞 0 踩 0

2603.01841 2026-03-03 cs.LG

Trivial Graph Features and Classical Learning are Enough to Detect Random Anomalies

Matthieu Latapy, Stephany Rajeh

2603.01840 2026-03-03 cs.CV eess.IV

FireRed-OCR Technical Report

Hao Wu, Haoran Lou, Xinyue Li, Zuodong Zhong, Zhaojun Sun, Phellon Chen, Xuanhe Zhou, Kai Zuo, Yibo Chen, Xu Tang, Yao Hu, Boxiang Zhou, Jian Wu, Yongji Wu, Wenxin Yu, Yingmiao Liu, Yuhao Huang, Manjie Xu, Gang Liu, Yidong Ma, Zhichao Sun, Changhao Qiao

2603.01839 2026-03-03 cs.CV cs.RO

LEAR: Learning Edge-Aware Representations for Event-to-LiDAR Localization

Kuangyi Chen, Jun Zhang, Yuxi Hu, Yi Zhou, Friedrich Fraundorfer

2603.01837 2026-03-03 cs.LG stat.ML

Constrained Particle Seeking: Solving Diffusion Inverse Problems with Just Forward Passes

Hongkun Dou, Zike Chen, Zeyu Li, Hongjue Li, Lijun Yang, Yue Deng

Comments Accepted by AAAI 2026

2603.01836 2026-03-03 cs.CV

Affine Correspondences in Stereo Vision: Theory, Practice, and Limitations

Levente Hajder

2603.01825 2026-03-03 cs.LG cs.GT stat.ML

Uncertainty Quantification of Click and Conversion Estimates for the Autobidding

Ivan Zhigalskii, Andrey Pudovikov, Aleksandr Katrutsa, Egor Samosvat

Comments 17 pages (10 main text + 7 appendix), 5 figures, 2 tables

2603.01824 2026-03-03 cs.CL cs.LG

OpenAutoNLU: Open Source AutoML Library for NLU

Grigory Arshinov, Aleksandr Boriskin, Sergey Senichev, Ayaz Zaripov, Daria Galimzianova, Daniil Karpov, Leonid Sanochkin

2603.01822 2026-03-03 cs.AI

Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

Eric Lacosse, Mariana Duarte, Peter M. Todd, Daniel C. McNamee

2603.01813 2026-03-03 cs.RO

SSMG-Nav: Enhancing Lifelong Object Navigation with Semantic Skeleton Memory Graph

Haochen Niu, Lantao Zhang, Xingwu Ji, Rendong Ying, Peilin Liu, Fei Wen

Comments Accepted by 2026 ICRA

2603.01812 2026-03-03 cs.CV cs.NA math.NA

Neural Operator-Grounded Continuous Tensor Function Representation and Its Applications

Ruoyang Su, Xi-Le Zhao, Sheng Liu, Wei-Hao Wu, Yisi Luo, Michael K. Ng

2603.01804 2026-03-03 cs.CV cs.AI

Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments

Dragos Costea, Alina Marcu, Cristina Lazar, Marius Leordeanu

2603.01801 2026-03-03 cs.AI

What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction

Lehui Li, Ruining Wang, Haochen Song, Yaoxin Mao, Tong Zhang, Yuyao Wang, Jiayi Fan, Yitong Zhang, Jieping Ye, Chengqi Zhang, Yongshun Gong

Comments 32 pages (+ appendix), 8 figures. Lehui Li and Ruining Wang contributed equally. Yongshun Gong is the corresponding author

2603.01799 2026-03-03 cs.AI cs.LO

Incremental, inconsistency-resilient reasoning over Description Logic Abox streams

Cas Proost, Pieter Bonte

2603.01792 2026-03-03 cs.CL cs.AI

ALTER: Asymmetric LoRA for Token-Entropy-Guided Unlearning of LLMs

Xunlei Chen, Jinyu Guo, Yuang Li, Zhaokun Wang, Yi Gong, Jie Zou, Jiwei Wei, Wenhong Tian

Comments Accepted at The 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026)

2603.01791 2026-03-03 cs.CL cs.IR

Semantic Novelty Trajectories in 80,000 Books: A Cross-Corpus Embedding Analysis

Fred Zimmerman

Comments 12 pages, 4 figures, 5 tables

2603.01788 2026-03-03 cs.CL

nchellwig at SemEval-2026 Task 3: Self-Consistent Structured Generation (SCSG) for Dimensional Aspect-Based Sentiment Analysis using Large Language Models

Nils Constantin Hellwig, Jakob Fehle, Udo Kruschwitz, Christian Wolff