arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2512.15557 2026-04-03 cs.RO

OMCL: Open-vocabulary Monte Carlo Localization

Evgenii Kruzhkov, Raphael Memmesheimer, Sven Behnke

Comments Accepted to IEEE RA-L

详情

英文摘要

Robust robot localization is an important prerequisite for navigation, but it becomes challenging when the map and robot measurements are obtained from different sensors. Prior methods are often tailored to specific environments, relying on closed-set semantics or fine-tuned features. In this work, we extend Monte Carlo Localization with vision-language features, allowing OMCL to robustly compute the likelihood of visual observations given a camera pose and a 3D map created from posed RGB-D images or aligned point clouds. These open-vocabulary features enable us to associate observations and map elements from different modalities, and to natively initialize global localization through natural language descriptions of nearby objects. We evaluate our approach using Matterport3D and Replica for indoor scenes and demonstrate generalization on SemanticKITTI for outdoor scenes.

URL PDF HTML ☆

赞 0 踩 0

2512.13855 2026-04-03 cs.CV cs.AI

Improvise, Adapt, Overcome -- Telescopic Adapters for Efficient Fine-tuning of Vision Language Models in Medical Imaging

Ujjwal Mishra, Vinita Shukla, Praful Hambarde, Amit Shukla

Comments Accepted at the IEEE/CVF winter conference on applications of computer vision (WACV 2026)

2512.10822 2026-04-03 cs.AI cs.RO

V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions

Mumuksh Tayal, Manan Tayal, Aditya Singh, Shishir Kolathaya, Ravi Prakash

Comments 28 pages, 9 figures, 11 tables. Paper accepted at TMLR

2512.10498 2026-04-03 cs.CV

Robust Shape from Focus via Multiscale Directional Dilated Laplacian and Recurrent Network

Khurram Ashfaq, Muhammad Tariq Mahmood

Comments Accepted to IJCV

2512.08991 2026-04-03 cs.CV cs.LG

Deterministic World Models for Verification of Closed-loop Vision-based Systems

Yuang Geng, Zhuoyang Zhou, Zhongzheng Zhang, Siyuan Pan, Hoang-Dung Tran, Ivan Ruchkin

Comments Significantly revised version with additional experiments and updated results. Submitted to EMSOFT 2026

2512.05069 2026-04-03 cs.LG cs.CR quant-ph

Hybrid Quantum-Classical Autoencoders for Unsupervised Network Intrusion Detection

Mohammad Arif Rasyidi, Omar Alhussein, Sami Muhaidat, Ernesto Damiani

Comments The authors have identified limitations in the experimental evaluation, which are insufficient to fully support the paper's conclusions. The manuscript is withdrawn pending additional experiments and analysis

2512.02344 2026-04-03 cs.CV

A multi-weight self-matching visual explanation for cnns on sar images

Siyuan Sun, Yongping Zhang, Hongcheng Zeng, Yamin Wang, Wei Yang, Wanting Yang, Jie Chen

2511.22828 2026-04-03 cs.AI q-bio.NC

Fast dynamical similarity analysis

Arman Behrad, Mitchell Ostrow, Mohammad Taha Fakharian, Ila Fiete, Christian Beste, Shervin Safavi

2511.22294 2026-04-03 cs.CV cs.LG

Structure is Supervision: Multiview Masked Autoencoders for Radiology

Sonia Laguna, Andrea Agostini, Alain Ryser, Samuel Ruiperez-Campillo, Irene Cannistraci, Moritz Vandenhirtz, Stephan Mandt, Nicolas Deperrois, Farhad Nooralahzadeh, Michael Krauthammer, Thomas M. Sutter, Julia E. Vogt

2511.22048 2026-04-03 cs.CV cs.AI

ICM-SR: Image-Conditioned Manifold Regularization for Image Super-Resolution

Junoh Kang, Donghun Ryou, Bohyung Han

2511.21681 2026-04-03 cs.CV

Seeing without Pixels: Perception from Camera Trajectories

Zihui Xue, Kristen Grauman, Dima Damen, Andrew Zisserman, Tengda Han

Comments Accepted by CVPR 2026, Project website: https://sites.google.com/view/seeing-without-pixels

2511.21569 2026-04-03 cs.AI cs.HC

When Models Fabricate Credentials: Measuring How Professional Identity Suppresses Honest Self-Representation

Alex Diep

Comments Submitted to COLM; 43 pages, 12 figures, 15 tables; sharpen focus of paper and reduced length of paper

2511.20456 2026-04-03 cs.LG

Towards Trustworthy Wi-Fi CSI-based Sensing: Systematic Evaluation of Adversarial Robustness

Shreevanth Krishnaa Gopalakrishnan, Stephen Hailes

Comments 18 pages, 5 figures, 6 tables

2511.18123 2026-04-03 cs.CV cs.AI cs.CL cs.LG

Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models

Dachuan Zhao, Weiyue Li, Zhenda Shen, Yushu Qiu, Bowen Xu, Haoyu Chen, Yongchao Chen

Comments Accepted at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

2511.16471 2026-04-03 cs.CV

FastSurfer-CC: A robust, accurate, and comprehensive framework for corpus callosum morphometry

Clemens Pollak, Kersten Diers, Santiago Estrada, David Kügler, Martin Reuter

2511.16145 2026-04-03 cs.LG cs.AI

Labels Matter More Than Models: Rethinking the Unsupervised Paradigm in Time Series Anomaly Detection

Zhijie Zhong, Zhiwen Yu, Kaixiang Yang, Yongheng Liu, Jun Jiang, C. L. Philip Chen

Comments 20 pages, 15 figures, 8 tables. Under review

2511.10853 2026-04-03 cs.AI cs.HC

Advanced Assistance for Traffic Crash Analysis: An AI-Driven Multi-Agent Approach to Pre-Crash Reconstruction

Gerui Xu, Boyou Chen, Huizhong Guo, Dave LeBlanc, Arpan Kusari, Efe Yarbasi, Ananna Ahmed, Zhaonan Sun, Shan Bao

Comments 36 pages, 14 figures

详情

DOI: 10.4271/09-14-01-0033
Journal ref: SAE International Journal of Transportation Safety, 14(1), 2026

英文摘要

Traffic collision reconstruction traditionally relies on human expertise and can be accurate, but pre-crash reconstruction is more challenging. This study develops a multi-agent AI framework that reconstructs pre-crash scenarios and infers vehicle behaviors from fragmented collision data. We propose a two-phase collaborative framework with reconstruction and reasoning stages. The system processes 277 rear-end lead vehicle deceleration (LVD) crashes from the Crash Investigation Sampling System (CISS, 2017 to 2022), integrating narrative reports, structured tabular variables, and scene diagrams. Phase I generates natural-language crash reconstructions from multimodal inputs. Phase II combines these reconstructions with Event Data Recorder (EDR) signals to (1) identify striking and struck vehicles and (2) isolate the EDR records most relevant to the collision moment, enabling inference of key pre-crash behaviors. For validation, we evaluated all LVD cases and emphasized 39 complex crashes where multiple EDR records per crash created ambiguity due to missing or conflicting data. Ground truth was set by consensus of two independent manual annotators, with a separate language model used only to flag potential conflicts for re-checking. The framework achieved 100% accuracy across 4,155 trials; three reasoning models produced identical outputs, indicating that performance is driven by the structured prompts rather than model choice. Research analysts without reconstruction training achieved 92.31% accuracy on the same 39 complex cases. Ablation tests showed that removing structured reasoning anchors reduced case-level accuracy from 99.7% to 96.5% and increased errors across multiple output dimensions. The system remained robust under incomplete inputs. This zero-shot evaluation, without domain-specific training or fine-tuning, suggests a scalable approach for AI-assisted pre-crash analysis.

URL PDF HTML ☆

赞 0 踩 0

2511.10841 2026-04-03 cs.LG cs.AI

FlowPath: Learning Data-Driven Manifolds with Invertible Flows for Robust Irregularly-sampled Time Series Classification

YongKyung Oh, Dong-Young Lim, Sungil Kim

Comments Published at the 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026). https://ojs.aaai.org/index.php/AAAI/article/view/39643

2511.01375 2026-04-03 cs.AI

Align to Misalign: Automatic LLM Jailbreak with Meta-Optimized LLM Judges

Hamin Koo, Minseon Kim, Jaehyung Kim

Comments ICLR 2026

2510.25147 2026-04-03 cs.LG math.OC

Machine Learning Guided Optimal Transmission Switching to Mitigate Wildfire Ignition Risk

Weimin Huang, Ryan Piansky, Bistra Dilkina, Daniel K. Molzahn

2510.25126 2026-04-03 cs.LG cs.AI

Bridging the Divide: End-to-End Sequence-Graph Learning

Yuen Chen, Yulun Wu, Samuel Sharpe, Igor Melnyk, Nam H. Nguyen, Furong Huang, C. Bayan Bruss, Rizal Fathony

2510.24379 2026-04-03 cs.CV

A Luminance-Aware Multi-Scale Network for Polarization Image Fusion with a Multi-Scene Dataset

Zhuangfan Huang, Xiaosong Li, Gao Wang, Tao Ye, Haishu Tan, Huafeng Li

2510.22855 2026-04-03 cs.LG

A Review of Neural Networks in Precipitation Prediction

Yugong Zeng, Jiayuan Wang, Jonathan Wu

详情

英文摘要

Precipitation prediction has undergone a profound transformation. A notable limitation of traditional NWP is the need for extensive statistical post-processing. To address this challenge, neural network-based approaches were developed. These approaches offer a framework that directly learns the mapping from atmospheric predictors to precipitation targets. Based on the technological development, this article first reviews the traditional precipitation forecasting methods and summarizes the development trends of precipitation forecasting based on neural networks. We then outline the training process, loss functions, and some datasets for precipitation prediction. In the main body of the article, we detail the basic artificial neural networks (ANNs), spatial feature extraction models, time feature extraction models, generative models, Transformer models, graph neural networks (GNNs), and emerging hybrid models. Finally, in the appendix, we supplement the commonly used evaluation metrics. This paper focuses on the advantages and disadvantages of various neural network models in precipitation forecasting applications, and also pays attention to the latest progress of neural network-based methods. Overall, neural networks have significantly improved the accuracy of short-term and medium-term precipitation forecasting, but still face challenges in representing extreme rainfall, handling imbalanced data, and ensuring physical consistency. The latest progress shows that future prediction systems will increasingly rely on the integration of multiple sources of data and hybrid physical-data-driven models to enhance their robustness and applicability. By compositing research covering multiple eras and paradigms, we not only depict the history of neural networks in precipitation prediction but also outline future directions in next generation forecasting systems.

URL PDF HTML ☆

赞 0 踩 0

2510.21852 2026-04-03 cs.LG physics.flu-dyn

Interpretable Diagnostics and Adaptive Data Assimilation for Neural ODEs via Discrete Empirical Interpolation

Hojin Kim, Romit Maulik

Comments 19 pages, 17 figures

2510.14523 2026-04-03 cs.LG math.ST stat.ML stat.TH

On the Identifiability of Tensor Ranks via Prior Predictive Matching

Eliezer da Silva, Arto Klami, Diego Mesquita, Iñigo Urteaga

Comments Accepted at AISTATS 2026

2510.11579 2026-04-03 cs.CV cs.LG

MS-Mix: Sentiment-Guided Adaptive Augmentation for Multimodal Sentiment Analysis

Hongyu Zhu, Lin Chen, Xin Jin, Mingsheng Shang

Comments Under Review

2510.09416 2026-04-03 cs.LG cs.SI

What Do Temporal Graph Learning Models Learn?

Abigail J. Hayes, Tobias Schumacher, Markus Strohmaier

2510.07487 2026-04-03 cs.LG

Reinforcement Learning-based Task Offloading in the Internet of Wearable Things

Waleed Bin Qaim, Aleksandr Ometov, Claudia Campolo, Antonella Molinaro, Elena Simona Lohan, Jari Nurmi

Comments Withdrawn by the authors. A revised version is under preparation

2510.07197 2026-04-03 cs.RO

COMPAct: Computational Optimization and Automated Modular design of Planetary Actuators

Aman Singh, Deepak Kapa, Suryank Joshi, Shishir Kolathaya

Comments 8 pages, 9 Figures, 2 tables; first two authors contributed equally; published in 2026 IEEE International Conference on Robotics and Automation (ICRA 2026)

2510.06339 2026-04-03 cs.RO

Vi-TacMan: Articulated Object Manipulation via Vision and Touch

Leiyao Cui, Zihang Zhao, Sirui Xie, Wenhuan Zhang, Zhi Han, Yixin Zhu

Comments ICRA 2026