arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.26541 2026-03-30 cs.CV

OVI-MAP:Open-Vocabulary Instance-Semantic Mapping

Zilong Deng, Federico Tombari, Marc Pollefeys, Johanna Wald, Daniel Barath

详情

英文摘要

Incremental open-vocabulary 3D instance-semantic mapping is essential for autonomous agents operating in complex everyday environments. However, it remains challenging due to the need for robust instance segmentation, real-time processing, and flexible open-set reasoning. Existing methods often rely on the closed-set assumption or dense per-pixel language fusion, which limits scalability and temporal consistency. We introduce OVI-MAP that decouples instance reconstruction from semantic inference. We propose to build a class-agnostic 3D instance map that is incrementally constructed from RGB-D input, while semantic features are extracted only from a small set of automatically selected views using vision-language models. This design enables stable instance tracking and zero-shot semantic labeling throughout online exploration. Our system operates in real time and outperforms state-of-the-art open-vocabulary mapping baselines on standard benchmarks.

URL PDF HTML ☆

赞 0 踩 0

2603.26516 2026-03-30 cs.CL cs.AI cs.LG

ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs

Inês Vieira, Inês Calvo, Iago Paulo, James Furtado, Rafael Ferreira, Diogo Tavares, Diogo Glória-Silva, David Semedo, João Magalhães

Comments PROPOR 2026 - The 17th International Conference on Computational Processing of Portuguese

2603.26515 2026-03-30 cs.CL cs.AI

JAL-Turn: Joint Acoustic-Linguistic Modeling for Real-Time and Robust Turn-Taking Detection in Full-Duplex Spoken Dialogue Systems

Guangzhao Yang, Yu Pan, Shi Qiu, Ningjie Bai

Comments 8 pages, in porgress

2603.26512 2026-03-30 cs.AI

CADSmith: Multi-Agent CAD Generation with Programmatic Geometric Validation

Jesse Barkley, Rumi Loghmani, Amir Barati Farimani

Comments 8 pages, 6 figures

2603.26511 2026-03-30 cs.CL cs.AI cs.LG

AMALIA Technical Report: A Fully Open Source Large Language Model for European Portuguese

Afonso Simplício, Gonçalo Vinagre, Miguel Moura Ramos, Diogo Tavares, Rafael Ferreira, Giuseppe Attanasio, Duarte M. Alves, Inês Calvo, Inês Vieira, Rui Guerra, James Furtado, Beatriz Canaverde, Iago Paulo, Vasco Ramos, Diogo Glória-Silva, Miguel Faria, Marcos Treviso, Daniel Gomes, Pedro Gomes, David Semedo, André Martins, João Magalhães

Comments PROPOR 2026 - The 17th International Conference on Computational Processing of Portuguese

2603.26510 2026-03-30 cs.CL

Clinical named entity recognition in the Portuguese language: a benchmark of modern BERT models and LLMs

Vinicius Anjos de Almeida, Sandro Saorin da Silva, Josimar Chire, Leonardo Vicenzi, Nícolas Henrique Borges, Helena Kociolek, Sarah Miriã de Castro Rocha, Frederico Nassif Gomes, Júlia Cristina Ferreira, Oge Marques, Lucas Emanuel Silva e Oliveira

Comments Under peer review. GitHub: https://github.com/GRUPOMED4U/clinical_ner_benchmark_paper

2603.26509 2026-03-30 cs.CV

Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays

Martin Rath, Morteza Ghahremani, Yitong Li, Ashkan Taghipour, Marcus Makowski, Christian Wachinger

2603.26486 2026-03-30 cs.CV

ClipTTT: CLIP-Guided Test-Time Training Helps LVLMs See Better

Mriganka Nath, Anurag Das, Jiahao Xie, Bernt Schiele

Comments 30 pages, 12 figures

2603.26483 2026-03-30 cs.LG

EcoFair: Trustworthy and Energy-Aware Routing for Privacy-Preserving Vertically Partitioned Medical Inference

Mostafa Anoosha, Dhavalkumar Thakker, Kuniko Paxton, Koorosh Aslansefat, Bhupesh Kumar Mishra, Baseer Ahmad, Rameez Raja Kureshi

Comments 16 pages, 4 figures, 4 tables

2603.26482 2026-03-30 cs.LG

SPECTRA: An Efficient Spectral-Informed Neural Network for Sensor-Based Activity Recognition

Deepika Gurung, Lala Shakti Swarup Ray, Mengxi Liu, Bo Zhou, Paul Lukowicz

2603.26478 2026-03-30 cs.SD stat.ME stat.ML

Probabilistic Multilabel Graphical Modelling of Motif Transformations in Symbolic Music

Ron Taieb, Yoel Greenberg, Barak Sober

Comments 23 pages (21 pages main text), 2 figures. Submitted to Journal of New Music Research (Special Issue on Computational and Cognitive Musicology)

2603.26468 2026-03-30 cs.CV

HyVIC: A Metric-Driven Spatio-Spectral Hyperspectral Image Compression Architecture Based on Variational Autoencoders

Martin Hermann Paul Fuchs, Behnood Rasti, Begüm Demir

2603.26467 2026-03-30 cs.RO

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

John Bateman, Andy M. Tyrrell, Jihong Zhu

2603.26466 2026-03-30 cs.RO

Adapt as You Say: Online Interactive Bimanual Skill Adaptation via Human Language Feedback

Zhuo Li, Dianxi Li, Tao Teng, Quentin Rouxel, Zhipeng Dong, Dennis Hong, Darwin Caldwell, Fei Chen

Comments 11 pages, 15 figures, submitted to IEEE TMECH

2603.26465 2026-03-30 cs.LG cs.AI

A Boltzmann-machine-enhanced Transformer For DNA Sequence Classification

Zhixuan Cao, Yishu Xu, Xuang WU

Comments 19 pages

2603.26464 2026-03-30 cs.LG math.DS

Automatic feature identification in least-squares policy iteration using the Koopman operator framework

Christian Mugisho Zagabe, Sebastian Peitz

Comments 6 pages

2603.26462 2026-03-30 cs.RO

DTP-Attack: A decision-based black-box adversarial attack on trajectory prediction

Jiaxiang Li, Jun Yan, Daniel Watzenig, Huilin Yin

Comments ICRA 2026

2603.26449 2026-03-30 cs.CL

ClimateCheck 2026: Scientific Fact-Checking and Disinformation Narrative Classification of Climate-related Claims

Raia Abu Ahmad, Max Upravitelev, Aida Usmanova, Veronika Solopova, Georg Rehm

Comments Accepted at NSLP@LREC 2026

2603.26447 2026-03-30 cs.CV cs.LG

Meta-Learned Adaptive Optimization for Robust Human Mesh Recovery with Uncertainty-Aware Parameter Updates

Shaurjya Mandal, Nutan Sharma, John Galeotti

2603.26444 2026-03-30 cs.CV

Image-based Quantification of Postural Deviations on Patients with Cervical Dystonia: A Machine Learning Approach Using Synthetic Training Data

Roland Stenger, Sebastian Löns, Nele Brügge, Feline Hamami, Alexander Münchau, Theresa Paulus, Anne Weissbach, Tatiana Usnich, Max Borsche, Martje G. Pauly, Lara M. Lange, Markus A. Hobert, Rebecca Herzog, Ana Luísa de Almeida Marcelino, Tina Mainka, Friederike Schumann, Lukas L. Goede, Johanna Reimer, Julienne Haas, Jos Becktepe, Alexander Baumann, Robin Wolke, Chi Wang Ip, Thorsten Odorfer, Daniel Zeller, Lisa Harder-Rauschenberger, John-Ih Lee, Philipp Albrecht, Tristan Kölsche, Joachim K. Krauss, Johanna M. Nagel, Joachim Runge, Johanna Doll-Lee, Simone Zittel, Kai Grimm, Pawel Tacik, André Lee, Tobias Bäumer, Sebastian Fudickar

2603.26441 2026-03-30 cs.RO

120 Minutes and a Laptop: Minimalist Image-goal Navigation via Unsupervised Exploration and Offline RL

Xiaoming Liu, Borong Zhang, Qingbiao Li, Steven Morad

Comments 8 pages, 8 figures, submitted to IEEE Robotics and Automation Letters (RA-L)

2603.26440 2026-03-30 cs.LG cs.CE

Interpretable long-term traffic modelling on national road networks using theory-informed deep learning

Yue Li, Shujuan Chen, Akihiro Shimoda, Ying Jin

2603.26434 2026-03-30 cs.CL

Automating Clinical Information Retrieval from Finnish Electronic Health Records Using Large Language Models

Mikko Saukkoriipi, Nicole Hernandez, Jaakko Sahlsten, Kimmo Kaski, Otso Arponen

2603.26430 2026-03-30 cs.CL cs.IR

Analysing Calls to Order in German Parliamentary Debates

Nina Smirnova, Daniel Dan, Philipp Mayr

Comments The paper is accepted to the 3rd Workshop on Natural Language Processing for Political Sciences (PoliticalNLP 2026) co-located with LREC 2026

2603.26415 2026-03-30 cs.LG cs.AI stat.AP

KMM-CP: Practical Conformal Prediction under Covariate Shift via Selective Kernel Mean Matching

Siddhartha Laghuvarapu, Rohan Deb, Jimeng Sun

2603.26412 2026-03-30 cs.RO

Generalizable task-oriented object grasping through LLM-guided ontology and similarity-based planning

Hao Chen, Takuya Kiyokawa, Weiwei Wan, Kensuke Harada

Comments Accepted by Robotics and Autonomous Systems

2603.26410 2026-03-30 cs.CL cs.AI

Why Models Know But Don't Say: Chain-of-Thought Faithfulness Divergence Between Thinking Tokens and Answers in Open-Weight Reasoning Models

Richard J. Young

Comments 19 pages, 8 figures, 4 tables

2603.26403 2026-03-30 cs.RO

T-800: An 800 Hz Data Glove for Precise Hand Gesture Tracking

Haoyang Luo, Zihang Zhao, Leiyao Cui, Saiyao Zhang, Liu Yang, Zhi Han, Xiyuan Tang, Yixin Zhu

2603.26401 2026-03-30 cs.CL

Word Alignment-Based Evaluation of Uniform Meaning Representations

Daniel Zeman, Federica Gamba

2603.26400 2026-03-30 cs.CV

SHANDS: A Multi-View Dataset and Benchmark for Surgical Hand-Gesture and Error Recognition Toward Medical Training

Le Ma, Thiago Freitas dos Santos, Nadia Magnenat-Thalmann, Katarzyna Wac