arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.20739 2026-03-09 cs.RO cs.CV

Decision-Driven Semantic Object Exploration for Legged Robots via Confidence-Calibrated Perception and Topological Subgoal Selection

Guoyang Zhao, Yudong Li, Weiqing Qi, Kai Zhang, Bonan Liu, Kai Chen, Haoang Li, Jun Ma

详情

英文摘要

Conventional navigation pipelines for legged robots remain largely geometry-centric, relying on dense SLAM representations that are fragile under rapid motion and offer limited support for semantic decision making in open-world exploration. In this work, we focus on decision-driven semantic object exploration, where the primary challenge is not map consistency but how noisy and heterogeneous semantic observations can be transformed into stable and executable exploration decisions. We propose a vision-based approach that explicitly addresses this problem through confidence-calibrated semantic evidence arbitration, a controlled-growth semantic topological memory, and a semantic utility-driven subgoal selection mechanism. These components enable the robot to accumulate task-relevant semantic knowledge over time and select exploration targets that balance semantic relevance, reliability, and reachability, without requiring dense geometric reconstruction. Extensive experiments in both simulation and real-world environments demonstrate that the proposed mechanisms consistently improve the quality of semantic decision inputs, subgoal selection accuracy, and overall exploration performance on legged robots.

URL PDF HTML ☆

赞 0 踩 0

2509.20507 2026-03-09 cs.LG

Auto-Regressive U-Net for Full-Field Prediction of Shrinkage-Induced Damage in Concrete

Liya Gaynutdinova, Petr Havlásek, Ondřej Rokoš, Fleur Hendriks, Martin Doškář

2509.19674 2026-03-09 cs.LG cs.CV

C^2Prompt: Class-aware Client Knowledge Interaction for Federated Continual Learning

Kunlun Xu, Yibo Feng, Jiangmeng Li, Yongsheng Qi, Jiahuan Zhou

Comments Accepted by NeurIPS 2025

2509.14063 2026-03-09 cs.RO

Language Conditioning Improves Accuracy of Aircraft Goal Prediction in Non-Towered Airspace

Sundhar Vinodh Sangeetha, Chih-Yuan Chiu, Sarah H. Q. Li, Shreyas Kousik

Comments The last two authors advised equally. Accepted to the 2026 IEEE International Conference on Robotics and Automation. 8 pages, 6 figures

2509.13386 2026-03-09 cs.RO cs.LG

VEGA: Electric Vehicle Navigation Agent via Physics-Informed Neural Operator and Proximal Policy Optimization

Hansol Lim, Minhyeok Im, Jonathan Boyack, Jee Won Lee, Jongseong Brad Choi

Comments This work has been submitted to the 2026 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) for possible publication

2509.11629 2026-03-09 cs.LG cs.AI

Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check

Chentao Cao, Xiaojun Xu, Bo Han, Hang Li

Comments ICLR 2026

2509.08105 2026-03-09 cs.CL

MERLIN: Multi-Stage Curriculum Alignment for Multilingual Encoder-LLM Integration in Cross-Lingual Reasoning

Kosei Uemura, David Guzmán, Quang Phuoc Nguyen, Jesujoba Oluwadara Alabi, En-shiun Annie Lee, David Ifeoluwa Adelani

Comments Accepted to EACL 2026 (main conference)

2509.08087 2026-03-09 cs.LG cs.AI

Performance Assessment Strategies for Language Model Applications in Healthcare

Victor Garcia, Mariia Sidulova, Aldo Badano

2509.07945 2026-03-09 cs.LG

One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning

Yuan Pu, Yazhe Niu, Jia Tang, Junyu Xiong, Shuai Hu, Hongsheng Li

Comments 55 pages, 20 figures. Accepted as a conference paper at ICLR 2026

2509.05554 2026-03-09 cs.CV cs.IR

RED: Robust Event-Guided Motion Deblurring with Modality-Specific Disentanglement

Yihong Leng, Siming Zheng, Jinwei Chen, Bo Li, Jiaojiao Li, Peng-Tao Jiang

2509.03953 2026-03-09 cs.AI cs.SC cs.SY eess.SY

Handling Infinite Domain Parameters in Planning Through Best-First Search with Delayed Partial Expansions

Ángel Aso-Mollar, Diego Aineto, Enrico Scala, Eva Onaindia

2508.21513 2026-03-09 cs.LG cond-mat.dis-nn cs.AI

A Geometric Perspective on the Difficulties of Learning GNN-based SAT Solvers

Geri Skenderi

Comments Accepted in the Proceedings track of the GRaM Workshop @ ICLR 2026

2508.13238 2026-03-09 cs.CV

DianJin-OCR-R1: Enhancing OCR Capabilities via a Reasoning-and-Tool Interleaved Vision-Language Model

Qian Chen, Xianyin Zhang, Lifan Guo, Feng Chen, Chi Zhang

2508.10154 2026-03-09 cs.LG

Characterizing Evolution in Expectation-Maximization Estimates for Overspecified Mixed Linear Regression

Zhankun Luo, Abolfazl Hashemi

Comments This paper was accepted by Transactions on Machine Learning Research (TMLR). The code for numerical experiments is available at https://github.com/dassein/em_overspecified_mlr

详情

英文摘要

Mixture models have attracted significant attention due to practical effectiveness and comprehensive theoretical foundations. A persisting challenge is model misspecification, which occurs when the model to be fitted has more mixture components than those in the data distribution. In this paper, we develop a theoretical understanding of the Expectation-Maximization (EM) algorithm's behavior in the context of targeted model misspecification for overspecified two-component Mixed Linear Regression (2MLR) with unknown $d$-dimensional regression parameters and mixing weights. In Theorem 5.1 at the population level, with an unbalanced initial guess for mixing weights, we establish linear convergence of regression parameters in $O(\log(1/ε))$ steps. Conversely, with a balanced initial guess for mixing weights, we observe sublinear convergence in $O(ε^{-2})$ steps to achieve the $ε$-accuracy at Euclidean distance. In Theorem 6.1 at the finite-sample level, for mixtures with sufficiently unbalanced fixed mixing weights, we demonstrate a statistical accuracy of $O((d/n)^{1/2})$, whereas for those with sufficiently balanced fixed mixing weights, the accuracy is $O((d/n)^{1/4})$ given $n$ data samples. Furthermore, we underscore the connection between our population level and finite-sample level results: by setting the desired final accuracy $ε$ in Theorem 5.1 to match that in Theorem 6.1 at the finite-sample level, namely letting $ε= O((d/n)^{1/2})$ for sufficiently unbalanced fixed mixing weights and $ε= O((d/n)^{1/4})$ for sufficiently balanced fixed mixing weights, we intuitively derive iteration complexity bounds $O(\log (1/ε))=O(\log (n/d))$ and $O(ε^{-2})=O((n/d)^{1/2})$ at the finite-sample level for sufficiently unbalanced and balanced initial mixing weights. We further extend our analysis in overspecified setting to low SNR regime.

URL PDF HTML ☆

赞 0 踩 0

2508.03351 2026-03-09 cs.CV cs.AI cs.CL

VLMQ: Token Saliency-Driven Post-Training Quantization for Vision-language Models

Yufei Xue, Yushi Huang, Jiawei Shao, Lunjie Zhu, Chi Zhang, Xuelong Li, Jun Zhang

2508.01653 2026-03-09 cs.CV cs.AI

MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing

Chenxi Li, Yichen Guo, Benfang Qian, Jinhao You, Kai Tang, Yaosong Du, Zonghao Zhang, Xiande Huang

2507.23428 2026-03-09 cs.LG

Merging Memory and Space: A State Space Neural Operator

Nodens Koren, Samuel Lanthaler

2507.20230 2026-03-09 cs.AI cs.CV cs.MA

A Multi-Agent System Enables Versatile Information Extraction from the Chemical Literature

Yufan Chen, Ching Ting Leung, Bowen Yu, Jianwei Sun, Yong Huang, Linyan Li, Hao Chen, Hanyu Gao

2507.19883 2026-03-09 cs.RO

Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA

Ahmed Abouelazm, Mohammad Mahmoud, Conrad Walter, Oleksandr Shchetsura, Erne Hussong, Helen Gremmelmaier, J. Marius Zöllner

Comments Paper is accepted in IEEE International Automated Vehicle Validation Conference (IAVVC 2025)

2507.19146 2026-03-09 cs.RO cs.LG

Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL

Ahmed Abouelazm, Johannes Ratz, Philip Schörner, J. Marius Zöllner

Comments First and Second authors contributed equally; Paper accepted in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025)

2507.18923 2026-03-09 cs.CV

Gaussian Set Surface Reconstruction through Per-Gaussian Optimization

Zhentao Huang, Di Wu, Zhenbang He, Minglun Gong

2507.15335 2026-03-09 cs.CV cs.AI

ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis

Muhammad Aqeel, Federico Leonardi, Francesco Setti

Comments Accepted to ICIAP 2025

2507.11279 2026-03-09 cs.CV

Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping

Yujie Zhang, Sabine Struckmeyer, Andreas Kolb, Sven Reichardt

2507.11245 2026-03-09 cs.CV

NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation

X. Feng, H. Yu, M. Wu, S. Hu, J. Chen, C. Zhu, J. Wu, X. Chu, K. Huang

Comments Project Page: https://amap-ml.github.io/NarrLV-Website/

2507.09095 2026-03-09 cs.LG

Temporal Misalignment Attacks against Multimodal Perception in Autonomous Driving

Md Hasan Shahriar, Md Mohaimin Al Barat, Harshavardhan Sundar, Ning Zhang, Naren Ramakrishnan, Y. Thomas Hou, Wenjing Lou

Comments 19 pages, 18 Figures

2507.06543 2026-03-09 cs.CV

Token Bottleneck: One Token to Remember Dynamics

Taekyung Kim, Dongyoon Han, Byeongho Heo, Jeongeun Park, Sangdoo Yun

Comments NeurIPS 2025, 18 pages, 9 figures, 10 tables, project page: https://token-bottleneck.github.io, code: https://github.com/naver-ai/tobo

2507.06265 2026-03-09 cs.CV cs.AI

SPARC: Concept-Aligned Sparse Autoencoders for Cross-Model and Cross-Modal Interpretability

Ali Nasiri-Sarvi, Hassan Rivaz, Mahdi S. Hosseini

Comments Accepted at TMLR 2026

2506.13082 2026-03-09 cs.AI

Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs

Daniel Kilov, Caroline Hendy, Secil Yanik Guyot, Aaron J. Snoswell, Seth Lazar

详情

英文摘要

Moral competence is the ability to act in accordance with moral principles. As large language models (LLMs) are increasingly deployed in situations demanding moral competence, there is increasing interest in evaluating this ability empirically. We review existing literature and identify three significant shortcoming: (i) Over-reliance on prepackaged moral scenarios with explicitly highlighted moral features; (ii) Focus on verdict prediction rather than moral reasoning; and (iii) Inadequate testing of models' (in)ability to recognize when additional information is needed. Grounded in philosophical research on moral skill, we then introduce a novel method for assessing moral competence in LLMs. Our approach moves beyond simple verdict comparisons to evaluate five dimensions of moral competence: identifying morally relevant features, weighting their importance, assigning moral reasons to these features, synthesizing coherent moral judgments, and recognizing information gaps. We conduct two experiments comparing six leading LLMs against non-expert humans and professional philosophers. In our first experiment using ethical vignettes standard to existing work, LLMs generally outperformed non-expert humans across multiple dimensions of moral reasoning. However, our second experiment, featuring novel scenarios designed to test moral sensitivity by embedding relevant features among irrelevant details, revealed a striking reversal: several LLMs performed significantly worse than humans. Our findings suggest that current evaluations may substantially overestimate LLMs' moral reasoning capabilities by eliminating the task of discerning moral relevance from noisy information, which we take to be a prerequisite for genuine moral skill. This work provides a more nuanced framework for assessing AI moral competence and highlights important directions for improving moral competence in advanced AI systems.

URL PDF HTML ☆

赞 0 踩 0

2506.07658 2026-03-09 cs.CL

From Raw Corpora to Domain Benchmarks: Automated Evaluation of LLM Domain Expertise

Nitin Sharma, Thomas Wolfers, Çağatay Yıldız

Comments 36 pages, 24 figures. Third version

2506.06727 2026-03-09 cs.AI cs.CV

VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs

Can Li, Ying Liu, Ting Zhang, Mei Wang, Hua Huang

Comments Accepted to ICLR 2026