arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.27404 2026-03-31 cs.AI cs.CL cs.CY cs.HC cs.MA

Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring

Jakub Masłowski, Jarosław A. Chudziak

Comments 15 pages, 3 figures, 4 tables. Accepted at ACIIDS 2026

详情

英文摘要

Large Language Models (LLMs) are being increasingly used as autonomous agents in complex reasoning tasks, opening the niche for dialectical interactions. However, Multi-Agent systems implemented with systematically unconstrained systems systematically undergo semantic drift and logical deterioration and thus can hardly be used in providing ethical tutoring where a precise answer is required. Current simulation often tends to degenerate into dialectical stagnation, the agents degenerate into recursive concurrence or circular arguments. A critical challenge remains: how to enforce doctrinal fidelity without suppressing the generative flexibility required for dialectical reasoning? To address this niche, we contribute the Heterogeneous Debate Engine (HDE), a cognitive architecture that combines Identity-Grounded Retrieval-Augmented Generation (ID-RAG) for doctrinal fidelity and Heuristic Theory of Mind for strategic opponent modeling. Our evaluation shows that architectural heterogeneity is a crucial variable to stability: contrary doctrinal initializations (e.g., Deontology vs. Utilitarianism) have increased the Argument Complexity Scores of students by an order of magnitude, over baselines. These findings validate the effectiveness of ID-RAG and Heuristic ToM as architectural requirements in maintaining high-fidelity (adversarial) pedagogy.

URL PDF HTML ☆

赞 0 踩 0

2603.27403 2026-03-31 cs.LG cs.AI

Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling

Kai Ye, Qingtao Pan, Shuo Li

Comments CVPR 2026

2603.27400 2026-03-31 cs.RO cs.LG

Rainbow-DemoRL: Combining Improvements in Demonstration-Augmented Reinforcement Learning

Dwait Bhatt, Shih-Chieh Chou, Nikolay Atanasov

Comments Accepted to ICRA 2026

2603.27385 2026-03-31 cs.LG

Active In-Context Learning for Tabular Foundation Models

Wilailuck Treerath, Fabrizio Pittorino

Comments 8 pages, 4 figures, 6 tables

2603.27375 2026-03-31 cs.CV

Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models

Yuhang Han, Yuyang Wu, Zhengbo Jiao, Yiyu Wang, Xuyang Liu, Shaobo Wang, Hanlin Xu, Xuming Hu, Linfeng Zhang

Comments Homepage: \url{https://kawhiiiileo.github.io/KAWHI_PAGE/}

2603.27371 2026-03-31 cs.CV

HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors

Ke Li, Tianjia Yang, Kaidi Liang, Xianbiao Hu, Ruwen Qin

2603.27365 2026-03-31 cs.CV

Falcon Perception

Aviraj Bevli, Sofian Chaybouti, Yasser Dahou, Hakim Hacid, Ngoc Dung Huynh, Phuc H. Le Khac, Sanath Narayan, Wamiq Reyaz Para, Ankit Singh

2603.27361 2026-03-31 cs.RO

Online Inertia Tensor Identification for Non-Cooperative Spacecraft via Augmented UKF

Batu Candan, Simone Servadio

2603.27360 2026-03-31 cs.AI

Defend: Automated Rebuttals for Peer Review with Minimal Author Guidance

Jyotsana Khatri, Manasi Patwardhan

2603.27356 2026-03-31 cs.CL cs.AI cs.CY

Culturally Adaptive Explainable LLM Assessment for Multilingual Information Disorder: A Human-in-the-Loop Approach

Maziar Kianimoghadam Jouneghani

Comments 9 pages, 3 figures, 1 table. Accepted to the Information Disorder Workshop at LREC 2026

2603.27349 2026-03-31 cs.CV cs.CL

Inference-Time Structural Reasoning for Compositional Vision-Language Understanding

Amartya Bhattacharya

2603.27348 2026-03-31 cs.LG

Embedding Provenance in Computer Vision Datasets with JSON-LD

Lynn Vonderhaar, Timothy Elvira, Tyler Thomas Procko, Omar Ochoa

2603.27344 2026-03-31 cs.CV

TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR

Ted Lentsch, Santiago Montiel-Marín, Holger Caesar, Dariu M. Gavrila

Comments CVPR 2026

2603.27343 2026-03-31 cs.AI

Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance

Dengzhe Hou, Lingyu Jiang, Deng Li, Zirui Li, Fangzhou Lin, Kazunori D Yamada

2603.27340 2026-03-31 cs.CV

EVA: Bridging Performance and Human Alignment in Hard-Attention Vision Models for Image Classification

Pengcheng Pan, Yonekura Shogo, Kuniyoshi Yasuo

2603.27338 2026-03-31 cs.AI

CounterMoral: Editing Morals in Language Models

Michael Ripa, Jim Davies

Comments 7 pages (10 + 1 reference + 6 appendix). Honors thesis completed in June 2024, write-up completed in 2025

2603.27335 2026-03-31 cs.CL

PubMed Reasoner: Dynamic Reasoning-based Retrieval for Evidence-Grounded Biomedical Question Answering

Yiqing Zhang, Xiaozhong Liu, Fabricio Murai

Comments 20 pages; under review

2603.27332 2026-03-31 cs.CV

Unsafe by Reciprocity: How Generation-Understanding Coupling Undermines Safety in Unified Multimodal Models

Kaishen Wang, Heng Huang

Comments 7 figures, 3 tables

2603.27331 2026-03-31 cs.CL cs.MM

SACRED: A Faithful Annotated Multimedia Multimodal Multilingual Dataset for Classifying Connectedness Types in Online Spirituality

Qinghao Guan, Yuchen Pan, Donghao Li, Zishi Zhang, Yiyang Chen, Lu Li, Flaminia Canu, Emilia Volkart, Gerold Schneider

Comments Accepted by LLMs4SSH 2026 at LREC

2603.27325 2026-03-31 cs.CV cs.AI

Improving Automated Wound Assessment Using Joint Boundary Segmentation and Multi-Class Classification Models

Mehedi Hasan Tusar, Fateme Fayyazbakhsh, Igor Melnychuk, Ming C. Leu

详情

英文摘要

Accurate wound classification and boundary segmentation are essential for guiding clinical decisions in both chronic and acute wound management. However, most existing AI models are limited, focusing on a narrow set of wound types or performing only a single task (segmentation or classification), which reduces their clinical applicability. This study presents a deep learning model based on YOLOv11 that simultaneously performs wound boundary segmentation (WBS) and wound classification (WC) across five clinically relevant wound types: burn injury (BI), pressure injury (PI), diabetic foot ulcer (DFU), vascular ulcer (VU), and surgical wound (SW). A wound-type balanced dataset of 2,963 annotated images was created to train the models for both tasks, with stratified five-fold cross-validation ensuring robust and unbiased evaluation. The models trained on the original non-augmented dataset achieved consistent performance across folds, though BI detection accuracy was relatively lower. Therefore, the dataset was augmented using rotation, flipping, and variations in brightness, saturation, and exposure to help the model learn more generalized and invariant features. This augmentation significantly improved model performance, particularly in detecting visually subtle BI cases. Among tested variants, YOLOv11x achieved the highest performance with F1-scores of 0.9341 (WBS) and 0.8736 (WC), while the lightweight YOLOv11n provided comparable accuracy at lower computational cost, making it suitable for resource-constrained deployments. Supported by confusion matrices and visual detection outputs, the results confirm the model's robustness against complex backgrounds and high intra-class variability, demonstrating the potential of YOLOv11-based architectures for accurate, real-time wound analysis in both clinical and remote care settings.

URL PDF HTML ☆

赞 0 踩 0

2603.27321 2026-03-31 cs.LG cs.AI

Multimodal Forecasting for Commodity Prices Using Spectrogram-Based and Time Series Representations

Soyeon Park, Doohee Chung, Charmgil Hong

Comments AAAI 2026 Summer Symposium Series; 9 pages

2603.27314 2026-03-31 cs.AI cs.CV cs.SD

TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba

Ziyue Yang, Kaixing Yang, Xulong Tang

Comments CVPR2026 Workshop on HuMoGen

2603.27313 2026-03-31 cs.RO

MetaTune: Adjoint-based Meta-tuning via Robotic Differentiable Dynamics

Xiexin Peng, Bingheng Wang, Tao Zhang, Ying Zheng

2603.27304 2026-03-31 cs.AI cs.MA

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Huacan Wang, Chaofa Yuan, Xialie Zhuang, Tu Hu, Shuo Zhang, Jun Han, Shi Wei, Daiqiang Li, Jingping Liu, Kunyi Wang, Zihan Yin, Zhenheng Tang, Andy Wang, Henry Peng Zou, Philip S. Yu, Sen Hu, Qizhen Lan, Ronghao Chen

2603.27303 2026-03-31 cs.AI cs.CL q-bio.QM

Self-evolving AI agents for protein discovery and directed evolution

Yang Tan, Lingrong Zhang, Mingchen Li, Yuanxi Yu, Bozitao Zhong, Bingxin Zhou, Nanqing Dong, Liang Hong

Comments 100 pages, 6 figures

2603.27301 2026-03-31 cs.CV

Dual-Path Learning based on Frequency Structural Decoupling and Regional-Aware Fusion for Low-Light Image Super-Resolution

Ji-Xuan He, Jia-Cheng Zhao, Feng-Qi Cui, Jinyang Huang, Yang Liu, Sirui Zhao, Meng Li, Zhi Liu

2603.27300 2026-03-31 cs.CV

Complet4R: Geometric Complete 4D Reconstruction

Weibang Wang, Kenan Li, Zhuoguang Chen, Yijun Yuan, Hang Zhao

2603.27299 2026-03-31 cs.LG

From Inference Routing to Agent Orchestration: Declarative Policy Compilation with Cross-Layer Verification

Huamin Chen, Xunzhuo Liu, Bowei He, Xue Liu

Comments Position Paper

2603.27294 2026-03-31 cs.CV

Class-Distribution Guided Active Learning for 3D Occupancy Prediction in Autonomous Driving

Wonjune Kim, In-Jae Lee, Sihwan Hwang, Sanmin Kim, Dongsuk Kum

Comments IEEE RA-L 2026

详情

DOI: 10.1109/LRA.2026.3678110
Journal ref: IEEE Robotics and Automation Letters (2026)

英文摘要

3D occupancy prediction provides dense spatial understanding critical for safe autonomous driving. However, this task suffers from a severe class imbalance due to its volumetric representation, where safety-critical objects (bicycles, traffic cones, pedestrians) occupy minimal voxels compared to dominant backgrounds. Additionally, voxel-level annotation is costly, yet dedicating effort to dominant classes is inefficient. To address these challenges, we propose a class-distribution guided active learning framework for selecting training samples to annotate in autonomous driving datasets. Our approach combines three complementary criteria to select the training samples. Inter-sample diversity prioritizes samples whose predicted class distributions differ from those of the labeled set, intra-set diversity prevents redundant sampling within each acquisition cycle, and frequency-weighted uncertainty emphasizes rare classes by reweighting voxel-level entropy with inverse per-sample class proportions. We ensure evaluation validity by using a geographically disjoint train/validation split of Occ3D-nuScenes, which reduces train-validation overlap and mitigates potential map memorization. With only 42.4% labeled data, our framework reaches 26.62 mIoU, comparable to full supervision and outperforming active learning baselines at the same budget. We further validate generality on SemanticKITTI using a different architecture, demonstrating consistent effectiveness across datasets.

URL PDF HTML ☆

赞 0 踩 0

2603.27287 2026-03-31 cs.RO cs.CV

Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving

Qiqi Liu, Huan Xu, Jingyu Li, Bin Sun, Zhihui Hao, Dangen She, Xiatian Zhu, Li Zhang

Comments 22 pages, 8 figures. Submitted to ECCV 2026. Code will be released