arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.25267 2026-04-29 cs.RO cs.AI

Dynamic UGV-UAV Cooperative Path Planning in Uncertain Environments

Ninh Nguyen, Srinivas Akella

Comments Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2026

详情

英文摘要

This paper addresses the Dynamic UGV-UAV Cooperative Path Planning (DUCPP) problem involving one unmanned ground vehicle (UGV) assisted by one or more unmanned aerial vehicles (UAVs) operating on an uncertain road network with potentially impassable edges. DUCPP is particularly relevant for scenarios such as disaster response, emergency supply transport, and rescue operations, where a UGV must reach a specified destination in the presence of partially unknown road conditions. To enable the UGV to travel safely and efficiently to its destination, the UAV(s) dynamically inspect edges in the environment to identify and prune damaged or impassable edges from consideration. We present multiple strategies, including a bidirectional approach, to optimize UGV-UAV cooperation for finding a safe path in an uncertain road network. Furthermore, we explore the impact of using multiple UAVs on reducing the UGV's travel time, and evaluate the associated computation time. The proposed strategies are implemented and evaluated on 100 urban road networks. The results demonstrate that the bidirectional strategy achieves the best performance in most instances, and using multiple UAVs further reduces UGV travel time at the expense of increased computation time. This paper presents a robust framework for DUCPP to achieve efficient UGV-UAV cooperation for path planning and inspection, offering practical solutions for navigation in challenging and uncertain conditions.

URL PDF HTML ☆

赞 0 踩 0

2604.25259 2026-04-29 cs.LG

DGLight: DQN-Guided GRPO Fine-Tuning of Large Language Models for Traffic Signal Control

Chenbo Yu

2604.25256 2026-04-29 cs.AI

AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery

Lei Xiong, Kun Luo, Ziyi Xia, Wenbo Zhang, Jin-Ge Yao, Zheng Liu, Jingying Shao, Jianlyu Chen, Hongjin Qian, Xi Yang, Qian Yu, Hao Li, Chen Yue, Xiaan Du, Yuyang Wang, Yesheng Liu, Haiyu Xu, Zhicheng Dou

2604.25255 2026-04-29 cs.CV

Personalized Cross-Modal Emotional Correlation Learning for Speech-Preserving Facial Expression Manipulation

Tianshui Chen, Yujie Zhu, Jianman Lin, Zhijing Yang, Chunmei Qing, Feng Gao, Liang Lin

2604.25249 2026-04-29 cs.CL cs.AI

Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance

Jon-Paul Cacioli

Comments 10 pages, 2 figures, 2 tables. Pre-registered: https://osf.io/6zftv/

详情

英文摘要

Detecting sandbagging--the deliberate underperformance on capability evaluations--is an open problem in AI safety. We tested whether symptom validity testing (SVT) logic from clinical malingering detection could identify sandbagging through below-chance performance (BCB) on forced-choice items. In a pre-registered pilot at the 7-9 billion parameter instruction-tuned scale (3 models, 4 MMLU-Pro domains, 4 conditions, 500 items per cell, 24,000 total trials), the plausibility gate failed. Zero of 12 model-domain cells showed significant below-chance performance under sandbagging instruction. Exploratory analyses revealed three qualitatively distinct failure modes. Qwen-2.5-7B and Phi-3.5-mini largely ignored the sandbagging instruction, with 62-88% response identity with the honest baseline. Llama-3-8B complied substantially but implemented underperformance as a positional heuristic, collapsing its response distribution onto middle-alphabet options (E at 31.8%, F at 26.1%) regardless of where the correct answer fell. This produced accuracy boosts of up to 33 percentage points when the correct answer coincidentally occupied the model's preferred position. An explicit anti-task instruction ("pick the least likely answer") drove two of three models below chance, with accuracy as low as 0.024. The capability for answer-aware avoidance therefore exists but is not activated by "deliberately underperform." BCB did not fail as a logical marker of answer-aware avoidance. It was not observed in this regime because the model showing the largest behavioural shift exhibited behaviour consistent with a position-dominant response policy rather than content-aware answer avoidance. We propose that positional-distribution shift may be a more effective behavioural signature than below-chance accuracy for detecting prompted underperformance at this model scale.

URL PDF HTML ☆

赞 0 踩 0

2604.25241 2026-04-29 cs.LG

Categorical Optimization with Bayesian Anchored Latent Trust Regions for Structural Design under High-Dimensional Uncertainty

Zhangyong Liang, Huanhuan Gao

2604.25231 2026-04-29 cs.CV cs.AI cs.CL

DRAGON: A Benchmark for Evidence-Grounded Visual Reasoning over Diagrams

Anirudh Iyengar Kaniyar Narayana Iyengar, Tampu Ravi Kumar, Gaurav Najpande, Manan Suri, Dinesh Manocha, Puneet Mathur, Vivek Gupta

Comments 22 Pages, 14 Figures

2604.25224 2026-04-29 cs.AI q-fin.CP

ValueAlpha: Agreement-Gated Stress Testing of LLM-Judged Investment Rationales Before Returns Are Observable

Sidi Chang, Peiying Zhu, Yuxiao Chen

Comments 9 pages, Submitted to IEEE Computational Intelligence in Financial Engineering and Economics (CIFEr) 2026, Tokyo, Japan

2604.25220 2026-04-29 cs.AI

DATAREEL: Automated Data-Driven Video Story Generation with Animations

Ridwan Mahbub, Syem Aziz, Mahir Ahmed, Shadikur Rahman, Mizanur Rahman, Shafiq Joty, Enamul Hoque

Comments Under Review

2604.25213 2026-04-29 cs.CV

When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documents

Jiaqi Wu, Yuchen Zhou, Dennis Tsang Ng, Xingyu Shen, Kidus Zewde, Ankit Raj, Tommy Duong, Simiao Ren

2604.25208 2026-04-29 cs.CV astro-ph.IM

Towards Seamless Lunar Mosaics: Deep Radiometric Normalization for Cross-Sensor Orbital Imagery Using Chandrayaan-2 TMC Data

Pratincha Singh, Jai Gopal Singla, Prashant Hemrajani, Nitant Dube, Amithabh, Hinal Patel

2604.25207 2026-04-29 cs.SD

Huí Sù: Co-constructing a Dual Feedback Apparatus

Yichen Wang, Charles Patrick Martin

Comments Accepted for publication at the International Conference on New Interfaces for Musical Expression (NIME) 2026 (music track)

2604.25203 2026-04-29 cs.CL cs.AI cs.LG

BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate

Arnon Mazza, Elad Levi

2604.25196 2026-04-29 cs.LG

Knowledge-Data Dually Driven Paradigm for Accurate Landslide Susceptibility Prediction under Data-Scarce Conditions Using Geomorphic Priors and Tabular Foundation Model

Yuting Yang, Gang Mei, Feng Chen, Yongshuang Zhang, Jianbing Peng

2604.25188 2026-04-29 cs.CV

Image Classification via Random Dilated Convolution with Multi-Branch Feature Extraction and Context Excitation

Wentao Jiang, Yuanchan Xu, Heng Yuan

详情

英文摘要

Image classification remains a fundamental yet challenging task in computer vision, particularly when fine-grained feature extraction and background noise suppression are required simultaneously. Conventional convolutional neural networks, despite their remarkable success in hierarchical feature learning, often struggle with capturing multi-scale contextual information and are susceptible to overfitting when confronted with noisy or irrelevant image regions. In this paper, we propose RDCNet (Image Classification Network with Random Dilated Convolution), a novel architecture built upon ResNet-34 that integrates three synergistic innovations to address these limitations: (1) a Multi-Branch Random Dilated Convolution (MRDC) module that employs parallel branches with varying dilation rates combined with a stochastic masking mechanism to capture fine-grained features across multiple scales while enhancing robustness against noise and overfitting; (2) a Fine-Grained Feature Enhancement (FGFE) module embedded within MRDC that bridges global contextual information with local feature representations through adaptive pooling and bilinear interpolation, thereby amplifying sensitivity to subtle visual patterns; and (3) a Context Excitation (CE) module that leverages softmax-based spatial attention and channel recalibration to dynamically emphasize task-relevant features while suppressing background interference. Extensive experiments conducted on five benchmark datasets -- CIFAR-10, CIFAR-100, SVHN, Imagenette, and Imagewoof -- demonstrate that RDCNet consistently achieves state-of-the-art classification accuracy, outperforming the second-best competing methods by margins of 0.02\%, 1.12\%, 0.18\%, 4.73\%, and 3.56\%, respectively, thereby validating the effectiveness and generalizability of the proposed approach across diverse visual recognition scenarios.

URL PDF HTML ☆

赞 0 踩 0

2604.25181 2026-04-29 cs.LG

Shearlet Neural Operators for Anisotropic-Shock-Dominated and Multi-scale parametric partial differential equations

Fabio Pereira dos Santos, Julio de Castro Vargas Fernandes, Adriano Mauricio de Almeida Cortes

2604.25178 2026-04-29 cs.CV

Lightweight Real-Time Rendering Parameter Optimization via XGBoost-Driven Lookup Tables

Baijun Tan, Francesco Moretti

详情

英文摘要

Achieving a desirable balance between rendering quality and real-time performance is a long-standing challenge in modern game and rendering engines, particularly on resource-constrained mobile devices such as laptops, tablets, and smartphones. Existing approaches to automatic rendering parameter optimization either depend on exhaustive per-scene pre-computation that spans several days, suffer from the prohibitive inference overhead of neural networks that prevents per-frame adaptation, or lack generalizability across heterogeneous hardware and diverse scenes. In this paper, we propose \textbf{LUT-Opt}, a lightweight, general-purpose framework for adaptive per-frame rendering parameter optimization. Our method decomposes the joint optimization of rendering time and image quality into a tractable two-stage pipeline. In the offline stage, we train a pair of XGBoost regressors to predict rendering time and image quality from rendering parameters, hardware state, and scene complexity descriptors. The trained ensemble models are then distilled into compact lookup tables (LUTs) through systematic discretization and a two-phase linear search that first constrains rendering time and subsequently maximizes structural similarity (SSIM). During runtime, the pre-computed LUT is queried every frame in sub-millisecond time, enabling truly adaptive parameter selection with negligible computational overhead. We validate LUT-Opt on two representative rendering techniques -- subsurface scattering (SSS) and hybrid-pipeline ambient occlusion (AO) -- implemented within Unreal Engine 5. Extensive experiments across multiple scenes and GPU configurations demonstrate that LUT-Opt reduces subsurface scattering rendering time by approximately 40\% and ambient occlusion rendering time by roughly 70\%, while incurring only about 2\% increase in image quality error, with per-frame inference latency below 0.1\ ms.

URL PDF HTML ☆

赞 0 踩 0

2604.25176 2026-04-29 cs.CV cs.LG

Benchmarking OCR Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill Digitization

Vijaysinh Gaikwad

2604.25167 2026-04-29 cs.AI

From Insight to Action: A Novel Framework for Interpretability-Guided Data Selection in Large Language Models

Ling Shi, Xinwei Wu, Xiaohu Zhao, Hao Wang, Heng Liu, Yangyang Liu, Linlong Xu, Longyue Wang, Deyi Xiong, Weihua Luo

2604.25166 2026-04-29 cs.AI

Training Transformers as a Universal Computer

Ruize Xu, Chenxiao Yang, Yanhong Li, David McAllester

Comments 20 pages, 9 figures

2604.25164 2026-04-29 cs.CV

IAM: Identity-Aware Human Motion and Shape Joint Generation

Wenqi Jia, Zekun Li, Abhay Mittal, Chengcheng Tang, Chuan Guo, Lezi Wang, James Matthew Rehg, Lingling Tao, Size An

2604.25159 2026-04-29 cs.LG

Accurate and Robust Generative Approach for Overcoming Data Sparsity and Imbalance in Landslide Modeling with A Tabular Foundation Model

Kaixuan Shao, Gang Mei, Yinghan Wu, Nengxiong Xu, Jianbing Peng

2604.25154 2026-04-29 cs.LG cs.DB

Prior-Aligned Data Cleaning for Tabular Foundation Models

Laure Berti-Equille

Comments 15 pages, 8 figures

2604.25143 2026-04-29 cs.LG cs.AI

Gradient-Direction Sensitivity Reveals Linear-Centroid Coupling Hidden by Optimizer Trajectories

Yongzhong Xu

Comments 15 pages, 5 figures

2604.25136 2026-04-29 cs.CL cs.AI cs.LG

Frictive Policy Optimization for LLMs: Epistemic Intervention, Risk-Sensitive Control, and Reflective Alignment

James Pustejovsky, Nikhil Krishnaswamy

Comments Frictive Policy Optimization; epistemic alignment; risk-sensitive control; LLM alignment; clarification and refusal; preference learning; trust regions; dialogue agents

2604.25135 2026-04-29 cs.CL

FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments

Amir Saeidi, Venkatesh Mishra, Souradeep Mukhopadhyay, Gaowen Liu, Ali Payani, Jayanth Srinivasa, Chitta Baral

Comments Accepted to ACL 2026 Findings

2604.25133 2026-04-29 cs.CL cs.SD eess.AS

Korean aegyo speech shows systematic F1 increase to signal childlike qualities

Ji-eun Kim, Volker Dellwo

Comments 18 pages, 2 figures, under review

2604.25132 2026-04-29 cs.CL

What Makes Good Instruction-Tuning Data? An In-Context Learning Perspective

Guangzeng Han, Xiaolei Huang

Comments ACL 2026, main conference

2604.25130 2026-04-29 cs.CL

LongSumEval: Question-Answering Based Evaluation and Feedback-Driven Refinement for Long Document Summarization

Huyen Nguyen, Haoxuan Zhang, Yang Zhang, Haihua Chen, Junhua Ding

Comments 13 pages, 3 figures

2604.25128 2026-04-29 cs.CV

ResetEdit: Precise Text-guided Editing of Generated Image via Resettable Starting Latent

Hanyi Wang, Han Fang, Zheng Wang, Shilin Wang, Ee-Chien Chang