arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.01651 2026-04-03 cs.LG

Label Shift Estimation With Incremental Prior Update

Yunrui Zhang, Gustavo Batista, Salil S. Kanhere

Comments SIAM SDM 2025

详情

DOI: 10.1137/1.9781611978520.12
Journal ref: Proceedings of the 2025 SIAM International Conference on Data Mining (SDM) Pages 134 - 142

英文摘要

An assumption often made in supervised learning is that the training and testing sets have the same label distribution. However, in real-life scenarios, this assumption rarely holds. For example, medical diagnosis result distributions change over time and across locations; fraud detection models must adapt as patterns of fraudulent activity shift; the category distribution of social media posts changes based on trending topics and user demographics. In the task of label shift estimation, the goal is to estimate the changing label distribution $p_t(y)$ in the testing set, assuming the likelihood $p(x|y)$ does not change, implying no concept drift. In this paper, we propose a new approach for post-hoc label shift estimation, unlike previous methods that perform moment matching with confusion matrix estimated from a validation set or maximize the likelihood of the new data with an expectation-maximization algorithm. We aim to incrementally update the prior on each sample, adjusting each posterior for more accurate label shift estimation. The proposed method is based on intuitive assumptions on classifiers that are generally true for modern probabilistic classifiers. The proposed method relies on a weaker notion of calibration compared to other methods. As a post-hoc approach for label shift estimation, the proposed method is versatile and can be applied to any black-box probabilistic classifier. Experiments on CIFAR-10 and MNIST show that the proposed method consistently outperforms the current state-of-the-art maximum likelihood-based methods under different calibrations and varying intensities of label shift.

URL PDF HTML ☆

赞 0 踩 0

2604.01647 2026-04-03 cs.AI

Exploring Robust Multi-Agent Workflows for Environmental Data Management

Boyuan Guan, Jason Liu, Yanzhao Wu, Kiavash Bahreini

Comments Accepted at PEARC 2026. 12 pages, 4 figures

2604.01641 2026-04-03 cs.CV

LivingWorld: Interactive 4D World Generation with Environmental Dynamics

Hyeongju Mun, In-Hwan Jin, Sohyeong Kim, Kyeongbo Kong

2604.01639 2026-04-03 cs.CL

Fragile Reasoning: A Mechanistic Analysis of LLM Sensitivity to Meaning-Preserving Perturbations

Shou-Tzu Han, Rodrigue Rizk, KC Santosh

Comments Preprint. Under review at COLM 2026

2604.01634 2026-04-03 cs.LG cs.CL

CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

Junyoung Sung, Seungwoo Lyu, Minjun Kim, Sumin An, Arsha Nagrani, Paul Hongsuck Seo

Comments Accepted to CVPR 2026

2604.01630 2026-04-03 cs.CL

Grounding AI-in-Education Development in Teachers' Voices: Findings from a National Survey in Indonesia

Nurul Aisyah, Muhammad Dehan Al Kautsar, Arif Hidayat, Fajri Koto

2604.01622 2026-04-03 cs.LG cs.CL

Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models

Shuibai Zhang, Caspian Zhuang, Chihan Cui, Zhihan Yang, Fred Zhangzhi Peng, Yanxin Zhang, Haoyue Bai, Zack Jia, Yang Zhou, Guanhua Chen, Ming Liu

Comments 26 pages

2604.01618 2026-04-03 cs.CV cs.AI

Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models

Jiawei Chen, Simin Huang, Jiawei Du, Shuaihang Chen, Yu Tian, Mingjie Wei, Chao Yu, Zhaoxia Yin

详情

英文摘要

Vision-language-action (VLA) models have shown strong performance in robotic manipulation, yet their robustness to physically realizable adversarial attacks remains underexplored. Existing studies reveal vulnerabilities through language perturbations and 2D visual attacks, but these attack surfaces are either less representative of real deployment or limited in physical realism. In contrast, adversarial 3D textures pose a more physically plausible and damaging threat, as they are naturally attached to manipulated objects and are easier to deploy in physical environments. Bringing adversarial 3D textures to VLA systems is nevertheless nontrivial. A central obstacle is that standard 3D simulators do not provide a differentiable optimization path from the VLA objective function back to object appearance, making it difficult to optimize through an end-to-end manner. To address this, we introduce Foreground-Background Decoupling (FBD), which enables differentiable texture optimization through dual-renderer alignment while preserving the original simulation environment. To further ensure that the attack remains effective across long-horizon and diverse viewpoints in the physical world, we propose Trajectory-Aware Adversarial Optimization (TAAO), which prioritizes behaviorally critical frames and stabilizes optimization with a vertex-based parameterization. Built on these designs, we present Tex3D, the first framework for end-to-end optimization of 3D adversarial textures directly within the VLA simulation environment. Experiments in both simulation and real-robot settings show that Tex3D significantly degrades VLA performance across multiple manipulation tasks, achieving task failure rates of up to 96.7\%. Our empirical results expose critical vulnerabilities of VLA systems to physically grounded 3D adversarial attacks and highlight the need for robustness-aware training.

URL PDF HTML ☆

赞 0 踩 0

2604.01615 2026-04-03 cs.AI cs.SE

Analysis of LLM Performance on AWS Bedrock: Receipt-item Categorisation Case Study

Gabby Sanchez, Sneha Oommen, Cassandra T. Britto, Di Wang, Jung-De Chiou, Maria Spichkova

Comments Preprint. Accepted to the 19th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE 2026). Final version to be published by SCITEPRESS, http://www.scitepress.org

2604.01613 2026-04-03 cs.LG

Pseudo-Quantized Actor-Critic Algorithm for Robustness to Noisy Temporal Difference Error

Taisuke Kobayashi

Comments 38 pages, 12 figures

2604.01612 2026-04-03 cs.CV cs.AI

NEMESIS: Noise-suppressed Efficient MAE with Enhanced Superpatch Integration Strategy

Kyeonghun Kim, Hyeonseok Jung, Youngung Han, Hyunsu Go, Eunseob Choi, Seongbin Park, Junsu Lim, Jiwon Yang, Sumin Lee, Insung Hwang, Ken Ying-Kai Liao, Nam-Joon Kim

Comments 5 pages, 5 figures, 5 tables

2604.01610 2026-04-03 cs.AI

GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation

Taraneh Ghandi, Hamidreza Mahyar, Shachar Klaiman

Comments 22 pages, 3 figures

详情

英文摘要

The use of knowledge graphs for grounding agents in real-world Q&A applications has become increasingly common. Answering complex queries often requires multi-hop reasoning and the ability to navigate vast relational structures. Standard approaches rely on prompting techniques that steer large language models to reason over raw graph context, or retrieval-augmented generation pipelines where relevant subgraphs are injected into the context. These, however, face severe limitations with enterprise-scale KGs that cannot fit in even the largest context windows available today. We present GraphWalk, a problem-agnostic, training-free, tool-based framework that allows off-the-shelf LLMs to reason through sequential graph navigation, dramatically increasing performance across different tasks. Unlike task-specific agent frameworks that encode domain knowledge into specialized tools, GraphWalk equips the LLM with a minimal set of orthogonal graph operations sufficient to traverse any graph structure. We evaluate whether models equipped with GraphWalk can compose these operations into correct multi-step reasoning chains, where each tool call represents a verifiable step creating a transparent execution trace. We first demonstrate our approach on maze traversal, a problem non-reasoning models are completely unable to solve, then present results on graphs resembling real-world enterprise knowledge graphs. To isolate structural reasoning from world knowledge, we evaluate on entirely synthetic graphs with random, non-semantic labels. Our benchmark spans 12 query templates from basic retrieval to compound first-order logic queries. Results show that tool-based traversal yields substantial and consistent gains over in-context baselines across all model families tested, with gains becoming more pronounced as scale increases, precisely where in-context approaches fail catastrophically.

URL PDF HTML ☆

赞 0 踩 0

2604.01605 2026-04-03 cs.CV cs.RO

F3DGS: Federated 3D Gaussian Splatting for Decentralized Multi-Agent World Modeling

Morui Zhu, Mohammad Dehghani Tezerjani, Mátyás Szántó, Márton Vaitkus, Song Fu, Qing Yang

Comments Accepted to the CVPR 2026 SPAR-3D Workshop

2604.01603 2026-04-03 cs.CV

Towards Minimal Focal Stack in Shape from Focus

Khurram Ashfaq, Muhammad Tariq Mahmood

Comments Accepted to CVPRW 2026 (3DMV)

2604.01601 2026-04-03 cs.LG

Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling

Deeptanshu Malu, Deevyanshu Malu, Aditya Nemiwal, Sunita Sarawagi

2604.01600 2026-04-03 cs.AI

MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction

Zitian Tang, Xu Zhang, Jianbo Yuan, Yang Zou, Varad Gunjal, Songyao Jiang, Davide Modolo

Comments CVPR 2026

2604.01599 2026-04-03 cs.AI

ByteRover: Agent-Native Memory Through LLM-Curated Hierarchical Context

Andy Nguyen, Danh Doan, Hoang Pham, Bao Ha, Dat Pham, Linh Nguyen, Hieu Nguyen, Thien Nguyen, Cuong Do, Phat Nguyen, Toan Nguyen

Comments 19 pages, 3 figures, 7 tables

2604.01598 2026-04-03 cs.CV

Riemannian and Symplectic Geometry for Hierarchical Text-Driven Place Recognition

Tianyi Shang, Zhenyu Li

Comments 9 pages

2604.01597 2026-04-03 cs.LG

Learning from the Right Rollouts: Data Attribution for PPO-based LLM Post-Training

Dong Shu, Denghui Zhang, Jessica Hullman

2604.01595 2026-04-03 cs.LG

Optimizing EEG Graph Structure for Seizure Detection: An Information Bottleneck and Self-Supervised Learning Approach

Lincan Li, Rikuto Kotoge, Xihao Piao, Zheng Chen, Yushun Dong

Comments Accepted by IEEE 14th International Conference on Healthcare Informatics (ICHI)

详情

英文摘要

Seizure detection from EEG signals is highly challenging due to complex spatiotemporal dynamics and extreme inter-patient variability. To model them, recent methods construct dynamic graphs via statistical correlations, predefined similarity measures, or implicit learning, yet rarely account for EEG's noisy nature. Consequently, these graphs usually contain redundant or task-irrelevant connections, undermining model performance even with state-of-the-art architectures. In this paper, we present a new perspective for EEG seizure detection: jointly learning denoised dynamic graph structures and informative spatial-temporal representations guided by the Information Bottleneck (IB). Unlike prior approaches, our graph constructor explicitly accounts for the noisy characteristics of EEG data, producing compact and reliable connectivity patterns that better support downstream seizure detection. To further enhance representation learning, we employ a self-supervised Graph Masked AutoEncoder that reconstructs masked EEG signals based on dynamic graph context, promoting structure-aware and compact representations aligned with the IB principle. Bringing things together, we introduce Information Bottleneck-guided EEG SeizuRE DetectioN via SElf-Supervised Learning (IRENE), which explicitly learns dynamic graph structures and interpretable spatial-temporal EEG representations. IRENE addresses three core challenges: (i) Identifying the most informative nodes and edges; (ii) Explaining seizure propagation in the brain network; and (iii) Enhancing robustness against label scarcity and inter-patient variability. Extensive experiments on benchmark EEG datasets demonstrate that our method outperforms state-of-the-art baselines in seizure detection and provides clinically meaningful insights into seizure dynamics. The source code is available at https://github.com/LabRAI/IRENE.

URL PDF HTML ☆

赞 0 踩 0

2604.01594 2026-04-03 cs.AI

Do Large Language Models Mentalize When They Teach?

Sevan K. Harootonian, Mark K. Ho, Thomas L. Griffiths, Yael Niv, Ilia Sucholutsky

Comments 9 pages, 5 figures. Workshop paper at ICML 2026

2604.01589 2026-04-03 cs.CV

Mitigating the ID-OOD Tradeoff in Open-Set Test-Time Adaptation

Wenjie Zhao, Jia Li, Xin Dong, Yapeng Tian, Yu Xiang, Yunhui Guo

2604.01588 2026-04-03 cs.AI

NED-Tree: Bridging the Semantic Gap with Nonlinear Element Decomposition Tree for LLM Nonlinear Optimization Modeling

Zhijing Hu, Yufan Deng, Haoyang Liu, Changjun Fan

Comments 17 pages, 7 figures, conference

2604.01587 2026-04-03 cs.LG

Variational LSTM with Augmented Inputs: Nonlinear Response History Metamodeling with Aleatoric and Epistemic Uncertainty

Manisha Sapkota, Min Li, Bowei Li

Comments 22 pages, 10 figures

2604.01586 2026-04-03 cs.CV cs.AI

SHOE: Semantic HOI Open-Vocabulary Evaluation Metric

Maja Noack, Qinqian Lei, Taipeng Tian, Bihan Dong, Robby T. Tan, Yixin Chen, John Young, Saijun Zhang, Bo Wang

Comments Accepted to GRAIL-V Workshop at CVPR 2026

2604.01579 2026-04-03 cs.CV cs.AI

Harmonized Tabular-Image Fusion via Gradient-Aligned Alternating Learning

Longfei Huang, Yang Yang

Comments ICME 26

2604.01576 2026-04-03 cs.LG

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

Shalima Binta Manir, Tim Oates

2604.01570 2026-04-03 cs.RO

Boosting Vision-Language-Action Finetuning with Feasible Action Neighborhood Prior

Haochen Niu, Kanyu Zhang, Shuyu Yin, Qinghai Guo, Peilin Liu, Fei Wen

Comments Accepted by CVPR 2026

2604.01569 2026-04-03 cs.CV cs.MM

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

Jiahao Meng, Tan Yue, Qi Xu, Haochen Wang, Zhongwei Ren, Weisong Liu, Yuhao Wang, Renrui Zhang, Yunhai Tong, Haodong Duan

2604.01567 2026-04-03 cs.RO

AnchorVLA: Anchored Diffusion for Efficient End-to-End Mobile Manipulation

Jia Syuen Lim, Zhizhen Zhang, Peter Bohm, Brendan Tidd, Zi Huang, Yadan Luo