arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.05523 2026-05-08 stat.ML cs.LG stat.CO

Permutation-preserving Functions and Neural Vecchia Covariance Kernels

Jian Cao, Nian Liu, Ying Lin

详情

英文摘要

We introduce a novel framework for constructing scalable and flexible covariance kernels for Gaussian processes (GPs) by directly learning the covariance structure under a regression-type parameterization induced by Vecchia approximations, using deep neural architectures. Specifically, we model kriging coefficients and conditional standard deviations, deterministic quantities that uniquely characterize the covariance, providing stable and informative learning targets. Exploiting the permutation-equivariant structure of conditioning sets in the Vecchia factorization, we derive a universal representation for permutation-preserving functions and design neural architectures that respect this symmetry, leading to improved training stability and data efficiency. The proposed approach enables expressive, non-stationary kernel learning while maintaining computational scalability, thereby bridging classical GP methodology with modern deep learning.

URL PDF HTML ☆

赞 0 踩 0

2605.05522 2026-05-08 eess.IV cs.CV

Tumor-aware augmentation with task-guided attention analysis improves rectal cancer segmentation from magnetic resonance images

Aneesh Rangnekar, Joao Miranda, Natally Horvat, Stephanie Chahwan, Samir Alrayess, Aditya Apte, Aditi Iyer, Eve LoCastro, Revathi Ravella, Marc J Gollub, Iva Petkovska, Jesse Joshua Smith, Paul Romesser, Julio Garcia-Aguilar, Harini Veeraraghavan, Joseph Deasy

详情

英文摘要

Pretraining on large-scale datasets has been shown to improve transformer generalizability, even for out-of-domain (OOD) modalities and tasks. However, two common assumptions often fail under OOD transfer: that downstream datasets can be adapted to the fixed input geometry of pretrained models and that pretrained representations transfer effectively across imaging modalities. We show that these assumptions break down through two interacting failure modes in CT-to-MRI transfer: inefficient token usage caused by zero-padding to match pretrained input dimensions and ineffective feature adaptation. These failures led to accuracy degradation despite extensive fine-tuning. We investigated these failure modes using two CT-pretrained hierarchical shifted-window transformer backbones, SMIT and Swin UNETR, pretrained with different objectives and datasets. Mechanistic analysis introduced an attention dilution index (ADI), an entropy-based metric quantifying attention diverted toward uninformative padding tokens, and centered kernel alignment (CKA) to measure feature reuse in MRI tasks. ADI increased with zero-padding, while high feature reuse did not necessarily correspond to improved accuracy. To mitigate these issues, we introduced two interventions: a tumor-aware augmentation strategy to improve tumor appearance heterogeneity coverage and an anisotropic cropping strategy to restore token efficiency. Fine-tuning on identical rectal MRI datasets improved detection rates to 224/247 (90.7%) for SMIT and 219/247 (88.7%) for Swin UNETR, demonstrating improved robustness under CT-to-MRI transfer. This study is among the first to examine when pretrained transformers fail to transfer effectively across imaging modalities and how simple mitigation strategies, motivated by mechanistic analysis of datasets, can reduce transfer limitations while improving robustness and MRI detection.

URL PDF HTML ☆

赞 0 踩 0

2605.05514 2026-05-08 cs.IT cs.AI cs.LG cs.NI eess.SP math.IT

When Semantic Communication Meets Queueing: Cross-Layer Latency and Task Fidelity Optimization

Yalin E. Sagduyu, Tugba Erpek

2605.05493 2026-05-08 stat.ME cond-mat.stat-mech cs.LG math.ST stat.TH

A renormalization-group inspired lattice-based framework for piecewise generalized linear models

Joshua C. Chang

Comments Under review

2605.05472 2026-05-08 cs.CY cs.AI

The Pedagogy of AI Mistakes: Fostering Higher-Order Thinking

Hadi Hosseini

Comments Accepted to AIED-2026; includes supplementary material

2605.05459 2026-05-08 cs.CR cs.LG

Privacy Without Losing Place: A Paradigm for Private Retrieval in Spatial RAGs

Kennedy Edemacu, Mohammad Mahdi Shokri, Vinay M. Shashidhar, Jong Wook Kim

2605.05446 2026-05-08 stat.ML cs.IT cs.LG math.IT math.OC

Convexity in Disguise: A Theoretical Framework for Nonconvex Low-Rank Matrix Estimation

Chengyu Cui, Gongjun Xu

2605.05436 2026-05-08 stat.ML cs.LG

Estimating Implicit Regularization in Deep Learning

Joseph H. Rudoler, Kevin Tan, Giles Hooker, Konrad P. Kording

2605.05432 2026-05-08 math.ST cs.LG stat.ML stat.TH

Direct Estimation of Schrödinger Bridge Time-Series Drifts: Finite-Sample, Asymptotic, and Adaptive Guarantees

Othmane Mazhar, Huyên Pham

Comments 36 pages, 3 figures, 8 tables

2605.05400 2026-05-08 cs.SE cs.AI cs.HC

Mise en Place for Agentic Coding: Deliberate Preparation as Context Engineering Methodology

Andrew Zigler

Comments 5 pages. Accepted at VibeX 2026, the 1st International Workshop on Vibe Coding and Vibe Researching, co-located with EASE 2026, Glasgow, June 9-12 2026. Camera-ready version. Research artifact: https://doi.org/10.5281/zenodo.19868258

2605.05382 2026-05-08 math.OC cs.LG

Meta-learning for sample-efficient Bayesian optimisation of fed-batch processes

Becky Langdon, Gabriel D. Patrón, Chrysoula D. Kappatou, Robert M. Lee, Behrang Shafei, Jixiang Qing, Ruth Misener, Mark van der Wilk, Calvin Tsay

Comments 24 pages, 12 figures

2605.05348 2026-05-08 cs.HC cs.AI

Making AI Drafts Count: A Quality Threshold in Audio Description Workflows

Lana Do, Shasta Ihorn, Charity M. Pitcher-Cooper, Sanjay Mirani, Gio Jung, Hyunjoo Shim, Zhenzhen Qin, Kien T. Nguyen, Vassilis Athitsos, Ilmi Yoon

2605.05287 2026-05-08 cs.CR cs.AI cs.IR cs.SE

Securing the Agent: Vendor-Neutral, Multitenant Enterprise Retrieval and Tool Use

Francisco Javier Arceo, Varsha Prasad Narsing

Comments 11 pages, 2 figures, Published in ACM Conference on AI and Agentic Systems

详情

DOI: 10.1145/3786335.3813145
Journal ref: ACM Conference on AI and Agentic Systems (ACM CAIS '26), May 26-29, 2026, San Jose, CA, USA

英文摘要

Retrieval-Augmented Generation (RAG) and agentic AI systems are increasingly prevalent in enterprise AI deployments. However, real enterprise environments introduce challenges largely absent from academic treatments and consumer-facing APIs: multiple tenants with heterogeneous data, strict access-control requirements, regulatory compliance, and cost pressures that demand shared infrastructure. A fundamental problem underlies existing RAG architectures in these settings: retrieval systems rank documents by relevance--whether through semantic similarity, keyword matching, or hybrid approaches--not by authorization, so a query from one tenant can surface another tenant's confidential data simply because it scores highest. We formalize this gap and analyze additional shortcomings--including tool-mediated disclosure, context accumulation across turns, and client-side orchestration bypass--that arise when agentic systems conflate relevance with authorization. To address these challenges, we introduce a layered isolation architecture combining policy-aware ingestion, retrieval-time gating, and shared inference, enforced through server-side agentic orchestration. This approach centralizes security-critical operations--tool execution authorization, state isolation, and policy enforcement--on the server, creating natural enforcement points for multitenant isolation while allowing client-side frameworks to retain control over agent composition and latency-sensitive operations. We validate the proposed architecture through an open-source implementation in OGX, a vendor-neutral framework that implements an OpenAI-compatible, open-source Responses API with server-side multi-turn orchestration. We evaluate it empirically and show that ABAC gating eliminates cross-tenant leakage while introducing negligible overhead.

URL PDF HTML ☆

赞 0 踩 0

2605.05282 2026-05-08 cs.PL cs.CL

Beyond BLEU: A Semantic Evaluation Method for Code Translation

Julius Näumann, Sven Keidel, Amir Molzam Sharifloo, Mira Mezini

2605.05271 2026-05-08 cs.CR cs.AI

Shattering the Echo Chamber: Hidden Safeguards in Manuscripts Against the AI Takeover of Peer Review

Oubo Ma, Ruixiao Lin, Jiahao Chen, Yuan Su, Yong Yang, Shouling Ji

Comments 22 pages, 14 figures, 11 tables

详情

英文摘要

As LLMs become increasingly capable, editorial boards and program committees are growing concerned about reviewers who fully outsource peer review to commercial chatbots. This concern stems from prior findings that current chatbots lack the independent critical thinking and depth of reasoning required to assess scientific novelty. One promising direction for mitigating this concern is to embed hidden instructions into manuscripts that disrupt or alter chatbot-generated reviews. However, existing methods remain intuitive and fragile, as they typically rely on homogeneous payloads injected in an inter-stream manner, rendering them susceptible to sanitization or neutralization. In this paper, we identify End-to-End Review Outsourcing as an emerging threat and propose IntraGuard, a black-box, venue-agnostic defense framework grounded in the structural--visual decoupling inherent to the PDF. Designed for committee-side deployment, IntraGuard supports both explicit strategies that trigger refusal or warning signals, and implicit strategies that embed predefined textual markers into the generated review. These strategies can be deployed via any of three intra-stream injection mechanisms, each of which seamlessly embeds heterogeneous defensive text objects within the PDF's underlying structure without altering its visual presentation. Extensive evaluations across 7 real-world commercial chatbot settings and 12 venues spanning diverse disciplines show that IntraGuard achieves a defense success rate of up to 84%, while preserving peer-review invariance for human reviewers. IntraGuard is lightweight and hardware-independent, incurring an average overhead of only one second per manuscript on a commodity personal computer. We further evaluate 11 adaptive attacks spanning manuscript sanitization and instruction interference, and discuss the implications of constructing ensemble defenses.

URL PDF HTML ☆

赞 0 踩 0

2605.05270 2026-05-08 stat.ML cs.LG stat.AP

Forecasting Oncology Demand Trends with Boosting-Based Bayesian Conjugate Models

Ademir Batista dos Santos Neto, Tiago Alessandro Espinola Ferreira, Paulo Renato Alves Firmino

Comments 18 pages, 3 figures

2605.05267 2026-05-08 cs.SE cs.AI

Bridging Generation and Training: A Systematic Review of Quality Issues in LLMs for Code

Kaifeng He, Xiaojun Zhang, Peiliang Cai, Mingwei Liu, Yanlin Wang, Chong Wang, Kaifeng Huang, Bihuan Chen, Xin Peng, Zibin Zheng

2605.05266 2026-05-08 cs.CR cs.LG

Differential Privacy in the Extensive-Form Bandit Problem

Stephen Pasteris, Rahul Savani, Theodore Turocy

2605.05262 2026-05-08 stat.ML cs.AI cs.LG

Maximizing Rollout Informativeness under a Fixed Budget: A Submodular View of Tree Search for Tool-Use Agentic Reinforcement Learning

Yuelin Hu, Zhenbo Yu, Zhengxue Cheng, Wei Liu, Li Song

Comments Preprint, 9 pages, 5 figures

详情

英文摘要

We formalize Rollout Informativeness under a Fixed Budget (RIFB) as the expected non-vanishing policy-gradient mass that a tool-use rollout set injects into Group Relative Policy Optimization (GRPO). We prove that any budget-agnostic independent sampler suffers a collapse rate bounded away from zero for hard prompts regardless of the budget. Motivated by this, we recast intermediate state selection as a monotone submodular maximization problem, where a greedy one-step selector enjoys a 1 minus 1/e approximation guarantee. Our Uncertainty-aware Upper Confidence Bound (UUCB) terms arise as closed-form marginal gains of this objective. This turns the token-level entropy bonus from an empirical trick into an analytic consequence of the formulation. We present InfoTree, a training-time tree-search framework coupling UUCB with a learned Adaptive Budget Allocator (ABA) and an asynchronous Speculative Expansion scheme. ABA rescues prompts whose initial tree is wasted on uniform outcomes, lifting the mixed-outcome ratio from 58.1 percent to 76.3 percent with less than 5 percent budget overhead. Speculative Expansion reduces wall-clock overhead from 14.3 percent to 4.8 percent by tolerating bounded staleness in UUCB scores. Across nine benchmarks spanning math reasoning (AIME 2024 and 2025, MATH-500, OlympiadBench, USAMO), web-search agents (GAIA, HLE-100, BrowseComp-lite), and tool-rich coding and OS agents (APPS-verified, AgentBench-OS), InfoTree outperforms flat GRPO, DeepSearch, Tree-GRPO, AT2PO, CW-GRPO, and RC-GRPO. Head-to-head compositions with Tree-GRPO prefix sharing and CW-GRPO contribution weights deliver further gains, confirming that our selector operates orthogonally to rollout reuse and trajectory re-weighting. A 5 by 5 by 5 robustness grid reveals that over three quarters of the hyperparameter space lies on a performance plateau, confirming UUCB robustness.

URL PDF HTML ☆

赞 0 踩 0

2605.05259 2026-05-08 q-bio.BM cond-mat.mtrl-sci cs.AI q-bio.QM

Enhancing Cryo-EM Density Map Segmentation in Phenix for Improved Atomic Model Building

Chenwei Zhang

Comments 10 pages, 4 figures, 2 tables

2605.05257 2026-05-08 cs.IR cs.AI cs.CL

Career-Aware Resume Tailoring via Multi-Source Retrieval-Augmented Generation with Provenance Tracking: A Case Study

Kumar Abhinav

Comments 6 pages, 1 figure, 5 tables. Also available on SSRN

2605.05252 2026-05-08 cs.SE cs.AI

Automated Population-Level Audit Assurance via AI-Based Document Intelligence

Santosh Vasudevan, Velu Natarajan

2605.05251 2026-05-08 cs.CR cs.LG cs.SE

Identifier-Free Code Embedding Models for Scalable Search

Eric Wolos, Michael Doyle

2605.05250 2026-05-08 cs.IR cs.AI

Decision-aware User Simulation Agent for Evaluating Conversational Recommender Systems

Yuan-Chi Li, Li-Chi Chen, Sung-Yi Wu, Yu-Che Tsai, Shou-De Lin

2605.05246 2026-05-08 eess.SP cs.AI

Memory-Efficient EDA Denoising via Knowledge Distillation for Wearable IoT Under Severe Motion Artifacts and Underwater Conditions

Yongbin Lee, Andrew Peitzsch, Youngsun Kong, Jarod Zizza, Dong-hee Kang, Farnoush Baghestani, Ki H. Chon

详情

英文摘要

Electrodermal activity (EDA) is widely used in wearable Internet of Medical Things (IoMT) systems for continuous health monitoring, including autonomic assessment. However, EDA signals are highly vulnerable to motion artifacts and environmental noise, limiting reliable deployment in harsh operating conditions such as underwater. This study proposes a robust, deployable EDA denoising framework that generalizes across multiple measurement locations and harsh environments. The framework integrates a hybrid CNN-Transformer teacher model with a lightweight depth-wise separable CNN student model via a knowledge distillation (KD) strategy. To further improve robustness, a realistic data augmentation scheme is introduced to simulate diverse motion artifacts and environmental distortions. The KD-based student model significantly reduces model size (7.87 MB to 0.51 MB) and computational cost (105.1M to 11.61M FLOPs) while maintaining denoising performance (MAE: 0.144, SNR improvement: 12.08 dB) using the public dataset validation. In real-world underwater conditions (UMAC dataset) testing, the proposed method substantially improves skin conductance response reconstruction, reducing mean absolute error from 2.809 to 0.215. Furthermore, on independent testing using the CNS-OT dataset, the denoised signals enhanced downstream CNS-OT prediction performance, achieving the highest AUROC (0.806) compared to prior denoising methods. The proposed method also improved the early prediction rate (sensitivity) from 0.550 to 0.767, enabling CNS-OT prediction up to a median of 6.9 minutes before symptom onset. These results demonstrate that the proposed framework not only improves EDA signal quality but also enhances clinically relevant prediction performance while remaining suitable for deployment in resource-constrained wearable Internet of Things systems operating in harsh environments.

URL PDF HTML ☆

赞 0 踩 0

2605.05244 2026-05-08 cs.IR cs.AI

Towards Dependable Retrieval-Augmented Generation Using Factual Confidence Prediction

Florian Geissler, Francesco Carella, Laura Fieback, Jakob Spiegelberg

2605.05242 2026-05-08 cs.IR cs.AI

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Zhuofeng Li, Haoxiang Zhang, Cong Wei, Pan Lu, Ping Nie, Yi Lu, Yuyang Bai, Shangbin Feng, Hangxiao Zhu, Ming Zhong, Yuyu Zhang, Jianwen Xie, Yejin Choi, James Zou, Jiawei Han, Wenhu Chen, Jimmy Lin, Dongfu Jiang, Yu Zhang

2605.05240 2026-05-08 eess.SP cs.AI

PPO-Based Dynamic Positioning of HAPS-BS in Wind-Disturbed Stratospheric Maritime Networks

Azim Akhtarshenas, German Svistunov, Matteo Bernabè, Kuangyu Zheng, David López-Pérez

2605.05238 2026-05-08 cs.IR cs.LG cs.SI

Dynamic Graph with Similarity-Aware Attention Graph Neural Network for Recommender Systems

Aadarsh Senapati, Neha Kujur, Vivek Yelleti

2605.05231 2026-05-08 eess.AS cs.SD

Prompting Whisper for Joint Speech Transcription and Diarization

Mariia Zamyrova, Henk van den Heuvel

Comments To be presented at the Joint Workshop on HSCMA and CHiME 2026