arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.06315 2026-05-08 stat.ML cs.LG

End-to-End Identifiable and Consistent Recurrent Switching Dynamical Systems

Carles Balsells-Rodas, Zhengrui Xiang, Xavier Sumba, Yingzhen Li

详情

英文摘要

Learning identifiable representations in deep generative models remains a fundamental challenge, particularly for sequential data with regime-switching dynamics. Existing approaches establish identifiability under restrictive assumptions, such as stationarity or limited emission models, and typically rely on variational autoencoder (VAE) estimators, which introduce approximation gaps that limit the recovery of the latent structure. In this work, we address both the theoretical and practical limitations of this setting. First, we establish identifiability of a broad class of recurrent nonlinear switching dynamical systems under flexible assumptions, significantly extending prior results. Second, we introduce $Ω$SDS, a flow-based estimator that enables exact likelihood optimization using expectation-maximisation. Through empirical validation on both synthetic and real-world data, our results demonstrate that $Ω$SDS achieves improved disentanglement compared to VAE-based estimators and more accurate forecasting of underlying dynamics.

URL PDF HTML ☆

赞 0 踩 0

2605.06289 2026-05-08 stat.ML cs.AI cs.LG

Multimodal Deep Generative Model for Semi-Supervised Learning under Class Imbalance

Heegeon Yoon, Heeyoung Kim

2605.06288 2026-05-08 stat.ME cs.AI

A Topological Sorting Criterion for Random Causal Directed Acyclic Graphs

Alexander G. Reisach, Antoine Chambaz, Gilles Blanchard, Sebastian Weichwald

2605.06279 2026-05-08 cs.SE cs.AI

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

Chengjie Wang, Jingzheng Wu, Xiang Ling, Tianyue Luo, Chen Zhao

Comments 35 pages, 8 figures

2605.06265 2026-05-08 stat.ML cs.LG

ConquerNet: Convolution-Smoothed Quantile ReLU Neural Networks with Minimax Guarantees

Tianpai Luo, Fangwei Wu, Weichi Wu

2605.06210 2026-05-08 stat.ML cs.AI cs.LG stat.AP stat.ME

Super-Level-Set Regression: Conditional Quantiles via Volume Minimization

Sacha Braun, Michael I. Jordan, Francis Bach

2605.06204 2026-05-08 stat.ML cs.LG

When Does Trimming Help Conformal Prediction? A Retained-Law Diagnostic under Calibration Contamination

Congye Wang

2605.06189 2026-05-08 eess.AS cs.LG

Predictive-Generative Drift Decomposition for Speech Enhancement and Separation

Julius Richter, Yoshiki Masuyama, Christoph Boeddeker, Takahiro Edo, Gordon Wichern, Jonathan Le Roux

Comments Submitted to NeurIPS 2026

2605.06172 2026-05-08 stat.ML cs.LG cs.NA math.NA math.PR

Expressivity of Bi-Lipschitz Normalizing Flows: A Score-Based Diffusion Perspective

Meira Iske, Carola-Bibiane Schönlieb

2605.06153 2026-05-08 cs.CR cs.CV

Secure Seed-Based Multi-bit Watermarking for Diffusion Models from First Principles

Enoal Gesny, Eva Giboulot

2605.06136 2026-05-08 cs.SE cs.AI

BUILD-AND-FIND: An Effort-Aware Protocol for Evaluating Agent-Managed Codebases

Jhen-Ke Lin

Comments 25 pages, 8 figures, 17 tables

2605.06134 2026-05-08 hep-lat cs.LG

Diffusion model for SU(N) gauge theories

Javad Komijani, Marina K. Marinkovic, Lara Turgut

Comments 23 pages, 6 figures

2605.06111 2026-05-08 cs.SE cs.AI

Schedule-and-Calibrate: Utility-Guided Multi-Task Reinforcement Learning for Code LLMs

Yujia Chen, Yang Ye, Xiao Chu, Yuchi Ma, Cuiyun Gao

2605.06091 2026-05-08 math.ST cs.LG math.PR stat.CO stat.TH

Time-Inhomogeneous Preconditioned Langevin Dynamics

Alexander Falk, Laurenz Nagler, Andreas Habring, Thomas Pock

2605.06082 2026-05-08 cs.AR cs.LG cs.PF

PoTAcc: A Pipeline for End-to-End Acceleration of Power-of-Two Quantized DNNs

Rappy Saha, Jude Haris, Nicolas Bohm Agostini, David Kaeli, José Cano

Comments Accepted to IEEE Transactions on Circuits and Systems for Artificial Intelligence (TCASAI), 2026

2605.06059 2026-05-08 stat.AP cs.LG

Correcting heterogeneous diagnostic bias when developing clinical prediction models using causal hidden Markov models

Jose Benitez-Aurioles, Ricardo Silva, Brian McMillan, Matthew Sperrin

Comments 4 figures, 2 tables, 4 supplementaries

详情

英文摘要

In routine care, individuals identified a priori as high-risk are usually tested for conditions more frequently. Protected attributes, such as sex or ethnicity may also determine testing frequency. Such heterogeneous detection rates across a population induce label error. This causes systematic model error for specific groups and biases performance metrics during validation. This paper proposes a method to correct for such bias in prediction models due to differential diagnostic delay. We use a causal inference framework to define our target estimand: an individual's diagnosis probability in a counterfactual scenario where their diagnosis rate matches that of a reference group. We model the longitudinal process as a hidden Markov model, in which confirmatory test results are emissions from a latent progressive disease stage. We validate our approach in simulated data and apply it to a case study of chronic kidney disease prediction using electronic health records. In simulations, our method reduces prediction bias and improves calibration-in-the-large, correcting the Observed:Expected ratio in the underdiagnosed group from 1.34 (standard deviation: 0.09) in a model developed without any correction for underdiagnosis bias to 1.02 (0.09). Violations of assumptions in the simulation affected the estimation of model parameters, but the proposed approach nonetheless remained better calibrated than the standard model. In the clinical case study, we identify diabetes as the main driver of observability, with an odds ratio of 10.36 (95% confidence interval, 9.80 - 11.02) in 6-month urine albumin-creatinine ratio testing rate. Using our approach to predict the counterfactual diagnostic rate in patients without diabetes, we improved the Observed:Expected ratio of a developed clinical prediction model from 1.55 (1.51 - 1.59) to 1.01 (0.98 - 1.04).

URL PDF HTML ☆

赞 0 踩 0

2605.05996 2026-05-08 stat.ML cs.LG

Gaussian mixture models in Hilbert spaces via kernel methods

Daniel López-Montero, Antonio Álvarez-López, Marcos Matabuena

Comments 38 pages, 13 figures

2605.05993 2026-05-08 stat.ML cs.LG stat.ME stat.OT

TabCF: Distributional Control Function Estimation with Tabular Foundation Models

Geping Chen, Chunlin Li, Tianzhong Yang, Zhengyuan Zhu, Jing Zhou

2605.05973 2026-05-08 stat.ML cs.AI cs.LG stat.AP

Towards Reliable LLM Evaluation: Correcting the Winner's Curse in Adaptive Benchmarking

Yang Xu, Jiefu Zhang, Haixiang Sun, Zihan Zhou, Tianyu Cao, Vaneet Aggarwal

2605.04918 2026-05-08 math.AP cs.LG cs.NA math.NA

Neural Discovery of Strichartz Extremizers

Nicolás Valenzuela, Ricardo Freire, Claudio Muñoz

Comments 38 pages, 26 figures; v.2: corrected typos

2605.04510 2026-05-08 math.OC cs.AI cs.LG

Predictive and Prescriptive AI toward Optimizing Wildfire Suppression

Leonard Boussioux, Alexandre Jacquillat, Ryne Reger, Jacob Wachspress

2605.03482 2026-05-08 cs.CR cs.AI cs.LG

MEMSAD: Gradient-Coupled Anomaly Detection for Memory Poisoning in Retrieval-Augmented Agents

Ishrith Gowda

Comments 28 pages, 9 figures, 6 theorems. Submitted to NeurIPS 2026

2605.03213 2026-05-08 cs.CR cs.AI

When Agents Handle Secrets: A Survey of Confidential Computing for Agentic AI

Javad Forough, Marios Kogias, Hamed Haddadi

2604.20050 2026-05-08 econ.GN cs.AI cs.GT q-fin.EC

Information Aggregation with AI Agents

Spyros Galanis

Comments 64 pages

2604.06269 2026-05-08 q-bio.QM cs.AI

MAT-Cell: A Multi-Agent Tree-Structured Reasoning Framework for Batch-Level Single-Cell Annotation

Yehui Yang, Zelin Zang, Xienan Zheng, Yuzhe Jia, Changxi Chi, Jingbo Zhou, Chang Yu, Jinlin Wu, Fuji Yang, Jiebo Luo, Zhen Lei, Stan Z. Li

2603.23055 2026-05-08 stat.ML cs.IT cs.LG math.IT

Post-Selection Distributional Model Evaluation

Amirmohammad Farzaneh, Osvaldo Simeone

2603.12278 2026-05-08 q-bio.OT cs.AI cs.LG

Unsupervised Anomaly Detection in Wearable Foot Sensor Data: A Baseline Feasibility Study Towards Diabetic Foot Ulcer Prevention

Md Tanvir Hasan Turja

Comments 36 pages, 19 figures. Published in Biomedical Signal Processing and Control, Vol. 123, Part A, 110416, September 2026. https://doi.org/10.1016/j.bspc.2026.110416

详情

DOI: 10.1016/j.bspc.2026.110416
Journal ref: Biomedical Signal Processing and Control, Vol. 123, Part A, 110416 (2026)

英文摘要

Diabetic foot ulcers (DFUs) are a severe complication of diabetes associated with significant morbidity, amputation risk, and healthcare burden. Developing effective continuous monitoring frameworks requires first establishing reliable baseline models of normal foot biomechanics. This paper presents a feasibility study of an anomaly detection framework applied to time-series data from wearable foot sensors, specifically NTC thin-film thermocouples for temperature and FlexiForce A401 pressure sensors for plantar load monitoring. Data were collected from healthy adult subjects across 312 capture sessions on an instrumented pathway, generating 93,790 valid multi-sensor readings spanning September 2023 to June 2024. Two unsupervised algorithms, Isolation Forest and K-Nearest Neighbors using Local Outlier Factor (KNN/LOF), were applied to detect statistical deviations in foot temperature and pressure signals. Results show that Isolation Forest is more sensitive to subtle, distributed anomalies, while KNN/LOF identifies concentrated extreme deviations but flags a higher proportion of sessions not corroborated by Isolation Forest. Since no clinical ground truth is available, this difference is interpreted as lower specificity under the shared 5 percent contamination assumption rather than a confirmed false-positive rate. A mild positive correlation (0.41-0.48) between pressure and temperature features supports the case for combined multi-modal monitoring. These findings establish a validated baseline analytical pipeline and provide a methodological foundation for future clinical validation studies involving diabetic patients, where the relationship between detected anomalies and DFU-related pathophysiology can be directly assessed.

URL PDF HTML ☆

赞 0 踩 0

2603.12031 2026-05-08 cs.DC cs.LG cs.MA

AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling

Hamed Hamzeh

详情

英文摘要

State-of-the-art cloud-native applications require intelligent schedulers that can effectively balance system stability, resource utilisation, and associated costs. While Kubernetes provides feasibility-based placement by default, recent research efforts have explored the use of reinforcement learning (RL) for more intelligent scheduling decisions. However, current RL-based schedulers have three major limitations. First, most of these schedulers use monolithic centralised agents, which are non-scalable for large heterogeneous clusters. Second, the ones that use multi-objective reward functions assume simple, static, linear combinations of the objectives. Third, no previous work has produced a stress-aware scheduler that can react adaptively to dynamic conditions. To address these gaps in current research, we propose the Adaptive Graph-enhanced Multi-Agent Reinforcement Learning Dynamic Kubernetes Scheduler (AGMARL-DKS). AGMARL-DKS addresses these gaps by introducing three major innovations. First, we construct a scalable solution by treating the scheduling challenge as a cooperative multi-agent problem, where every cluster node operates as an agent, employing centralised training methods before decentralised execution. Second, to be context-aware and yet decentralised, we use a Graph Neural Network (GNN) to build a state representation of the global cluster context at each agent. This represents an improvement over methods that rely solely on local observations. Finally, to make trade-offs between these objectives, we use a stress-aware lexicographical ordering policy instead of a simple, static linear weighting of these objectives. The evaluations in Google Kubernetes Engine (GKE) reveal that AGMARL-DKS significantly outperforms the default scheduler in terms of fault tolerance, utilisation, and cost, especially in scheduling batch and mission-critical workloads.

URL PDF HTML ☆

赞 0 踩 0

2603.02950 2026-05-08 cs.CY cs.AI cs.GT

Path Dependence under Adaptive AI Delegation

Lingxiao Huang, Nisheeth K. Vishnoi

2603.01192 2026-05-08 stat.ML cs.LG

A Basin-Selection Perspective on Grokking via Singular Learning Theory

Ben Cullen, Sergio Estan-Ruiz, Riya Danait, Jiayi Li