arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.21596 2026-04-24 stat.ME stat.CO

Efficient Bayes Factor Sensitivity Analysis via Posterior Density Ratios

František Bartoš, Eric-Jan Wagenmakers, Maarten Marsman, Don van den Bergh

详情

英文摘要

Bayes factor sensitivity analysis examines how the evidence for one hypothesis over another depends on the prior distribution. In complex models, the standard approach refits the model at each hyper-parameter value, and the total computational cost scales linearly in the grid size. We propose a method that recovers the entire sensitivity curve from a single additional model fit. The key identity decomposes the Bayes factor at any hyper-parameter value $γ_x$ into an ``anchor'' Bayes factor at a fixed reference $γ_0$ and a Savage--Dickey density ratio in an extended model that places a hyper-prior on $γ$. Once this extended model is fit, the Bayes factor at any $γ_x$ follows from the anchor value and a ratio of two posterior density ordinates. To approximate this ratio, we employ the importance-weighted marginal density estimator (IWMDE). Because the sensitivity parameter enters the model only through the prior distribution on the model parameters, the data likelihood cancels in the IWMDE, reducing it to a simple ratio of prior density evaluations on the MCMC draws, without any additional likelihood computation. The resulting estimator is fast, remains accurate even with small MCMC samples, and substantially outperforms kernel density estimation across the full sensitivity range. The method extends naturally to simultaneous sensitivity over multiple hyper-parameters and to Bayesian model averaging. We illustrate it on a univariate Bayesian $t$-test with exact Bayes factors for validation, a bivariate informed $t$-test, and a Bayesian model-averaged meta-analysis, obtaining accurate sensitivity curves at a fraction of the brute-force cost.

URL PDF HTML ☆

赞 0 踩 0

2604.21595 2026-04-24 stat.ML cs.LG

A Kernel Nonconformity Score for Multivariate Conformal Prediction

Louis Meyer, Wenkai Xu

2604.21549 2026-04-24 cs.AI stat.ME

Unbiased Prevalence Estimation with Multicalibrated LLMs

Fridolin Linder, Thomas Leeper, Daniel Haimovich, Niek Tax, Lorenzo Perini, Milan Vojnovic

2604.21548 2026-04-24 econ.EM stat.ME

Nonparametric Point Identification of Treatment Effect Distributions via Rank Stickiness

Tengyuan Liang

Comments 25 pages, 2 figures

2604.21545 2026-04-24 stat.ME stat.AP

Informed Asymmetric Dirichlet Priors for Multivariate Bernoulli Mixture Models

Luisa Ferrari, Maria Franco Villoria, Garritt L. Page, Alex Laini

Comments 44 pages, 11 figures

2604.21538 2026-04-24 stat.CO

On a class of constrained particle filters for continuous-discrete state space models

Utku Erdogan, Gabriel J. Lord, Joaquin Miguez

Comments arXiv admin note: text overlap with arXiv:2512.11012

2604.21498 2026-04-24 stat.ME stat.AP

Analyzing directional errors in spatial orientation using nonparametric circular regression with mixed covariates

Mario Francisco-Fernández, Andrea Meilán-Vila

Comments 33 pages, 13 figures, 3 tables

2604.21491 2026-04-24 cs.CR stat.AP stat.ME

Benchmarking the Utility of Privacy-Preserving Cox Regression Under Data-Driven Clipping Bounds: A Multi-Dataset Simulation Study

Keita Fukuyama, Yukiko Mori, Tomohiro Kuroda, Hiroaki Kikuchi

Comments 11 pages, 6 figures, 5 tables. Supplementary material (5 pages, 2 figures, 3 tables) included as ancillary file. Submission to IEEE Journal of Biomedical and Health Informatics (J-BHI)

详情

英文摘要

Differential privacy (DP) is a mathematical framework that guarantees individual privacy; however, systematic evaluation of its impact on statistical utility in survival analyses remains limited. In this study, we systematically evaluated the impact of DP mechanisms (Laplace mechanism and Randomized Response) with data-driven clipping bounds on the Cox proportional hazards model, using 5 clinical datasets ($n = 168$--$6{,}524$), 15 levels of $\varepsilon$ (0.1--1000), and $B = 1{,}000$ Monte Carlo iterations. The data-driven clipping bounds used here are observed min/max and therefore do not provide formal $\varepsilon$-DP guarantees; the results represent an optimistic lower bound on utility degradation under formal DP. We compared three types of input perturbations (covariates only, all inputs, and the discrete-time model) with output perturbations (dfbeta-based sensitivity), using loss of significance rate (LSR), C-index, and coefficient bias as metrics. At standard DP levels ($\varepsilon \leq 1$), approximately 90% (90--94%) of the significant covariates lost significance, even in the largest dataset ($n = 6{,}524$), and the predictive performance approached random levels (test C-index $\approx 0.5$) under many conditions. Among the input perturbation approaches, perturbing only covariates preserved the risk-set structure and achieved the best recovery, whereas output perturbation (dfbeta-based sensitivity) maintained near-baseline performance at $\varepsilon \geq 5$. At $n \approx 3{,}000$, the significance recovered rapidly at $\varepsilon = 3$--10; however, in practice, $\varepsilon \geq 10$ (for predictive performance) to $\varepsilon \geq 30$--60 (for significance preservation) is required. In the moderate-to-high $\varepsilon$ range, false-positive rates increased for variables whose baseline $p$-values were near the significance threshold.

URL PDF HTML ☆

赞 0 踩 0

2604.21457 2026-04-24 cs.CY cs.SI stat.AP

Context-Aware Displacement Estimation from Mobile Phone Data: A Methodological Framework

Rajius Idzalika, Muhammad Rheza Muztahid, Radityo Eko Prasojo

Comments 24 pages, 4 figures, 14 tables. Case study: Super Typhoon Nando, Philippines (2025)

2604.21432 2026-04-24 stat.ML cs.LG

A single algorithm for both restless and rested rotting bandits

Julien Seznec, Pierre Ménard, Alessandro Lazaric, Michal Valko

Comments In AISTATS 2020

2604.21372 2026-04-24 stat.AP

Optimal basis risk weighting in expectile-based parametric insurance

Markus Johannes Maier, Matthias Scherer

2604.21292 2026-04-24 math.CO cs.IT math.IT stat.AP

Large values in time series and additive combinatorics

Alex Iosevich, Vishal Gupta

Comments 13 pages, 6 figures

2604.21270 2026-04-24 stat.ML cs.LG cs.SY eess.SY math.OC

CLT-Optimal Parameter Error Bounds for Linear System Identification

Yichen Zhou, Stephen Tu

Comments 36 pages

2604.21260 2026-04-24 stat.ML cs.AI cs.LG econ.EM q-bio.QM stat.ME

Calibeating Prediction-Powered Inference

Lars van der Laan, Mark Van Der Laan

Comments Paper website: https://larsvanderlaan.github.io/ppi-aipw/

2604.21235 2026-04-24 cs.LG cs.CL stat.ME

Learning Dynamic Representations and Policies from Multimodal Clinical Time-Series with Informative Missingness

Zihan Liang, Ziwen Pan, Ruoxuan Xiong

Comments Findings of ACL 2026 (30 pages)

2604.21203 2026-04-24 stat.ML cs.LG

Refining Covariance Matrix Estimation in Stochastic Gradient Descent Through Bias Reduction

Ziyang Wei, Wanrong Zhu, Jingyang Lyu, Wei Biao Wu

2604.21115 2026-04-24 eess.SP stat.AP

Complex Approximate Message Passing with Non-separable Denoising

Vishnu Teja Kunde, Alessandro Mirri, Jean-Francois Chamberland, Enrico Paolini

2604.21110 2026-04-24 stat.ME math.ST stat.TH

A goodness-of-fit test for the logistic propensity score model under nonignorable missing data

Manli Cheng, Yangjianchen Xu, Qinglong Tian, Pengfei Li

Comments 18 pages

2604.21097 2026-04-24 stat.ML cs.LG

Learning to Emulate Chaos: Adversarial Optimal Transport Regularization

Gabriel Melo, Leonardo Santiago, Peter Y. Lu

2604.21067 2026-04-24 stat.AP

The geometry of conflict : 3D Spatio-temporal patterns in fatalities prediction

Thomas Schincariol

Comments 68 Pages, 34 figures

2604.21020 2026-04-24 stat.ME

A Functional-Class Meta-Analytic Framework for Quantifying Surrogate Resilience

Emily Hsiao, Layla Parast

2604.21009 2026-04-24 stat.ME stat.CO

Revisiting Bayesian Variable Selection via Optimization

Leo L Duan

2604.20978 2026-04-24 stat.ME

ML, PL, QL in Markov chain models

Nils Lid Hjort, Cristiano Varin

Comments 34 pages, 7 figures. This is the Statistical Research Report version, Department of Mathematics, University of Oslo version, April 2005, with some more examples and material than in the published version, Scandinavian Journal of Statistics, 2008, vol. 35, pages 64-82

2604.20949 2026-04-24 cs.LG q-fin.TR stat.ME stat.ML

Early Detection of Latent Microstructure Regimes in Limit Order Books

Prakul Sunil Hiremath, Vruksha Arun Hiremath

Comments 48 pages, 7 figures. Combines theoretical guarantees (identifiability and early-detection bounds), 200-run simulation study, and preliminary real-data evaluation on BTC/USDT limit order books. Code and data available

2604.20907 2026-04-24 stat.ML cs.LG math.CO math.PR math.ST stat.TH

Achieving the Kesten-Stigum bound in the non-uniform hypergraph stochastic block model

Manuel Fernandez, Ludovic Stephan, Yizhe Zhu

Comments 67 pages, 1 figure

2604.20877 2026-04-24 q-fin.RM stat.AP stat.ME

When AAA Satisfies Nothing: Impossibility Theorems for Structured Credit Ratings

Marco Pollanen

Comments 22 pages, 7 tables, 1 figure. Methodological paper on reliability bounds and discrimination limits, with application to structured credit ratings

2604.19738 2026-04-24 math.PR cs.LG stat.ML

Phase Transitions in the Fluctuations of Functionals of Random Neural Networks

Simmaco Di Lillo, Leonardo Maini, Domenico Marinucci

2604.16645 2026-04-24 stat.ME math.ST stat.TH

Strang splitting estimator for nonlinear multivariate stochastic differential equations with Pearson-type multiplicative noise

Predrag Pilipović, Adeline Samson, Susanne Ditlevsen

Comments 27 pages of main text, 14 pages of supplementary materials, 8 figures

2604.04141 2026-04-24 stat.ME math.ST stat.AP stat.TH

On Data Thinning for Model Validation in Small Area Estimation

Sho Kawano, Paul A. Parker, Zehang Richard Li

2603.20903 2026-04-24 math.OC hep-ph stat.ML

Unfolding with a Wasserstein Loss

Katy Craig, Benjamin Faktor, Benjamin Nachman

2603.15055 2026-04-24 stat.ML cs.LG math.ST stat.TH

Spatio-temporal probabilistic forecast using MMAF-guided learning

Leonardo Bardi, Imma Valentina Curato, Lorenzo Proietti

2603.03700 2026-04-24 stat.ML cs.AI cs.LG math.ST stat.TH

Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data

Saptarshi Chakraborty, Quentin Berthet, Peter L. Bartlett

详情

英文摘要

Despite the remarkable empirical success of score-based diffusion models, their statistical guarantees remain underdeveloped. Existing analyses often provide pessimistic convergence rates that do not reflect the intrinsic low-dimensional structure common in real data, such as that arising in natural images. In this work, we study the statistical convergence of score-based diffusion models for learning an unknown distribution $μ$ from finitely many samples. Under mild regularity conditions on the forward diffusion process and the data distribution, we derive finite-sample error bounds on the learned generative distribution, measured in the Wasserstein-$p$ distance. Unlike prior results, our guarantees hold for all $p \ge 1$ and require only a finite-moment assumption on $μ$, without compact-support, manifold, or smooth-density conditions. Specifically, given $n$ i.i.d.\ samples from $μ$ with finite $q$-th moment and appropriately chosen network architectures, hyperparameters, and discretization schemes, we show that the expected Wasserstein-$p$ error between the learned distribution $\hatμ$ and $μ$ scales as $\mathbb{E}\, \mathbb{W}_p(\hatμ,μ) = \widetilde{O}\!\left(n^{-1 / d^\ast_{p,q}(μ)}\right),$ where $d^\ast_{p,q}(μ)$ is the $(p,q)$-Wasserstein dimension of $μ$. Our results demonstrate that diffusion models naturally adapt to the intrinsic geometry of data and mitigate the curse of dimensionality, since the convergence rate depends on $d^\ast_{p,q}(μ)$ rather than the ambient dimension. Moreover, our theory conceptually bridges the analysis of diffusion models with that of GANs and the sharp minimax rates established in optimal transport. The proposed $(p,q)$-Wasserstein dimension also extends the notion of classical Wasserstein dimension to distributions with unbounded support, which may be of independent theoretical interest.

URL PDF HTML ☆

赞 0 踩 0

2602.18577 2026-04-24 stat.ME stat.CO

balnet: Pathwise Estimation of Covariate Balancing Propensity Scores

Erik Sverdrup, Trevor Hastie

2602.06262 2026-04-24 stat.ME stat.AP

Latent variation in pathogen strain-specific effects under multiple-versions-of-treatment theory

Bronner P. Gonçalves

Comments 9 pages, 1 figure

2511.14354 2026-04-24 math.ST stat.TH

Asymptotic Distribution of Constrained Nearly-Isotonic Graph Fused Lasso

Vladimir Pastukhov

Comments 11 pages, 1 figure

2510.04548 2026-04-24 cond-mat.dis-nn cs.LG stat.ML

Learning Linear Regression with Low-Rank Tasks in-Context

Kaito Takanami, Takashi Takahashi, Yoshiyuki Kabashima

Comments Accepted at AISTATS 2026

2509.25630 2026-04-24 stat.ML cs.LG cs.NA math.NA

When Langevin Monte Carlo Meets Randomization: New Sampling Algorithms with Non-asymptotic Error Bounds beyond Log-Concavity and Gradient Lipschitzness

Xiaojie Wang, Bin Yang

2509.03476 2026-04-24 stat.ME

Temporal dependence in exposure and hazard-based infectious disease interventions

Hiroyasu Ando, A. James O'Malley, Akihiro Nishi

Comments 15 pages, 3 figures

2508.10612 2026-04-24 math.ST stat.TH

Approximation rates for finite mixtures of location-scale models and fast least-squares estimators

Hien Duy Nguyen, TrungTin Nguyen, Jacob Westerhout, Xin Guo

2506.12721 2026-04-24 cs.AI cs.CL cs.LG stat.ML

Strategic Scaling of Test-Time Compute: A Bandit Learning Approach

Bowen Zuo, Yinglun Zhu

Comments To appear at ICLR 2026

2506.10374 2026-04-24 cs.IT math.IT math.ST stat.TH

Optimal Non-Adaptive Group Testing with One-Sided Error Guarantees

Daniel McMorrow, Jonathan Scarlett

2506.04292 2026-04-24 cs.SI cs.LG stat.AP

GARG-AML against Smurfing: A Scalable and Interpretable Graph-Based Framework for Anti-Money Laundering

Bruno Deprez, Bart Baesens, Tim Verdonck, Wouter Verbeke

2501.06133 2026-04-24 stat.ME math.ST stat.TH

Testing conditional independence under isotonicity

Rohan Hore, Jake A. Soloff, Rina Foygel Barber, Richard J. Samworth

Comments 79 pages, 7 figures, 2 Table

2407.13970 2026-04-24 math.ST stat.TH

Frequentist Coverage of Bayes Posteriors in Nonlinear Inverse Problems with Gaussian Priors

Youngsoo Baek, Katerina Papagiannouli

Comments 42 pages, 2 figures

2401.16407 2026-04-24 stat.ML cs.LG eess.IV eess.SP

Is K-fold cross validation the best model selection method for Machine Learning?

Juan M Gorriz, R. Martin Clemente, F Segovia, J Ramirez, A Ortiz, J. Suckling

Comments 40 pages, 24 figures

详情

DOI: 10.1016/j.inffus.2026.104404

英文摘要

As a technique that can compactly represent complex patterns, machine learning has significant potential for predictive inference. K-fold cross-validation (CV) is the most common approach to ascertaining the likelihood that a machine learning outcome is generated by chance, and it frequently outperforms conventional hypothesis testing. This improvement uses measures directly obtained from machine learning classifications, such as accuracy, that do not have a parametric description. To approach a frequentist analysis within machine learning pipelines, a permutation test or simple statistics from data partitions (i.e., folds) can be added to estimate confidence intervals. Unfortunately, neither parametric nor non-parametric tests solve the inherent problems of partitioning small sample-size datasets and learning from heterogeneous data sources. The fact that machine learning strongly depends on the learning parameters and the distribution of data across folds recapitulates familiar difficulties around excess false positives and replication. A novel statistical test based on K-fold CV and the Upper Bound of the actual risk (K-fold CUBV) is proposed, where uncertain predictions of machine learning with CV are bounded by the worst case through the evaluation of concentration inequalities. Probably Approximately Correct-Bayesian upper bounds for linear classifiers in combination with K-fold CV are derived and used to estimate the actual risk. The performance with simulated and neuroimaging datasets suggests that K-fold CUBV is a robust criterion for detecting effects and validating accuracy values obtained from machine learning and classical CV schemes, while avoiding excess false positives.

URL PDF HTML ☆

赞 0 踩 0

2309.07176 2026-04-24 cs.LG stat.ML

Mind the Gap: Optimal and Equitable Encouragement Policies

Angela Zhou

Comments Updated with major new case study on SNAP recertification benefits

2303.03237 2026-04-24 stat.ML cs.LG math.ST stat.CO stat.TH

Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation

David Holzmüller, Francis Bach

Comments Published in JMLR. New in v4: Summary tables / sections. Plots can be reproduced using the code at https://github.com/dholzmueller/sampling_experiments