arXivDaily arXiv每日学术速递 周一至周五更新
重置
2604.16288 2026-04-20 math.AP cond-mat.stat-mech math-ph math.MP math.PR stat.ML

Phase transitions in Doi-Onsager, Noisy Transformer, and other multimodal models

Kyunghoo Mun, Matthew Rosenzweig

Comments 16 pages

详情
英文摘要

We study phase transitions for repulsive-attractive mean-field free energies on the circle. For a $\frac{1}{n+1}$-periodic interaction whose Fourier coefficients satisfy a certain decay condition, we prove that the critical coupling strength $K_c$ coincides with the linear stability threshold $K_\#$ of the uniform distribution and that the phase transition is continuous, in the sense that the uniform distribution is the unique global minimizer at criticality. The proof is based on a sharp coercivity estimate for the free energy obtained from the constrained Lebedev--Milin inequality. We apply this result to three motivating models for which the exact value of the phase transition and its (dis)continuity in terms of the model parameters was not fully known. For the two-dimensional Doi--Onsager model $W(θ)=-|\sin(2πθ)|$, we prove that the phase transition is continuous at $K_c=K_\#=3π/4$. For the noisy transformer model $W_β(θ)=(e^{β\cos(2πθ)}-1)/β$, we identify the sharp threshold $β_*$ such that $K_c(β) = K_\#(β)$ and the phase transition is continuous for $β\leq β_*$, while $K_c(β)<K_\#(β)$ and the phase transition is discontinuous for $β> β_*$. We also obtain the corresponding sharp dichotomy for the noisy Hegselmann--Krause model $W_{R}(θ) = (R-2π|θ|)_{+}^2$ .

2604.16239 2026-04-20 stat.ML cs.LG

Adaptive multi-fidelity optimization with fast learning rates

Come Fiegel, Victor Gabillon, Michal Valko

Comments Published at International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

详情
Journal ref
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
英文摘要

In multi-fidelity optimization, biased approximations of varying costs of the target function are available. This paper studies the problem of optimizing a locally smooth function with a limited budget, where the learner has to make a tradeoff between the cost and the bias of these approximations. We first prove lower bounds for the simple regret under different assumptions on the fidelities, based on a cost-to-bias function. We then present the Kometo algorithm which achieves, with additional logarithmic factors, the same rates without any knowledge of the function smoothness and fidelity assumptions, and improves previously proven guarantees. We finally empirically show that our algorithm outperforms previous multi-fidelity optimization methods without the knowledge of problem-dependent parameters.

2604.16238 2026-04-20 cs.LG physics.ao-ph stat.ML

Enhancing AI and Dynamical Subseasonal Forecasts with Probabilistic Bias Correction

Hannah Guan, Soukayna Mouatadid, Paulo Orenstein, Judah Cohen, Haiyu Dong, Zekun Ni, Jeremy Berman, Genevieve Flaspohler, Alex Lu, Jakob Schloer, Joshua Talib, Jonathan A. Weyn, Lester Mackey

详情
英文摘要

Decision-makers rely on weather forecasts to plant crops, manage wildfires, allocate water and energy, and prepare for weather extremes. Today, such forecasts enjoy unprecedented accuracy out to two weeks thanks to steady advances in physics-based dynamical models and data-driven artificial intelligence (AI) models. However, model skill drops precipitously at subseasonal timescales (2 - 6 weeks ahead), due to compounding errors and persistent biases. To counter this degradation, we introduce probabilistic bias correction (PBC), a machine learning framework that substantially reduces systematic error by learning to correct historical probabilistic forecasts. When applied to the leading dynamical and AI models from the European Centre for Medium-Range Weather Forecasts (ECMWF), PBC doubles the subseasonal skill of the AI Forecasting System and improves the skill of the operationally-debiased dynamical model for 91% of pressure, 92% of temperature, and 98% of precipitation targets. We designed PBC for operational deployment, and, in ECMWF's 2025 real-time forecasting competition, its global forecasts placed first for all weather variables and lead times, outperforming the dynamical models from six operational forecasting centers, an international dynamical multi-model ensemble, ECMWF's AI Forecasting System, and the forecasting systems of 34 teams worldwide. These probabilistic skill gains translate into more accurate prediction of extreme events and have the potential to improve agricultural planning, energy management, and disaster preparedness in vulnerable communities.

2604.16221 2026-04-20 stat.ME

Network Meta-analysis and Diffusion

Gerta Rücker, Annabel L. Davies, Guido Schwarzer

Comments 19 pages, 8 figures

详情
英文摘要

We show that the covariance matrix of the treatment effect estimates in a network meta-analysis can be obtained without matrix inversion using a geometric series of diffusion matrices. This property extends to the hat matrix and provides a connection between parameter estimation in regression analysis and random walks on the network graph. We also provide a number of visualization tools implemented in R.

2604.16219 2026-04-20 math.ST stat.ME stat.TH

Simultaneous Inference for Covariance and Precision Matrices of Long-Range Dependent Time Series

Percy S. Zhai, Mladen Kolar, Wei Biao Wu

详情
英文摘要

For time series with long-range temporal dependence, inference for covariance and precision matrices is non-trivial. We propose a Berry-Esseen type Gaussian approximation result that gives a finite-sample bound for the Kolmogorov distance between the infinity norms of the estimation error of sample covariance matrix and the corresponding Gaussian approximation. The method utilizes martingale and m-dependent approximation and relies on constructing triadic blocks. We also establish a bootstrapping result with block sampling method, which preserves validity despite strong temporal dependence. Our results on covariance allow ultra-high-dimensional settings where the dimension of time series can grow sub-exponentially with sample size. Similar results can be built for precision matrix under low-dimensional settings. No assumption is required on the structure of covariance and precision matrices.

2604.16206 2026-04-20 math.PR math.ST stat.TH

Extrapolation of max-stable random fields with Fréchet marginals

Vitalii Makogin, Evgeny Spodarev, Ilja Sukhanov

Comments 32 pages, 9 figures

详情
英文摘要

We propose a method for the prediction of stationary max--stable random fields with $α$-Fréchet marginal distribution $H_α$. The method is suitable to cope with heavy tails for $α\in(0,2)$ and is (approximately) exact in marginal distributions. It is based on a recent extrapolation approach via level sets which requires no moment assumptions. An explicit connection between the excursion metric and the Davis-Resnick distance is established. The existence of the predictor is proven. The non-uniqueness of the forecast is demonstrated on several examples. The method is tested on multiple simulated time series and random fields as well as applied to real data of annual maximum precipitation.

2604.16203 2026-04-20 stat.ME stat.AP stat.ML

A Bayesian Updating Framework for Long-term Multi-Environment Trial Data in Plant Breeding

Stephan Bark, Waqas Ahmed Malik, Maryna Prus, Hans-Peter Piepho, Volker Schmid

Comments 27 pages, 4 figures, 2 tables; includes supplementary material and reproducible code (GitHub link)

详情
英文摘要

In variety testing, multi-environment trials (MET) are essential for evaluating the genotypic performance of crop plants. A persistent challenge in the statistical analysis of MET data is the estimation of variance components, which are often still inaccurately estimated or shrunk to exactly zero when using residual (restricted) maximum likelihood (REML) approaches. At the same time, institutions conducting MET typically possess extensive historical data that can, in principle, be leveraged to improve variance component estimation. However, these data are rarely incorporated sufficiently. The purpose of this paper is to address this gap by proposing a Bayesian framework that systematically integrates historical information to stabilize variance component estimation and better quantify uncertainty. Our Bayesian linear mixed model (BLMM) reformulation uses priors and Markov chain Monte Carlo (MCMC) methods to maintain the variance components as positive, yielding more realistic distributional estimates. Furthermore, our model incorporates historical prior information by managing MET data in successive historical data windows. Variance component prior and posterior distributions are shown to be conjugate and belong to the inverse gamma and inverse Wishart families. While Bayesian methodology is increasingly being used for analyzing MET data, to the best of our knowledge, this study comprises one of the first serious attempts to objectively inform priors in the context of MET data. This refers to the proposed Bayesian updating approach. To demonstrate the framework, we consider an application where posterior variance component samples are plugged into an A-optimality experimental design criterion to determine the average optimal allocations of trials to agro-ecological zones in a sub-divided target population of environments (TPE).

2604.16111 2026-04-20 cs.LG stat.ML

Sample Complexity Bounds for Stochastic Shortest Path with a Generative Model

Jean Tarbouriech, Matteo Pirotta, Michal Valko, Alessandro Lazaric

Comments Accepted at the 32nd International Conference on Algorithmic Learning Theory (ALT 2021)

详情
英文摘要

We study the sample complexity of learning an $ε$-optimal policy in the Stochastic Shortest Path (SSP) problem. We first derive sample complexity bounds when the learner has access to a generative model. We show that there exists a worst-case SSP instance with $S$ states, $A$ actions, minimum cost $c_{\min}$, and maximum expected cost of the optimal policy over all states $B_{\star}$, where any algorithm requires at least $Ω(SAB_{\star}^3/(c_{\min}ε^2))$ samples to return an $ε$-optimal policy with high probability. Surprisingly, this implies that whenever $c_{\min} = 0$ an SSP problem may not be learnable, thus revealing that learning in SSPs is strictly harder than in the finite-horizon and discounted settings. We complement this lower bound with an algorithm that matches it, up to logarithmic factors, in the general case, and an algorithm that matches it up to logarithmic factors even when $c_{\min} = 0$, but only under the condition that the optimal policy has a bounded hitting time to the goal state.

2604.16087 2026-04-20 cs.LG stat.ML

The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback

Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Michal Valko, Vianney Perchet

Comments Accepted at the 42nd International Conference on Machine Learning (ICML 2025)

详情
英文摘要

We study the problem of learning in zero-sum matrix games with repeated play and bandit feedback. Specifically, we focus on developing uncoupled algorithms that guarantee, without communication between players, the convergence of the last-iterate to a Nash equilibrium. Although the non-bandit case has been studied extensively, this setting has only been explored recently, with a bound of $\mathcal{O}(T^{-1/8})$ on the exploitability gap. We show that, for uncoupled algorithms, guaranteeing convergence of the policy profiles to a Nash equilibrium is detrimental to the performance, with the best attainable rate being $Ω(T^{-1/4})$ in contrast to the usual $Ω(T^{-1/2})$ rate for convergence of the average iterates. We then propose two algorithms that achieve this optimal rate up to constant and logarithmic factors. The first algorithm leverages a straightforward trade-off between exploration and exploitation, while the second employs a regularization technique based on a two-step mirror descent approach.

2604.16086 2026-04-20 cs.CV cs.AI cs.LG stat.ML

Stylistic-STORM (ST-STORM) : Perceiving the Semantic Nature of Appearance

Hamed Ouattara, Pierre Duthon, Pascal Houssam Salmane, Frédéric Bernardin, Omar Ait Aider

Comments 20 pages, 16 figures, ICPR 2026 (28th International Conference on Pattern Recognition)

详情
英文摘要

One of the dominant paradigms in self-supervised learning (SSL), illustrated by MoCo or DINO, aims to produce robust representations by capturing features that are insensitive to certain image transformations such as illumination, or geometric changes. This strategy is appropriate when the objective is to recognize objects independently of their appearance. However, it becomes counterproductive as soon as appearance itself constitutes the discriminative signal. In weather analysis, for example, rain streaks, snow granularity, atmospheric scattering, as well as reflections and halos, are not noise: they carry the essential information. In critical applications such as autonomous driving, ignoring these cues is risky, since grip and visibility depend directly on ground conditions and atmospheric conditions. We introduce ST-STORM, a hybrid SSL framework that treats appearance (style) as a semantic modality to be disentangled from content. Our architecture explicitly separates two latent streams, regulated by gating mechanisms. The Content branch aims at a stable semantic representation through a JEPA scheme coupled with a contrastive objective, promoting invariance to appearance variations. In parallel, the Style branch is constrained to capture appearance signatures (textures, contrasts, scattering) through feature prediction and reconstruction under an adversarial constraint. We evaluate ST-STORM on several tasks, including object classification (ImageNet-1K), fine-grained weather characterization, and melanoma detection (ISIC 2024 Challenge). The results show that the Style branch effectively isolates complex appearance phenomena (F1=97% on Multi-Weather and F1=94% on ISIC 2024 with 10% labeled data), without degrading the semantic performance (F1=80% on ImageNet-1K) of the Content branch, and improves the preservation of critical appearance

2604.08809 2026-04-20 cs.LG stat.AP

Structural Evaluation Metrics for SVG Generation via Leave-One-Out Analysis

Haonan Zhu, Adrienne Deganutti, Elad Hirsch, Purvanshi Mehta

详情
英文摘要

SVG generation is typically evaluated by comparing rendered outputs to reference images, which captures visual similarity but not the structural properties that make SVG editable, decomposable, and reusable. Inspired by the classical jackknife, we introduce element-level leave-one-out (LOO) analysis. The procedure renders the SVG with and without each element, which yields element-level signals for quality assessment and structural analysis. From this single mechanism, we derive (i) per-element quality scores that enable zero-shot artifact detection; (ii) element-concept attribution via LOO footprints crossed with VLM-grounded concept heatmaps; and (iii) four structural metrics: purity, coverage, compactness, and locality, which quantify SVG modularity from complementary angles. These metrics extend SVG evaluation from image similarity to code structure, enabling element-level diagnosis and comparison of how visual concepts are represented, partitioned, and organized within SVG code. Their practical relevance is validated on over 19,000 edits (5 types) across 5 generation systems and 3 complexity tiers.

2604.08691 2026-04-20 math.ST cs.CC math.PR stat.TH

Planted clique detection and recovery from the hypergraph adjacency matrix

Kalle Alaluusua, B. R. Vinay Kumar

Comments 45 pages. This revision fixes a measurability issue in the leave--one--out proof by separating a measurable eigenvector representative from the subsequent sign choice. It also removes an unnecessary factor left over from an earlier modification, which makes the argument more transparent

详情
英文摘要

Hypergraph data are often projected onto a weighted graph by constructing an adjacency matrix whose $(i,j)$ entry counts the number of hyperedges containing both nodes $i$ and $j$. This reduction is computationally convenient, but it can lose information: distinct hypergraphs may induce the same matrix, and the matrix entries are generally dependent because each hyperedge contributes to multiple pairs. We study the planted clique problem under this matrix-only observation model. For detection, we show that a spectral norm test is asymptotically powerful at the $\sqrt{n}$ scale, with explicit dependence on the background hyperedge probability $p$. For recovery, we analyze a polynomial-time spectral method based on the leading eigenvector and prove exact recovery at the canonical $\sqrt{n}$ scale, again with explicit dependence on $p$. We also extend both results to sparse regimes in which the hyperedge probability may depend on \(n\). Our analysis adapts a leave--one--out eigenvector framework to this setting. These results provide rigorous detection and recovery guarantees when only the adjacency matrix is observed.

2603.03188 2026-04-20 stat.ML cs.LG

Scalable Posterior Uncertainty for Flexible Density-Based Clustering

Nicola Bariletto, Stephen G. Walker

详情
英文摘要

We introduce a novel framework for uncertainty quantification in clustering that combines martingale posterior distributions with density-based clustering. Unlike classical model-based approaches, which define clusters at the latent level of a mixture model, we treat clusters as explicit functionals of the data-generating density, without assuming any specific parametric form. To characterize density uncertainty, we obtain martingale posterior samples via a predictive resampling scheme driven by model score evaluations. This allows us to leverage state-of-the-art differentiable density estimators, such as normalizing flows, making density resampling efficient in large-scale settings and fully parallelizable on modern GPU hardware. Martingale posterior samples of the clustering structure are then obtained by applying density-based clustering to the density draws, enabling principled inference on any clustering-related quantity. Casting the inference target as a density functional further enables a rigorous theoretical analysis of the procedure's convergence properties. We apply our methodology to image and single-cell RNA sequencing data, demonstrating the computational efficiency afforded by its GPU compatibility as well as its ability to recover meaningful clustering structures, with associated uncertainty, across diverse domains.

2512.14504 2026-04-20 stat.ME

A flexible class of latent variable models for the analysis of antibody response data

Emanuele Giorgi, Jonas Wallin

Comments This is a working paper, and updated versions will be released in the future. For further information about this research, please contact Emanuele Giorgi at e.giorgi@bham.ac.uk

详情
英文摘要

Existing approaches to modelling antibody concentration data are mostly based on finite mixture models that rely on the assumption that individuals can be divided into two distinct groups: seronegative and seropositive. Here, we challenge this dichotomous modelling assumption and propose a latent variable modelling framework in which the immune status of each individual is represented along a continuum of latent seroreactivity, ranging from minimal to strong immune activation. This formulation provides greater flexibility in capturing age-related changes in antibody distributions while preserving the full information content of quantitative measurements. We show that the proposed class of models can accommodate a large variety of model formulations, both mechanistic and regression-based, and also includes finite mixture models as a special case. We also propose a computationally efficient $L_2$-based estimator as an alternative to maximum likelihood estimation, which substantially reduces computational cost, and we establish its consistency. Through a case study on malaria serology, we demonstrate how the flexibility of the novel framework enables joint analyses across all ages while accounting for changes in transmission patterns. We conclude by outlining extensions of the proposed modelling framework and its relevance to other omics applications.

2508.12886 2026-04-20 stat.AP

Forecasting Extreme Day and Night Heat in Paris: A Proof of Concept

Richard Berk

Comments 5 figures and 2 pseudocode tables. Revised with new technical material added. Prose edited. References updated

详情
英文摘要

As a form of "small A", quantile machine learning is used to forecast diurnal and nocturnal $Q(.90)$ air temperatures for Paris, France from late spring through the summer months of 2021. The data are provided by the Paris-Montsouris weather station. Rather than trying to directly anticipate the onset and cessation of reported heat waves, Q(.90) values are estimated. The 90th percentile is chosen so that exceedances represent relatively rare and extreme conditions. Predictors include eight routinely available indicators of weather conditions, lagged by 14 days. Using holdout data, the temperature forecasts are produced two weeks in advance. Adaptive conformal prediction regions are computed that, under exchangeability, provide provably valid finite-sample coverage of forecasting uncertainty. For both diurnal and nocturnal temperatures, forecasting accuracy in the holdout data is promising, and sound measures of uncertainty are coupled with a novel decision-making framework. Benefits for policy and practice follow.

2507.05701 2026-04-20 stat.ME

Area-based epigraph and hypograph indices for functional outlier detection

Belen Pulido, Alba M. Franco-Pereira, Rosa E. Lillo, Fabian Scheipl

Comments 24 pages

详情
英文摘要

Detecting outliers in Functional Data Analysis is challenging because curves can stray from the majority in many different ways. The Modified Epigraph Index (MEI) and Modified Hypograph Index (MHI) rank functions by the fraction of the domain on which one curve lies above or below another. While effective for spotting shape anomalies, their construction limits their ability to flag magnitude outliers. This paper introduces two new metrics, the Area-Based Epigraph Index (ABEI) and Area-Based Hypograph Index (ABHI) that quantify the area between curves, enabling simultaneous sensitivity to both magnitude and shape deviations. Building on these indices, we present EHyOut, a robust procedure that recasts functional outlier detection as a multivariate problem: for every curve, and for its first and second derivatives, we compute ABEI and ABHI and then apply multivariate outlier-detection techniques to the resulting feature vectors. Extensive simulations show that EHyOut remains stable across a wide range of contamination settings and often outperforms established benchmark methods. Moreover, applications to Spanish weather data and United Nations world population data further illustrate the practical utility and meaningfulness of this methodology.

2502.15036 2026-04-20 math.ST stat.TH

Extreme Value Analysis based on Blockwise Top-Two Order Statistics

Axel Bücher, Erik Haufs

Comments 96 pages

详情
英文摘要

Extreme value analysis for time series is often based on the block maxima method, in particular for environmental applications. In the classical univariate case, the latter is based on fitting an extreme-value distribution to the sample of (annual) block maxima. Mathematically, the target parameters of the extreme-value distribution also show up in limit results for other high order statistics, which suggests estimation based on blockwise large order statistics. It is shown that a naive approach based on maximizing an independence log-likelihood yields an estimator that is inconsistent in general. A consistent, bias-corrected estimator is proposed, and is analyzed theoretically and in finite-sample simulation studies. The new estimator is shown to be more efficient than traditional counterparts, for instance for estimating large return levels or return periods.

2412.19363 2026-04-20 cs.AI cs.LG stat.ME stat.ML

Large Language Models for Market Research: A Data-augmentation Approach

Mengxin Wang, Dennis J. Zhang, Heng Zhang

详情
英文摘要

Large Language Models (LLMs) have transformed artificial intelligence by excelling in complex natural language processing tasks. Their ability to generate human-like text has opened new possibilities for market research, particularly in conjoint analysis, where understanding consumer preferences is essential but often resource-intensive. Traditional survey-based methods face limitations in scalability and cost, making LLM-generated data a promising alternative. However, while LLMs have the potential to simulate real consumer behavior, recent studies highlight a significant gap between LLM-generated and human data, with biases introduced when substituting between the two. In this paper, we address this gap by proposing a novel statistical data augmentation approach that efficiently integrates LLM-generated data with real data in conjoint analysis. This results in statistically robust estimators with consistent and asymptotically normal properties, in contrast to naive approaches that simply substitute human data with LLM-generated data, which can exacerbate bias. We further present a finite-sample performance bound on the estimation error. We validate our framework through an empirical study on COVID-19 vaccine preferences, demonstrating its superior ability to reduce estimation error and save data and costs by 24.9% to 79.8%. In contrast, naive approaches fail to save data due to the inherent biases in LLM-generated data compared to human data. Another empirical study on sports car choices validates the robustness of our results. Our findings suggest that while LLM-generated data is not a direct substitute for human responses, it can serve as a valuable complement when used within a robust statistical framework.

2409.01794 2026-04-20 stat.ME cs.LG stat.ML

Estimating Joint Interventional Distributions from Marginal Interventional Data

Sergio Hernan Garrido Mejia, Elke Kirschbaum, Armin Kekić, Bernhard Schölkopf, Atalanti Mastakouri

Comments Accepted at the Causal Reasoning and Learning (CLeaR) conference 2026

详情
英文摘要

In this paper we show how to exploit interventional data to acquire the joint conditional distribution of all the variables using the Maximum Entropy principle. To this end, we extend the Causal Maximum Entropy method to make use of interventional data in addition to observational data. Using Lagrange duality, we prove that the solution to the Causal Maximum Entropy problem with interventional constraints lies in the exponential family, as in the Maximum Entropy solution. Our method allows us to perform two tasks of interest when marginal interventional distributions are provided for any subset of the variables. First, we show how to perform causal feature selection from a mixture of observational and single-variable interventional data, and, second, how to infer joint interventional distributions. For the former task, we show on synthetically generated data, that our proposed method outperforms the state-of-the-art method on merging datasets, and yields comparable results to the KCI-test which requires access to joint observations of all variables.

2407.14781 2026-04-20 math.ST cs.NA math.AP math.NA math.PR stat.TH

Bernstein-von Mises theorems for time evolution equations

Richard Nickl

Comments 54 pages

详情
英文摘要

We consider a class of infinite-dimensional dynamical systems driven by non-linear parabolic partial differential equations with initial condition $θ$ modelled by a Gaussian process `prior' probability measure. Given discrete samples of the state of the system evolving in space-time, one obtains updated `posterior' measures on a function space containing all possible trajectories. We give a general set of conditions under which these non-Gaussian posterior distributions are approximated, in Wasserstein distance for the supremum-norm metric, by the law of a Gaussian random function. We demonstrate the applicability of our results to periodic non-linear reaction diffusion equations \begin{align*} \frac{\partial}{\partial t} u - Δu &= f(u) \\ u(0) &= θ\end{align*} where $f$ is any smooth and compactly supported reaction function. In this case the limiting Gaussian measure can be characterised as the solution of a time-dependent Schrödinger equation with `rough' Gaussian initial conditions whose covariance operator we describe.

2604.16031 2026-04-20 stat.ME stat.AP

A Comparison of Joint and Stepwise Dynamic Cognitive Diagnostic Models

Yawen Ma, Anastasia Ushakova, Kate Cain, Gabriel Wallin

Comments 14 pages, 7 tables

详情
英文摘要

To extend cognitive diagnostic models (CDMs) to longitudinal settings, stepwise approaches that integrate a CDM model with a latent transition model and covariates are widely used due to their flexibility. Previous research has shown that stepwise estimation can yield biased results, motivating classification-error correction as a means of improving inference over uncorrected stepwise procedures. In this study, we evaluate a unified Bayesian dynamic cognitive diagnostic model that jointly estimates measurement (item parameters, latent attribute profiles) and transition components (transition parameters) in longitudinal settings with covariates. We compare this joint approach with the bias-corrected stepwise latent transition CDM through a Monte Carlo study. Results demonstrate that joint modeling provides more accurate recovery of transition parameters, particularly under limited test length and sample size, underscoring its advantages for longitudinal diagnostic analysis and offering practical guidance for applied researchers.

2604.15980 2026-04-20 math.ST stat.TH

Decompounding on Compact Symmetric Spaces

Erik Kennerland

详情
英文摘要

This paper examines a stochastic deconvolution problem on compact symmetric spaces which is referred to as decompounding. This involves estimating the step distributions of a random walk, where in addition the number of steps between observations is unknown. The harmonic analysis of symmetric spaces is used to construct an estimator to the problem which converges in mean squared error, extending and improving on the analogous problem on compact Lie groups. The rates of convergence are shown to coincide with asymptotic lower bounds of density estimation in Euclidean space. We provide proofs that while the same rates hold for general density estimation problems in compact symmetric spaces, the decompounding problem lies in a subclass of these with different lower bounds depending on the rank of the space. Consequently, the optimality of the estimator depends on the rank of the symmetric space. Decompounding is a broad problem which appears in applications ranging from mathematical finance to wave optics, and the extension to compact symmetric spaces covers manifolds that commonly appear in the statistics literature.

2604.15940 2026-04-20 cs.LG stat.AP

(Weighted) Adaptive Radius Near Neighbor Search: Evaluation for WiFi Fingerprint-based Positioning

Khang Le, Joaquín Torres-Sospedra, Philipp Müller

Comments 11 pages, 2 figures, 2 tables, submitted to IPIN 2026

详情
英文摘要

Fixed Radius Near Neighbor (FRNN) search is an alternative to the widely used k Nearest Neighbors (kNN) search. Unlike kNN, FRNN determines a label or an estimate for a test sample based on all training samples within a predefined distance. While this approach is beneficial in certain scenarios, assuming a fixed maximum distance for all training samples can decrease the accuracy of the FRNN. Therefore, in this paper we propose the Adaptive Radius Near Neighbor (ARNN) and the Weighted ARNN (WARNN), which employ adaptive distances and in latter case weights. All three methods are compared to kNN and twelve of its variants for a regression problem, namely WiFi fingerprinting indoor positioning, using 22 different datasets to provide a comprehensive analysis. While the performances of the tested FRNN and ARNN versions were amongst the worse, three of the four best methods in the test were WARNN versions, indicating that using weights together with adaptive distances achieves performance comparable or even better than kNN variants.

2604.15889 2026-04-20 stat.CO

Markov embedding of ranked unlabelled evolutionary trees and its applications

Lasse Thorup Fallesen, Simon Pauli, Elisabeth Sommer James, Lars Nørvang Andersen, Asger Hobolth

Comments 26 pages, 15 figures

详情
英文摘要

Rooted bifurcating trees are mathematical objects used to model evolutionary relationships and arise naturally in both coalescent theory and phylogenetics. Recent numerical representations of tree topologies, known as F-matrices, allow for summarizing a sample of trees via Fréchet means and provide new measures of tree balance. However, the number of ranked unlabelled trees grows super-exponentially with the number of leaves. This makes computation intensive and current methods rely on mixed integer programming and simulation-based methods. Moreover, F-matrices are difficult to interpret, and their distribution is only described in terms of first- and second-order moments under neutral branching. In this paper, we introduce a Markov chain embedding of ranked and unlabelled trees that drastically decreases the size of the state space. Leveraging this embedding, we develop an algorithm that efficiently computes all Fréchet means and use discrete phase-type theory to obtain the joint distribution of tree balance indices. We also use discrete phase-type theory to generalize previous results regarding moments of F-matrices to arbitrary order for any time homogeneous and bifurcating coalescent model. Using this framework, we construct three tests for neutrality and demonstrate their improved power compared to previous methods on simulated data.

2604.15773 2026-04-20 cond-mat.stat-mech cs.AI stat.ME

Phase Transitions as the Breakdown of Statistical Indistinguishability

Taiyo Narita, Hideyuki Miyahara

详情
英文摘要

We introduce a novel characterization of phase transitions based on hypothesis testing. In our formulation, a phase transition is defined as the breakdown of statistical indistinguishability under vanishing parameter perturbations in the thermodynamic limit. This perspective provides a general, order-parameter-free framework that does not rely on model-specific insights or learning procedures. We show that conventional approaches, such as those based on the Binder parameter, can be reinterpreted as special cases within this framework. As a concrete realization, we employ a distribution-free two-sample run test and demonstrate that the critical point of the two-dimensional Ising model is accurately identified without prior knowledge of the order parameter.

2604.15742 2026-04-20 cs.LG hep-th stat.ML

Collective Kernel EFT for Pre-activation ResNets

Hidetoshi Kawase, Toshihiro Ota

Comments 20 pages

详情
英文摘要

In finite-width deep neural networks, the empirical kernel $G$ evolves stochastically across layers. We develop a collective kernel effective field theory (EFT) for pre-activation ResNets based on a $G$-only closure hierarchy and diagnose its finite validity window. Exploiting the exact conditional Gaussianity of residual increments, we derive an exact stochastic recursion for $G$. Applying Gaussian approximations systematically yields a continuous-depth ODE system for the mean kernel $K_0$, the kernel covariance $V_4$, and the $1/n$ mean correction $K_{1,\mathrm{EFT}}$, which emerges diagrammatically as a one-loop tadpole correction. Numerically, $K_0$ remains accurate at all depths. However, the $V_4$ equation residual accumulates to an $O(1)$ error at finite time, primarily driven by approximation errors in the $G$-only transport term. Furthermore, $K_{1,\mathrm{EFT}}$ fails due to the breakdown of the source closure, which exhibits a systematic mismatch even at initialization. These findings highlight the limitations of $G$-only state-space reduction and suggest extending the state space to incorporate the sigma-kernel.

2604.15696 2026-04-20 stat.ME math.PR

Testing and estimation of the index of stability of univariate and bivariate symmetric $α-$stable distributions via modified Greenwood statistic

Katarzyna Skowronek, Marek Arendarczyk, Anna K. Panorska, Tomasz J. Kozubowski, Agnieszka Wyłomańska

详情
英文摘要

We propose a testing and estimation methodology for univariate and bivariate symmatric $α$-stable distributions using a modified version of the Greenwood statistic. Originally designed for positive-valued random variables, the Greenwood statistic, and its modified version tailored for symmetric distributions, have been predominantly applied to univariate random samples. In this paper, we extend the modified Greenwood statistic to a bivariate setting and examine its probabilistic properties within the class of $α$-stable distributions, with a focus on the sub-Gaussian case. Additionally, we introduce a novel testing approach that considers two variations of the modified Greenwood statistic as test statistics for the bivariate case. In the univariate setting, we adapt the proposed testing methodology for estimating the stability index. The simulation studies presented demonstrate that our proposed methodology outperforms classical approaches previously used in this context and serves as an effective tool for distinguishing between Gaussian and $α$-stable distributions with a stability index close to 2. The theoretical and simulation results are further illustrated with practical data examples.

2604.15544 2026-04-20 stat.AP stat.ME

Practical Process Capability Indices Workflows

Fei Jiang, Lei Yang

Comments 12 pages, 5 figures and 5 tables

详情
英文摘要

This paper presents a comprehensive review of univariate process capability indices (PCIs), which are critical metrics for assessing how effectively a manufacturing process satisfies customer specifications based on a single quality characteristic. The primary objective of this review is to develop practical procedural workflows for conducting process capability analysis under various preconditions, including those less frequently addressed scenarios in existing literature. Key analytical components, such as outlier detection, normality test, and best distribution fitting, are integrated into the proposed framework to ensure accurate and robust capability assessments. By systematically evaluating a range of methodologies, this study offers guidance for researchers and practitioners in selecting the most appropriate PCIs for specific process conditions. Ultimately, the work aims to simplify the complexity of PCI analysis while enhancing its precision and utility in quality control and process improvement efforts.

2604.15538 2026-04-20 stat.ML cs.LG

PRIM-cipal components analysis

Tianhao Liu, Daniel Andrés Díaz-Pachón, J. Sunil Rao

Comments 12 pages, 46 figures

详情
英文摘要

Supervised No Free Lunch Theorems (NFLTs) are well studied, yet unsupervised NFLTs remain underexplored. For elliptical distributions, we prove that there exist two equally optimal, scientifically meaningful bump-hunting strategies that are exact opposites, with no universal winner. Specifically, peeling $k$ orthogonal dimensions from $\mathbb{R}^d$ ($d \ge k$), retaining an inter-quantile region of probability $1-α$ per peeled dimension, maximizes total variance and Frobenius norm when the $k$ smallest principal components (called pettiest components) are selected, and minimizes them when the selected dimensions are the $k$ leading principal components. These optima inspire PRIM-based bump-hunting algorithms either by minimizing variance or by minimizing volume, thereby motivating an NFLT. We test our results on the Fashion-MNIST database, showing that peeling the largest principal components captures multiplicity, while peeling the smallest principal components isolates popular styles.

2604.15531 2026-04-20 q-fin.ST stat.ME stat.ML

Spurious Predictability in Financial Machine Learning

Sotirios D. Nikolopoulos

Comments 49 pages, 10 figures. The QuantAudit R package and full replication scripts will be made publicly available upon journal publication

详情
英文摘要

Adaptive specification search generates statistically significant backtests even under martingale-difference nulls. We introduce a falsification audit testing complete predictive workflows against synthetic reference classes, including zero-predictability environments and microstructure placebos. Workflows generating significant walk-forward evidence in these environments are falsified. For passing workflows, we quantify selection-induced performance inflation using an absolute magnitude gap linking optimized in-sample evidence to disjoint walk-forward realizations, adjusted for effective multiplicity. Simulations validate extreme-value scaling under correlated searches and demonstrate detection power under genuine structure. Empirical case studies confirm that many apparent findings represent methodological artifacts rather than genuine predictability.

2604.15504 2026-04-20 cs.SI stat.AP

A Quasi-Experiment comparing the health of unhoused people who have and have not experienced an eviction in King County, WA

Ihsan Kahveci, Timothy A. Thomas, Nathalie E. Williams, Janelle Rothfolk, Cathea Carey, Paul Hebert, Amy Hagopian, Zack W. Almquist

详情
英文摘要

Home eviction poses a significant threat to housing stability, a critical determinant of health. This study examines the relationship between eviction and health and substance use within the unhoused population of King County, Washington. Using a sample of 1,106 individuals experiencing homelessness, we employed a quasi-experimental design to compare the health outcomes of those who have experienced eviction with those who have not. Our findings reveal eviction is associated with an 8.3% point increase (SE = 0.039) in the likelihood of reporting poor general health and an 9.5% increase (SE = 0.032) in substance use disorder. No significant effect was found for mental health outcomes. While these results highlight the severe health risks linked to eviction, further research with more precise estimates is necessary to better understand long-term effects. These findings contribute to the growing evidence of how home eviction undermines the well-being of vulnerable populations.

2604.15469 2026-04-20 stat.ME

Sample continuation in Bayesian hierarchical model via variational inference

Yucong Liu, Zilai Si, Alexander Strang

详情
英文摘要

Posterior distributions arising in ill-posed Bayesian inverse problems are often both analytically intractable and highly sensitive to parameters of the chosen prior family. We aim to understand the sensitivity of intractable posterior distributions to changes in prior assumptions by tracking how a sample representation of the posterior changes as the prior parameters change. This enables sensitivity analysis for small perturbations in the prior, providing insights into the robustness of the posterior estimates under minor changes in assumptions. It also allows solution continuation when dealing with significant alterations in prior beliefs, facilitating a comprehensive understanding of how large shifts in assumptions affect the posterior distribution. We focus on a class of non-conjugate hierarchical models tailored to encourage sparsity in linear inverse problems. The specific hierarchical model of interest is chosen since it is parameterized by a small number of shape parameters, and includes most classical sparsity promoting priors as special cases. As the shape parameters change, the posterior can transition continuously from a tractable unimodal distribution to an intractable multimodal distribution. To track the change in the posterior, we adopt particle based variational inference methods, specifically Stein Variational Gradient Descent (SVGD). SVGD iteratively updates a set of samples to minimize the KL-divergence away from a desired target distribution. We augment SVGD by Birth-Death sampling, which can efficiently exchange mass between separated modes, while simultaneously optimizing the kernel bandwidth used to derive the SVGD update. This method enables the discovery of new modes by tracing the modes as they branch out of a simpler, unimodal posterior, derived within the same family of priors.

2604.15452 2026-04-20 stat.ME stat.CO

Spatially continuous modelling of aggregated outcome data

Stephen Jun Villejo, Peter Diggle, Finn Lindgren, Haavard Rue, Guangquan Li, Ella White, Matthew Wade, Marta Blangiardo

详情
英文摘要

This work develops a block aggregation approach to spatial estimation and prediction when the response is observed at a coarse spatial scale, for example as counts of events in administrative areas, or blocks, while covariates are available at a finer spatial resolution, typically as raster images. Our approach specifies a linear predictor at the finer resolution as a combination of covariate effects and a latent, spatially continuous Gaussian process. This linear predictor then determines the distribution of the response through an inverse link function and spatial integration. We use a simulation study to evaluate the performance of the proposed approach in comparison to two industry standard approaches: a traditional geostatistical model that associates each response with the centroid of its block; and a Markov random field (MRF) approach that aggregates covariate data to block-level. As expected, the differences in performance among the three approaches are small with respect to block-level prediction. The rationale for, and advantage of, the block aggregation approach lies in its delivery of reliable inferences at whatever spatial resolution is required in a particular application. We describe two applications: a linear Gaussian sampling model of wastewater virus concentrations in England, using population density as covariate; and log-linear Poisson model of cardiovascular hospitalisations in England using socio-demographic variables at fine-scale administrative units as covariates.

2604.15392 2026-04-20 cs.LG cs.AI stat.ML

Lightweight Geometric Adaptation for Training Physics-Informed Neural Networks

Kang An, Chenhao Si, Shiqian Ma, Ming Yan

Comments 22 pages, Chenhao Si and Kang An contributed equally to this work. Their authorship order was determined randomly

详情
英文摘要

Physics-Informed Neural Networks (PINNs) often suffer from slow convergence, training instability, and reduced accuracy on challenging partial differential equations due to the anisotropic and rapidly varying geometry of their loss landscapes. We propose a lightweight curvature-aware optimization framework that augments existing first-order optimizers with an adaptive predictive correction based on secant information. Consecutive gradient differences are used as a cheap proxy for local geometric change, together with a step-normalized secant curvature indicator to control the correction strength. The framework is plug-and-play, computationally efficient, and broadly compatible with existing optimizers, without explicitly forming second-order matrices. Experiments on diverse PDE benchmarks show consistent improvements in convergence speed, training stability, and solution accuracy over standard optimizers and strong baselines, including on the high-dimensional heat equation, Gray--Scott system, Belousov--Zhabotinsky system, and 2D Kuramoto--Sivashinsky system.

2604.11305 2026-04-20 cs.LG cs.IT math.IT stat.ML

Beyond Fixed False Discovery Rates: Post-Hoc Conformal Selection with E-Variables

Meiyi Zhu, Osvaldo Simeone

Comments 32 pages, 29 figures

详情
英文摘要

Conformal selection (CS) uses calibration data to identify test inputs whose unobserved outcomes are likely to satisfy a pre-specified minimal quality requirement, while controlling the false discovery rate (FDR). Existing methods fix the target FDR level before observing data, which prevents the user from adapting the balance between number of selected test inputs and FDR to downstream needs and constraints based on the available data. For example, in genomics or neuroimaging, researchers often inspect the distribution of test statistics, and decide how aggressively to pursue candidates based on observed evidence strength and available follow-up resources. To address this limitation, we introduce {post-hoc CS} (PH-CS), which generates a path of candidate selection sets, each paired with a data-driven false discovery proportion (FDP) estimate. PH-CS lets the user select any operating point on this path by maximizing a user-specified utility, arbitrarily balancing selection size and FDR. Building on conformal e-variables and the e-Benjamini-Hochberg (e-BH) procedure, PH-CS is proved to provide a finite-sample post-hoc reliability guarantee whereby the ratio between estimated FDP level and true FDP is, on average, upper bounded by $1$, so that the average estimated FDP is, to first order, a valid upper bound on the true FDR. PH-CS is extended to control quality defined in terms of a general risk. Experiments on synthetic and real-world datasets demonstrate that, unlike CS, PH-CS can consistently satisfy user-imposed utility constraints while producing reliable FDP estimates and maintaining competitive FDR control.

2604.10013 2026-04-20 stat.ME math.OC

Toward Exact Convergence in Byzantine-Robust Decentralized Learning: A Statistical Identification Approach

Siyuan Zhang, Chengde Qian, Xin Liu, Changliang Zou

Comments 52 pages, 7 figures

详情
英文摘要

To defend against Byzantine attacks in decentralized learning, most existing methods rely on robust aggregation rules to mitigate the influence of malicious machines. However, these strategies inherently introduce bias, leading to inexact convergence with non-vanishing steady-state errors. In this paper, we propose a strategic shift from passive aggregation to active identification by introducing the Decentralized Rescaled Stochastic Gradient Descent with Byzantine Machine Identification (DRSGD-ByMI) framework. The core of our approach is an identification-based ``detect-then-optimize'' pipeline, where a p-value-free detection procedure is developed to accurately prune malicious nodes from the network. By leveraging sample-splitting score statistics, this identification mechanism achieves false discovery rate control without requiring restrictive distributional assumptions. We theoretically demonstrate that this precise identification allows the decentralized network to recover sufficient connectivity among the normal nodes, thereby enabling DRSGD-ByMI to match, even in the presence of Byzantine machines, the same order-optimal convergence rate as standard decentralized stochastic first-order methods. Numerical experiments validate our theoretical results and demonstrate the effectiveness of DRSGD-ByMI for decentralized robust learning problems.

2604.07770 2026-04-20 stat.ME

Efficient Targeted Maximum Likelihood Estimation of Average Treatment Effects under Structured Outcome Models with Unknown Error Distributions

Mijeong Kim

详情
英文摘要

We study targeted maximum likelihood estimation (TMLE) of the average treatment effect in a semiparametric regression model whose mean function is indexed by a finite-dimensional parameter, while the additive error distribution is left unspecified apart from mild regularity conditions and independence from treatment and baseline covariates. The paper addresses a genuinely new causal problem: because the target depends on both the regression parameter and the unrestricted marginal law of the covariates, the regression-efficient score must be converted into a causal efficient influence function, semiparametric efficiency bound, and targeting step for the average treatment effect itself. We derive those objects, construct a cross-fitted TMLE, and establish asymptotic linearity and efficiency. In simulations, the proposed estimator is most effective when the mean is correctly structured but the error law is heavy-tailed or skewed. In these settings, it yields smaller root mean squared error and shorter intervals than Gaussian working-model inference, a standard augmented inverse-probability-weighted estimator, Bayesian additive regression trees, and a forest-based TMLE benchmark. Misspecification experiments are included to clarify the scope of the method rather than to claim universal superiority under broad mean-model failure.

2604.00843 2026-04-20 math.AP cs.NA math.NA math.PR math.ST stat.TH

Sharp local sparsity of regularized optimal transport

Alberto González-Sanz, Rishabh S. Gvalani, Lukas Koch

Comments 18 pages, no figures, fixed typo in first author's name

详情
英文摘要

In recent years, the use of entropy-regularized optimal transport with $L^p$-type entropies has become increasingly popular. In this setting, the solutions are sparse, in the sense that the support of the regularized optimal coupling, $\mathrm{supp}(π_\varepsilon)$, shrinks to the support of the original optimal transport problem as $\varepsilon \to 0$. The main open question concerns the rate of this convergence. In this paper, we obtain sharp local results away from the boundary. We prove that the supports $\mathrm{supp}(π_\varepsilon(\cdot \mid x))$ of the conditional measures, $π_\varepsilon(\cdot \mid x)$, behave like balls of radius $\varepsilon^\frac 1 {d(p-1)+2}$. This allows us to show that the regularized potentials are uniformly strongly convex and to derive the rate of convergence of these potentials toward their unregularized limit. Our results generalize the results of (González-Sanz and Nutz, SIAM J.~Math.~Anal.) and (Wiesel and Xu, Ibid.) to the multivariate case and beyond the case of self-transport.

2602.14630 2026-04-20 astro-ph.CO stat.ML

Bayesian Cosmic Void Finding with Graph Flows

Leander Thiele

Comments 8+3 pages, 9+2 figures; v2: Published in OJAp

详情
英文摘要

Cosmic voids contain higher-order cosmological information and are of interest for astroparticle physics. Finding genuine matter underdensities in sparse galaxy surveys is, however, an underconstrained problem. Traditional void finding algorithms produce deterministic void catalogs, neglecting the probabilistic nature of the problem. We present a method to sample from the stochastic mapping from galaxy catalogs to arbitrary void definitions. Our algorithm uses a deep graph neural network to evolve "test particles" according to a flow-matching objective. We demonstrate the method in a simplified example setting but outline steps to generalize it towards practically usable void finders. Trained on a deterministic teacher, the model performs well but has considerable stochasticity which we interpret as regularization. Cosmological information in the predicted void catalogs outperforms the teacher. On the one hand, our method can cheaply emulate existing void finders with apparently useful regularization. More importantly, it also allows us to find the Bayes-optimal mapping between observed galaxies and any void definition. This includes definitions operating at the level of simulated matter density and velocity fields.

2602.07006 2026-04-20 cs.CV cs.LG stat.ML

Scalable spatial point process models for forensic footwear analysis

Alokesh Manna, Neil Spencer, Dipak K. Dey

详情
英文摘要

Shoe print evidence recovered from crime scenes plays a key role in forensic investigations. By examining shoe prints, investigators can determine details of the footwear worn by suspects. However, establishing that a suspect's shoes match the make and model of a crime scene print may not be sufficient. Typically, thousands of shoes of the same size, make, and model are manufactured, any of which could be responsible for the print. Accordingly, a popular approach used by investigators is to examine the print for signs of ``accidentals,'' i.e., cuts, scrapes, and other features that accumulate on shoe soles after purchase due to wear. While some patterns of accidentals are common on certain types of shoes, others are highly distinctive, potentially distinguishing the suspect's shoe from all others. Quantifying the rarity of a pattern is thus essential to accurately measuring the strength of forensic evidence. In this study, we address this task by developing a hierarchical Bayesian model. Our improvement over existing methods primarily stems from two advancements. First, we frame our approach in terms of a latent Gaussian model, thus enabling inference to be efficiently scaled to large collections of annotated shoe prints via integrated nested Laplace approximations. Second, we incorporate spatially varying coefficients to model the relationship between shoes' tread patterns and accidental locations. We demonstrate these improvements through superior performance on held-out data, which enhances accuracy and reliability in forensic shoe print analysis.

2602.06105 2026-04-20 stat.ML cs.LG math.AG

Robustness Verification of Polynomial Neural Networks

Yulia Alexandr, Hao Duan, Guido Montúfar

详情
英文摘要

We study robustness verification of neural networks via metric algebraic geometry. For polynomial neural networks, certifying a robustness radius amounts to computing the distance to the algebraic decision boundary. We use the Euclidean distance (ED) degree as an intrinsic measure of the complexity of this problem, analyze the associated ED discriminant, and introduce a parameter discriminant that detects parameter values at which the ED degree drops. We derive formulas for the ED degree for several network architectures and characterize the expected number of real critical points in the infinite-width limit. We develop symbolic elimination methods to compute these quantities and homotopy-continuation methods for exact robustness certification. Finally, experiments on lightning self-attention modules reveal decision boundaries with strictly smaller ED degree than generic cubic hypersurfaces of the same ambient dimension.

2601.17734 2026-04-20 stat.ME

Group Permutation Testing in Linear Model: Sharp Validity, Power Improvement, and Extension Beyond Exchangeability

Zonghan Li, Hongyi Zhou, Zhiheng Zhang

Comments 74 pages, 3 figures. Includes supplementary material

详情
英文摘要

We consider finite-sample inference for a single regression coefficient in the fixed-design linear model $Y = Zβ+ bX + \varepsilon$, where $\varepsilon\in\mathbb{R}^n$ may exhibit complex dependence or heterogeneity. We develop a group permutation framework, yielding a unified and analyzable randomization structure for linear-model testing. Under exchangeable errors, we place permutation-augmented regression tests within this group-theoretic setting and show that a grouped version of PALMRT controls Type I error at level at most $2α$ for any permutation group; moreover, we provide an worst-case construction demonstrating that the factor $2$ is sharp and cannot be improved without additional assumptions. Second, we relate the Type II error to a design-dependent geometric separation. We formulate it as a combinatorial optimization problem over permutation groups and bound it under additional mild sub-Gaussian assumptions. For the Type II error upper bound control, we propose a constructive algorithm for the permutation strategy that is better (at least no worse) than the i.i.d. permutation, with simulations empirically indicating substantial power gains, especially under heavy-tailed designs. Finally, we extend group-based CPT and PALMRT beyond exchangeability by connecting rank-based randomization arguments to conformal inference. The resulting weighted group tests satisfy finite-sample Type I error bounds that degrade gracefully with a weighted average of total variation distances between $\varepsilon$ and its group-permuted versions, recovering exact validity when these discrepancies vanish and yielding quantitative robustness otherwise. Taken together, the group-permutation viewpoint provides a principled bridge from exact randomization validity to design-adaptive power and quantitative robustness under approximate symmetries.

2601.01854 2026-04-20 stat.ME

Causal inference for censored data with continuous marks

Lianqiang Qu, Long Lv, Liuquan Sun

Comments This paper is a replacement for the previous work titled "Causal inference for censored data with continuous marks." In the current version, we introduce a new definition of causal inference by considering the mark as a post-treatment variable. This approach offers a clearer causal interpretation compared to the previous version

详情
英文摘要

This paper presents a framework for causal inference in the presence of censored data,where the failure time is marked by a continuous variable referred to as a mark.The mark is observed after treatment and is not meaningful when the failure time is censored. In addition, due to the continuous nature of the marks, observations at each given mark are sparse. These facts make the identification and estimation of causality a challenging task. To address these issues, we define a new mark-specific treatment effect within the potential outcomes framework and characterize its identifying conditions. We then propose a local smoothing estimator for the causal effects and establish its asymptotic properties. We further develop testing methods to evaluate whether the treatment has an effect on the failure time when controlling the values of the mark at certain points or within a defined interval, and develop a Gaussian approximation method to obtain the critical values. We evaluate our method using simulation studies as well as a real dataset from the Antibody Mediated Prevention trials.

2510.21934 2026-04-20 cs.LG stat.ML

Joint Score-Threshold Optimization for Interpretable Risk Assessment

Fardin Ganjkhanloo, Emmett Springer, Erik H. Hoyer, Daniel L. Young, Kimia Ghobadi

详情
英文摘要

Risk assessment tools in healthcare commonly employ point-based scoring systems that map patients to ordinal risk categories via thresholds. While electronic health record (EHR) data presents opportunities for data-driven optimization of these tools, two fundamental challenges impede standard supervised learning: (1) labels are often available only for extreme risk categories due to intervention-censored outcomes, and (2) misclassification cost is asymmetric and increases with ordinal distance. We propose a mixed-integer programming (MIP) framework that jointly optimizes scoring weights and category thresholds in the face of these challenges. Our approach prevents label-scarce category collapse via threshold constraints, and utilizes an asymmetric, distance-aware objective. The MIP framework supports governance constraints, including sign restrictions, sparsity, and minimal modifications to incumbent tools, ensuring practical deployability in clinical workflows. We further develop a continuous relaxation of the MIP problem to provide warm-start solutions for more efficient MIP optimization. We apply the proposed score optimization framework to a case study of inpatient falls risk assessment using the Johns Hopkins Fall Risk Assessment Tool.

2510.12700 2026-04-20 cs.LG cs.AI cs.CG math.AT stat.ML

Topological Signatures of ReLU Neural Network Activation Patterns

Vicente Bosca, Tatum Rask, Sunia Tanweer, Andrew R. Tawfeek, Branden Stone

详情
Journal ref
Proc. 1st Conf. on Topology, Algebra, and Geometry in Data Science (TAG-DS 2025), PMLR 321:287-301, 2026
英文摘要

This paper explores the topological signatures of ReLU neural network activation patterns. We consider feedforward neural networks with ReLU activation functions and analyze the polytope decomposition of the feature space induced by the network. Mainly, we investigate how the Fiedler partition of the dual graph and show that it appears to correlate with the decision boundary -- in the case of binary classification. Additionally, we compute the homology of the cellular decomposition -- in a regression task -- to draw similar patterns in behavior between the training loss and polyhedral cell-count, as the model is trained.

2510.10959 2026-04-20 cs.LG cs.AI cs.CL stat.ML

Revisiting Entropy Regularization: Adaptive Coefficient Unlocks Its Potential for LLM Reinforcement Learning

Xiaoyun Zhang, Xiaojian Yuan, Di Huang, Wang You, Chen Hu, Jingqing Ruan, Ai Jian, Kejiang Chen, Xing Hu

Comments 16 pages, 4 figures

详情
英文摘要

Reasoning ability has become a defining capability of Large Language Models (LLMs), with Reinforcement Learning with Verifiable Rewards (RLVR) emerging as a key paradigm to enhance it. However, RLVR training often suffers from policy entropy collapse, where the policy becomes overly deterministic, hindering exploration and limiting reasoning performance. While entropy regularization is a common remedy, its effectiveness is highly sensitive to the fixed coefficient, making it unstable across tasks and models. In this work, we revisit entropy regularization in RLVR and argue that its potential has been largely underestimated. Our analysis shows that (i) tasks of varying difficulty demand distinct exploration intensities, and (ii) balanced exploration may require the policy entropy to be maintained within a moderate range below its initial level. Therefore, we propose Adaptive Entropy Regularization (AER)--a framework that dynamically balances exploration and exploitation via three components: difficulty-aware coefficient allocation, initial-anchored target entropy, and dynamic global coefficient adjustment. Experiments on multiple mathematical reasoning benchmarks show that AER consistently outperforms baselines, improving both reasoning accuracy and exploration capability.

2509.24397 2026-04-20 stat.AP

Assessing Roundabout Safety Perceptions under Heterogeneous Traffic: Socio-Demographic and Geometric Influences in Indian Urban Contexts

Abhijnan Maji, Indrajit Ghosh

详情
英文摘要

Evaluation of the safety perceptions of roundabout users is crucial for improving road safety in mixed-traffic environments. The crash- and conflict-based analyses do not incorporate the socio-demographic characteristics of the roundabout users, which can only be captured through questionnaire surveys on a larger scale. This research evaluated the relationship of roundabout safety perception with demographic factors, driving characteristics, and varying roundabout geometries using multiple correspondence analysis, cluster analysis, factor analysis, and multinomial logistic regression. The study analyzed data from 1,530 respondents across two Indian cities. The study identified three roundabout user clusters. Single-lane roundabouts were perceived as safer during entry and circulation, with a significant prominence among middle-aged users. In contrast, double- and multi-lane roundabouts presented higher perceived risks during exit maneuvers, especially among young, inexperienced, unemployed/self-employed users. Vulnerable road users reported significantly higher perceived risks, especially under suboptimal lighting conditions. Respondents with 10-20 years of driving experience, especially car users, perceived lower risk at single-lane roundabouts but acknowledged the higher risk linked to speed variations and complex maneuvers at multi-lane roundabouts. Driving experience, vehicle type, and geometric configurations were crucial in roundabout safety perception. The study highlighted the need to improve the built environment of roundabouts for vulnerable road users. The roundabout merging area was perceived as the most dangerous spot; however, exits were also perceived as dangerous for double- and multi-lane roundabouts. The findings can benefit policymakers, engineers, and urban planners by enabling them to deploy targeted safety interventions based on issues highlighted in the study.

2509.19104 2026-04-20 cs.LG stat.ML

Online Distributionally Robust LLM Alignment via Regression to Relative Reward

Sharan Sahu, Martin T. Wells

Comments 70 pages, 7 figures, 1 table

详情
英文摘要

Reinforcement Learning with Human Feedback (RLHF) has become crucial for aligning Large Language Models (LLMs) with human intent. However, existing offline RLHF approaches suffer from overoptimization, where language models degrade by overfitting inaccuracies and drifting from preferred behaviors observed during training. Distributionally robust optimization (DRO) is a natural solution, but existing DRO-DPO methods are sample-inefficient, ignore heterogeneous preferences, and lean on brittle heuristics. We introduce \emph{DRO-REBEL}, a family of robust online REBEL updates built on type-$p$ Wasserstein, Kullback-Leibler (KL), and $χ^2$ ambiguity sets. Strong duality reduces each update to a relative-reward regression, retaining REBEL's scalability without PPO-style clipping or value networks. Under linear rewards, log-linear policies, and a standard coverage condition, we prove $\widetilde{O}(\sqrt{d/n})$ bounds on squared parameter error, with sharper constants than prior DRO-DPO analyses, and give the first parametric $\widetilde{O}(d/n)$ rate for DRO-based alignment under preference shift, matching non-robust RLHF in benign regimes. Each divergence yields a tractable SGD-based algorithm: gradient regularization for Wasserstein, importance weighting for KL, and a 1-D dual solve for $χ^2$. On Emotion Alignment, the ArmoRM multi-objective benchmark, and HH-Alignment, DRO-REBEL outperforms prior robust and non-robust baselines across unseen preference mixtures, model sizes, and dataset scales.

2509.02772 2026-04-20 stat.ME stat.CO stat.ML

Inference on covariance structure in high-dimensional multi-view data

Lorenzo Mauri, David B. Dunson

Comments 22 pages including references (35 with appendix), 4 figures, 3 tables

详情
英文摘要

This article focuses on covariance estimation for multi-view data. Popular approaches rely on factor-analytic decompositions that have shared and view-specific latent factors. Posterior computation is conducted via expensive and brittle Markov chain Monte Carlo (MCMC) sampling or variational approximations that underestimate uncertainty and lack theoretical guarantees. Our proposed methodology employs spectral decompositions to estimate and align latent factors that are active in at least one view. Conditionally on these factors, we choose jointly conjugate prior distributions for factor loadings and residual variances. The resulting posterior is a simple product of normal-inverse gamma distributions for each variable, bypassing MCMC and facilitating posterior computation. We prove favorable increasing-dimension asymptotic properties, including posterior contraction and central limit theorems for point estimators. We show excellent performance in simulations, including accurate uncertainty quantification, and apply the methodology to integrate four high-dimensional views from a multi-omics dataset of cancer cell samples.

2507.04962 2026-04-20 stat.ME

Covariance test for discretely observed functional data: when and how it works?

Yang Zhou, Jin Yang, Fang Yao

Comments 35 pages, 2 figures, 1 table

详情
英文摘要

For covariance test in functional data analysis, existing methods are developed only for fully observed curves, whereas in practice, trajectories are typically observed discretely and with noise. To bridge this gap, we employ a pool-smoothing strategy to construct an FPC-based test statistic, allowing the number of estimated eigenfunctions to grow with the sample size. This yields a consistently nonparametric test, while the challenge arises from the concurrence of diverging truncation and discretized observations. Facilitated by advancing perturbation bounds of estimated eigenfunctions, we establish that the asymptotic null distribution remains valid across permissable truncation levels. Moreover, when the sampling frequency (i.e., the number of measurements per subject) reaches certain magnitude of sample size, the test behaves as if the functions were fully observed. This phase transition phenomenon differs from the well-known result of the pooling mean/covariance estimation, reflecting the elevated difficulty in covariance test due to eigen-decomposition. The numerical studies, including simulations and real data examples, yield favorable performance compared to existing methods.

2507.03759 2026-04-20 stat.ML cs.LG

Sequential Regression Learning with Randomized Algorithms

Dorival Leão, Reiko Aoki, Alberto Ohashi, Teh Led Red

详情
英文摘要

This paper presents ``randomized SINDy", a sequential machine learning algorithm designed for dynamic data that has a time-dependent structure. It employs a probabilistic approach, with its PAC learning property rigorously proven through the mathematical theory of functional analysis. The algorithm dynamically predicts using a learned probability distribution of predictors, updating weights via gradient descent and a proximal algorithm to maintain a valid probability density. Inspired by SINDy (Brunton et al. 2016), it incorporates feature augmentation and Tikhonov regularization. For multivariate normal weights, the proximal step is omitted to focus on parameter estimation. The algorithm's effectiveness is demonstrated through experimental results in regression and binary classification using real-world data.

2505.02636 2026-04-20 math.OC math.ST stat.TH

Phase retrieval and matrix sensing via benign and overparametrized nonconvex optimization

Andrew D. McRae

详情
英文摘要

We study a nonconvex optimization algorithmic approach to phase retrieval and the more general problem of semidefinite low-rank matrix sensing. Specifically, we analyze the nonconvex landscape of a quartic Burer-Monteiro factored least-squares optimization problem. We develop a new analysis framework, taking advantage of the semidefinite problem structure, to understand the properties of second-order critical points -- specifically, whether they (approximately) recover the ground truth matrix. We show that it can be helpful to (mildly) overparametrize the problem, that is, to optimize over matrices of higher rank than the ground truth. We then apply this framework to several well-studied problem instances: in addition to recovering existing state-of-the-art phase retrieval landscape guarantees (without overparametrization), we show that overparametrizing by a factor at most logarithmic in the dimension allows recovery with optimal statistical sample complexity and error for the problems of (1) phase retrieval with sub-Gaussian measurements and (2) more general semidefinite matrix sensing with rank-1 Gaussian measurements. Previously, such statistical results had been shown only for estimators based on semidefinite programming. More generally, our analysis is partially based on the powerful method of convex dual certificates, suggesting that it could be applied to a much wider class of problems.

2503.07976 2026-04-20 stat.ML cs.LG

Two-Dimensional Deep ReLU CNN Approximation for Korobov Functions: A Constructive Approach

Qin Fang, Lei Shi, Min Xu, Ding-Xuan Zhou

详情
英文摘要

This paper investigates approximation capabilities of two-dimensional (2D) deep convolutional neural networks (CNNs), with Korobov functions serving as a benchmark. We focus on 2D CNNs, comprising multi-channel convolutional layers with zero-padding and ReLU activations, followed by a fully connected layer. We propose a fully constructive approach for building 2D CNNs to approximate Korobov functions and provide a rigorous analysis of the complexity of the constructed networks. Our results demonstrate that 2D CNNs achieve near-optimal approximation rates under the continuous weight selection model, significantly alleviating the curse of dimensionality. This work provides a solid theoretical foundation for 2D CNNs and illustrates their potential for broader applications in function approximation.

2502.19312 2026-04-20 cs.LG cs.AI cs.CL cs.HC stat.ML

FSPO: Few-Shot Optimization of Synthetic Preferences Personalizes to Real Users

Anikait Singh, Sheryl Hsu, Kyle Hsu, Eric Mitchell, Stefano Ermon, Tatsunori Hashimoto, Archit Sharma, Chelsea Finn

Comments Website: https://fewshot-preference-optimization.github.io/

详情
英文摘要

Effective personalization of LLMs is critical for a broad range of user-interfacing applications such as virtual assistants and content curation. Inspired by the strong in-context capabilities of LLMs, we propose few-shot preference optimization (FSPO), an algorithm for LLM personalization that reframes reward modeling as a meta-learning problem. Under FSPO, an LLM learns to quickly infer a personalized reward function for a user via a few labeled preferences. FSPO also utilizes user description rationalization (RAT) to encourage better reward modeling and instruction following, recovering performance with the oracle user description. Since real-world preference data is challenging to collect at scale, we propose careful design choices to construct synthetic preference datasets for personalization, generating over 1M synthetic personalized preferences using publicly available LLMs. To successfully transfer from synthetic data to real users, we find it crucial for the data to exhibit both high diversity and coherent, self-consistent structure. We evaluate FSPO on personalized open-ended generation for up to 1,500 synthetic users across three domains: movie reviews, education, and open-ended question answering. We also run a controlled human study. Overall, FSPO achieves an 87% Alpaca Eval winrate in generating responses that are personalized to synthetic users and a 70% winrate with real human users in open-ended question answering.

2411.12502 2026-04-20 cs.LG cs.AI stat.ML

Transformer Neural Processes - Kernel Regression

Daniel Jenson, Jhonathan Navott, Mengyan Zhang, Makkunda Sharma, Elizaveta Semenova, Seth Flaxman

Comments This was superseded by 'Scalable Spatiotemporal Inference with Biased Scan Attention Transformer Neural Processes' (arXiv:2506.09163)

详情
英文摘要

Neural Processes (NPs) are a rapidly evolving class of models designed to directly model the posterior predictive distribution of stochastic processes. Originally developed as a scalable alternative to Gaussian Processes (GPs), which are limited by $O(n^3)$ runtime complexity, the most accurate modern NPs can often rival GPs but still suffer from an $O(n^2)$ bottleneck due to their attention mechanism. We introduce the Transformer Neural Process - Kernel Regression (TNP-KR), a scalable NP featuring: (1) a Kernel Regression Block (KRBlock), a simple, extensible, and parameter efficient transformer block with complexity $O(n_c^2 + n_c n_t)$, where $n_c$ and $n_t$ are the number of context and test points, respectively; (2) a kernel-based attention bias; and (3) two novel attention mechanisms: scan attention (SA), a memory-efficient scan-based attention that when paired with a kernel-based bias can make TNP-KR translation invariant, and deep kernel attention (DKA), a Performer-style attention that implicitly incoporates a distance bias and further reduces complexity to $O(n_c)$. These enhancements enable both TNP-KR variants to perform inference with 100K context points on over 1M test points in under a minute on a single 24GB GPU. On benchmarks spanning meta regression, Bayesian optimization, image completion, and epidemiology, TNP-KR with DKA outperforms its Performer counterpart on nearly every benchmark, while TNP-KR with SA achieves state-of-the-art results.

2411.05808 2026-04-20 math.ST math.PR stat.TH

Layered Hill estimator for extreme data in clusters

Taegyu Kang, Takashi Owada

Comments 36 pages

详情
英文摘要

A new estimator is proposed for estimating the tail exponent of a heavy-tailed distribution. This estimator, referred to as the layered Hill estimator, is a generalization of the traditional Hill estimator, building upon a layered structure formed by clusters of extreme values. We argue that the layered Hill estimator provides a robust alternative to the traditional approach, exhibiting desirable asymptotic properties such as consistency and asymptotic normality for the tail exponent. Both theoretical analysis and simulation studies demonstrate that the layered Hill estimator shows significantly better and more robust performance, particularly when a portion of the extreme data is missing.

2408.07066 2026-04-20 stat.ME

Conformal prediction after data-dependent model selection

Ruiting Liang, Wanrong Zhu, Rina Foygel Barber

详情
英文摘要

Given a family of pretrained models and a hold-out set, how can we construct a valid conformal prediction set while selecting a model that minimizes the width of the set? If we use the same hold-out data set both to select a model (the model that yields the smallest conformal prediction sets) and then to construct a conformal prediction set based on that selected model, we suffer a loss of coverage due to selection bias. Alternatively, we could further split the data to perform selection and calibration separately, but this comes at a steep cost if the size of the dataset is limited. In this paper, we address the challenge of constructing a valid prediction set after data-dependent model selection -- commonly, selecting the model that minimizes the width of the resulting prediction sets. Our novel methods can be implemented efficiently and admit finite-sample validity guarantees without invoking additional sample-splitting. We show that our methods yield prediction sets with asymptotically optimal width under certain notion of regularity for the model class. The improvement in the width of the prediction sets constructed by our methods are further demonstrated through applications to synthetic datasets in various settings and a real data example.

2303.12660 2026-04-20 cs.SI math.PR math.ST stat.TH

Structural Measures of Resilience for Supply Chains

Marios Papachristou, M. Amin Rahimian, Arash Azadegan

详情
英文摘要

Modern production systems are increasingly defined by dense networks of multi-tier sourcing dependencies, where localized upstream disruptions can cascade into system-wide collapses. While supply chain resilience has garnered significant managerial attention, we still lack theoretically-grounded, reliable, analytical metrics that can distinguish inherently resilient architectures from fragile ones. This paper addresses this gap by developing a structural resilience framework and a novel metric, defined as the maximum supplier failure rate that a network can sustain while maintaining an aggregate production level. Using node percolation theory and branching processes, we identify four critical structural determinants of resilience: the number of raw materials, the number of finished goods, sourcing requirements, and sourcing influence. Our analysis reveals two distinct regimes: "top hat" architectures, which are characterized by excessive raw materials and high centralization, making them inherently fragile; and "rolling pin" structures, which maintain controlled input/output widths and sparsity, allowing them to absorb non-trivial shocks. To operationalize these insights, we formulate resilience computation as a scalable linear program that approximates cascading failure sizes in large-scale networks with cycles, heterogeneous suppliers, and structural decoupling. Furthermore, we extend our framework to account for exogenous failure correlations, such as those arising from geographic or geopolitical factors that can undermine traditional supplier and input diversification strategies. We validate our theoretical results using multi-echelon supply chain data. These tools can inform network design, supplier diversification, and inventory planning to proactively reduce systemic risk.

2301.05660 2026-04-20 physics.data-an math.ST stat.ME stat.TH

Learn your entropy from informative data: an axiom ensuring the consistent identification of generalized entropies

Andrea Somazzi, Diego Garlaschelli

详情
Journal ref
Physical Review Research 7: 033087 (2025)
英文摘要

Shannon entropy, a cornerstone of information theory, statistical physics and inference methods, is uniquely identified by the Shannon-Khinchin or Shore-Johnson axioms. Generalizations of Shannon entropy, motivated by the study of non-extensive or non-ergodic systems, relax some of these axioms and lead to entropy families indexed by certain `entropic' parameters. In general, the selection of these parameters requires pre-knowledge of the system or encounters inconsistencies. Here we introduce a simple axiom for any entropy family: namely, that no entropic parameter can be inferred from a completely uninformative (uniform) probability distribution. When applied to the Uffink-Jizba-Korbel and Hanel-Thurner entropies, the axiom selects only Rényi entropy as viable. It also extends consistency with the Maximum Likelihood principle, which can then be generalized to estimate the entropic parameter purely from data, as we confirm numerically. Remarkably, in a generalized maximum-entropy framework the axiom implies that the maximized log-likelihood always equals minus Shannon entropy, even if the inferred probability distribution maximizes a generalized entropy and not Shannon's, solving a series of problems encountered in previous approaches.