arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.26644 2026-03-30 cs.LG astro-ph.IM stat.ME

Automatic Laplace Collapsed Sampling: Scalable Marginalisation of Latent Parameters via Automatic Differentiation

Toby Lovick, David Yallup, Will Handley

Comments 28 Pages, 7 Figures. Comments welcome

详情

英文摘要

We present Automatic Laplace Collapsed Sampling (ALCS), a general framework for marginalising latent parameters in Bayesian models using automatic differentiation, which we combine with nested sampling to explore the hyperparameter space in a robust and efficient manner. At each nested sampling likelihood evaluation, ALCS collapses the high-dimensional latent variables $z$ to a scalar contribution via maximum a posteriori (MAP) optimisation and a Laplace approximation, both computed using autodiff. This reduces the effective dimension from $d_θ+ d_z$ to just $d_θ$, making Bayesian evidence computation tractable for high-dimensional settings without hand-derived gradients or Hessians, and with minimal model-specific engineering. The MAP optimisation and Hessian evaluation are parallelised across live points on GPU-hardware, making the method practical at scale. We also show that automatic differentiation enables local approximations beyond Laplace to parametric families such as the Student-$t$, which improves evidence estimates for heavy-tailed latents. We validate ALCS on a suite of benchmarks spanning hierarchical, time-series, and discrete-likelihood models and establish where the Gaussian approximation holds. This enables a post-hoc ESS diagnostic that localises failures across hyperparameter space without expensive joint sampling.

URL PDF HTML ☆

赞 0 踩 0

2603.26618 2026-03-30 math.ST stat.TH

Statistical inference for extremal directions in high-dimensional spaces

Lucas Butsch, Vicky Fasen-Hartmann

2603.26611 2026-03-30 cs.LG stat.ME stat.ML

Benchmarking Tabular Foundation Models for Conditional Density Estimation in Regression

Rafael Izbicki, Pedro L. C. Rodrigues

2603.26548 2026-03-30 stat.AP

Impact of Residential Retrofits on Gas and Electricity Consumption in France

Charly Andral, Laetitia Leduc, Guillaume Matheron, Yukihide Nakada

2603.26502 2026-03-30 stat.ME stat.AP stat.ML

Targeted learning of heterogeneous treatment effect curves for right censored or left truncated time-to-event data

Matthew Pryce, Karla Diaz-Ordaz, Ruth H. Keogh, Stijn Vansteelandt

2603.26478 2026-03-30 cs.SD stat.ME stat.ML

Probabilistic Multilabel Graphical Modelling of Motif Transformations in Symbolic Music

Ron Taieb, Yoel Greenberg, Barak Sober

Comments 23 pages (21 pages main text), 2 figures. Submitted to Journal of New Music Research (Special Issue on Computational and Cognitive Musicology)

2603.26460 2026-03-30 math.ST stat.TH

The relative value of interventional and observational samples in Bayesian Causal Linear Gaussian Models

Valentinian Lungu, Anish Dhir, Mark van der Wilk, Ioannis Kontoyiannis

2603.26418 2026-03-30 stat.ML cs.LG math.FA

Kantorovich--Kernel Neural Operators: Approximation Theory, Asymptotics, and Neural Network Interpretation

Tian-Xiao He

2603.26415 2026-03-30 cs.LG cs.AI stat.AP

KMM-CP: Practical Conformal Prediction under Covariate Shift via Selective Kernel Mean Matching

Siddhartha Laghuvarapu, Rohan Deb, Jimeng Sun

2603.26375 2026-03-30 stat.AP

Summarising mortality data with a time-dependent beta latent variable model

Pedro Menezes de Araújo, Isobel Claire Gormley, Thomas Brendan Murphy

2603.26369 2026-03-30 math.ST stat.ME stat.TH

Validating spatial-temporal separability for stationary processes

Lujia Bai, Holger Dette, Zihao Yuan

2603.26358 2026-03-30 stat.ME stat.AP

Mixed Time Series Quasi-Likelihood Models for Uncovering Covid-19 Viral Load and Mortality Dynamics

Kejin Wu, Raanju R. Sundararajan, Michel F. C. Haddad, Luiza S. C. Piancastelli, Wagner Barreto-Souza

Comments Paper submitted for publication

2603.26349 2026-03-30 stat.ML cs.AI cs.LG

Generative Score Inference for Multimodal Data

Xinyu Tian, Xiaotong Shen

Comments 25 pages, 4 figures

2603.26344 2026-03-30 stat.ML cs.LG cs.SD eess.AS eess.SP

A Power-Weighted Noncentral Complex Gaussian Distribution

Toru Nakashika

2603.26334 2026-03-30 stat.AP physics.data-an

Bayesian estimation of optical constants using mixtures of Gaussian process experts

Teemu Härkönen, Hui Chen, Erik Vartiainen

2603.26327 2026-03-30 stat.ME cs.LG

Making Multi-Axis Models Robust to Multiplicative Noise: How, and Why?

Bailey Andrew, David R. Westhead, Luisa Cutillo

Comments 9 pages (26 with supplemental), 4 figures (+2 in supplemental), preprint

2603.26309 2026-03-30 stat.AP cs.LG q-fin.RM

Semi-structured multi-state delinquency model for mortgage default

Victor Medina-Olivares, Wangzhen Xia, Stefan Lessmann, Nadja Klein

2603.26301 2026-03-30 stat.ME math.CO math.PR math.ST stat.ML stat.TH

Complete Causal Identification from Ancestral Graphs under Selection Bias

Leihao Chen, Joris M. Mooij

2603.26297 2026-03-30 stat.ME

Attribution of Spurious Factors from High-Dimensional Functional Time Series

Adam Nie, Yanrong Yang, Han Lin Shang, Yi He

Comments 35 pages,7 figures, 1 table

2603.26296 2026-03-30 stat.AP

Adaptation and Validation of the Turkish Version of the Large Language Model Dependency Scale (LLM-D12)

Tugba Coskun Aslan, Gulser Uncular, Hasan Durmus, Yasin Kavla, Arda Borlu, Sameha Alshakhsi, Ala Yankouskaya, Raian Ali

2603.26261 2026-03-30 cs.LG stat.ML

Contrastive Conformal Sets

Yahya Alkhatib, Wee Peng Tay

2603.26225 2026-03-30 math.ST stat.TH

Dependencies in Multiplex Networks: A Motif Count Approach

Karl Sawaya, Sofia Olhede

2603.26166 2026-03-30 stat.ME

Unifying the Hoover and Gini indices: Analytical, bias, and computational aspects

Roberto Vila, Helton Saulo, Felipe Quintino

Comments 19 pages, 2 figures

2603.26097 2026-03-30 cs.LG cs.AI stat.ML

Dynamic Tokenization via Reinforcement Patching: End-to-end Training and Zero-shot Transfer

Yulun Wu, Sravan Kumar Ankireddy, Samuel Sharpe, Nikita Seleznev, Dehao Yuan, Hyeji Kim, Nam H. Nguyen

2603.26048 2026-03-30 stat.ML cs.LG math.ST stat.TH

Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression

Haoming Shi, Eric C. Chi, Hengrui Luo

Comments 62 pages, 11 figures

2603.26026 2026-03-30 stat.OT

Hybrid physics-data driven spectral forecasts of semisubmersible response

Ian Milne, Lachlan Astfalck, Matthew Zed, Jack Lee-Kopij, Edward Cripps

2603.24999 2026-03-30 stat.AP cs.AI

Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients

Michael Hardy, Joshua Gilbert, Benjamin Domingue

2603.24970 2026-03-30 econ.EM stat.ME

Randomization Inference For the Always-Reporter Average Treatment Effect

Haoge Chang, Zeyang Yu

2603.24771 2026-03-30 stat.ME stat.ML

Identifiable Deep Latent Variable Models for MNAR Data

Huiming Xie, Fei Xue, Xiao Wang

2603.24763 2026-03-30 math.ST cs.LG stat.ML stat.TH

Binary Expansion Group Intersection Network

Sicheng Zhou, Kai Zhang

2603.19778 2026-03-30 cs.CE math.ST stat.TH

Uniform Maximum Projection Designs for Computer Experiments

Miroslav Vořechovský, Jan Mašek

Comments Accepted in Computers and Structures

详情

DOI: 10.1016/j.compstruc.2026.108209
Journal ref: Computers and Structures, ISSN 1879-2243 (0045-7949), 325:108209, 2026

英文摘要

Space-filling experimental designs are widely used in engineering computer experiments, where only a limited number of expensive model evaluations can be afforded. Distance-based designs such as Maximin or Minimax ensure global space-filling, while Latin hypercube sampling enforces uniform one-dimensional projections, yet neither guarantees uniformity in lowdimensional subspaces. Maximum Projection (MaxPro) designs were introduced to improve uniformity in low-dimensional subspaces, yet their original formulation relies on the Euclidean distance and may induce systematic density distortions in bounded domains. We demonstrate that the standard MaxPro criterion leads to statistically non-uniform sampling, resulting in undersampling of corner regions and biased Monte Carlo estimates. To remedy this issue, we introduce a periodic variant of the criterion, termed Uniform Maximum Projection (uMaxPro), in which the Euclidean metric is replaced by a periodic distance based on the minimum image convention. The proposed uMaxPro designs preserve the projection-aware structure of MaxPro while achieving statistical uniformity of the design-generation mechanism. Numerical experiments show unbiased Monte Carlo integration with reduced variance, excellent subspace projection performance, and competitive discrepancy properties. The methodology is further validated on benchmark engineering problems, including a meso-scale finite element model of concrete, demonstrating improved accuracy in surrogate modeling and probabilistic estimation. The resulting criterion provides a simple and computationally efficient modification of MaxPro that enhances its robustness for nonadaptive computer experiments. The construction algorithm, open-source implementation, and reproducible optimized designs are provided to facilitate practical adoption of the method.

URL PDF HTML ☆

赞 0 踩 0

2603.02460 2026-03-30 stat.ML cs.LG

Conformal Graph Prediction with Z-Gromov Wasserstein Distances

Gabriel Melo, Thibaut de Saivre, Anna Calissano, Florence d'Alché-Buc

2602.20396 2026-03-30 cs.LG stat.ME

cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context

Jörg Martin, Stefan Haufe

2601.02226 2026-03-30 stat.AP

Initial data analysis of the national German transplantation registry with a focus on kidney transplantation

Lukas Klein, Gunter Grieser, Carl-Ludwig Fischer-Fröhlich, Axel Rahmel, Henrik Stahl, Andreas Wienke, Antje Jahn-Eimermacher

Comments 31 pages, 9 figures, 1 supplementary document, Submitted to BMC Medical Research Methodology

2512.23138 2026-03-30 astro-ph.IM astro-ph.SR cs.LG stat.ML

Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN

Yuan-Sen Ting

Comments 17 pages, 7 figures. Published in the Open Journal of Astrophysics

2512.03321 2026-03-30 stat.CO

Numerical optimization for the compatibility constant of the lasso

Kei Hirose

2511.19234 2026-03-30 stat.ME stat.AP

Integrating Complex Covariate Transformations in Generalized Additive Models

Claudia Collarin, Matteo Fasiolo, Yannig Goude, Simon N. Wood

2511.04206 2026-03-30 math.ST stat.TH

Goodness-of-fit testing of the distribution of posterior classification probabilities for validating model-based clustering

Salima El Kolei, Matthieu Marbac

详情

英文摘要

We present the first method for assessing the relevance of a model-based clustering result in a general framework. Standard validation criteria, like the adjusted Rand index, rely on external labels to assess partition accuracy; consequently, they are inapplicable to real-world clustering problems where labels are missing. In contrast, our method offers an internal goodness-of-fit diagnostic, since it evaluates the validity of the clustering mechanism by testing the specification of the posterior probabilities of classification defined on the unit simplex. Because this simplex dimension is fixed by the number of clusters, the procedure naturally circumvents the curse of dimensionality, making it applicable to high-dimensional data where traditional density-based tests fail. The testing procedure requires only a consistent estimator of the parameters and the associated posterior classification probabilities for each observation, and its implementation is straightforward, as no additional model fitting is needed. Under the null hypothesis, the method exploits the fact that any functional transformation of the posterior probabilities has the same expectation under both the model being tested and the true data-generating process. The resulting goodness-of-fit test is constructed via an empirical likelihood approach with a growing number of moment conditions, allowing asymptotic detection of any alternative. A block-splitting strategy, employed to account for parameter estimation, provides a vector of test statistics that behave like a vector of independent chi-square random variables. Therefore, the goodness-of-fit of the posterior classification probabilities is assessed via the goodness-of-fit of the vector of empirical likelihood ratio test statistics. Hence, based on the distribution of this vector of statistics, different goodness-of-fit tests (e.g., Kolmogorov-Smirnov) can be used to investigate the distribution of the vector of test statistics with an exact asymptotic significance level.

URL PDF HTML ☆

赞 0 踩 0

2510.27643 2026-03-30 stat.ML cs.LG cs.NA math.NA math.OC stat.CO

Bayesian Optimization on Networks

Wenwen Li, Daniel Sanz-Alonso, Ruiyi Yang

Comments 40 pages, 10 figures; includes appendices

2510.11239 2026-03-30 stat.CO

Spline Interpolation on Compact Riemannian Manifolds

Charlie Sire, Mike Pereira, Thomas Romary

2510.07235 2026-03-30 math.ST stat.ME stat.TH

A Bernstein polynomial approach for the estimation of cumulative distribution functions in the presence of missing data

Rihab Gharbi, Wissem Jedidi, Salah Khardani, Frédéric Ouimet

Comments 33 pages, 2 figures, 10 tables

2510.05646 2026-03-30 stat.AP math.ST stat.TH

Geographically Weighted Regression for Air Quality Low-Cost Sensor Calibration

Jean-Michel Poggi, Bruno Portier, Emma Thulliez

2510.03587 2026-03-30 stat.CO stat.ME stat.ML

Exact and Approximate MCMC for Doubly-intractable Probabilistic Graphical Models Leveraging the Underlying Independence Model

Yujie Chen, Antik Chakraborty, Anindya Bhadra

Comments To appear in Proceedings of the 29th International Conference on Artificial Intelligence and Statistics (AISTATS) 2026, Tangier, Morocco

2509.25419 2026-03-30 stat.ME stat.CO

Bias-Reduced Estimation of Structural Equation Models

Haziq Jamil, Yves Rosseel, Oliver Kemp, Ioannis Kosmidis

2507.18951 2026-03-30 math.AP math.ST stat.CO stat.TH

Elliptic Bayesian Inverse Problems on Metric Graphs

David Bolin, Wenwen Li, Daniel Sanz-Alonso

Comments 38 pages, 7 figures, including appendices

2505.17564 2026-03-30 stat.AP math.ST stat.TH

Using low-cost sensors to improve NO2 concentration maps derived from physico-chemical models

Emma Thulliez, Camille Coron

2504.06215 2026-03-30 stat.ME econ.EM

Randomization Inference in Two-Sided Market Experiments

Jizhou Liu, Azeem M. Shaikh, Panos Toulis

2502.18253 2026-03-30 econ.GN q-fin.EC stat.AP

Enhancing External Validity of Experiments with Ongoing Sampling

Chen Wang, Shichao Han, Shan Huang

2502.14950 2026-03-30 quant-ph cs.LG math.ST stat.ML stat.TH

Symmetric observations without symmetric causal explanations

Christian William, Patrick Remy, Jean-Daniel Bancal, Yu Cai, Nicolas Brunner, Alejandro Pozas-Kerstjens

Comments 8+3 pages, 4+1 figures, RevTeX 4.2. The computational appendix is available at https://www.github.com/apozas/symmetric-causal. V2: published version

2502.01557 2026-03-30 cs.LG math.DS stat.ML

How iteration order influences convergence and stability in deep learning

Benoit Dherin, Benny Avelin, Anders Karlsson, Hanna Mazzawi, Javier Gonzalvo, Michael Munn

2501.03501 2026-03-30 stat.AP

Modeling Cell Developmental Trajectory using Multinomial Unbalanced Optimal Transport

Junhao Zhu, Kevin Zhang, Zhaolei Zhang, Dehan Kong

2408.00949 2026-03-30 cs.LG math.GR math.RT stat.ML

Equivariant neural networks and piecewise linear representation theory

Joel Gibson, Daniel Tubbenhauer, Geordie Williamson

Comments 23 pages, many figures, revision, to appear in Contemp. Math., comments welcome

2405.16885 2026-03-30 stat.ME q-bio.PE

Hidden Markov modelling of spatio-temporal dynamics of measles in 1750-1850 Finland

Tiia-Maria Pasanen, Jouni Helske, Tarmo Ketola

2403.08079 2026-03-30 cs.SE stat.ME

BayesFLo: Bayesian fault localization of complex software systems

Yi Ji, Simon Mak, Ryan Lekivetz, Joseph Morgan

2311.12634 2026-03-30 math.PR math.ST stat.TH

On $q$-Order Statistics

Malvina Vamvakari

2311.02543 2026-03-30 stat.ME

Pairwise likelihood estimation and limited information goodness-of-fit test statistics for binary factor analysis models under complex survey sampling

Haziq Jamil, Irini Moustaki, Chris Skinner

2201.10300 2026-03-30 math.CA cs.NA math.NA math.ST stat.TH

The Inverse Problem for Single Trajectories of Rough Differential Equations

Thomas Morrish, Theodore Papamarkou, Anastasia Papavasiliou, Yang Zhao

Comments Final version, accepted for publication in the SIAM/ASA Journal on Uncertainty Quantification

2603.26002 2026-03-30 math.ST math.PR stat.TH

Quasi-Banach spaces of random variables and stochastic processes

Yuriy Kozachenko, Yuriy Mlavets, Oleksandr Mokliachuk

2603.25970 2026-03-30 stat.AP stat.CO

Bayesian Deep Count Regression and Anomaly Detection: Evidence from GDELT Event Panels

Hsin-Hsiung Huang, Yuh-Haur Chen, Mahlon Scott

2603.25966 2026-03-30 math.PR math.ST stat.TH

Besov-Orlicz moduli of Brownian motion and polygonal partial sum processes

Fabian Mies

2603.25964 2026-03-30 stat.AP

Assessing Reporting Delays in ACLED Conflict Event Data

Faniry A. Razakason, Daniel Racek, Paul W. Thurner, Göran Kauermann

2603.25934 2026-03-30 math.ST math.PR stat.ML stat.TH

Sharp Concentration Inequalities: Phase Transition and Mixing of Orlicz Tails with Variance

Yinan Shen, Jinchi Lv

2603.25919 2026-03-30 stat.ME

Regularized Regression by Composition: Identifiability, Structured Penalization, and Statistical Guarantees for Multi-Flow Distributional Models

Safaa K. Kadhem

2603.25916 2026-03-30 cs.LG stat.ML

Parameter-Free Dynamic Regret for Unconstrained Linear Bandits

Alberto Rumi, Andrew Jacobsen, Nicolò Cesa-Bianchi, Fabio Vitale

Comments 10 pages. v1: AISTATS 2026

2603.25911 2026-03-30 stat.ME stat.ML

Robust Tensor-on-Tensor Regression

Mehdi Hirari, Fabio Centofanti, Mia Hubert, Stefan Van Aelst

2603.25910 2026-03-30 math.ST stat.TH

Finite-Time Observability of Oscillatory Instabilities in Synchronous p-bit Dynamics

Naoya Onizawa, Shunsuke Koshita, Takahiro Hanyu

Comments submitted to physical review e

2603.25869 2026-03-30 eess.IV cs.CV stat.ML

Learning to Recorrupt: Noise Distribution Agnostic Self-Supervised Image Denoising

Brayan Monroy, Jorge Bacca, Julián Tachella

2603.25854 2026-03-30 stat.ME

Modeling with Categorical Features via Exact Fusion and Sparsity Regularisation

Kayhan Behdin, Riade Benbaki, Peter Radchenko, Rahul Mazumder

Comments Journal of Royal Statistical Society, Series B (to appear)

2603.25838 2026-03-30 stat.ME

Causal Network Discovery from Interventional Count Data with Latent Linear DAGs

Yijiao Zhang, Hongzhe Li

Comments 35 pages, 5 figures

2603.25796 2026-03-30 stat.ML cs.AI cs.LG math.ST stat.TH

Beyond identifiability: Learning causal representations with few environments and finite samples

Inbeom Lee, Tongtong Jin, Bryon Aragam

2603.25776 2026-03-30 stat.ML cs.LG

SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised Blind Source Separation

Yuan-Hao Wei

2603.25755 2026-03-30 physics.chem-ph cs.LG q-bio.QM stat.ML

KANEL: Kolmogorov-Arnold Network Ensemble Learning Enables Early Hit Enrichment in High-Throughput Virtual Screening

Pavel Koptev, Nikita Krainov, Konstantin Malkov, Alexander Tropsha

Comments 8 Pages

2602.04668 2026-03-30 math.ST stat.TH

Estimation of reliability and accuracy of models of $φ$-sub-Gaussian process using generating functions of polynomial expansions

Oleksandr Mokliachuk

2512.17374 2026-03-30 stat.ML math.OC

Generative modeling of conditional probability distributions on the level-sets of collective variables

Fatima-Zahrae Akhyar, Wei Zhang, Gabriel Stoltz, Christof Schütte

2511.19742 2026-03-30 stat.AP stat.ME

Anchoring Convenience Survey Samples to a Baseline Census for Vaccine Coverage Monitoring in Global Health

Nathaniel Dyrkton, Shomoita Alam, Susan Shepherd, Ibrahim Sana, Kevin Phelan, Jay JH Park

Comments 5 figures, 2 tables. Includes updates to DGM, Results, and added clarification

2511.09500 2026-03-30 stat.ML cs.LG math.ST stat.ME stat.TH

Distributional Shrinkage I: Universal Denoiser Beyond Tweedie's Formula

Tengyuan Liang

Comments 27 pages, 5 figures

2501.06360 2026-03-30 stat.ME

Borrowing Information from an Unidentifiable Model: Guaranteed Efficiency Gain with a Dichotomized Outcome in the External Data

Lu Wang, Yanyuan Ma, Jiwei Zhao

2412.04882 2026-03-30 cs.LG stat.ML

Nonmyopic Global Optimisation via Approximate Dynamic Programming

Filippo Airaldi, Bart De Schutter, Azita Dabiri

Comments 36 pages, 6 figures, 2 tables, submitted to Springer Machine Learning

详情

英文摘要

Global optimisation to optimise expensive-to-evaluate black-box functions without gradient information. Bayesian optimisation, one of the most well-known techniques, typically employs Gaussian processes as surrogate models, leveraging their probabilistic nature to balance exploration and exploitation. However, these processes become computationally prohibitive in high-dimensional spaces. Recent alternatives, based on inverse distance weighting (IDW) and radial basis functions (RBFs), offer competitive, computationally lighter solutions. Despite their efficiency, both traditional global and Bayesian optimisation strategies suffer from the myopic nature of their acquisition functions, which focus on immediate improvement neglecting future implications of the sequential decision making process. Nonmyopic acquisition functions devised for the Bayesian setting have shown promise in improving long-term performance. Yet, their combination with deterministic surrogate models remains unexplored. In this work, we introduce novel nonmyopic acquisition strategies tailored to IDW and RBF based on approximate dynamic programming paradigms, including rollout and multi-step scenario-based optimisation schemes, to enable lookahead acquisition. These methods optimise a sequence of query points over a horizon by predicting the evolution of the surrogate model, inherently managing the exploration-exploitation trade-off via optimisation techniques. The proposed approach represents a significant advance in extending nonmyopic acquisition principles, previously confined to Bayesian optimisation, to deterministic models. Empirical results on synthetic and hyperparameter tuning benchmark problems, a constrained problem, as well as on a data-driven predictive control application, demonstrate that these nonmyopic methods outperform conventional myopic approaches, leading to faster and more robust convergence.

URL PDF HTML ☆

赞 0 踩 0

2307.15181 2026-03-30 econ.EM math.ST stat.ME stat.TH

On the Efficiency of Highly Stratified Experiments

Yuehao Bai, Jizhou Liu, Azeem M. Shaikh, Max Tabord-Meehan