arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.28163 2026-05-01 eess.SP cs.LG stat.CO stat.ML

Sequential Inference for Gaussian Processes: A Signal Processing Perspective

Daniel Waxman, Fernando Llorente, Petar M. Djurić

Comments 53 pages, 7 figures. Accepted to IEEE Signal Processing Magazine

详情

英文摘要

The proliferation of capable and efficient machine learning (ML) models marks one of the strongest methodological shifts in signal processing (SP) in its nearly 100-year history. ML models support the development of SP systems that represent complex, nonlinear relationships with high predictive accuracy. Adapting these models often requires sequential inference, which differs both theoretically and methodologically from the usual paradigm of ML, where data are often assumed independent and identically distributed. Gaussian processes (GPs) are a flexible yet principled framework for modeling random functions, and they have become increasingly relevant to SP as statistical and ML methods assume a more prominent role. We provide a self-contained, tutorial-style overview of GPs, with a particular focus on recent methodological advances in sequential, incremental, or streaming inference. We introduce these techniques from a signal-processing perspective while bridging them to recent advances in ML. Many of the developments we survey have direct applications to state-space modeling, sequential regression and forecasting, anomaly detection in time series, sequential Bayesian optimization, adaptive and active sensing, and sequential detection and decision-making. By organizing these advances from a signal-processing perspective, we intend to equip practitioners with practical tools and a coherent roadmap for deploying sequential GP models in real-world systems.

URL PDF HTML ☆

赞 0 踩 0

2604.28104 2026-05-01 stat.ME math.ST stat.TH

Kernel-based independence and mean independence tests for weakly dependent data

Daniel Diz-Castro, Manuel Febrero-Bande, Wenceslao González-Manteiga

2604.28047 2026-05-01 stat.ME

Data-Adaptive and Model-Robust Covariate Adjustment for Time-to-Event Outcomes in Stratified Randomized Trials

Raphael C. Kim, Brian Gilbert, Ramin Zabih, Michele Santacatterina, Ivan Diaz

2604.28027 2026-05-01 stat.ME

Response to: "A note on conditional densities, Bayes' rule, and recent criticisms of Bayesian inference" by Yan et al., 2026

Klaus Mosegaard, Andrew Curtis

Comments 10 pages, 0 figures

2604.27907 2026-05-01 stat.ME

Multivariate mixed models with model-free random effects

Angela Andreella, Livio Finos

2604.27892 2026-05-01 stat.ML cs.LG stat.AP

Prediction-powered Inference by Mixture of Experts

Yanwu Gu, Linglong Kong, Dong Xia

2604.27887 2026-05-01 stat.ME

Meta-Analysis Without Normality: Estimating the True Effect Distribution with Penalized Gaussian Mixtures

Daihe Sui, Elizabeth Tipton

Comments 38 pages, 17 figures

2604.27883 2026-05-01 math.ST cs.IT cs.LG math.IT stat.ML stat.TH

Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing

Max Lovig

Comments 43 Pages, 7 Figures

2604.27831 2026-05-01 stat.AP

Optimal allocation of trials to sub-regions in crop variety testing with multiple years and correlated genotype effects

Maryna Prus, Lenka Filová, Hans-Peter Piepho, Waqas Ahmed Malik

2604.27813 2026-05-01 math.ST stat.TH

A High Dimensional Wild Bootstrap Max-Test for Detecting the Presence of Significant Predictors

Jonathan B. Hill

2604.27791 2026-05-01 stat.ME

Reversible Jump MCMC With No Regrets: Bayesian Variable Selection Using Mixtures of Mutually Singular Distributions

Don van den Bergh, Merlise A. Clyde, Adrian E. Raftery, Maarten Marsman

2604.27742 2026-05-01 cs.LG stat.ML

Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction

Mehryar Mohri, Yutao Zhong

2604.20052 2026-05-01 stat.CO

Annealed Langevin Monte Carlo for Flow ODE Sampling

Hanwen Huang

Comments 25 pages, 3 figures

2604.01911 2026-05-01 stat.ME

On the uncertainty from the first-stage estimation of prognostic covariate adjustment in randomized controlled trials

Nodoka Seya, Masataka Taguri

2602.20549 2026-05-01 cs.LG cs.CV stat.ME

Sample-efficient evidence estimation of score based priors for model selection

Frederic Wang, Katherine L. Bouman

Comments ICLR 2026

2602.19483 2026-05-01 cs.LG cs.AI stat.ML

Making Conformal Predictors Robust in Healthcare Settings: a Case Study on EEG Classification

Arjun Chatterjee, Sayeed Sajjad Razin, John Wu, Siddhartha Laghuvarapu, Jathurshan Pradeepkumar, Jimeng Sun

Comments Accepted to the International Conference on Artificial Intelligence in Medicine 2026

2601.22993 2026-05-01 cs.LG stat.ML

Constrained Policy Optimization with Cantelli-Bounded Value-at-Risk

Rohan Tangri, Jan-Peter Calliess

2512.20914 2026-05-01 math.ST stat.AP stat.ML stat.TH

Invariant Feature Extraction Through Conditional Independence and the Optimal Transport Barycenter Problem: the Gaussian case

Ian Bounos, Pablo Groisman, Mariela Sued, Esteban Tabak

2510.12911 2026-05-01 econ.EM q-fin.RM stat.ME

Spot Regressions with Candlesticks

Yasin Simsek

2509.20194 2026-05-01 stat.ME econ.EM

Identification and Semiparametric Estimation of Conditional Means from Aggregate Data

Cory McCartan, Shiro Kuriwaki

Comments 20 pages, plus references and appendices

2508.05462 2026-05-01 stat.CO math.PR

Piecewise Deterministic Sampling for Constrained Distributions

Joël Tatang Demano, Paul Dobson, Konstantinos Zygalakis

Comments 44 pages, 9 figures

2506.17463 2026-05-01 math.ST stat.ME stat.TH

Testing Separability of High-Dimensional Covariance Matrices

Bongjung Sung, Peter D. Hoff

Comments 85 pages, 32 pages in the main text, new theoretical results, including the convergence of the Kronecker MLE under the partial-isotropy core, with more sophisticated results on the asymptotic distributions and consistency, are added

2505.13230 2026-05-01 cs.LG cond-mat.dis-nn stat.ML

Implicit bias produces neural scaling laws in learning curves, from perceptrons to deep networks

Francesco D'Amico, Dario Bocchi, Matteo Negri

Comments Final accepted version at ICLR26 main conference; 27 pages, 21 Figures, 5 tables

2505.12487 2026-05-01 stat.CO stat.ME stat.ML

Stereographic Multiple-Try Metropolis

Zhihao Wang, Jun Yang

Comments 53 pages, 12 figures

2503.24324 2026-05-01 stat.AP econ.GN physics.soc-ph q-fin.EC q-fin.RM

Mitigating Financial Risk from Climate-Induced Agricultural Price Volatility

Sourish Das, Sudeep Shukla, Abbinav Sankar Kailasam, Anish Rai, Sejal Garg, Anirban Chakraborti

Comments 15 pages, 11 figures

2503.04956 2026-05-01 stat.ML cs.LG

Foreclassing: A new machine learning perspective on human decision making with temporal data

Daniel Andrew Coulson, Martin T. Wells

Comments 20 pages, 1 figure, 15 tables

2502.19234 2026-05-01 physics.ao-ph physics.data-an stat.AP

Arctic teleconnection on climate and ozone pollution in the polar jet stream path of eastern US

K Shuvo Bakar, Sourish Das, Sudeep Shukla, Anirban Chakraborti

Comments 19 pages, 6 figures

2402.14532 2026-05-01 cs.LG stat.ML

A Framework for Variational Inference of Lightweight Bayesian Neural Networks with Heteroscedastic Uncertainties

David J. Schodt, Ryan Brown, Michael Merritt, Samuel Park, Delsin Menolascino, Mark A. Peot

Comments Fix equation typos

2310.18500 2026-05-01 stat.ME

Designing Randomized Experiments to Predict Unit-Specific Treatment Effects

Elizabeth Tipton, Michalis Mamakos

Comments 46 pages, 3 figures

2302.03286 2026-05-01 math.NA cs.NA stat.ML

Algorithmically Designed Artificial Neural Networks (ADANNs): Higher order deep operator learning for parametric partial differential equations

Arnulf Jentzen, Adrian Riekert, Philippe von Wurstemberger

Comments 39 pages, 17 Figures

2208.07086 2026-05-01 stat.ME math.ST stat.TH

Flexible Bayesian Multiple Comparison Adjustment Using Dirichlet Process and Beta-Binomial Model Priors

Don van den Bergh, Fabian Dablander

Comments 31 pages, 12 figures, and 2 tables

2112.13247 2026-05-01 math.ST stat.TH

Decision-making with possibilistic inferential models

Ryan Martin, Shih-Ni Prim, Jonathan Williams

2604.27733 2026-05-01 cs.LG stat.ML

Mind the Gap: Structure-Aware Consistency in Preference Learning

Mehryar Mohri, Yutao Zhong

2604.27732 2026-05-01 stat.AP q-fin.RM stat.OT

A Note on the Generalized Cape Cod Reserving Method

Ronald Richman, Mario V. Wüthrich

2604.27723 2026-05-01 cs.LG stat.ML

Optimized Deferral for Imbalanced Settings

Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong

2604.27696 2026-05-01 stat.CO stat.AP stat.ML

FoReco and FoRecoML: A Unified Toolbox for Forecast Reconciliation in R

Daniele Girolimetto, Jeroen Rombouts, Ines Wilms, Yangzhuoran Fin Yang

2604.27665 2026-05-01 math.ST math.PR stat.TH

A note on estimation of quarticity based on spot volatility

Yi Guo

2604.27603 2026-05-01 stat.CO

Martingale Posteriors for Discretely Observed Diffusions

Jingning Yao, Ajay Jasra, Sheng Jiang

2604.27409 2026-05-01 stat.ME stat.AP

Robust inference methods of diagnostic test accuracy meta-analysis for influential outlying studies via density power divergence

Kotaro Sasaki, Hisashi Noma, Theodoros Evrenoglou

Comments 20 pages with 4 figures

2604.27394 2026-05-01 stat.ML cs.LG

Bayesian X-Learner: Calibrated Posterior Inference for Heterogeneous Treatment Effects under Heavy-Tailed Outcomes

Eichi Uehara

Comments 47 pages, 7 figures, 25 tables. Code: https://github.com/EichiUehara/bayesian-X-Leaner. Prepared for submission to TMLR

2604.27338 2026-05-01 stat.AP

Estimating Population Viral Load Contextual Exposure Using GPS-Derived Activity Spaces in Rural South Africa

Zhaoxing Wu, Haoyang Wu, Thulile Mathenjwa, Elphas Okango, Khai Hoan Tram, Margot Otto, Maxime Inghels, Paul Mee, Diego Cuadros, Hae-Young Kim, Till Bärnighausen, Frank Tanser, Adrian Dobra

Comments 22 pages, 5 figures

2604.27305 2026-05-01 stat.ME

Inference on Generalized Latent Variable Models with High-Dimensional Responses and Covariates

Jing Ouyang, Chengyu Cui, Yunxiao Chen, Kean Ming Tan, Gongjun Xu

2604.27282 2026-05-01 cs.CY cs.LG stat.AP

The Likelihood Ratio Wall: Structural Limits on Accurate Risk Assessment for Rare Violence

Marco Pollanen

Comments 16 pages, 2 figures, 8 tables. Accepted to the 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT '26)

2604.27280 2026-05-01 cs.LG stat.ME

Predicting Covariate-Driven Spatial Deformation for Nonstationary Gaussian Processes

Minghao Gu, Weizhi Lin, Qiang Huang

2604.27243 2026-05-01 stat.AP

Estimating Decision Uncertainty from Preference Uncertainty: Application to Ground Vehicle Design

Chia-Ruei Liu, Yongjia Song, Qiong Zhang, Cameron Turner

2604.27242 2026-05-01 math.PR math.ST stat.TH

Statistical Inference for Homogenization Limits Driven by Wiener or Hermite Processes

Pablo Ramses Alonso-Martin

Comments 43 pages. Comments are welcome

2604.27198 2026-05-01 stat.AP stat.ME

Bayesian Nonparametric Causal Inference for Quantile Residual Life: An Application to Alzheimer's Disease

Woojung Bae, Taekwon Hong, Sang Kyu Lee, Dongrak Choi, Jong-Hyeon Jeong

2604.27196 2026-05-01 math.ST stat.TH

Technical Note on Relating Scores of Tilted Distributions

Curtis McDonald

2604.27191 2026-05-01 stat.ME cs.LG stat.ML

Linear Models, Variable Selection, Artificial Intelligence

By Riyadh Alrawkan, Edward Boone, Ryad Ghanam, Anton Westveld

2604.27025 2026-05-01 stat.ML cs.LG

SCOPE-FE: Structured Control of Operator and Pairwise Exploration for Feature Engineering

Minhee Park, Seongyeon Son, Yonghyun Lee, Eunchan Kim

2604.27017 2026-05-01 eess.IV cs.LG stat.ML

Validating the Clinical Utility of CineECG 3D Reconstructions through Cross-Modal Feature Attribution

Karol Dobiczek, Maciej Mozolewski, Szymon Bobek, Michał Szafarczyk, Peter van Dam, Grzegorz J. Nalepa

Comments Accepted to the CompHealth workshop at the 26th International Conference on Computational Science

2604.26992 2026-05-01 math.ST stat.ME stat.ML stat.TH

Adaptive Robust Confidence Intervals in Efron's Gaussian Two-Groups Model

Qiaosen Wang, Shuwen Chai, Chao Gao

详情

英文摘要

Robust uncertainty quantification is increasingly important in modern data analysis and is often formalized under Huber's model, which allows an $\varepsilon$-fraction of arbitrary corruptions. In many experimental sciences, however, the measurement protocol is well controlled, and contamination is more plausibly introduced upstream. Motivated by this noise-oblivious nature of adversaries, we study confidence intervals for the null location parameter $θ$ in Efron's Gaussian two-groups model, where an unknown fraction $\varepsilon$ of observations have arbitrarily shifted means, but all samples share the same law of additive Gaussian measurement noise with variance $σ^2$. We characterize the minimax-optimal length among confidence intervals with a prescribed coverage level uniformly over the unknown contamination proportion and all noise-oblivious adversaries. Although prior work has shown that the minimax point estimation rate of theta does not deteriorate when $\varepsilon$ becomes unknown, our results reveal that, with a given $σ^2$, the minimax-optimal length of confidence intervals that are adaptive to unknown $\varepsilon$ is of order $σ(n^{-1/4}+\varepsilon^{1/2}/\max\{1, \log(en \varepsilon^2)\}^{1/2})$, which is polynomially worse than the optimal length when $\varepsilon$ is known. When the variance $σ^2$ is also unknown, we show a further degradation: no adaptive robust confidence interval can be shorter than $Ω(σn^{-1/8})$. Algorithmically, we introduce a Fourier-based certification procedure built on Carathéodory's positive-semidefiniteness constraints. By scanning candidate points and accepting those whose residual characteristic function is certifiably consistent with a Gaussian location mixture, our algorithm attains the minimax lower bound in the known-variance setting and is computable in polynomial time.

URL PDF HTML ☆

赞 0 踩 0

2604.26983 2026-05-01 cs.IR cs.LG stat.ML

Value-Aware Product Recommendation by Customer Segmentation using a suitable High-Dimensional Similarity Measure

María Florencia Acosta, Rodrigo García Arancibia, Pamela Llop, Mariel Lovatto, Lucas Mansilla

2604.26973 2026-05-01 cs.NE cs.LG stat.CO

MAEO: Multiobjective Animorphic Ensemble Optimization for Scalable Large-scale Engineering Applications

Omer F. Erdem, Dean Price, Paul Seurin, Majdi I. Radaideh

Comments 33 pages, 9 figures, 5 tables, under peer review

详情

英文摘要

Multiobjective optimization remains challenging for many scientific and engineering problems due to the need to balance convergence, diversity, and computational efficiency across high-dimensional objective landscapes. This work presents the Multiobjective Animorphic Ensemble Optimization (MAEO) framework, a parallelizable ensemble strategy that unifies state-of-the-art evolutionary algorithms within an island-based architecture, overcoming the limitations of relying on a single optimizer, as implied by the No Free Lunch theorem. MAEO uses a parameter-free hypervolume indicator for island performance assessment and a strict Pareto-rank-based individual scoring formulation that incorporates crowding distance and nadir-point proximity to ensure consistent selection pressure within each front. The framework is initiated using four algorithms (NSGA-III, CTAEA, AGEMOEA2, SPEA2) and evaluated through extensive benchmarking on 12 DTLZ/ZDT functions under 36 dimensionality settings using Wilcoxon signed-rank tests with both hypervolume and inverse generational distance metrics. Results show that MAEO achieves balanced convergence-diversity performance, outperforming or matching some of the leading multiobjective optimization algorithms across different benchmark problems. To demonstrate practical applicability, MAEO is applied to the equilibrium-cycle optimization of a small modular nuclear reactor. Eight discrete design variables (and three objectives (levelized cost of electricity, peak soluble boron concentration, fuel cycle length) are optimized under two safety constraints. The algorithm carried out roughly 40000 evaluations using computer simulations. MAEO identifies core designs that lower both the levelized cost of electricity and the peak boron concentration, while preserving fuel cycle length and meeting all safety constraints.

URL PDF HTML ☆

赞 0 踩 0

2604.24587 2026-05-01 stat.AP

Bayesian inference for hidden Markov models under genuine multimodality with application to ecological time series

Marco A. Gallegos-Herrada, Vianey Leos-Barajas, Jeffrey S. Rosenthal

Comments 37 pages, 11 figures, to be submitted to Bayesian Analysis, corrected author affiliations

2604.22200 2026-05-01 astro-ph.GA stat.AP

Formalizing Galaxy Population Evolution: Drift and Mergers as Transport Processes on Manifolds

Tsutomu T. Takeuchi

Comments 31 pages, 3 figure, to be submitted

2604.11119 2026-05-01 stat.ML cs.LG

DDO-RM: Distribution-Level Policy Improvement after Reward Learning

Tiantian Zhang, Jierui Zuo, Michael Chen, Wenping Wang

Comments 8 pages, 4 figures

2604.08632 2026-05-01 cs.CR cs.NI stat.AP

Why Network Segmentation Projects Fail

Rohit Dube

2603.13566 2026-05-01 stat.ML cs.LG

EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

En-Ya Kuo, Sebastien Motsch

Comments Updated the first page to include the IEEE submission notice required for previously posted electronic preprint versions

2603.10252 2026-05-01 stat.ML cs.LG physics.data-an stat.ME

Bayesian Hierarchical Models and the Maximum Entropy Principle

Brendon J. Brewer

Comments 6 pages, 2 figures. To appear in the proceedings of the 44th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering (MaxEnt 2025), held in Auckland, New Zealand

2602.10125 2026-05-01 cs.SI cs.NI stat.AP

How segmented is my network?

Rohit Dube

Comments 5 Tables, 5 Figures

2602.07915 2026-05-01 cs.LG cs.AI stat.ME stat.ML

CausalCompass: Evaluating the Robustness of Time-Series Causal Discovery in Misspecified Scenarios

Huiyang Yi, Xiaojian Shen, Yonggang Wu, Duxin Chen, He Wang, Wenwu Yu

Comments Major revision from the previous version

2601.05052 2026-05-01 cs.LG stat.ML

DeepWeightFlow: Re-Basined Flow Matching for Generating Neural Network Weights

Saumya Gupta, Scott Biggs, Moritz Laber, Zohair Shafi, Robin Walters, Ayan Paul

Comments 25 pages, 20 tables, 2 figures

2511.02258 2026-05-01 stat.ML cs.LG math.PR math.ST stat.TH

Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer Networks

Parsa Rangriz

2510.19110 2026-05-01 stat.ML cs.LG stat.AP

Signature Kernel Scoring Rule: A Spatio-Temporal Diagnostic for Probabilistic Weather Forecasting

Archer Dodson, Ritabrata Dutta

2509.16115 2026-05-01 econ.EM stat.AP

A Korean Macroeconomic Database for Data-Rich Policy Analysis and U.S.--Korea Dependence

Changryong Baek, Seunghyun Moon, Seunghyeon Lee

2505.24259 2026-05-01 stat.ME

Partially-shared Imaging Regression on Integrating Heterogeneous Brain-Cognition Associations across Alzheimer's Diagnoses

Yang Sui, Qi Xu, Ting Li, Yang Bai, Annie Qu

2504.19342 2026-05-01 stat.ML cs.LG stat.ME

Contextual Online Uncertainty-Aware Preference Learning for Human Feedback

Nan Lu, Ethan Lee, Ethan X. Fang, Junwei Lu

2503.03065 2026-05-01 stat.ME

Meta-analysis of median survival times with inverse-variance weighting

Sean McGrath, Cheng-Han Yang, Jonathan Kimmelman, Omer Ozturk, Russell Steele, Andrea Benedetti

详情

DOI: 10.1002/sim.70533
Journal ref: Stat. Med. 45 (2026) e70533

英文摘要

We consider the problem of meta-analyzing outcome measures based on median survival times. Primary studies with time-to-event outcomes often report estimates of median survival times and confidence intervals based on the Kaplan-Meier estimator. However, outcome measures based on median survival are rarely meta-analyzed, as standard inverse-variance weighted methods require within-study standard errors that are typically not reported. In this article, we consider an inverse-variance weighted approach to meta-analyze median survival times that estimates the within-study standard errors from the reported confidence intervals. We show that this method consistently estimates the standard error of median survival when applied to confidence intervals constructed by the Brookmeyer-Crowley method. We conduct a series of simulation studies evaluating the performance of this approach at the study level (i.e., for estimating the standard error of median survival) and the meta-analytic level (i.e., for estimating the pooled median, difference of medians, and ratio of medians) for commonly used confidence intervals for median survival, including the Brookmeyer-Crowley method and nonparametric bootstrap. We find that this approach often performs comparably to a benchmark approach that uses the true within-study standard errors for meta-analyzing median-based outcome measures when within-study sample sizes are moderately large (e.g., above 50). However, when the effective sample sizes are small, the method can yield biased estimates of within-study standard errors. We illustrate an application of this approach in a meta-analysis evaluating survival benefits of being assigned to experimental arms versus comparator arms in randomized trials for non-small cell lung cancer therapies.

URL PDF HTML ☆

赞 0 踩 0

2502.14698 2026-05-01 cs.LG cs.AI stat.AP stat.ML

General Uncertainty Estimation with Delta Variances

Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

2502.07189 2026-05-01 cs.LG stat.ML

Exploring Vision Neural Network Pruning via Screening Methodology

Mingyuan Wang, Yangzi Guo, Sida Liu, Yuhang Liu

2412.11136 2026-05-01 stat.ME stat.ML

Minimax Regret Estimation for Generalizing Heterogeneous Treatment Effects with Multisite Data

Yi Zhang, Melody Huang, Kosuke Imai

2412.05135 2026-05-01 stat.ML cs.LG stat.CO

The Polynomial Stein Discrepancy for Assessing Moment Convergence

Narayan Srinivasan, Matthew Sutton, Christopher Drovandi, Leah F South

Comments 17 Pages, 14 Figs

2408.02679 2026-05-01 cs.LG cs.GR cs.HC stat.ME

Visual Analysis of Multi-outcome Causal Graphs

Mengjie Fan, Jinlu Yu, Daniel Weiskopf, Nan Cao, Huai-Yu Wang, Liang Zhou

2407.16212 2026-05-01 stat.ME cs.NA math.NA stat.CO

Optimal experimental design: Formulations and computations

Xun Huan, Jayanth Jagalur, Youssef Marzouk

Comments Appears in Acta Numerica 2024. Some corrections and clarifications in this version

详情

DOI: 10.1017/S0962492924000023
Journal ref: Acta Numerica, Volume 33, July 2024, pp. 715-840

英文摘要

Questions of `how best to acquire data' are essential to modeling and prediction in the natural and social sciences, engineering applications, and beyond. Optimal experimental design (OED) formalizes these questions and creates computational methods to answer them. This article presents a systematic survey of modern OED, from its foundations in classical design theory to current research involving OED for complex models. We begin by reviewing criteria used to formulate an OED problem and thus to encode the goal of performing an experiment. We emphasize the flexibility of the Bayesian and decision-theoretic approach, which encompasses information-based criteria that are well-suited to nonlinear and non-Gaussian statistical models. We then discuss methods for estimating or bounding the values of these design criteria; this endeavor can be quite challenging due to strong nonlinearities, high parameter dimension, large per-sample costs, or settings where the model is implicit. A complementary set of computational issues involves optimization methods used to find a design; we discuss such methods in the discrete (combinatorial) setting of observation selection and in settings where an exact design can be continuously parameterized. Finally we present emerging methods for sequential OED that build non-myopic design policies, rather than explicit designs; these methods naturally adapt to the outcomes of past experiments in proposing new experiments, while seeking coordination among all experiments to be performed. Throughout, we highlight important open questions and challenges.

URL PDF HTML ☆

赞 0 踩 0

2407.08668 2026-05-01 stat.ML cs.LG

Modeling Spatial Extremal Dependence of Precipitation Using Distributional Neural Networks

Christopher Bülte, Lisa Leimenstoll, Melanie Schienle

2405.15952 2026-05-01 stat.CO math.ST stat.TH

Theoretical guarantees for lifted samplers

Philippe Gagnon, Florian Maire

详情

DOI: 10.1016/j.spa.2026.104937
Journal ref: Stochastic Processes and their Applications, 199, 1-26 (2026)

英文摘要

Lifted samplers form a class of Markov chain Monte Carlo methods which has drawn a lot attention in recent years due to superior performance in challenging Bayesian applications. A canonical example of lifted samplers is the one that is derived from a random walk Metropolis algorithm for a totally-ordered state space such as the integers or the real numbers. The lifted sampler is derived by splitting into two the proposal distribution: one part in the increasing direction, and the other part in the decreasing direction. It keeps following a direction, until a rejection occurs, upon which it flips the direction. In terms of asymptotic variances, it outperforms the random walk Metropolis algorithm, regardless of the target distribution, at no additional computational cost. Other studies show, however, that beyond this simple case, lifted samplers do not always outperform their Metropolis counterparts. In this paper, we leverage the celebrated work of Tierney (1998) to provide an analysis in a general framework encompassing a broad class of lifted samplers. Our finding is that, essentially, the asymptotic variances cannot increase by a factor of more than 2, regardless of the target distribution, the way the directions are induced, and the type of algorithm from which the lifted sampler is derived (be it a Metropolis--Hastings algorithm, a reversible jump algorithm, etc.). This result indicates that, while there is potentially a lot to gain from lifting a sampler, there is not much to lose.

URL PDF HTML ☆

赞 0 踩 0

2207.11890 2026-05-01 econ.EM stat.ME

Misclassification in Difference-in-differences Models

Augustine Denteh, Désiré Kédagni