arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.15288 2026-04-17 math.ST stat.TH

Generalization of Pearl's Front-Door Criterion

Carol Wu, Elina Robeva

详情

英文摘要

Pearl's front-door criterion provides a set of sufficient conditions for estimating the total causal effect from observational data in the presence of latent confounding, using the functional P(y | do(x := x*)) = \sum_z P(z | x*) \sum_x P(y | x, z) P(x). An open question is whether these conditions can be generalized to be both necessary and sufficient for the validity of this functional, similar to the generalization achieved for the back-door adjustment criterion by Shpitser. In this paper, we present a new, weakened set of graph-based conditions sufficient for the front-door formula to estimate the total causal effect, expanding the scope of problems amenable to front-door identification.

URL PDF HTML ☆

赞 0 踩 0

2604.15285 2026-04-17 stat.ML cs.LG math.ST stat.TH

Structural interpretability in SVMs with truncated orthogonal polynomial kernels

Víctor Soto-Larrosa, Nuria Torrado, Edmundo J. Huertas

2604.15269 2026-04-17 quant-ph cs.LG math.ST stat.TH

Cloning is as Hard as Learning for Stabilizer States

Nikhil Bansal, Matthias C. Caro, Gaurav Mahajan

Comments 10 + 33 + 8 pages

2604.15230 2026-04-17 stat.AP

On the robustness of Mann-Kendall tests used to forecast critical transitions

Tristan Gamot, Nils Thibeau--Sutre, Tom J. M. Van Dooren

Comments 26 pages including appendices, 10 figures, 2 tables

2604.15229 2026-04-17 math.ST stat.ME stat.TH

On a Probability Inequality for Order Statistics with Applications to Bootstrap, Conformal Prediction, and more

Manit Paul, Arun Kumar Kuchibhotla

Comments 65 pages, 10 figures

2604.15217 2026-04-17 stat.ME

A Bayesian Approach to Unit-level Dependent Multi-type Survey Data

Zewei Kong, Paul A. Parker, Jonathan R. Bradley, Scott H. Holan

Comments 28 pages, 2 figures. Submitted to Journal of Survey Statistics and Methodology

2604.15114 2026-04-17 stat.ML cs.AI cs.LG

Amortized Optimal Transport from Sliced Potentials

Minh-Phuc Truong, Khai Nguyen

Comments 26 pages, 11 figures, 10 tables

2604.15107 2026-04-17 stat.ML cs.LG

MinShap: A Modified Shapley Value Approach for Feature Selection

Chenghui Zheng, Garvesh Raskutti

2604.15106 2026-04-17 stat.ME

Cellwise Robust Twoblock Dimension Reduction

Sven Serneels

2604.15104 2026-04-17 stat.ME

On the Conservativeness of Robust Variance Estimators in Propensity Score Weighted Cox Models

Hiroya Morita, Shunichiro Orihara, Fumitaka Shimizu, Masataka Taguri

Comments 19 pages, 4 table

2604.15070 2026-04-17 stat.ME

Adaptive Multi-Prior Lasso for High-Dimensional Generalized Linear Models

Fuzhi Xu, Weijuan Liang, Shuangge Ma, Qingzhao Zhang

Comments 23 pages, 3 figures, 2 tables

2604.15067 2026-04-17 stat.AP stat.ME

Capturing Aleatoric Uncertainty in Climate Models

Cornelia Gruber, Henri Funk, Magdalena Mittermeier, Helmut Küchenhoff, Göran Kauermann

2604.15064 2026-04-17 stat.ME

Ranked-choice conjoint experiments

Thomas S. Robinson, Mats Ahrenshop, Spyros Kosmidis

2604.15061 2026-04-17 math.ST stat.TH

On general weighted cumulative residual (past) extropy of extreme order statistics

Santosh Kumar Chaudhary, Sarikul Islam, Nitin Gupta

2604.14975 2026-04-17 stat.CO cs.NA math.NA stat.AP stat.ML

Theta-regularized Kriging: Modelling and Algorithms

Xuelin Xie, Xiliang Lu

2604.14971 2026-04-17 stat.AP

Mapping Subnational Vulnerability to Inadequate Micronutrient Intake using a Bayesian Small Area Estimation Framework

Sahoko Ishida, Mohammed Osman, Ziyao Cui, Uchenna Agu, Emily Becher, Gabriel Battcock, Daniel Hernandez, Duccio Piovani, Frances Knight, Seth Flaxman, Kevin Tang

详情

英文摘要

Inadequate dietary micronutrient intake is a significant risk factor for deficiency and remains a major global health challenge. Nutrition programmes and interventions are most effective when targeted to populations at greatest risk. Household Consumption and Expenditure Surveys (HCES) are a widely available source of dietary data; however, they are often not powered for estimation below the first administrative level, limiting their utility for geographically targeted interventions. To address this, we applied Bayesian Small Area Estimation (SAE) methods to estimate the prevalence of apparent inadequate intake at the second administrative level. Three approaches were considered: a cluster level Beta binomial model and two area level models (mean smoothing and joint smoothing). Models were evaluated using a Rwanda HCES survey that supports inference at this scale. All models were implemented in a fully Bayesian framework to propagate uncertainty. Simulation results in Rwanda showed that the cluster level Beta binomial model achieved the strongest performance, while the area level joint smoothing model was the most reliable alternative among models accounting for survey design. Based on these results, models were applied to Senegal and Nigeria. In Senegal, second administrative level estimates captured meaningful subnational variation, reduced uncertainty relative to direct estimates, and remained consistent with first administrative level benchmarks. In Nigeria, despite smaller sample sizes and survey design constraints, modelled estimates reduced extreme uncertainty and showed good agreement with first administrative level estimates. This study demonstrates that Bayesian SAE methods can be applied to HCES data to generate reliable fine scale estimates of inadequate micronutrient intake, supporting localised nutrition interventions.

URL PDF HTML ☆

赞 0 踩 0

2604.14908 2026-04-17 cs.LG cs.SY eess.SY stat.ML

Multi-User mmWave Beam and Rate Adaptation via Combinatorial Satisficing Bandits

Emre Özyıldırım, Barış Yaycı, Umut Eren Akturk, Cem Tekin

2603.06431 2026-04-17 math.NA cs.LG cs.NA stat.ML

Certified and accurate computation of function space norms of deep neural networks

Johannes Gründler, Moritz Maibaum, Philipp Petersen

2603.02196 2026-04-17 cs.AI cs.LG math.ST stat.ML stat.TH

Conformal Policy Control

Drew Prinster, Clara Fannjiang, Ji Won Park, Kyunghyun Cho, Anqi Liu, Suchi Saria, Samuel Stanton

2602.06930 2026-04-17 cs.LG math.OC math.ST stat.ML stat.TH

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation

Wenlong Mou

Comments update from previous version: removed unnecessarily strong requirement on discount rate

2505.07427 2026-04-17 stat.AP cs.CE

Value of Information-based assessment of strain-based thickness loss monitoring in ship hull structures

Nicholas E. Silionis, Konstantinos N. Anyfantis

Comments 39 pages, 18 figures, Preprint submitted to journal

2407.05790 2026-04-17 stat.CO stat.ML

Kinetic Interacting Particle Langevin Monte Carlo

Paul Felix Valsecchi Oliva, O. Deniz Akyildiz

2604.14860 2026-04-17 stat.ML cs.LG

Best of both worlds: Stochastic & adversarial best-arm identification

Yasin Abbasi-Yadkori, Peter L. Bartlett, Victor Gabillon, Alan Malek, Michal Valko

Comments Published in Conference on Learning Theory (COLT 2018)

2604.14810 2026-04-17 stat.ML cs.LG stat.CO

Scalable Model-Based Clustering with Sequential Monte Carlo

Connie Trojan, Pavel Myshkov, Paul Fearnhead, James Hensman, Tom Minka, Christopher Nemeth

Comments Accepted at AISTATS 2026. 31 pages, 20 figures

2604.14809 2026-04-17 stat.ML cs.LG stat.AP

Expert-Guided Class-Conditional Goodness-of-Fit Scores for Interpretable Classification with Informative Missingness: An Application to Seismic Monitoring

Shahar Cohen, David M. Steinberg, Yael Radzyner, Yochai Ben Horin

Comments 50 pages, 8 figures

2604.14702 2026-04-17 cs.LG stat.ML

Gating Enables Curvature: A Geometric Expressivity Gap in Attention

Satwik Bathula, Anand A. Joshi

Comments 41 pages, 9 figures

2604.14669 2026-04-17 cs.LG math.DS math.OC stat.ML

Zeroth-Order Optimization at the Edge of Stability

Minhak Song, Liang Zhang, Bingcong Li, Niao He, Michael Muehlebach, Sewoong Oh

Comments 38 pages

2604.14657 2026-04-17 stat.AP

Evacuation destination choices during Hurricane Ian: A direct demand modeling approach

Alessandra Recalde, Luyu Liu, Xiaojian Zhang, Sangung Park, Shangkun Jiang, Xilei Zhao

2604.14649 2026-04-17 stat.ME math.ST stat.TH

Model Checking for Regressions Based on Weighted Residual Processes with Diverging Number of Predictors

Yue Hu, Haiqi Li, Xintao Xia

2604.14587 2026-04-17 cs.LG math.OC stat.ML

CLion: Efficient Cautious Lion Optimizer with Enhanced Generalization

Feihu Huang, Guanyi Zhang, Songcan Chen

Comments 30 pages

2604.14571 2026-04-17 stat.ME stat.CO

Bayesian sparse principal coordinates analysis with delta-tolerant linear approximation for microbiome data

Hsin-Hsiung Huang, Ruitao Liu, Liangliang Zhang, Shao-Hsuan Wang

2604.14534 2026-04-17 cs.LG stat.AP

An unsupervised decision-support framework for multivariate biomarker analysis in athlete monitoring

Fernando Barcelos Rosito, Sebastião De Jesus Menezes, Simone Ferreira Sturza, Adriana Seixas, Muriel Figueredo Franco

Comments 15 pages, 4 figures, 3 tables, submitted to Springer Nature Scientific Reports

2604.14517 2026-04-17 stat.ME

Bayesian Node-Level Outlier Detection for Graph Signals

Seongmin Kim, Kyusoon Kim

Comments 35 pages, 4 figures

2604.14498 2026-04-17 cs.AI cs.LG stat.ML

Improving Machine Learning Performance with Synthetic Augmentation

Mel Sohm, Charles Dezons, Sami Sellami, Oscar Ninou, Axel Pincon

2604.14497 2026-04-17 cs.CE stat.AP

Robust Optimal Experimental Design Accounting for Sensor Failure

Rebekah White, Chandler Smith, Drew Kouri, Jace Ritchie, Wilkins Aquino, Timothy Walsh

2604.14482 2026-04-17 math.NT math.CA math.ST stat.TH

Arithmetic functions and learning theory

W. Burstein, A. Iosevich, A. Sant

2604.14407 2026-04-17 stat.ME

Propensity Score Weighting to Ensure Balance in Key Subgroups or Strata: A Practical Guide

Emma K. Mackay, Amol A. Verma, Fahad Razak, Surain B. Roberts

Comments 15 pages, 1 figure

2604.14404 2026-04-17 math.ST stat.ME stat.ML stat.TH

Early-stopped aggregation: Adaptive inference with computational efficiency

Ilsang Ohn, Shitao Fan, Jungbin Jun, Lizhen Lin

2604.14394 2026-04-17 econ.EM math.ST stat.TH

Generalized Autoregressive Multivariate Models: From Binary to Poisson

Anna Bykhovskaya, Nour Meddahi

Comments 39 pages

2604.14370 2026-04-17 stat.ME cs.LG

Deployment of AI-Assisted Interventions: Capacity Constraints and Noisy Compliance

Carri W. Chan, Yi Han, Hannah Li, Benjamin L. Ranard

2604.14364 2026-04-17 stat.AP

Joint Bayesian Inference of Genetic Effect Sizes and PK Parameters in Nonlinear Mixed-Effects Models

Julien Martinelli, Ibtissem Rebai, David W. Haas, Julie Bertrand

2604.14352 2026-04-17 stat.ME cs.LG stat.AP

PROXIMA: A Reliability Scoring Framework for Proxy Metrics in Online Controlled Experiments

Avinash Amudala

Comments 14 pages. Sole-author submission. Independent research. Companion code at https://github.com/Avinash-Amudala/PROXIMA. Zenodo archive: 10.5281/zenodo.15483241. Related US provisional patent application: 63/974,569 (filed Feb 3, 2026)

2604.14338 2026-04-17 cs.LG stat.ML

Path-Sampled Integrated Gradients

Firuz Kamalov, Fadi Thabtah, R. Sivaraj, Neda Abdelhamid

2604.14331 2026-04-17 cs.LG stat.ML

Heat and Matérn Kernels on Matchings

Dmitry Eremeev, Salem Said, Viacheslav Borovitskiy

2604.14305 2026-04-17 stat.ME cs.LG q-bio.GN stat.AP

Combining Bayesian and Frequentist Inference for Laboratory-Specific Performance Guarantees in Copy Number Variation Detection

Austin Talbot, Alex V. Kotlar, Yue Ke

2604.14257 2026-04-17 econ.GN q-fin.EC stat.AP

Mapping the causal structure of price formation in Texas's transitioning electricity market

Shiva Madadkhani, Nils Sturma, Mathias Drton, Svetlana Ikonnikova

2604.14249 2026-04-17 cs.LG stat.ML

Metric-Aware Principal Component Analysis (MAPCA):A Unified Framework for Scale-Invariant Representation Learning

Michael Leznik

Comments 12 pages , one figure

2604.14230 2026-04-17 stat.AP

A Statistical Market-Design Framework for Academic Job Markets

Ali Kaazempur-Mofrad, Xiaowu Dai, Xuming He

2604.14209 2026-04-17 cs.LG cs.AI stat.ML

Towards Verified and Targeted Explanations through Formal Methods

Hanchen David Wang, Diego Manzanas Lopez, Preston K. Robinette, Ipek Oguz, Taylor T. Johnson, Meiyi Ma

Comments Paper has been accepted at JAIR

2604.14206 2026-04-17 cs.LG q-fin.PM stat.ML

Portfolio Optimization Proxies under Label Scarcity and Regime Shifts via Bayesian and Deterministic Students under Semi-Supervised Sandwich Training

Adhiraj Chattopadhyay

Comments 18 pages of main text. 10 pages of appendices. 35 references. Around 13 figures

2604.14182 2026-04-17 stat.ME stat.ML

Cellwise Outliers

Mia Hubert, Jakob Raymaekers, Peter J. Rousseeuw

Comments This is a review paper

2604.14181 2026-04-17 math.ST stat.TH

A note on kernel density estimators with optimal bandwidths

Nils Lid Hjort, Stephen G. Walker

Comments 8 pages, 0 figures. Statistical Research Report, Department of Mathematics, University of Oslo, from June 2000, but arXiv'd April 2026. The papers is pubished in essentially this form in Statistics & Probabiity Letters, 2001, vol. 54, pages 153-159, at this url: https://www.sciencedirect.com/science/article/pii/S016771520100027X

2604.14176 2026-04-17 cs.LG cs.AI stat.ML

The Devil Is in Gradient Entanglement: Energy-Aware Gradient Coordinator for Robust Generalized Category Discovery

Haiyang Zheng, Nan Pu, Yaqi Cai, Teng Long, Wenjing Li, Nicu Sebe, Zhun Zhong

Comments Accepted by CVPR26

2604.13861 2026-04-17 cs.LG stat.AP

Simulation-Based Optimisation of Batting Order and Bowling Plans in T20 Cricket

Tinniam V Ganesh

Comments Improved abstract wording and readability; minor textual edits, no change to methodology or results. Submitted to the Journal of Quantitative Analysis in Sports (JQAS), April 2026. 23 pages, 8 figures

2602.10955 2026-04-17 stat.ME stat.AP

Prior Smoothing for Multivariate Disease Mapping Models

Garazi Retegui, María Dolores Ugarte, Jaione Etxeberria, Alan E. Gelfand

2601.14147 2026-04-17 math.OC stat.CO

Gradient flow for finding E-optimal designs

Jieling Shi, Kim-Chuan Toh, Xin T. Tong, Weng Kee Wong

Comments 44 pages, 3 figures

2512.05024 2026-04-17 stat.ME cs.AI cs.LG

Model-Free Assessment of Simulator Fidelity via Quantile Curves

Garud Iyengar, Yu-Shiou Willy Lin, Kaizheng Wang

Comments 39 pages, 15 figures

2511.18107 2026-04-17 cs.LG stat.ML

Active Learning with Selective Time-Step Acquisition for PDEs

Yegon Kim, Hyunsu Kim, Gyeonghoon Ko, Juho Lee

Comments This manuscript is an improvement over the camera-ready version in ICML 2025. We have added a clearer motivation for our acquisition function. (See Sections 2.3 and 3.2)

2510.10260 2026-04-17 math.OC math.PR q-fin.MF stat.ML

Robust Exploratory Stopping under Ambiguity in Reinforcement Learning

Junyan Ye, Hoi Ying Wong, Kyunghyun Park

Comments 31 pages, 9 figures, 1 table

2508.06179 2026-04-17 math.ST stat.TH

Consistency of variational inference for Besov priors in non-linear inverse problems

Shaokang Zu, Junxiong Jia, Zhiguo Wang

Comments 37 pages. arXiv admin note: substantial text overlap with arXiv:2409.18415

2506.18994 2026-04-17 stat.ME stat.ML

Causal Decomposition Analysis with Synergistic Interventions: A Triply-Robust Machine Learning Approach to Addressing Multiple Dimensions of Social Disparities

Soojin Park, Su Yeon Kim, Xinyao Zheng, Chioun Lee

Comments The case study section contains errors due to coding issues. Therefore, I would like to withdraw the paper

2506.13763 2026-04-17 cs.LG cs.AI cs.CV stat.ML

Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value

Yixian Xu, Shengjie Luo, Liwei Wang, Di He, Chang Liu

Comments 33 pages, 12 figures, 9 tables. ICLR 2026 Camera Ready version

2506.13139 2026-04-17 stat.ML cs.LG

Random Matrix Theory for Deep Learning: Beyond Eigenvalues of Linear Models

Zhenyu Liao, Michael W. Mahoney

Comments 30 pages, 6 figures, to appear on IEEE Signal Processing Magazine

2506.11251 2026-04-17 stat.ME cs.AI cs.LG

Measuring multi-calibration

Ido Guy, Daniel Haimovich, Fridolin Linder, Nastaran Okati, Lorenzo Perini, Niek Tax, Mark Tygert

Comments 25 pages, 12 tables

2505.07153 2026-04-17 stat.ME

Enhancing Inference for Small Cohorts via Transfer Learning and Weighted Integration of Multiple Datasets

Subharup Guha, Mengqi Xu, Yi Li

2504.20470 2026-04-17 stat.ME

The Promises of Multiple Experiments: Identifying Joint Distribution of Potential Outcomes

Peng Wu, Xiaojie Mao

2503.06538 2026-04-17 stat.ME

Association measures for two-way contingency tables based on multi-categorical proportional reduction in error

Wataru Urasaki, Kouji Tahata, Sadao Tomizawa

2502.01254 2026-04-17 math.ST stat.TH

A necessary and sufficient condition for convergence in distribution of the quantile process in $L^1(0,1)$

Brendan K. Beare, Tetsuya Kaji

Comments 22 pages

2501.11315 2026-04-17 stat.AP q-bio.QM stat.ML

High-dimensional point forecast combinations for emergency department demand

Peihong Guo, Wen Ye Loh, Kenwin Maung, Esther Li Wen Choo, Borame Lee Dickens, Kelvin Bryan Tan, John Abishgenadan, Pei Ma, Jue Tao Lim

详情

DOI: 10.1186/s12873-026-01497-9
Journal ref: BMC Emerg Med 26, 83 (2026)

英文摘要

Current work on forecasting emergency department (ED) admissions focuses on disease aggregates or singular disease types. However, given differences in the dynamics of individual diseases, it is unlikely that any single forecasting model would accurately account for each disease and for all time, leading to significant forecast model uncertainty. Yet, forecasting models for ED admissions to-date do not explore the utility of forecast combinations to improve forecast accuracy and stability. It is also unknown whether improvements in forecast accuracy can be yield from (1) incorporating a large number of environmental and anthropogenic covariates or (2) forecasting total ED causes by aggregating cause-specific ED forecasts. To address this gap, we propose high-dimensional forecast combination schemes to combine a large number of forecasting individual models for forecasting cause-specific ED admissions over multiple causes and forecast horizons. We use time series data of ED admissions with an extensive set of explanatory lagged variables at the national level, including meteorological/ambient air pollutant variables and ED admissions of all 16 causes studied. We show that the simple forecast combinations yield forecast accuracies of around 3.81%-23.54% across causes. Furthermore, forecast combinations outperform individual forecasting models, in more than 50% of scenarios (across all ED admission categories and horizons) in a statistically significant manner. Inclusion of high-dimensional covariates and aggregating cause-specific forecasts to provide all-cause ED forecasts provided modest improvements in forecast accuracy. Forecasting cause-specific ED admissions can provide fine-scale forward guidance on resource optimization and pandemic preparedness and forecast combinations can be used to hedge against model uncertainty when forecasting across a wide range of admission categories.

URL PDF HTML ☆

赞 0 踩 0

2501.09331 2026-04-17 cs.LG stat.ML

Identifying Information from Observations with Uncertainty and Novelty

Derek S. Prijatelj, Timothy J. Ireland, Walter J. Scheirer

Comments 29 pages, 4 figures, 2 table, and 2 inline algorithms

详情

英文摘要

A machine that learns a task from observations must encounter and process uncertainty and novelty, especially when it is to maintain performance when observing new information and to select the hypothesis that best fits the current observations. In this context, some key questions arise: what and how much information did the observations provide, how much information is required to identify the data-generating process, how many observations remain to get that information, and how does a predictor determine that it has observed novel information? We formalize identifying information to answer these questions and synthesize prior works. Identifying information are bits that verify or falsify a hypothesis as the data-generating process. In this formalization, we prove the information theoretic characteristics of the computation of hypothesis identification and the resulting sample complexity. We define hypothesis identification and sample complexity via the computation of an indicator function over a set of hypotheses, bridging algorithmic and probabilistic information. We detail the sample complexity and its properties for data-generating processes ranging from deterministic processes to ergodic stationary stochastic processes, which connect the notion of identifying information in finite steps with asymptotic statistics and PAC-learning. The indicator function's computation naturally formalizes novel information and its identification from observations with respect to a hypothesis set, which detects a misspecified hypothesis set. We also proved that a computable PAC-Bayes learners' sample complexity distribution is determined by its moments in terms of the prior probability distribution over a fixed finite hypothesis set, and thus an approximation of the sample complexity distribution is always computable within the desired precision that resources allow.

URL PDF HTML ☆

赞 0 踩 0

2307.02582 2026-04-17 q-fin.ST math.PR math.ST stat.TH

Estimating the roughness exponent of stochastic volatility from discrete observations of the integrated variance

Xiyue Han, Alexander Schied

Comments 50 pages, 3 figures

2304.08974 2026-04-17 econ.EM stat.ME

Doubly Robust Estimators with Weak Overlap

Yukun Ma, Pedro H. C. Sant'Anna, Yuya Sasaki, Takuya Ura

2301.07386 2026-04-17 q-bio.NC stat.AP

Hierarchical Bayesian inference for community detection and connectivity of functional brain networks

Lingbin Bian, Nizhuan Wang, Leonardo Novelli, Jonathan Keith, Adeel Razi

2104.03436 2026-04-17 math.ST stat.ME stat.TH

Synthetic likelihood in misspecified models

David T. Frazier, Christopher Drovandi, David J. Nott