arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.13207 2026-03-16 math.ST stat.TH

Estimating the Missing Mass, Partition Function or Evidence for a Case of Sampling from a Discrete Set

Bastiaan J. Braams

Comments 20 pages

详情

英文摘要

We consider the problem of estimating the missing mass, partition function or evidence and its probability distribution in the case that for each sample point in the discrete sample space its (unnormalized) probability mass is revealed. Estimating the missing mass or partition function (evidence) is a well-studied problem for which, in different contexts, the harmonic mean estimator and the Good-Turing (and related) estimators are available. For sampling on a discrete set with revealed probability masses these estimators can be Rao-Blackwellized, leading to self-consistent estimators not involving an auxiliary distribution with known total mass. For the case of sampling from a mixture distribution this offers the perspective of anchoring the estimator at both ends: at the diffuse end (high temperature in statistical physics) via an explicit expression for the total probability mass and at the peaked end (low temperature) via the feature of repeated entries in the sample. Estimation is model-free, but to provide a probability distribution for the missing mass or partition function a model is needed for the distribution of mass. We present one such model, identify sufficient reduced statistics, and analyze the model in various ways -- Bayesian, profile likelihood, maximum likelihood and moment matching -- with the objective of eliminating the mathematical (nuisance) parameters for a final expression in terms of the observed data. The most satisfactory (explicit and transparent) result is obtained by a mixed method that combines Bayesian marginalization or profile likelihood optimization for all but one of the parameters with plain maximum likelihood optimization of the final parameter.

URL PDF HTML ☆

赞 0 踩 0

2603.13156 2026-03-16 stat.ME stat.ML

When Your Model Stops Working: Anytime-Valid Calibration Monitoring

Tristan Farran

2603.13009 2026-03-16 stat.ME stat.CO

TwoTimeScales: An R-package for Smoothing Hazards with Two Time Scales

Angela Carollo, Paul H. C. Eilers, Hein Putter, Jutta Gampe

Comments 15 pages, 6 figures

2603.12920 2026-03-16 cs.CL stat.ML

HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection

Zixin Feng, Xinying Cui, Yifan Sun, Zheng Wei, Jiachen Yuan, Jiazhen Hu, Ning Xin, Md Maruf Hasan

2603.12893 2026-03-16 cs.CV cs.AI cs.LG cs.NE stat.ML

Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models

David McAllister, Miika Aittala, Tero Karras, Janne Hellsten, Angjoo Kanazawa, Timo Aila, Samuli Laine

Comments Code available at https://github.com/NVlabs/finite-difference-flow-optimization

2603.11240 2026-03-16 stat.OT

Statistical Methodology Groups in the Pharmaceutical Industry

Jenny Devenport, Tobias Mielke, Mouna Akacha, Kaspar Rufibach, Alex Ocampo, Vivian Lanius, Marc Vandemeulebroecke, Philip Hougaard, Pierre Collin, David Wright, Jurgen Hummel, Cornelia Ursula Kunz, Mike Krams

Comments 39 pages, 2 figures, 1 table

2603.07227 2026-03-16 physics.ao-ph stat.AP

Estimating changes in extreme quantiles over time, applied to desert temperatures

Callum Leach, Kevin Ewans, Philip Jonathan

2512.22587 2026-03-16 cs.LG stat.ML

Structural Incompatibility of Differentiable Sorting and Within-Vector Rank Normalization

Taeyun Kim

Comments 6 pages

2512.11946 2026-03-16 cs.LG cs.AI stat.ML

Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations

Pramudita Satria Palar, Paul Saves, Rommel G. Regis, Koji Shimoyama, Shigeru Obayashi, Nicolas Verstaevel, Joseph Morlier

Comments Published in Aerospace Science and Technology, 2026

详情

DOI: 10.1016/j.ast.2026.112091

英文摘要

Explainable machine learning techniques have gained increasing attention in engineering applications, especially in aerospace design and analysis, where understanding how input variables influence data-driven models is essential. Partial Dependence Plots (PDPs) are widely used for interpreting black-box models by showing the average effect of an input variable on the prediction. However, their global sensitivity metric can be misleading when strong interactions are present, as averaging tends to obscure interaction effects. To address this limitation, we propose a global sensitivity metric based on Individual Conditional Expectation (ICE) curves. The method computes the expected feature importance across ICE curves, along with their standard deviation, to more effectively capture the influence of interactions. We provide a mathematical proof demonstrating that the PDP-based sensitivity is a lower bound of the proposed ICE-based metric under truncated orthogonal polynomial expansion. In addition, we introduce an ICE-based correlation value to quantify how interactions modify the relationship between inputs and the output. Comparative evaluations were performed on three cases: a 5-variable analytical function, a 5-variable wind-turbine fatigue problem, and a 9-variable airfoil aerodynamics case, where ICE-based sensitivity was benchmarked against PDP, SHapley Additive exPlanations (SHAP), and Sobol' indices. The results show that ICE-based feature importance provides richer insights than the traditional PDP-based approach, while visual interpretations from PDP, ICE, and SHAP complement one another by offering multiple perspectives.

URL PDF HTML ☆

赞 0 踩 0

2511.13421 2026-03-16 cs.LG stat.ML

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

Tingkai Yan, Haodong Wen, Binghui Li, Kairong Luo, Wenguang Chen, Kaifeng Lyu

详情

英文摘要

While data scaling laws of large language models (LLMs) have been widely examined in the one-pass regime with massive corpora, their form under limited data and repeated epochs remains largely unexplored. This paper presents a theoretical analysis of how a common workaround, training for multiple epochs on the same dataset, reshapes the data scaling laws in linear regression. Concretely, we ask: to match the performance of training on a dataset of size $N$ for $K$ epochs, how much larger must a dataset be if the model is trained for only one pass? We quantify this using the \textit{effective reuse rate} of the data, $E(K, N)$, which we define as the multiplicative factor by which the dataset must grow under one-pass training to achieve the same test loss as $K$-epoch training. Our analysis precisely characterizes the scaling behavior of $E(K, N)$ for SGD in linear regression under either strong convexity or Zipf-distributed data: (1) When $K$ is small, we prove that $E(K, N) \approx K$, indicating that every new epoch yields a linear gain; (2) As $K$ increases, $E(K, N)$ plateaus at a problem-dependent value that grows with $N$ ($Θ(\log N)$ for the strongly-convex case), implying that larger datasets can be repeated more times before the marginal benefit vanishes. These theoretical findings point out a neglected factor in a recent empirical study (Muennighoff et al. (2023)), which claimed that training LLMs for up to $4$ epochs results in negligible loss differences compared to using fresh data at each step, \textit{i.e.}, $E(K, N) \approx K$ for $K \le 4$ in our notation. Supported by further empirical validation with LLMs, our results reveal that the maximum $K$ value for which $E(K, N) \approx K$ in fact depends on the data size and distribution, and underscore the need to explicitly model both factors in future studies of scaling laws with data reuse.

URL PDF HTML ☆

赞 0 踩 0

2511.04974 2026-03-16 stat.AP

Estimating Inhomogeneous Spatio-Temporal Background Intensity Functions using Graphical Dirichlet Processes

Isaías Bañales, Tomoaki Nishikawa, Yoshihiro Ito, Manuel J. Aguilar-Velázquez

2510.01930 2026-03-16 stat.ML cond-mat.dis-nn cs.LG

Precise Dynamics of Diagonal Linear Networks: A Unifying Analysis by Dynamical Mean-Field Theory

Sota Nishiyama, Masaaki Imaizumi

Comments 48 pages, accepted at AISTATS 2026 (Spotlight)

2507.14389 2026-03-16 stat.AP econ.EM math.ST stat.ME stat.TH

Spatiotemporal Autoregressive Models for Areal Compositional Data

Matthias Eckardt, Philipp Otto

2506.20021 2026-03-16 stat.ME

Speeding up the ordered allocation sampler

Maria F. Gil-Leyva, Fidel Selva, Pierpaolo De Blasi

Comments Change from v1: added acknowledgment

2502.20114 2026-03-16 stat.CO cond-mat.stat-mech cs.NA math.NA math.PR

Scalability of the second-order reliability method for stochastic differential equations with multiplicative noise

Timo Schorlepp, Tobias Grafke

Comments 59 pages, 9 figures

2501.15194 2026-03-16 cs.LG stat.CO stat.ML

Reliable Pseudo-labeling via Optimal Transport with Attention for Short Text Clustering

Zhihao Yao

Comments arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission

2410.17046 2026-03-16 stat.ME stat.AP

Mesoscale two-sample testing for networks

Peter W. MacDonald, Elizaveta Levina, Ji Zhu

Comments 59 pages, 9 figures

2406.03821 2026-03-16 stat.AP stat.ME

Bayesian generalized method of moments applied to pseudo-observations in survival analysis

Léa Orsini, Caroline Brard, Emmanuel Lesaffre, Guosheng Yin, David Dejardin, Gwénaël Le Teuff

详情

DOI: 10.1007/s10985-025-09670-1

英文摘要

Bayesian inference for survival regression modeling offers numerous advantages, especially for decision-making and external data borrowing, but demands the specification of the baseline hazard function, which may be a challenging task. We propose an alternative approach that does not need the specification of this function. Our approach combines pseudo-observations to convert censored data into longitudinal data with the Generalized Methods of Moments (GMM) to estimate the parameters of interest from the survival function directly. GMM may be viewed as an extension of the Generalized Estimating Equation (GEE) currently used for frequentist pseudo-observations analysis and can be extended to the Bayesian framework using a pseudo-likelihood function. We assessed the behavior of the frequentist and Bayesian GMM in the new context of analyzing pseudo-observations. We compared their performances to the Cox, GEE, and Bayesian piecewise exponential models through a simulation study of two-arm randomized clinical trials. Frequentist and Bayesian GMM gave valid inferences with similar performances compared to the three benchmark methods, except for small sample sizes and high censoring rates. For illustration, three post-hoc efficacy analyses were performed on randomized clinical trials involving patients with Ewing Sarcoma, producing results similar to those of the benchmark methods. Through a simple application of estimating hazard ratios, these findings confirm the effectiveness of this new Bayesian approach based on pseudo-observations and the generalized method of moments. This offers new insights on using pseudo-observations for Bayesian survival analysis.

URL PDF HTML ☆

赞 0 踩 0

2311.08365 2026-03-16 math.ST stat.TH

Local asymptotics of selection models with applications in Bayesian selective inference

Daniel G. Rasines, G. Alastair Young

Comments 30 pages, 7 figures, 1 table

2311.07733 2026-03-16 stat.ME math.PR

Credible Intervals for Probability of Failure with Gaussian Processes

Aleksei G. Sorokin, Vishwas Rao

2303.07167 2026-03-16 stat.ME stat.AP stat.ML

When Respondents Don't Care Anymore: Identifying the Onset of Careless Responding

Max Welz, Andreas Alfons

2208.13701 2026-03-16 stat.ME cs.LG math.OC stat.ML

Data-Driven Influence Functions for Optimization-Based Causal Inference

Michael I. Jordan, Yixin Wang, Angela Zhou

Comments Revision

2603.12867 2026-03-16 stat.ME

Breaking the Winner's Curse with Bayesian Hybrid Shrinkage

Richard Mudd, Abbas Zaidi, Rina Friedberg, Ilya Gorbachev, Anchal Choubey, Houssam Nassif

2603.12843 2026-03-16 math.ST stat.ME stat.TH

The geometry of Stein's method of moments: A canonical decomposition via score matching

Mitsuki Nagai, Keisuke Yano

2603.12838 2026-03-16 math.OC cs.DC stat.ML

A New Kernel Regularity Condition for Distributed Mirror Descent: Broader Coverage and Simpler Analysis

Junwen Qiu, Ziyang Zeng, Leilei Mei, Junyu Zhang

Comments 25 pages, 4 figures

2603.12780 2026-03-16 math.ST math.PR stat.TH

Functional CLT for general sample covariance matrices

Jian Cui, Zhijun Liu, Jiang Hu, Zhidong Bai

2603.12753 2026-03-16 stat.ME cs.CR

Balancing the privacy-utility trade-off: How to draw reliable conclusions from private data

Raphaël de Fondeville

2603.12734 2026-03-16 stat.ML cs.LG

VecMol: Vector-Field Representations for 3D Molecule Generation

Yuchen Hua, Xingang Peng, Jianzhu Ma, Muhan Zhang

2603.12672 2026-03-16 math.ST stat.TH

Multivariate normality test based on the uniform distribution on the Stiefel manifold

Koki Shimizu, Toshiya Iwashita

2603.12627 2026-03-16 stat.ML cs.IT cs.LG math.IT

Batched Kernelized Bandits: Refinements and Extensions

Chenkai Ma, Keqin Chen, Jonathan Scarlett

2603.12562 2026-03-16 stat.ML cs.CV cs.LG

Variational Garrote for Sparse Inverse Problems

Kanghun Lee, Hyungjoon Soh, Junghyo Jo

Comments 10 pages, 4 figures

2603.12561 2026-03-16 stat.ME

Consistent and powerful CUSUM change-point test for panel data with changes in variance

Wenzhi Yang, Yueting Xu, Xiaoping Shi, Qiong Li

2603.12552 2026-03-16 cs.LG math.OC stat.ML

Asymptotic and Finite-Time Guarantees for Langevin-Based Temperature Annealing in InfoNCE

Faris Chaudhry

Comments Accepted at the Optimization for Machine Learning Workshop (NeurIPS 2025)

2603.12525 2026-03-16 stat.ML cond-mat.dis-nn cs.LG

EB-RANSAC: Random Sample Consensus based on Energy-Based Model

Muneki Yasuda, Nao Watanabe, Kaiji Sekimoto

2603.12523 2026-03-16 stat.ME math.ST stat.TH

Inference for function-on-function regression: central limit theorem and residual bootstrap

Hyemin Yeon

2603.12518 2026-03-16 math.ST stat.ME stat.TH

Gaussian and bootstrap approximations for functional principal component regression

Hyemin Yeon

2603.12448 2026-03-16 stat.CO cs.NA math.NA

Sampling through iterated approximation: Gradient-free and multi-fidelity Bayesian inference via transport

Daniel Sharp, Bart van Bloemen Waanders, Youssef Marzouk

2603.12394 2026-03-16 stat.AP

Spatio-temporal evolution of surface temperature trends in Ghana (1983-2021): a multi-station approach

John Bagiliko, David Stern, Denis Ndanguza

2603.12356 2026-03-16 stat.AP

Modeling diesel output particulate matter as the Ornstein-Uhlenbeck process

Maxwell Bolt, Alex Alberts, Akash S. Desai, Peter Meckl, Ilias Bilionis

Comments 18 pages, 8 figures

2603.12352 2026-03-16 stat.ME

Bayesian Covariate-Varying Interaction Analysis for Multivariate Count Data: Application to Microbiome Studies

Shuangjie Zhang, Michael L. Patnode, Juhee Lee

Comments 33 pages, 1o Figures

2603.12351 2026-03-16 stat.ML cs.LG q-bio.QM stat.CO stat.ME

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration

Raphiel J. Murden, Ganzhong Tian, Deqiang Qiu, Benajmin B. Risk

2603.12349 2026-03-16 cs.LG cs.AI q-bio.QM stat.ML

Budget-Sensitive Discovery Scoring: A Formally Verified Framework for Evaluating AI-Guided Scientific Selection

Abhinaba Basu, Pavan Chakraborty

详情

英文摘要

Scientific discovery increasingly relies on AI systems to select candidates for expensive experimental validation, yet no principled, budget-aware evaluation framework exists for comparing selection strategies -- a gap intensified by large language models (LLMs), which generate plausible scientific proposals without reliable downstream evaluation. We introduce the Budget-Sensitive Discovery Score (BSDS), a formally verified metric -- 20 theorems machine-checked by the Lean 4 proof assistant -- that jointly penalizes false discoveries (lambda-weighted FDR) and excessive abstention (gamma-weighted coverage gap) at each budget level. Its budget-averaged form, the Discovery Quality Score (DQS), provides a single summary statistic that no proposer can inflate by performing well at a cherry-picked budget. As a case study, we apply BSDS/DQS to: do LLMs add marginal value to an existing ML pipeline for drug discovery candidate selection? We evaluate 39 proposers -- 11 mechanistic variants, 14 zero-shot LLM configurations, and 14 few-shot LLM configurations -- using SMILES representations on MoleculeNet HIV (41,127 compounds, 3.5% active, 1,000 bootstrap replicates) under both random and scaffold splits. Three findings emerge. First, the simple RF-based Greedy-ML proposer achieves the best DQS (-0.046), outperforming all MLP variants and LLM configurations. Second, no LLM surpasses the Greedy-ML baseline under zero-shot or few-shot evaluation on HIV or Tox21, establishing that LLMs provide no marginal value over an existing trained classifier. Third, the proposer hierarchy generalizes across five MoleculeNet benchmarks spanning 0.18%-46.2% prevalence, a non-drug AV safety domain, and a 9x7 grid of penalty parameters (tau >= 0.636, mean tau = 0.863). The framework applies to any setting where candidates are selected under budget constraints and asymmetric error costs.

URL PDF HTML ☆

赞 0 踩 0

2603.12297 2026-03-16 cs.IT math.IT math.PR math.ST stat.TH

Complex-Valued Probability Measures and Their Applications in Information Theory

Siang Cheng, Hejun Xu, Tianxiao Pang

Comments 23 pages, 3 tables

2603.12288 2026-03-16 cs.LG cs.AI stat.ML

From Garbage to Gold: A Data-Architectural Theory of Predictive Robustness

Terrence J. Lee-St. John, Jordan L. Lawson, Bartlomiej Piechowski-Jozwiak

Comments 120 pages, 12 figures, 3 tables. Simulation code and documentation available at: https://github.com/tjleestjohn/from-garbage-to-gold

详情

英文摘要

Tabular machine learning presents a paradox: modern models achieve state-of-the-art performance using high-dimensional (high-D), collinear, error-prone data, defying the "Garbage In, Garbage Out" mantra. To help resolve this, we synthesize principles from Information Theory, Latent Factor Models, and Psychometrics, clarifying that predictive robustness arises not solely from data cleanliness, but from the synergy between data architecture and model capacity. Partitioning predictor-space "noise" into "Predictor Error" and "Structural Uncertainty" (informational deficits from stochastic generative mappings), we prove that leveraging high-D sets of error-prone predictors asymptotically overcomes both types of noise, whereas cleaning a low-D set is fundamentally bounded by Structural Uncertainty. We demonstrate why "Informative Collinearity" (dependencies from shared latent causes) enhances reliability and convergence efficiency, and explain why increased dimensionality reduces the latent inference burden, enabling feasibility with finite samples. To address practical constraints, we propose "Proactive Data-Centric AI" to identify predictors that enable robustness efficiently. We also derive boundaries for Systematic Error Regimes and show why models that absorb "rogue" dependencies can mitigate assumption violations. Linking latent architecture to Benign Overfitting, we offer a first step towards a unified view of robustness to Outcome Error and predictor-space noise, while also delineating when traditional DCAI's focus on label cleaning remains powerful. By redefining data quality from item-level perfection to portfolio-level architecture, we provide a theoretical rationale for "Local Factories" -- learning from live, uncurated enterprise "data swamps" -- supporting a deployment paradigm shift from "Model Transfer" to "Methodology Transfer'' to overcome static generalizability limitations.

URL PDF HTML ☆

赞 0 踩 0

2603.12284 2026-03-16 stat.ME stat.ML

Bayesian Conservative Policy Optimization (BCPO): A Novel Uncertainty-Calibrated Offline Reinforcement Learning with Credible Lower Bounds

Debashis Chatterjee

2603.11829 2026-03-16 stat.ME

Robust Sequential Hypothesis Testing with Generalized Estimating Equations for Incomplete Clustered and Longitudinal Data

Nathan T. Provost, Abdus S. Wahed

Comments VERSION 2: First version accidentally used older abbreviated title, this has been corrected. 24 pages; 1 figure

2602.21130 2026-03-16 stat.ML cs.LG

An Enhanced Projection Pursuit Tree Classifier with Visual Methods for Assessing Algorithmic Improvements

Natalia da Silva, Dianne Cook, Eun-Kyung Lee

2601.02610 2026-03-16 stat.ME stat.ML

Conformal novelty detection with false discovery rate control at the boundary

Zijun Gao, Etienne Roquain, Daniel Xiang

Comments 43 pages, 17 figures, 1 table

2512.13622 2026-03-16 stat.ME stat.AP

Empirical Bayes learning from selectively reported confidence intervals

Hunter Chen, Junming Guan, Erik van Zwet, Nikolaos Ignatiadis

2510.09816 2026-03-16 q-bio.NC math.OC physics.bio-ph physics.data-an stat.ML

A mathematical theory for understanding when abstract representations emerge in neural networks

Bin Wang, W. Jeffrey Johnston, Stefano Fusi

Comments 19 pages, 8 figures

2510.09598 2026-03-16 stat.ME

Defensive Model Expansion for Robust Bayesian Inference

Antonio R. Linero

2510.05645 2026-03-16 math.ST stat.TH

Weak convergence of Bayes estimators under general loss functions

Robin Requadt, Housen Li, Axel Munk

2508.21742 2026-03-16 cs.AI stat.ME

Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions

Timothée Loranchet, Charles K. Assaad

Comments Accepted to AISTATS 2026

2505.10628 2026-03-16 stat.ML cs.LG math.PR

Minimax learning rates for estimating binary classifiers under margin conditions

Jonathan García, Philipp Petersen

2411.12367 2026-03-16 stat.ME stat.AP

Left-truncated discrete lifespans: The AFiD enterprise panel

Eric Scholz, Rafael Weißbach

Comments 42 pages, 2 figures, 4 tables

2411.07993 2026-03-16 stat.AP

Markov Processes for Enhanced Deepfake Generation and Detection

Michael A. Kouritzin, Ian Zhang, Jyoti Bhadana, Seoyeon Park

2410.18613 2026-03-16 cs.LG cs.CV stat.ML

Rethinking Attention: Polynomial Alternatives to Softmax in Transformers

Hemanth Saratchandran, Jianqiao Zheng, Yiping Ji, Wenbo Zhang, Simon Lucey

2410.03191 2026-03-16 stat.ML cs.LG

Nested Deep Learning Model Towards A Foundation Model for Brain Signal Data

Fangyi Wei, Jiajie Mo, Kai Zhang, Haipeng Shen, Srikantan Nagarajan, Fei Jiang

Comments 56 pages; paper structure updated

2407.15693 2026-03-16 math.AP cs.LG math.FA math.ST stat.TH

Fisher-Rao Gradient Flow: Geodesic Convexity and Functional Inequalities

José A. Carrillo, Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Dongyi Wei

Comments 38 pages

2401.02739 2026-03-16 cs.LG q-bio.QM stat.ML

Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors

Wasu Top Piriyakulkij, Yingheng Wang, Volodymyr Kuleshov

Comments published at AAAI 2025; the first two authors contribute equally to this work; code available at https://github.com/topwasu/DDVI

2311.09838 2026-03-16 stat.ME q-bio.GN q-bio.PE stat.AP stat.CO

Bayesian Inference of Reproduction Number from Epidemiological and Genetic Data Using Particle MCMC

Alicia Gill, Jere Koskela, Xavier Didelot, Richard G. Everitt

Comments 24 pages, 11 figures (30 pages, 19 figures including appendices)

2303.07287 2026-03-16 stat.ML cs.LG econ.EM

Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm

Huiming Zhang, Haoyu Wei, Guang Cheng

Comments This manuscript has been withdrawn by the authors as it is not yet ready for public release. Further improvements and revisions are required before a final version can be considered for distribution