arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.11762 2026-04-14 cs.CV cs.LG eess.SP physics.med-ph stat.ML

MosaicMRI: A Diverse Dataset and Benchmark for Raw Musculoskeletal MRI

Paula Arguello, Berk Tinaz, Mohammad Shahab Sepehri, Maryam Soltanolkotabi, Mahdi Soltanolkotabi

Comments 15 pages, 6 figures, preliminary version

详情

英文摘要

Deep learning underpins a wide range of applications in MRI, including reconstruction, artifact removal, and segmentation. However, progress has been driven largely by public datasets focused on brain and knee imaging, shaping how models are trained and evaluated. As a result, careful studies of the reliability of these models across diverse anatomical settings remain limited. In this work, we introduce MosaicMRI, a large and diverse collection of fully sampled raw musculoskeletal (MSK) MR measurements designed for training and evaluating machine-learning-based methods. MosaicMRI is the largest open-source raw MSK MRI dataset to date, comprising 2,671 volumes and 80,156 slices. The dataset offers substantial diversity in volume orientation (e.g., axial, sagittal), imaging contrasts (e.g., PD, T1, T2), anatomies (e.g., spine, knee, hip, ankle, and others), and numbers of acquisition coils. Using VarNet as a baseline for accelerated reconstruction task, we perform a comprehensive set of experiments to study scaling behavior with respect to both model capacity and dataset size. Interestingly, models trained on the combined anatomies significantly outperform anatomy-specific models in low-sample regimes, highlighting the benefits of anatomical diversity and the presence of exploitable cross-anatomical correlations. We further evaluate robustness and cross-anatomy generalization by training models on one anatomy (e.g., spine) and testing them on another (e.g., knee). Notably, we identify groups of body parts (e.g., foot and elbow) that generalize well with each other, and highlight that performance under domain shifts depends on both training set size, anatomy, and protocol-specific factors.

URL PDF HTML ☆

赞 0 踩 0

2604.11746 2026-04-14 stat.ME math.ST stat.ML stat.TH

Inferring Change Points in Regression via Sample Weighting

Gabriel Arpino, Ramji Venkataramanan

Comments 70 pages, 11 figures

2604.11731 2026-04-14 stat.ME stat.AP stat.ML

Nested Atoms Model with Application to Clustering Big Population-Scale Single-Cell Data

Arhit Chakrabarti, Yang Ni, Yuchao Jiang, Bani K. Mallick

2604.11729 2026-04-14 math.PR cs.DS cs.LG math.ST stat.TH

Universality of first-order methods on random and deterministic matrices

Nicola Gorini, Chris Jones, Dmitriy Kunisky, Lucas Pesenti

2604.11673 2026-04-14 stat.ME cs.AI math.ST stat.CO stat.TH

NetworkNet: A Deep Neural Network Approach for Random Networks with Sparse Nodal Attributes and Complex Nodal Heterogeneity

Zhaoyu Xing, Xiufan Yu

2604.11591 2026-04-14 stat.ME

A novel reference prior for Gaussian hierarchical models with intrinsic conditional autoregressive random effects

Marco A. R. Ferreira

2604.11578 2026-04-14 quant-ph cs.AI cs.LG stat.ML

Minimizing classical resources in variational measurement-based quantum computation for generative modeling

Arunava Majumder, Hendrik Poulsen Nautrup, Hans J. Briegel

Comments 14 pages

2604.11550 2026-04-14 stat.ME

Principled Inference in Dense High-Dimensional Linear Models via Local Conditional Sparsity

Wenjun Xiong, Yan Chen, Mingya Long, Qizhai Li

2604.11507 2026-04-14 math.OC cs.AI cs.LG cs.SY eess.SY stat.ML

Deep Learning for Sequential Decision Making under Uncertainty: Foundations, Frameworks, and Frontiers

I. Esra Buyuktahtakin

2604.11491 2026-04-14 stat.ML cs.AI cs.LG math.ST stat.ME stat.TH

ADD for Multi-Bit Image Watermarking

An Luo, Jie Ding

2604.11458 2026-04-14 stat.ME stat.CO

An Empirical Comparison of Methods for Quantifying the Similarity of Categorical Datasets

Marieke Stolte, Jörg Rahnenführer, Andrea Bommert

2604.11393 2026-04-14 econ.EM math.ST stat.TH

Average Marginal Effects in One-Step Partially Linear Instrumental Regressions

Lucas Girard, Elia Lapenta

Comments 67 pages (body: pages 1-26; appendices: pages 26-67); 8 figures; 5 tables

2604.11363 2026-04-14 math.ST stat.TH

Subordinated Wright-Fisher Priors

Nathan A. Judd, Dario Spanò

2604.11343 2026-04-14 cs.DL stat.ME

Which Discoveries Are Paradigm Shifting?

Sajad Ashouri, Arash Hajikhani, Ari Hyytinen, Petri Rouvinen, Arho Suominen

2604.11335 2026-04-14 math.ST stat.ME stat.TH

Trends in tail dependence of heteroscedastic extremes

John H. J. Einmahl, Chen Zhou

2604.11311 2026-04-14 cs.LG stat.ML

Learning Discrete Diffusion of Graphs via Free-Energy Gradient Flows

Dario Rancati, Jan Maas, Francesco Locatello

2604.11300 2026-04-14 math.ST stat.ME stat.TH

Detection and Mode-Identification of Multiple Change Points in Tensor Factor Models

Yuqi Zhang, Zetai Cen, Haeran Cho

Comments 165 pages

2604.11253 2026-04-14 stat.ML cs.LG

Trustworthy Feature Importance Avoids Unrestricted Permutations

Emanuele Borgonovo, Francesco Cappelli, Xuefei Lu, Elmar Plischke, Cynthia Rudin

2604.11239 2026-04-14 stat.ME

Optimized questionnaire item selection for tracking the progression of motor symptoms in Parkinson's disease

Karl Sigfrid, Ellinor Fackle-Fornius, Frank Miller

2604.11223 2026-04-14 stat.ML cs.AI cs.LG

Regional Explanations: Bridging Local and Global Variable Importance

Salim I. Amoukou, Nicolas J-B. Brunel

Comments Accepted at the 39th Conference on Neural Information Processing Systems (NeurIPS 2025)

2604.11200 2026-04-14 cs.LG cs.AI stat.ML

ShapShift: Explaining Model Prediction Shifts with Subgroup Conditional Shapley Values

Tom Bewley, Salim I. Amoukou, Emanuele Albini, Saumitra Mishra, Manuela Veloso

2604.11199 2026-04-14 stat.CO math.PR

Extended One-Liners for the Beta, Gamma, and Dirichlet Distributions with Shape Parameters Below One

Dylan Greaves

Comments 8 pages, 1 figure, 1 table

2604.11168 2026-04-14 stat.ME

Prediction decomposition for causal analysis

Ofir Reich

Comments 22 pages, 7 figures

2604.11151 2026-04-14 cs.LG stat.ML

Gradient-Variation Regret Bounds for Unconstrained Online Learning

Yuheng Zhao, Andrew Jacobsen, Nicolò Cesa-Bianchi, Peng Zhao

2604.11127 2026-04-14 math.ST stat.TH

Empirical interpretation of the Pitman efficiency

Tadeusz Inglot

2604.11118 2026-04-14 cs.LG stat.ML

Distributionally Robust K-Means Clustering

Vikrant Malik, Taylan Kargin, Babak Hassibi

2604.10310 2026-04-14 math.PR math.ST stat.OT stat.TH

Weak convergence from projected laws on a positive-measure set of directions

Alejandro Cholaquidis, Manuel Hernandez Banadik

2604.03775 2026-04-14 cond-mat.stat-mech stat.ML

Cross-Spectral Witness for Hidden Nonequilibrium Beyond the Scalar Ceiling

Yuda Bi, Vince D Calhoun

2604.02150 2026-04-14 math.NA cs.NA math.PR math.ST stat.ML stat.TH

Samplet limits and multiwavelets

Gianluca Giacchi, Michael Multerer, Jacopo Quizi

2603.22962 2026-04-14 cs.LG stat.ML

Asymptotic Learning Curves for Diffusion Models with Random Features Score and Manifold Data

Anand Jerry George, Nicolas Macris

Comments The proof of Lemma 1 in Appendix C is incorrect

2603.15928 2026-04-14 stat.AP

Prior-Data Fitted Networks for Causal Inference: a Simulation Study with Real-World Scenarios

Francisco Mourao, David Hajage, Daria Bystrova, Bertrand Bouvarel, Nathanaël Lapidus, Fabrice Carrat, Benjamin Glemain

Comments 26 pages, 4 tables, 3 figures

2603.14305 2026-04-14 astro-ph.HE astro-ph.GA cond-mat.stat-mech stat.AP

Reconnection-driven State Transitions in Flat Spectrum Radio Quasars

Agniva Roychowdhury

Comments 17 pages, 10 figures; accepted for publication in The Astrophysical Journal

2601.21860 2026-04-14 math.OC stat.ML

Pathwise Learning of Stochastic Dynamical Systems with Partial Observations

Nicole Tianjiao Yang

2512.19691 2026-04-14 cs.AI stat.AP

Scalable Stewardship of an LLM-Assisted Clinical Benchmark with Physician Oversight

Junze Ye, Daniel Tawfik, Alex J. Goodell, Nikhil V. Kotha, Mark K. Buyyounouski, Mohsen Bayati

Comments Github codebase: https://github.com/junzeye/validate-medcalc-labels

2511.15068 2026-04-14 stat.ME

Classification Trees with Valid Inference via the Exponential Mechanism

Soham Bakshi, Snigdha Panigrahi

2510.04358 2026-04-14 physics.ao-ph stat.AP stat.ML

Score-based generative emulation of impact-relevant Earth system model outputs

Shahine Bouabid, Andre Nogueira Souza, Raffaele Ferrari

详情

DOI: 10.1029/2025MS005558

英文摘要

Policy targets evolve faster than the Coupled Model Intercomparison Project cycles, complicating adaptation and mitigation planning that must often contend with outdated projections. Climate model output emulators address this gap by offering inexpensive surrogates that can rapidly explore alternative futures while staying close to Earth System Model (ESM) behavior. The focus is on emulators designed to provide inputs to impact models. Using monthly ESM fields of near-surface temperature, precipitation, relative humidity, and wind speed, it is shown that deep generative models have the potential to model the joint distribution of variables relevant for impacts. The specific model proposed uses score-based diffusion on a spherical mesh and runs on a single mid-range graphical processing unit. A thorough suite of diagnostics is introduced to compare emulator outputs with their parent ESMs, including their probability densities, cross-variable correlations, time of emergence, or tail behavior. The emulator performance is evaluated across three distinct ESMs in both pre-industrial and forced regimes. The results show that the emulator produces distributions that closely match the ESM outputs and captures key forced responses. They also reveal important failure cases, notably for variables with a strong regime shift in the seasonal cycle. Although not a perfect match to the ESM, the inaccuracies of the emulator are small relative to the magnitude of internal variability in ESM projections. This suggests that the generative emulators can be useful in supporting impact assessment. Priorities for future development toward daily resolution, finer spatial scales, and bias-aware training are discussed. Code is made available at https://github.com/shahineb/climemu.

URL PDF HTML ☆

赞 0 踩 0

2509.22736 2026-04-14 eess.IV cs.AI cs.CV cs.LG physics.med-ph stat.ML

PnP-CM: Consistency Models as Plug-and-Play Priors for Inverse Problems

Merve Gülle, Junno Yun, Yaşar Utku Alçalar, Mehmet Akçakaya

Comments IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

2509.19889 2026-04-14 stat.ME

Improving Disease Risk Estimation in Small Areas by Accounting for Spatiotemporal Local Discontinuities

G. Santafé, A. Adin, M. D. Ugarte

2509.15359 2026-04-14 stat.ME

Bayesian Mixture Models for Heterogeneous Extremes

Viviana Carcaiso, Miguel de Carvalho, Ilaria Prosdocimi, Isadora Antoniano-Villalobos

Comments Paper updated based on reviewers' comments

2506.04082 2026-04-14 stat.CO stat.AP stat.ME

Adaptive tuning of Hamiltonian Monte Carlo methods

Elena Akhmatskaya, Lorenzo Nagar, Jose Antonio Carrillo, Leonardo Gavira Balmacz, Hristo Inouzhe, Martín Parga Pazos, María Xosé Rodríguez Álvarez

详情

英文摘要

With the recently increased interest in probabilistic models, the efficiency of an underlying sampler becomes a crucial consideration. Hamiltonian Monte Carlo (HMC) is one popular option for models of this kind. Performance of the method, however, strongly relies on a choice of parameters associated with an integration for Hamiltonian equations. Up to date, such a choice remains mainly heuristic or introduces time complexity. We propose a novel computationally inexpensive and flexible approach (we call it Adaptive Tuning or ATune) that, by combining a theoretical analysis of the multivariate Gaussian model with simulation data generated during a burn-in stage of a HMC simulation, detects a system specific splitting integrator with a set of reliable sampler's hyperparameters, including their credible randomization intervals, to be readily used in a production simulation. The method automatically eliminates those values of simulation parameters which could cause undesired extreme scenarios, such as resonance artifacts, low accuracy or poor sampling. The new approach is implemented in the in-house software package HaiCS, with no computational overheads introduced in a production simulation, and can be easily incorporated in any package for Bayesian inference with HMC. The tests on popular statistical models reveal the superiority of adaptively tuned standard and generalized HMC methods in terms of stability, performance and accuracy over conventional HMC tuned heuristically and coupled with the well-established integrators. We also claim that the generalized HMC is preferable for achieving high sampling performance. The efficiency of the new methodology is assessed in comparison with state-of-the-art samplers, e.g. NUTS, in real-world applications, such as endocrine therapy resistance in cancer, modeling of cell-cell adhesion dynamics and influenza A epidemic outbreak.

URL PDF HTML ☆

赞 0 踩 0

2503.22924 2026-04-14 stat.ME

Asymptotic Standard Errors for Reliability Coefficients in Item Response Theory

Youjin Sung, Yang Liu

2411.05869 2026-04-14 stat.ML cs.LG stat.AP stat.CO stat.ME

Compactly-supported nonstationary kernels for computing exact Gaussian processes on big data

Mark D. Risser, Marcus M. Noack, Hengrui Luo, Ronald Pandolfi

2410.12618 2026-04-14 stat.AP

Spatio-Temporal Analysis of Public Transportation Undercrowding: Leveraging APC Data for a Comprehensive Evaluation of Usage Rates

Arianna Burzacchi, Valeria Maria Urbano, Marika Arena, Giovanni Azzone, Piercesare Secchi, Simone Vantini

Comments Pre-print version

2409.06565 2026-04-14 math.PR math.FA math.ST q-bio.QM stat.ME stat.TH

Statistical inference for a multiscale stochastic model of enzyme kinetics via propagation of chaos

Arnab Ganguly, Wasiur R. KhudaBukhsh

Comments Removed functional central limit theorem and added new results to the parameter inference section

2409.06406 2026-04-14 stat.AP

Monitoring road infrastructures from satellite images in Greater Maputo

Arianna Burzacchi, Matteo Landrò, Simone Vantini

Comments Pre-print version of the published manuscript available at Statistical Methods Applications (2024)

2408.16004 2026-04-14 stat.AP

Granger causal inference for climate change attribution

Mark D. Risser, Mohammed Ombadi, Michael F. Wehner

详情

DOI: 10.1088/2752-5295/add046

英文摘要

Climate change detection and attribution (D&A) is concerned with determining the extent to which anthropogenic activities have influenced specific aspects of the global climate system. D&A fits within the broader field of causal inference, the collection of statistical methods that identify cause and effect relationships. There are a wide variety of methods for making attribution statements, each of which require different types of input data and each of which are conditional to varying extents. Some methods are based on Pearl causality (experimental interference) while others leverage Granger (predictive) causality, and the causal framing provides important context for how the resulting attribution conclusion should be interpreted. However, while Granger-causal attribution analyses have become more common, there is no clear statement of their strengths and weaknesses and no clear consensus on where and when Granger-causal perspectives are appropriate. In this prospective paper, we provide a formal definition for Granger-based approaches to trend and event attribution and a clear comparison with more traditional methods for assessing the human influence on extreme weather and climate events. Broadly speaking, Granger-causal attribution statements can be constructed quickly from observations and do not require computationally-intesive dynamical experiments. These analyses also enable rapid attribution, which is useful in the aftermath of a severe weather event, and provide multiple lines of evidence for anthropogenic climate change when paired with Pearl-causal attribution. Confidence in attribution statements is increased when different methodologies arrive at similar conclusions. Moving forward, we encourage the D&A community to embrace hybrid approaches to climate change attribution that leverage the strengths of both Granger and Pearl causality.

URL PDF HTML ☆

赞 0 踩 0

2408.13751 2026-04-14 stat.ML cs.LG math.OC

Improved identification of breakpoints in piecewise regression and its applications

Taehyeong Kim, Hyungu Lee, Myungjin Kim, Hayoung Choi

Comments 32 pages, 6 figures

2407.19191 2026-04-14 math.ST stat.ME stat.TH

Statistical inference for subgraph counts and clustering coefficient using network sampling in a sparse Stochastic Block Model framework

Anirban Mandal, Arindam Chatterjee

Comments 120 pages, 3 figures. Major revisions have been made, and new results have been added

2404.18905 2026-04-14 stat.ME cs.LG stat.ML

Detecting critical treatment effect bias in small subgroups

Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang

Comments Accepted for presentation at the Conference on Uncertainty in Artificial Intelligence (UAI) 2024

2402.19036 2026-04-14 math.ST stat.TH

Empirical Bayes in Bayesian learning: understanding a common practice

Stefano Rizzelli, Judith Rousseau, Sonia Petrone

2311.14867 2026-04-14 stat.ME

Disaggregating Time-Series with Many Indicators: An Overview of the DisaggregateTS Package

Luke Mosley, Kaveh Salehzadeh Nobari, Giuseppe Brandi, Alex Gibberd

2305.16272 2026-04-14 cs.LG cs.GT stat.ML

Incentivizing Honesty among Competitors in Collaborative Learning and Optimization

Florian E. Dorner, Nikola Konstantinov, Georgi Pashaliev, Martin Vechev

Comments Updated experimental results after fixing a mistake in the code. Previous version published in NeurIPS 2023; 37 pages, 5 figures

2010.15950 2026-04-14 math.ST stat.TH

All Block Maxima method for estimating the extreme value index

Jochem Oorschot, Chen Zhou

1812.00250 2026-04-14 stat.ME stat.AP

A Graphical Framework for Testing Hierarchically Structured Hypothesis Families

Zhiying Qiu, Li Yu, Wenge Guo

Comments 37 pages, 8 figures, 2 tables

2604.10986 2026-04-14 stat.ME stat.CO

Optimal multiple testing under family-wise error control: elementary symmetric polynomials and a scalable algorithm

Prasanjit Dubey, Xiaoming Huo

2604.10976 2026-04-14 stat.ML cs.LG stat.CO stat.ME

Neural Generalized Mixed-Effects Models

Yuli Slavutsky, Sebastian Salazar, David M. Blei

2604.10965 2026-04-14 stat.CO cs.LG stat.AP stat.ML

bioLeak: Leakage-Aware Modeling and Diagnostics for Machine Learning in R

Selçuk Korkmaz

Comments 35 pages, 4 figures

2604.10922 2026-04-14 cs.IT math.IT math.ST stat.TH

$α$-Mutual Information for the Gaussian Noise Channel

Mohammad Milanian, Alex Dytso, Martina Cardone

2604.10899 2026-04-14 math.ST stat.TH

Characterisations of Kullback--Leibler approximation by finite Gaussian mixtures

Hien Duy Nguyen

2604.10863 2026-04-14 stat.ME stat.CO

Restricted Search Space Graph MCMC via Birth-Death Processes

Morris Greenberg, Kieran R Campbell, Radu Craiu

Comments 63 pages including 31 pages of supplement, 10 figures and 27 supplemental figures; Code to run the MCMC algorithm and reproduce simulations is available at https://github.com/morrisgreenberg/RestrictedSearchMCMC

2604.10857 2026-04-14 cs.LG cs.AI cs.DS math.ST stat.ML stat.TH

Query Lower Bounds for Diffusion Sampling

Zhiyang Xun, Eric Price

2604.10854 2026-04-14 stat.AP stat.ML

Uncertainty-Aware Sparse Identification of Dynamical Systems via Bayesian Model Averaging

Shuhei Kashiwamura, Yusuke Kato, Hiroshi Kori, Masato Okada

2604.10824 2026-04-14 stat.AP

Causal Fairness Analysis of ADHD Status and High School STEM Outcomes

Shuhan Ai

2604.10821 2026-04-14 cs.LG stat.CO stat.ML

Slithering Through Gaps: Capturing Discrete Isolated Modes via Logistic Bridging

Pinaki Mohanty, Ruqi Zhang

2604.10820 2026-04-14 math.PR econ.EM math.CO math.ST stat.TH

A Strict Gap Between Relaxed and Partition-Constrained Spectral Compression in a Six-State Lumpable Markov Chain

Oleg Kiriukhin

2604.10814 2026-04-14 cs.LG math.ST stat.TH

Online Covariance Estimation in Averaged SGD: Improved Batch-Mean Rates and Minimax Optimality via Trajectory Regression

Yijin Ni, Xiaoming Huo

2604.10808 2026-04-14 stat.AP stat.ME

Modeling Tripartite Hyperevents in Scientific Collaboration Networks

Amin Gino Fabbrucci Barbagli, Jürgen Lerner, Viviana Amati, Domenico De Stefano

2604.10792 2026-04-14 math.PR econ.EM math.CT math.ST stat.TH

Variable-Length Markov Chains on Finite Quivers: Boundary-Window Identifiability, Exact Depth, and Local Rank Comparison

Oleg Kiriukhin

详情

英文摘要

Variable-length Markov chains on finite quivers provide a natural framework for context-dependent stochastic growth under incidence constraints. I study quiver-valued variable-length Markov chains observed through finite boundary windows and develop a first-order theory of visible-depth identifiability via stationary visible one-step transition laws and their restricted differentials on prescribed tangent blocks. For visible depth $m$, the main object is the stationary one-step informative map $q_{\mathcal{Q}}^{(m)}$. In the edge-homogeneous regime, once the local visible support is fixed and the representation hypothesis holds, all admissible visible depths encode the same edge-level extension law and hence have the same first-order rank. In the exact-depth regime of context length $r$, the depth-$r$ boundary process is the canonical finite-state Markov chain, smaller visible windows are deterministic truncations, and every coarser informative map factors $C^1$-smoothly through the depth-$r$ informative map on the relevant affine transition-array neighborhood. Hence rank cannot increase beyond depth $r$. After quotienting a tangent block by directions already invisible at depth $r$, I characterize strict coarse-depth loss exactly by coarse rank deficiency, equivalently by strict rank drop from depth $r$ to depth $m$ on the original block. I also give subspace-based and global selected-coordinate criteria, a global one-coordinate branching criterion, and an explicit depth-two example. Under full fine-depth rank and strict coordinate-rank loss at every smaller depth, a global coordinate-rank theorem yields $m_*(T,θ_0)=r$. Reduced local coordinates remove stochastic redundancies, first-order criteria are invariant under $C^1$ reparameterization, and the statistical and LAN consequences remain conditional on additional estimation and likelihood-level hypotheses.

URL PDF HTML ☆

赞 0 踩 0

2604.10752 2026-04-14 cs.IT econ.EM math.IT math.PR math.ST stat.TH

Entropy-Rate Selection for Partially Observed Processes

Oleg Kiriukhin

2604.10727 2026-04-14 stat.ML cs.AI cs.LG math.PR math.ST stat.TH

Tail-Aware Information-Theoretic Generalization for RLHF and SGLD

Huiming Zhang, Binghan Li, Wan Tian, Qiang Sun

Comments 65 pages, 9 figures

2604.10710 2026-04-14 stat.ME

Causal mediation in cluster-randomized trials with multiple mediators: spillover-aware decomposition, identification, and semiparametric efficient inference

Jiaqi Tong, Chao Cheng, Fan Li

2604.10706 2026-04-14 stat.ME

Multiple Imputation Diagnostics when using Electronic Health Record Data in Observational Studies: A Case Study

Nrupen A. Bhavsar, Lingyu Zhou, Samuel I. Berchuck, Matthew L. Maciejewski, Jerome P. Reiter

Comments 22 pages with title page and references, 4 figures

详情

英文摘要

Missing values in electronic health record (EHR) data pose a significant challenge for epidemiologic research. Traditional methods for handling missing data, like mean imputation, may introduce bias. Multiple imputation (MI) offers a principled solution by generating multiple plausible values based on statistical models. However, MI requires careful model specification and validation of imputations, ideally using multivariate graphical tools. We demonstrate the application of such tools to validate MI in a study of chronic kidney disease, assessing cardiovascular outcomes linked to neighborhood socioeconomic status (nSES). This study used data from Duke University Health System (DUHS) and Lincoln Community Health Center (LCHC). Eligible patients had at least one encounter within DUHS or LCHC and had two estimated glomerular filtration rate (eGFR) values <60 mL/min per 1.73 m2 more than 90 days apart between January 1, 2007 and July 1, 2008. Socioeconomic status was assessed using the Agency for Healthcare Research and Quality (AHRQ) index based on census data. The main outcome was a cardiovascular disease-related hospitalization. Participants were mostly older (mean age 73 years), female (64%), and Black (43%). Participants living in lower nSES neighborhoods had higher mean systolic blood pressure (SBP: 140 mmHg) and hemoglobin A1c (HbA1c) levels (7.1%) as compared to participants living in higher nSES neighborhoods. A machine learning based approach, Classification and Regression Trees (CART), was the preferred approach to impute missing data. The distributions of imputed values of SBP and HbA1c were impacted by whether marginal or conditional values of SBP and HbA1c were imputed. The choice of MI had minimal impact on inference and prediction. Future research may want to extend our results and consider how results may differ when using EHR data from multiple health systems.

URL PDF HTML ☆

赞 0 踩 0

2604.10672 2026-04-14 stat.ML cs.LG

One-Step Score-Based Density Ratio Estimation

Wei Chen, Qibin Zhao, John Paisley, Junmei Yang, Delu Zeng

2604.10650 2026-04-14 stat.ML cs.LG

A Deep Generative Approach to Stratified Learning

Randy Martinez, Rong Tang, Lizhen Lin

Comments 79 pages, 5 figures

2604.10641 2026-04-14 cs.IT cs.IR math.IT math.PR stat.AP

On the Capacity of Distinguishable Synthetic Identity Generation under Face Verification

Behrooz Razeghi

2604.10618 2026-04-14 stat.AP

A comprehensive study on causal discovery between degradation paths

Shi-Shun Chen, Shuai Gao, Xiao-Yang Li, Enrico Zio

2604.10570 2026-04-14 econ.GN cs.CE q-fin.EC stat.AP

Unveiling contrasting impacts of heat mitigation and adaptation policies on U.S. internal migration

Chao Li, Xing Su, Chao Fan, Yang Li, Luping Li, Chunmo Zheng, Wenglong Chao, Leena Jarvi, Han Lin, Juan Tu

Comments 24 pages, 6 figures, 2 tables

2604.10555 2026-04-14 stat.OT

On Some Multivariate Extensions to Zenga Curve: Properties and Applications

Shifna P R, S. M. Sunoj

2604.10412 2026-04-14 stat.ML cs.LG stat.ME

Orthogonal machine learning for conditional odds and risk ratios

Jiacheng Ge, Iván Díaz

详情

英文摘要

Conditional effects are commonly used measures for understanding how treatment effects vary across different groups, and are often used to target treatments/interventions to groups who benefit most. In this work we review existing methods and propose novel ones, focusing on the odds ratio (OR) and the risk ratio (RR). While estimation of the conditional average treatment effect (ATE) has been widely studied, estimators for the OR and RR lag behind, and cutting edge estimators such as those based on doubly robust transformations or orthogonal risk functions have not been generalized to these parameters. We propose such a generalization here, focusing on the DR-learner and the R-learner. We derive orthogonal risk functions for the OR and RR and show that the associated pseudo-outcomes satisfy second-order conditional-mean remainder properties analogous to the ATE case. We also evaluate estimators for the conditional ATE, OR, and RR in a comprehensive nonparametric Monte Carlo simulation study to compare them with common alternatives under hundreds of different data-generating distributions. Our numerical studies provide empirical guidance for choosing an estimator. For instance, they show that while parametric models are useful in very simple settings, the proposed nonparametric estimators significantly reduce bias and mean squared error in the more complex settings expected in the real world. We illustrate the methods in the analysis of physical activity and sleep trouble in U.S. adults using data from the National Health and Nutrition Examination Survey (NHANES). The results demonstrate that our estimators uncover substantial treatment effect heterogeneity that is obscured by traditional regression approaches and lead to improved treatment decision rules, highlighting the importance of data-adaptive methods for advancing precision health research.

URL PDF HTML ☆

赞 0 踩 0

2604.10398 2026-04-14 stat.ME stat.ML

Estimating heterogeneous treatment effects with survival outcomes via a deep survival learner

Yuming Sun, Jian Kang, Yi Li

2604.10376 2026-04-14 math.ST stat.TH

Spectral analysis of multivariate stationary Hawkes processes

Yifu Tang, Conor Kresin, Boris Baeumer, Ting Wang

2604.10375 2026-04-14 q-fin.RM q-fin.PM stat.AP

On the Structure of Risk Contribution: A Leave-One-Out Decomposition into Inherent and Correlation Risk

Nolan Alexander, Frank Fabozzi

Comments Code: https://github.com/nolanalexander/inherent-correlation-decomposition

2604.08220 2026-04-14 stat.AP

WaST: a formalisation of the Wave model with associated statistical inference and applications

Grégoire Clarté

2604.05225 2026-04-14 stat.CO cs.LG stat.AP stat.ML

fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R

Selcuk Korkmaz, Dincer Goksuluk, Eda Karaismailoglu

Comments 36 pages, 2 figures

2604.05063 2026-04-14 math.ST stat.TH

Robust mean estimation under star-shaped constraints with heavy-tailed noise

Tuorui Peng, Akshay Prasadan, Matey Neykov

Comments 56 pages

2603.29575 2026-04-14 stat.ME

Transfer Learning for Moderate-Dimensional Ridge-Regularized Robust Linear Regression

Lingfeng Lyu, Xiao Guo, Zongqi Liu

2603.14356 2026-04-14 stat.AP

Prediction-based Inference in Electronic Health Record (EHR)-linked Biobanks with Clinically Informative Outcomes

Xingran Chen, Cheng-Han Yang, Zhenke Wu, Bhramar Mukherjee

2601.01471 2026-04-14 math.ST econ.EM stat.ME stat.ML stat.TH

Double Machine Learning of Continuous Treatment Effects with General Instrumental Variables

Shuyuan Chen, Peng Zhang, Yifan Cui

2512.20552 2026-04-14 cs.IT math.IT stat.ML

Information-theoretic signatures of causality in Bayesian networks and hypergraphs

Sung En Chiang, Zhaolu Liu, Robert L. Peach, Mauricio Barahona

Comments 21 pages, 3 figures

2511.03015 2026-04-14 cs.LG stat.ML

Discrete Bayesian Sample Inference for Graph Generation

Ole Petersen, Marcel Kollovieh, Marten Lienen, Stephan Günnemann

2510.07942 2026-04-14 math.PR math.ST stat.TH

Precise convergence rate of spectral radius of product of complex Ginibre

Yutao Ma, Xujia Meng

Comments This version makes substantial improvements over the previous one, including in the title, abstract, and content. We would therefore prefer to announce it as a completely new submission

2509.20587 2026-04-14 stat.ML cs.LG stat.ME

Unsupervised Domain Adaptation for Binary Classification with an Unobservable Source Subpopulation

Chao Ying, Jun Jin, Haotian Zhang, Qinglong Tian, Yanyuan Ma, Sharon Li, Jiwei Zhao

2509.10853 2026-04-14 stat.ML cs.LG

Variable Selection Using Relative Importance Rankings

Tien-En Chang, Argon Chen

Comments 35 pages, 9 figures

详情

Journal ref: 10.1016/j.patcog.2026.113561

英文摘要

Although conceptually related, variable selection and relative importance (RI) analysis have been treated quite differently in the literature. While RI is typically used for post-hoc model explanation, this paper explores its potential for variable or feature ranking and filter-based selection before model creation. Specifically, we anticipate strong performance from the RI measures because they incorporate both direct and combined effects of predictors, addressing a key limitation of marginal correlation, which ignores dependencies among predictors. We implement and evaluate the RI-based variable ranking and selection methods, including a newly proposed RI measure, CRI.Z, with improved computational efficiency relative to conventional RI measures. Through extensive simulations, we first demonstrate how the RI measures more accurately rank the variables than the marginal correlation, especially when there are suppressed or weak predictors. We then show that predictive models built on these rankings are highly competitive, often outperforming state-of-the-art linear-model methods such as the lasso and relaxed lasso. The proposed RI-based methods are particularly effective in challenging cases involving clusters of highly correlated predictors, a setting known to cause failures in many benchmark methods. The practical utility and efficiency of RI-based methods are further demonstrated through two high-dimensional gene expression datasets. Although lasso methods have dominated the recent literature on variable selection, our study reveals that the RI-based method is a powerful and competitive alternative. We believe these underutilized tools deserve greater attention in statistics and machine learning communities. The code is available at: https://github.com/tien-endotchang/RI-variable-selection.

URL PDF HTML ☆

赞 0 踩 0

2509.03297 2026-04-14 stat.ME stat.ML

Feedback-Enhanced Online Multiple Testing with Applications to Conformal Selection

Lin Lu, Yuyang Huo, Haojie Ren, Zhaojun Wang, Changliang Zou

2507.14457 2026-04-14 stat.ME

Blurring Mean Shift for Clustering Functional Data: A Scalable Algorithm and Convergence Analysis

Toshinari Morimoto, Ting-Li Chen, Su-Yun Huang, Ruey S. Tsay

Comments Proofs are provided in the supplementary material

2503.11390 2026-04-14 math.ST math.PR stat.TH

On continuity of Chatterjee's rank correlation and related dependence measures

Jonathan Ansari, Sebastian Fuchs

Comments 23 pages, 2 figures; accepted for publication in 'Bernoulli'

2503.02983 2026-04-14 stat.ML cs.LG

BLADE: Bayesian Langevin Active Discovery with Replica Exchange for Identification of Complex Systems

Cindy Xiangrui Kong, Haoyang Zheng, Guang Lin

2502.10849 2026-04-14 stat.ME

Dynamic spectral co-clustering of directed networks to unveil latent community paths in VAR-type models

Younghoon Kim, Changryong Baek

Comments This paper is withdrawn due to an error in the model specification. Specifically, the direction of propagation of the network structure in Section 2 was incorrectly defined. This issue also affects the simulation setup and model applications to real data in Sections 5 and 6, which were constructed based on the same specification. Consequently, the results reported in the paper may not be valid

2501.18785 2026-04-14 stat.ME

Low-Rank Graphon Learning for Networks

Xinyuan Fan, Feiyan Ma, Chenlei Leng, Weichi Wu

2412.20495 2026-04-14 cs.CR cs.AI cs.LG stat.ML

A Multiparty Homomorphic Encryption Approach to Confidential Federated Kaplan Meier Survival Analysis

Narasimha Raghavan Veeraragavan, Svetlana Boudko, Jan Franz Nygård

Comments 58 pages

2412.07469 2026-04-14 stat.ML cs.LG

Score-matching-based Structure Learning for Temporal Data on Networks

Hao Chen, Kai Yi

2409.01599 2026-04-14 stat.ME math.ST stat.TH

Multivariate Inference of Network Moments by Subsampling

Mingyu Qi, Chen-Wei Hua, Tianxi Li, Wen Zhou

2402.10537 2026-04-14 stat.ME

Quantifying Individual Risk for Binary Outcomes

Peng Wu, Peng Ding, Zhi Geng, Yue Liu

2401.03893 2026-04-14 math.OC stat.ML

Finite-Time Decoupled Convergence in Nonlinear Two-Time-Scale Stochastic Approximation

Yuze Han, Xiang Li, Zhihua Zhang

2207.11825 2026-04-14 stat.ME

Fast convergence rates for dose-response estimation

Matteo Bonvini, Edward H. Kennedy

1810.07793 2026-04-14 cs.LG stat.ML

The Wasserstein transform

Kun Jin, Facundo Mémoli, Zane Smith, Zhengchao Wan

2604.10353 2026-04-14 stat.ME math.ST stat.TH

Uncertainty Quantification for Noisy Low-tubal-rank Tensor Completion

Jiuqian Shang, Jingyang Li, Yang Chen

Comments 56 pages

2604.10308 2026-04-14 stat.ME stat.AP

Considerations for the Integration of Randomized Controlled Trials and Real-World Data

Sky Qiu, Charles Barr, Lauren Dang, Issa Dahabreh, Larry Han, Kajsa Kvist, Hana Lee, Andrew Mertens, Nerissa Nance, Lei Nie, Kara Rudolph, Xu Shi, Jens Tarp, Salina P. Waddy, Kenneth Wiley, Andy Wilson, Margot Lisa Jing Yann, Zhiwei Zhang, Tianyue Zhou, Maya Petersen, Mark van der Laan

2604.10249 2026-04-14 stat.ME stat.CO

Gaussian Graphical Models for Functional Connectivity Analysis: A Statistical Review with Applications to Alzheimer's Disease

Panpan Zhang, Shiying Xiao, W. Hudson Robb, Dandan Liu, Angela L. Jefferson, Jun Yan

2604.10232 2026-04-14 econ.EM math.ST stat.TH

Gaussian approximation for maximum score and non-smooth M-estimators with multiway dependence

Harold D. Chiang, Ahnaf Rafi

2604.10205 2026-04-14 math.ST stat.TH

Normalized Likelihood Criteria for Model Selection in the Stochastic Block Model

Andressa Cerqueira, Felipe Baptistão

2604.10178 2026-04-14 stat.ME

Bayesian Distance-to-Set Models: from Latent Variable to Latent Projection

Leo L Duan, Yuexi Wang, Jason Xu

2604.10088 2026-04-14 stat.ME

Cox Model Predicting Covariate Subject to Right Censoring

Chen-Yen Lin, Susan Halabi, Taehwa Choi

2604.10018 2026-04-14 stat.ME

Inference from multivariate differential recruitment in respondent-driven sampling data

Vanesa Reinoso, Danilo Alvares, Jonathan Acosta, Isabelle S. Beaudry

2604.09953 2026-04-14 stat.ME math.ST stat.TH

Partial correlation networks of Gaussian processes

Michele Peruzzi

详情

英文摘要

In Gaussian graphical models, conditional independence and partial correlations are natural inferential targets for understanding direct relationships in multivariate data. No comparable framework exists for spatial processes, where multivariate analysis defaults to modeling unconditional cross-covariance structure, even when direct relationships remain of scientific interest. We address this gap by establishing a novel characterization of process-level partial correlation for multivariate Gaussian processes that recovers a direct link with Gaussian graphical models. Our analysis proceeds through a class of stationary multivariate processes, termed spectrally inside-out, in which a precision matrix modulates the strength of conditional dependence and yields necessary and sufficient conditions for conditional independence. Within this class, partial cross-correlation functions factorize into a process-level partial correlation coefficient and an attenuation term independent of cross-process parameters. The spectrally inside-out class includes the separable coregionalization model, a process convolution construction, and the parsimonious multivariate Matérn, for which such a characterization was previously thought unavailable. We further show that a nonstationary inside-out model satisfies the same factorization and admits the same necessary and sufficient conditions. Our results clarify the limitations of existing approaches: linear coregionalization models encode conditional independence through the zero pattern of the inverse factor loading matrix and do not result in interpretable partial cross-correlation functions. Low-rank spatial factor models lack a meaningful graphical characterization. Methods that enforce network structure through auxiliary graphical layers only characterize presence or absence of graph edges. We illustrate our results through synthetic and real data.

URL PDF HTML ☆

赞 0 踩 0

2604.09950 2026-04-14 math.ST math.PR stat.TH

On a copula product linking Wasserstein correlations and rearranged dependence measures

Jonathan Ansari

Comments 22 pages; 3 figures

2604.09913 2026-04-14 stat.ME stat.ML

Performance of weakly-supervised electronic health record-based phenotyping methods in rare-outcome settings

Yunjing Hong, Jennifer C. Nelson, Brian D. Williamson

Comments 58 pages, 4 main figures, 3 supplemental figures, 4 main tables, 17 supplemental tables

2604.09910 2026-04-14 stat.ME

Mixed Membership Models for Multilevel Functional Data

Donatello Telesca, Nicholas Marco, Emma Landry

2604.09909 2026-04-14 cs.LG cs.NA math.NA math.OC stat.ML

Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size

Michał Dereziński, Xiaoyu Dong

2604.09902 2026-04-14 stat.ME stat.AP

crumble: A comprehensive framework for modern causal mediation analysis with intermediate confounding

Richard Liu, Nicholas T. Williams, Kara E. Rudolph, Ivan Diaz

2604.09898 2026-04-14 stat.AP

Evaluating the impact of longitudinal treatment strategies in the presence of informative monitoring and time-dependent confounding

Leah Pirondini, Karla Diaz-Ordaz, Edward Palmer, Ruth H. Keogh

2604.09895 2026-04-14 stat.AP physics.data-an

Blume-Capel model: Estimation of a three stable state network for $-\bf 1$, $\bf 0$ and $\bf +1$ data

Lourens Waldorp, Jonas Dalege, Maarten Marsman, Adam Finnemann, Irene Ferri, Han L. J. van der Maas

2604.09858 2026-04-14 econ.EM stat.ME

Coupling Designs for Randomized Experiments with Complex Treatments

Max Cytrynbaum, Fredrik Sävje

2604.09779 2026-04-14 stat.ME

Inference conditional on selection: a review

Anna Neufeld, Ronan Perry, Daniela Witten

2604.09754 2026-04-14 stat.AP

Surface temperature extremes produced by huge machine learning hindcasts of summer 2023

Mark Risser, Ankur Mahesh, Joshua North, William D. Collins, Boris Bonev, Karthik Kashinath, Thorsten Kurth, Shashank Subramanian, Michael S. Pritchard

2604.09661 2026-04-14 physics.ao-ph nlin.AO physics.data-an stat.ME

Multistability and intermingledness in complex high-dimensional data

George Datseris, Johannes Lohmann, Oisín Hamilton, Jacob Haqq-Misra

2604.09660 2026-04-14 stat.AP

Overdispersed and Markovian Children

Nils Lid Hjort

Comments 18 pages, 11 figures. Statistical Research Report, Department of Mathematics, University of Oslo, April 2026. The material is up to research level, for some of the details, but most of it can be read at Master's level statistics (and, indeed, above)

2604.09656 2026-04-14 cs.LG cs.AI stat.AP stat.ME

Fairboard: a quantitative framework for equity assessment of healthcare models

James K. Ruffle, Samia Mohinta, Chris Foulon, Mohamad Zeina, Zicheng Wang, Sebastian Brandner, Harpreet Hyare, Parashkev Nachev

Comments 30 pages, 6 figures, 109 extended data figures (ancillary file)

2604.09614 2026-04-14 cs.AI cs.IT math.IT math.ST stat.TH

The Geometry of Knowing: From Possibilistic Ignorance to Probabilistic Certainty -- A Measure-Theoretic Framework for Epistemic Convergence

Moriba Kemessia Jah

详情

英文摘要

This paper develops a measure-theoretic framework establishing when and how a possibilistic representation of incomplete knowledge contracts into a probabilistic representation of intrinsic stochastic variability. Epistemic uncertainty is encoded by a possibility distribution and its dual necessity measure, defining a credal set bounding all probability measures consistent with current evidence. As evidence accumulates, the credal set contracts. The epistemic collapse condition marks the transition: the Choquet integral converges to the Lebesgue integral over the unique limiting density. We prove this rigorously (Theorem 4.5), with all assumptions explicit and a full treatment of the non-consonant case. We introduce the aggregate epistemic width W, establish its axiomatic properties, provide a canonical normalization, and give a feasible online proxy resolving a circularity in prior formulations. Section 7 develops the dynamics of epistemic contraction: evidence induces compatibility, compatibility performs falsification, posterior possibility is the min-intersection of prior possibility and compatibility, and a credibility-directed flow governs support geometry contraction. This is not belief updating. It is knowledge contraction. Probability theory is the limiting geometry of that process. The UKF and ESPF solve different problems by different mechanisms. The UKF minimizes MSE, asserts truth, and requires a valid generative model. The ESPF minimizes maximum entropy and surfaces what evidence has not ruled out. When the world is Gaussian and the model valid, both reach the same estimate by entirely different routes -- convergent optimality, not hierarchical containment. We prove this (Theorem 9.1) and compare both on a 2-day, 877-step orbital tracking scenario. Both achieve 1-meter accuracy. The UKF is accurate but epistemically silent. The ESPF is accurate and epistemically honest.

URL PDF HTML ☆

赞 0 踩 0

2603.19640 2026-04-14 stat.AP

Logistic-aided Huber M-estimator for robust GNSS positioning

Zhengdao Li, Penggao Yan, Li-Ta Hsu

Comments Submitted to IEEE Transactions on Aerospace and Electronic Systems

2603.05919 2026-04-14 cs.LG math.ST stat.ML stat.TH

Design Experiments to Compare Multi-armed Bandit Algorithms

Huiling Meng, Ningyuan Chen, Xuefeng Gao

2601.20628 2026-04-14 stat.ML cs.LG

Sparse clustering via the Deterministic Information Bottleneck algorithm

Efthymios Costa, Ioanna Papatsouma, Angelos Markos

Comments Submitted to IFCS 2026 (8 pages total)

2512.01070 2026-04-14 math.DG stat.ME

Covariance Estimation for Matrix-variate Data via Fixed-rank Core Covariance Geometry

Bongjung Sung

Comments 39 pages, 22 pages in the main text, 4 figures

2511.17725 2026-04-14 stat.ME

A Unified Spatiotemporal Framework for Modeling Censored and Missing Areal Responses

Jose A. Ordoñez, Tsung-I Lin, Victor H. Lachos, Luis M. Castro

2510.02050 2026-04-14 stat.AP cs.LG

Multidata Causal Discovery for Statistical Hurricane Intensity Forecasting

Saranya Ganesh S, Frederick Iat-Hin Tam, Milton S. Gomez, Marie McGraw, Mark DeMaria, Kate Musgrave, Jakob Runge, Tom Beucler

Comments 20 pages, 8 Figures, 1 Table, SI; Manuscript following second peer review

2509.24904 2026-04-14 astro-ph.CO astro-ph.HE astro-ph.IM physics.data-an stat.CO

Graph-based Summary Statistics for Revealing the Stochastic Gravitational Wave Background in Pulsar Timing Arrays

M. Alakhras, S. M. S. Movahed

Comments 29 pages, 15 figures, 1 table. Matched with the published version. Including the revision in a part of method

详情

DOI: 10.3847/1538-4357/ae4342
Journal ref: The Astrophysical Journal 999.2 (2026): 226

英文摘要

In this work, we propose a graph-based method implemented on the pulsar timing residuals (PTRs) for stochastic gravitational wave background (SGWB) detection within the nano-Hertz frequency regime and examining uncertainties of its parameters. We construct a correlation graph with pulsars as its nodes, and analyze the graph-based summary statistics, including structural characteristics of complex network, for identifying SGWB in the real and synthetic datasets. The effect of the number of pulsars, the observation time span, and the strength of the SGWB on the graph-based feature vector is evaluated. Our results demonstrate that the Discriminative Summary Statistics for common signal detection consists of the average clustering coefficient and the edge weight fluctuation. The SGWB detection conducted after the observation of a common signal and then exclusion of non-Hellings \& Downs templates is performed by the second cumulant of edge weight for angular separation thresholds $\barζ\gtrsim 40^{\circ}$. The lowest detectable value of SGWB strain amplitude utilizing our graph-based measures at the current PTAs sensitivity is $A_{\rm SGWB}\gtrsim 1.2\times 10^{-15}$. Fisher forecasts confirmed that the uncertainty levels of $\log_{10} A_{\rm SGWB}$ and spectral index reach $1.5\%$ and $19.5\%$, respectively, at $2σ$ confidence interval. A weak evidence for an SGWB at $\sim 2.3σ$ level is obtained by applying our graph-based method to the NANOGrav 15-year dataset.

URL PDF HTML ☆

赞 0 踩 0

2506.05014 2026-04-14 cs.LG cs.AI stat.ML

Towards Reasonable Concept Bottleneck Models

Nektarios Kalampalikis, Kavya Gupta, Georgi Vitanov, Isabel Valera

Comments 32 pages, 20 figures

2506.00444 2026-04-14 math.ST stat.TH

Detecting non-uniform patterns on high-dimensional hyperspheres

Tiefeng Jiang, Tuan Pham

Comments added results for the Watson model

2505.18344 2026-04-14 cs.LG cs.AI stat.ML

Improved Sample Complexity For Diffusion Model Training Without Empirical Risk Minimizer Access

Mudit Gaur, Prashant Trivedi, Sasidhar Kunapuli, Amrit Singh Bedi, Vaneet Aggarwal

2503.11268 2026-04-14 stat.ME stat.CO

Rank estimation for the accelerated failure time model with partially interval-censored data

Taehwa Choi, Sangbum Choi, Dipankar Bandyopadhyay

Comments Accepted in Statistica Sinica

2502.07114 2026-04-14 stat.ML cs.LG cs.NA math.NA math.OC stat.CO

Online Covariance Matrix Estimation in Sketched Newton Methods

Wei Kuang, Mihai Anitescu, Sen Na

Comments 63 pages, 4 figures, 9 tables

2410.22559 2026-04-14 cs.LG cs.AI stat.ML

Disentanglement as Identifiable Pushforward Factorisation

Carl Allen

Comments 9 pages

2410.11964 2026-04-14 cs.LG stat.ML

A Complete Decomposition of KL Error using Refined Information and Mode Interaction Selection

James Enouen, Mahito Sugiyama

2311.11487 2026-04-14 stat.ME stat.AP

Modeling Insurance Claims using Bayesian Nonparametric Regression

Mostafa Shams Esfand Abadi, Kaushik Ghosh