arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.19740 2026-04-22 cs.LG cs.AI cs.CV stat.ML

Generalization at the Edge of Stability

Mario Tuci, Caner Korkmaz, Umut Şimşekli, Tolga Birdal

Comments Project page: https://circle-group.github.io/research/GATES

详情

英文摘要

Training modern neural networks often relies on large learning rates, operating at the edge of stability, where the optimization dynamics exhibit oscillatory and chaotic behavior. Empirically, this regime often yields improved generalization performance, yet the underlying mechanism remains poorly understood. In this work, we represent stochastic optimizers as random dynamical systems, which often converge to a fractal attractor set (rather than a point) with a smaller intrinsic dimension. Building on this connection and inspired by Lyapunov dimension theory, we introduce a novel notion of dimension, coined the `sharpness dimension', and prove a generalization bound based on this dimension. Our results show that generalization in the chaotic regime depends on the complete Hessian spectrum and the structure of its partial determinants, highlighting a complexity that cannot be captured by the trace or spectral norm considered in prior work. Experiments across various MLPs and transformers validate our theory while also providing new insights into the recently observed phenomenon of grokking.

URL PDF HTML ☆

赞 0 踩 0

2604.19712 2026-04-22 cs.LG cond-mat.dis-nn cs.IT math.IT math.PR stat.ML

Ultrametric OGP - parametric RDT \emph{symmetric} binary perceptron connection

Mihailo Stojnic

2604.19698 2026-04-22 cs.LG math.ST stat.TH

On two ways to use determinantal point processes for Monte Carlo integration

Guillaume Gautier, Rémi Bardenet, Michal Valko

Comments NeurIPS 2019

2604.19694 2026-04-22 stat.ME stat.AP

A Goodness-of-Fit Test for Mixed-Effects Logistic Regression

Ariel Linden

2604.19672 2026-04-22 cs.LG stat.ML

Budgeted Online Influence Maximization

Pierre Perrault, Jennifer Healey, Zheng Wen, Michal Valko

Comments 37th International Conference on Machine Learning (ICML 2020), 28 pages

2604.19662 2026-04-22 q-bio.NC physics.bio-ph stat.AP

Modelling time-order effects in haptic perception with a Bayesian dynamical framework

Gastón Avetta, Jose Lobera, Juan José Zárate, Inés Samengo, Damián G. Hernández

Comments 21 pages, 7 figures

2604.19580 2026-04-22 q-fin.ST econ.EM q-fin.PM stat.AP

Probabilistic Forecasting for Day-ahead Electricity Prices, Battery Trading Strategies and the Economic Evaluation of Predictive Accuracy

Simon Hirsch, Florian Ziel

Comments 30 pages, 15 figures, 5 pages supplementary materials

2604.19560 2026-04-22 cs.LG math.OC stat.ML

Separating Geometry from Probability in the Analysis of Generalization

Maxim Raginsky, Benjamin Recht

Comments 19 pages

2604.19531 2026-04-22 cs.SI math.ST stat.TH

Hypergraph Mining via Proximity Matrix

Junhao Bian, Yilin Bi, Tao Zhou

2604.19517 2026-04-22 stat.ME

PRADAS: PRior-Assisted DAta Splitting for False Discovery Rate Control

Yuanchuan Guo, Buyu Lin, Jun S. Liu

Comments 61 pages, 6 figures

2604.19493 2026-04-22 stat.ME stat.CO

A Nonparametric Goodness-of-Fit Test for High-Dimensional Generalized Gaussian Distributions via Nearest-Neighbor Graphs

Mehmet Sıddık Çadırcı, Yener Ünal

Comments 22 pages, 5 pages

2604.19463 2026-04-22 astro-ph.CO stat.ME

On combining estimated and analytic covariance matrices

Alan Heavens, Lorne Whiteway, Elena Sellentin

Comments For submission to OJA

2604.19451 2026-04-22 cs.LG stat.ML

Heterogeneity-Aware Personalized Federated Learning for Industrial Predictive Analytics

Yuhan Hu, Xiaolei Fang

2604.19381 2026-04-22 math.OC math.ST stat.TH

Sharp recovery and landscape guarantees for the nonconvex matrix LASSO

Andrew D. McRae, Richard Y. Zhang

2604.19378 2026-04-22 stat.ME stat.CO

Random Reward Phase-Type Distributions with Applications in Latent Severity Modeling

Simon Pauli, Andreas Futschik

Comments 25 pages, 9 figures, submitted to Statistical Papers

2604.19352 2026-04-22 math.ST stat.TH

Stochastic Intervention

Rohit Chaudhuri

Comments Stochastic Intervention, Causal Inference, High Dimensional Treatments, High Dimensional Inference

2604.18653 2026-04-22 stat.ME physics.soc-ph

How to quantify direct correlations between variables

Shengjun Wu, Jeffery Wu

Comments 15 pages, 11 figures, 3 tables

2604.18181 2026-04-22 math.ST stat.TH

Spectral approximation for the separable covariance mixture model

Ben Deitmar

Comments 96 pages, 2 figures

2604.17094 2026-04-22 astro-ph.IM physics.data-an stat.CO

Simple approximations of some statistical functions

Zinovy Malkin

2604.16129 2026-04-22 stat.ME

Deep Ranking with Heterogeneous Effects

Yuanhang Luo, Shuxing Fang, Ruijian Han, Yiming Xu

2604.11812 2026-04-22 math.ST stat.ME stat.TH

Confidence envelopes for the false discoveries with heterogeneous data

Romain Périer, Gilles Blanchard, Sebastian Döhler, Guillermo Durand, Etienne Roquain

2604.08681 2026-04-22 stat.ME econ.EM stat.AP

Nonparametric Identification and Estimation of Causal Effects on Latent Outcomes

Jiawei Fu, Donald P. Green

2603.21623 2026-04-22 stat.ME stat.ML

Neyman-Pearson multiclass classification under label noise via empirical likelihood

Qiong Zhang, Qinglong Tian, Pengfei Li

2603.17463 2026-04-22 stat.AP econ.EM q-fin.RM q-fin.ST

Multivariate GARCH and portfolio variance prediction: A forecast reconciliation perspective

Massimiliano Caporin, Daniele Girolimetto, Emanuele Lopetuso

2603.15817 2026-04-22 stat.ME math.ST stat.TH

On the Equivalence between Neyman Orthogonality and Pathwise Differentiability

Yuxi Chen, Edward H. Kennedy, Sivaraman Balakrishnan

2512.09060 2026-04-22 stat.CO stat.ML

All Emulators are Wrong, Many are Useful, and Some are More Useful Than Others: A Reproducible Comparison of Computer Model Surrogates

Kellin N. Rumsey, Graham C. Gibson, Devin Francom, Reid Morris

2511.22535 2026-04-22 stat.ME

Bayes Factor Hypothesis Testing in Meta-Analyses: Practical Advantages and Methodological Considerations

Joris Mulder, Robbie C. M. van Aert

Comments 63 pages, 10 figures

2511.16164 2026-04-22 cs.LG stat.AP

Achieving Skilled and Reliable Daily Probabilistic Forecasts of Wind Power at Subseasonal-to-Seasonal Timescales over France

Eloi Lindas, Yannig Goude, Philippe Ciais

2510.24512 2026-04-22 eess.SP physics.geo-ph stat.AP

Heuristic Quality Coefficients for Interferometric Phase Linking

Magnus Heimpel, Irena Hajnsek, Othmar Frey

Comments 32 pages, 9 figures. Replacement is the version now published in ISPRS Journal of Photogrammetry and Remote Sensing

详情

DOI: 10.1016/j.isprsjprs.2026.04.015
Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, vol. 237, pp. 1-21 (2026)

英文摘要

In multitemporal InSAR, phase linking (PL) refers to the estimation of a single-reference interferometric phase history for distributed scatterers (DS) from the information contained in the sample coherence matrix. Because the phase information in this matrix is typically inconsistent, DS processing needs practical reliability indicators to decide whether a pixel's PL estimate is sufficiently supported by the data for subsequent deformation analysis. For maximum-likelihood estimation, uncertainty can be quantified via Fisher-information-based covariance estimates, but no analogous, generally applicable uncertainty quantification is available for the broad range of non-ML methods. We propose three heuristic quality coefficients within a unified mathematical framework that covers common PL methods: (1) a method-specific goodness-of-fit coefficient that normalizes the achieved PL objective between a method-consistent upper bound and an empirically modeled noise floor level; (2) a closure phase coefficient computed from the sample coherence matrix in advance; and (3) an ambiguity coefficient that compares the obtained PL estimate with the best alternative in its orthogonal complement in the solution space. All coefficients are normalized to the interval $[0,1]$, where 1 indicates maximum reliability and 0 matches the behavior expected under pure noise. Simulations under exponential and seasonal decorrelation models show that the goodness-of-fit coefficient tracks the normalized absolute phase error most consistently, whereas the closure phase coefficient provides an a priori indicator for pre-screening. Experiments on a TerraSAR-X stack over Visp, Switzerland, reveal plausible spatial patterns across urban and vegetated areas and show that the ambiguity coefficient provides complementary information, especially in regions with temporally varying scattering mechanisms.

URL PDF HTML ☆

赞 0 踩 0

2510.10866 2026-04-22 stat.ML cs.LG

Quantifying Data Similarity Using Cross Learning

Shudong Sun, Hao Helen Zhang, Joseph C Watkins

2508.04818 2026-04-22 cs.CV eess.IV stat.ML

Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models

Mehrdad Moradi, Marco Grasso, Bianca Maria Colosimo, Kamran Paynabar

Comments 9 pages, 8 figures, 1 table. Accepted to 2025 International Conference on Machine Learning and Applications (ICMLA)

详情

DOI: 10.1109/ICMLA66185.2025.00095
Journal ref: Proc. 2025 International Conference on Machine Learning and Applications (ICMLA), Boca Raton, FL, USA, 2025, pp. 663-670

英文摘要

Generative models have demonstrated significant success in anomaly detection and segmentation over the past decade. Recently, diffusion models have emerged as a powerful alternative, outperforming previous approaches such as GANs and VAEs. In typical diffusion-based anomaly detection, a model is trained on normal data, and during inference, anomalous images are perturbed to a predefined intermediate step in the forward diffusion process. The corresponding normal image is then reconstructed through iterative reverse sampling. However, reconstruction-based approaches present three major challenges: (1) the reconstruction process is computationally expensive due to multiple sampling steps, making real-time applications impractical; (2) for complex or subtle patterns, the reconstructed image may correspond to a different normal pattern rather than the original input; and (3) Choosing an appropriate intermediate noise level is challenging because it is application-dependent and often assumes prior knowledge of anomalies, an assumption that does not hold in unsupervised settings. We introduce Reconstruction-free Anomaly Detection with Attention-based diffusion models in Real-time (RADAR), which overcomes the limitations of reconstruction-based anomaly detection. Unlike current SOTA methods that reconstruct the input image, RADAR directly produces anomaly maps from the diffusion model, improving both detection accuracy and computational efficiency. We evaluate RADAR on real-world 3D-printed material and the MVTec-AD dataset. Our approach surpasses state-of-the-art diffusion-based and statistical machine learning models across all key metrics, including accuracy, precision, recall, and F1 score. Specifically, RADAR improves F1 score by 7% on MVTec-AD and 13% on the 3D-printed material dataset compared to the next best model. Code available at: https://github.com/mehrdadmoradi124/RADAR

URL PDF HTML ☆

赞 0 踩 0

2507.12330 2026-04-22 stat.AP stat.ME

Forecasting sub-population mortality using credibility theory

Mathias Lindholm, Gabriele Pittarello

2503.08389 2026-04-22 stat.ME

Clustered Flexible Calibration Plots For Binary Outcomes Using Random Effects Modeling

Lasai Barreñada, Bavo D. C. Campo, Laure Wynants, Ben Van Calster

Comments 44 pages, 18 figures, 4 tables

详情

DOI: 10.1017/rsm.2025.10046
Journal ref: Res. synth. methods 17 (2026) 567-588

英文摘要

Evaluation of clinical prediction models across multiple clusters, whether centers or datasets, is becoming increasingly common. A comprehensive evaluation includes an assessment of the agreement between the estimated risks and the observed outcomes, also known as calibration. Calibration is of utmost importance for clinical decision making with prediction models and it may vary between clusters. We present three approaches to take clustering into account when evaluating calibration. (1) Clustered group calibration (CG-C), (2) two-stage meta-analysis calibration (2MA-C) and (3) mixed model calibration (MIX-C) can obtain flexible calibration plots with random effects modelling and providing confidence and prediction intervals. As a case example, we externally validate a model to estimate the risk that an ovarian tumor is malignant in multiple centers (N = 2489). We also conduct a simulation study and synthetic data study generated from a true clustered dataset to evaluate the methods. In the simulation study MIX-C and 2MA-C (splines) gave estimated curves closest to the true overall curve. In the synthetic data study MIX-C produced cluster specific curves closest to the truth. Coverage of the prediction interval across the plot was best for 2MA-C with splines. We recommend using 2MA-C with splines to estimate the overall curve and the 95% PI and MIX-C for the cluster specific curves, especially when sample size per cluster is limited. We provide ready-to-use code to construct summary flexible calibration curves with confidence and prediction intervals to assess heterogeneity in calibration across datasets or centers.

URL PDF HTML ☆

赞 0 踩 0

2411.03304 2026-04-22 stat.ME math.ST stat.TH

Bayesian Controlled FDR Variable Selection via Parameter-Expanded Latent Knockoffs

Lorenzo Focardi-Olmi, Anna Gottard, Michele Guindani, Marina Vannucci

详情

英文摘要

In many research fields, researchers aim to identify significant associations between a set of explanatory variables and a response while controlling the FDR. The Knockoff filter has been recently proposed in the frequentist paradigm to introduce controlled noise in a model by cleverly constructing copies of the predictors as auxiliary variables. We develop a fully Bayesian generalization of the classical model-X knockoff filter for normally distributed covariates. In our approach, we consider a joint model for the covariates and the response, where the conditional independence structure of the covariates is captured through a Gaussian graphical model and used to define a latent knockoff layer through a parameter-expanded representation of the response model. Estimating the covariate graph informs the knockoff construction and improves inference on the covariate effects. We use a modified spike-and-slab prior on the regression coefficients, avoiding the increase of the model dimension typical of the classical knockoff filter. We also address extensions to non-Gaussian responses. Our model performs variable selection using an upper bound on the posterior probability of non-inclusion. We show that the induced latent knockoff layer defines valid Gaussian model-X knockoffs under the proposed construction and that the resulting procedure controls the Bayesian FDR at an arbitrary level, in finite samples, if the distribution of the covariates is fully known; under an estimated graphical structure, it satisfies an asymptotic FDR guarantee. We use simulated data to demonstrate that our proposal increases the stability of the selection with respect to classical knockoff methods. With respect to Bayesian variable selection methods, our selection procedure achieves comparable or better performances, while maintaining control over the FDR. We conclude with an application to real data.

URL PDF HTML ☆

赞 0 踩 0

2407.13980 2026-04-22 stat.ME cs.LG stat.ML

Byzantine-tolerant distributed learning of finite mixture models

Qiong Zhang, Yan Shuo Tan, Jiahua Chen

2312.06098 2026-04-22 stat.ME math.ST stat.TH

Mixture Matrix-valued Autoregressive Model

Fei Wu, Kung-Sik Chan

2307.01348 2026-04-22 econ.EM stat.ME

Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data

Ruijun Bu, Degui Li, Oliver Linton, Hanchao Wang

1110.6639 2026-04-22 physics.data-an stat.CO

On computation of a common mean

Zinovy Malkin

2604.19290 2026-04-22 q-fin.CP q-fin.MF q-fin.PR q-fin.RM stat.ME

Orthogonal reparametrization of the Nelson-Siegel-Svensson interest rate curve model: conditioning, diagnostics, and identifiability

Robert Flassig, Emrah Gülay, Daniel Guterding

Comments 28 pages, 10 figures

2604.19279 2026-04-22 stat.AP stat.OT

Early Prediction of Student Performance Using Bayesian Updating with Informative Priors Across Cohorts

Jakob Schwerter, Amer Krivosija, Tim Novak, Katja Ickstadt, Alexander Munteanu

2604.19177 2026-04-22 stat.ME

Multiscale Cochran-Mantel-Haenszel Scanning for Conditional Dependency

Gyeonghun Kang, Jialiang Mao, Li Ma

2604.19175 2026-04-22 stat.CO

Digital twin-based hybrid framework for steam generator clogging prognostics

Edgar Jaber, Emmanuel Remy, Vincent Chabridon, Morgane Garo-Sail, Mathilde Mougeot, Didier Lucor, Jerome Delplace, Maxime Lointier

2604.19165 2026-04-22 stat.ML cs.LG cs.NA math.NA

Analytical Extraction of Conditional Sobol' Indices via Basis Decomposition of Polynomial Chaos Expansions

Shijie Zhong, Jiangfeng Fu

Comments 11 pages, 2 figures

2604.19162 2026-04-22 cs.CL stat.AP

Mind the Unseen Mass: Unmasking LLM Hallucinations via Soft-Hybrid Alphabet Estimation

Hongxing Pan, Yingying Guo, Wenqing Kuang, Jiashi Lu

Comments 7 pages, 1 figure, 3 tables

2604.19153 2026-04-22 stat.AP

And Quiet Does Not Flow the Don: Statistical Analysis of a Quarrel Between Nobel Prize Laureates

Nils Lid Hjort

Comments 8 pages, 2 figures; Statistical Research Report, Department of Mathematics, University of Oslo, and Centre of Advanced Study, the Norwegian Academy of Science and Letters, 2007; published in "Consilience", Centre of Advanced Study, Norwegian Academy of Science and Letters, 2007, pp. 134-140

2604.19152 2026-04-22 stat.ME

Transfer Learning for Degree-Corrected Mixed Membership Network Models

Yong He, Kangxiang Qin, Haoran Tang

2604.19150 2026-04-22 stat.ME math.ST stat.TH

The General Formulation of Loss-Based Priors for Parameter Spaces

Cristiano Villa

2604.19091 2026-04-22 stat.ML cs.LG

Fast estimation of Gaussian mixture components via centering and singular value thresholding

Huan Qing

Comments 28 pages, 7 figures, 1 table

2604.19066 2026-04-22 cs.LG stat.AP

Age-Dependent Heterogeneity in the Association Between Physical Activity and Mental Distress: A Causal Machine Learning Analysis of 3.2 Million U.S. Adults

Yuan Shan

2604.19065 2026-04-22 cs.GT cs.SY eess.SY math.OC stat.ML

Last-Iterate Guarantees for Learning in Co-coercive Games

Siddharth Chandak, Ramanan Tamizholi, Nicholas Bambos

Comments Submitted to IEEE Conference on Decision and Control (CDC) 2026

2604.19018 2026-04-22 cs.LG cs.AI cs.SY eess.SY math.OC stat.ML

Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control

Julian Skifstad, Xinyue Annie Yang, Glen Chou

Comments Under review

2604.18973 2026-04-22 stat.AP cs.LG

Ground-Level Near Real-Time Modeling for PM2.5 Pollution Prediction

Zachary R. Fox, Janet O. Agbaje, Dakotah Maguire, Javier E. Santos, Jeremy Logan, Maggie Davis, Rima Habre, Jim VanDerslice, Heidi A. Hanson

详情

英文摘要

Air pollution is a worldwide public health threat that can cause or exacerbate many illnesses, including respiratory disease, cardiovascular disease, and some cancers. However, epidemiological studies and public health decision-making are stymied by the inability to assess pollution exposure impacts in near real time. To address this, developing accurate digital twins of environmental pollutants will enable timely data-driven analytics - a crucial step in modernizing health policy and decision-making. Although other models predict and analyze fine particulate matter exposure, they often rely on modeled input data sources and data streams that are not regularly updated. Another challenge stems from current models relying on predefined grids. In contrast, our deep-learning approach interpolates surface level PM2.5 concentrations between sparsely distributed US EPA monitoring stations in a grid-free manner. By incorporating additional, readily available datasets - including topographic, meteorological, and land-use data - we improve its ability to predict pollutant concentrations with high spatial and temporal resolution. This enables model querying at any spatial location for rapid predictions without computing over the entire grid. To ensure robustness, we randomize spatial sampling during training to enable our model to perform well in both dense and sparse monitored regions. This model is well suited for near real-time deployment because its lightweight architecture allows for fast updates in response to streaming data. Moreover, model flexibility and scalability allow it to be adapted to various geographical contexts and scales, making it a practical tool for delivering accurate and timely air quality assessments. Its capacity to rapidly evaluate multiple scenarios can be especially valuable for decision-making during public health crises.

URL PDF HTML ☆

赞 0 踩 0

2604.18912 2026-04-22 cs.LG stat.ME

Collaborative Contextual Bayesian Optimization

Chih-Yu Chang, Qiyuan Chen, Tianhan Gao, David Fenning, Chinedum Okwudire, Neil Dasgupta, Wei Lu, Raed Al Kontar

2604.18864 2026-04-22 cs.LG stat.ML

ParamBoost: Gradient Boosted Piecewise Cubic Polynomials

Nicolas Salvadé, Tim Hillel

2604.18840 2026-04-22 stat.AP stat.CO stat.ME

Spatial Extremes at Scale: A Case Study of Surface Skin Temperature and Heat Risk in the United States

Ben Seiyon Lee, Reetam Majumder, Jordan Richards, Emma S. Simpson, Likun Zhang

2604.18823 2026-04-22 stat.AP

A Non-stationary, Amortized, Transfer Learning Approach for Modeling Italian Air Quality

Alessandro Fusta Moro, Antony Sikorski, Daniel McKenzie, Alessandro Fassò, Douglas Nychka

2604.18774 2026-04-22 stat.CO stat.ME

A simulation study to resolve conflicting evidence on the error rates from MANOVA group tests

Joseph D Consiglio

Comments 19 pages, 9 tables, 0 figures

2604.18742 2026-04-22 stat.AP stat.ME

JASPER: Joint Bayesian Analysis of Spatial Expression via Regression

Pritam Dey, Rajarshi Guhaniyogi, Yang Ni, Bani K. Mallick

Comments 40 pages; 6 figures

2604.18657 2026-04-22 stat.ME

Locally parametric nonparametric density estimation

Nils Lid Hjort, M. C. Jones

Comments 30 pages, no figures. This is the Statistical Research Report version, Department of Mathematics, University of Oslo, November 1995, published in Annals of Statistics, 1996, vol. 24, pages 1619-1647

2604.18646 2026-04-22 stat.ME

Stable Transport Meta-Analysis for Heterogeneous Cardiovascular Trials: A Nuisance-Anchor Framework with a Sign-Stability Diagnostic

Ibrahim Halil Tanboga

2604.18632 2026-04-22 cs.CV stat.AP

StomaD2: An All-in-One System for Intelligent Stomatal Phenotype Analysis via Diffusion-Based Restoration Detection Network

Quanling Zhao, Meng'en Qin, Yanfeng Sun, Yuan Miao, Xiaohui Yang

2604.18609 2026-04-22 stat.AP

The Broken Shield of European Palliative Care: Evidence from Synthetic Counterfactuals on Financial Toxicity and Informal Care

Pietro Grassi, Edoardo Paperi, Chiara Seghieri, Daniele Vignoli

2604.18605 2026-04-22 q-fin.GN math.PR stat.AP

Exploring Drivers of Extreme Housing Prices in Australia

Grace Burtenshaw, Ashley Burtenshaw, Meagan Carney

2604.18599 2026-04-22 stat.AP q-bio.NC

Simulation Based Inference of a Simple Neural Network Structure

Pierre Charitat, Ségolen Geffray, Christophe Pouzat

2604.18598 2026-04-22 stat.AP cs.CE cs.NA math.NA

Bathymetry Reconstruction by Bayesian Inference

Lars Stietz, Sebastian Götschel, Peter Schleper, Daniel Ruprecht

2604.10395 2026-04-22 math.PR math.ST stat.TH

A remark on the comparison of the sum and the maximum of positive random variables

Kazuki Okamura

Comments 6 pages; results extended

2604.08404 2026-04-22 cs.LG stat.ML

Adversarial Label Invariant Graph Data Augmentations for Out-of-Distribution Generalization

Simon Zhang, Ryan P. DeMilt, Kun Jin, Cathy H. Xia

Comments 22 pages, 3 figures, accepted at ICML SCIS 2023

2603.19569 2026-04-22 stat.ME

Heterogeneous readmission prediction with hierarchical effect decomposition and regularization

Ziren Jiang, Lingfeng Huo, Jue Hou, Mary Vaughan-Sarrazin, Maureen A. Smith, Jared D. Huling

Comments 31 pages, 5 figures, 2 tables

2602.19790 2026-04-22 cs.LG stat.ML

Drift Localization using Conformal Predictions

Fabian Hinder, Valerie Vaquet, Johannes Brinkrolf, Barbara Hammer

Comments Paper is an extended version; the original was published at the 34th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN) 2026

2512.19553 2026-04-22 stat.ME

A Statistical Framework for Understanding Causal Effects that Vary by Treatment Initiation Time in EHR-based Studies

Luke Benz, Rajarshi Mukherjee, Rui Wang, David Arterburn, Heidi Fischer, Catherine Lee, Susan M. Shortreed, Alexander W. Levis, Sebastien Haneuse

2512.12448 2026-04-22 cs.LG cs.NE physics.data-an stat.ML

Optimized Architectures for Kolmogorov-Arnold Networks

James Bagrow, Josh Bongard

Comments 23 pages, 4 figures, 9 tables

2510.13763 2026-04-22 stat.ML cs.LG

PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference

Yang Yang, Severi Rissanen, Paul E. Chang, Nasrulloh Loka, Daolang Huang, Arno Solin, Markus Heinonen, Luigi Acerbi

Comments Accepted at ICLR 2026. Camera-ready version. 38 pages, 8 figures

2510.09477 2026-04-22 stat.ML cs.LG

Efficient Autoregressive Inference for Transformer Probabilistic Models

Conor Hassan, Nasrulloh Loka, Cen-You Li, Daolang Huang, Paul E. Chang, Yang Yang, Francesco Silvestrin, Samuel Kaski, Luigi Acerbi

Comments Accepted at ICLR 2026. Camera-ready version. 39 pages, 20 figures

2509.03726 2026-04-22 stat.ML cs.LG

Energy-Weighted Flow Matching: Unlocking Continuous Normalizing Flows for Efficient and Scalable Boltzmann Sampling

Niclas Dern, Lennart Redl, Sebastian Pfister, Marcel Kollovieh, David Lüdke, Stephan Günnemann

Comments 21 pages, 4 figures

2508.21184 2026-04-22 cs.CL cs.AI stat.ML

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Deepro Choudhury, Sinead Williamson, Adam Goliński, Ning Miao, Freddie Bickford Smith, Michael Kirchhof, Yizhe Zhang, Tom Rainforth

Comments Published at the International Conference on Learning Representations 2026

2507.03828 2026-04-22 cs.LG stat.ML

IMPACT: Importance-Aware Activation Space Reconstruction

Md Mokarram Chowdhury, Daniel Agyei Asante, Ernie Chang, Yang Li

Comments To appear in the Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)

2507.01918 2026-04-22 q-fin.PM cs.AI math.OC physics.data-an stat.ML

End-to-End Large Portfolio Optimization for Variance Minimization with Neural Networks through Covariance Cleaning

Christian Bongiorno, Efstratios Manolakis, Rosario Nunzio Mantegna

详情

DOI: 10.1016/j.jfds.2026.100179
Journal ref: The Journal of Finance and Data Science, 12, (2026) 100179

英文摘要

We develop a rotation-invariant neural network that provides the global minimum-variance portfolio by jointly learning how to lag-transform historical returns and marginal volatilities and how to regularise the eigenvalues of large equity covariance matrices. This explicit mathematical mapping offers clear interpretability of each module's role, so the model cannot be regarded as a pure black box. The architecture mirrors the analytical form of the global minimum-variance solution yet remains agnostic to dimension, so a single model can be calibrated on panels of a few hundred stocks and applied, without retraining, to one thousand US equities, a cross-sectional jump that indicates robust generalization capability. The loss function is the future short-term realized minimum variance and is optimized end-to-end on real returns. In out-of-sample tests from January 2000 to December 2024, the estimator delivers systematically lower realized volatility, smaller maximum drawdowns, and higher Sharpe ratios than the best competitors, including state-of-the-art non-linear shrinkage, and these advantages persist across both short and long evaluation horizons despite the model's training focus is short-term. Furthermore, although the model is trained end-to-end to produce an unconstrained minimum-variance portfolio, we show that its learned covariance representation can be used in general optimizers under long-only constraints with virtually no loss in its performance advantage over competing estimators. These advantages persist when the strategy is executed under a highly realistic implementation framework that models market orders at the auctions, empirical slippage, exchange fees, and financing charges for leverage, and they remain stable during episodes of acute market stress.

URL PDF HTML ☆

赞 0 踩 0

2507.00451 2026-04-22 cs.LG cs.AI cs.DS cs.IT math.IT stat.ML

Best Agent Identification for General Game Playing

Matthew Stephenson, Alex Newcombe, Eric Piette, Dennis Soemers

2506.18186 2026-04-22 cs.LG stat.ML

Online Learning of Whittle Indices for Restless Bandits with Non-Stationary Transition Kernels

Md Kamran Chowdhury Shisher, Vishrant Tripathi, Mung Chiang, Christopher G. Brinton

2506.02524 2026-04-22 stat.ME stat.AP

Variable Selection in Functional Linear Cox Model

Yuanzhen Yue, Stella Self, Yichao Wu, Jiajia Zhang, Rahul Ghosal

2505.22811 2026-04-22 stat.ML cs.LG

Highly Efficient and Effective LLMs with Multi-Boolean Architectures

Ba-Hien Tran, Van Minh Nguyen

Comments ICLR 2026 (Main Conference)

2505.09803 2026-04-22 stat.ML cs.LG

LatticeVision: Image to Image Networks for Modeling Non-Stationary Spatial Data

Antony Sikorski, Michael Ivanitskiy, Nathan Lenssen, Douglas Nychka, Daniel McKenzie

Comments This work has been accepted at the 29th International Conference on Artificial Intelligence and Statistics (AISTATS)

2502.16156 2026-04-22 stat.ML cs.LG

A Review of Causal Decision Making

Lin Ge, Hengrui Cai, Runzhe Wan, Yang Xu, Rui Song

2502.14479 2026-04-22 q-fin.RM q-fin.ST stat.AP

Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework

Arno Botha, Tanja Verster, Roland Breedt

Comments 37 pages, 10013 words, 10 figures

2502.12141 2026-04-22 econ.GN q-fin.EC stat.ME

Potato Potahto in the FAO-GAEZ Productivity Measures? Nonclassical Measurement Error with Multiple Proxies

Rafael Araujo, Vitor Possebom

2502.10605 2026-04-22 stat.ML cs.CY cs.LG econ.EM stat.ME

Batch-Adaptive Causal Annotations

Ezinne Nwankwo, Lauri Goldkind, Angela Zhou

2502.08461 2026-04-22 math.ST stat.AP stat.ME stat.TH

On the Dirichlet-kernel Gasser--Müller estimator and its competitors for fixed design regression on the simplex

Hanen Daayeb, Christian Genest, Salah Khardani, Nicolas Klutchnikoff, Frédéric Ouimet

Comments 18 pages, 2 figures, 1 table

2501.19311 2026-04-22 stat.ME

The Case for Time in Causal DAGs

Alexander G. Reisach, Alberto Suárez, Sebastian Weichwald, Antoine Chambaz

2501.14974 2026-04-22 math.ST cs.CR math.PR stat.ME stat.ML stat.TH

Private Minimum Hellinger Distance Estimation via Hellinger Distance Differential Privacy

Fengnan Deng, Anand N. Vidyashankar

2412.01763 2026-04-22 math.OC cs.LG stat.ML

The Data-Driven Censored Newsvendor Problem

Chamsi Hssaine, Sean R. Sinclair

Comments 85 pages, 11 tables, 11 figures

2407.21651 2026-04-22 math.PR stat.AP

On minimal predictable intensity of point processes

Haoming Wang

Comments Separate into two papers, the first entitled "On minimal predictable intensity of point processes" to appear in Houston Journal of Mathematics, the second arXiv:2509.06016

2402.05384 2026-04-22 stat.ME

Efficient Nonparametric Inference for Mediation Analysis with Nonignorable Missing Confounders

Jiawei Shan, Wei Li, Chunrong Ai

2310.06902 2026-04-22 math.ST stat.TH

On robustness of Spectral Rényi divergence

Tetsuya Takabatake, Keisuke Yano