arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.20153 2026-02-24 stat.ML cs.LG stat.ME

JUCAL: Jointly Calibrating Aleatoric and Epistemic Uncertainty in Classification Tasks

Jakob Heiss, Sören Lambrecht, Jakob Weissteiner, Hanna Wutte, Žan Žurič, Josef Teichmann, Bin Yu

Comments 11 pages + appendix. Preliminary version of an ongoing project that will be expanded with furhter evaluations

详情

英文摘要

We study post-calibration uncertainty for trained ensembles of classifiers. Specifically, we consider both aleatoric (label noise) and epistemic (model) uncertainty. Among the most popular and widely used calibration methods in classification are temperature scaling (i.e., pool-then-calibrate) and conformal methods. However, the main shortcoming of these calibration methods is that they do not balance the proportion of aleatoric and epistemic uncertainty. Not balancing these uncertainties can severely misrepresent predictive uncertainty, leading to overconfident predictions in some input regions while being underconfident in others. To address this shortcoming, we present a simple but powerful calibration algorithm Joint Uncertainty Calibration (JUCAL) that jointly calibrates aleatoric and epistemic uncertainty. JUCAL jointly calibrates two constants to weight and scale epistemic and aleatoric uncertainties by optimizing the negative log-likelihood (NLL) on the validation/calibration dataset. JUCAL can be applied to any trained ensemble of classifiers (e.g., transformers, CNNs, or tree-based methods), with minimal computational overhead, without requiring access to the models' internal parameters. We experimentally evaluate JUCAL on various text classification tasks, for ensembles of varying sizes and with different ensembling strategies. Our experiments show that JUCAL significantly outperforms SOTA calibration methods across all considered classification tasks, reducing NLL and predictive set size by up to 15% and 20%, respectively. Interestingly, even applying JUCAL to an ensemble of size 5 can outperform temperature-scaled ensembles of size up to 50 in terms of NLL and predictive set size, resulting in up to 10 times smaller inference costs. Thus, we propose JUCAL as a new go-to method for calibrating ensembles in classification.

URL PDF HTML ☆

赞 0 踩 0

2602.20152 2026-02-24 cs.LG cs.AI stat.ML

Behavior Learning (BL): Learning Hierarchical Optimization Structures from Data

Zhenyao Ma, Yue Liang, Dongxu Li

Comments ICLR 2026

2602.20151 2026-02-24 stat.ME cs.LG math.ST stat.ML stat.TH

Conformal Risk Control for Non-Monotonic Losses

Anastasios N. Angelopoulos

2602.20126 2026-02-24 cs.LG cs.IT math.IT math.ST stat.ML stat.TH

Adaptation to Intrinsic Dependence in Diffusion Language Models

Yunxiao Zhao, Changxiao Cai

2602.20118 2026-02-24 stat.ME

Improving the Power of Bonferroni Adjustments under Joint Normality and Exchangeability

Caleb Hiltunen, Yeonwoo Rho

2602.20115 2026-02-24 math.ST econ.EM stat.ME stat.TH

Compound decisions and empirical Bayes via Bayesian nonparametrics

Nikolaos Ignatiadis, Sid Kankanala

Comments 34 pages

2602.20071 2026-02-24 math.ST stat.AP stat.ME stat.TH

Estimators of different delta coefficients based on the unbiased estimator of the expected proportions of agreements

A. Martín Andrés, M. Álvarez Hernández

2602.20062 2026-02-24 cs.LG stat.ML

A Theory of How Pretraining Shapes Inductive Bias in Fine-Tuning

Nicolas Anguita, Francesco Locatello, Andrew M. Saxe, Marco Mondelli, Flavia Mancini, Samuel Lippl, Clementine Domine

2602.20029 2026-02-24 stat.ME

Covariance estimation for derivatives of functional data using an additive penalty in P-splines

Yueyun Zhu, Steven Golovkine, Norma Bargary, Andrew J. Simpkin

2602.19610 2026-02-24 cs.LG stat.CO stat.ME stat.ML

Variational Inference for Bayesian MIDAS Regression

Luigi Simeone

Comments 27 pages, 11 figures

2510.03817 2026-02-24 cs.LG stat.ML

TROLL: Trust Regions improve Reinforcement Learning for Large Language Models

Philipp Becker, Niklas Freymuth, Serge Thilges, Fabian Otto, Gerhard Neumann

Comments Published as a conference paper at ICLR 2026

2506.22975 2026-02-24 math.ST stat.TH

On the Study of Weighted Fractional Cumulative Residual Inaccuracy and its Dynamical Version with Applications

Aman Pandey, Chanchal Kundu

2505.24506 2026-02-24 stat.AP

Enhancing the Accuracy of Spatio-Temporal Models for Wind Speed Prediction by Incorporating Bias-Corrected Crowdsourced Data

Eamonn Organ, Maeve Upton, Denis Allard, Lionel Benoit, James Sweeney

Journal ref Environmetrics 37(2), e70069 (2026)

2501.10471 2026-02-24 cs.LG q-bio.QM stat.ML

VillageNet: Graph-based, Easily-interpretable, Unsupervised Clustering for Broad Biomedical Applications

Aditya Ballal, Gregory A. DePaul, Esha Datta, Asuka Hatano, Erik Carlsson, Ye Chen-Izu, Javier E. López, Leighton T. Izu

Comments Software available at https://villagenet.streamlit.app/ Github Link: https://github.com/lordareicgnon/VillageNet

2501.01129 2026-02-24 stat.AP

Compositional data analysis for modelling and forecasting mortality using the α-transformation

Han Ying Lim, Dharini Pathmanathan, Sophie Dabo-Niang

Comments 15 pages, 3 tables, 4 figures

2412.02094 2026-02-24 cs.LG cs.CY stat.AP

Crash Severity Risk Modeling Strategies under Data Imbalance

Abdullah Al Mamun, Abyad Enan, Debbie A. Indah, Judith Mwakalonge, Gurcan Comert, Mashrur Chowdhury

Comments This second revised version has been resubmitted to the Transportation Research Record: Journal of the Transportation Research Board after addressing the reviewers' comments and is currently awaiting the final decision

Journal ref Transportation Research Record (2025)

详情

DOI: 10.1177/03611981251376371

英文摘要

This study investigates crash severity risk modeling strategies for work zones involving large vehicles (i.e., trucks, buses, and vans) under crash data imbalance between low-severity (LS) and high-severity (HS) crashes. We utilized crash data involving large vehicles in South Carolina work zones from 2014 to 2018, which included four times more LS crashes than HS crashes. The objective of this study is to evaluate the crash severity prediction performance of various statistical, machine learning, and deep learning models under different feature selection and data balancing techniques. Findings highlight a disparity in LS and HS predictions, with lower accuracy for HS crashes due to class imbalance and feature overlap. Discriminative Mutual Information (DMI) yields the most effective feature set for predicting HS crashes without requiring data balancing, particularly when paired with gradient boosting models and deep neural networks such as CatBoost, NeuralNetTorch, XGBoost, and LightGBM. Data balancing techniques such as NearMiss-1 maximize HS recall when combined with DMI-selected features and certain models such as LightGBM, making them well-suited for HS crash prediction. Conversely, RandomUnderSampler, HS Class Weighting, and RandomOverSampler achieve more balanced performance, which is defined as an equitable trade-off between LS and HS metrics, especially when applied to NeuralNetTorch, NeuralNetFastAI, CatBoost, LightGBM, and Bayesian Mixed Logit (BML) using merged feature sets or models without feature selection. The insights from this study offer safety analysts guidance on selecting models, feature selection, and data balancing techniques aligned with specific safety goals, providing a robust foundation for enhancing work-zone crash severity prediction.

URL PDF HTML ☆

赞 0 踩 0

2407.16024 2026-02-24 stat.ME

Generalized dynamic functional principal component analysis

Tzung Hsuen Khoo, Issa-Mbenard Dabo, Dharini Pathmanathan, Sophie Dabo-Niang

2406.07210 2026-02-24 econ.GN physics.soc-ph q-fin.EC stat.AP

The green hydrogen ambition and implementation gap

Adrian Odenweller, Falko Ueckerdt

Journal ref Nat Energy 10, 110-123 (2025)

2405.09797 2026-02-24 stat.ME stat.ML stat.OT

Extrapolating Single-Treatment Effects Out of Factorial Experiments

Guilherme Duarte

2212.00795 2026-02-24 stat.ME

Causal Selection of Covariates in Regression Calibration for Mismeasured Continuous Exposure

Wenze Tang, Donna Spiegelman, Xiaomei Liao, Molin Wang

Comments 11 pages, 3 figures

1705.10494 2026-02-24 stat.ML cs.LG

Joint auto-encoders: a flexible multi-task learning framework

Baruch Epstein, Ron Meir, Tomer Michaeli

2602.19954 2026-02-24 stat.AP

A Two-Step Spatio-Temporal Framework for Turbine-Height Wind Estimation at Unmonitored Sites from Sparse Meteorological Data

Eamonn Organ, Maeve Upton, Denis Allard, Lionel Benoit, James Sweeney

2602.19952 2026-02-24 stat.AP stat.ME stat.ML

A Bayesian Framework for Post-disruption Travel Time Prediction in Metro Networks

Shayan Nazemi, Aurélie Labbe, Stefan Steiner, Pratheepa Jeganathan, Martin Trépanier, Léo R. Belzile

2602.19922 2026-02-24 stat.ME

Transfer Learning with Network Embeddings under Structured Missingness

Mengyan Li, Xiaoou Li, Kenneth D Mandl, Tianxi Cai

2602.19903 2026-02-24 eess.SP cs.LG stat.ML

Rethinking Chronological Causal Discovery with Signal Processing

Kurt Butler, Damian Machlanski, Panagiotis Dimitrakopoulos, Sotirios A. Tsaftaris

Comments 5 pages, 5 figures, Final version accepted to the 59th Asilomar Conference on Signals, Systems, and Computers (2025)

2602.19893 2026-02-24 cs.LG stat.ML

Generalized Random Direction Newton Algorithms for Stochastic Optimization

Soumen Pachal, Prashanth L. A., Shalabh Bhatnagar, Avinash Achar

2602.19859 2026-02-24 stat.ML cs.LG

Dirichlet Scale Mixture Priors for Bayesian Neural Networks

August Arnstad, Leiv Rønneberg, Geir Storvik

Comments 24 pages, 20 figures

2602.19851 2026-02-24 stat.ME cs.LG

Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

Xinyan Su, Jiacan Gao, Mingyuan Ma, Xiao Xu, Xinrui Wan, Tianqi Gu, Enyun Yu, Jiecheng Guo, Zhiheng Zhang

2602.19839 2026-02-24 math.ST stat.TH

Addressing parity blindness of data-driven Sobolev tests on the hypersphere

Marcio Reverbel

Comments 6 pages, 1 figure, submitted to Statistics & Probability Letters

2602.19838 2026-02-24 stat.ME math.ST stat.TH

Optimality of the Half-Order Exponent in the Turing-Good Identities for Bayes Factors

Kensuke Okada

2602.19803 2026-02-24 math.ST cs.IT math.IT stat.TH

From Asymptotic to Finite-Sample Minimax Robust Hypothesis Testing

Gökhan Gül

Comments 40 pages, 6 figures. Submitted to IEEE Transactions on Information Theory

2602.19785 2026-02-24 cs.LG cs.NE stat.ML

Unsupervised Anomaly Detection in NSL-KDD Using $β$-VAE: A Latent Space and Reconstruction Error Approach

Dylan Baptiste, Ramla Saddem, Alexandre Philippot, François Foyer

Journal ref 2025 15th France-Japan \& 13th Europe-Asia Congress on Mechatronics (MECATRONICS) / 23rd International Conference on Research and Education in Mechatronics (REM), Dec 2025, Saint-Ouen-sur-Seine, France. pp.1-6

2602.19761 2026-02-24 stat.ML cs.LG stat.AP

Ensemble Machine Learning and Statistical Procedures for Dynamic Predictions of Time-to-Event Outcomes

Nina van Gerwen, Sten Willemsen, Bettina E. Hansen, Christophe Corpechot, Marco Carbone, Cynthia Levy, Maria-Carlota Londõno, Atsushi Tanaka, Palak Trivedi, Alejandra Villamil, Gideon Hirschfield, Dimitris Rizopoulos

2602.19740 2026-02-24 econ.EM stat.AP

Volatility Spillovers in China's Real Estate Crisis: A Network Approach

Julia Manso

详情

英文摘要

Sentiment towards the Chinese real estate sector has deteriorated following the introduction of financing constraints in 2020 with the ''three red lines." Forcing developers to restructure their debt, the policy triggered a cascade of financing troubles, defaults, and reduced housing demand, ultimately culminating in a prolonged real estate crisis. This paper utilizes a network approach in line with Demirer et al. (2018) and Diebold and Yilmaz (2014) to measure daily time-varying connectedness in the stock return volatilities of major Chinese real estate developers throughout the crisis. Focusing on spillover between companies as reflected by market perception, this paper examines how connectedness evolves over time across firms with different regional exposures and state-ownership statuses, filling a gap in the literature to elucidate where property demand and real estate firm trustworthiness have deteriorated most. An event-study analysis of four key moments of the crisis outlines distinct phases of market sentiment: with the introduction of the three red lines, connectedness primarily reflects shared exposure and a uniform shock to the market. Then, the early unrest surrounding Evergrande exposes strong regional differentiation, with firms concentrated in less developed regions receiving significant spillover. By one year into the crisis, previously stable regions receive higher levels of spillover, and there is evidence of a substitution effect towards private developers. Two years into the crisis, the market has much less homogeneity in effects across regions and state-ownership status: major shocks induce minimal network changes, reflecting how investors have already priced in their beliefs. This paper also offers one of the most extensive timelines of the Chinese real estate crisis to date, and a new R package, GephiForR, was created for the network visualization in this paper.

URL PDF HTML ☆

赞 0 踩 0

2602.19738 2026-02-24 stat.ME

Individualized Causal Effects under Network Interference with Combinatorial Treatments

Yunping Lu, Haoang Chi, Qirui Hu, Zhiheng Zhang

2602.19709 2026-02-24 math.ST stat.TH

On Expectation Propagation and the Probabilistic Editor in some simple mixture problems

Nils Lid Hjort, Mike Titterington

Comments 22 pages, 0 figures; Mike Titterington passed away in 2023, at the age of 77; this is the October 2010 version of a paper we collaborated on then and (still) planned to extend before submitting to a journal

2602.19663 2026-02-24 q-fin.RM stat.CO

The impact of class imbalance in logistic regression models for low-default portfolios in credit risk

Willem D. Schutte, Charl Pretorius, Neill Smit, Leandra van der Merwe, Robert Maxwell

Comments 24 pages, 9 figures

2602.19648 2026-02-24 stat.ME

Local depth-based classification of directional data

Giuseppe Gismondi, Rebecca Rivieccio, Giuseppe Pandolfo

2602.19634 2026-02-24 cs.LG cs.AI stat.ML

Compositional Planning with Jumpy World Models

Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni, Marc G. Bellemare, Alessandro Lazaric, Ahmed Touati

2602.19600 2026-02-24 stat.ML cs.LG

Manifold-Aligned Generative Transport

Xinyu Tian, Xiaotong Shen

Comments 64 pages, 5 figures

2602.19590 2026-02-24 q-fin.TR cs.CE q-fin.ST stat.CO

Metaorder modelling and identification from public data

Ezra Goliath, Tim Gebbie

Comments 12 pages, 6 figures

2602.19578 2026-02-24 stat.ML cs.LG

Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization

Weichi Yao, Bianca Dumitrascu, Bryan R. Goldsmith, Yixin Wang

2602.19528 2026-02-24 cs.LG stat.ML

Beyond Accuracy: A Unified Random Matrix Theory Diagnostic Framework for Crash Classification Models

Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

2602.19520 2026-02-24 stat.AP

Decomposing Crowd Wisdom: Domain-Specific Calibration Dynamics in Prediction Markets

Nam Anh Le

2602.19513 2026-02-24 stat.AP stat.ML

Real-time Win Probability and Latent Player Ability via STATS X in Team Sports

Yasutaka Shimizu, Atsushi Yamanobe

2602.19510 2026-02-24 cs.LG math.OC stat.ML

Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon

Rudrajit Das, Neel Patel, Meisam Razaviyayn, Vahab Mirrokni

2602.19486 2026-02-24 eess.SY cs.SY stat.AP stat.ME

A mixed Hinfty-Passivity approach for Leveraging District Heating Systems as Frequency Ancillary Service in Electric Power Systems

Xinyi Yi, Ioannis Lestas

2602.19481 2026-02-24 math.ST stat.TH

A Selection Premium Decomposition for the Expected Maximum of Random Walks

Victor H. de la Pena, Fangyuan Lin, Victor K. de la Pena

2602.19462 2026-02-24 stat.ME math.ST stat.AP stat.TH

Zero Variance Portfolio

Jinyuan Chang, Yi Ding, Zhentao Shi, Bo Zhang

2602.19455 2026-02-24 cs.LG cs.AI cs.CL stat.ML

SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

Zelin He, Boran Han, Xiyuan Zhang, Shuai Zhang, Haotian Lin, Qi Zhu, Haoyang Fang, Danielle C. Maddix, Abdul Fatir Ansari, Akash Chandrayan, Abhinav Pradhan, Bernie Wang, Matthew Reimherr

Comments Accepted by the 29th International Conference on Artificial Intelligence and Statistics (AISTATS 2026)

2602.19403 2026-02-24 cs.CL stat.AP

Personalized Prediction of Perceived Message Effectiveness Using Large Language Model Based Digital Twins

Jasmin Han, Janardan Devkota, Joseph Waring, Amanda Luken, Felix Naughton, Roger Vilardaga, Jonathan Bricker, Carl Latkin, Meghan Moran, Yiqun Chen, Johannes Thrul

Comments 31 pages, 5 figures, submitted to Journal of the American Medical Informatics Association (JAMIA). Drs. Chen and Thrul share last authorship

2602.19398 2026-02-24 stat.ME

Variable selection via knockoffs for clustered data

Silvia Bacci, Leonardo Grilli, Carla Rampichini

Comments 11 pages, under submission

2602.19378 2026-02-24 stat.ME

Identification and estimation of the conditional average treatment effect with nonignorable missing covariates, treatment, and outcome

Shuozhi Zuo, Yixin Wang, Fan Yang

2602.19370 2026-02-24 stat.AP stat.ME

Reliability of stochastic capacity estimates

Igor Mikolasek

Comments 9 pages, 3 figures, 3 tables, accepted for TRA 2026 conference

2602.19351 2026-02-24 stat.AP

Network-Level Travel Time Prediction Considering The Effects of Weather and Seasonality

Yufei Ai, Yao Yu, Wenjing Pu, Lu Gao, Yihao Ren

2602.19331 2026-02-24 cs.LG cs.NE stat.ML

Partial Soft-Matching Distance for Neural Representational Comparison with Partial Unit Correspondence

Chaitanya Kapoor, Alex H. Williams, Meenakshi Khosla

2602.19329 2026-02-24 stat.AP cs.LG

Dynamic Elasticity Between Forest Loss and Carbon Emissions: A Subnational Panel Analysis of the United States

Keonvin Park

2602.19295 2026-02-24 q-bio.QM stat.AP stat.ME

Time-Varying Hazard Patterns and Co-Mutation Profiles of KRAS G12C and G12D in Real-World NSCLC

Robert Amevor, Dennis Baidoo, Emmanuel Kubuafor

2602.19290 2026-02-24 stat.ME econ.EM math.ST stat.TH

Distributional Discontinuity Design

Kyle Schindl, Larry Wasserman

2602.19284 2026-02-24 stat.ME

Localized conformal model selection

Yuhao Wang, Tengyao Wang

Comments 8 pages, 1 figure

2602.19263 2026-02-24 stat.AP cs.LG

Prognostics of Multisensor Systems with Unknown and Unlabeled Failure Modes via Bayesian Nonparametric Process Mixtures

Kani Fu, Sanduni S Disanayaka Mudiyanselage, Chunli Dai, Minhee Kim

2602.19239 2026-02-24 stat.ML cs.LG

Attention Deficits in Language Models: Causal Explanations for Procedural Hallucinations

Ahmed Karim, Fatima Sheaib, Zein Khamis, Maggie Chlon, Jad Awada, Leon Chlon

2602.19236 2026-02-24 stat.ME

CoMET: A Compressed Bayesian Mixed-Effects Model for High-Dimensional Tensors

Sreya Sarkar, Kshitij Khare, Sanvesh Srivastava

Comments 50 pages, 11 figures, and 2 tables

2602.19220 2026-02-24 stat.ME

A likelihood approach to proper analysis of secondary outcomes in matched case-control studies

Shanshan Liu, Guoqing Diao

2602.19216 2026-02-24 stat.ME

Statistical Measures for Explainable Aspect-Based Sentiment Analysis: A Case Study on Environmental Discourse in Reddit

Luisa Stracqualursi, Patrizia Agati

Comments Preprint of an article accepted for publication in Statistics (Taylor & Francis). 14 pages, 2 figures, 4 tables

2602.19143 2026-02-24 cs.LG math.OC stat.ML

Incremental Learning of Sparse Attention Patterns in Transformers

Oğuz Kaan Yüksel, Rodrigo Alvarez Lucendo, Nicolas Flammarion

Comments 36 pages, 19 figures

2602.19129 2026-02-24 stat.ME

Estimation and Statistical Inference for Generalized Multilayer Latent Space Model

Zhaozhe Liu, Gongjun Xu, Haoran Zhang

2602.17960 2026-02-24 math.PR stat.AP stat.ML

Anisotropic local law for non-separable sample covariance matrices

Zhou Fan, Renyuan Ma, Elliot Paquette, Zhichao Wang

2602.14934 2026-02-24 stat.ML cs.LG

Activation-Space Uncertainty Quantification for Pretrained Networks

Richard Bergna, Stefan Depeweg, Sergio Calvo-Ordoñez, Jonathan Plenk, Alvaro Cartea, Jose Miguel Hernández-Lobato

2602.14440 2026-02-24 stat.ME cs.LG stat.ML

CAIRO: Decoupling Order from Scale in Regression

Harri Vanhems, Yue Zhao, Peng Shi, Archer Y. Yang

2602.14208 2026-02-24 cs.LG math.OC stat.ML

Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws

Jinbo Wang, Binghui Li, Zhanpeng Zhou, Mingze Wang, Yuxuan Sun, Jiaqi Zhang, Xunliang Cai, Lei Wu

Comments 34 pages, accepted by ICLR 2026 as a conference paper

2601.13851 2026-02-24 cs.LG stat.ML

Inverting Self-Organizing Maps: A Unified Activation-Based Framework

Alessandro Londei, Matteo Benati, Denise Lanzieri, Vittorio Loreto

2512.04861 2026-02-24 math.ST stat.ML stat.TH

Concentration bounds for intrinsic dimension estimation using Gaussian kernels

Martin Andersson

Comments 24 pages, 8 figures

2511.07588 2026-02-24 stat.ME

Weighted Asymptotically Optimal Sequential Testing

Soumyabrata Bose, Jay Bartroff

2511.07270 2026-02-24 math.ST cs.IT cs.LG math.IT math.PR stat.ML stat.TH

High-Dimensional Asymptotics of Differentially Private PCA

Youngjoo Yun, Rishabh Dudeja

2511.00958 2026-02-24 cs.LG cs.AI stat.ML

The Hidden Power of Normalization Layers in Neural Networks: Exponential Capacity Control

Khoat Than

2510.21491 2026-02-24 cs.LG cs.DC stat.ML

Benchmarking Catastrophic Forgetting Mitigation Methods in Federated Time Series Forecasting

Khaled Hallak, Oudom Kem

Comments Accepted for presentation at the FLTA 2025 Conference on Federated Learning. This version corresponds to the camera-ready author manuscript

2510.16703 2026-02-24 cs.LG cs.AI stat.ME

On the Granularity of Causal Effect Identifiability

Yizuo Chen, Adnan Darwiche

2510.11853 2026-02-24 stat.ME math.ST stat.TH

A Martingale Kernel Two-Sample Test

Anirban Chatterjee, Aaditya Ramdas

Comments Accepted for publication in the proceedings of The 37th International Conference on Algorithmic Learning Theory

2510.03734 2026-02-24 cs.LG cs.AI cs.CY stat.ML

Cost Efficient Fairness Audit Under Partial Feedback

Nirjhar Das, Mohit Sharma, Praharsh Nanavati, Kirankumar Shiragur, Amit Deshpande

Comments Accepted at NeurIPS 2025 RegML Workshop; Reliable ML Workshop

2509.12666 2026-02-24 stat.ML cs.LG cs.NA math.NA

PBPK-iPINNs: Inverse Physics-Informed Neural Networks for Physiologically Based Pharmacokinetic Brain Models

Charuka D. Wickramasinghe, Krishanthi C. Weerasinghe, Pradeep K. Ranaweera, Nelum S. S. M. Hapuhinna

Comments 28 pages, 12 figures

2508.13668 2026-02-24 physics.bio-ph stat.CO

Perspective: An outlook on fluorescence tracking

Lance W. Q. Xu, Steve Pressé

2507.11891 2026-02-24 stat.ML cs.LG math.ST stat.TH

Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

Shuangning Li, Chonghuan Wang, Jingyan Wang

2507.11768 2026-02-24 stat.ML cs.LG

LLMs are Bayesian, In Expectation, Not in Realization

Leon Chlon, Zein Khamis, Maggie Chlon, Mahdi El Zein, MarcAntonio M. Awada

2506.22740 2026-02-24 cs.AI stat.ML

Explanations are a Means to an End: Decision Theoretic Explanation Evaluation

Ziyang Guo, Berk Ustun, Jessica Hullman

2506.18630 2026-02-24 stat.ML cs.LG eess.SP

Trustworthy Prediction with Gaussian Process Knowledge Scores

Kurt Butler, Guanchao Feng, Tong Chen, Petar Djuric

Comments 6 pages, 5 figures, to be published in the Proceedings of the European Signal Processing Conference (EUSIPCO)

2506.18215 2026-02-24 math.ST stat.TH

Estimating quantile treatments without strict overlap

Marco Avella-Medina, Richard Davis, Gennady Samorodnitsky

2506.10572 2026-02-24 stat.ML cs.LG

Probability Bounding: Post-Hoc Calibration via Box-Constrained Softmax

Kyohei Atarashi, Satoshi Oyama, Hiromi Arai, Hisashi Kashima

Comments 46 pages, 4 figures

2506.00486 2026-02-24 cs.LG cs.AI stat.ML

It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs

Jun Wu, Patrick Huang, Jiangtao Wen, Yuxing Han

2505.21417 2026-02-24 stat.ME stat.CO

Model averaging with mixed criteria for estimating high quantiles of extreme values: Application to heavy rainfall

Yonggwan Shin, Yire Shin, Jeong-Soo Park

Journal ref Shin, Y., Shin, Y. & Park, JS. Model averaging with mixed criteria for estimating high quantiles of extreme values: application to heavy rainfall. Stoch Environ Res Risk Assess 40(2), 47 (2026)

2504.19375 2026-02-24 cs.LG cs.SY eess.SY math.OC stat.ML

$O(1/k)$ Finite-Time Bound for Non-Linear Two-Time-Scale Stochastic Approximation

Siddharth Chandak

Comments Submitted to IEEE Transactions on Automatic Control

2504.16100 2026-02-24 eess.SP cs.AI cs.LG stat.ML

Towards Accurate Forecasting of Renewable Energy : Building Datasets and Benchmarking Machine Learning Models for Solar and Wind Power in France

Eloi Lindas, Yannig Goude, Philippe Ciais

Comments 24 pages, 4 tables, 18 figures

Journal ref Environmental Data Science , Volume 4 , 2025 , e45

详情

DOI: 10.1017/eds.2025.10021

英文摘要

Accurate prediction of non-dispatchable renewable energy sources is essential for grid stability and price prediction. Regional power supply forecasts are usually indirect through a bottom-up approach of plant-level forecasts, incorporate lagged power values, and do not use the potential of spatially resolved data. This study presents a comprehensive methodology for predicting solar and wind power production at country scale in France using machine learning models trained with spatially explicit weather data combined with spatial information about production sites capacity. A dataset is built spanning from 2012 to 2023, using daily power production data from RTE (the national grid operator) as the target variable, with daily weather data from ERA5, production sites capacity and location, and electricity prices as input features. Three modeling approaches are explored to handle spatially resolved weather data: spatial averaging over the country, dimension reduction through principal component analysis, and a computer vision architecture to exploit complex spatial relationships. The study benchmarks state-of-the-art machine learning models as well as hyperparameter tuning approaches based on cross-validation methods on daily power production data. Results indicate that cross-validation tailored to time series is best suited to reach low error. We found that neural networks tend to outperform traditional tree-based models, which face challenges in extrapolation due to the increasing renewable capacity over time. Model performance ranges from 4% to 10% in nRMSE for midterm horizon, achieving similar error metrics to local models established at a single-plant level, highlighting the potential of these methods for regional power supply forecasting.

URL PDF HTML ☆

赞 0 踩 0

2503.09287 2026-02-24 econ.EM stat.AP

On the Wisdom of Crowds (of Economists)

Francis X. Diebold, Aaron Mora, Minchul Shin

2502.09257 2026-02-24 cs.LG cs.AI stat.ML

From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards

Liad Erez, Tomer Koren

2501.10117 2026-02-24 econ.EM stat.ME

Prediction Sets and Conformal Inference with Interval Outcomes

Weiguang Liu, Áureo de Paula, Elie Tamer

2412.15520 2026-02-24 stat.ME

Logistic Regression Model for Differentially-Private Matrix Masked Data

Linh H Nghiem, Aidong A. Ding, Samuel Wu

2411.02770 2026-02-24 cs.LG math.PR stat.CO stat.ML

A spectral mixture representation of isotropic kernels with application to random Fourier features

Nicolas Langrené, Xavier Warin, Pierre Gruet

Comments 27 pages, 12 figures

2407.06898 2026-02-24 math.OC stat.AP

Who Goes Next? Optimizing the Allocation of Adherence-Improving Interventions

Daniel Otero-Leon, Mariel Lavieri, Brian Denton, Jeremy Sussman, Rodney Hayward

2404.00888 2026-02-24 math.ST stat.TH

Two step estimations via the Dantzig selector for models of stochastic processes with high-dimensional parameters

Kou Fujimori, Koji Tsukuda

Comments 51 pages, 1 figure

Journal ref Stochastic Processes and their Applications Volume 192 (2026), 104809

2402.12122 2026-02-24 math.NA cs.NA math.PR math.ST stat.TH

Almost sure convergence rates of adaptive increasingly rare Markov chain Monte Carlo

Julian Hofstadler, Krzysztof Latuszynski, Gareth O. Roberts, Daniel Rudolf

2401.05812 2026-02-24 stat.CO

A Tidy Framework and Infrastructure to Systematically Assemble Spatio-temporal Indexes from Multivariate Data

H. Sherry Zhang, Dianne Cook, Ursula Laa, Nicolas Langrené, Patricia Menéndez

Journal ref Journal of Computational and Graphical Statistics 34(2) 642-653 (2025)

2401.02953 2026-02-24 stat.ME

Linked factor analysis

Giuseppe Vinci

Comments 42 page, 7 figures

2312.03274 2026-02-24 stat.ME stat.ML

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA

Joonsuk Kang, Matthew Stephens

2301.00201 2026-02-24 stat.ML cs.LG math.DG

Exploring Singularities in point clouds with the graph Laplacian: An explicit approach

Martin Andersson, Benny Avelin

Comments 28 pages, 12 figures

Journal ref Journal of Computational Mathematics and Data Science 14 (2025) 100113

2210.01844 2026-02-24 math.OC math.ST q-fin.MF stat.TH

A quickest detection problem with false negatives

Tiziano De Angelis, Jhanvi Garg, Quan Zhou

Comments 35 pages, 4 figures

2205.00259 2026-02-24 stat.CO stat.ME

cubble: An R Package for Organizing and Wrangling Multivariate Spatio-temporal Data

H. Sherry Zhang, Dianne Cook, Ursula Laa, Nicolas Langrené, Patricia Menéndez

Journal ref Journal of Statistical Software 110(7) 1-27 (2024)

2107.05956 2026-02-24 stat.CO stat.ME

IID Sampling from Intractable Distributions

Sourabh Bhattacharya

Comments This updated version will appear in Sankhya A's special issue paying tribute to Professor C. R. Rao

2103.01280 2026-02-24 econ.EM math.ST stat.ME stat.ML stat.TH

Dynamic covariate balancing: estimating treatment effects over time with potential local projections

Davide Viviano, Jelena Bradic

2008.11175 2026-02-24 stat.AP stat.ME

How Ominous is the Premonition of Future Global Warming?

Debashis Chatterjee, Sourabh Bhattacharya

Comments This updated version will appear in Sankhya B's special issue paying tribute to Professor C. R. Rao

2602.19012 2026-02-24 stat.ME stat.AP

Adaptive Weighting for Time-to-Event Continual Reassessment Method: Improving Safety in Phase I Dose-Finding Through Data-Driven Delay Distribution Estimation

Robert Amevor, Emmanuel Kubuafor, Dennis Baidoo

2602.18988 2026-02-24 stat.ME stat.AP

Latent Moment Models for Recurrent Binary Outcomes: A Bayesian and Quasi-Distributional Approach

Niloofar Ramezani, Lori P. Selby, Pascal Nitiema, Jeffrey R. Wilson

Comments 16 pages, 1 figure, 4 tables, 1 Supplementary Table

2602.18948 2026-02-24 cs.LG cs.NE hep-th stat.ML

Toward Manifest Relationality in Transformers via Symmetry Reduction

J. François, L. Ravera

Comments 12 pages

2602.18870 2026-02-24 stat.ML cs.LG

Federated Measurement of Demographic Disparities from Quantile Sketches

Arthur Charpentier, Agathe Fernandes Machado, Olivier Côté, François Hu

2602.18865 2026-02-24 stat.ME

Expected Shortfall Regression via Optimization

Yuanzhi Li, Shushu Zhang, Xuming He

Comments Yuanzhi Li and Shushu Zhang contributed equally to this work

2602.18808 2026-02-24 math.PR stat.ME

Orthogonal polynomials on path-space

Ilya Chevyrev, Emilio Ferrucci, Darrick Lee, Terry Lyons, Harald Oberhauser, Nikolas Tapia

Comments 38 pages, 4 figures

2602.18795 2026-02-24 cs.LG stat.ML

Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

Zheng Wang, Nizar Bouguila

Comments Submitted to JMLR, under review

2602.18762 2026-02-24 stat.ML cs.LG

Bounds and Identification of Joint Probabilities of Potential Outcomes and Observed Variables under Monotonicity Assumptions

Naoya Hashimoto, Yuta Kawakami, Jin Tian

2602.18727 2026-02-24 stat.AP q-bio.QM

Statistical methods for reference-free single-molecule localisation microscopy

Jack Peyton, Benjamin Davis, Emily Gribbin, Daniel Rolfe, Hannah Mitchell

2602.18677 2026-02-24 stat.ME stat.AP

Bayesian calendar-time survival analysis with epidemic curve priors and variant-specific infection hazards

Angela M Dahl, Elizabeth R Brown

Comments 24 pages, 6 figures

2602.18660 2026-02-24 stat.ME cs.HC

Better Assumptions, Stronger Conclusions: The Case for Ordinal Regression in HCI

Brandon Victor Syiem, Eduardo Velloso

Comments 21 pages, 16 figures, to be published in the Proceedings of the 2026 ACM CHI Conference on Human Factors in Computing Systems

2602.18656 2026-02-24 stat.ME math.ST stat.TH

Minimally Discrete and Minimally Randomized p-Values

Joshua Habiger, Pratyaydipta Rudra

2602.18651 2026-02-24 stat.ME

Hybrid combinations of parametric and empirical likelihoods

Nils Lid Hjort, Ian W. McKeague, Ingrid Van Keilegom

Comments 24 pages, 4 figures. This is the July 2017 authors' manuscript, with Supplementary Material, with final paper published in Statistica Sinica, 2018, their Peter Hall issue, vol. 28, pages 2389-2407, see pmc.ncbi.nlm.nih.gov/articles/PMC6602551/

Journal ref Statistica Sinica, 2018, vol. 28, pages 2389-2407

2602.18636 2026-02-24 cs.CY stat.AP

Statistical Imaginaries, State Legitimacy: Grappling with the Arrangements Underpinning Quantification in the U.S. Census

Jayshree Sarathy, danah boyd

Journal ref Critical Sociology, 51(6), 1267-1288 (2024)

2602.18573 2026-02-24 stat.ML cs.LG stat.ME

Multiclass Calibration Assessment and Recalibration of Probability Predictions via the Linear Log Odds Calibration Function

Amy Vennos, Xin Xing, Christopher T. Franck

2602.18570 2026-02-24 stat.ME

Spatiotemporal double machine learning to estimate the impact of Cambodian land concessions on deforestation

Anika Arifin, Duncan DeProfio, Layla Lammers, Benjamin Shapiro, Brian J Reich, Henry Uddyback, Joshua M Gray

2602.18518 2026-02-24 cs.LG stat.ME stat.ML

Measuring the Prevalence of Policy Violating Content with ML Assisted Sampling and LLM Labeling

Attila Dobi, Aravindh Manickavasagam, Benjamin Thompson, Xiaohan Yang, Faisal Farooq

Comments 8 pages

2602.18486 2026-02-24 cs.LG eess.SP stat.ML

Support Vector Data Description for Radar Target Detection

Jean Pinsolle, Yadang Alexis Rouzoumka, Chengfang Ren, Chistèle Morisseau, Jean-Philippe Ovarlez

Comments 5 pages, 2 figures, to appear in Acoustics, Speech and Signal Processing (ICASSP), 2026 IEEE International Conference on, Barcelona, Spain, May 2026

2602.18465 2026-02-24 cs.LG stat.ML

Revisiting the Seasonal Trend Decomposition for Enhanced Time Series Forecasting

Sanjeev Panta, Xu Yuan, Li Chen, Nian-Feng Tzeng

Comments 5 pages, accepted at 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2026)

2602.18442 2026-02-24 stat.ME stat.AP

Ostrom-Weighted Bootstrap: A Theoretically Optimal and Provably Complete Framework for Hierarchical Imputation in Multi-Agent Systems

Hirofumi Wakimoto

Comments 7 pages, initial submission

2602.11080 2026-02-24 stat.ME math.ST stat.TH

Constrained Fiducial Inference for Gaussian Models

Hank Flury, Jan Hannig, Richard Smith

2601.18412 2026-02-24 stat.ME

Preference-based Centrality and Ranking in General Metric Spaces

Lingfeng Lyu, Doudou Zhou

2601.17160 2026-02-24 stat.ML cs.AI cs.LG stat.ME

Information-Theoretic Causal Bounds under Unmeasured Confounding

Yonghan Jung, Bogyeong Kang

2601.15500 2026-02-24 stat.ML cs.AI cs.LG math.ST stat.TH

Low-Dimensional Adaptation of Rectified Flow: A Diffusion and Stochastic Localization Perspective

Saptarshi Roy, Alessandro Rinaldo, Purnamrita Sarkar

Comments 32 pages, 7 figures

2601.06830 2026-02-24 stat.ML cs.LG cs.NA math.NA math.OC math.PR

Constrained Density Estimation via Optimal Transport

Yinan Hu, Esteban G. Tabak

2512.20363 2026-02-24 cs.LG cs.AI cs.DC stat.AP stat.ML

Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

Daniel M. Jimenez-Gutierrez, Mehrdad Hassanzadeh, David Solans, Mohammed Elbamby, Nicolas Kourtellis, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea Vitaletti

Comments Accepted for publication to the 40th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2026)

2512.20007 2026-02-24 stat.ML cs.LG stat.ME

Semiparametric KSD test: unifying score and distance-based approaches for goodness-of-fit testing

Zhihan Huang, Ziang Niu

2512.13123 2026-02-24 math.OC cs.LG math.ST stat.ML stat.TH

Stopping Rules for Stochastic Gradient Descent via Anytime-Valid Confidence Sequences

Liviu Aolaritei, Michael I. Jordan

2511.05640 2026-02-24 cs.LG cs.GT stat.ML

Blind Inverse Game Theory: Jointly Decoding Rewards and Rationality in Entropy-Regularized Competitive Games

Hamza Virk, Sandro Amaglobeli, Zuhayr Syed

2511.01222 2026-02-24 stat.ME

Perturbed Double Machine Learning: Nonstandard Inference Beyond the Parametric Length

Mengchu Zheng, Matteo Bonvini, Zijian Guo

2510.22792 2026-02-24 math.ST stat.TH

Composite goodness-of-fit test with the Kernel Stein Discrepancy and a bootstrap for degenerate U-statistics with estimated parameters

Florian Brück, Veronika Reimoser, Fabian Baier

2510.22664 2026-02-24 cond-mat.stat-mech cs.IT gr-qc hep-ph math.IT math.ST quant-ph stat.TH

The Gravitational Aspect of Information: The Physical Reality of Asymmetric "Distance"

Tomoi Koide, Armin van de Venn

Comments 9 pages, no figures. Typos corrected and text added

2509.21655 2026-02-24 cs.LG stat.ML

DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models

Yinuo Ren, Wenhao Gao, Lexing Ying, Grant M. Rotskoff, Jiequn Han

Comments Published at ICLR 2026 (https://openreview.net/forum?id=l01eG3Qikl)

2509.01924 2026-02-24 stat.ML cs.LG stat.AP stat.ME

Non-Linear Model-Based Sequential Decision-Making in Agriculture

Sakshi Arya, Wentao Lin

2508.14487 2026-02-24 stat.ME stat.CO

Bridge Sampling Diagnostics

Giorgio Micaletto, Aki Vehtari

Comments 19 pages

2508.01457 2026-02-24 physics.ao-ph stat.ML

NICE^k Metrics: Unified and Multidimensional Framework for Evaluating Deterministic Solar Forecasting Accuracy

Cyril Voyant, Milan Despotovic, Luis Garcia-Gutierrez, Rodrigo Amaro e Silva, Philippe Lauret, Ted Soubdhan, Nadjem Bailek

Comments 24 pages, 1 Table, 5 Figures

Journal ref Sustainable Energy Technologies and Assessments (2025), 104588

2507.11732 2026-02-24 cs.LG stat.ML

Graph Neural Networks Powered by Encoder Embedding for Improved Node Learning

Shiyu Chen, Cencheng Shen, Youngser Park, Carey E. Priebe

2507.10373 2026-02-24 math.ST stat.ME stat.TH

Post-reduction inference for confidence sets of models

Heather Battey, Daniel Garcia Rasines, Yanbo Tang

2506.09217 2026-02-24 cs.RO cs.CV stat.AP

Perception Characteristics Distance: Measuring Stability and Robustness of Perception System in Dynamic Conditions under a Certain Decision Rule

Boyu Jiang, Liang Shi, Zhengzhi Lin, Lanxin Xiang, Loren Stowe, Feng Guo

Comments This paper has been accepted to the CVPR 2026 Main Conference

2505.23546 2026-02-24 math.OC stat.ML

Going from a Representative Agent to Counterfactuals in Combinatorial Choice

Yanqiu Ruan, Karthyek Murthy, Karthik Natarajan

Comments 34 pages, 6 figures

2505.19371 2026-02-24 cs.AI cs.LG math.ST stat.TH

Foundations of Top-$k$ Decoding For Language Models

Georgy Noarov, Soham Mallick, Tao Wang, Sunay Joshi, Yan Sun, Yangxinyu Xie, Mengxin Yu, Edgar Dobriban

2505.06595 2026-02-24 stat.ML cs.AI cs.CV cs.LG math.PR

Feature Representation Transferring to Lightweight Models via Perception Coherence

Hai-Vy Nguyen, Fabrice Gamboa, Sixin Zhang, Reda Chhaibi, Serge Gratton, Thierry Giaccone

Comments Published in Transactions on Machine Learning Research (02/2026)

Journal ref Published in Transactions on Machine Learning Research (02/2026)

2503.22933 2026-02-24 stat.ME

Improving Transportability of Regression Calibration Under the Main/External Validation Study Design

Zexiang Li, Donna Spiegelman, Molin Wang, Zuoheng Wang, Xin Zhou

详情

DOI: 10.1093/biomtc/ujag019

英文摘要

In epidemiology, obtaining accurate individual exposure measurements can be costly and challenging. Thus, these measurements are often subject to error. Regression calibration with a validation study is widely employed as a study design and analysis method to correct for measurement error in the main study due to its broad applicability and simple implementation. However, relying on an external validation study to assess the measurement error process carries the risk of introducing bias into the analysis. Specifically, if the parameters of regression calibration model estimated from the external validation study are not transportable to the main study, the subsequent estimated parameter describing the exposure-disease association will be biased. In this work, we improve the regression calibration method for linear regression models using an external validation study. Unlike the original approach, our proposed method ensures that the regression calibration model is transportable by estimating the parameters in the measurement error generating process using the external validation study and obtaining the remaining parameter values in the regression calibration model directly from the main study. This guarantees that parameter values in the regression calibration model will be applicable to the main study. We derived the theoretical properties of our proposed method. The simulation results show that the proposed method effectively reduces bias and maintains nominal confidence interval coverage. We applied this method to data from the Health Professionals Follow-Up Study (main study) and the Men's Lifestyle Validation Study (external validation study) to assess the effects of dietary intake on body weight.

URL PDF HTML ☆

赞 0 踩 0

2503.14381 2026-02-24 stat.ML cs.LG math.ST stat.ME stat.TH

Optimizing High-Dimensional Oblique Splits

Chien-Ming Chi

Comments 91 pages, 13 tables

2503.11842 2026-02-24 cs.LG stat.ML

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, M. Emrullah Ildiz, Xuechen Zhang, Mahdi Soltanolkotabi, Marco Mondelli, Samet Oymak

Comments Accepted at ICML 2025

2502.04591 2026-02-24 cs.LG cs.AI stat.ML

Are We Measuring Oversmoothing in Graph Neural Networks Correctly?

Kaicheng Zhang, Piero Deidda, Desmond Higham, Francesco Tudisco

Comments Accepted into ICLR 2026

2410.19412 2026-02-24 cs.LG cs.AI cs.CE econ.EM stat.CO

Robust Time Series Causal Discovery for Agent-Based Model Validation

Gene Yu, Ce Guo, Wayne Luk

Comments A peer-reviewed version titled "VCDF: A Validated Consensus-Driven Framework for Time Series Causal Discovery" is accepted to Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2026. Please cite the PAKDD version

2410.10251 2026-02-24 math.ST stat.TH

Convergence rates for estimating multivariate scale mixtures of uniform densities

Arlene K. H. Kim, Gil Kur, Adityanand Guntuboyina

Journal ref Electron. J. Statist. 19(2): 3771-3834 (2025)

2410.08958 2026-02-24 stat.ML cs.LG

The MAPS Algorithm: Fast model-agnostic and distribution-free prediction intervals for supervised learning

Daniel Salnikov, Dan Leonte, Kevin Michalewicz

Comments 28 pages, 3 algorithms, 5 figures, 3 tables

2404.12613 2026-02-24 stat.ML cs.LG eess.SP stat.ME

Model Selection and Parameter Estimation of One-Dimensional Gaussian Mixture Models

Xinyu Liu, Hai Zhang

2403.18248 2026-02-24 econ.EM stat.ML

Statistical Inference of Optimal Allocations I: Regularities and their Implications

Kai Feng, Han Hong, Denis Nekipelov

2402.11717 2026-02-24 math.RA math.ST stat.TH

A symmetric function approach to polynomial regression

Hans-Christian Herbig, Daniel Herden, Christopher Seaton

Comments 12 pages, 2 figures

Journal ref Aequationes mathematicae Volume 100, article number 21, (2026)

2402.10758 2026-02-24 stat.ML cs.LG stat.CO

Stochastic Localization via Iterative Posterior Sampling

Louis Grenioux, Maxence Noble, Marylou Gabrié, Alain Oliviero Durmus

Comments Accepted at ICML 2024, improved assumption A0 (and consequences), fixed corollary 11

2308.04825 2026-02-24 stat.ME math.PR

Repelled point processes with application to numerical integration

Diala Hawat, Gabriel Mastrilli, Rémi Bardenet, Raphaël Lachièze-Rey

2306.01485 2026-02-24 cs.LG cs.AI cs.NA math.NA stat.ML

Robust low-rank training via approximate orthonormal constraints

Dayana Savostianova, Emanuele Zangrando, Gianluca Ceruti, Francesco Tudisco

Journal ref Proceedings NeurIPS 2023

2210.04140 2026-02-24 stat.ME

Bayesian Repulsive Mixture Modeling with Matérn Point Processes

Hanxi Sun, Boqian Zhang, Minhyeok Kim, Vinayak Rao

Comments Main doc: 18 pages, 8 figures. Supp: 16 pages, 19 figures. Changes: added author (Minhyeok Kim) and section/results on setting repulsion parameters

2203.14959 2026-02-24 stat.AP physics.ao-ph physics.data-an

Benchmarks for Solar Radiation Time Series Forecasting

Cyril Voyant, Gilles Notton, Jean-Laurent Duchaud, Luis Antonio García Gutiérrez, Jamie M. Bright, Dazhi Yang

Comments 32 pages, 9 Tables and 4 Figures

Journal ref Volume 191, May 2022, Pages 747-762

1408.0705 2026-02-24 stat.ME econ.EM

Using Invalid Instruments on Purpose: Focused Moment Selection and Averaging for GMM

Francis J. DiTraglia

Journal ref Journal of Econometrics, Volume 195, Issue 2, December 2016, Pages 187-208