arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.04369 2026-03-05 math.ST stat.TH

On the singularity of the Fisher Information matrix in the sine-skewed family on the d-dimensional torus

Emily Schutte, Sophia Loizidou, Vincent Laheurte

Comments 8 pages

2603.04347 2026-03-05 math.ST stat.TH

Extreme Geometric Quantiles Under Minimal Assumptions, with a Connection to Tukey Depth

Sibsankar Singha, Marie Kratz, Sreekar Vadlamani

Comments 24 pages, 2 figures

2603.04323 2026-03-05 cs.LG cs.CR cs.DC math.AT stat.ML

PTOPOFL: Privacy-Preserving Personalised Federated Learning via Persistent Homology

Kelly L Vomo-Donfack, Adryel Hoszu, Grégory Ginot, Ian Morilla

Comments 22 pages, 6 Figures

2603.04315 2026-03-05 stat.ME math.ST stat.TH

A spectral inference method for determining the number of communities in networks

Yujia Wu, Xiucai Ding, Jingfei Zhang, Wei Lan, Chih-Ling Tsai

Comments 46 pages. This manuscript presents a significant generalization and resolves several issues in the previous submission, arXiv:2409.05276, which now appears as a special case within the current framework

2603.04306 2026-03-05 stat.CO

Theory Discovery in Social Networks: Automating ERGM Specification with Large Language Models

Yidan Sun, Mayank Kejriwal

2603.04286 2026-03-05 stat.ME

A mixture model for subtype identification in the context of disease progression modeling

Sofia Kaisaridi, Juliette Ortholand, Caglayan Tuna, Hugues Chabriat, Sophie Tezenas du Montcel

2603.04278 2026-03-05 stat.ME

Markov-Based Modelling for Reservoir Management: Assessing Reliability and Resilience

M. L. Gámiz, N. Limnios, D. Montoro-Cazorla, M. C. Segovia-García

Comments 36 pages, 5 figures

2603.04275 2026-03-05 econ.EM q-fin.RM stat.ME stat.ML

Statistical Inference for Score Decompositions

Timo Dimitriadis, Marius Puke

2603.04260 2026-03-05 stat.AP

State-dependent marginal emission factors with autoregressive components

Antonio Panico, Andrew Burlinson, Luigi Grossi

2603.04252 2026-03-05 stat.AP stat.ME

Cluster-Level Experiments using Temporal Switchback Designs: Precision Gains in Pricing A/B Tests at LATAM Airlines

Nicolás Ferrari-Ortiz, Sebastián Orellana-Montini, Timur Abbiasov, Marie Garkavenko, Rutger Lit

2603.04246 2026-03-05 stat.ME

Areal Disaggregation: A Small Area Estimation Perspective

Yunhan Wu, Finn Lindgren, Heidi A. Hanson

2603.04223 2026-03-05 stat.ML cs.LG

Semi-Supervised Generative Learning via Latent Space Distribution Matching

Kwong Yu Chong, Long Feng

2603.04204 2026-03-05 stat.ML cs.CV cs.LG math.ST stat.ME stat.TH

Beyond Mixtures and Products for Ensemble Aggregation: A Likelihood Perspective on Generalized Means

Raphaël Razafindralambo, Rémy Sun, Frédéric Precioso, Damien Garreau, Pierre-Alexandre Mattei

2603.04199 2026-03-05 math.ST cs.CR cs.LG stat.ME stat.TH

Bayesian Adversarial Privacy

Cameron Bell, Timothy Johnston, Antoine Luciano, Christian P Robert

2603.04133 2026-03-05 stat.ML cs.LG

Exploiting Subgradient Sparsity in Max-Plus Neural Networks

Ikhlas Enaieh, Olivier Fercoq

2603.04080 2026-03-05 stat.ME econ.EM

Doubly Robust Estimation of Treatment Effects in Staggered Difference-in-Differences with Time-Varying Covariates

Yuhao Deng, Le Kang

2603.04007 2026-03-05 cs.LG stat.ML

Fixed-Budget Constrained Best Arm Identification in Grouped Bandits

Raunak Mukherjee, Sharayu Moharir

Comments 25 pages, 2 Figures

2603.04003 2026-03-05 stat.ME stat.CO

Efficient Bayesian Estimation of Dynamic Structural Equation Models via State Space Marginalization

Øystein Sørensen

2603.03997 2026-03-05 econ.EM stat.AP stat.CO stat.ME

Bandwidth Selection for Spatial HAC Standard Errors

Alexander Lehner

2603.03987 2026-03-05 stat.ME

Bayesian structured additive quantile regression for inflated bounded data

Francisco F. Queiroz, Johannes Brachem, Paul F. V. Wiemann, Thomas Kneib

2603.03972 2026-03-05 math.PR math.ST stat.TH

A note on outlier eigenvectors for sparse non-Hermitian perturbations

Miltiadis Galanis, Michail Louvaris

Comments 10 pages

2603.03954 2026-03-05 stat.ME

Forecasting of Multiple Seasonal Categorical Time Series Using Fourier Series with Application to AQI Data of Kolkata

Anirban Ghosh, Raju Maiti

2603.03922 2026-03-05 cs.LG stat.ML

Hierarchical Inference and Closure Learning via Adaptive Surrogates for ODEs and PDEs

Pengyu Zhang, Arnaud Vadeboncoeur, Alex Glyn-Davies, Mark Girolami

2603.03845 2026-03-05 math.DS math.PR stat.CO

Steady State Distribution and Stability Analysis of Random Differential Equations with Uncertainties and Superpositions: Application to a Predator Prey Model

Wolfgang Hoegele

2603.03843 2026-03-05 stat.ML cs.LG

Invariance-Based Dynamic Regret Minimization

Margherita Lazzaretto, Jonas Peters, Niklas Pfister

Comments 32 pages, 7 figures

2603.03828 2026-03-05 stat.OT

Philosophical foundations of statistics

Inge G. Helland, Nils Lid Hjort, Gunnar Taraldsen

Comments 7 pages, no figures; Statistical Research Report, Department of Mathematics, University of Oslo, February 2023, but now arXiv'd March 2026. The article has appeared in International Encyclopedia of Statistical Science 2024, pages 1894-1899, Springer, at this url: https://link.springer.com/content/pdf/10.1007/978-3-662-69359-9_471.pdf

2603.03819 2026-03-05 stat.ME stat.ML

Direct Bayesian Additive Regression Trees for Conditional Average Treatment Effects in Regression Discontinuity Designs

Daisuke Kondo, Shonosuke Sugasawa

Comments 25 pages

2603.03816 2026-03-05 stat.ME math.ST stat.AP stat.TH

The projected isotropic normal distribution with applications in neuroscience

Kanti V. Mardia, Antonio Mauricio F. L. Miranda de Sa'

Comments 33 Pages 14Figures

2603.03789 2026-03-05 stat.AP

Enhancing Mortality Forecasting with Ensemble Learning: A Shapley-Based Approach

G. Bimonte, M. Russolillo, Y. Yang, H. L. Shang

Comments 45 pages, 6 figures

2603.03785 2026-03-05 stat.ML cs.LG

Observationally Informed Adaptive Causal Experimental Design

Erdun Gao, Liang Zhang, Jake Fawkes, Aoqi Zuo, Wenqin Liu, Haoxuan Li, Mingming Gong, Dino Sejdinovic

2603.03778 2026-03-05 cs.LG stat.ML

Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation

Yuqi Kong, Xiao Zhang, Weiran Shen

2603.03763 2026-03-05 math.ST stat.TH

On large bandwidth matrix values kernel smoothed estimators for multi-index models

Taku Moriyama

2603.03674 2026-03-05 stat.ME

HiMAP: Hilbert Mass-Aligned Parameterization for Multivariate Barycenters and Frećhet Regression

Tao Wang, Qiannan Huang, Jun Zhu, Cheng Meng

Comments 35 pages, 14 figures

2603.03673 2026-03-05 cs.LG stat.ML

A Stein Identity for q-Gaussians with Bounded Support

Sophia Sklaviadis, Thomas Moellenhoff, Andre F. T. Martins, Mario A. T. Figueiredo, Mohammad Emtiyaz Khan

2603.03626 2026-03-05 stat.ML cs.LG cs.NA math.NA math.PR

Riemannian Langevin Dynamics: Strong Convergence of Geometric Euler-Maruyama Scheme

Zhiyuan Zhan, Masashi Sugiyama

2603.03621 2026-03-05 cs.LG cs.CV cs.NA math.NA math.OC stat.ML

Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

Blaine Quackenbush, Paul J. Atzberger

Comments related open source software see https://web.atzberger.org/

2603.03613 2026-03-05 stat.ML cs.NE math.OC

Empirical Evaluation of No Free Lunch Violations in Permutation-Based Optimization

Grzegorz Sroka

2603.03569 2026-03-05 stat.ME stat.AP stat.CO

Bayesian Estimation of Variance under Fine Stratification via Mean-Variance Smoothing

Sepideh Mosaferi, Shonosuke Sugasawa

Comments arXiv admin note: text overlap with arXiv:2110.10296

2603.02029 2026-03-05 cs.AI cs.LG stat.ML

Rich Insights from Cheap Signals: Efficient Evaluations via Tensor Factorization

Felipe Maia Polo, Aida Nematzadeh, Virginia Aglietti, Adam Fisch, Isabela Albuquerque

2603.01196 2026-03-05 stat.ME

A Percentile-Focused Regression Method for Applied Data with Irregular Error Structures

Elsayed Elamir

Comments 20 pages, 2 figures

2602.21969 2026-03-05 stat.ME

Estimation of the complexity of a network under a Gaussian graphical model

Nabaneet Das, Thorsten Dickhaus

2601.12767 2026-03-05 stat.ME

Bayesian Variable Selection with the Quasi-Posterior

Beniamino Hadj-Amar, Jack Jewson

2601.03518 2026-03-05 math.PR math.ST stat.TH

Universal concentration for sums under arbitrary dependence

Cosme Louart, Sicheng Tan

Comments 1 Figures

2512.21806 2026-03-05 math.ST stat.TH

Minimum Variance Designs With Constrained Maximum Bias

Douglas P. Wiens

2511.07340 2026-03-05 stat.CO stat.ME

Smoothing Out Sticking Points: Sampling from Discrete-Continuous Mixtures with Dynamical Monte Carlo by Mapping Discrete Mass into a Latent Universe

Andrew Chin, Akihiko Nishimura

2511.01960 2026-03-05 stat.OT q-bio.OT

Towards a Unified Framework for Statistical and Mathematical Modeling

Paul N Zivich

2510.26303 2026-03-05 cs.LG cs.AI math.OC stat.ML

Implicit Bias of Per-sample Adam on Separable Data: Departure from the Full-batch Regime

Beomhan Baek, Minhak Song, Chulhee Yun

Comments Published at ICLR 2026

2510.17325 2026-03-05 math.ST stat.TH

Composite Lp-quantile regression, near quantile regression and the oracle model selection theory

Fuming Lin WEilin Mou

Comments 35 pages, 7 figures, 2 tables

2509.25135 2026-03-05 cs.LG stat.ML

Learning in an Echo Chamber: Online Learning with Replay Adversary

Daniil Dmitriev, Harald Eskelund Franck, Carolin Heinzler, Amartya Sanyal

详情

DOI: 10.1137/1.9781611978971.239
Journal ref: Proceedings of the 2026 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA)

英文摘要

As machine learning systems increasingly train on self-annotated data, they risk reinforcing errors and becoming echo chambers of their own beliefs. We model this phenomenon by introducing a learning-theoretic framework: Online Learning in the Replay Setting. In round $t$, the learner outputs a hypothesis $\hat{h}_t$; the adversary then reveals either the true label $f^\ast(x_t)$ or a replayed label $\hat{h}_i(x_t)$ from an earlier round $i < t$. A mistake is counted only when the true label is shown, yet classical algorithms such as the SOA or the halving algorithm are easily misled by the replayed errors. We introduce the Extended Threshold dimension, $\mathrm{ExThD}(\mathcal{H})$, and prove matching upper and lower bounds that make $\mathrm{ExThD}(\mathcal{H})$ the exact measure of learnability in this model. A closure-based learner makes at most $\mathrm{ExThD}(\mathcal{H})$ mistakes against any adaptive adversary, and no algorithm can perform better. For stochastic adversaries, we prove a similar bound for every intersection-closed class. The replay setting is provably harder than the classical mistake bound setting: some classes have constant Littlestone dimension but arbitrarily large $\mathrm{ExThD}(\mathcal{H})$. Proper learning exhibits an even sharper separation: a class is properly learnable under replay if and only if it is (almost) intersection-closed. Otherwise, every proper learner suffers $Ω(T)$ errors, whereas our improper algorithm still achieves the $\mathrm{ExThD}(\mathcal{H})$ bound. These results give the first tight analysis of learning against replay adversaries, based on new results for closure-type algorithms.

URL PDF HTML ☆

赞 0 踩 0

2509.21091 2026-03-05 stat.ML cs.AI cs.LG

Best-of-$\infty$ -- Asymptotic Performance of Test-Time LLM Ensembling

Junpei Komiyama, Daisuke Oba, Masafumi Oyamada

Comments To appear at ICLR2026. Our code is available at https://github.com/jkomiyama/BoInf-code-publish/. Updated the title

2509.19956 2026-03-05 stat.ME stat.AP

Multi-state Models For Disease Histories Based On Longitudinal Data

Simon Wiegrebe, Johannes Piller, Mathias Gorski, Merle Behr, Helmut Küchenhoff, Iris M. Heid, Andreas Bender

2507.12686 2026-03-05 stat.ML cs.LG math.PR math.ST stat.TH

Finite-Dimensional Gaussian Approximation for Deep Neural Networks: Universality in Random Weights

Krishnakumar Balasubramanian, Nathan Ross

Comments To appear in Bernoulli Journal

2506.13150 2026-03-05 cs.LG math.OC stat.ML

Federated ADMM from Bayesian Duality

Thomas Möllenhoff, Siddharth Swaroop, Finale Doshi-Velez, Mohammad Emtiyaz Khan

Comments First two authors contributed equally. Published at ICLR 2026. Code is at https://github.com/team-approx-bayes/bayes-admm

2506.12112 2026-03-05 math.CV math.ST stat.TH

A Unifying Integral Representation of the Gamma Function and Its Reciprocal

Peter Reinhard Hansen, Chen Tong

Comments Please note: The results in this manuscript have been entirely subsumed and extended by the more comprehensive framework in arXiv:2602.17007

2505.22554 2026-03-05 stat.ML cs.LG

A Copula Based Supervised Filter for Feature Selection in Diabetes Risk Prediction Using Machine Learning

Agnideep Aich, Md Monzur Murshed, Sameera Hewage, Amanda Mayeaux

2505.18535 2026-03-05 cs.LG math.PR stat.ML

Convergence, Sticking and Escape: Stochastic Dynamics Near Critical Points in SGD

Dmitry Dudukalov, Artem Logachov, Vladimir Lotov, Timofei Prasolov, Evgeny Prokopenko, Anton Tarasenko

Comments The introduction, Subsections 2.1 ("Suitable Time Scaling") and 2.2 ("Sticking to a Critical Point"), as well as a small portion of the proof, have been revised. Subsection 2.3 ("Leaving the Neighborhood of a Sharp Maximum") has undergone minor revisions due to the equality in the doubly exponential case

2505.15643 2026-03-05 cs.LG cs.IT math.IT stat.ML

Optimal Best-Arm Identification under Fixed Confidence with Multiple Optima

Lan V. Truong

Comments To appear in IEEE Transactions on Information Theory

2505.07669 2026-03-05 stat.ME stat.AP

Separable models for dynamic signed networks

Alberto Caimo, Isabella Gollini

Comments 20 pages, 9 figures, 3 tables

2505.01297 2026-03-05 math.ST stat.TH

On identification in ill-posed linear regression

Gianluca Finocchio, Tatyana Krivobokova

Comments 61 pages, 2 figures

2504.11279 2026-03-05 stat.CO stat.ME stat.ML

Simulation-based inference for stochastic nonlinear mixed-effects models with applications in systems biology

Henrik Häggström, Sebastian Persson, Marija Cvijovic, Umberto Picchini

Comments 42 pages, 23 figures

2503.18012 2026-03-05 physics.comp-ph stat.ML

Scalable physics-informed deep generative model for solving forward and inverse stochastic differential equations

Shaoqian Zhou, Wen You, Ling Guo, Xuhui Meng

2502.08838 2026-03-05 stat.ME math.ST stat.AP stat.TH

Statistical inference for Levy-driven graph supOU processes: From short- to long-memory in high-dimensional time series

Shreya Mehta, Almut E. D. Veraart

2501.13839 2026-03-05 stat.ME econ.EM

Detecting Sparse Cointegration

Jesus Gonzalo, Jean-Yves Pitarakis

2412.19436 2026-03-05 stat.ML cs.LG

Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback

Seong Jin Lee, Will Wei Sun, Yufeng Liu

2409.08773 2026-03-05 econ.EM stat.ME

Heterogeneous Responses to Continuous Treatments: A Cluster-Based Causal Framework

Augusto Cerqua, Roberta Di Stefano, Raffaele Mattera

2409.00966 2026-03-05 math.PR cs.DS cs.LG math.ST stat.TH

A computational transition for detecting correlated stochastic block models by low-degree polynomials

Guanyi Chen, Jian Ding, Shuyang Gong, Zhangsong Li

Comments 80 pages, 2 figures, added further explanations and remarks; to appear in Annals of Statistics

2408.06958 2026-03-05 cs.LG stat.ML

AuToMATo: An Out-Of-The-Box Persistence-Based Clustering Algorithm

Marius Huber, Sara Kalisnik, Patrick Schnider

Comments Code: https://doi.org/10.5281/zenodo.17279740

2406.14184 2026-03-05 stat.ME

On integral priors for multiple comparison in Bayesian model selection

Diego Salmerón, Juan Antonio Cano, Christian P. Robert

Comments Accepted for publication in International Statistical Review. DOI: 10.1111/insr.70028

2406.14059 2026-03-05 cs.GT cs.LG math.OC stat.ML

Tracking solutions of time-varying variational inequalities

Hédi Hadiji, Sarah Sachs, Cristóbal Guzmán

2405.20856 2026-03-05 stat.ME stat.ML

Parameter identification in linear non-Gaussian causal models under general confounding

Daniele Tramontano, Mathias Drton, Jalal Etesami

2403.15384 2026-03-05 stat.ME

Unifying small area estimators based on area-level and unit-level models through calibration

William Acero, Isabel Molina, J. Miguel Marín

Comments 27 pages, 9 figures, 2 tables

2402.03756 2026-03-05 math.ST cs.NA math.NA stat.TH

Uniform error bounds of the ensemble transform Kalman filter for infinite-dimensional dynamics with multiplicative covariance inflation

Kota Takeda, Takashi Sakajo

Comments 18 pages, 0 figures

2312.05645 2026-03-05 stat.ML cs.CR cs.IT cs.LG math.IT

Sample-Optimal Locally Private Hypothesis Selection and the Provable Benefits of Interactivity

Alireza F. Pour, Hassan Ashtiani, Shahab Asoodeh

2310.09701 2026-03-05 stat.ME

A robust and powerful method for assessing replicability of high dimensional data

Haochen Lei, Yan Li, Hongyuan Cao

2603.03587 2026-03-05 stat.ME cs.LG stat.ML

Controllable Generative Sandbox for Causal Inference

Qi Zhang, Harsh Parikh, Ashley Naimi, Razieh Nabi, Christopher Kim, Timothy Lash

Comments 34 pages, 15 figures. Submitted to ICML 2026. Code available at https://github.com/zhangqiecho/causalmix

2603.03445 2026-03-05 stat.ME stat.AP

The Certainty Bound: Structural Limits on Scientific Reliability

Marco Pollanen

Comments 44 pages, 2 figures, submitted to Meta-Psychology (open peer review)

2603.03411 2026-03-05 stat.ML cs.LG

Scalable Contrastive Causal Discovery under Unknown Soft Interventions

Mingxuan Zhang, Khushi Desai, Sopho Kevlishvili, Elham Azizi

2603.03405 2026-03-05 stat.ML cs.LG

Surprisal-Rényi Free Energy

Shion Matsumoto, Raul Castillo, Benjamin Prada, Ankur Arjun Mali

2603.03401 2026-03-05 stat.ML cs.LG stat.ME

Beyond Cross-Validation: Adaptive Parameter Selection for Kernel-Based Gradient Descents

Xiaotong Liu, Yunwen Lei, Xiangyu Chang, Shao-Bo Lin

2603.03387 2026-03-05 stat.ML cs.AI cs.LG

Learning Order Forest for Qualitative-Attribute Data Clustering

Mingjie Zhao, Sen Feng, Yiqun Zhang, Mengke Li, Yang Lu, Yiu-ming Cheung

Comments Accepted to ECAI2024

2603.03375 2026-03-05 stat.ML cs.LG math.CT

The Theory behind UMAP?

David Wegmann

Comments This article is derived from my masters thesis

2601.05217 2026-03-05 math.ST cs.IT math.IT math.PR stat.TH

A complete characterization of testable hypotheses

Martin Larsson, Johannes Ruf, Aaditya Ramdas

Comments 28 pages

2511.14827 2026-03-05 stat.ML cs.AI cs.LG math.AP

Implicit Bias of the JKO Scheme

Peter Halmos, Boris Hanin

2505.23783 2026-03-05 stat.ML cs.AI cs.CL cs.LG

Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

Korel Gundem, Juncheng Dong, Dennis Zhang, Vahid Tarokh, Zhengling Qi

2502.05459 2026-03-05 cs.CV cs.AI q-bio.CB stat.ML

DCENWCNet: A Deep CNN Ensemble Network for White Blood Cell Classification with LIME-Based Explainability

Sibasish Dhibar

2408.02391 2026-03-05 math.ST econ.EM stat.TH

Expected Kullback-Leibler-based characterizations of score-driven updates

Ramon de Punder, Timo Dimitriadis, Rutger-Jan Lange