arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2510.22007 2026-01-21 math.ST cs.CL cs.CR cs.LG stat.ML stat.TH

Optimal Detection for Language Watermarks with Pseudorandom Collision

T. Tony Cai, Xiang Li, Qi Long, Weijie J. Su, Garrett G. Wen

2601.14223 2026-01-21 math.ST stat.TH

Symmetry Testing in Time Series using Ordinal Patterns: A U-Statistic Approach

Annika Betken, Giorgio Micali, Manuel Ruiz Marín

2601.14199 2026-01-21 stat.ME

Factor Analysis of Multivariate Stochastic Volatility Model

Taehee Lee, Jun S. Liu

Comments Submitted to Journal of the American Statistical Association (JASA)

2601.14173 2026-01-21 cs.LG stat.ML

Penalizing Localized Dirichlet Energies in Low Rank Tensor Products

Paris A. Karakasis, Nicholas D. Sidiropoulos

Comments 19 pages

2601.14170 2026-01-21 math.PR cond-mat.stat-mech cs.DM math.CO math.ST stat.TH

Wasserstein distances between ERGMs and Erdős-Rényi models

Vilas Winstein

Comments 33 pages

详情

英文摘要

Ferromagnetic exponential random graph models (ERGMs) are random graph models under which the presence of certain small structures (such as triangles) is encouraged; they can be constructed by tilting an Erdős--Rényi model by the exponential of a particular nonlinear Hamiltonian. These models are mixtures of metastable wells which each behave macroscopically like an Erdős--Rényi model, exhibiting the same laws of large numbers for subgraph counts [CD13]. However, on the microscopic scale these metastable wells are very different from Erdős--Rényi models, with the total variation distance between the two measures tending to 1 [MX23]. In this article we clarify this situation by providing a sharp (up to constants) bound on the Hamming-Wasserstein distance between the two models, which is the average number of edges at which they differ, under the coupling which minimizes this average. In particular, we show that this distance is $Θ(n^{3/2})$, quantifying exactly how these models differ. An upper bound of this form has appeared in the past [RR19], but this was restricted to the subcritical (high-temperature) regime of parameters. We extend this bound, using a new proof technique, to the supercritical (low-temperature) regime, and prove a matching lower bound which has only previously appeared in the subcritical regime of special cases of ERGMs satisfying a "triangle-free" condition [DF25]. To prove the lower bound in the presence of triangles, we introduce an approximation of the discrete derivative of the Hamiltonian, which controls the dynamical properties of the ERGM, in terms of local counts of triangles and wedges (two-stars) near an edge. This approximation is the main technical and conceptual contribution of the article, and we expect it will be useful in a variety of other contexts as well. Along the way, we also prove a bound on the marginal edge probability under the ERGM via a new bootstrapping argument. Such a bound has already appeared [FLSW25], but again only in the subcritical regime and using a different proof strategy.

URL PDF HTML ☆

赞 0 踩 0

2601.14062 2026-01-21 q-fin.ST stat.AP stat.ML

Demystifying the trend of the healthcare index: Is historical price a key driver?

Payel Sadhukhan, Samrat Gupta, Subhasis Ghosh, Tanujit Chakraborty

2601.13998 2026-01-21 stat.ME stat.AP stat.CO

Modeling Zero-Inflated Longitudinal Circular Data Using Bayesian Methods: Application to Ophthalmology

Prajamitra Bhuyan, Soutik Halder, Jayant Jha

2601.13966 2026-01-21 math.ST stat.TH

Information-Theoretic and Computational Limits of Correlation Detection under Graph Sampling

Dong Huang, Pengkun Yang

Comments 61 pages, 7 figures

2601.13962 2026-01-21 eess.SP cs.SY eess.SY q-bio.NC stat.ME

Optimal Calibration of the endpoint-corrected Hilbert Transform

Eike Osmers, Dorothea Kolossa

2601.13955 2026-01-21 math.ST stat.TH

Uniform Consistency of Generalized Cross-Validation for Ridge Regression in High-Dimensional Misspecified Linear Models

Akira Shinkyu

2601.13946 2026-01-21 math.ST stat.TH

Topological Criteria for Hypothesis Testing with Finite-Precision Measurements

Philip Boeken, Eduardo Skapinakis, Konstantin Genin, Joris M. Mooij

2601.13784 2026-01-21 stat.ME

An Adaptive Phase II Trial Design for Dose Selection and Addition in Microfilarial Infections

Sonja Zehetmayer, Marta Bofill Roig, Fabrice Lotola Mougeni, Sabine Specht, Marc P. Hübner, Martin Posch

详情

英文摘要

We propose a frequentist adaptive phase 2 trial design to evaluate the safety and efficacy of three treatment regimens (doses) compared to placebo for four types of helminth (worm) infections. This trial will be carried out in four Subsaharan African countries from spring 2025. Since the safety of the highest dose is not yet established, the study begins with the two lower doses and placebo. Based on safety and early efficacy results from an interim analysis, a decision will be made to either continue with the two lower doses or drop one or both and introduce the highest dose instead. This design borrows information across baskets for safety assessment, while efficacy is assessed separately for each basket. The proposed adaptive design addresses several key challenges: (1) The trial must begin with only the two lower doses because reassuring safety data from these doses is required before escalating to a higher dose. (2) Due to the expected speed of recruitment, adaptation decisions must rely on an earlier, surrogate endpoint. (3) The primary outcome is a count variable that follows a mixture distribution with an atom at 0. To control the familywise error rate in the strong sense when comparing multiple doses to the control in the adaptive design, we extend the partial conditional error approach to accommodate the inclusion of new hypotheses after the interim analysis. In a comprehensive simulation study we evaluate various design options and analysis strategies, assessing the robustness of the design under different design assumptions and parameter values. We identify scenarios where the adaptive design improves the trial's ability to identify an optimal dose. Adaptive dose selection enables resource allocation to the most promising treatment arms, increasing the likelihood of selecting the optimal dose while reducing the required overall sample size and trial duration.

URL PDF HTML ☆

赞 0 踩 0

2601.13776 2026-01-21 cs.LG stat.ML

Orthogonium : A Unified, Efficient Library of Orthogonal and 1-Lipschitz Building Blocks

Thibaut Boissin, Franck Mamalet, Valentin Lafargue, Mathieu Serrurier

Journal ref ICML 2025 Workshop on Championing Open- source Development in Machine Learning (CODEML '25), Jul 2025, Vancouver, France

2601.13755 2026-01-21 stat.ME

Building a Standardised Statistical Reporting Toolbox in an Academic Oncology Clinical Trials Unit: The grstat R Package

Dan Chaltiel, Alexis Cochard, Nusaibah Ibrahimi, Charlotte Bargain, Ikram Benchara, Anne Lourdessamy, Aldéric Fraslin, Matthieu Texier, Livia Pierotti

2601.13744 2026-01-21 math.ST stat.TH

A Note on k-NN Gating in RAG

Gérard Biau, Claire Boyer

2601.13642 2026-01-21 stat.ML cs.LG

Sample Complexity of Average-Reward Q-Learning: From Single-agent to Federated Reinforcement Learning

Yuchen Jiao, Jiin Woo, Gen Li, Gauri Joshi, Yuejie Chi

2601.13641 2026-01-21 stat.AP eess.SP

Correction of Pooling Matrix Mis-specifications in Compressed Sensing Based Group Testing

Shuvayan Banerjee, Radhendushka Srivastava, James Saunderson, Ajit Rajwade

2601.13627 2026-01-21 stat.AP

Are Large Language Models able to Predict Highly Cited Papers? Evidence from Statistical Publications

Zhanshuo Ye, Yiming Hou, Rui Pan, Tianchen Gao, Hansheng Wang

2601.13605 2026-01-21 eess.SY cs.SY stat.AP

Outage Identification from Electricity Market Data: Quickest Change Detection Approach

Milad Hoseinpour, Shubhanshu Shekhar, Vladimir Dvorkin

Comments 7 pages, 2 figures, 1 table

2601.13544 2026-01-21 physics.soc-ph econ.EM stat.AP

The Collapse of Multilayer Predation and the Emergence of a Monolithic Leviathan

Li Tuobang

Comments in Chinese language

2601.13535 2026-01-21 stat.ME stat.AP

What is Overlap Weighting, How Has it Evolved, and When to Use It for Causal Inference?

Haidong Lu, Fan Li, Laine E. Thomas, Fan Li

Comments 26 pages, 1 table, 1 figure

2601.13514 2026-01-21 stat.ME

Post-selection inference for penalized M-estimators via score thinning

Ronan Perry, Snigdha Panigrahi, Daniela Witten

2601.13474 2026-01-21 cs.LG cs.AI math.OC stat.ML

Preconditioning Benefits of Spectral Orthogonalization in Muon

Jianhao Ma, Yu Huang, Yuejie Chi, Yuxin Chen

2601.13454 2026-01-21 stat.ME

Categorical distance correlation under general encodings and its application to high-dimensional feature screening

Qingyang Zhang

Comments 39 pages, 7 figures

2601.13449 2026-01-21 stat.ME stat.AP

Identifying Causes of Test Unfairness: Manipulability and Separability

Youmi Suk, Weicong Lyu

Comments 20 pages for the main text

2601.13436 2026-01-21 stat.ML cs.LG cs.SY eess.SP eess.SY math.ST stat.TH

Distribution-Free Confidence Ellipsoids for Ridge Regression with PAC Bounds

Szabolcs Szentpéteri, Balázs Csanád Csáji

2601.13405 2026-01-21 stat.ME

Associating High-Dimensional Longitudinal Datasets through an Efficient Cross-Covariance Decomposition

Jianbin Tan, Pixu Shi

Comments 30 pages, 6 figures

2601.13396 2026-01-21 stat.AP

A Two-Stage Bayesian Framework for Multi-Fidelity Online Updating of Spatial Fragility Fields

Abdullah M. Braik, Maria Koliou

Comments 46 pages, 14 figures, 2 tables. This is a preprint and has not been peer reviewed

2601.13362 2026-01-21 stat.AP cs.LG

Improving Geopolitical Forecasts with Bayesian Networks

Matthew Martin

Comments 34 pages, 3 figures

2601.13347 2026-01-21 math.NA cs.NA math.OC math.ST stat.TH

A Scalable Sequential Framework for Dynamic Inverse Problems via Model Parameter Estimation

Aryeh Keating, Mirjeta Pasha

Comments 27 Pages, 8 Figures

2601.13281 2026-01-21 econ.EM q-fin.RM stat.AP

Spectral Dynamics and Regularization for High-Dimensional Copulas

Koos B. Gubbels, Andre Lucas

2601.13272 2026-01-21 cs.LG stat.CO stat.ML

Multi-level Monte Carlo Dropout for Efficient Uncertainty Quantification

Aaron Pim, Tristan Pryer

Comments 26 pages, 11 figures

2601.13254 2026-01-21 math.ST math.AP math.FA math.PR stat.TH

Inverting the Fisher information operator in non-linear models

Dimitri Konen

2512.13642 2026-01-21 econ.EM stat.ML

From Many Models, One: Macroeconomic Forecasting with Reservoir Ensembles

Giovanni Ballarin, Lyudmila Grigoryeva, Yui Ching Li

Comments Updated manuscript with shortened main text

2511.21603 2026-01-21 math.ST stat.TH

Uniform inference for kernel instrumental variable regression

Marvin Lob, Rahul Singh, Suhas Vijaykumar

2511.03694 2026-01-21 stat.CO

Robust Global Fr'echet Regression via Weight Regularization

Hao Li, Shonosuke Sugasawa, Shota Katayama

2509.11741 2026-01-21 stat.CO

Tidy simulation: Designing robust, reproducible, and scalable Monte Carlo simulations

Erik-Jan van Kesteren

Comments 16 pages, 3 figures

2509.03515 2026-01-21 cs.RO cs.AI cs.LG cs.SY eess.SY stat.AP

Can the Waymo Open Motion Dataset Support Realistic Behavioral Modeling? A Validation Study with Naturalistic Trajectories

Yanlin Zhang, Sungyong Chung, Nachuan Li, Dana Monzer, Hani S. Mahmassani, Samer H. Hamdar, Alireza Talebpour

2508.20257 2026-01-21 cs.LG stat.ML

Discovering equations from data: symbolic regression in dynamical systems

Beatriz R. Brum, Luiza Lober, Isolde Previdelli, Francisco A. Rodrigues

2508.00770 2026-01-21 math.ST stat.ME stat.TH

On admissibility in post-hoc hypothesis testing

Ben Chugg, Tyron Lardy, Aaditya Ramdas, Peter Grünwald

Comments 58 pages. To appear in the International Journal of Approximate Reasoning

2507.01726 2026-01-21 quant-ph physics.chem-ph stat.ML

Generative flow-based warm start of the variational quantum eigensolver

Hang Zou, Martin Rahm, Anton Frisk Kockum, Simon Olsson

Comments 20 pages; 8 figures

Journal ref npj Quantum Information volume 12, 5 (2026)

2506.22324 2026-01-21 stat.ME

General measures of effect size to calculate power and sample size for Wald tests with generalized linear models

Amy L Cochran, Shijie Yuan, Paul J Rathouz

2506.15511 2026-01-21 stat.AP

A sequential ensemble approach to epidemic modeling: Combining Hawkes and SEIR models using SMC$^2$

Dhorasso Temfack, Jason Wyse

2505.18918 2026-01-21 stat.ML cs.LG eess.SP

ALPCAHUS: Subspace Clustering for Heteroscedastic Data

Javier Salazar Cavazos, Jeffrey A Fessler, Laura Balzano

Comments Manuscript submitted to IEEE Transactions on Signal Processing (TSP), revised, and pending acceptance

2502.04162 2026-01-21 stat.AP cs.LG cs.SI stat.ML

Network-Level Measures of Mobility from Aggregated Origin-Destination Data

Alisha Foster, David A. Meyer, Asif Shakeel

Comments 34 pages, 20 figures

2502.02986 2026-01-21 math.ST stat.TH

Matching Criterion for Identifiability in Sparse Factor Analysis

Nils Sturma, Miriam Kranzlmueller, Irem Portakal, Mathias Drton

2501.17512 2026-01-21 stat.ML cs.LG

A survey on Clustered Federated Learning: Taxonomy, Analysis and Applications

Michael Ben Ali, Omar El-Rifai, Imen Megdiche, André Peninou, Olivier Teste

2501.10263 2026-01-21 stat.ME math.ST stat.TH

Prior distributions for structured semi-orthogonal matrices

Michael Jauch, Marie-Christine Düker, Peter Hoff

Comments 23 pages, 5 figures

2412.19012 2026-01-21 stat.ME

Dynamic networks clustering via mirror distance

Runbing Zheng, Avanti Athreya, Marta Zlatic, Michael Clayton, Carey E. Priebe

2411.19908 2026-01-21 stat.ML cs.LG

Another look at statistical inference with machine learning-imputed data

Jessica Gronsbell, Jianhui Gao, Zachary R. McCaw, Yaqi Shi, David Cheng

2410.00858 2026-01-21 math.PR math.ST stat.CO stat.ML stat.TH

Entropy contraction of the Gibbs sampler under log-concavity

Filippo Ascolani, Hugo Lavenant, Giacomo Zanella

2409.02684 2026-01-21 q-bio.NC cs.LG stat.ML

Neural timescales from a computational perspective

Roxana Zeraati, Anna Levina, Jakob H. Macke, Richard Gao

Comments 21 pages, 5 figures, 3 boxes, 1 table

2409.00297 2026-01-21 cs.LG stat.ML

On Expressive Power of Quantized Neural Networks under Fixed-Point Arithmetic

Yeachan Park, Sejun Park, Geonho Hwang

2404.07923 2026-01-21 stat.ME

BESS: A Bayesian Estimator of Sample Size

Dehua Bi, Yuan Ji

2403.19196 2026-01-21 math.ST stat.TH

What Is a Good Imputation Under MAR Missingness?

Jeffrey Näf, Erwan Scornet, Julie Josse

2311.17797 2026-01-21 cs.LG stat.ME

Learning to Simulate: Generative Metamodeling via Quantile Regression

L. Jeff Hong, Yanxi Hou, Qingkai Zhang, Xiaowei Zhang

2309.14581 2026-01-21 stat.AP cs.CR econ.EM

Assessing Utility of Differential Privacy for RCTs

Kaitlyn R. Webb, Soumya Mukherjee, Aratrika Mustafi, Aleksandra Slavković, Lars Vilhuber

Comments Submitted

2307.02044 2026-01-21 math.ST cs.IT math.IT stat.TH

The distribution of Ridgeless least squares interpolators

Qiyang Han, Xiaocong Xu

2305.14543 2026-01-21 stat.ML cs.LG

Deep Functional Factor Models: Forecasting High-Dimensional Functional Time Series via Bayesian Nonparametric Factorization

Yirui Liu, Xinghao Qiao, Yulong Pei, Liying Wang

Journal ref Proceedings of the 41st International Conference on Machine Learning 2024

2302.09049 2026-01-21 cs.IT cs.LG math.IT math.ST stat.TH

Multiperiodic Processes: Ergodic Sources with a Sublinear Entropy

Łukasz Dębowski

Comments 30 pages; 1 figure

2302.01607 2026-01-21 stat.ME

dynamite: An R Package for Dynamic Multivariate Panel Models

Santtu Tikka, Jouni Helske

Comments This is the version published in the Journal of Statistical Software

Journal ref Journal of Statistical Software, 115(5):1-42, 2025

2204.01540 2026-01-21 cs.CY stat.OT

Teaching for large-scale Reproducibility Verification

Lars Vilhuber, Hyuk Harry Son, Meredith Welch, David N. Wasser, Michael Darisse

2110.12722 2026-01-21 econ.EM stat.ME

Functional instrumental variable regression with an application to estimating the impact of immigration on native wages

Dakyung Seong, Won-Ki Seo

Journal ref Econom. Theory 41 (2025) 1248-1283

2109.08351 2026-01-21 econ.EM stat.ME

Regression Discontinuity Design with Potentially Many Covariates

Yoichi Arai, Taisuke Otsu, Myung Hwan Seo

Journal ref Econom. Theory 41 (2025) 1416-1451

1912.07075 2026-01-21 math.NA cs.NA math.ST stat.TH

Boosted optimal weighted least-squares

Cécile Haberstich, Anthony Nouy, Guillaume Perrin

Comments This version contains a corrected version of appendix section B.1

Journal ref Math. Comp. (2022)

1709.03473 2026-01-21 math.ST econ.EM stat.ML stat.TH

Is completeness necessary? Estimation in nonidentified linear models

Andrii Babii, Jean-Pierre Florens

Journal ref Econom. Theory 41 (2025) 1284-1321

2601.13191 2026-01-21 stat.ML cs.LG

Empirical Risk Minimization with $f$-Divergence Regularization

Francisco Daunas, Iñaki Esnaola, Samir M. Perlaza, H. Vincent Poor

Comments Submitted to IEEE Transactions on Information Theory. arXiv admin note: substantial text overlap with arXiv:2502.14544, arXiv:2508.03314

2601.12957 2026-01-21 math.ST math.PR stat.TH

Random tree Besov priors: Data-driven regularisation parameter selection

Hanne Kekkonen, Andreas Tataris

2601.12931 2026-01-21 cs.LG cs.AI stat.ML

Online Continual Learning for Time Series: a Natural Score-driven Approach

Edoardo Urettini, Daniele Atzeni, Ioanna-Yvonni Tsaknaki, Antonio Carta

2601.12930 2026-01-21 stat.ME

Guidance for Addressing Individual Time Effects in Cohort Stepped Wedge Cluster Randomized Trials: A Simulation Study

Jale Basten, Katja Ickstadt, Nina Timmesfeld

2601.12864 2026-01-21 stat.AP

The impact of abnormal temperatures on crop yields in Italy: a functional quantile regression approach

Giovanni Bocchi, Alessandra Micheletti, Paolo Nota, Alessandro Olper

Comments 14 pages, 5 figures, 3 tables

2601.12633 2026-01-21 math.PR cs.NA math.NA stat.ML

New Trends in the Stability of Sinkhorn Semigroups

Pierre Del Moral, Ajay Jasra

2601.12612 2026-01-21 cs.LG stat.ML

What Trace Powers Reveal About Log-Determinants: Closed-Form Estimators, Certificates, and Failure Modes

Piyush Sao

2601.12587 2026-01-21 stat.ML cs.LG

A Theory of Diversity for Random Matrices with Applications to In-Context Learning of Schrödinger Equations

Frank Cole, Yulong Lu, Shaurya Sehgal

2601.12566 2026-01-21 econ.EM math.ST stat.ME stat.TH

Partial Identification under Stratified Randomization

Bruno Ferman, Davi Siqueira, Vitor Possebom

2601.12552 2026-01-21 stat.AP

Stop using limiting stimuli as a measure of sensitivities of energetic materials

Dennis Christensen, Geir Petter Novik

2601.12540 2026-01-21 stat.ME math.ST stat.TH

Rerandomization for quantile treatment effects

Tingxuan Han, Yuhao Wang

Comments 67 pages, 0 figure

2601.12518 2026-01-21 cs.LG cs.AI stat.ML

Cooperative Multi-agent RL with Communication Constraints

Nuoya Xiong, Aarti Singh

Comments 33 pages

2601.12515 2026-01-21 stat.CO

Bayesian Inference for Partially Observed McKean-Vlasov SDEs with Full Distribution Dependence

Ning Ning, Amin Wu

Comments 23 pages, 30 pages supplementary

2601.12478 2026-01-21 stat.AP

Assessing Interactive Causes of an Occurred Outcome Due to Two Binary Exposures

Shanshan Luo, Wei Li, Xueli Wang, Shaojie Wei, Zhi Geng

2601.12425 2026-01-21 stat.ME stat.CO

Robust semi-parametric mixtures of linear experts using the contaminated Gaussian distribution

Peterson Mambondimumwe, Sphiwe B. Skhosana, Najmeh Nakhaei Rad

2601.12380 2026-01-21 cs.LG stat.ML

Statistical-Neural Interaction Networks for Interpretable Mixed-Type Data Imputation

Ou Deng, Shoji Nishimura, Atsushi Ogihara, Qun Jin

2601.12370 2026-01-21 stat.ME

Single-index Semiparametric Transformation Cure Models with Interval-censored Data

Xiaoru Huang, Tonghui Yu, Xiaoyu Liu

2601.12343 2026-01-21 econ.EM cs.AI stat.ML

How Well Do LLMs Predict Human Behavior? A Measure of their Pretrained Knowledge

Wayne Gao, Sukjin Han, Annie Liang

2601.12321 2026-01-21 stat.AP

A Machine Learning--Based Surrogate EKMA Framework for Diagnosing Urban Ozone Formation Regimes: Evidence from Los Angeles

Sijie Zheng

Comments Preprint. Under review

2601.10993 2026-01-21 stat.ML cs.LG

Memorize Early, Then Query: Inlier-Memorization-Guided Active Outlier Detection

Minseo Kang, Seunghwan Park, Dongha Kim

2601.06371 2026-01-21 econ.EM stat.AP

The Promise of Time-Series Foundation Models for Agricultural Forecasting: Evidence from Commodity Prices

Le Wang, Boyuan Zhang

2512.22162 2026-01-21 math.ST stat.ME stat.TH

Exchangeability and randomness for infinite and finite sequences

Vladimir Vovk

Comments 18 pages, 2 figures

2512.17372 2026-01-21 math.ST stat.TH

False positive control in time series coincidence detection

Ruiting Liang, Samuel Dyson, Rina Foygel Barber, Daniel E. Holz

2512.10825 2026-01-21 math.ST cs.LG math.OC stat.TH

An Elementary Proof of the Near Optimality of LogSumExp Smoothing

Thabo Samakhoana, Benjamin Grimmer

Comments 11 pages

2512.10032 2026-01-21 cs.LG cs.AI stat.ML

Cluster-Dags as Powerful Background Knowledge For Causal Discovery

Jan Marco Ruiz de Vargas, Kirtan Padh, Niki Kilbertus

Comments 23 pages, 5 figures

2512.08601 2026-01-21 stat.ML cs.LG math.OC

Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis

Orit Davidovich, Shimrit Shtern, Segev Wasserkrug, Nimrod Megiddo

2510.13459 2026-01-21 cs.AI cs.CE cs.NI stat.AP

Mobile Coverage Analysis using Crowdsourced Data

Timothy Wong, Tom Freeman, Joseph Feehily

Comments 8 pages

2510.09895 2026-01-21 cs.LG cs.AI stat.ML

Chain-of-Influence: Tracing Interdependencies Across Time and Features in Clinical Predictive Modelings

Yubo Li, Rema Padman

2510.09616 2026-01-21 cs.CR cs.AI math.ST stat.TH

Causal Digital Twins for Cyber-Physical Security: A Framework for Robust Anomaly Detection in Industrial Control Systems

Mohammadhossein Homaei, Mehran Tarif, Pablo Garcia Rodriguez, Andres Caro, Mar Avila

Comments 22 Pages, six figures, and 14 tables,

2510.08335 2026-01-21 stat.ML cs.LG

PAC Learnability in the Presence of Performativity

Ivan Kirev, Lyuben Baltadzhiev, Nikola Konstantinov

Comments 21 pages, 3 figures; Added another assumption on the RN derivative in Section 5, to fix an incorrect bounding argument in the proof of Theorem 5.1 in the initial version; more details on page 6

2509.25482 2026-01-21 cs.AI cs.LG cs.RO cs.SY eess.SY stat.ML

Message passing-based inference in an autoregressive active inference agent

Wouter M. Kouw, Tim N. Nisslbeck, Wouter L. N. Nuijten

Comments 14 pages, 4 figures, proceedings of the International Workshop on Active Inference 2025. Erratum v1: in Eq. (50), $p(y_t, Θ, u_t \mid y_{*}, \mathcal{D}_k)$ should have been $p(y_t, Θ\mid u_t, y_{*}, \mathcal{D}_k)$

2508.13071 2026-01-21 stat.ME

Surrogate-based Bayesian calibration methods for chaotic systems: a comparison of traditional and non-traditional approaches

Maike F. Holthuijzen, Atlanta Chakraborty, Elizabeth Krath, Tommie Catanach

Comments 34 pages, 6 figures

2507.16340 2026-01-21 math.ST stat.ME stat.TH

Structured linear factor models for tail dependence

Alexis Boulin, Axel Bücher

Comments 42 pages

2507.12276 2026-01-21 econ.EM stat.AP

Probabilistic Forecasting of Climate Policy Uncertainty: The Role of Macro-financial Variables and Google Search Data

Donia Besher, Anirban Sengupta, Tanujit Chakraborty

详情

英文摘要

Accurately forecasting Climate Policy Uncertainty (CPU) is essential for designing climate strategies that balance economic growth with environmental objectives. Elevated CPU levels can delay regulatory implementation, hinder investment in green technologies, and amplify public resistance to policy reforms, particularly during periods of economic stress. Despite the growing literature documenting the economic relevance of CPU, forecasting its evolution and understanding the role of macro-financial drivers in shaping its fluctuations have not been explored. This study addresses this gap by presenting the first effort to forecast CPU and identify its key drivers. We employ various statistical tools to identify macro-financial exogenous drivers, alongside Google search data to capture early public attention to climate policy. Local projection impulse response analysis quantifies the dynamic effects of these variables, revealing that household financial vulnerability, housing market activity, business confidence, credit conditions, and financial market sentiment exert the most substantial impacts. These predictors are incorporated into a Bayesian Structural Time Series (BSTS) framework to produce probabilistic forecasts for both US and Global CPU indices. Extensive experiments and statistical validation demonstrate that BSTS with time-invariant regression coefficients achieves superior forecasting performance. We demonstrate that this performance stems from its variable selection mechanism, which identifies exogenous predictors that are empirically significant and theoretically grounded, as confirmed by the feature importance analysis. From a policy perspective, the findings underscore the importance of adaptive climate policies that remain effective across shifting economic conditions while supporting long-term environmental and growth objectives.

URL PDF HTML ☆

赞 0 踩 0

2506.21894 2026-01-21 stat.ML cs.LG

Thompson Sampling in Function Spaces via Neural Operators

Rafael Oliveira, Xuesong Wang, Kian Ming A. Chai, Edwin V. Bonilla

Comments Final revision to appear at NeurIPS 2025 proceedings, expanded proof of Proposition 2, added Remark 2 on sublinear information gain, and revised discussion at the end of Appendix C.4

2505.13809 2026-01-21 math.ST econ.EM stat.ML stat.TH

Semiparametric Off-Policy Inference for Optimal Policy Values under Possible Non-Uniqueness

Haoyu Wei

2504.20894 2026-01-21 cs.LG stat.ML

Does Feedback Help in Bandits with Arm Erasures?

Merve Karakas, Osama Hanna, Lin F. Yang, Christina Fragouli

Journal ref 2025 IEEE International Symposium on Information Theory (ISIT)

2504.09854 2026-01-21 econ.GN q-fin.EC stat.AP

Do Determinants of EV Purchase Intent vary across the Spectrum? Evidence from Bayesian Analysis of US Survey Data

Nafisa Lohawala, Mohammad Arshad Rahman

Comments 33 pages, three figures, five tables

2502.10653 2026-01-21 econ.EM stat.ME

Policy Learning with Confidence

Victor Chernozhukov, Sokbae Lee, Adam M. Rosen, Liyang Sun

Comments 40 pages, 3 figures

2501.08673 2026-01-21 stat.AP

A Spatio-Temporal Dirichlet Process Mixture Model on Linear Networks for Crime Data

Sujeong Lee, Won Chang, Jorge Mateu, Heejin Lee, Jaewoo Park

2412.16765 2026-01-21 cs.LG math.OC stat.ML

Optimization Insights into Deep Diagonal Linear Networks

Hippolyte Labarrière, Cesare Molinari, Lorenzo Rosasco, Cristian Vega, Silvia Villa

2411.05870 2026-01-21 eess.SY cs.SY math.DS math.PR physics.data-an stat.ME

An Adaptive Online Smoother with Closed-Form Solutions and Information-Theoretic Lag Selection for Conditional Gaussian Nonlinear Systems

Marios Andreou, Nan Chen, Yingda Li

Comments Latest revision. 46 pages (Main Text pp. 1--28; Appendix pp. 29--40), 9 figures (7 in Main Text, 2 in Appendix). Currently under review in Journal of Nonlinear Science (Springer Nature). Code available upon request. For further details visit https://mariosandreou.short.gy/OnlineSmootherCGNS

详情

英文摘要

Data assimilation (DA) combines partial observations with dynamical models to improve state estimation. Filter-based DA uses only past and present data and is the prerequisite for real-time forecasts. Smoother-based DA exploits both past and future observations. It aims to fill in missing data, provide more accurate estimations, and develop high-quality datasets. However, the standard smoothing procedure requires using all historical state estimations, which is storage-demanding, especially for high-dimensional systems. This paper develops an adaptive-lag online smoother for a large class of complex dynamical systems with strong nonlinear and non-Gaussian features, which has important applications to many real-world problems. The adaptive lag allows the utilization of observations only within a nearby window, thus reducing computational complexity and storage needs. Online lag adjustment is essential for tackling turbulent systems, where temporal autocorrelation varies significantly over time due to intermittency, extreme events, and nonlinearity. Based on the uncertainty reduction in the estimated state, an information criterion is developed to systematically determine the adaptive lag. Notably, the mathematical structure of these systems facilitates the use of closed analytic formulae to calculate the online smoother and adaptive lag, avoiding empirical tunings as in ensemble-based DA methods. The adaptive online smoother is applied to studying three important scientific problems. First, it helps detect online causal relationships between state variables. Second, the advantage of reduced computational storage expenditure is illustrated via Lagrangian DA, a high-dimensional nonlinear problem. Finally, the adaptive smoother advances online parameter estimation with partial observations, emphasizing the role of the observed extreme events in accelerating convergence.

URL PDF HTML ☆

赞 0 踩 0

2407.17225 2026-01-21 stat.ME

Asymmetry Analysis of Bilateral Shapes

Kanti V. Mardia, Xiangyu Wu, John T. Kent, Colin R. Goodall, Balvinder S. Khambay

Comments 38 pages

2407.15301 2026-01-21 stat.ML cs.LG math.ST q-bio.QM stat.TH

U-learning for Prediction Inference via Combinatory Multi-Subsampling: With Applications to LASSO and Neural Networks

Zhe Fei, Yi Li

2405.18828 2026-01-21 math.ST stat.ML stat.TH

CHANI: Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration

Sophie Jaffard, Samuel Vaiter, Patricia Reynaud-Bouret

2405.15325 2026-01-21 cs.LG stat.ML

On the Identification of Temporally Causal Representation with Instantaneous Dependence

Zijian Li, Yifan Shen, Kaitao Zheng, Ruichu Cai, Xiangchen Song, Mingming Gong, Guangyi Chen, Kun Zhang

2405.11923 2026-01-21 math.ST stat.ME stat.TH

Rate Optimality and Phase Transition for User-Level Local Differential Privacy

Alexander Kent, Thomas B. Berrett, Yi Yu

Comments 98 pages, 12 figures, 6 tables

2301.10932 2026-01-21 cs.LG math.OC stat.ML

On the Global Convergence of Risk-Averse Natural Policy Gradient Methods with Expected Conditional Risk Measures

Xian Yu, Lei Ying

2207.05442 2026-01-21 stat.ML cs.LG

Wasserstein multivariate auto-regressive models for modeling distributional time series

Yiye Jiang, Jérémie Bigot

2103.13236 2026-01-21 stat.ME stat.AP

Bayesian Evidence Synthesis for the common effect model

Stavros Nikolakopoulos, Björn Alfons Edmar, Ioannis Ntzoufras

Comments 19 pages, 2 figures

2103.04086 2026-01-21 stat.ME

Non-parametric Bayesian inference via loss functions under model misspecification

Yu Luo, David A. Stephens, Daniel J. Graham, Emma J. McCoy

2601.12231 2026-01-21 cs.LG cs.CR stat.CO

Wavelet-Aware Anomaly Detection in Multi-Channel User Logs via Deviation Modulation and Resolution-Adaptive Attention

Kaichuan Kong, Dongjie Liu, Xiaobo Jin, Shijie Xu, Guanggang Geng

Comments Accepted by ICASSP 2026. Copyright 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

2601.12221 2026-01-21 stat.AP

A warping function-based control chart for detecting distributional changes in damage-sensitive features for structural condition assessment

Zhicheng Chen, Wenyu Chen, Xinyi Lei

详情

英文摘要

Data-driven damage detection methods achieve damage identification by analyzing changes in damage-sensitive features (DSFs) derived from structural health monitoring (SHM) data. The core reason for their effectiveness lies in the fact that damage or structural state transition can be manifested as changes in the distribution of DSF data. This enables us to reframe the problem of damage detection as one of identifying these distributional changes. Hence, developing automated tools for detecting such changes is pivotal for automated structural health diagnosis. Control charts are extensively utilized in SHM for DSF change detection, owing to their excellent online detection and early warning capabilities. However, conventional methods are primarily designed to detect mean or variance shifts, making it challenging to identify complex shape changes in distributions. This limitation results in insufficient damage detection sensitivity. Moreover, they typically exhibit poor robustness against data contamination. This paper proposes a novel control chart to address these limitations. It employs the probability density functions (PDFs) of subgrouped DSF data as monitoring objects, with shape deformations characterized by warping functions. Furthermore, a nonparametric control chart is specifically constructed for warping function monitoring in the functional data analysis framework. Key advantages of the new method include the ability to detect both shifts and complex shape deformations in distributions, excellent online detection performance, and robustness against data contamination. Extensive simulation studies demonstrate its superiority over competing approaches. Finally, the method is applied to detecting distributional changes in DSF data for cable condition assessment in a long-span cable-stayed bridge, demonstrating its practical utility in engineering.

URL PDF HTML ☆

赞 0 踩 0

2601.12213 2026-01-21 cs.LG math.OC stat.ML

One-Sided Matrix Completion from Ultra-Sparse Samples

Hongyang R. Zhang, Zhenshuo Zhang, Huy L. Nguyen, Guanghui Lan

Comments 41 pages

Journal ref Trans. Mach. Learn. Res. 2026

详情

英文摘要

Matrix completion is a classical problem that has received recurring interest across a wide range of fields. In this paper, we revisit this problem in an ultra-sparse sampling regime, where each entry of an unknown, $n\times d$ matrix $M$ (with $n \ge d$) is observed independently with probability $p = C / d$, for a fixed integer $C \ge 2$. This setting is motivated by applications involving large, sparse panel datasets, where the number of rows far exceeds the number of columns. When each row contains only $C$ entries -- fewer than the rank of $M$ -- accurate imputation of $M$ is impossible. Instead, we estimate the row span of $M$ or the averaged second-moment matrix $T = M^{\top} M / n$. The empirical second-moment matrix computed from observed entries exhibits non-random and sparse missingness. We propose an unbiased estimator that normalizes each nonzero entry of the second moment by its observed frequency, followed by gradient descent to impute the missing entries of $T$. The normalization divides a weighted sum of $n$ binomial random variables by the total number of ones. We show that the estimator is unbiased for any $p$ and enjoys low variance. When the row vectors of $M$ are drawn uniformly from a rank-$r$ factor model satisfying an incoherence condition, we prove that if $n \ge O({d r^5 ε^{-2} C^{-2} \log d})$, any local minimum of the gradient-descent objective is approximately global and recovers $T$ with error at most $ε^2$. Experiments on both synthetic and real-world data validate our approach. On three MovieLens datasets, our algorithm reduces bias by $88\%$ relative to baseline estimators. We also empirically validate the linear sampling complexity of $n$ relative to $d$ on synthetic data. On an Amazon reviews dataset with sparsity $10^{-7}$, our method reduces the recovery error of $T$ by $59\%$ and $M$ by $38\%$ compared to baseline methods.

URL PDF HTML ☆

赞 0 踩 0

2601.12178 2026-01-21 cs.LG stat.ML

Federated Learning for the Design of Parametric Insurance Indices under Heterogeneous Renewable Production Losses

Fallou Niakh

2601.12167 2026-01-21 stat.ME stat.AP stat.OT

Using Directed Acyclic Graphs to Illustrate Common Biases in Diagnostic Test Accuracy Studies

Yang Lu, Nandini Dendukuri

2601.12120 2026-01-21 stat.ME stat.ML

Lost in Aggregation: The Causal Interpretation of the IV Estimand

Danielle Tsao, Krikamol Muandet, Frederick Eberhardt, Emilija Perković

2601.12105 2026-01-21 cs.CR stat.AP stat.ME

Privacy-Preserving Cohort Analytics for Personalized Health Platforms: A Differentially Private Framework with Stochastic Risk Modeling

Richik Chakraborty, Lawrence Liu, Syed Hasnain

Comments 18 pages, 4 figures

2601.12031 2026-01-21 stat.ME

Estimations of Extreme CoVaR and CoES under Asymptotic Independence

Qingzhao Zhong

2601.12023 2026-01-21 stat.ML cs.LG

A Kernel Approach for Semi-implicit Variational Inference

Longlin Yu, Ziheng Cheng, Shiyue Zhang, Cheng Zhang

Comments 40 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2405.18997

2601.11949 2026-01-21 stat.AP

A Deep Learning-Copula Framework for Climate-Related Home Insurance Risk

Asim K. Dey

2601.11905 2026-01-21 cs.AI cs.LG math.ST stat.TH

LIBRA: Language Model Informed Bandit Recourse Algorithm for Personalized Treatment Planning

Junyu Cao, Ruijiang Gao, Esmaeil Keyvanshokooh, Jianhao Ma

Comments 50 pages. Previous version with human-AI collaboration: arXiv:2410.14640

2601.11897 2026-01-21 cs.LG stat.ME stat.ML

Task-tailored Pre-processing: Fair Downstream Supervised Learning

Jinwon Sohn, Guang Lin, Qifan Song

2601.11790 2026-01-21 stat.ML cs.LG stat.ME

Gradient-based Active Learning with Gaussian Processes for Global Sensitivity Analysis

Guerlain Lambert, Céline Helbert, Claire Lauvernet

2601.11717 2026-01-21 math.ST math.PR stat.TH

Detecting Mutual Excitations in Non-Stationary Hawkes Processes

Elchanan Mossel, Anirudh Sridhar

Comments 12 pages

2601.11701 2026-01-21 math.ST stat.ML stat.TH

Stability and Accuracy Trade-offs in Statistical Estimation

Abhinav Chakraborty, Yuetian Luo, Rina Foygel Barber

Comments The first two authors contributed equally and are listed in alphabetical order

2601.11638 2026-01-21 cs.LG stat.ML

Verifying Physics-Informed Neural Network Fidelity using Classical Fisher Information from Differentiable Dynamical System

Josafat Ribeiro Leal Filho, Antônio Augusto Fröhlich

Comments This paper has been submitted and is currently under review at IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

2601.06327 2026-01-21 cs.OH stat.AP

From Lagging to Leading: Validating Hard Braking Events as High-Density Indicators of Segment Crash Risk

Yechen Li, Shantanu Shahane, Shoshana Vasserman, Carolina Osorio, Yi-fan Chen, Ivan Kuznetsov, Kristin White, Justyna Swiatkowska, Neha Arora, Feng Guo

详情

英文摘要

Identifying high crash risk road segments and accurately predicting crash incidence is fundamental to implementing effective safety countermeasures. While collision data inherently reflects risk, the infrequency and inconsistent reporting of crashes present a major challenge to robust risk prediction models. The proliferation of connected vehicle technology offers a promising avenue to leverage high-density safety metrics for enhanced crash forecasting. A Hard-Braking Event (HBE), interpreted as an evasive maneuver, functions as a potent proxy for elevated driving risk due to its demonstrable correlation with underlying crash causal factors. Crucially, HBE data is significantly more readily available across the entire road network than conventional collision records. This study systematically evaluated the correlation at individual road segment level between police-reported collisions and aggregated and anonymized HBEs identified via the Google Android Auto platform, utilizing datasets from California and Virginia. Empirical evidence revealed that HBEs occur at a rate magnitudes higher than traffic crashes. Employing the state-of-the-practice Negative-Binomial regression models, the analysis established a statistically significant positive correlation between the HBE rate and the crash rate: road segments exhibiting a higher frequency of HBEs were consistently associated with a greater incidence of crashes. This sophisticated model incorporated and controlled for various confounding factors, including road type, speed profile, proximity to ramps, and road segment slope. The HBEs derived from connected vehicle technology thus provide a scalable, high-density safety surrogate metric for network-wide traffic safety assessment, with the potential to optimize safer routing recommendations and inform the strategic deployment of active safety countermeasures.

URL PDF HTML ☆

赞 0 踩 0

2601.06296 2026-01-21 stat.ME stat.ML

A Targeted Learning Framework for Estimating Restricted Mean Survival Time Difference using Pseudo-observations

Man Jin, Yixin Fang

2512.20232 2026-01-21 cs.LG stat.AP

Adaptive Multi-task Learning for Probabilistic Load Forecasting

Onintze Zaballa, Verónica Álvarez, Santiago Mazuelas

2510.25005 2026-01-21 cs.AI cs.LG math.ST stat.ML stat.TH

Cyclic Counterfactuals under Shift-Scale Interventions

Saptarshi Saha, Dhruv Vansraj Rathore, Utpal Garain

Comments Accepted at NeurIPS 2025

2510.04460 2026-01-21 math.PR cs.DS cs.LG math.ST stat.TH

Perspectives on Stochastic Localization

Bobby Shi, Kevin Tian, Matthew S. Zhang

2509.17140 2026-01-21 stat.AP stat.ME

An Italian Gender Equality Index

Lorenzo Panebianco

2508.09040 2026-01-21 stat.ME econ.EM math.ST stat.TH

Bias correction for Chatterjee's graph-based correlation coefficient

Mona Azadkia, Leihao Chen, Fang Han

Comments 45 pages; this version includes additional results demonstrating that the bias can be negligible when d<=3

2508.05259 2026-01-21 math.ST stat.TH

Nonparametric Estimation from Correlated Copies of a Drifted Process

Nicolas Marie

Comments 23 pages, 6 figures

2508.04444 2026-01-21 cs.LG cs.NA math.NA stat.ML

Matrix-Free Two-to-Infinity and One-to-Two Norms Estimation

Askar Tsyganov, Evgeny Frolov, Sergey Samsonov, Maxim Rakhuba

Comments AAAI-2026, camera-ready version

2506.22621 2026-01-21 cs.LG math.OC stat.ML

Modeling Hierarchical Spaces: A Review and Unified Framework for Surrogate-Based Architecture Design

Paul Saves, Edward Hallé-Hannan, Jasper Bussemaker, Youssef Diouane, Nathalie Bartoli

Comments Published in Structural and Multidisciplinary Optimization, Springer Nature (2026)

2505.01849 2026-01-21 stat.AP stat.CO

Applications of higher order Markov models and Pressure Index to strategize controlled run chases in Twenty20 cricket

Rhitankar Bandyopadhyay, Dibyojyoti Bhattacharjee

Comments 33 pages, 2 figures, 24 tables, 8 pseudo codes for algorithms

2503.20466 2026-01-21 physics.ao-ph cs.LG stat.ML

Data-driven Seasonal Climate Predictions via Variational Inference and Transformers

Lluís Palma, Alejandro Peraza, David Civantos, Amanda Duarte, Stefano Materia, Ángel G. Muñoz, Jesús Peña-Izquierdo, Laia Romero, Albert Soret, Markus G. Donat

详情

DOI: 10.1038/s41612-026-01320-z

英文摘要

Most operational climate services providers base their seasonal predictions on initialised general circulation models (GCMs) or statistical techniques that fit past observations. GCMs require substantial computational resources, which limits their capacity. In contrast, statistical methods often lack robustness due to short historical records. Recent works propose machine learning methods trained on climate model output, leveraging larger sample sizes and simulated scenarios. Yet, many of these studies focus on prediction tasks that might be restricted in spatial extent or temporal coverage, opening a gap with existing operational predictions. Thus, the present study evaluates the effectiveness of a methodology that combines variational inference with transformer models to predict fields of seasonal anomalies. The predictions cover all four seasons and are initialised one month before the start of each season. The model was trained on climate model output from CMIP6 and tested using ERA5 reanalysis data. We analyse the method's performance in predicting interannual anomalies beyond the climate change-induced trend. We also test the proposed methodology in a regional context with a use case focused on Europe. While climate change trends dominate the skill of temperature predictions, the method presents additional skill over the climatological forecast in regions influenced by known teleconnections. We reach similar conclusions based on the validation of precipitation predictions. Despite underperforming SEAS5 in most tropics, our model offers added value in numerous extratropical inland regions. This work demonstrates the effectiveness of training generative models on climate model output for seasonal predictions, providing skilful predictions beyond the induced climate change trend at time scales and lead times relevant for user applications.

URL PDF HTML ☆

赞 0 踩 0

2503.06431 2026-01-21 stat.ME cs.LG

Fairness-aware kidney exchange and kidney paired donation

Mingrui Zhang, Xiaowu Dai, Lexin Li

2502.20966 2026-01-21 stat.ML cs.LG math.ST stat.TH

Post-Hoc Uncertainty Quantification in Pre-Trained Neural Networks via Activation-Level Gaussian Processes

Richard Bergna, Stefan Depeweg, Sergio Calvo Ordonez, Jonathan Plenk, Alvaro Cartea, Jose Miguel Hernandez-Lobato

Comments 10 pages, 8 figures, 7th Symposium on Advances in Approximate Bayesian Inference

2502.16542 2026-01-21 stat.ML cs.LG

Variable transformations in consistent loss functions

Hristos Tyralis, Georgia Papacharalampous

Comments 37 pages, 4 figures, 2 tables

Journal ref Knowledge-Based Systems 336 (2026) 115202

2411.00839 2026-01-21 cs.LG cs.AI cs.CV stat.ME stat.ML

CausAdv: A Causal-based Framework for Detecting Adversarial Examples

Hichem Debbi

2410.21914 2026-01-21 stat.ME stat.CO

Bayesian Stability Selection and Inference on Selection Probabilities

Mahdi Nouraie, Connor Smith, Samuel Muller

2410.18164 2026-01-21 cs.LG cs.AI stat.ML

TabDPT: Scaling Tabular Foundation Models on Real Data

Junwei Ma, Valentin Thomas, Rasa Hosseinzadeh, Alex Labach, Hamidreza Kamkari, Jesse C. Cresswell, Keyvan Golestan, Guangwei Yu, Anthony L. Caterini, Maksims Volkovs

Comments Inference repo: github.com/layer6ai-labs/TabDPT-inference; Training repo: github.com/layer6ai-labs/TabDPT-training

Journal ref NeurIPS 2025 Proceedings

2410.15530 2026-01-21 stat.ME math.ST stat.TH

Simultaneous Inference in Multiple Matrix-Variate Graphs for High-Dimensional Neural Recordings

Zongge Liu, Heejong Bong, Zhao Ren, Matthew A. Smith, Robert E. Kass

2410.09267 2026-01-21 stat.ME

Experimentation on Endogenous Graphs

Wenshuo Wang, Edvard Bakhitov, Dominic Coey

2409.15995 2026-01-21 stat.ME q-bio.QM stat.AP

Robust Inference for Non-Linear Regression Models with Applications in Enzyme Kinetics

Suryasis Jana, Abhik Ghosh

Comments To appear in the Journal of Applies Statistics

2408.02060 2026-01-21 math.ST stat.ME stat.ML stat.TH

Winners with Confidence: Discrete Argmin Inference with an Application to Model Selection

Tianyu Zhang, Hao Lee, Jing Lei

2407.21119 2026-01-21 econ.EM stat.ME

Potential weights and implicit causal designs in linear regression

Jiafeng Chen

2404.11781 2026-01-21 stat.ME math.ST stat.AP stat.TH

Asymmetric canonical correlation analysis of Riemannian and high-dimensional data

James Buenfil, Eardi Lila

Journal ref Electron. J. Statist. 19 (2) 6077 - 6102, 2025

2402.10818 2026-01-21 cs.LG stat.ML

Trading off Consistency and Dimensionality of Convex Surrogates for the Mode

Enrique Nueve, Bo Waggoner, Dhamma Kimpara, Jessie Finocchiaro

Comments Updated error with Bregman Losses to only Square Losses

2401.12309 2026-01-21 econ.EM stat.ME

Interpreting Event-Studies from Recent Difference-in-Differences Methods

Jonathan Roth

2312.16819 2026-01-21 cs.LG math.OC stat.ML

Hidden Minima in Two-Layer ReLU Networks

Yossi Arjevani

2309.12162 2026-01-21 stat.ME cs.LG econ.EM math.ST stat.TH

Optimal Conditional Inference in Adaptive Experiments

Jiafeng Chen, Isaiah Andrews

Comments An extended abstract of this paper was presented at CODE@MIT 2021

2210.00953 2026-01-21 stat.ML cs.LG math.OC

Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes

Dongyan Huo, Yudong Chen, Qiaomin Xie

Comments SIGMETRICS 2023

Journal ref Mathematics of Operations Research, 2026

2111.03754 2026-01-21 stat.ME

Transformed Linear Prediction for Extremes

Jeongjin Lee, Daniel Cooley

2103.03191 2026-01-21 stat.ML cs.LG cs.NA math.NA math.OC math.PR

Generalization Bounds for Sparse Random Feature Expansions

Abolfazl Hashemi, Hayden Schaeffer, Robert Shi, Ufuk Topcu, Giang Tran, Rachel Ward

2009.13040 2026-01-21 stat.ML cs.LG math.ST stat.TH

Local Minima Structures in Gaussian Mixture Models

Yudong Chen, Dogyoon Song, Xumei Xi, Yuqian Zhang

Comments 73 pages, 6 figures, 2Tables. To appear in Transactions on Information Theory

Journal ref IEEE Transactions on Information Theory, vol. 70, no. 6, pp. 4218-4257, 2024