arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.15008 2026-02-17 cs.LG cs.IT math.IT math.ST stat.ML stat.TH

Efficient Sampling with Discrete Diffusion Models: Sharp and Adaptive Guarantees

Daniil Dmitriev, Zhihan Huang, Yuting Wei

2602.15007 2026-02-17 stat.AP stat.ME

Hidden Markov Individual-level Models of Infectious Disease Transmission

Dirk Douwes-Schultz, Rob Deardon, Alexandra M. Schmidt

2602.14998 2026-02-17 math.PR cs.IT cs.SI math.IT math.ST stat.TH

Random geometric graphs with smooth kernels: sharp detection threshold and a spectral conjecture

Cheng Mao, Yihong Wu, Jiaming Xu

2602.14991 2026-02-17 stat.ME

Joint analysis for multivariate longitudinal and event time data with a change point anchored at interval-censored event time

Yue Zhan, Cheng Zheng, Ying Zhang

2602.14969 2026-02-17 math.ST stat.TH

Topological trivialization in non-convex empirical risk minimization

Andrea Montanari, Basil Saeed

Comments 33 pages; 16 pdf figures

2602.14952 2026-02-17 cs.LG math.OC stat.ME stat.ML

Locally Adaptive Multi-Objective Learning

Jivat Neet Kaur, Isaac Gibbs, Michael I. Jordan

Comments Code is available at https://github.com/jivatneet/adaptive-multiobjective

2602.14942 2026-02-17 stat.ME

Balanced Stochastic Block Model for Community Detection in Signed Networks

Yichao Chen, Weijing Tang, Ji Zhu

2602.14877 2026-02-17 stat.AP

When to repeat a biomarker test? Decomposing sources of variation from conditionally repeated measurements

Supun Manathunga, Mart P. Janssen, Yu Luo, W. Alton Russell, Mart Pothast

Comments 36 pages, 12 figures

2602.14872 2026-02-17 cs.LG cs.AI math.OC stat.ML

On the Learning Dynamics of RLVR at the Edge of Competence

Yu Huang, Zixin Wen, Yuejie Chi, Yuting Wei, Aarti Singh, Yingbin Liang, Yuxin Chen

2602.14869 2026-02-17 cs.AI stat.ML

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

Matthew Kowal, Goncalo Paulo, Louis Jaburi, Tom Tseng, Lev E McKinney, Stefan Heimersheim, Aaron David Tucker, Adam Gleave, Kellin Pelrine

2602.14791 2026-02-17 cs.LG stat.ML

Extending Multi-Source Bayesian Optimization With Causality Principles

Luuk Jacobs, Mohammad Ali Javidian

Comments An extended abstract version of this work was accepted for the Proceedings of the 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)

2602.14701 2026-02-17 cs.LG stat.ML

Unbiased Approximate Vector-Jacobian Products for Efficient Backpropagation

Killian Bakong, Laurent Massoulié, Edouard Oyallon, Kevin Scaman

2602.14692 2026-02-17 stat.CO math.PR

Weak Poincaré inequalities for Deterministic-scan Metropolis-within-Gibbs samplers

Mengxi Gao, Gareth O. Roberts, Andi Q. Wang

Comments 51 pages

2602.14678 2026-02-17 quant-ph cond-mat.dis-nn stat.CO

NISQ-compatible quantum cryptography based on Parrondo dynamics in discrete-time quantum walks

Aditi Rath, Dinesh Kumar Panda, Colin Benjamin

Comments 23 pages, 24 figures, 3 tables

2602.14642 2026-02-17 stat.ML cs.LG

GenPANIS: A Latent-Variable Generative Framework for Forward and Inverse PDE Problems in Multiphase Media

Matthaios Chatzopoulos, Phaedon-Stelios Koutsourelakis

2602.06568 2026-02-17 math.ST math.PR stat.CO stat.TH

Ergodicity of an Adaptive MCMC Sampler under a Probability Bound

Alexandre Chotard

2602.06320 2026-02-17 stat.ML cond-mat.dis-nn cs.LG

High-Dimensional Limit of Stochastic Gradient Flow via Dynamical Mean-Field Theory

Sota Nishiyama, Masaaki Imaizumi

2602.01427 2026-02-17 stat.ML cs.LG stat.AP

Robust Generalization with Adaptive Optimal Transport Priors for Decision-Focused Learning

Haixiang Sun, Andrew L. Liu

2601.21812 2026-02-17 stat.ML cs.AI cs.LG

A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

Francisco Caldas, Sahil Kumar, Cláudia Soares

Comments submitted to ICML'26

2512.19064 2026-02-17 stat.AP

Unraveling time-varying causal effects of multiple exposures: integrating Functional Data Analysis with Multivariable Mendelian Randomization

Nicole Fontana, Francesca Ieva, Luisa Zuccolo, Emanuele Di Angelantonio, Piercesare Secchi

2512.17979 2026-02-17 cs.GT cs.AI cs.MA econ.GN q-fin.EC stat.AP

Adaptive Agents in Spatial Double-Auction Markets: Modeling the Emergence of Industrial Symbiosis

Matthieu Mastio, Paul Saves, Benoit Gaudou, Nicolas Verstaevel

Comments AAMAS CC-BY 4.0 licence. Adaptive Agents in Spatial Double-Auction Markets: Modeling the Emergence of Industrial Symbiosis. Full paper. In Proc. of the 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026), Paphos, Cyprus, May 25 - 29, 2026, IFAAMAS, 10 pages

2512.13228 2026-02-17 cs.LG stat.ML

ModSSC: A Modular Framework for Semi-Supervised Classification on Heterogeneous Data

Melvin Barbaux, Samia Boukir

Comments Preprint describing the open source ModSSC framework for inductive and transductive semi-supervised classification on heterogeneous data

2511.15315 2026-02-17 stat.ML cs.LG

Robust Bayesian Optimisation with Unbounded Corruptions

Abdelhamid Ezzerg, Ilija Bogunovic, Jeremias Knoblauch

2508.17622 2026-02-17 stat.ML cs.LG econ.TH math.OC

The Statistical Fairness-Accuracy Frontier

Alireza Fallah, Michael I. Jordan, Annie Ulichney

2506.13593 2026-02-17 cs.LG stat.AP stat.ML

Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs

Hen Davidov, Shai Feldman, Gilad Freidkin, Yaniv Romano

2505.16788 2026-02-17 stat.CO stat.AP

Interpretable contour level selection for heat maps for gridded data

Tarn Duong

2504.13781 2026-02-17 stat.ME

Addressing outliers in mixed-effects logistic regression: a more robust modeling approach

Divan A. Burger, Sean van der Merwe, Emmanuel Lesaffre

2503.21980 2026-02-17 math.ST stat.ME stat.ML stat.TH

Rolled Gaussian process models for curves on manifolds

Simon Preston, Karthik Bharath, Pablo Lopez-Custodio, Alfred Kume

2503.21715 2026-02-17 stat.ME econ.EM

A Powerful Bootstrap Test of Independence in High Dimensions

Mauricio Olivares, Tomasz Olma, Daniel Wilhelm

2503.09411 2026-02-17 cs.LG math.OC stat.ML

Learning Rate Annealing Improves Tuning Robustness in Stochastic Optimization

Amit Attia, Tomer Koren

Comments 23 pages

2602.14616 2026-02-17 stat.CO q-bio.QM stat.AP

Higher-Order Hit-&-Run Samplers for Linearly Constrained Densities

Richard D. Paul, Anton Stratmann, Johann F. Jadebeck, Martin Beyß, Hanno Scharr, David Rügamer, Katharina Nöh

2602.14607 2026-02-17 stat.ME cs.LG cs.NA math.NA stat.CO

A Bayesian Approach to Low-Discrepancy Subset Selection

Nathan Kirk

Comments 13 pages, 3 figures, mODa14

2602.14587 2026-02-17 cs.LG cs.AI math.OC math.ST stat.TH

Decoupled Continuous-Time Reinforcement Learning via Hamiltonian Flow

Minh Nguyen

2602.14580 2026-02-17 cs.LG stat.ML

Replicable Constrained Bandits

Matteo Bollini, Gianmarco Genalti, Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi

2602.14543 2026-02-17 cs.LG stat.ML

Truly Adapting to Adversarial Constraints in Constrained MABs

Francesco Emanuele Stradi, Kalana Kalupahana, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

2602.14520 2026-02-17 stat.ML astro-ph.HE

Accelerating Posterior Inference from Pulsar Light Curves via Learned Latent Representations and Local Simulator-Guided Optimization

Farhana Taiyebah, Abu Bucker Siddik, Indronil Bhattacharjee, Diane Oyen, Soumi De, Greg Olmschenk, Constantinos Kalapotharakos

2602.14478 2026-02-17 stat.ML cs.DS cs.LG math.OC

Constrained and Composite Sampling via Proximal Sampler

Thanh Dang, Jiaming Liang

Comments The main paper is 13 pages; the rest are appendices

详情

英文摘要

We study two log-concave sampling problems: constrained sampling and composite sampling. First, we consider sampling from a target distribution with density proportional to $\exp(-f(x))$ supported on a convex set $K \subset \mathbb{R}^d$, where $f$ is convex. The main challenge is enforcing feasibility without degrading mixing. Using an epigraph transformation, we reduce this task to sampling from a nearly uniform distribution over a lifted convex set in $\mathbb{R}^{d+1}$. We then solve the lifted problem using a proximal sampler. Assuming only a separation oracle for $K$ and a subgradient oracle for $f$, we develop an implementation of the proximal sampler based on the cutting-plane method and rejection sampling. Unlike existing constrained samplers that rely on projection, reflection, barrier functions, or mirror maps, our approach enforces feasibility using only minimal oracle access, resulting in a practical and unbiased sampler without knowing the geometry of the constraint set. Second, we study composite sampling, where the target is proportional to $\exp(-f(x)-h(x))$ with closed and convex $f$ and $h$. This composite structure is standard in Bayesian inference with $f$ modeling data fidelity and $h$ encoding prior information. We reduce composite sampling via an epigraph lifting of $h$ to constrained sampling in $\mathbb{R}^{d+1}$, which allows direct application of the constrained sampling algorithm developed in the first part. This reduction results in a double epigraph lifting formulation in $\mathbb{R}^{d+2}$, on which we apply a proximal sampler. By keeping $f$ and $h$ separate, we further demonstrate how different combinations of oracle access (such as subgradient and proximal) can be leveraged to construct separation oracles for the lifted problem. For both sampling problems, we establish mixing time bounds measured in Rényi and $χ^2$ divergences.

URL PDF HTML ☆

赞 0 踩 0

2602.14472 2026-02-17 math.ST cs.LG math.OC stat.ML stat.TH

Frequentist Regret Analysis of Gaussian Process Thompson Sampling via Fractional Posteriors

Somjit Roy, Prateek Jaiswal, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

Comments 34 pages, Submitted

2602.14432 2026-02-17 cs.LG cs.AI stat.ML

S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations

Arnav Chavan, Nahush Lele, Udbhav Bamba, Sankalp Dayal, Aditi Raghunathan, Deepak Gupta

2602.14423 2026-02-17 cs.LG cs.AI stat.ML

The geometry of invariant learning: an information-theoretic analysis of data augmentation and generalization

Abdelali Bouyahia, Frédéric LeBlanc, Mario Marchand

详情

英文摘要

Data augmentation is one of the most widely used techniques to improve generalization in modern machine learning, often justified by its ability to promote invariance to label-irrelevant transformations. However, its theoretical role remains only partially understood. In this work, we propose an information-theoretic framework that systematically accounts for the effect of augmentation on generalization and invariance learning. Our approach builds upon mutual information-based bounds, which relate the generalization gap to the amount of information a learning algorithm retains about its training data. We extend this framework by modeling the augmented distribution as a composition of the original data distribution with a distribution over transformations, which naturally induces an orbit-averaged loss function. Under mild sub-Gaussian assumptions on the loss function and the augmentation process, we derive a new generalization bound that decompose the expected generalization gap into three interpretable terms: (1) a distributional divergence between the original and augmented data, (2) a stability term measuring the algorithm dependence on training data, and (3) a sensitivity term capturing the effect of augmentation variability. To connect our bounds to the geometry of the augmentation group, we introduce the notion of group diameter, defined as the maximal perturbation that augmentations can induce in the input space. The group diameter provides a unified control parameter that bounds all three terms and highlights an intrinsic trade-off: small diameters preserve data fidelity but offer limited regularization, while large diameters enhance stability at the cost of increased bias and sensitivity. We validate our theoretical bounds with numerical experiments, demonstrating that it reliably tracks and predicts the behavior of the true generalization gap.

URL PDF HTML ☆

赞 0 踩 0

2602.14414 2026-02-17 stat.ME econ.EM stat.AP

The Role of Measured Covariates in Assessing Sensitivity to Unmeasured Confounding

Abhinandan Dalal, Iris Horng, Yang Feng, Dylan S. Small

2602.14387 2026-02-17 stat.ME

Automatic Variance Adjustment for Small Area Estimation

Jon Wakefield, Jitong Jiang, Yunhan Wu

2602.14349 2026-02-17 stat.AP

Same Prompt, Different Outcomes: Evaluating the Reproducibility of Data Analysis by LLMs

Jiaxin Cui, Rohan Alexander

2602.14284 2026-02-17 stat.OT cs.CY

Benchmarking AI Performance on End-to-End Data Science Projects

Evelyn Hughes, Rohan Alexander

2602.14280 2026-02-17 stat.CO cs.LG

Fast Compute for ML Optimization

Nick Polson, Vadim Sokolov

2602.14244 2026-02-17 stat.ML cs.LG

Federated Ensemble Learning with Progressive Model Personalization

Ala Emrani, Amir Najafi, Abolfazl Motahari

Comments 42 pages

详情

英文摘要

Federated Learning provides a privacy-preserving paradigm for distributed learning, but suffers from statistical heterogeneity across clients. Personalized Federated Learning (PFL) mitigates this issue by considering client-specific models. A widely adopted approach in PFL decomposes neural networks into a shared feature extractor and client-specific heads. While effective, this design induces a fundamental tradeoff: deep or expressive shared components hinder personalization, whereas large local heads exacerbate overfitting under limited per-client data. Most existing methods rely on rigid, shallow heads, and therefore fail to navigate this tradeoff in a principled manner. In this work, we propose a boosting-inspired framework that enables a smooth control of this tradeoff. Instead of training a single personalized model, we construct an ensemble of $T$ models for each client. Across boosting iterations, the depth of the personalized component are progressively increased, while its effective complexity is systematically controlled via low-rank factorization or width shrinkage. This design simultaneously limits overfitting and substantially reduces per-client bias by allowing increasingly expressive personalization. We provide theoretical analysis that establishes generalization bounds with favorable dependence on the average local sample size and the total number of clients. Specifically, we prove that the complexity of the shared layers is effectively suppressed, while the dependence on the boosting horizon $T$ is controlled through parameter reduction. Notably, we provide a novel nonlinear generalization guarantee for decoupled PFL models. Extensive experiments on benchmark and real-world datasets (e.g., EMNIST, CIFAR-10/100, and Sent140) demonstrate that the proposed framework consistently outperforms state-of-the-art PFL methods under heterogeneous data distributions.

URL PDF HTML ☆

赞 0 踩 0

2602.14206 2026-02-17 math.ST stat.TH

Kernel Estimation Of Chatterjee's Dependence Coefficient

Mona Azadkia, Holger Dette

2602.14203 2026-02-17 stat.AP

Evaluating the Impact of COVID-19 on Transportation Infrastructure Funding

Lu Gao, Pan Lu, Fengxiang Qiao, Joshua Qiang Li, Yunpeng Zhang, Yihao Ren

2602.14198 2026-02-17 stat.AP

Zipf-Mandelbrot Scaling in Korean Court Music: Universal Patterns in Music

Byeongchan Choi, Junwon You, Myung Ock Kim, Jae-Hun Jung

Comments 20 pages, 5 figures, 4 tables

2602.14061 2026-02-17 stat.CO cs.NA math.NA

MPL-HMC: A Tunable Parameterized Leapfrog Framework for Robust Hamiltonian Monte Carlo

Sourabh Bhattacharya

Comments Feedback welcome

2602.14053 2026-02-17 math.ST cs.NA math.NA stat.TH

Mean-Square Convergence of a New Parameterized Leapfrog Scheme for Hamiltonian Systems Driven by Gaussian Process Potentials

Sourabh Bhattacharya

Comments Feedback welcome

2602.14029 2026-02-17 stat.ML cs.LG math.ST stat.TH

Why Self-Training Helps and Hurts: Denoising vs. Signal Forgetting

Mingqi Wu, Archer Y. Yang, Qiang Sun

Comments 8 pages main, 29 pages in total

2602.14020 2026-02-17 stat.ML cs.LG

Computable Bernstein Certificates for Cross-Fitted Clipped Covariance Estimation

Even He, Zaizai Yan

2602.13942 2026-02-17 stat.ML cs.LG

A Theoretical Framework for LLM Fine-tuning Using Early Stopping for Non-random Initialization

Zexuan Sun, Garvesh Raskutti

2602.13935 2026-02-17 cs.AI cs.LG stat.ML

Statistical Early Stopping for Reasoning Models

Yangxinyu Xie, Tao Wang, Soham Mallick, Yan Sun, Georgy Noarov, Mengxin Yu, Tanwi Mallick, Weijie J. Su, Edgar Dobriban

2602.13888 2026-02-17 stat.ME stat.AP stat.CO stat.ML

Mixture-of-experts Wishart model for covariance matrices with an application to Cancer drug screening

The Tien Mai, Zhi Zhao

2602.13872 2026-02-17 stat.ME stat.ML

Predicting fixed-sample test decisions enables anytime-valid inference

Chris Holmes, Stephen Walker

2602.13871 2026-02-17 math.ST cs.IT cs.LG math.IT math.OC stat.AP stat.ML stat.TH

Ensemble-Conditional Gaussian Processes (Ens-CGP): Representation, Geometry, and Inference

Sai Ravela, Jae Deok Kim, Kenneth Gee, Xingjian Yan, Samson Mercier, Lubna Albarghouty, Anamitra Saha

Comments 20 pages. Technical manuscrupt on representational equivalence between conditional Gaussian inference, quadratic optimization, and RKHS geometry in finite dimensions

2602.13852 2026-02-17 cs.AI stat.AP

Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking

Zhengmian Hu, Lei Shi, Ritwik Sinha, Justin Grover, David Arbour

2602.13804 2026-02-17 cs.AI cs.LG stat.ML

Attention in Constant Time: Vashista Sparse Attention for Long-Context Decoding with Exponential Guarantees

Vashista Nobaub

Comments 22 pages

2602.13729 2026-02-17 math.ST stat.ME stat.TH

Semi-supervised linear regression with missing covariates

Benedict M. Risebrow, Thomas B. Berrett

2602.13722 2026-02-17 econ.EM stat.ME

The Accuracy Smoothness Dilemma in Prediction: a Novel Multivariate M-SSA Forecast Approach

Marc Wildi

2602.12027 2026-02-17 math.ST math.PR stat.TH

General-purpose post-sampling reweighting method for multimodal target measures

Pierre Monmarché

2602.11712 2026-02-17 cs.LG cs.CE nlin.CD physics.data-an stat.ME

Potential-energy gating for robust state estimation in bistable stochastic systems

Luigi Simeone

Comments 20 pages, 8 figures

详情

英文摘要

We introduce potential-energy gating, a method for robust state estimation in systems governed by double-well stochastic dynamics. The observation noise covariance of a Bayesian filter is modulated by the local value of a known or assumed potential energy function: observations are trusted when the state is near a potential minimum and progressively discounted as it approaches the barrier separating metastable wells. This physics-based mechanism differs from statistical robust filters, which treat all state-space regions identically, and from constrained filters, which bound states rather than modulating observation trust. The approach is especially relevant in non-ergodic or data-scarce settings where only a single realization is available and statistical methods alone cannot learn the noise structure. We implement gating within Extended, Unscented, Ensemble, and Adaptive Kalman filters and particle filters, requiring only two additional hyperparameters. Monte Carlo benchmarks (100 replications) on a Ginzburg-Landau double-well with 10% outlier contamination show 57-80% RMSE improvement over the standard Extended Kalman Filter, all statistically significant (p < 10^{-15}, Wilcoxon test). A naive topological baseline using only well positions achieves 57%, confirming that the continuous energy landscape adds ~21 percentage points. The method is robust to misspecification: even with 50% parameter errors, improvement never falls below 47%. Comparing externally forced and spontaneous Kramers-type transitions, gating retains 68% improvement under noise-induced transitions whereas the naive baseline degrades to 30%. As an empirical illustration, we apply the framework to Dansgaard-Oeschger events in the NGRIP delta-18O ice-core record, estimating asymmetry gamma = -0.109 (bootstrap 95% CI: [-0.220, -0.011]) and showing that outlier fraction explains 91% of the variance in filter improvement.

URL PDF HTML ☆

赞 0 踩 0

2602.11290 2026-02-17 math.ST math.OC stat.TH

Entropic vector quantile regression: Duality and Gaussian case

Kengo Kato, Boyu Wang

Comments 30 pages

2602.07132 2026-02-17 stat.ML cs.LG

Discrete Adjoint Matching

Oswin So, Brian Karrer, Chuchu Fan, Ricky T. Q. Chen, Guan-Horng Liu

Comments ICLR 2026

2602.06797 2026-02-17 stat.ML cs.LG

Optimal Learning-Rate Schedules under Functional Scaling Laws: Power Decay and Warmup-Stable-Decay

Binghui Li, Zilin Wang, Fengling Chen, Shiyang Zhao, Ruiheng Zheng, Lei Wu

2512.18898 2026-02-17 math.ST stat.ME stat.TH

Model-Agnostic Bounds for Augmented Inverse Probability Weighted Estimators' Wald-Confidence Interval Coverage in Randomized Controlled Trials

Hongxiang Qiu

Comments substantially weakened sub-Gaussian assumptions with discussions; highlighted non-asymptotic results in the supplement for better readability

2512.00499 2026-02-17 cs.LG cs.AI stat.ML

ESPO: Entropy Importance Sampling Policy Optimization

Yuepeng Sheng, Yuwei Huang, Shuman Liu, Anxiang Zeng, Haibo Zhang

2511.19628 2026-02-17 stat.ML cs.LG stat.CO

Optimization and Regularization Under Arbitrary Objectives

Jared N. Lakhani, Etienne Pienaar

Comments 74 pages, 29 figures, 16 tables

2511.05128 2026-02-17 econ.EM stat.AP

Do Test Scores Help Teachers Give Better Track Advice to Students? A Principal Stratification Analysis

Andrea Ichino, Fabrizia Mealli, Javier Viviens

2511.00769 2026-02-17 math.PR math.OC stat.CO

Information-theoretic minimax and submodular optimization algorithms for multivariate Markov chains

Zheyuan Lai, Michael C. H. Choi

Comments 34 pages, 6 figures

2510.26046 2026-02-17 stat.ML cs.LG stat.ME

Bias-Corrected Data Synthesis for Imbalanced Learning

Pengfei Lyu, Zhengchi Ma, Linjun Zhang, Anru R. Zhang

Comments 41 pages, 4 figures, includes proofs and appendix

2510.18259 2026-02-17 stat.ML cs.AI cs.LG

Learning under Quantization for High-Dimensional Linear Regression

Dechen Zhang, Junwei Su, Difan Zou

2510.04455 2026-02-17 math.OC cs.AI cs.LG math.ST stat.ML stat.TH

Inverse Mixed-Integer Programming: Learning Constraints then Objective Functions

Akira Kitaoka

Comments 40 pages

2510.02983 2026-02-17 cs.DS cs.LG math.OC stat.ML

Oracle-based Uniform Sampling from Convex Bodies

Thanh Dang, Jiaming Liang

Comments 32 pages

2509.23437 2026-02-17 cs.LG stat.ML

Better Hessians Matter: Studying the Impact of Curvature Approximations in Influence Functions

Steve Hong, Runa Eschenhagen, Bruno Mlodozeniec, Richard Turner

2509.22794 2026-02-17 stat.ML cs.AI cs.LG econ.EM math.ST stat.TH

Differentially Private Two-Stage Gradient Descent for Instrumental Variable Regression

Haodong Liang, Yanhao Jin, Krishnakumar Balasubramanian, Lifeng Lai

Comments 37 pages, 12 figures

2509.19189 2026-02-17 cs.LG stat.ML

Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules

Binghui Li, Fengling Chen, Zixun Huang, Lean Wang, Lei Wu

Comments 60 pages, accepted by NeurIPS 2025 as a spotlight paper

2509.02784 2026-02-17 stat.AP stat.ML

A Composite-Loss Graph Neural Network for the Multivariate Post-Processing of Ensemble Weather Forecasts

Mária Lakatos

Comments 30 pages, 16 figures, 3 tables

详情

DOI: 10.1002/qj.70119
Journal ref: Q. J. R. Meteorol. Soc., Early View (2026)

英文摘要

Ensemble forecasting systems have advanced meteorology by providing probabilistic estimates of future states. Nonetheless, systematic biases often persist, making statistical post-processing essential. Traditional parametric post-processing techniques and machine learning-based methods can produce calibrated predictive distributions at specific locations and lead times, yet often struggle to capture dependencies across forecast dimensions. To address this, multivariate post-processing methods-such as ensemble copula coupling and the Schaake shuffle-are widely applied in a second step to restore realistic inter-variable or spatio-temporal dependencies. The aim of this study is the multivariate post-processing of ensemble forecasts using a graph neural network (dualGNN) trained with a composite loss function that combines the energy score (ES) and the variogram score (VS). The method is evaluated on two datasets: WRF-based solar irradiance forecasts over northern Chile and ECMWF visibility forecasts for Central Europe. The dualGNN consistently outperforms all empirical copula-based post-processed forecasts and shows significant improvements compared to graph neural networks trained solely on either the continuous ranked probability score or the ES, according to the evaluated multivariate verification metrics. Furthermore, for the WRF forecasts, the rank-order structure of the dualGNN forecasts captures valuable dependency information, enabling a more effective restoration of spatial relationships than either the raw numerical weather prediction ensemble or historical observational rank structures. Notably, incorporating VS into the loss function improved the univariate performance for both target variables compared to training on ES alone. Moreover, for the visibility forecasts, the ES-VS combination even outperformed the strongest calibrated reference in terms of univariate performance.

URL PDF HTML ☆

赞 0 踩 0

2509.00955 2026-02-17 cs.LG cs.AI stat.ML

ART: Adaptive Resampling-based Training for Imbalanced Classification

Arjun Basandrai, Shourya Jain, K. Ilanthenral

Comments Submitted to MLWA

2508.10342 2026-02-17 stat.ME

Identifying Unmeasured Confounders in Panel Causal Models: A Two-Stage LM-Wald Approach

Bang Quan Zheng

2507.15471 2026-02-17 stat.ME math.ST stat.TH

Multiple Hypothesis Testing To Estimate The Number Of Communities in Stochastic Block Models

Chetkar Jha, Mingyao Li, Ian Barnett

Comments This article is significantly improved version of the previous arXiv submission arXiv:2201.04722

2507.08838 2026-02-17 cs.LG cs.AI stat.ML

wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models

Xiaohang Tang, Rares Dolga, Sangwoong Yoon, Ilija Bogunovic

Comments Accepted to ICLR 2026

2506.04749 2026-02-17 stat.CO stat.ME stat.ML

Variational Transdimensional Inference

Laurence Davies, Dan Mackinlay, Rafael Oliveira, Scott A. Sisson

Comments 35 pages, 11 figures

2505.20478 2026-02-17 stat.ME

Iterative Exploration-Driven Sparse SDP Clustering via Thompson Sampling

Jongmin Mun, Paromita Dubey, Yingying Fan

Comments 58 pages, 2 figures, 2 tables, 4 algorithms;

2505.19712 2026-02-17 cs.LG math.PR stat.ML

On the Relation between Rectified Flows and Optimal Transport

Johannes Hertrich, Antonin Chambolle, Julie Delon

Comments Accepted for NeurIPS 2025

2503.09912 2026-02-17 stat.AP

Beta-Generalized Lindley Distribution: A Novel Probability Model for Wind Speed

Tiantian Yang, Dongwei Chen

Comments 18 pages, 7 figures, 5 tables

2410.18784 2026-02-17 cs.LG cs.NA eess.SP math.NA math.ST stat.ML stat.TH

Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality

Zhihan Huang, Yuting Wei, Yuxin Chen

Comments Accepted to Mathematics of Operations Research

2410.13420 2026-02-17 stat.ME math.ST stat.TH

Spatial Proportional Hazards Model with Differential Regularization

Lorenzo Tedesco, Francesco Finazzi

2410.03919 2026-02-17 cs.LG stat.ML

Online Posterior Sampling with a Diffusion Prior

Branislav Kveton, Boris Oreshkin, Youngsuk Park, Aniket Deshmukh, Rui Song

Comments Advances in Neural Information Processing Systems 37

2409.15307 2026-02-17 stat.CO physics.comp-ph

An ILUES-based adaptive Gaussian process method for multimodal Bayesian inverse problems

Zhihang Xu, Xiaoyu Zhu, Daoji Li, Qifeng Liao

2406.00866 2026-02-17 stat.ME math.ST stat.AP stat.TH

Planning for gold: Hypothesis screening with split samples for valid powerful testing in matched observational studies

William Bekerman, Abhinandan Dalal, Carlo del Ninno, Dylan S. Small

Comments To be published in Biometrika

2402.11219 2026-02-17 math.ST stat.ME stat.TH

Estimators for multivariate allometric regression model

Koji Tsukuda, Shun Matsuura

Comments 20 pages

2402.02644 2026-02-17 cs.LG stat.ML

Permutation-based Inference for Variational Learning of Directed Acyclic Graphs

Edwin V. Bonilla, Pantelis Elinas, He Zhao, Maurizio Filippone, Vassili Kitsios, Terry O'Kane

2310.11736 2026-02-17 math.ST math.OC stat.ML stat.TH

A Theory of Feature Learning in Kernel Models

Yunlu Chen, Yang Li, Keli Liu, Feng Ruan

2308.13036 2026-02-17 math.ST stat.TH

Robust Signal Detection with Quadratically Convex Orthosymmetric Constraints

Yikun Li, Matey Neykov

Comments 80 pages, 7 figures

2306.14297 2026-02-17 stat.ME cs.LG

Inference for relative sparsity

Samuel J. Weisenthal, Sally W. Thurston, Ashkan Ertefaie

Comments 66 pages, 3 figures

2305.13842 2026-02-17 math.ST stat.TH

Asymptotic Properties of Multi-Treatment Covariate Adaptive Randomization Procedures for Balancing Observed and Unobserved Covariates

Li-Xin Zhang

Comments 102 pages

2303.08218 2026-02-17 stat.ME

Spatial causal inference in the presence of unmeasured confounding and interference

Georgia Papadogeorgou, Srijata Samanta

2302.12093 2026-02-17 eess.SY cs.SY math.OC stat.ME

Experimenting under Stochastic Congestion

Shuangning Li, Ramesh Johari, Xu Kuang, Stefan Wager

2211.13478 2026-02-17 stat.ME

A New Spatio-Temporal Model Exploiting Hamiltonian Equations

Satyaki Mazumder, Sayantan Banerjee, Sourabh Bhattacharya

Comments Another updated version, more streamlined and establishing deep theoretical connections with our two new companion papers

2206.03166 2026-02-17 math.ST cs.DM cs.MS math.PR stat.ME stat.TH

A novel statistical approach for two-sample testing based on the overlap coefficient

Atsushi Komaba, Hisashi Johno, Kazunori Nakamoto

Comments 36 pages, 5 figures. Accepted for publication in Journal of Mathematical Sciences, the University of Tokyo

2112.06251 2026-02-17 cs.LG stat.ML

Learning with Subset Stacking

Ş. İlker Birbil, Sinan Yıldırım, Samet Çopur, M. Hakan Akyüz

Comments 26 pages, 8 figures, 4 tables. Code available

2106.04096 2026-02-17 cs.LG math.OC stat.ML

Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation

Semih Cayci, Niao He, R. Srikant

2103.14203 2026-02-17 stat.ML cs.LG

Deep Two-Way Matrix Reordering for Relational Data Analysis

Chihiro Watanabe, Taiji Suzuki

2602.13635 2026-02-17 stat.ME

Backward Smoothing versus Fixed-Lag Smoothing in Particle Filters

Genshiro Kitagawa

Comments 17 pages, 5 tables, 7 figures

2602.13619 2026-02-17 stat.ML cs.IT cs.LG math.IT stat.ME

Locally Private Parametric Methods for Change-Point Detection

Anuj Kumar Yadav, Cemre Cadir, Yanina Shkel, Michael Gastpar

Comments 43 pages, 20 figures

2602.13565 2026-02-17 math.ST cs.NA math.NA math.PR stat.OT stat.TH

An Improved Milstein Method for the Numerical Solution of Multidimensional Stochastic Differential Equations

Paromita Banerjee, Anirban Mondal

2602.13538 2026-02-17 stat.ME

Empirical Bayes data integreation for multi-response regression

Antik Chakraborty, Fei Xue

Comments To appear in Statistica Sinica

2602.13533 2026-02-17 stat.ME math.ST stat.AP stat.TH

Estimation and Inference of the Win Ratio for Two Hierarchical Endpoints Subject to Censoring and Missing Data

Yi Liu, Huiman Barnhart, Sean O'Brien, Yuliya Lokhnygina, Roland A. Matsouaka

2602.13518 2026-02-17 stat.ME

Towards Semiparametric Bandwidth Selectors for Kernel Density Estimators

Nils Lid Hjort

Comments 26 pages, no figures; technical report from 1999, needing additional numerical work to become a full paper

2602.13475 2026-02-17 stat.ME stat.AP stat.ML

Efficient and Debiased Learning of Average Hazard Under Non-Proportional Hazards

Xiang Meng, Lu Tian, Kenneth Kehl, Hajime Uno

Comments Main paper: 24 pages and 2 figures; Reference and Supplement: 22 pages and 8 Figures

2602.13465 2026-02-17 math.ST stat.TH

Intrinsic dimension concentration inequalities for self-adjoint operators

Diego Martinez-Taboada, Aaditya Ramdas

2602.13450 2026-02-17 econ.EM stat.AP stat.ME

Inference From Random Restarts

Moeen Nehzati, Diego Cussen

详情

英文摘要

Algorithms for computing equilibria, optima, and fixed points in nonconvex problems often depend sensitively on practitioner-chosen initial conditions. When uniqueness of a solution is of interest, a common heuristic is to run such algorithms from many randomly selected initial conditions and to interpret repeated convergence to the same output as evidence of a unique solution or a dominant basin of attraction. Despite its widespread use, this practice lacks a formal inferential foundation. We provide a simple probabilistic framework for interpreting such numerical evidence. First, we give sufficient conditions under which an algorithm's terminal output is a measurable function of its initial condition, allowing probabilistic reasoning over outcomes. Second, we provide sufficient conditions ensuring that an algorithm admits only finitely many possible terminal outcomes. While these conditions may be difficult to verify on a case-by-case basis, we give simple sufficient conditions for broad classes of problems under which almost all instances admit only finitely many outcomes (in the sense of prevalence). Standard algorithms such as gradient descent and damped fixed-point iteration applied to sufficiently smooth functions satisfy these conditions. Within this framework, repeated solver runs correspond to independent samples from the induced distribution over outcomes. We adopt a Bayesian approach to infer basin sizes and the probability of solution uniqueness from repeated identical outputs, and we establish convergence rates for the resulting posterior beliefs. Finally, we apply our framework to settings in the existing industrial organization literature, where random-restart heuristics are used. Our results formalize and qualify these arguments, clarifying when repeated convergence provides meaningful evidence for uniqueness and when it does not.

URL PDF HTML ☆

赞 0 踩 0

2602.13442 2026-02-17 stat.ME stat.ML

Measuring Neural Network Complexity via Effective Degrees of Freedom

Jia Zhou, Douglas Landsittel

Comments 20 pages, 3 figures, 6 tables

2602.13413 2026-02-17 cs.LG cs.NA math.NA math.OC stat.ML

Why is Normalization Preferred? A Worst-Case Complexity Theory for Stochastically Preconditioned SGD under Heavy-Tailed Noise

Yuchen Fang, James Demmel, Javad Lavaei

2602.13362 2026-02-17 stat.ML cs.AI cs.LG

Nonparametric Distribution Regression Re-calibration

Ádám Jung, Domokos M. Kelen, András A. Benczúr

2602.07046 2026-02-17 q-fin.ST q-fin.CP stat.AP

Same Returns, Different Risks: How Cryptocurrency Markets Process Infrastructure vs Regulatory Shocks

Murad Farzulla

Comments 24 pages, 7 tables. JEL: C22, C58, G12, G14. Code at https://github.com/studiofarzulla/sentiment-without-structure

2510.26090 2026-02-17 stat.ME

Poisson process factorization for mutational signature analysis with genomic covariates

Alessandro Zito, Giovanni Parmigiani, Jeffrey W. Miller

2510.10854 2026-02-17 cs.LG cs.AI stat.ML

Discrete State Diffusion Models: A Sample Complexity Perspective

Aadithya Srikanth, Mudit Gaur, Vaneet Aggarwal

2509.20993 2026-02-17 cs.LG cs.DS stat.ML

Learning the Inverse Temperature of Ising Models under Hard Constraints using One Sample

Rohan Chauhan, Ioannis Panageas

Comments Accepted to Appear in ICLR '26

2502.12581 2026-02-17 stat.ML cs.AI cs.LG

The Majority Vote Paradigm Shift: When Popular Meets Optimal

Antonio Purificato, Maria Sofia Bucarelli, Anil Kumar Nelakanti, Andrea Bacciu, Fabrizio Silvestri, Amin Mantrach

Comments 33 pages, 7 figures

2502.01010 2026-02-17 stat.ME

Sequential Change Detection in Correlation Structures with Window-Limited Statistics

Jie Gao, Liyan Xie, Zhaoyuan Li

2501.16178 2026-02-17 cs.LG stat.ML

SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting

Wenxuan Xie, Fanpu Cao

2501.01696 2026-02-17 stat.ML cs.IT cs.LG math.IT

Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent

Tong Wu

Comments This paper has been accepted for publication in the Journal of Machine Learning Research

2411.01629 2026-02-17 stat.ML cs.LG math.OC math.ST stat.TH

Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Tengyuan Liang, Kulunu Dharmakeerthi, Takuya Koriyama

Comments 30 pages, 11 figures

详情

Journal ref: Transactions on Machine Learning Research, 2026

英文摘要

Adding noise is easy; what about denoising? Diffusion is easy; what about reverting a diffusion? Diffusion-based generative models aim to denoise a Langevin diffusion chain, moving from a log-concave equilibrium measure $ν$, say an isotropic Gaussian, back to a complex, possibly non-log-concave initial measure $μ$. The score function performs denoising, moving backward in time, and predicting the conditional mean of the past location given the current one. We show that score denoising is the optimal backward map in transportation cost. What is its localization uncertainty? We show that the curvature function determines this localization uncertainty, measured as the conditional variance of the past location given the current. We study in this paper the effectiveness of the diffuse-then-denoise process: the contraction of the forward diffusion chain, offset by the possible expansion of the backward denoising chain, governs the denoising difficulty. For any initial measure $μ$, we prove that this offset net contraction at time $t$ is characterized by the curvature complexity of a smoothed $μ$ at a specific signal-to-noise ratio (SNR) scale $r(t)$. We discover that the multi-scale curvature complexity collectively determines the difficulty of the denoising chain. Our multi-scale complexity quantifies a fine-grained notion of average-case curvature instead of the worst-case. Curiously, it depends on an integrated tail function, measuring the relative mass of locations with positive curvature versus those with negative curvature; denoising at a specific SNR scale is easy if such an integrated tail is light. We conclude with several non-log-concave examples to demonstrate how the multi-scale complexity probes the bottleneck SNR for the diffuse-then-denoise process.

URL PDF HTML ☆

赞 0 踩 0

2407.13010 2026-02-17 cs.LG cs.CE physics.comp-ph stat.ML

A Resolution Independent Neural Operator

Bahador Bahmani, Somdatta Goswami, Ioannis G. Kevrekidis, Michael D. Shields

详情

DOI: 10.1016/j.jcp.2025.114233
Journal ref: Journal of Computational Physics, 539, 2025, 114233

英文摘要

The Deep Operator Network (DeepONet) is a powerful neural operator architecture that uses two neural networks to map between infinite-dimensional function spaces. This architecture allows for the evaluation of the solution field at any location within the domain but requires input functions to be discretized at identical locations, limiting practical applications. We introduce a general framework for operator learning from input-output data with arbitrary sensor locations and counts. This begins by introducing a resolution-independent DeepONet (RI-DeepONet), which handles input functions discretized arbitrarily but sufficiently finely. To achieve this, we propose two dictionary learning algorithms that adaptively learn continuous basis functions, parameterized as implicit neural representations (INRs), from correlated signals on arbitrary point clouds. These basis functions project input function data onto a finite-dimensional embedding space, making it compatible with DeepONet without architectural changes. We specifically use sinusoidal representation networks (SIRENs) as trainable INR basis functions. Similarly, the dictionary learning algorithms identify basis functions for output data, defining a new neural operator architecture: the Resolution Independent Neural Operator (RINO). In RINO, the operator learning task reduces to mapping coefficients of input basis functions to output basis functions. We demonstrate RINO's robustness and applicability in handling arbitrarily sampled input and output functions during both training and inference through several numerical examples.

URL PDF HTML ☆

赞 0 踩 0

2209.05894 2026-02-17 math.ST stat.ME stat.TH

Nonparametric estimation of trawl processes: Theory and applications

Orimar Sauri, Almut E. D. Veraart