arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.04829 2026-04-07 stat.ME cs.LG stat.ML

A Robust SINDy Autoencoder for Noisy Dynamical System Identification

Kairui Ding

Comments 27 pages

详情

英文摘要

Sparse identification of nonlinear dynamics (SINDy) has been widely used to discover the governing equations of a dynamical system from data. It uses sparse regression techniques to identify parsimonious models of unknown systems from a library of candidate functions. Therefore, it relies on the assumption that the dynamics are sparsely represented in the coordinate system used. To address this limitation, one seeks a coordinate transformation that provides reduced coordinates capable of reconstructing the original system. Recently, SINDy autoencoders have extended this idea by combining sparse model discovery with autoencoder architectures to learn simplified latent coordinates together with parsimonious governing equations. A central challenge in this framework is robustness to measurement error. Inspired by noise-separating neural network structures, we incorporate a noise-separation module into the SINDy autoencoder architecture, thereby improving robustness and enabling more reliable identification of noisy dynamical systems. Numerical experiments on the Lorenz system show that the proposed method recovers interpretable latent dynamics and accurately estimates the measurement noise from noisy observations.

URL PDF HTML ☆

赞 0 踩 0

2604.04823 2026-04-07 math.PR math.ST stat.TH

Rapid convergence of tempering chains to multimodal Gibbs measures

Seungjae Son

2604.04807 2026-04-07 stat.ME

Rank-Based Sparse Regression in Principal Components Space under Measurement Error

Long Feng, Xiaoyi Wang, Le Zhou

2604.04802 2026-04-07 cs.IT cs.LG eess.SP math.IT math.PR stat.ML

Partially deterministic sampling for compressed sensing with denoising guarantees

Yaniv Plan, Matthew S. Scott, Ozgur Yilmaz

2604.04785 2026-04-07 math.ST stat.ME stat.TH

High Dimensional Bootstrap and Asymptotic Expansion for the $k$-th Largest Coordinate

Long Feng

2604.04755 2026-04-07 stat.ME

Active Sequential Signal Detection with Asynchronous Decisions

Yiming Xing, Georgios Fellouris

Comments 13 pages, 3 figures

2604.04726 2026-04-07 stat.ML cs.LG eess.SP

A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models

Xiao Liang, Shuang Li

2604.04717 2026-04-07 cs.LG cond-mat.mtrl-sci cs.AI stat.ML

The Infinite-Dimensional Nature of Spectroscopy and Why Models Succeed, Fail, and Mislead

Umberto Michelucci, Francesca Venturini

2604.04673 2026-04-07 math.ST cs.LG stat.ML stat.TH

Minimaxity and Admissibility of Bayesian Neural Networks

Daniel Andrew Coulson, Martin T. Wells

Comments 95 pages and 6 figures

2604.04638 2026-04-07 math.ST stat.TH

Joint Estimation in Potts Model

Somabha Mukherjee, Sumit Mukherjee, Sayar Karmakar

Comments 60 pages, 1 figure

2604.04588 2026-04-07 stat.ML cs.IT cs.LG math.IT math.OC math.ST stat.TH

Noisy Nonreciprocal Pairwise Comparisons: Scale Variation, Noise Calibration, and Admissible Ranking Regions

Jean-Pierre Magnot

2604.04529 2026-04-07 stat.ME econ.EM

Dynamic Factor Stochastic Volatility-in-Mean VAR for Large Macroeconomic Panels

Daichi Hiraki, Siddhartha Chib, Yasuhiro Omori

Comments 72 pages, 27 figures, 22 tables

2604.04517 2026-04-07 stat.ME econ.EM stat.CO

Unified Mixture Sampler for State-Space Models: Application to Stochastic Conditional Duration Models

Daichi Hiraki, Yasuhiro Omori

Comments 15 pages, 2 figures, 6 tables

2604.04431 2026-04-07 stat.CO

iLBA: An R package for confidentially disseminating aggregated frequency tables

Jeehyun Hwang, Dongsun Yoon, Sungkyu Jung, Min-Jeong Park, Inkwon Yeo

2604.04410 2026-04-07 cs.LG cs.AI cs.CL stat.ML

Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment

Hiroshi Takahashi, Tomoharu Iwata, Atsutoshi Kumagai, Sekitoshi Kanai, Masanori Yamada, Kosuke Nishida, Kazutoshi Shinoda

Comments Code is available at https://github.com/takahashihiroshi/rdro

2604.04365 2026-04-07 math.ST stat.ML stat.TH

Attributed Network Alignment: Statistical Limits and Efficient Algorithm

Dong Huang, Chenyang Tian, Pengkun Yang

Comments 53 pages, 8 figures

2604.04342 2026-04-07 cs.LG stat.ML

Generative models for decision-making under distributional shift

Xiuyuan Cheng, Yunqin Zhu, Yao Xie

Comments Under review for INFORMS TutORials in Operations Research, 2026

2604.04315 2026-04-07 stat.ME stat.CO

Mean--Variance Risk-Aware Bayesian Optimal Experimental Design for Nonlinear Models

Wanggang Shen, Xun Huan

Comments 36 pages, 31 figures

2604.04302 2026-04-07 stat.ME cs.LG

CavMerge: Merging K-means Based on Local Log-Concavity

Zhili Qiao, Wangqian Ju, Peng Liu

2604.04294 2026-04-07 stat.ME stat.CO

Simulated Annealing for Model-Robust Partial Profile Choice Designs in Healthcare Preference Studies

Yicheng Mao, Roselinde Kessels

2604.04278 2026-04-07 stat.ME math.ST stat.TH

Efficient estimation of relative risk, odds ratio and their logarithms for rare events

Luis Mendo

Comments 28 pages, 9 figures

2604.04274 2026-04-07 cs.AI cs.CE stat.AP

InferenceEvolve: Towards Automated Causal Effect Estimators through Self-Evolving AI

Can Wang, Hongyu Zhao, Yiqun Chen

2604.04272 2026-04-07 math.ST stat.TH

Theoretical Foundations of Principal Manifold Estimation with Non-Euclidean Templates

Kun Meng, Christopher Perez

Comments 111 pages

2604.04264 2026-04-07 stat.ML cs.IT cs.LG eess.SP math.IT stat.AP

Avoiding Non-Integrable Beliefs in Expectation Propagation

Zilu Zhao, Jichao Chen, Dirk Slock

2604.04228 2026-04-07 math.ST cs.DS stat.ML stat.TH

Robust Regression with Adaptive Contamination in Response: Optimal Rates and Computational Barriers

Ilias Diakonikolas, Chao Gao, Daniel M. Kane, Ankit Pensia, Dong Xie

2604.04218 2026-04-07 stat.ML cs.LG math.ST stat.TH

Sharp asymptotic theory for Q-learning with LDTZ learning rate and its generalization

Soham Bonnerjee, Zhipeng Lou, Wei Biao Wu

详情

Journal ref: ICLR 2026, Main Conference Track, Poster

英文摘要

Despite the sustained popularity of Q-learning as a practical tool for policy determination, a majority of relevant theoretical literature deals with either constant ($η_{t}\equiv η$) or polynomially decaying ($η_{t} = ηt^{-α}$) learning schedules. However, it is well known that these choices suffer from either persistent bias or prohibitively slow convergence. In contrast, the recently proposed linear decay to zero (\texttt{LD2Z}: $η_{t,n}=η(1-t/n)$) schedule has shown appreciable empirical performance, but its theoretical and statistical properties remain largely unexplored, especially in the Q-learning setting. We address this gap in the literature by first considering a general class of power-law decay to zero (\texttt{PD2Z}-$ν$: $η_{t,n}=η(1-t/n)^ν$). Proceeding step-by-step, we present a sharp non-asymptotic error bound for Q-learning with \texttt{PD2Z}-$ν$ schedule, which then is used to derive a central limit theory for a new \textit{tail} Polyak-Ruppert averaging estimator. Finally, we also provide a novel time-uniform Gaussian approximation (also known as \textit{strong invariance principle}) for the partial sum process of Q-learning iterates, which facilitates bootstrap-based inference. All our theoretical results are complemented by extensive numerical experiments. Beyond being new theoretical and statistical contributions to the Q-learning literature, our results definitively establish that \texttt{LD2Z} and in general \texttt{PD2Z}-$ν$ achieve a best-of-both-worlds property: they inherit the rapid decay from initialization (characteristic of constant step-sizes) while retaining the asymptotic convergence guarantees (characteristic of polynomially decaying schedules). This dual advantage explains the empirical success of \texttt{LD2Z} while providing practical guidelines for inference through our results.

URL PDF HTML ☆

赞 0 踩 0

2604.04181 2026-04-07 stat.ME

Variance Reduction Methods for Dirichlet Expectations

Ayeong Lee

2604.04156 2026-04-07 stat.AP

Two-Sample Testing for Multivariate Cross-Correlation Functions with Applications to Gut-Brain Reward Learning

Bhaskar Ray, Tùng Bùi, William Matthew Howe, Srijan Sengupta

2604.04155 2026-04-07 cs.LG cs.IT math.IT q-bio.QM stat.ML

The Geometric Alignment Tax: Tokenization vs. Continuous Geometry in Scientific Foundation Models

Prashant C. Raju

2604.04118 2026-04-07 math.ST stat.TH

Heavy Tailed Homogeneous Structural Causal Models

Vishal Routh, Shuyang Bai

2604.00848 2026-04-07 stat.OT math.ST stat.ME stat.ML stat.TH

Debiased Estimators in High-Dimensional Regression: A Review and Replication of Javanmard and Montanari (2014)

Benjamin Smith

2604.00220 2026-04-07 stat.ME stat.AP

Two Sample Test for Eigendecompositions of Functional Data

Angel Garcia de la Garza, Britton Sauerbrei, Jeff Goldsmith

2603.28532 2026-04-07 cs.LG cs.AI stat.AP

Detecting low left ventricular ejection fraction from ECG using an interpretable and scalable predictor-driven framework

Ya Zhou, Tianxiang Hao, Ziyi Cai, Haojie Zhu, Kejun He, Jia Liu, Xiaohan Fan, Jing Yuan

Comments This version includes minor typographical corrections. The results and conclusions remain unchanged

2603.27323 2026-04-07 math.ST stat.TH

Property Of The Beta Modified Weibull Distribution With Six Parameters

Didier Alain Njamen Njomen, Fidel Djongreba Ndikwa

Comments 13 pages, 4 figures, 1 table. Accepted paper in International Journal of Applied Mathematics

2603.22594 2026-04-07 stat.ME

Making Effective Statistical Inferences: From Significance Testing to the Open Science Inference Ecosystem (2016-2026)

Aswini Kumar Patra

Comments 23 pages, 1 Figure, 3 tables

2603.09033 2026-04-07 q-bio.QM math.ST stat.TH

Sequential learning theory for Markov genealogy processes

David J Pascall

2602.11333 2026-04-07 econ.EM stat.ML

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

Kaicheng Chen, Harold D. Chiang

Comments This paper supersedes the earlier manuscript "Maximal inequalities for separately exchangeable empirical processes" (arXiv:2502.11432) by Harold D. Chiang

2602.07841 2026-04-07 econ.EM q-fin.ST stat.AP

A Nontrivial Upper Bound on the Out-of-Sample $R^2$ in Return Forecasting

Cheng Zhang

2512.24521 2026-04-07 stat.ME cs.HC stat.AP

Power Analysis is Essential: High-Powered Tests Suggest Minimal to No Effect of Rounded Shapes on Click-Through Rates

Ron Kohavi, Jakub Linowski, Lukas Vermeer, Fabrice Boisseranc, Joachim Furuseth, Andrew Gelman, Guido Imbens, Ravikiran Rajagopal

Comments 34 pages, 9 figures

2512.10537 2026-04-07 stat.ME stat.CO

A Bayes-Motivated Quadratic-Form Test for High-Dimensional Mean Testing

Daojiang He, Suren Xu, Jing Zhou

2511.20985 2026-04-07 stat.ME

Two-stage Estimation for Causal Inference Involving a Semi-continuous Exposure

Xiaoya Wang, Richard J. Cook, Yeying Zhu, Tugba Akkaya-Hocagil, R. Colin Carter, Sandra W. Jacobson, Joseph L. Jacobson, Louise M. Ryan

2511.15453 2026-04-07 stat.ME

Testing Conditional Independence via the Spectral Generalized Covariance Measure: Beyond Euclidean Data

Ryunosuke Miyazaki, Yoshimasa Uematsu

2511.09216 2026-04-07 cs.LG q-bio.QM stat.ML

Controllable protein design with particle-based Feynman-Kac steering

Erik Hartman, Jonas Wallin, Johan Malmström, Jimmy Olsson

Comments In version 2 we added an experiment on improving designability through steering towards lower delta G

2510.23448 2026-04-07 cs.LG stat.ML

An Information-Theoretic Analysis of OOD Generalization in Meta-Reinforcement Learning

Xingtu Liu

2508.10053 2026-04-07 cs.LG stat.ML

xRFM: Accurate, scalable, and interpretable feature learning models for tabular data

Daniel Beaglehole, David Holzmüller, Adityanarayanan Radhakrishnan, Mikhail Belkin

2507.22207 2026-04-07 cond-mat.dis-nn cs.LG physics.data-an stat.ML

Better Together: Cross and Joint Covariances Enhance Signal Detectability in Undersampled Data

Arabind Swain, Sean Alexander Ridout, Ilya Nemenman

2506.21527 2026-04-07 math.ST math.PR stat.TH

Asymptotic Inference for Exchangeable Gibbs Partitions

Takuya Koriyama

Comments 40 pages, 3 figures. We have updated numerical simulations and added a rigorous proposition explaining why the uniform CI and local CI complement each other

2505.21972 2026-04-07 cs.LG cs.AI stat.ML

LLMs Judging LLMs: A Simplex Perspective

Patrick Vossler, Fan Xia, Yifan Mai, Adarsh Subbaswamy, Jean Feng

Comments Accepted at AISTATS 2026

2505.15443 2026-04-07 cs.CL stat.ML

ALIEN: Aligned Entropy Head for Improving Uncertainty Estimation of LLMs

Artem Zabolotnyi, Roman Makarov, Mile Mitrovic, Polina Proskura, Oleg Travkin, Roman Alferov, Alexey Zaytsev

Comments 16 pages, 2 figures

2504.18743 2026-04-07 cs.LG math.PR stat.ML

From Set Convergence to Pointwise Convergence: Finite-Time Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes

Zaiwei Chen, Phalguni Nanda

Comments 65 pages and 6 figures

2504.17104 2026-04-07 stat.ME stat.AP

Target trial emulation without matching: a more efficient approach for evaluating vaccine effectiveness using observational data

Emily Wu, Elizabeth Rogawski McQuade, Mats Stensrud, Razieh Nabi, David Benkeser

Comments 24 pages, 5 figures

2504.14795 2026-04-07 eess.IV cs.CV cs.LG stat.ML

A Bayesian Approach to Segmentation with Noisy Labels via Spatially Correlated Distributions

Ryu Tadokoro, Tsukasa Takagi, Shin-ichi Maeda

详情

Journal ref: Transactions on Machine Learning Research (TMLR) , 2026

英文摘要

In semantic segmentation, the accuracy of models heavily depends on the high-quality annotations. However, in many practical scenarios, such as medical imaging and remote sensing, obtaining true annotations is not straightforward and usually requires significant human labor. Relying on human labor often introduces annotation errors, including mislabeling, omissions, and inconsistency between annotators. In the case of remote sensing, differences in procurement time can lead to misaligned ground-truth annotations. These label errors are not independently distributed, and instead usually appear in spatially connected regions where adjacent pixels are more likely to share the same errors. To address these issues, we propose an approximate Bayesian estimation based on a probabilistic model that assumes training data include label errors, incorporating the tendency for these errors to occur with spatial correlations between adjacent pixels. However, Bayesian inference for such spatially correlated discrete variables is notoriously intractable. To overcome this fundamental challenge, we introduce a novel class of probabilistic models, which we term the ELBO-Computable Correlated Discrete Distribution (ECCD). By representing the discrete dependencies through a continuous latent Gaussian field with a Kac-Murdock-Szegö (KMS) structured covariance, our framework enables scalable and efficient variational inference for problems previously considered computationally prohibitive. Through experiments on multiple segmentation tasks, we confirm that leveraging the spatial correlation of label errors significantly improves performance. Notably, in specific tasks such as lung segmentation, the proposed method achieves performance comparable to training with clean labels under moderate noise levels. Code is available at https://github.com/pfnet-research/Bayesian_SpatialCorr.

URL PDF HTML ☆

赞 0 踩 0

2504.14169 2026-04-07 stat.ME

Correcting nonignorable nonresponse bias in turnout estimation using callback data

Xinyu Li, Naiwen Ying, Kendrick Qijun Li, Xu Shi, Wang Miao

2502.18223 2026-04-07 stat.ME

Penalizing complexity priors for Bayesian inference of circular models

Xiang Ye, Janet Van Niekerk, Håvard Rue

Comments 20 pages, 21 figures

2502.07977 2026-04-07 cs.LG math.OC stat.ML

RESIST: Resilient Decentralized Learning Using Consensus Gradient Descent

Cheng Fang, Rishabh Dixit, Waheed U. Bajwa, Mert Gurbuzbalaban

Comments preprint of a journal paper; 110 pages, 14 figures, and 1 table

详情

英文摘要

Empirical risk minimization (ERM) is a cornerstone of modern machine learning (ML), supported by advances in optimization theory that ensure efficient solutions with provable algorithmic and statistical learning rates. Privacy, memory, computation, and communication constraints necessitate data collection, processing, and storage across network-connected devices. In many applications, networks operate in decentralized settings where a central server cannot be assumed, requiring decentralized ML algorithms that are efficient and resilient. Decentralized learning, however, faces significant challenges, including an increased attack surface. This paper focuses on the man-in-the-middle (MITM) attack, wherein adversaries exploit communication vulnerabilities to inject malicious updates during training, potentially causing models to deviate from their intended ERM solutions. To address this challenge, we propose RESIST (Resilient dEcentralized learning using conSensus gradIent deScenT), an optimization algorithm designed to be robust against adversarially compromised communication links, where transmitted information may be arbitrarily altered before being received. Unlike existing adversarially robust decentralized learning methods, which often (i) guarantee convergence only to a neighborhood of the solution, (ii) lack guarantees of linear convergence for strongly convex problems, or (iii) fail to ensure statistical consistency as sample sizes grow, RESIST overcomes all three limitations. It achieves algorithmic and statistical convergence for strongly convex, Polyak-Lojasiewicz, and nonconvex ERM problems by employing a multistep consensus gradient descent framework and robust statistics-based screening methods to mitigate the impact of MITM attacks. Experimental results demonstrate the robustness and scalability of RESIST across attack strategies, screening methods, and loss functions.

URL PDF HTML ☆

赞 0 踩 0

2410.18918 2026-04-07 stat.ML cs.LG

MissNODAG: Differentiable Cyclic Causal Graph Learning from Incomplete Data

Muralikrishnna G. Sethuraman, Razieh Nabi, Faramarz Fekri

Comments To appear in Transactions on Machine Learning Research

2410.07607 2026-04-07 math.ST stat.TH

Staleness Factors and Volatility Estimation at High Frequencies

Xinbing Kong, Bin Wu, Wuyi Ye

2410.07430 2026-04-07 cs.LG stat.ML

EventFlow: Forecasting Temporal Point Processes with Flow Matching

Gavin Kerrigan, Kai Nelson, Padhraic Smyth

Comments AISTATS 2026 Best Paper Award, camera ready version

2408.12739 2026-04-07 quant-ph cs.LG stat.ML

Quantum Convolutional Neural Networks are Effectively Classically Simulable

Pablo Bermejo, Paolo Braccia, Manuel S. Rudolph, Zoë Holmes, Lukasz Cincio, M. Cerezo

Comments 12 + 15 pages , 6 + 7 figures, 1 table, updated to published version

2405.03083 2026-04-07 stat.ME cs.LG stat.ML

Causal K-Means Clustering

Kwangho Kim, Jisu Kim, Edward H. Kennedy

2404.07457 2026-04-07 math.ST stat.CO stat.TH

From Poisson Observations to Fitted Negative Binomial Distribution

Yingying Yang, Niloufar Dousti Mousavi, Zhou Yu, Jie Yang

Comments 54 pages, 3 figures, 15 tables

2403.11343 2026-04-07 cs.LG cs.CR math.ST stat.ME stat.ML stat.TH

Federated Transfer Learning with Differential Privacy

Mengchu Li, Ye Tian, Yang Feng, Yi Yu

Comments 101 pages, 7 figures

2307.13475 2026-04-07 econ.EM math.ST stat.TH

Large sample properties of GMM estimators under second-order identification

Hugo Kruiniger

Comments 30 pages. In the third version of the paper, I have added results on the optimal weight matrices for ϕ_{1}-hat and ϕ_{p}-hat, respectively

详情

英文摘要

Dovonon and Hall (Journal of Econometrics, 2018) proposed a limiting distribution theory for GMM estimators for a p - dimensional globally identified parameter vector ϕ when local identification conditions fail at first-order but hold at second-order. They assumed that the first-order underidentification is due to the expected Jacobian having rank p-1 at the true value ϕ_{0}, i.e., having a rank deficiency of one. After reparametrizing the model such that the last column of the Jacobian vanishes, they showed that the GMM estimator of the vector comprising the first p-1 parameters, ϕ_{1}, converges at rate T^{-1/2} and the GMM estimator of the remaining parameter, ϕ_{p}, converges at rate T^{-1/4}. They also provided a limiting distribution of T^{1/4}(ϕ_{p}-hat-ϕ_{0,p}) subject to a (non-transparent) condition which they claimed to be not restrictive in general. However, as we show in this paper, their condition is in fact only satisfied when ϕ is overidentified and the limiting distribution of T^{1/4}(ϕ_{p}-hat-ϕ_{0,p}), which is non-standard, depends on whether ϕ is exactly identified or overidentified. In particular, the limiting distributions of the sign of T^{1/4}(ϕ_{p}-hat-ϕ_{0,p}) for the cases of exact and overidentification, respectively, are different and are obtained by using expansions of the GMM objective function of different orders. Unsurprisingly, we find that the limiting distribution theories of Dovonon and Hall (2018) for Indirect Inference (II) estimation under two different scenarios with second-order identification where the target function is a GMM estimator of the auxiliary parameter vector, are incomplete for similar reasons. We discuss how our results for GMM estimation can be used to complete both theories. We also derive the optimal weight matrices for ϕ_{1}-hat and ϕ_{p}-hat, respectively.

URL PDF HTML ☆

赞 0 踩 0

2302.08724 2026-04-07 stat.ML cs.LG stat.OT

Piecewise Deterministic Markov Processes for Bayesian Neural Networks

Ethan Goan, Dimitri Perrin, Kerrie Mengersen, Clinton Fookes

Comments typo fix, Includes correction to software and corrigendum note (fix supplementary references)

2604.04084 2026-04-07 stat.CO stat.ME

Meta-analysis with the glmmTMB R package

Coralie Williams, Maeve McGillycuddy, Mollie Brooks, Benjamin M. Bolker, Ayumi Mizuno, Yefeng Yang, Wolfgang Viechtbauer, David I. Warton, Shinichi Nakagawa

2604.04032 2026-04-07 stat.ME stat.AP

Bootstrap-Aggregated Method-of-Moments Estimation of the Copula Correlation Parameter for Marginal Survival Inference under Dependent Censoring

Hyun-Soo Zhang, Inkyung Jung, Chung Mo Nam

2604.03985 2026-04-07 cs.LG eess.SP stat.ML

Autoencoder-Based Parameter Estimation for Superposed Multi-Component Damped Sinusoidal Signals

Momoka Iida, Hayato Motohashi, Hirotaka Takahashi

Comments 27 pages, 16 figures, 14 tables

2604.03981 2026-04-07 cs.LG stat.CO

Multirate Stein Variational Gradient Descent for Efficient Bayesian Sampling

Arash Sarshar

2604.03970 2026-04-07 stat.ME stat.AP stat.CO

Learning association from multiple intermediate events for dynamic prediction of survival: an application to cardiovascular disease prognosis

Tonghui Yu, Liming Xiang

2604.03969 2026-04-07 stat.ML cs.LG stat.ME

Nearly Optimal Best Arm Identification for Semiparametric Bandits

Seok-Jin Kim

Comments To appear at AISTATS 2026

2604.03952 2026-04-07 stat.AP q-bio.QM

Multidimensional physical fitness is associated with reduced dementia risk through proteomic and neuroimaging pathways: a prospective cohort study of the UK Biobank

Yiqing Sun, Runyu Lin, Jiayue Qin, Feiyue Pan, Bingjie Li, Zhigang Yao

Comments 22 pages, 6 figures

2604.03948 2026-04-07 q-fin.PM stat.AP

Forecasting Tangency Portfolios and Investing in the Minimum Euclidean Distance Portfolio to Maximize Out-of-Sample Sharpe Ratios

Nolan Alexander, William Scherer

Comments Code: https://github.com/nolanalexander/efficient-frontier-coefficients

2604.03946 2026-04-07 q-fin.PM stat.AP

Asset allocation using a Markov process of clustered efficient frontier coefficients states

Nolan Alexander, William Scherer, Jamey Thompson

Comments Code: https://github.com/nolanalexander/efficient-frontier-coefficients

2604.03939 2026-04-07 stat.ME cs.LG stat.ML

Fused Multinomial Logistic Regression Utilizing Summary-Level External Machine-learning Information

Chi-Shian Dai, Jun Shao

Comments 24 pages, 2 figures

2604.03898 2026-04-07 cs.AI stat.CO

LLM-Agent-based Social Simulation for Attitude Diffusion

Deepak John Reji

2604.03863 2026-04-07 stat.ME

Estimation of treatment effect in clinical trials of continuous endpoints with retrieved dropouts

Myeongjong Kang, Sangyoon Yi

Comments 27 pages, 3 figures, 8 tables

2604.03840 2026-04-07 stat.ME cs.LG

New insights into Elo algorithm for practitioners and statisticians

Leszek Szczecinski

2604.03827 2026-04-07 stat.ME stat.AP

Confidence Intervals for Rate Estimation with Importance Sampling in Autonomous Vehicle Evaluation

Aiyou Chen, Ruixuan Rachel Zhou, Joseph J. Lee, Nicholas Chamandy, Henning Hohnhold

Comments 27 pages, 9 figures, Accepted by the Annals of Applied Statistics

2604.03810 2026-04-07 stat.ME

A test for normality based on self-similarity

Akin Anarat, Holger Schwender

2604.03772 2026-04-07 stat.ML cs.LG

Debiased Machine Learning for Conformal Prediction of Counterfactual Outcomes Under Runtime Confounding

Keith Barnatchez, Kevin P. Josey, Rachel C. Nethery, Giovanni Parmigiani

2604.03722 2026-04-07 math.PR math.ST stat.TH

Statistical Inference for Fractional Diffusions

Pablo Ramses Alonso-Martin, Horatio Boedihardjo, Anastasia Papavasiliou

Comments Contribution to an edited volume on anomalous diffusions

2604.03721 2026-04-07 stat.ML cs.LG stat.ME

The Generalised Kernel Covariance Measure

Luca Bergen, Dino Sejdinovic, Vanessa Didelez

Comments Accepted for the 5th Conference on Causal Learning and Reasoning (CLeaR 2026)

2604.03712 2026-04-07 math.PR math.ST stat.TH

Berry-Esseen Bounds for Statistics of Non-Stationary, $ϕ$-Mixing Random Variables

Brendan Williams, Yeor Hafouta

2604.03663 2026-04-07 econ.EM math.ST stat.TH

Robust Priors in Nonlinear Panel Models with Individual and Time Effects

Zizhong Yan, Zhengyu Zhang, Mingli Chen, Jingrong Li, Iván Fernández-Val

2604.03574 2026-04-07 stat.ME stat.AP

Spherically Embedded Time Series with Unknown Trend and Periodic Components

Jiazhen Xu, Han Lin Shang

2604.03566 2026-04-07 math.OC stat.ML

Fréchet Regression on the Bures-Wasserstein Manifold

Duc Toan Nguyen, César A. Uribe

2604.03544 2026-04-07 econ.EM stat.ME

Quantifying Omitted Variable Bias in Nonlinear Instrumental Variable Estimators

Yu-Min Yen

Comments 40 pages, 8 figures

2604.03535 2026-04-07 stat.ME

Multilevel Regression Discontinuity Models with Latent Variables

Monica Morell, Youngjin Han, Muwon Kwon, Youjin Sung, Yang Liu, Ji Seung Yang

2604.03502 2026-04-07 stat.ML cs.LG stat.ME

Nonparametric Regression Discontinuity Designs with Survival Outcomes

Maximilian Schuessler, Erik Sverdrup, Robert Tibshirani, Stefan Wager

2604.03437 2026-04-07 stat.AP cs.CY

Is it Cake or is it AI? A Systematic Review of Human Uncertainty in Distinguishing Generative Artificial Intelligence Content

Mark Louie F. Ramos

2604.03398 2026-04-07 stat.ME stat.AP stat.CO

Robust Standard Errors for Bayesian Posterior Functionals via the Infinitesimal Jackknife

Nanyu Luo, Feng Ji

2604.03388 2026-04-07 cs.LG stat.ML

Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

Haotian Xiang, Bingcong Li, Qin Lu

详情

英文摘要

When deploying large language models (LLMs) to safety-critical applications, uncertainty quantification (UQ) is of utmost importance to self-assess the reliability of the LLM-based decisions. However, such decisions typically suffer from overconfidence, particularly after parameter-efficient fine-tuning (PEFT) for downstream domain-specific tasks with limited data. Existing methods to alleviate this issue either rely on Laplace approximation based post-hoc framework, which may yield suboptimal calibration depending on the training trajectory, or variational Bayesian training that requires multiple complete forward passes through the entire LLM backbone at inference time for Monte Carlo estimation, posing scalability challenges for deployment. To address these limitations, we build on the Bayesian last layer (BLL) model, where the LLM-based deterministic feature extractor is followed by random last layer parameters for uncertainty reasoning. Since existing low-rank adapters (LoRA) for PEFT have limited expressiveness due to rank collapse, we address this with Polar-decomposed Low-rank Adapter Representation (PoLAR), an orthogonalized parameterization paired with Riemannian optimization to enable more stable and expressive adaptation. Building on this PoLAR-BLL model, we leverage the variational (V) inference framework to put forth a scalable Bayesian fine-tuning approach which jointly seeks the PoLAR parameters and approximate posterior of the last layer parameters via alternating optimization. The resulting PoLAR-VBLL is a flexible framework that nicely integrates architecture-enhanced optimization with scalable Bayesian inference to endow LLMs with well-calibrated UQ. Our empirical results verify the effectiveness of PoLAR-VBLL in terms of generalization and uncertainty estimation on both in-distribution and out-of-distribution data for various common-sense reasoning tasks.

URL PDF HTML ☆

赞 0 踩 0

2604.03359 2026-04-07 physics.ao-ph stat.AP

Multidecadal Cycles Study in the Climate Indexes Series Using Wavelet Analysis in North/Northeast Brazil

Cleber Souza Corrêa, Roberto Lage Guedes, Karlmer Abel Bueno Corrêa, Felipe Gustavo Pilau

Comments 9 pages, 3 figures, published in Anuário do Instituto de Geociências (UFRJ), 42(1):66-73, 2019. DOI: 10.11137/2019_1_66_73

2604.03357 2026-04-07 physics.ao-ph stat.AP

Multidecadal Cycles of the Climatic Index: Sunspots that Affect North and Northeast of Brazil

Cleber Souza Corrêa, Roberto Lage Guedes, André Muniz Marinho da Rocha, Karlmer Abel Bueno Corrêa

Comments 10 pages, 4 figures, accepted and published in Journal of Aerospace Technology and Management, 12:e0420, 2020

2604.03355 2026-04-07 stat.AP

The Long-Range Memory and the Fractal Dimension: a Case Study for Alcântara

Cleber Souza Correa, Daniel Andrade Schuch, Antonio Paulo de Queiroz, Gilberto Fisch, Felipe do Nascimento Correa, Mariane Mendes Coutinho

Comments 8 pages, 6 figures, published in Journal of Aerospace Technology and Management (2017), DOI: 10.5028/jatm.v9i4.683

详情

DOI: 10.5028/jatm.v9i4.683
Journal ref: Journal of Aerospace Technology and Management, 9(4):461-468, 2017

英文摘要

This study aimed to analyze the time series behavior of the Southern Oscillation Index through techniques using Fast Fourier Transform, computing the autocorrelation function, and the calculation of the Hurst coefficient. The methodology of Hurst exponent calculation uses different lags, which are computed in the time series of Southern Oscillation Index. The persistent behavior in the time series can be characterized by calculating the Hurst exponent, seeking for more behavioral information, such as the existence of persistence and/or terms of long-range memory in the series. The results show a persistence of the climate in terms of long-memory Southern Oscillation Index time series, which can help to understand complex dynamic behavior in climate effects at global-scale level and specifically its influence in northeastern Brazil, in the region of the Alcântara Launch Center. The R package \texttt{tseriesChaos} was used in the analysis of the Southern Oscillation Index time series, estimating the largest Lyapunov exponent, which indicates the existence of chaotic behavior in time series. The resampling technique was used in a permutation test between the surface wind data in the São Luís airport, Maranhão State, and the Southern Oscillation Index. The permutation test results showed that the time series of monthly average wind speed in the São Luís airport is correlated with the variability of Southern Oscillation Index, statistically significant at the 5\% confidence level. The results also indicate the possibility of using autoregressive models to represent average meteorological variables in behavioral analysis, as well as trends in the climate, more specifically a possible climatic influence of El Niño--Southern Oscillation on wind strength in the Alcântara Launch Center.

URL PDF HTML ☆

赞 0 踩 0

2604.03354 2026-04-07 math.OC stat.CO

Optimal Experimental Design using Eigenvalue-Based Criteria with Pyomo.DoE

Daniel J. Laky, Shammah Lilonfe, Shawn B. Martin, Katherine A. Klise, Bethany L. Nicholson, John D. Siirola, Alexander W. Dowling

Comments 82 pages, 14 figures, 11 tables; includes supplementary information

2604.03341 2026-04-07 stat.AP physics.ao-ph stat.ML

Generative Unsupervised Downscaling of Climate Models via Domain Alignment: Application to Wind Fields

Julie Keisler, Boutheina Oueslati, Anastase Charantonis, Yannig Goude, Claire Monteleoni

详情

英文摘要

General Circulation Models (GCMs) are widely used for future climate projections, but their coarse spatial resolution and systematic biases limit their direct use for impact studies. This limitation is particularly critical for wind-related applications, such as wind energy, which require spatially coherent, multivariate, and physically plausible near-surface wind fields. Classical statistical downscaling and bias correction methods partly address this issue. Still, they struggle to preserve spatial structure, inter-variable consistency, and robustness under climate change, especially in high-dimensional settings. Recent advances in generative machine learning offer new opportunities for downscaling and bias correction, eliminating the need for explicitly paired low- and high-resolution datasets. However, many existing approaches remain difficult to interpret and challenging to deploy in operational climate impact studies. In this work, we apply SerpentFlow, an interpretable, generative, domain alignment framework, to the multivariate downscaling and bias correction of wind variables from GCM outputs. This is a method that generates low-resolution/high-resolution training data pairs by separating large-scale spatial patterns from small-scale variability. Large-scale components are aligned across climate model and observational domains. Conditional fine-scale variability is then learned using a flow-matching generative model. We apply the approach to multiple wind variables downscaling, including average and maximal wind speed, zonal and meridional components, and compare it with widely used multivariate bias correction methods. Results show improved spatial coherence, inter-variable consistency, and robustness under future climate conditions, highlighting the potential of interpretable generative models for wind and energy applications.

URL PDF HTML ☆

赞 0 踩 0

2604.03284 2026-04-07 stat.CO stat.ME

FunctionalCalibration: an R package for estimation in aggregated functional data model

Alex Rodrigo dos Santos Sousa, Vitor Ribas Perrone

2604.03271 2026-04-07 stat.CO physics.data-an

GPU-Accelerated Sequential Monte Carlo for Bayesian Spectral Analysis

Tomohiro Nabika, Yui Hayashi, Masato Okada

2604.00966 2026-04-07 math.ST cs.CC stat.TH

A Framework for Computational Lower Bounds in Nontrivial Norm Approximation

Runshi Tang, Yuefeng Han, Anru R. Zhang

2604.00672 2026-04-07 cs.CL cs.IR math.ST stat.TH

Common TF-IDF variants arise as key components in the test statistic of a penalized likelihood-ratio test for word burstiness

Zeyad Ahmed, Paul Sheridan, Michael McIsaac, Aitazaz A. Farooque

Comments 27 pages, 3 tables, 7 figures, accepted in Discover Computing 2026

2603.26029 2026-04-07 math.ST cs.CC stat.TH

Detection Is Harder Than Estimation in Certain Regimes: Inference for Moment and Cumulant Tensors

Runshi Tang, Yuefeng Han, Anru R. Zhang

2602.14303 2026-04-07 stat.ME

A Novel Three-Parameter Extended Weibull Distribution for Health Data Modelling

Isqeel Ogunsola, Nurudeen Ajadi, Gboyega Adepoju

2601.06597 2026-04-07 cs.LG stat.ML

Understanding and inverse design of implicit bias in stochastic learning: a geometric perspective

Nicola Aladrah, Emanuele Ballarin, Matteo Biagetti, Alessio Ansuini, Alberto d'Onofrio, Fabio Anselmi

Comments v2

2601.01422 2026-04-07 stat.CO stat.ME

Hamiltonian Monte Carlo for (Physics) Dummies

Arghya Mukherjee, Dootika Vats

Comments 40 pages, 12 figures, 1 table

2512.15056 2026-04-07 stat.AP

Routine Blood Biomarkers Reveal a Preclinical Continuum of Multiple Myeloma Risk

Bingjie Li, Jiadai Xu, Yiqing Sun, Feiyue Pan, Shing-Tung Yau, Peng Liu, Zhigang Yao

Comments 25 pages

2512.11919 2026-04-07 stat.ME cs.AI math.ST stat.TH

A fine-grained look at causal effects in causal spaces

Junhyung Park, Yuqing Zhou

2511.05281 2026-04-07 stat.ME

Conditioning on posterior samples for flexible frequentist goodness-of-fit testing

Ritwik Bhaduri, Aabesh Bhattacharyya, Rina Foygel Barber, Lucas Janson

Comments added sensitivity analysis

2510.22068 2026-04-07 cs.LG stat.ML

Deep Gaussian Processes for Functional Maps

Matthew Lowery, Zhitong Xu, Da Long, Keyan Chen, Daniel S. Johnson, Yang Bai, Varun Shankar, Shandian Zhe

Comments 9 pages + 9 page appendix, 7 figures

2510.20052 2026-04-07 math.OC cs.LG stat.ML

Endogenous Aggregation of Multiple Data Envelopment Analysis Scores for Large Data Sets

Hashem Omrani, Raha Imanirad, Adam Diamant, Utkarsh Verma, Amol Verma, Fahad Razak

2510.16132 2026-04-07 cs.LG math.OC stat.ML

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies

Phalguni Nanda, Zaiwei Chen

Comments 46 pages, 4 figures

2510.06157 2026-04-07 stat.ME

Frequency-Domain Analysis of Time Series with Network-Structured Dependence: Application to Global Bank Connectedness

Cristian F. Jiménez-Varón, Marina I. Knight

2510.05454 2026-04-07 econ.EM stat.ME

Estimating Treatment Effects Under Bounded Heterogeneity

Soonwoo Kwon, Liyang Sun

Comments 45 pages, 5 figures

2510.04299 2026-04-07 stat.ME

Out-of-bag prediction balls for random forests in metric spaces

Diego Serrano, Eduardo García-Portugués

Comments 28 pages, 8 figures, 6 tables. Supplementary material: 11 pages, 4 figures, 2 tables

2509.21940 2026-04-07 stat.ML cs.IT cs.LG math.IT math.ST stat.TH

Sequential 1-bit Mean Estimation with Near-Optimal Sample Complexity

Ivan Lau, Jonathan Scarlett

Comments AISTATS 2026

2509.12981 2026-04-07 cs.LG stat.ML

Causal Discovery via Quantile Partial Effect

Yikang Chen, Xingzhe Sun, Dehui Du

Comments 29 pages, 6 figures; ICLR 2026

2509.02892 2026-04-07 cs.LG stat.ME

Improving Generative Methods for Causal Evaluation via Simulation-Based Inference

Pracheta Amaranath, Vinitra Muralikrishnan, Amit Sharma, David Jensen

Comments 13 pages main text, 68 pages total

2509.00472 2026-04-07 stat.ML cs.LG math.ST stat.TH

Partially Functional Dynamic Backdoor Diffusion-based Causal Model

Xinwen Liu, Lei Qian, Song Xi Chen, Niansheng Tang

Comments 16 pages, 2 figures

2508.19640 2026-04-07 math.ST stat.ME stat.TH

Optimal Cox regression under federated differential privacy: coefficients and cumulative hazards

Elly K. H. Hung, Yi Yu

2508.13831 2026-04-07 stat.ML cs.LG

Smooth Flow Matching for Synthesizing Functional Data

Jianbin Tan, Anru R. Zhang

2506.21744 2026-04-07 cs.LG stat.AP stat.ML

Federated Item Response Models: A Gradient-driven Privacy-preserving Framework for Distributed Psychometric Estimation

Biying Zhou, Nanyu Luo, Feng Ji

2506.07816 2026-04-07 stat.ML cs.LG math.PR

Accelerating Constrained Sampling: A Large Deviations Approach

Yingli Wang, Changwei Tu, Xiaoyu Wang, Lingjiong Zhu

Comments 59 pages, 15 figures

2506.00077 2026-04-07 cs.CL cs.LG stat.ML

Gaussian mixture models as a proxy for interacting language models

Edward L. Wang, Mohammad Sharifi Kiasari, Tianyu Wang, Hayden Helm, Avanti Athreya, Carey Priebe, Vince Lyzinski

2505.18288 2026-04-07 stat.ML cs.LG

Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

Yash Patel, Unique Subedi, Ambuj Tewari

Comments 37 pages

2505.12530 2026-04-07 cs.LG math.OC stat.ML

Enforcing Fair Predicted Scores on Intervals of Percentiles by Difference-of-Convex Constraints

Yutian He, Yankun Huang, Yao Yao, Qihang Lin

Comments 45 pages, 12 figures, 4 tables. This work is published in the proceedings of AISTATS 2026

2505.11211 2026-04-07 cs.LG cs.AI stat.ME stat.ML

Bayesian Hierarchical Invariant Prediction

Francisco Madaleno, Pernille Julie Viuff Sand, Francisco C. Pereira, Sergio Hernan Garrido Mejia

2504.05297 2026-04-07 stat.ME econ.EM stat.AP stat.CO

Eigenvalue-Based Randomness Test for Residual Diagnostics in Panel Data Models

Marcell T. Kurbucz, Betsabé Pérez Garrido, Antal Jakovác

Comments 10 pages, 3 figures

2503.03206 2026-04-07 cs.LG cs.CV math.ST stat.ML stat.TH

An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models

Binxu Wang, Cengiz Pehlevan

Comments 96 pages, 29 figures. Published in Advances in Neural Information Processing Systems, NeurIPS 2025 (Spotlight)

2503.03049 2026-04-07 stat.ME stat.AP

Estimating treatment effects with competing intercurrent events in randomized controlled trials

Sizhu Lu, Yanyao Yi, Yongming Qu, Huayu Karen Liu, Ting Ye, Peng Ding

详情

英文摘要

The analysis of randomized controlled trials is often complicated by intercurrent events (IEs) -- events that occur after treatment initiation and affect either the interpretation or existence of outcome measurements. Examples include treatment discontinuation or the use of additional medications. In two recent clinical trials for systemic lupus erythematosus with complications of IEs, we classify the IEs into two broad categories: effect-informative (e.g., treatment discontinuation due to adverse events or lack of efficacy) and effect-uninformative (e.g., treatment discontinuation due to external factors such as pandemics or relocation). To define a clinically meaningful estimand, we adopt tailored strategies for each category of IEs. For effect-informative IEs, which are often informative about a patient's outcome, we use the composite variable strategy that assigns an outcome value indicative of treatment failure. For effect-uninformative IEs, we apply the hypothetical strategy, assuming their timing is conditionally independent of the outcome given treatment and baseline covariates, and hypothesizing a scenario in which such events do not occur. A central yet previously overlooked challenge is the presence of competing IEs, where the first IE censors all subsequent ones. Despite its ubiquity in practice, this issue has not been explicitly recognized or addressed in previous data analyses due to the lack of rigorous statistical methodology. In this paper, we propose a principled framework to formulate the estimand, establish its nonparametric identification and semiparametric estimation theory, and introduce weighting, outcome regression, and doubly robust estimators. We apply our methods to analyze the two systemic lupus erythematosus trials, demonstrating the robustness and practical utility of the proposed framework.

URL PDF HTML ☆

赞 0 踩 0

2502.15567 2026-04-07 cs.LG stat.ML

Model Privacy: A Unified Framework for Understanding Model Stealing Attacks and Defenses

Ganghua Wang, Yuhong Yang, Jie Ding

Comments Journal of the Royal Statistical Society Series B: Statistical Methodology, 2026

2502.02020 2026-04-07 cs.LG stat.ME

Causal Bandit Over Unknown Graphs: Upper Confidence Bounds With Backdoor Adjustment

Yijia Zhao, Qing Zhou

2501.07571 2026-04-07 math.ST stat.TH

Statistical learnability of smooth boundaries via pairwise binary classification with deep ReLU networks

Hiroki Waida, Takafumi Kanamori

2412.13453 2026-04-07 stat.ME

Modeling extremal dependence in multivariate and spatial problems: a practical perspective

Boris Beranger, Simone A. Padoan

2411.13443 2026-04-07 math.NA cs.NA math.OC stat.ML

Nonlinear Assimilation via Score-based Sequential Langevin Sampling

Zhao Ding, Chenguang Duan, Yuling Jiao, Jerry Zhijian Yang, Cheng Yuan, Pingwen Zhang

2411.02225 2026-04-07 stat.ML cs.IT cs.LG math.IT math.ST stat.TH

Sparse Max-Affine Regression

Haitham Kanj, Seonho Kim, Kiryung Lee

详情

英文摘要

This paper presents Sparse Gradient Descent as a solution for variable selection in convex piecewise linear regression, where the model is given as the maximum of $k$-affine functions $ x \mapsto \max_{j \in [k]} \langle a_j^\star, x \rangle + b_j^\star$ for $j = 1,\dots,k$. Here, $\{ a_j^\star\}_{j=1}^k$ and $\{b_j^\star\}_{j=1}^k$ denote the ground-truth weight vectors and intercepts. A non-asymptotic local convergence analysis is provided for Sp-GD under sub-Gaussian noise when the covariate distribution satisfies the sub-Gaussianity and anti-concentration properties. When the model order and parameters are fixed, Sp-GD provides an $ε$-accurate estimate given $\mathcal{O}(\max(ε^{-2}σ_z^2,1)s\log(d/s))$ observations where $σ_z^2$ denotes the noise variance. This also implies the exact parameter recovery by Sp-GD from $\mathcal{O}(s\log(d/s))$ noise-free observations. The proposed initialization scheme uses sparse principal component analysis to estimate the subspace spanned by $\{ a_j^\star\}_{j=1}^k$, then applies an $r$-covering search to estimate the model parameters. A non-asymptotic analysis is presented for this initialization scheme when the covariates and noise samples follow Gaussian distributions. When the model order and parameters are fixed, this initialization scheme provides an $ε$-accurate estimate given $\mathcal{O}(ε^{-2}\max(σ_z^4,σ_z^2,1)s^2\log^4(d))$ observations. A new transformation named Real Maslov Dequantization (RMD) is proposed to transform sparse generalized polynomials into sparse max-affine models. The error decay rate of RMD is shown to be exponentially small in its temperature parameter. Furthermore, theoretical guarantees for Sp-GD are extended to the bounded noise model induced by RMD. Numerical Monte Carlo results corroborate theoretical findings for Sp-GD and the initialization scheme.

URL PDF HTML ☆

赞 0 踩 0

2410.00985 2026-04-07 stat.ME

Nonparametric tests of treatment effect homogeneity for policy-makers

Oliver Dukes, Mats J. Stensrud, Riccardo Brioschi, Aaron Hudson

2309.10284 2026-04-07 stat.ME math.ST stat.AP stat.TH

Rank-adaptive covariance testing with applications to genomics and neuroimaging

David Veitch, Yinqiu He, Jun Young Park

2307.09366 2026-04-07 cs.LG stat.ME stat.ML

Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives

Kayhan Behdin, Wenyu Chen, Rahul Mazumder

Comments Operations Research (to appear)

2306.06581 2026-04-07 stat.ML cs.DS cs.LG math.OC

Importance Sparsification for Sinkhorn Algorithm

Mengyu Li, Jun Yu, Tao Li, Cheng Meng

Comments Accepted by Journal of Machine Learning Research

2306.04119 2026-04-07 stat.ME

Improving Survey Inference in Two-phase Designs Using Bayesian Machine Learning

Xinru Wang, Anyu Zhu, Lauren Kennedy, Abigail Greenleaf, Qixuan Chen

2006.04363 2026-04-07 cs.LG cs.AI stat.ML

Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

Farzane Aminmansour, Taher Jafferjee, Ehsan Imani, Erin Talvitie, Micheal Bowling, Martha White

Comments Published in Journal of Artificial Intelligence (JAIR) in 2024. Updated to published version, changed title to JAIR version, added a new author that led the submission