arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.08676 2026-03-10 stat.ML cs.LG stat.CO

Momentum SVGD-EM for Accelerated Maximum Marginal Likelihood Estimation

Adam Rozzio, Rafael Athanasiades, O. Deniz Akyildiz

Comments Accepted to AISTATS 2026

2603.08607 2026-03-10 stat.ME stat.AP

RESAPLE: An Approximate One-Step Restricted Likelihood Estimator of Spatial Dependence for Exploratory Spatial Analysis

Aditya Khan, Meredith Franklin

2603.08553 2026-03-10 stat.ML cs.LG math.OC q-fin.PM q-fin.RM

Generative Adversarial Regression (GAR): Learning Conditional Risk Scenarios

Saeed Asadi, Jonathan Yu-Meng Li

2603.08542 2026-03-10 math.ST cs.DS math.PR stat.TH

Bayesian inference of planted matchings: Local posterior approximation and infinite-volume limit

Zhou Fan, Timothy L. H. Wee, Kaylee Y. Yang

2603.08518 2026-03-10 cs.LG stat.ML

Breaking the Bias Barrier in Concave Multi-Objective Reinforcement Learning

Swetha Ganesh, Vaneet Aggarwal

2603.08495 2026-03-10 cs.LG stat.ML

Efficient Credal Prediction through Decalibration

Paul Hofman, Timo Löhr, Maximilian Muschalik, Yusuf Sale, Eyke Hüllermeier

2603.08377 2026-03-10 cs.LG stat.ML

Beyond the Markovian Assumption: Robust Optimization via Fractional Weyl Integrals in Imbalanced Data

Gustavo A. Dorrego

Comments 5 pages, 3 figures

2603.08370 2026-03-10 stat.ML cs.IR cs.LG stat.ME

Unifying On- and Off-Policy Variance Reduction Methods

Olivier Jeunen

2603.08353 2026-03-10 math.ST stat.TH

Limiting Spectral Distribution of moderately large Kendall's correlation matrix and its application

Raunak Shevade, Monika Bhattacharjee

Comments 25 pages, Submitted to journal

2603.08349 2026-03-10 cs.LG cs.AI stat.ML

Towards plausibility in time series counterfactual explanations

Marcin Kostrzewa, Krzysztof Galus, Maciej Zięba

2603.08345 2026-03-10 stat.ME q-bio.QM

Amortized Phylodynamic Inference with Neural Bayes Estimators and Recursive Neural Networks

Alexander E. Zarebski, Thomas Williams, Louis du Plessis

2603.08320 2026-03-10 math.ST math.PR stat.TH

Size-Location Correlation for Set-Valued Processes: Theory, Estimation, and Laws of Large Numbers under $ρ$-Mixing

Tuyen Luc Tri

Comments 47 pages, 2 figures

2603.08311 2026-03-10 math.ST cs.LG stat.TH

Sign Identifiability of Causal Effects in Stationary Stochastic Dynamical Systems

Gijs van Seeventer, Saber Salehkaleybar

2603.08287 2026-03-10 stat.ML cs.LG

Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces

Hamish Flynn, Joe Watson, Ingmar Posner, Jan Peters

Comments 37 pages, 8 figures

2603.08285 2026-03-10 stat.ME

An objective non-local prior for skew-symmetric models

F. J. Rubio

Comments R code and real data available at: https://github.com/FJRubio67/MOOMIN

2603.08257 2026-03-10 stat.ML cs.LG

Beyond ReinMax: Low-Variance Gradient Estimators for Discrete Latent Variables

Daniel Wang, Thang D. Bui

2603.08242 2026-03-10 cs.LG stat.AP

Optimising antibiotic switching via forecasting of patient physiology

Magnus Ross, Nel Swanepoel, Akish Luintel, Emma McGuire, Ingemar J. Cox, Steve Harris, Vasileios Lampos

Comments 32 pages, 8 figures

2603.06759 2026-03-10 math.ST stat.TH

A New Estimator of Kullback--Leibler Divergence via Shannon Entropy

Mehmet Siddik Cadirci, Martin Singull

Comments 20 pages, 6 figures, 2 tables

2602.20640 2026-03-10 math.ST stat.ML stat.TH

Scalable multitask Gaussian processes for complex mechanical systems with functional covariates

Razak Christophe Sabi Gninkou, Andrés F. López-Lopera, Franck Massa, Rodolphe Le Riche

2602.10760 2026-03-10 math.ST stat.TH

Covariate-Adaptive Randomization in Clinical Trials without Inflated Variances

Zhang Li-Xin

Comments 30 pages

2601.02241 2026-03-10 stat.ML cs.LG

From Mice to Trains: Amortized Bayesian Inference on Graph Data

Svenja Jedhoff, Elizaveta Semenova, Aura Raulo, Anne Meyer, Paul-Christian Bürkner

2512.24327 2026-03-10 stat.ML cs.CG cs.LG

Topological Spatial Graph Coarsening

Anna Calissano, Etienne Lasalle

2511.19905 2026-03-10 math.ST stat.ME stat.TH

Sigmoid-FTRL: Design-Based Adaptive Neyman Allocation for AIPW Estimators

Fangyi Chen, Shu Ge, Jian Qian, Christopher Harshaw

2510.04602 2026-03-10 stat.ML cs.AI cs.LG

Wasserstein Gradient Flows for Scalable and Regularized Barycenter Computation

Eduardo Fernandes Montesuma, Yassir Bendou, Mike Gartrell

Comments Under review

2510.04543 2026-03-10 cs.LG stat.ML

The Role of Feature Interactions in Graph-based Tabular Deep Learning

Elias Dubbeldam, Reza Mohammadi, Marit Schoonhoven, S. Ilker Birbil

Comments 12 pages, 5 figures, accepted at TMLR 2026

2510.01734 2026-03-10 stat.ME

Stabilizing Thompson Sampling with Null Hypothesis Bayesian Response-Adaptive Randomization

Samuel Pawel, Leonhard Held

2509.26429 2026-03-10 stat.ML cs.LG

An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes

Emil Javurek, Valentyn Melnychuk, Jonas Schweisthal, Konstantin Hess, Dennis Frauen, Stefan Feuerriegel

Comments Published as a conference paper at ICLR 2026

2507.14391 2026-03-10 stat.ME econ.EM

Policy relevance of causal quantities in networks

Sahil Loomba, Dean Eckles

Comments 27 Pages, 4 figures

2505.03234 2026-03-10 stat.ME

Designing clinical trials for the comparison of single and multiple quantiles with right-censored data

Beatriz Farah, Olivier Bouaziz, Aurélien Latouche

2503.00290 2026-03-10 econ.EM math.ST stat.TH

GMM and M Estimation under Network Dependence

Yuya Sasaki

2502.03849 2026-03-10 math.ST stat.CO stat.ME stat.TH

Fast confidence bounds for the false discovery proportion over a path of hypotheses

Guillermo Durand

2409.16044 2026-03-10 stat.AP stat.ME

Stable Survival Extrapolation via Transfer Learning

Anastasios Apsemidis, Nikolaos Demiris

Comments 28 pages, 6 figures, 1 table

2409.09787 2026-03-10 cs.LG cs.AI stat.CO stat.ML

BNEM: A Boltzmann Sampler Based on Bootstrapped Noised Energy Matching

RuiKang OuYang, Bo Qiang, José Miguel Hernández-Lobato

Comments Camera-ready version for TMLR (03/2026)

2408.13143 2026-03-10 stat.ME

A Restricted Latent Class Model with Polytomous Attributes and Respondent-Level Covariates

Eric Alan Wayman, Steven Andrew Culpepper, Jeff Douglas, Jesse Bowers

Comments 42 pages, 1 figure, 11 tables. Added second simulation study, expanded explanations, added runtime information, and fixed typos. The version of record of this article, first published in Behaviormetrika, is available on the publisher's website at https://doi.org/10.1007/s41237-025-00271-8

2406.13691 2026-03-10 stat.ME stat.CO

Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs

Adam Gorm Hoffmann, Claus Thorn Ekstrøm, Andreas Kryger Jensen

Comments 48 pages, 3 figures; Figure 1 corrected

2405.08290 2026-03-10 stat.CO stat.ME

MCMC using $\textit{bouncy}$ Hamiltonian dynamics: A unifying framework for Hamiltonian Monte Carlo and piecewise deterministic Markov process samplers

Andrew Chin, Akihiko Nishimura

2312.10330 2026-03-10 math.OC stat.ML

Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization

Yuchen Li, Laura Balzano, Deanna Needell, Hanbaek Lyu

Comments 54 pages, 8 figures. Related work updated

2309.03122 2026-03-10 stat.AP physics.soc-ph stat.ME

Bayesian Evidence Synthesis for Modeling SARS-CoV-2 Transmission

Anastasios Apsemidis, Nikolaos Demiris

Comments 27 pages, 6 figures

2603.08156 2026-03-10 cs.LG stat.ML

Are We Winning the Wrong Game? Revisiting Evaluation Practices for Long-Term Time Series Forecasting

Thanapol Phungtua-eng, Yoshitaka Yamamoto

Comments First draft

2603.08149 2026-03-10 math.ST math.PR stat.TH

The W-footrule coefficient: A copula-based measure of countermonotonicity

Enrique de Amo, David García-Fernández, Manuel Úbeda-Flores

2603.08130 2026-03-10 cs.LG stat.ML

Explainable Condition Monitoring via Probabilistic Anomaly Detection Applied to Helicopter Transmissions

Aurelio Raffa Ugolini, Jessica Leoni, Valentina Breschi, Damiano Paniccia, Francesco Aldo Tucci, Luigi Capone, Mara Tanelli

2603.08101 2026-03-10 stat.AP

Non-stationary GEV models for estimating design sea-states in a changing climate. Applications to offshore wind farms along the French coasts

Nicolas Raillard, Coline Poppeschi, Tessa Chevallier, Youen Kervella, Laurent Dubus

Comments This work is under review for journal "Advances in Statistical Climatology, Meteorology and Oceanography" (ASCMO)

2603.08002 2026-03-10 math.ST stat.ME stat.TH

Post-Hoc Large-Sample Statistical Inference

Ben Chugg, Etienne Gauthier, Michael I. Jordan, Aaditya Ramdas, Ian Waudby-Smith

Comments 61 pages, 7 figures

2603.07971 2026-03-10 math.ST stat.TH

Estimation of differential entropy for normal populations under prior information

Somnath Mandal, Lakshmi Kanta Patra

Comments 29 pages, 28 figures, 3 tables, 34 references

2603.07965 2026-03-10 stat.ML cs.LG

Local Constrained Bayesian Optimization

Jing Jingzhe, Fan Zheyi, Szu Hui Ng, Qingpei Hu

2603.07921 2026-03-10 stat.ML cs.LG

Robust Transfer Learning with Side Information

Akram S. Awad, Shihab Ahmed, Yue Wang, George K. Atia

2603.07899 2026-03-10 cs.LG stat.ML

Bayesian Transformer for Probabilistic Load Forecasting in Smart Grids

Sajib Debnath, Md. Uzzal Mia

详情

英文摘要

The reliable operation of modern power grids requires probabilistic load forecasts with well-calibrated uncertainty estimates. However, existing deep learning models produce overconfident point predictions that fail catastrophically under extreme weather distributional shifts. This study proposes a Bayesian Transformer (BT) framework that integrates three complementary uncertainty mechanisms into a PatchTST backbone: Monte Carlo Dropout for epistemic parameter uncertainty, variational feed-forward layers with log-uniform weight priors, and stochastic attention with learnable Gaussian noise perturbations on pre-softmax logits, representing, to the best of our knowledge, the first application of Bayesian attention to probabilistic load forecasting. A seven-level multi-quantile pinball-loss prediction head and post-training isotonic regression calibration produce sharp, near-nominally covered prediction intervals. Evaluation of five grid datasets (PJM, ERCOT, ENTSO-E Germany, France, and Great Britain) augmented with NOAA covariates across 24, 48, and 168-hour horizons demonstrates state-of-the-art performance. On the primary benchmark (PJM, H=24h), BT achieves a CRPS of 0.0289, improving 7.4% over Deep Ensembles and 29.9% over the deterministic LSTM, with 90.4% PICP at the 90% nominal level and the narrowest prediction intervals (4,960 MW) among all probabilistic baselines. During heat-wave and cold snap events, BT maintained 89.6% and 90.1% PICP respectively, versus 64.7% and 67.2% for the deterministic LSTM, confirming that Bayesian epistemic uncertainty naturally widens intervals for out-of-distribution inputs. Calibration remained stable across all horizons (89.8-90.4% PICP), while ablation confirmed that each component contributed a distinct value. The calibrated outputs directly support risk-based reserve sizing, stochastic unit commitment, and demand response activation.

URL PDF HTML ☆

赞 0 踩 0

2603.07887 2026-03-10 cs.LG cs.AI cs.CL math.ST stat.ML stat.TH

Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference

Noah Golowich, Fan Chen, Dhruv Rohatgi, Raghav Singhal, Carles Domingo-Enrich, Dylan J. Foster, Akshay Krishnamurthy

2603.07871 2026-03-10 stat.ME

Effective and flexible depth-based inference for functional parameters

Hyemin Yeon

2603.07864 2026-03-10 stat.ML cs.LG

An Interpretable Generative Framework for Anomaly Detection in High-Dimensional Financial Time Series

Waldyn G Martinez

2603.07856 2026-03-10 stat.ME

Variational Inference for Variable Selection in Scalar-on-Function Regression

Ana Carolina da Cruz, Camila P. E. de Souza, Pedro H. T. O. Sousa

Comments 41 pages in main text and 18 pages in the Supplementary Material

2603.07842 2026-03-10 stat.ME

New results and tests for stochastic dominance between linear combinations

Tommaso Lando, Paulo Eduardo Oliveira

2603.07813 2026-03-10 econ.EM stat.AP

At-Risk Transformation for U.S. Recession Prediction

Rahul Billakanti, Minchul Shin

Comments 46 pages, 2 figures

2603.07791 2026-03-10 stat.ME stat.AP

Design Effect Ratios for Bayesian Survey Models: A Diagnostic Framework for Identifying Survey-Sensitive Parameters

JoonHo Lee

2603.07780 2026-03-10 econ.EM math.ST stat.TH

Testing for Endogeneity: A Moment-Based Bayesian Approach

Siddhartha Chib, Minchul Shin, Anna Simoni

Comments 109 pages, 4 figures

2603.07742 2026-03-10 stat.OT

A Cylindrical Galton Board at the Galton Board's 150th Anniversary

Kanti V. Mardia, Colin Goodall, John Rubbo

Comments 18 pages, 8 Figures

2603.07701 2026-03-10 cond-mat.str-el hep-th physics.app-ph quant-ph stat.CO

Fractional Topological Phases, Flat Bands, and Robust Edge States on Finite Cyclic Graphs via Single-Coin Split-Step Quantum Walks

Dinesh Kumar Panda, Colin Benjamin

Comments 18 pages, 18 figures, 2 tables

2603.07656 2026-03-10 stat.ME math.ST stat.AP stat.CO stat.TH

Group-Sparse Smoothing for Longitudinal Models with Time-Varying Coefficients

Yu Lu, Tianni Zhang, Yuyao Wang, Mengfei Ran

2603.07634 2026-03-10 stat.ME physics.data-an

Dissecting Spectral Granger Causality through Partial Information Decomposition

Luca Faes, Gorana Mijatovic, Riccardo Pernice, Daniele Marinazzo, Sebastiano Stramaglia, Yuri Antonacci

2603.07527 2026-03-10 stat.ME

An efficient method of posterior sampling for Poisson INGARCH models

Yixuan Fan, Zhengwei Liu, Fukang Zhu

2603.07522 2026-03-10 stat.ML cs.LG

Beyond Data Splitting: Full-Data Conformal Prediction by Differential Privacy

Young Hyun Cho, Jordan Awan

2603.07505 2026-03-10 stat.ME

Adapting to noise tails in private linear regression

Jinyuan Chang, Lin Yang, Mengyue Zha, Wen-Xin Zhou

2603.07479 2026-03-10 stat.ME

Mixed Effects Mixture of Experts: Modeling Double Heterogeneous Trajectories

Xinkai Yue, Xiaodong Yan, Haohui Han, Liya Fu

Comments 5 figures

2603.07478 2026-03-10 stat.ME math.OC stat.AP

Evaluating consumption effects of intelligent control algorithms for district heated buildings

Antti Solonen, Arttu Häkkinen, Sallamaari Rapo, Antti Mäkinen, Sampo Kaukonen, Felipe Uribe

2603.07467 2026-03-10 stat.ML cs.LG math.PR math.ST stat.ME stat.TH

Probabilistic Inference and Learning with Stein's Method

Qiang Liu, Lester Mackey, Chris Oates

2603.07458 2026-03-10 econ.EM stat.AP

ForeComp: An R Package for Comparing Predictive Accuracy Using Fixed-Smoothing Asymptotics

Minchul Shin, Nathan Schor

Comments 45 pages, 2 figures

2603.07447 2026-03-10 stat.ME math.ST stat.AP stat.TH

Dirichlet kernel density estimation on the simplex with missing data

Hanen Daayeb, Wissem Jedidi, Salah Khardani, Guanjie Lyu, Frédéric Ouimet

Comments 32 pages, 9 figures, 2 tables

2603.07437 2026-03-10 cs.LG cs.SY eess.SY math.OC stat.ML

Cost-Driven Representation Learning for Linear Quadratic Gaussian Control: Part II

Yi Tian, Kaiqing Zhang, Russ Tedrake, Suvrit Sra

Comments 38 pages; preliminary version appeared in IEEE CDC 2023; this is the extended journal version, with an end-to-end guarantee added

2603.07409 2026-03-10 stat.ME stat.ML

Tree-Based Predictive Models for Noisy Input Data

Kevin McCoy, Zachary Wooten, Christine B. Peterson

Comments 17 pages, 9 figures

2603.07380 2026-03-10 stat.AP

Excessive data censoring in fMRI undermines individual precision and weakens brain-behavior associations

Amanda Mejia, Joanne Hwang, Damon Pham, Stephanie Noble, Theodore D. Satterthwaite, Thomas E. Nichols, B. T. Thomas Yeo

详情

英文摘要

Censoring high-motion volumes in fMRI is common practice to reduce effects of head motion on functional connectivity (FC). Although aggressive censoring removes more noise, it causes extensive data loss, creating a tradeoff that may ultimately improve or degrade FC accuracy. Here, we evaluate how censoring affects FC estimation and downstream brain-wide association studies (BWAS). Using extensively sampled participants from the Human Connectome Project (HCP) Retest dataset, we establish individual "ground truth" FC and assess the accuracy of FC estimated from 5-30 minute scans. We find that censoring degrades FC accuracy, with more aggressive censoring being more detrimental, particularly among participants exhibiting above-average motion. In these participants, aggressive censoring reduces FC accuracy by 30% for 30-minute scans denoised with ICA-FIX, an advanced denoising method, and by 3% for scans denoised with conventional confound regression. These effects reflect substantial data loss (34%) that outweighs comparatively modest noise reductions: 7% with ICA-FIX and 18% with confound regression. Compensating for this would require substantially longer scans (62% with confound regression; 76% with ICA-FIX), inflating data collection budgets. Introducing a repeated measures framework to separate motion trait from artifact, we find that standard QC metrics are dominated by motion trait and overstate motion bias, which is effectively mitigated with less aggressive censoring. Finally, using data from nearly 1,000 HCP participants, we demonstrate that unreliable FC substantially attenuates BWAS correlations: by ~30% under optimal conditions (longer ICA-FIX scans with no censoring) but exceeding 75% in short, aggressively censored scans. Our findings support the use of advanced denoising methods, limiting censoring, and collecting longer scans to maximize fidelity of FC and BWAS.

URL PDF HTML ☆

赞 0 踩 0

2603.07351 2026-03-10 cs.RO cs.LG stat.ML

A Distributed Gaussian Process Model for Multi-Robot Mapping

Seth Nabarro, Mark van der Wilk, Andrew J. Davison

Comments ICRA 2026, 8 pages

2602.22758 2026-03-10 cs.AI stat.AP

Decomposing Physician Disagreement in HealthBench

Satya Borgohain, Roy Mariathas

2602.15319 2026-03-10 stat.ME stat.AP

Bayesian Inference for Joint Tail Risk in Paired Biomarkers via Archimedean Copulas with Restricted Jeffreys Priors

Agnideep Aich, Md. Monzur Murshed, Sameera Hewage, Ashit Baran Aich

2512.03112 2026-03-10 cs.LG cs.AI stat.ML

Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

Jialai She

详情

英文摘要

Shapley values, a gold standard for feature attribution in Explainable AI, face two key challenges. First, the canonical Shapley framework assumes that the worth function is additive, yet real-world payoff constructions--driven by non-Gaussian distributions, heavy tails, feature dependence, or domain-specific loss scales--often violate this assumption, leading to distorted attributions. Second, achieving sparse explanations in high-dimensional settings by computing dense Shapley values and then applying ad hoc thresholding is costly and risks inconsistency. We introduce Sparse Isotonic Shapley Regression (SISR), a unified nonlinear explanation framework. SISR simultaneously learns a monotonic transformation to restore additivity--obviating the need for a closed-form specification--and enforces an L0 sparsity constraint on the Shapley vector, enhancing computational efficiency in large feature spaces. Its optimization algorithm leverages Pool-Adjacent-Violators for efficient isotonic regression and normalized hard-thresholding for support selection, ensuring ease in implementation and global convergence guarantees. Analysis shows that SISR recovers the true transformation in a wide range of scenarios and achieves strong support recovery even in high noise. Moreover, we are the first to demonstrate that irrelevant features and inter-feature dependencies can induce a true payoff transformation that deviates substantially from linearity. Extensive experiments demonstrate that SISR stabilizes attributions across payoff schemes and correctly filters irrelevant features; in contrast, standard Shapley values suffer severe rank and sign distortions. By unifying nonlinear transformation estimation with sparsity pursuit, SISR advances the frontier of nonlinear explainability, providing a theoretically grounded and practical attribution framework.

URL PDF HTML ☆

赞 0 踩 0

2510.16717 2026-03-10 stat.ME math.ST quant-ph stat.TH

Correlation of divergency: c-delta. Being different in a similar way or not

Johan F. Hoorn

Comments 17 pages, 1 table

2509.02171 2026-03-10 stat.ML cs.LG stat.AP

Synthetic data for ratemaking: imputation-based methods vs adversarial networks and autoencoders

Yevhen Havrylenko, Meelis Käärik, Artur Tuttar

Comments 35 pages, 2 figures, 2 tables

2508.01920 2026-03-10 q-bio.NC q-bio.QM stat.AP

CITS: Nonparametric Statistical Causal Modeling for High-Resolution Neural Time Series

Rahul Biswas, SuryaNarayana Sripada, Somabha Mukherjee, Reza Abbasi-Asl

Comments arXiv admin note: text overlap with arXiv:2312.09604

2505.09496 2026-03-10 stat.ML cs.LG

Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data

Rui Miao, Babak Shahbaba, Annie Qu

2504.20527 2026-03-10 math.OC stat.ML

Adaptive Replication Strategies in Trust-Region-Based Bayesian Optimization of Stochastic Functions

Mickael Binois, Jeffrey Larson

2502.13711 2026-03-10 math.ST math.PR stat.AP stat.TH

On noncentral Wishart mixtures of noncentral Wisharts and their use for testing random effects in factorial design models

Christian Genest, Anne MacKay, Frédéric Ouimet

Comments 12 pages, 0 figures, 2 tables

2501.15163 2026-03-10 cs.LG stat.ML

The Exploration of Error Bounds in Classification with Noisy Labels

Haixia Liu, Boxiao Li, Can Yang, Yang Wang

Comments 21 pages

2501.06024 2026-03-10 stat.ME math.ST stat.TH

Doubly-Robust Functional Average Treatment Effect Estimation

Lorenzo Testa, Tobia Boschi, Francesca Chiaromonte, Edward H. Kennedy, Matthew Reimherr

Comments 19 pages, 2 figures

2501.04959 2026-03-10 econ.EM stat.CO

DisSim-FinBERT: Text Simplification for Core Message Extraction in Complex Financial Texts

Wonseong Kim, Christina Niklaus, Choong Lyol Lee, Siegfried Handschuh

Comments 28 pages, 5 figures, 2 tables

2410.21263 2026-03-10 stat.ME cs.LG math.ST stat.ML stat.TH

Adaptive Transfer Clustering: A Unified Framework

Yuqi Gu, Zhongyuan Lyu, Kaizheng Wang

Comments 72 pages

2409.08838 2026-03-10 stat.ME

Intrinsic Geometry-Based Angular Covariance: A Novel Framework for Nonparametric Changepoint Detection in Meteorological Data

Surojit Biswas, Buddhananda Banerjee, Arnab Kumar Laha

Comments arXiv admin note: text overlap with arXiv:2403.00508

2407.19602 2026-03-10 stat.ME

Metropolis--Hastings with Scalable Subsampling

Estevão Prado, Christopher Nemeth, Chris Sherlock

Comments 78 pages, 14 figures, 9 tables

2407.05110 2026-03-10 math.ST math.OC stat.AP stat.TH

Distributional stability of sparse inverse covariance matrix estimators

Renjie Chen, Huifu Xu, Henryk Zähle

2301.08056 2026-03-10 stat.ME math.PR math.ST stat.TH

Geodesic slice sampling on the sphere

Michael Habeck, Mareike Hasenpflug, Shantanu Kodgirwar, Daniel Rudolf

Comments 38 pages, 10 figures in the main text, 1 table in the appendix, appeared in Journal of Machine Learning Research, 26(297), 1-28, (2025)

2212.14857 2026-03-10 math.ST stat.ME stat.ML stat.TH

Nuisance Function Tuning and Sample Splitting for Optimally Estimating a Doubly Robust Functional

Sean McGrath, Rajarshi Mukherjee

2212.14511 2026-03-10 cs.LG cs.SY eess.SY math.OC stat.ML

Cost-Driven Representation Learning for Linear Quadratic Gaussian Control: Part I

Yi Tian, Kaiqing Zhang, Russ Tedrake, Suvrit Sra

Comments 51 pages; preliminary version appeared in L4DC 2023; this is the extended journal version, with an end-to-end guarantee added

2108.07636 2026-03-10 stat.ML cs.LG

Accounting for shared covariates in semi-parametric Bayesian additive regression trees

Estevão B. Prado, Andrew C. Parnell, Keefe Murphy, Nathan McJames, Ann O'Shea, Rafael A. Moral

Comments 48 pages, 8 tables, 10 figures

详情

DOI: 10.1214/24-AOAS1960
Journal ref: The Annals of Applied Statistics 19 (1) 302 - 328, March 2025

英文摘要

We propose some extensions to semi-parametric models based on Bayesian additive regression trees (BART). In the semi-parametric BART paradigm, the response variable is approximated by a linear predictor and a BART model, where the linear component is responsible for estimating the main effects and BART accounts for non-specified interactions and non-linearities. Previous semi-parametric models based on BART have assumed that the set of covariates in the linear predictor and the BART model are mutually exclusive in an attempt to avoid poor coverage properties and reduce bias in the estimates of the parameters in the linear predictor. The main novelty in our approach lies in the way we change the tree-generation moves in BART to deal with this bias and resolve non-identifiability issues between the parametric and non-parametric components, even when they have covariates in common. This allows us to model complex interactions involving the covariates of primary interest, both among themselves and with those in the BART component. Our novel method is developed with a view to analysing data from an international education assessment, where certain predictors of students' achievements in mathematics are of particular interpretational interest. Through additional simulation studies and another application to a well-known benchmark dataset, we also show competitive performance when compared to regression models, alternative formulations of semi-parametric BART, and other tree-based methods. The implementation of the proposed method is available at \url{https://github.com/ebprado/CSP-BART}.

URL PDF HTML ☆

赞 0 踩 0

1905.01358 2026-03-10 cs.MA stat.AP

Agent based decision making for Integrated Air Defense system

Sumanta Kumar Das, Sumant Mukherjee

Comments 8 pages,9 figure,2 tables

2603.07320 2026-03-10 stat.ME

Bayesian repulsive mixture model for multivariate functional data

Ricardo Cunha Pedroso, Fernando Andrés Quintana, Rosangela Helena Loschi

Comments 25 pages, 7 figures

2603.07310 2026-03-10 stat.CO math.PR

A note on diffusive/random-walk behaviour in Metropolis--Hastings algorithms

Yuxin Liu, Peiyi Zhou, Samuel Livingstone

Comments 12 pages, 9 pages of appendix

2603.07288 2026-03-10 stat.ME

Loglinear modelling of huge contingency tables

Veronica Vinciotti, Ernst C. Wit

2603.07276 2026-03-10 cs.CV cs.LG stat.ML

Variational Flow Maps: Make Some Noise for One-Step Conditional Generation

Abbas Mammadov, So Takao, Bohan Chen, Ricardo Baptista, Morteza Mardani, Yee Whye Teh, Julius Berner

2603.07273 2026-03-10 math.ST stat.TH

Maximal Ancillarity, Semiparametric Efficiency, and the Elimination of Nuisances

Marc Hallin, Bas J. M. Werker, Bo Zhou

2603.07247 2026-03-10 math.NA cs.NA stat.CO

Multi-parameter determination in the semilinear Helmholtz equation

Long-Ling Du, Zejun Sun, Li-Li Wang, Guang-Hui Zheng

Comments 26 pages

2603.07230 2026-03-10 stat.ME cs.LG stat.ML

Conditional Rank-Rank Regression via Deep Conditional Transformation Models

Xiaoyi Wang, Long Feng, Zhaojun Wang

2603.07169 2026-03-10 cs.LG stat.ML

Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts

Yuxuan Han, Meng-Hao Guo, Zhengning Liu, Wenguang Chen, Shi-Min Hu

2603.07132 2026-03-10 math.PR math.ST stat.TH

Quadratic form of heavy-tailed self-normalized random vector with applications in $α$-heavy Mar\v cenko--Pastur law

Zhaorui Dong, Johannes Heiny, Jianfeng Yao

2603.07122 2026-03-10 cs.LG stat.ML

Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers

Tao Shi, Liangming Chen, Long Jin, Mengchu Zhou

2603.07114 2026-03-10 physics.soc-ph stat.CO

Robustness and size-dependence of circadian rhythms in multiscale suprachiasmatic-nucleus networks

Youhao Zhuo, Yingpeng Liu, Jiao Wu, Kesheng Xu, Muhua Zheng

Comments 20 pages, 14 figures

2603.07108 2026-03-10 stat.ML cs.LG stat.ME

Deep Generative Spatiotemporal Engression for Probabilistic Forecasting of Epidemics

Rajdeep Pathak, Tanujit Chakraborty

2603.07099 2026-03-10 stat.ME stat.CO

Parametric modal regression for right-censored positive responses

Christian E. Galarza, Víctor H. Lachos

Comments 25 pages, 7 figures, 4 tables. R package available at https://github.com/chedgala/ModalCens

2603.07055 2026-03-10 stat.ME econ.EM math.ST stat.TH

Integrating Heterogeneous Information in Randomized Experiments: A Unified Calibration Framework

Wei Ma, Zeqi Wu, Zheng Zhang

2603.07014 2026-03-10 stat.ME math.ST stat.ML stat.TH

Fréchet regression of multivariate distributions with nonparanormal transport

Junyoung Park, Irina Gaynanova

Comments 62 pages, 4 figures

2603.07005 2026-03-10 cs.LG stat.ML

Combinatorial Allocation Bandits with Nonlinear Arm Utility

Yuki Shibukawa, Koichi Tanaka, Yuta Saito, Shinji Ito

Comments 32 pages

2603.06970 2026-03-10 stat.ME

Deep Probabilistic Spatial Modeling for Multivariate Mixed-Type Responses

Yeseul Jeon, Kyeong Eun Lee, Joon Jin Song

2603.06957 2026-03-10 stat.ML cs.AI cs.LG

Post-Training with Policy Gradients: Optimality and the Base Model Barrier

Alireza Mousavi-Hosseini, Murat A. Erdogdu

Comments 36 pages, 2 figures

2603.06944 2026-03-10 stat.ME

Estimating Complex Densities using Two-Stage Normalizing Flows

Roxana Darvishi, David C. Stenning, Ted von Hippel, Owen G. Ward

详情

英文摘要

In many scientific applications, the target probability distribution cannot be evaluated in closed form or sampled from directly. Instead, it can often be decomposed into multiple components, some of which are accessible only through samples generated by simulators or external datasets, while others admit tractable mathematical expressions or are specified through statistical assumptions about variable relationships. Developing inference methods that coherently integrate these heterogeneous sources of information remains an open challenge. In this paper, we propose a Two-Stage Normalizing Flows framework for approximating and sampling from such distributions. The method first learns the densities of components for which only samples are available, and then combines the outputs with the analytically specified terms to reconstruct the full target distribution in a second stage. The resulting model enables both point-wise density evaluation and efficient generation of representative samples, without requiring direct access to the full target density or joint samples from the complete model. We assess the proposed approach through simulation studies in joint density inference and Bayesian hierarchical models with inaccessible likelihoods. The proposed framework is able to accurately recover complex, highly nonlinear target structures using only partial information about the target density, providing stable and flexible approximations in settings where standard modeling assumptions do not hold (or when complete access to the target distribution is not available). Analysis of a large scale astronomy application highlights interesting differences between our method and existing approaches. Our normalizing flows procedure offers a robust and flexible approach to inference for intractable target distributions across both simulated and real-world applications.

URL PDF HTML ☆

赞 0 踩 0

2603.06941 2026-03-10 math.ST stat.ME stat.TH

Demonstration Experiments

Guido Imbens, Lorenzo Masoero, Alexander Rakhlin, Thomas S. Richardson, Suhas Vijaykumar

2603.06916 2026-03-10 stat.ME stat.AP

Living forwards or understanding backwards? A comparison of Inverse Probability of Treatment Weighting and G-estimation methods for targeting hypothetical full adherence estimands in longitudinal cohort studies

Xiaoran Liang, Deniz Türkmen, Jane A H Masoli, Luke C Pilling, Jack Bowden

2603.06901 2026-03-10 stat.ML cs.LG

Fairness May Backfire: When Leveling-Down Occurs in Fair Machine Learning

Yi Yang, Xiangyu Chang, Pei-yu Chen

Comments Short version of the paper (Nov 20, 2025)

2603.06872 2026-03-10 math.NA cs.NA math.DS stat.ML

Kernel Methods for Some Transport Equations with Application to Learning Kernels for the Approximation of Koopman Eigenfunctions: A Unified Approach via Variational Methods, Green's Functions and the Method of Characteristics

Boumediene Hamzi, Houman Owhadi, Umesh Vaidya

2603.06851 2026-03-10 stat.ML cs.GT cs.LG

Bilateral Trade Under Heavy-Tailed Valuations: Minimax Regret with Infinite Variance

Hangyi Zhao

Comments 9 pages

2603.06826 2026-03-10 stat.ML cs.LG stat.ME

CREDO: Epistemic-Aware Conformalized Credal Envelopes for Regression

Luben M. C. Cabezas, Sabina J. Sloman, Bruno M. Resende, Fanyi Wu, Michele Caprio, Rafael Izbicki

Comments 26 pages, 5 figures

2603.06715 2026-03-10 q-bio.PE stat.CO

Understanding and Managing Frogeye Leaf Spot through Network-Based Modeling in Soybean

Chinthaka Weerarathna, Thien-Minh Le, Jin Wang

Comments 22 pages, 7 figures, 3 tables

2603.06616 2026-03-10 cs.LG cs.AI math.ST stat.TH

RACER: Risk-Aware Calibrated Efficient Routing for Large Language Models

Sai Hao, Hao Zeng, Hongxin Wei, Bingyi Jing

2603.01198 2026-03-10 eess.SY cs.SY stat.AP

Digital Twin-Based Cooling System Optimization for Data Center

Shrenik Jadhav, Zheng Liu

Comments 30 pages, 8 figures

2603.00827 2026-03-10 math.ST stat.TH

Minimax convergence rates of a binary plug-in type classification procedure for time-homogeneous SDE paths under low-noise conditions

Eddy Michel Ella-Mintsa

Comments 55 pages

2603.00202 2026-03-10 stat.ML cs.LG math.PR

The Partition Principle Revisited: Non-Equal Volume Designs Achieve Minimal Expected Star Discrepancy

Xiaoda Xu

Comments Wrong in critical steps

2602.23355 2026-03-10 stat.ME

Robust model selection using likelihood as data

Jongwoo Choi, Neil A. Spencer, Jeffrey W. Miller

2602.20912 2026-03-10 stat.AP

A Corrected Welch Satterthwaite Equation. And: What You Always Wanted to Know About Kish's Effective Sample but Were Afraid to Ask

Matthias von Davier

Comments 16 pages

2602.00784 2026-03-10 q-fin.RM math.LO math.PR math.ST q-fin.MF stat.TH

Non-standard analysis for coherent risk estimation: hyperfinite representations, discrete Kusuoka formulae, and plug-in asymptotics

Tomasz Kania

Comments 42 pp

2601.17205 2026-03-10 stat.ME

Bayesian Inference for Discrete Markov Random Fields Through Coordinate Rescaling

Giuseppe Arena, Maarten Marsman

2601.16120 2026-03-10 stat.ML cs.LG stat.ME

Synthetic Augmentation in Imbalanced Learning: When It Helps, When It Hurts, and How Much to Add

Zhengchi Ma, Anru R. Zhang

2601.02275 2026-03-10 eess.SY cs.SY stat.AP

Machine Learning Guided Cooling System Optimization for Data Center

Shrenik Jadhav, Zheng Liu

Comments 11 pages, 11 figures

2511.20968 2026-03-10 stat.CO

SVEMnet: An R package for Self-Validated Elastic-Net Ensembles and Multi-Response Optimization in Small-Sample Mixture-Process Experiments

Andrew T. Karl

2511.19525 2026-03-10 cs.LG cs.CV stat.ML

Shortcut Invariance: Targeted Jacobian Regularization in Disentangled Latent Space

Shivam Pal, Sakshi Varshney, Piyush Rai

2511.04060 2026-03-10 math.ST stat.TH

A Unified Graphical Criterion for Characterizing a Linear Causal Interpretation of Partial Regression Coefficients

Masato Shimokawa

Comments v6: Added Theorem 3.7. v7: Corrected a typo in Definition 2.5. v8: Changed the title. Added a Discussion section. Removed Lemma 5.9. The correction does not affect the main results. v9: Focused the discussion on the main theme

2511.01040 2026-03-10 stat.OT stat.AP stat.CO

From Structural Equation Modeling to Targeted Learning: A Tutorial Introduction to Targeted Maximum Likelihood Estimation for SEM Researchers

Junjie Ma, Xiaoya Zhang, Guangye He, Yuting Han, Ting Ge, Feng Ji

2510.23745 2026-03-10 stat.ML cs.LG

Bayesian neural networks with interpretable priors from Mercer kernels

Alex Alberts, Ilias Bilionis

Comments Published in Computer Methods in Applied Mechanics and Engineering

2510.03449 2026-03-10 stat.ME stat.AP stat.CO

Bayesian Transfer Learning for High-Dimensional Linear Regression via Adaptive Shrinkage

Parsa Jamshidian, Donatello Telesca

2509.26112 2026-03-10 stat.AP

Shotgun DNA sequencing evidence: sample-specific and unknown genotyping error probabilities

Mikkel Meyer Andersen

Comments Handling multiple markers (including adding maximising profile likelihood) in Methods and reworked Results as a consequence

详情

DOI: 10.1016/j.fsigen.2026.103474

英文摘要

Many forensic genetic trace samples are of too low quality to obtain short tandem repeat (STR) DNA profiles as the nuclear DNA they contain is highly degraded (e.g., telogen hairs). Instead, performing shotgun DNA sequencing of such samples can provide valuable information on, e.g., single nucleotide polymorphism (SNP) markers. As a result, shotgun sequencing is starting to gain more attention in forensic genetics and statistical models to correctly interpret such evidence, including properly accounting for sequencing errors, are needed. One such model is the wgsLR model by Andersen et. al. (2025) that enabled evaluating the evidential strength of a comparison between the genotypes in the trace sample and reference sample assuming a single-source contribution to both samples. This paper extends the wgsLR model to allow for different (asymmetric) genotyping error probabilities (e.g., from a low quality trace sample and a high quality reference sample). The model was also extended to handle unknown genotyping error probabilities via both maximising profile likelihood and using a prior distribution. The sensitivity of the wgsLR model against overdispersion was also investigated and it was found robust against it. It was also found that handling an unknown genotyping error probability of the trace sample with the methods having a sufficient number of independent markers gave concordant weight of evidence (WoE) under both the hypotheses (same or different individuals being donors of trace and reference sample). It was found more conservative to use a too small trace sample genotyping error probability rather than a too high genotyping error probability as the latter can explain genotype inconsistencies by errors rather than due to two different individuals being the donors of the trace sample and reference sample. The extensions of the model are implemented in the R package wgsLR.

URL PDF HTML ☆

赞 0 踩 0

2509.02937 2026-03-10 math.OC cs.LG stat.ML

Faster Gradient Methods for Highly-Smooth Stochastic Bilevel Optimization

Lesi Chen, Junru Li, El Mahdi Chayti, Jingzhao Zhang

Comments ICLR 2026; Add one additional author compared to v1

2506.18562 2026-03-10 stat.ME

Multi-Rank Subspace Change-Point Detection for Monitoring Robotic Swarms

Jonghyeok Lee, Yao Xie, Youngser Park, Jason Hindes, Ira Schwartz, Carey Priebe

2505.13564 2026-03-10 cs.LG stat.ML

Online Decision-Focused Learning

Aymeric Capitaine, Maxime Haddouche, Eric Moulines, Michael I. Jordan, Etienne Boursier, Alain Durmus

2505.04957 2026-03-10 math.ST stat.ME stat.TH

The Poisson tensor completion parametric estimator

Daniel M. Dunlavy, Richard B. Lehoucq, Carolyn D. Mayer, Arvind Prasadan

Comments 19 pages, 9 figures

2505.00940 2026-03-10 cs.LG math.OC stat.CO stat.ME

StablePCA: Distributionally Robust Learning of Shared Representations from Multi-Source Data

Zhenyu Wang, Molei Liu, Jing Lei, Francis Bach, Zijian Guo

2502.07937 2026-03-10 cs.LG stat.ML

Active Advantage-Aligned Online Reinforcement Learning with Offline Data

Xuefeng Liu, Hung T. C. Le, Siyu Chen, Rick Stevens, Zhuoran Yang, Matthew R. Walter, Yuxin Chen

2410.13744 2026-03-10 stat.ME q-bio.MN

Inferring the dynamics of quasi-reaction systems via nonlinear local mean-field approximations

Matteo Framba, Veronica Vinciotti, Ernst C. Wit

2408.06710 2026-03-10 cs.LG cs.AI stat.ML

Variational Learning of Gaussian Process Latent Variable Models through Stochastic Gradient Annealed Importance Sampling

Jian Xu, Shian Du, Junmei Yang, Qianli Ma, Delu Zeng, John Paisley

2408.00329 2026-03-10 cs.LG cs.AI math.OC stat.ML

OTAD: An Optimal Transport-Induced Robust Model for Agnostic Adversarial Attack

Kuo Gai, Sicong Wang, Shihua Zhang

Comments 15 pages, 2 figures

2407.16786 2026-03-10 stat.ME

Causal generalized linear models via Pearson risk invariance

Alice Polinelli, Veronica Vinciotti, Ernst C. Wit

2406.14380 2026-03-10 econ.EM cs.LG stat.ME

Estimating Treatment Effects under Algorithmic Interference: A Structured Neural Networks Approach

Ruohan Zhan, Shichao Han, Yuchen Hu, Zhenling Jiang

2406.09055 2026-03-10 stat.ME

Relational event models with global covariates

Melania Lembo, Rūta Juozaitienė, Veronica Vinciotti, Ernst C. Wit

2404.12556 2026-03-10 stat.CO

Bias- and Variance-Aware Probabilistic Rounding Error Analysis for Floating-Point Arithmetic

Sahil Bhola, Karthik Duraisamy

2302.00941 2026-03-10 cs.GT stat.ML

A Robust Multi-Item Auction Design with Statistical Learning

Jiale Han, Xiaowu Dai

2012.09828 2026-03-10 math.ST stat.TH

Nonparametric two-sample hypothesis testing for low-rank random graphs of differing sizes

Joshua Agterberg, Minh Tang, Carey Priebe

2010.01388 2026-03-10 cs.LG cs.AI stat.ML

Online Neural Networks for Change-Point Detection

Mikhail Hushchyn, Kenenbek Arzymatov, Denis Derkach

Comments This version of the article has been submitted to the journal but is not the Version of Record and does not reflect peer-review improvements, post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/s10994-026-07000-6