arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.20492 2026-04-23 stat.ML cs.IT cs.LG math.IT

Decentralized Machine Learning with Centralized Performance Guarantees via Gibbs Algorithms

Yaiza Bermudez, Samir Perlaza, Iñaki Esnaola

Comments In Proceedings of the International Symposium on Information Theory (ISIT), 2026

2604.20446 2026-04-23 cs.LG stat.ML

The Origin of Edge of Stability

Elon Litman

2604.20445 2026-04-23 stat.AP

Assessing the Shortfall Risk of GB Electricity Grid using Shifts in Winter Weather Conditions

Aninda Bhattacharya, Chris J. Dent, Amy L. Wilson, Gabriele C. Hegerl

Comments Pre-print Submitted to Applied Energy

2604.20422 2026-04-23 math.ST math.PR stat.TH

Likelihood-based inference for birth-death processes with composite birth mechanisms

Marko Lalovic, Nicos Georgiou, Istvan Z. Kiss

Comments 32 pages, 8 figures

2604.20416 2026-04-23 stat.AP

SHARELIFE Imputations

Giuseppe De Luca, Paolo Li Donni

Comments 84 pages (including 32 pages of appendices and 22 figures)

2604.20414 2026-04-23 math.ST stat.ME stat.ML stat.TH

Fast and Provably Accurate Sequential Designs using Hilbert Space Gaussian Processes

Huanyan Zhu, Cheng Li

2604.20409 2026-04-23 cs.LG stat.ML

Calibrating conditional risk

Andrey Vasilyev, Yikai Wang, Xiaocheng Li, Guanting Chen

2604.20370 2026-04-23 cs.LG stat.ML

Cold-Start Forecasting of New Product Life-Cycles via Conditional Diffusion Models

Ruihan Zhou, Zishi Zhang, Jinhui Han, Yijie Peng, Xiaowei Zhang

2604.20341 2026-04-23 physics.geo-ph stat.AP

Extrapolation from historical data cannot reliably predict the time of a potential AMOC collapse

Andreas Morr, Maya Ben-Yami, Brian Groenke, Christof Schötz, Alessandro Cotronei, Eirik Myrvoll-Nilsen, Sebastian Bathiany, Martin Rypdal, Niklas Boers

2604.20322 2026-04-23 stat.ME

Zero-Inflated Logistic Regression Models with Shared Design: Identifiability, Existence of Estimates, and a Relabeling Rule

Yui Tomo, Shinto Eguchi, Daisuke Yoneoka

2604.20301 2026-04-23 stat.ML cs.LG stat.CO stat.ME

Properties and limitations of geometric tempering for gradient flow dynamics

Francesca Romana Crucinio, Sahani Pathiraja

Comments Accepted at TMLR https://openreview.net/forum?id=IP0w5LdcxC

2604.20296 2026-04-23 stat.ML cs.LG

Online Survival Analysis: A Bandit Approach under Cox PH Model

Yang Xu, Wenbin Lu, Rui Song

2604.20285 2026-04-23 stat.AP stat.OT

Time-dependent structural equation modeling of fans' football fever using activity tracking data during the 2025 DFB Cup final

Jonas Bauer, Christiane Fuchs, Tamara Schamberger

2604.20276 2026-04-23 cs.LG stat.ML

Rethinking Intrinsic Dimension Estimation in Neural Representations

Rickmer Schulte, David Rügamer

Comments Accepted at the 29th International Conference on Artificial Intelligence and Statistics (AISTATS) 2026

2604.20238 2026-04-23 math.ST stat.TH

Bayesian approaches to non- and semiparametric density estimation [with a rejoinder to my discussants]

Nils Lid Hjort

Comments 29 pages, no figures. Statistical Research Report, Department of Mathematics, University of Oslo, 1995; invited discussion paper for the Fifth Valencia Meeting on Bayesian Statistics. Published version in "Bayesian Statistics" (1995, eds. J.M. Bernardo, J.O. Berger, A.P. Dawid, and A.F. Smith), Proceedings of the Fifth Valencia International Meeting (vol. 5), 223-254

2604.20219 2026-04-23 cs.LG cs.NA math.NA stat.ML

Geometric Layer-wise Approximation Rates for Deep Networks

Shijun Zhang, Zuowei Shen, Yuesheng Xu

2604.20161 2026-04-23 cs.LG stat.ME stat.ML

SMART: A Spectral Transfer Approach to Multi-Task Learning

Boxin Zhao, Mladen Kolar, Jinchi Lv

Comments 53 pages, 4 figures, 1 table

2604.20115 2026-04-23 cs.LG cs.AI stat.ML

On the Stability and Generalization of First-order Bilevel Minimax Optimization

Xuelin Zhang, Peipei Yuan

2604.20111 2026-04-23 cs.LG cs.AI stat.ML

Meta Additive Model: Interpretable Sparse Learning With Auto Weighting

Xuelin Zhang, Xinyue Liu, Lingjuan Wu, Hong Chen

2604.20072 2026-04-23 math.ST stat.ME stat.TH

Vertex misalignment and changepoint localization in network time series

Tianyi Chen, Mohammad Sharifi Kiasari, Sijing Yu, Youngser Park, Avanti Athreya, Vince Lyzinski, Carey E Priebe, Zachary Lubberts

Comments 52 pages, 11 figures, 3 tables

2604.20069 2026-04-23 stat.AP

Bayesian inference for disease transmission models informed by viral dynamics

Dylan J. Morris, Lauren Kennedy, Andrew J. Black

Comments 35 pages, 13 figures

2604.20045 2026-04-23 stat.ME

A general nonparametric framework for testing hypotheses about function-valued parameters

Albert Osom, Ali Shojaie, Aaron Hudson

2604.20016 2026-04-23 stat.ME math.ST stat.TH

Weighted Holm Procedures: Theory, Properties, and Recommendations

Beibei Li, Wenge Guo

Comments 35 pages, 5 figures, 2 tables

2604.19996 2026-04-23 stat.ME stat.AP

Meta-analysis of networks of diagnostic tests with binary and continuous results

Efthymia Derezea, Gabriel Rogers, Nicky J Welton, Hayley E Jones

2604.19977 2026-04-23 stat.ME

Constructing external comparator groups via transportability in mean or in effect measure

Lawson Ung, Guanbo Wang, Sebastien Haneuse, Sonia Hernandez-Diaz, Miguel A. Hernán, Issa J. Dahabreh

详情

英文摘要

Learning about causal effects in target populations and their subsets may be facilitated by combining information from multiple sources. One major class of study designs that combine information involves appending an index study with data from an external comparator, which may facilitate head-to-head comparisons of treatments initially studied in different populations. We delineate external comparator analyses under two distinct, but related, identification strategies. The first strategy relies on exchangeability (transportability) of potential outcome means, which uses information only on the treatments that are to be compared. The second strategy relies on transportability in effect measure, requiring additional use of information on a third treatment common to the populations that have been combined. In a time-fixed setting with a point treatment and non-failure time outcome, we examine identification and estimation under a basic setup where information from an index trial is combined with a second, and external to the index trial, data source. We propose estimators for identifying observed data functionals, with a particular focus on semiparametric efficient augmented weighting estimators that incorporate models for the probability of trial participation, the probability of treatment, and conditional outcome means. We derive the asymptotic properties of these augmented weighting estimators -- including robustness to model misspecification and slower rates of convergence for some nuisance function models -- and use simulation to compare their finite sample performance to estimators based only on outcome modeling or weighting. Last, we provide a practical demonstration of the proposed methods by combining the ACCEPT and PHOENIX 1 randomized trials to evaluate the effect of various biologic agents on plaque psoriasis, a chronic inflammatory disorder.

URL PDF HTML ☆

赞 0 踩 0

2604.19972 2026-04-23 stat.ME

Principal Nested Cones

Yanyan Zhan, Ian L. Dryden, Yuexuan Wu

2604.19841 2026-04-23 stat.AP cs.LG

Spatio-temporal modelling of electric vehicle charging demand

Kaoutar Bouaachra, Yvenn Amara-Ouali, Yannig Goude, Raphaël Lachieze-Rey

Comments 18 pages, 19 figures

2604.12783 2026-04-23 stat.ME econ.EM

A Bayes-Factor-Guided Approach to Post-Double Selection with Bootstrapped Multiple Imputation

Johannes Bleher, Claudia Tarantola

Comments 33 pages, 8 figures, 11 tables

2604.12694 2026-04-23 stat.CO

Adaptive Sparse Group Lasso Penalized Quantile Regression via Dual ADMM

Huayan Kou, Yuwen Gu, Yi Lian, Rui Zhang, Jun Fan

2604.02219 2026-04-23 hep-ph hep-ex physics.data-an stat.ME

Many Wrongs Make a Right: Leveraging Biased Simulations Towards Unbiased Parameter Inference

Ezequiel Alvarez, Sean Benevedes, Manuel Szewc, Jesse Thaler

Comments 29 pages, 18 figures, 1 table, code available at https://github.com/sequi76/TAMM and data products available at https://zenodo.org/records/19341120 v2: version to be submitted

2603.29316 2026-04-23 stat.AP

A Bayesian Finite Mixture Model Approach for Mixed-type Data Clustering and Variable Selection with Censored Biomarkers

Yueting Wang, Shu Wang, Jonathan G. Yabes, Chung-Chou H. Chang

Comments 55 pages (including 17-page Appendices), 8 figures (including 1 figure in Appendix B)

2602.19774 2026-04-23 stat.AP

Spatio-temporal modeling of urban extreme rainfall events at high resolution

Chloé Serre-Combe, Nicolas Meyer, Thomas Opitz, Gwladys Toulemonde

2512.12463 2026-04-23 stat.ML cs.LG math.ST stat.TH

Understanding Overparametrization in Survival Models through Interpolation

Yin Liu, Jianwen Cai, Didong Li

2512.12325 2026-04-23 cs.LG math.ST stat.ML stat.TH

Eventually LIL Regret: Almost Sure $\ln\ln T$ Regret for a sub-Gaussian Mixture on Unbounded Data

Shubhada Agrawal, Aaditya Ramdas

Comments Published at ALT 2026

2511.23156 2026-04-23 stat.AP

Design loads for wave impacts -- introducing the Probabilistic Adaptive Screening (PAS) method for predicting extreme non-linear loads on maritime structures

Sanne M. van Essen, Harleigh C. Seyffert

详情

DOI: 10.1016/j.oceaneng.2026.125440
Journal ref: van Essen, S.M. and Seyffert, H.C. (2026). Design loads for wave impacts - The Probabilistic Adaptive Screening (PAS) method for extreme non-linear hydrodynamic loads and responses of maritime structures. Ocean Eng., 357p2, 125440

英文摘要

Wave impact loads on maritime structures can cause casualties, damage, pollution and operational delays. Consequently, their extreme values should be accounted for in the design of these structures. However, this is challenging, as wave impact events are both rare and highly complex, requiring both high-fidelity simulations and long analysis durations to reliably quantify the associated design loads. Moreover, existing extreme value prediction methods are neither specifically developed nor adequately validated for wave impact phenomena. We therefore introduce the new Probabilistic Adaptive Screening (PAS) method for predicting extreme non-linear loads on maritime structures. The method integrates copula-based statistical dependence modelling with multi-fidelity screening and adaptive sampling. This framework enables efficient extreme value prediction by statistically mapping low-fidelity indicator variables to high-fidelity impact loads. The method allows for efficient linear potential flow indicators to be used in the low-fidelity stage, even for strongly non-linear cases. Its statistical framework is validated against four non-linear test cases, including non-linear waves, ship vertical bending moments, green water impact loads, and slamming loads. It is concluded that PAS with optimal settings accurately estimates both the short-term distributions and extreme values in these test cases, with most probable maximum (MPM) values within 2-15% of the reference brute-force Monte-Carlo Simulation (MCS) results. In addition, PAS achieves this performance very efficiently, requiring in the order of 1-3% of the high-fidelity simulation time needed for conventional MCS. These results demonstrate that PAS can reliably reproduce the statistics of both weakly and strongly non-linear extreme load problems, while significantly reducing the associated computational cost compared to MCS.

URL PDF HTML ☆

赞 0 踩 0

2510.13233 2026-04-23 stat.ME stat.CO

Scalable Bayesian inference for high-dimensional mixed-type multivariate spatial data

Arghya Mukherjee, Arnab Hazra, Dootika Vats

Comments 52 pages, 8 figures, 13 tables

2510.09902 2026-04-23 math.HO stat.ML

If you can distinguish, you can express: Galois theory, Stone--Weierstrass, machine learning, and linguistics

Ben Blum-Smith, Claudia Brugman, Thomas Conners, Soledad Villar

Comments Added a section that engages with relevant recent work

2510.08465 2026-04-23 stat.ML cs.LG

Accumulated Aggregated D-Optimal Designs for Estimating Main Effects in Black-Box Models

Chih-Yu Chang, Ming-Chung Chang

2510.03665 2026-04-23 stat.ME stat.CO

Efficient Log-Rank Updates for Random Survival Forests

Erik Sverdrup, James Yang, Michael LeBlanc

2509.19367 2026-04-23 eess.SP cs.LG stat.ML

Low-Cost Sensor Fusion Framework for Organic Substance Classification and Quality Control Using Classification Methods

Borhan Uddin Chowdhury, Damian Valles, Md Raf E Ul Shougat

Comments Copyright 2025 IEEE. This is the author's version of the work accepted for publication in FMLDS 2025. The final version will be published by IEEE and available via DOI (to be inserted when available). Accepted at FMLDS 2025, to appear in IEEE Xplore. 8 pages, 17 figures, 3 tables

2509.17260 2026-04-23 q-bio.NC cs.OH stat.AP

A tutorial on electrogastrography using low-cost hardware and open-source software

Evgeniya Anisimova, Sameer N. B. Alladin, Styliani Tsamaz, Edwin S. Dalmaijer

2508.18948 2026-04-23 hep-th cond-mat.dis-nn cs.LG stat.ML

Gauge-covariant stochastic neural fields: Stability and finite-width effects

Rodrigo Carmo Terin

Comments 20 pages, 2 figures, 1 table. Accepted version for publication in Scientific Reports

2508.17761 2026-04-23 cs.LG stat.ML

Evaluating the Quality of the Quantified Uncertainty for (Re)Calibration of Data-Driven Regression Models

Jelke Wibbeke, Nico Schönfisch, Sebastian Rohjans, Andreas Rauh

详情

DOI: 10.1016/j.ijar.2026.109685
Journal ref: International Journal of Approximate Reasoning, Volume 195, 2026, 109685, ISSN 0888-613X

英文摘要

In safety-critical applications data-driven models must not only be accurate but also provide reliable uncertainty estimates. This property, commonly referred to as calibration, is essential for risk-aware decision-making. In regression a wide variety of calibration metrics and recalibration methods have emerged. However, these metrics differ significantly in their definitions, assumptions and scales, making it difficult to interpret and compare results across studies. Moreover, most recalibration methods have been evaluated using only a small subset of metrics, leaving it unclear whether improvements generalize across different notions of calibration. In this work, we systematically extract and categorize regression calibration metrics from the literature and benchmark these metrics independently of specific modelling methods or recalibration approaches. Through controlled experiments with real-world, synthetic and artificially miscalibrated data, we demonstrate that calibration metrics frequently produce conflicting results. Our analysis reveals substantial inconsistencies: many metrics disagree in their evaluation of the same recalibration result, and some even indicate contradictory conclusions. This inconsistency is particularly concerning as it potentially allows cherry-picking of metrics to create misleading impressions of success. We identify the Expected Normalized Calibration Error (ENCE) and the Coverage Width-based Criterion (CWC) as the most dependable metrics in our tests. Our findings highlight the critical role of metric selection in calibration research.

URL PDF HTML ☆

赞 0 踩 0

2508.03059 2026-04-23 stat.ME stat.CO stat.ML

Two-sample comparison through additive tree models for density ratios

Naoki Awaya, Yuliang Xu, Li Ma

2508.02569 2026-04-23 stat.AP

Understanding Heterogeneity in Adaptation to Intermittent Water Supply: Clustering Household Types in Amman, Jordan

Shreyas Gadge, Vítor V. Vasconcelos, André de Roos, Elisabeth H. Krueger

2507.18279 2026-04-23 math.ST math.AP math.DS math.PR stat.TH

Data assimilation with the 2D Navier-Stokes equations: Optimal Gaussian asymptotics for the posterior measure

Dimitri Konen, Richard Nickl

2507.16433 2026-04-23 stat.ME cs.LG

Adaptive Multi-task Learning for Multi-sector Portfolio Optimization

Qingliang Fan, Ruike Wu, Yanrong Yang

2506.20910 2026-04-23 math.OC cs.LG stat.ML

Faster Fixed-Point Methods for Multichain MDPs

Matthew Zurek, Yudong Chen

2506.20904 2026-04-23 cs.LG cs.IT math.IT math.OC stat.ML

Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL

Matthew Zurek, Guy Zamir, Yudong Chen

2506.16658 2026-04-23 math.ST cs.LG stat.ML stat.TH

Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards

Wenlong Ji, Yihan Pan, Ruihao Zhu, Lihua Lei

详情

英文摘要

Multi-armed bandit (MAB) is a widely adopted framework for sequential decision-making under uncertainty. Traditional bandit algorithms rely solely on online data, which tends to be scarce as it must be gathered during the online phase when the arms are actively pulled. However, in many practical settings, rich auxiliary data, such as covariates of past users, is available prior to deploying any arms. We introduce a new setting for MAB where pre-trained machine learning (ML) models are applied to convert side information and historical data into \emph{surrogate rewards}. A prominent challenge of this setting is that the surrogate rewards may exhibit substantial bias, as true reward data is typically unavailable in the offline phase, forcing ML predictions to heavily rely on extrapolation. To address the issue, we propose the Machine Learning-Assisted Upper Confidence Bound (MLA-UCB) algorithm, which can be applied to any reward prediction model and any form of auxiliary data. When the predicted and true rewards are jointly Gaussian, it provably improves the cumulative regret, even in cases where the mean surrogate reward completely misaligns with the true mean rewards, and achieves the asymptotic optimality among a broad class of policies. Notably, our method requires no prior knowledge of the covariance matrix between true and surrogate rewards. We further extend the method to a batched reward MAB problem, where each arm pull yields a batch of observations and rewards may be non-Gaussian, and we derive computable confidence bounds and regret guarantees that improve upon classical UCB algorithms. Finally, extensive simulations with both Gaussian and ML-generated surrogates, together with real-world studies on language model selection and video recommendation, demonstrate consistent and often substantial regret reductions with moderate offline surrogate sample sizes and correlations.

URL PDF HTML ☆

赞 0 踩 0

2506.14103 2026-04-23 stat.ME q-bio.QM

A Robust Nonparametric Framework for Detecting Repeated Spatial Patterns

Rajitha Senanayake, Pratheepa Jeganathan

Comments 39 pages including an Appendix of 17 pages, 39 figures

2506.02276 2026-04-23 cs.LG stat.ML

Latent Stochastic Interpolants

Saurabh Singh, Dmitry Lagun

Comments Accepted at ICLR 2026 as a conference paper

2505.17803 2026-04-23 stat.ME

Anytime-valid simultaneous lower confidence bounds for the true discovery proportion

Friederike Preusse

2504.10530 2026-04-23 math.PR stat.CO

Efficient Rare-Event Simulation for Random Geometric Graphs via Importance Sampling

Sarat Moka, Christian Hirsch, Volker Schmidt, Dirk Kroese

Comments 29 Pages, 2 figures

2503.16744 2026-04-23 stat.ME stat.AP

Modeling and forecasting subnational age distribution of death counts

Han Lin Shang, Cristian F. Jiménez-Varón

Comments 45 pages, 9 figures, 7 tables

2502.06151 2026-04-23 cs.LG cs.AI stat.ML

Recency Biased Causal Attention for Time-series Forecasting

Kareem Hegazy, Michael W. Mahoney, N. Benjamin Erichson

2412.07999 2026-04-23 math.ST math.PR stat.ML stat.TH

Fast Mixing of Data Augmentation Algorithms: Bayesian Probit, Logit, and Lasso Regression

Holden Lee, Kexin Zhang

Comments 48 pages, 8 figures; Refined theorem statements and simulations

2410.18880 2026-04-23 math.ST math.PR stat.TH

Can we spot a fake?

Shahar Mendelson, Grigoris Paouris, Roman Vershynin

Comments 13 pages. A few typos corrected

2409.18198 2026-04-23 stat.AP

Estimating soil carbon sequestration potential and approximating optimal management policies

Jacob Spertus, Eric Slessarev, Whendee Silver, Philip Stark

Comments 26 pages, 6 figures, 1 table

2408.00920 2026-04-23 cs.LG stat.ML

Towards Certified Unlearning for Deep Neural Networks

Binchi Zhang, Yushun Dong, Tianhao Wang, Jundong Li

Comments ICML 2024 (errata)

2406.05262 2026-04-23 stat.AP

A Three-groups Non-local Model for Combining Heterogeneous Data Sources to Identify Genes Associated with Parkinson's Disease

Troy P. Wixson, Benjamin A. Shaby, Daisy L. Philtron, International Parkinson Disease Genomics Consortium, Leandro A. Lima, Stacia K. Wyman, Julia A. Kaye, Steven Finkbeiner

Comments 26 pages, 6 figures, 4 tables. This version includes the supplementary materials. Author version. Accepted for publication in Biometrics (04-2026)

2402.13103 2026-04-23 cs.LG math.ST stat.TH

Multivariate Functional Linear Discriminant Analysis for the Classification of Short Time Series with Missing Data

Rahul Bordoloi, Clémence Réda, Orell Trautmann, Saptarshi Bej, Olaf Wolkenhauer

2312.17015 2026-04-23 stat.ME

Regularized Exponentially Tilted Empirical Likelihood for Bayesian Inference

Eunseop Kim, Steven N. MacEachern, Mario Peruggia

2105.09232 2026-04-23 cs.LG math.ST stat.TH

Diffusion Approximations for Thompson Sampling in the Small Gap Regime

Lin Fan, Peter W. Glynn

1810.11624 2026-04-23 cs.LG stat.ML

Time series clustering based on the characterisation of segment typologies

David Guijo-Rubio, Antonio Manuel Durán-Rosal, Pedro Antonio Gutiérrez, Alicia Troncoso, César Hervás-Martínez

Comments 13 pages, 7 figures, 4 tables, 57 refs

详情

DOI: 10.1109/TCYB.2019.2962584
Journal ref: IEEE Transactions on Cybernetics ( Volume: 51, Issue: 11, November 2021)

英文摘要

Time series clustering is the process of grouping time series with respect to their similarity or characteristics. Previous approaches usually combine a specific distance measure for time series and a standard clustering method. However, these approaches do not take the similarity of the different subsequences of each time series into account, which can be used to better compare the time series objects of the dataset. In this paper, we propose a novel technique of time series clustering based on two clustering stages. In a first step, a least squares polynomial segmentation procedure is applied to each time series, which is based on a growing window technique that returns different-length segments. Then, all the segments are projected into same dimensional space, based on the coefficients of the model that approximates the segment and a set of statistical features. After mapping, a first hierarchical clustering phase is applied to all mapped segments, returning groups of segments for each time series. These clusters are used to represent all time series in the same dimensional space, after defining another specific mapping process. In a second and final clustering stage, all the time series objects are grouped. We consider internal clustering quality to automatically adjust the main parameter of the algorithm, which is an error threshold for the segmenta- tion. The results obtained on 84 datasets from the UCR Time Series Classification Archive have been compared against two state-of-the-art methods, showing that the performance of this methodology is very promising.

URL PDF HTML ☆

赞 0 踩 0

2604.20832 2026-04-23 math.OC stat.ME

Solving Minimax Problems with Bilinear Objectives with ADMM

Bob Wilson

Comments 9 pages, 1 figure (color)

2604.20788 2026-04-23 math.ST stat.TH

The E-measure

Nick W. Koning

2604.20761 2026-04-23 stat.ML stat.ME

Geometric Renyi Differential Privacy: Ricci Curvature Characterized by Heat Diffusion Mechanisms

Xiaotian Chang, Yangdi Jiang, Cyrus Mostajeran, Qirui Hu

2604.20743 2026-04-23 stat.ME stat.CO

ProfileGLMM: a R Package Extending Bayesian Profile Regression using Generalised Linear Mixed Models

Matteo Amestoy, Mark A. van de Wiel, Wessel N. van Wieringen

2602.19201 2026-04-23 econ.EM stat.ME

Panel Quantile Regression with Common Shocks

Harold D. Chiang, Antonio F. Galvao, Chia-Min Wei

2601.06674 2026-04-23 math.ST math.PR stat.TH

Reduction and classification of higher-order Markov chains

Christophe Gallesco, Caio Teodore Genovese Huss Oliveira, Daniel Yasumasa Takahashi

Comments 9 pages, 5 figures

2512.05070 2026-04-23 stat.ML cs.LG

Control Consistency Losses for Diffusion Bridges

Samuel Howard, Nikolas Nüsken, Jakiw Pidstrigach

2511.18201 2026-04-23 stat.ME

Spatial deformation in a Bayesian spatiotemporal model for incomplete matrix-variate responses

Rodrigo de Souza Bulhões, Marina Silva Paez, Dani Gamerman

Comments Submitted to Environmental and Ecological Statistics

2409.07609 2026-04-23 cs.CR cs.CV cs.LG stat.AP

Survival of the Cheapest: Cost-Aware Hardware Adaptation for Adversarial Robustness

Charles Meyers, Mohammad Reza Saleh Sedghpour, Tommy Löfstedt, Erik Elmroth

2406.03302 2026-04-23 stat.ME math.ST stat.TH

Identification strategies for combining an experimental study with external data

Lawson Ung, Guanbo Wang, Sebastien Haneuse, Miguel A. Hernán, Issa J. Dahabreh

Comments This is an update of the original submission

2404.16746 2026-04-23 stat.ME math.ST stat.ML stat.TH

Estimating the Number of Components in Finite Mixture Models via Variational Approximation

Chenyang Wang, Yun Yang

2604.20667 2026-04-23 stat.ME

Data Integration for Estimating Subgroup-Specific Conditional Average Treatment Effects (CATEs) Using Coarsened External Information in Randomized Trials

Youqi Yang, Walter Dempsey, Bhramar Mukherjee

Comments 25 pages, 4 figures

2604.20632 2026-04-23 astro-ph.IM stat.ME

Review: A new method for estimation and use of systematic errors in Poisson regression

M. Bonamente

Comments Accepted for Frontiers in Astronomy and Space Sciences - Astrostatistics. This is a review of https://ui.adsabs.harvard.edu/abs/2025ApJ...980..139B and https://ui.adsabs.harvard.edu/abs/2025ApJ...980..140B presented at the sys2025 workshop in Huntsville, AL (Nov 14-17. 2025)

2604.20630 2026-04-23 stat.ME

Double Robust Weighted Regression with Missing Confounders

Md. Shaddam Hossain Bagmar, Hua Shen

2604.20625 2026-04-23 stat.ME stat.AP

Dynamic Prediction of the Target Survival Time in Metastatic Solid Tumor Cancer Clinical Trials

Sidi Wang, Kelley Kidwell, Bo Huang, Satrajit Roychoudhury

2604.20614 2026-04-23 cs.LG math.DS math.OC stat.ML

Too Sharp, Too Sure: When Calibration Follows Curvature

Alessandro Morosini, Matea Gjika, Tomaso Poggio, Pierfrancesco Beneventano

Comments 33 pages, 23 figures

2604.20612 2026-04-23 math.ST math.PR stat.TH

E-values and sequential power-one tests for monotonicity and unimodality

Hongjian Wang, Aaditya Ramdas

2604.20611 2026-04-23 stat.AP

Bayesian Inference for Incomplete 2x2 Diagnostic Tables

Sara Antonijevic, Danielle Sitalo, Brani Vidakovic

Comments 21 pages, 10 tables. Supplementary materials and reproducible code available at https://github.com/saraantonijevic/bayesian_diagnostic_table-reconstruction

2604.20551 2026-04-23 stat.ML cs.LG

On Bayesian Softmax-Gated Mixture-of-Experts Models

Nicola Bariletto, Huy Nguyen, Nhat Ho, Alessandro Rinaldo

2604.20517 2026-04-23 math.DS math.OC stat.CO

Bounding Transient Instability in Sensor Data Injected Nonlinear Stochastic Flight Dynamics

Surya Ratna Prakash D, Soumyendu Raha

2604.20516 2026-04-23 stat.ML cs.LG

Efficient Symbolic Computations for Identifying Causal Effects

Benjamin Hollering, Pratik Misra, Nils Sturma

2510.04525 2026-04-23 cs.LG math.PR stat.ML

Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion

Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi, Hiromi Wakaki, Yuki Mitsufuji

Comments 23 pages, fixed cleveref-related issue

2505.13106 2026-04-23 math.OC physics.soc-ph stat.AP

How to optimise tournament draws: The case of the FIFA World Cup

László Csató

Comments 32 pages, 8 figures, 6 tables

2411.18334 2026-04-23 stat.ME

Large multi-response linear regression estimation based on low-rank pre-smoothing

Xinle Tian, Alex Gibberd, Matthew Nunes, Sandipan Roy

2407.01621 2026-04-23 cs.LG q-bio.QM stat.ME stat.ML

Deciphering interventional dynamical causality from non-intervention complex systems

Jifan Shi, Yang Li, Juan Zhao, Siyang Leng, Rui Bao, Kazuyuki Aihara, Luonan Chen, Wei Lin

详情

DOI: 10.1016/j.xinn.2026.101358

英文摘要

Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. Delay-embedding technique provides a promising approach. In this study, we propose a framework named Interventional Dynamical Causality (IntDC) in contrast to the traditional Constructive Dynamical Causality (ConDC). ConDC, including Granger causality, transfer entropy and convergence of cross-mapping, measures the causality by constructing a dynamical model without considering interventions. A computational criterion, Interventional Embedding Entropy (IEE), is proposed to measure causal strengths in an interventional manner. IEE is an intervened causal information flow but in the delay-embedding space. Further, the IEE theoretically and numerically enables the deciphering of IntDC solely from observational (non-interventional) time-series data, without requiring any knowledge of dynamical models or real interventions in the considered system. In particular, IEE can be applied to rank causal effects according to their importance and construct causal networks from data. We conducted numerical experiments to demonstrate that IEE can find causal edges accurately, eliminate effects of confounding, and quantify causal strength robustly over traditional indices. We also applied IEE to real-world tasks. IEE performed as an accurate and robust tool for causal analyses solely from the observational data. The IntDC framework and IEE algorithm provide an efficient approach to the study of causality from time series in diverse non-intervention complex systems.

URL PDF HTML ☆

赞 0 踩 0