arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.02304 2026-04-03 stat.CO

Disentangled Deep Priors for Bayesian Inverse Problems

Arkaprabha Ganguli, Emil Constantinescu

详情

英文摘要

We propose a structured prior for high-dimensional Bayesian inverse problems based on a disentangled deep generative model whose latent space is partitioned into auxiliary variables aligned with known and interpretable physical parameters and residual variables capturing remaining unknown variability. This yields a hierarchical prior in which interpretable coordinates carry domain-relevant uncertainty while the residual coordinates retain the flexibility of deep generative models. By linearizing the generator, we characterize the induced prior covariance and derive conditions under which the posterior exhibits approximate block-diagonal structure in the latent variables, clarifying when representation-level disentanglement translates into a separation of uncertainty in the inverse problem. We formulate the resulting latent-space inverse problem and solve it using MAP estimation and Markov chain Monte Carlo (MCMC) sampling. On elliptic PDE inverse problems, such as conductivity identification and source identification, the approach matches an oracle Gaussian process prior under correct specification and provides substantial improvement under prior misspecification, while recovering interpretable physical parameters and producing spatially calibrated uncertainty estimates.

URL PDF HTML ☆

赞 0 踩 0

2604.02286 2026-04-03 stat.ME stat.AP

Bayesian covariance regression for differential network analysis of zero-inflated microbiome data

Zichun Xu, Jing Ma

2604.02250 2026-04-03 cs.LG stat.ML

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

Hao Zhu, Di Zhou, Donna Slonim

Comments To appear in the Proceedings of the 5th Conference on Causal Learning and Reasoning (CLeaR 2026)

2604.02248 2026-04-03 stat.ML cs.LG

BVFLMSP : Bayesian Vertical Federated Learning for Multimodal Survival with Privacy

Abhilash Kar, Basisth Saha, Tanmay Sen, Biswabrata Pradhan

2604.02238 2026-04-03 cs.CY cs.AI stat.AP

Generative AI Spotlights the Human Core of Data Science: Implications for Education

Nathan Taback

2604.02227 2026-04-03 eess.SY cs.SY math.OC stat.ME

Sensitivity analysis for stopping criteria with application to organ transplantations

Xingyu Ren, Michael C. Fu, Steven I. Marcus

2604.02187 2026-04-03 stat.AP physics.ao-ph

Possible, Yes; Ignorant, Perhaps: A Scorecard for Possibilistic Forecasts

John R. Lawson

Comments 11 figures; 7 sections;19 pages on PDF as-is

2604.02179 2026-04-03 stat.ME

Irregularly and incompletely sampled random fields in the Earth sciences: Analysis and synthesis of parameterized covariance models

Olivia L. Walbert, Frederik J. Simons, Arthur P. Guillaumin, Sofia C. Olhede

2604.02116 2026-04-03 stat.ME stat.CO

A new wavelet-based variational family with copula dependence structures

Giovanni Piccirilli, Aluísio Pinheiro

2601.19016 2026-04-03 cs.CC cs.CR math.PR math.ST stat.TH

Average-Case Reductions for $k$-XOR and Tensor PCA

Guy Bresler, Alina Harbuzova

Comments 112 pages, 6 figures

2601.13507 2026-04-03 stat.ME

Two-stage Least Squares with Clustered Data under the Local Average Treatment Effect Framework

Anqi Zhao, Peng Ding, Fan Li

2601.11016 2026-04-03 stat.ML cs.AI cs.LG math.OC

Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach

Fenglin Zhang, Jie Wang

2511.07605 2026-04-03 math.ST stat.ME stat.TH

Confidence Intervals for Linear Models with Arbitrary Noise Contamination

Dong Xie, Chao Gao, John Lafferty

2510.18520 2026-04-03 cs.LG stat.ME

Partial VOROS: A Cost-aware Performance Metric for Binary Classifiers with Precision and Capacity Constraints

Christopher Ratigan, Kyle Heuton, Carissa Wang, Lenore Cowen, Michael C. Hughes

Comments In Proceedings of the International Conference of Artificial Intelligence and Statistics (AISTATS), 2026

2509.07013 2026-04-03 cs.LG q-bio.PE stat.ME

Generalized Machine Learning for Fast Calibration of Agent-Based Epidemic Models

Sima Najafzadehkhoei, George Vega Yon, Derek S. Meyer, Bernardo Modenesi

2509.03309 2026-04-03 stat.ME

A Measure of Predictive Sharpness for Probabilistic Models

Pekka Syrjänen

2507.18021 2026-04-03 math.ST cs.DS cs.LG math.FA math.PR stat.TH

Zeroth-order Logconcave Sampling

Yunbum Kook, Santosh S. Vempala

Comments v2: Fix a bug in the restart mechanism; add a lower bound on Gaussian annealing

2507.04754 2026-04-03 stat.ML cs.LG

Intervening to Learn and Compose Causally Disentangled Representations

Alex Markham, Isaac Hirsch, Jeri A. Chang, Liam Solus, Bryon Aragam

Comments 45 pages, 10 figures; accepted to the 5th conference on Causal Learning and Reasoning (CLeaR)

2506.17527 2026-04-03 math.ST math.CO math.PR stat.TH

Detection and Reconstruction of a Random Hypergraph from Noisy Graph Projection

Shuyang Gong, Zhangsong Li, Qiheng Xu

Comments 19 pages, 1 figure; Section 6 rewritten to fix a previous error

2412.11340 2026-04-03 stat.ME

Fast Bayesian Functional Principal Components Analysis

Joseph Sartini, Xinkai Zhou, Liz Selvin, Scott Zeger, Ciprian Crainiceanu

Comments 21 pages, 7 figures, 1 table

2411.12159 2026-04-03 stat.ML cs.LG cs.SY eess.SY stat.AP

Prognostics for Autonomous Deep-Space Habitat Health Management under Multiple Unknown Failure Modes

Benjamin Peters, Ayush Mohanty, Xiaolei Fang, Stephen K. Robinson, Nagi Gebraeel

Comments Manuscript under review

2405.14690 2026-04-03 q-bio.QM stat.AP

Beyond Scalar Metrics: Functional Data Analysis of Postprandial Continuous Glucose Monitoring in the AEGIS Study

Marcos Matabuena, Joe Sartini, Francisco Gude

2405.13621 2026-04-03 stat.ME

Interval identification of natural effects in the presence of outcome-related unmeasured confounding

Marco Doretti, Elena Stanghellini

Comments 14 pages, 2 figures, 2 tables

2310.19603 2026-04-03 cs.LG cs.NA cs.NE math.NA math.PR stat.ML

Transformers Can Solve Non-Linear and Non-Markovian Filtering Problems in Continuous Time For Conditionally Gaussian Signals

Blanka Horvath, Anastasis Kratsios, Yannick Limmer, Xuwei Yang

2604.02074 2026-04-03 stat.AP cs.CV

Country-wide, high-resolution monitoring of forest browning with Sentinel-2

Samantha Biegel, David Brüggemann, Francesco Grossi, Michele Volpi, Konrad Schindler, Benjamin D. Stocker

Comments 9 pages, 7 figures, to be published in the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Congress)

2604.02072 2026-04-03 stat.ME stat.CO

Sparse Probabilistic Richardson Extrapolation

Chris. J. Oates, Richard Howey, Toni Karvonen

2604.02017 2026-04-03 stat.ML cs.LG

Demographic Parity Tails for Regression

Naht Sinh Le, Christophe Denis, Mohamed Hebiri

2604.01978 2026-04-03 math.PR cs.LG stat.ML

Homogenized Transformers

Hugo Koubbi, Borjan Geshkovski, Philippe Rigollet

2604.01946 2026-04-03 cs.LG stat.ME stat.ML

PAC-Bayesian Reward-Certified Outcome Weighted Learning

Yuya Ishikawa, Shu Tamano

2604.01943 2026-04-03 stat.ML cs.LG

A Novel Theoretical Analysis for Clustering Heteroscedastic Gaussian Data without Knowledge of the Number of Clusters

Dominique Pastor, Elsa Dupraz, Ismail Hbilou, Guillaume Ansel

Comments 76 pages, submitted to JMLR

详情

英文摘要

This paper addresses the problem of clustering measurement vectors that are heteroscedastic in that they can have different covariance matrices. From the assumption that the measurement vectors within a given cluster are Gaussian distributed with possibly different and unknown covariant matrices around the cluster centroid, we introduce a novel cost function to estimate the centroids. The zeros of the gradient of this cost function turn out to be the fixed-points of a certain function. As such, the approach generalizes the methodology employed to derive the existing Mean-Shift algorithm. But as a main and novel theoretical result compared to Mean-Shift, this paper shows that the sole fixed-points of the identified function tend to be the cluster centroids if both the number of measurements per cluster and the distances between centroids are large enough. As a second contribution, this paper introduces the Wald kernel for clustering. This kernel is defined as the p-value of the Wald hypothesis test for testing the mean of a Gaussian. As such, the Wald kernel measures the plausibility that a measurement vector belongs to a given cluster and it scales better with the dimension of the measurement vectors than the usual Gaussian kernel. Finally, the proposed theoretical framework allows us to derive a new clustering algorithm called CENTRE-X that works by estimating the fixed-points of the identified function. As Mean-Shift, CENTRE-X requires no prior knowledge of the number of clusters. It relies on a Wald hypothesis test to significantly reduce the number of fixed points to calculate compared to the Mean-Shift algorithm, thus resulting in a clear gain in complexity. Simulation results on synthetic and real data sets show that CENTRE-X has comparable or better performance than standard clustering algorithms K-means and Mean-Shift, even when the covariance matrices are not perfectly known.

URL PDF HTML ☆

赞 0 踩 0

2604.01880 2026-04-03 cs.LG cs.NE stat.ML

DDCL-INCRT: A Self-Organising Transformer with Hierarchical Prototype Structure (Theoretical Foundations)

Giansalvo Cirrincione

Comments 30 pages, 5 figures. Submitted to Neural Networks (Elsevier)

2604.01789 2026-04-03 stat.ML cs.LG

Learning in Prophet Inequalities with Noisy Observations

Jung-hun Kim, Vianney Perchet

Comments ICLR 2026

2604.01734 2026-04-03 q-bio.QM stat.ME

A Novel Multi-view Mixture Model Framework for Longitudinal Clustering with Application to ANCA-Associated Vasculitis

Shen Jia, David Selby, Mark A Little, Tin Lok James Ng

2604.01689 2026-04-03 stat.ME

DeepKriging on the global Data

Hao-Yun Huang, Wen-Ting Wang, Ping-Hsun Chiang, Wei-Ying Wu

2604.01629 2026-04-03 stat.ME

Conformalized Method for Empirical Bayes Normal Mean Inference Problem with Heteroscedastic Variance

Kwangok Seo, Johan Lim

2604.01625 2026-04-03 stat.ME

Data-adaptive gene and pathway-based tests forrare-variant associations with survival outcomes

Yu Wang, Kwang Woo Ahn, Sarah L. Kerns, William Hall, Petra Seibold, Christopher J. Talbot, Ana Vega, Barry S. Rosenstein, Nawaid Usmani, Catharine M. L. West, Liv Veldeman, Paul L. Auer, Zhongyuan Chen

2604.01606 2026-04-03 stat.ML cs.LG math.OC

Random Coordinate Descent on the Wasserstein Space of Probability Measures

Yewei Xu, Qin Li

2604.01593 2026-04-03 stat.ME

Nonparametric regression of spatio-temporal data using infinite-dimensional covariates

Subhrajyoty Roy, Soudeep Deb, Sayar Karmakar, Rishideep Roy

2604.01580 2026-04-03 stat.CO math.PR

Simulation and Analysis of Multifractional Stochastic Processes with R Package Rmfrac

Andriy Olenko, Nemini Samarakoon

Comments 29 pages, 10 figures

2604.01568 2026-04-03 math.ST stat.ME stat.TH

Asymptotic theory and bias correction for the Wallace--Freeman estimator

Enes Makalic, Daniel F. Schmidt

2604.01546 2026-04-03 stat.ME

Spatially-informed Image Harmonization Results in Improved Scanner Effect Removal and Prediction

Alec Reinhardt, Yajie Liu, Suprateek Kundu

Comments 31 Pages, 5 fifures

2604.01501 2026-04-03 stat.ME stat.AP stat.ML stat.OT

Identifying and Estimating Causal Direct Effects Under Unmeasured Confounding

Philippe Boileau, Nima S. Hejazi, Ivana Malenica, Peter B. Gilbert, Sandrine Dudoit, Mark J. van der Laan

2604.01500 2026-04-03 stat.ME

Copula-Based Time Series for Non-Gaussian and Non-Markovian Stationary Processes

Sven Pappert, Harry Joe

2604.01491 2026-04-03 stat.AP

Opponent-Adjusted Evaluation of NFL Pass Blocking and Pass Rushing Performance

Jonathan Pipping-Gamón, Maximilian Gebauer, Victoria Lee, Kenny Watts, Abraham J. Wyner

Comments 14 pages, 3 figures, 5 tables. Code available at https://github.com/WhartonSABI/nfl-elo

2604.01470 2026-04-03 math.ST stat.ME stat.TH

Sharp Debiasing for Smooth Functional Estimation in Banach Spaces

Woonyoung Chang, Arun Kumar Kuchibhotla

2604.01441 2026-04-03 eess.SY cs.LG cs.OS cs.SY eess.SP stat.ML

Generative Profiling for Soft Real-Time Systems and its Applications to Resource Allocation

Georgiy A. Bondar, Abigail Eisenklam, Yifan Cai, Robert Gifford, Tushar Sial, Linh Thi Xuan Phan, Abhishek Halder

2604.01411 2026-04-03 cs.LG cs.CL stat.ML

Test-Time Scaling Makes Overtraining Compute-Optimal

Nicholas Roberts, Sungjun Cho, Zhiqi Gao, Tzu-Heng Huang, Albert Wu, Gabriel Orlanski, Avi Trost, Kelly Buchanan, Aws Albarghouthi, Frederic Sala

2604.01399 2026-04-03 math.ST math.PR stat.TH

Conditional Independence under Infinite Measures and Poisson Point Processes

Shuyang Bai, Vishal Routh

Comments 15 pages

2604.01356 2026-04-03 cs.DS stat.CO

A divide and conquer strategy for multinomial particle filter resampling

Andrey A. Popov

2604.01339 2026-04-03 cs.CV cs.AI cs.LG stat.ME stat.ML

Regularizing Attention Scores with Bootstrapping

Neo Christopher Chung, Maxim Laletin

2604.01325 2026-04-03 cs.AI stat.ME

The Digital Twin Counterfactual Framework: A Validation Architecture for Simulated Potential Outcomes

Olav Laudy

2604.01321 2026-04-03 math.OC cs.NA cs.SY eess.SY math.NA stat.CO

Risk Control of Traffic Flow Through Chance Constraints and Large Deviation Approximation

Rui Xu, Shanyin Tong, Xuan Di

2604.01267 2026-04-03 math.ST stat.ML stat.TH

Observable Geometry of Singular Statistical Models

Sean Plummer

2604.01266 2026-04-03 math.ST stat.CO stat.TH

Horseshoe Priors and MDP

Nick Polson, Vadim Sokolov, Daniel Zantedeschi

2604.01086 2026-04-03 cs.DS cs.IT math.IT math.ST stat.TH

Asymptotically Optimal Sequential Testing with Heterogeneous LLMs

Guokai Li, Alys Liang, Mo Liu, Murray Lei, Stefanus Jasin, Fenghua Yang, Preet Baxi

2603.28650 2026-04-03 cs.LG cs.AI stat.ML

Information-Theoretic Limits of Safety Verification for Self-Improving Systems

Arsenios Scrivens

Comments 27 pages, 6 figures. Companion empirical paper: doi:10.5281/zenodo.19237566

2603.23675 2026-04-03 stat.AP math.PR

Dynamical behaviors of a stochastic SIS epidemic model with mean-reverting inhomogeneous geometric brownian motion

Lahcen Khammich, Driss Kiouach

Comments It contains significant errors that require substantial revision

2603.22888 2026-04-03 math.ST stat.TH

Boundary Inference for Mixed Fractional Models under High-Frequency Observation Critical LAN and Score Tests at $H=3/4$

Chunhao Cai, Yiwu Shang, Weilin Xiao, Cong Zhang

2603.22573 2026-04-03 stat.ME

Multiple Jump MCMC: A Scalable Algorithm for Bayesian Inference on Binary Model Spaces

Lucas Vogels, Reza Mohammadi, Marit Schoonhoven, Sinan Yildirim, Ilker Birbil

2603.20359 2026-04-03 stat.ML cs.LG cs.NA math.DS math.NA

Operator Learning for Smoothing and Forecasting

Edoardo Calvello, Elizabeth Carlson, Nikola Kovachki, Michael N. Manta, Andrew M. Stuart

2603.20025 2026-04-03 stat.ML cs.LG math.ST stat.TH

Graph-Informed Adversarial Modeling: Infimal Subadditivity of Interpolative Divergences

Panagiota Birmpa, Eric Joseph Hall

Comments 34 pages, 9 figures

2603.11457 2026-04-03 stat.ME econ.EM math.ST stat.TH

Bayesian Modular Inference for Copula Models with Potentially Misspecified Marginals

Lucas Kock, David T. Frazier, Michael Stanley Smith, David J. Nott

2603.02491 2026-04-03 cs.LG cs.AI cs.RO q-bio.NC stat.ML

What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty

Aran Nayebi

Comments 23 pages; added PSR recovery (Theorems 3 & 4), and updated related work

2602.16142 2026-04-03 math.ST cs.CG cs.LG stat.TH

Ratio Covers of Convex Sets and Optimal Mixture Density Estimation

Spencer Compton, Gábor Lugosi, Jaouad Mourtada, Jian Qian, Nikita Zhivotovskiy

Comments 47 pages

2602.08083 2026-04-03 stat.AP

A Unified Server Quality Metric for Tennis

Aiwen Li, Amrita Balajee, Harry Wieand, Jonathan Pipping-Gamón

Comments 21 pages, published in Journal of Sports Analytics. Code available at https://github.com/WhartonSABI/server-quality

2601.21462 2026-04-03 cs.LG stat.ML

Partial Feedback Online Learning

Shihao Shao, Cong Fang, Zhouchen Lin, Dacheng Tao

Comments 40 pages. Fixed some typos in the proof and improved readability

2601.18774 2026-04-03 stat.AP math.PR stat.ME

Extreme-Path Benchmarks for Sequential Probability Forecasts

Jonathan Pipping-Gamón, Abraham J. Wyner

Comments Submitted to Annals of Applied Statistics. 17 pages, 3 figures

2601.10531 2026-04-03 stat.ML cs.LG math.CO

Coarsening Causal DAG Models

Francisco Madaleno, Pratik Misra, Alex Markham

Comments 27 pages, 5 figures; accepted to the 5th conference on Causal Learning and Reasoning (CLeaR)

2510.14523 2026-04-03 cs.LG math.ST stat.ML stat.TH

On the Identifiability of Tensor Ranks via Prior Predictive Matching

Eliezer da Silva, Arto Klami, Diego Mesquita, Iñigo Urteaga

Comments Accepted at AISTATS 2026

2510.09908 2026-04-03 stat.ML cs.LG

Learning with Incomplete Context: Linear Contextual Bandits with Pretrained Imputation

Hao Yan, Heyan Zhang, Yongyi Guo

2510.04318 2026-04-03 stat.ML cs.LG

Adaptive Coverage Policies in Conformal Prediction

Etienne Gauthier, Francis Bach, Michael I. Jordan

Comments Code at: https://github.com/GauthierE/adaptive-coverage-policies

2510.00463 2026-04-03 stat.ML cs.LG eess.SP stat.ME

On the Adversarial Robustness of Learning-based Conformal Novelty Detection

Daofu Zhang, Mehrdad Pournaderi, Hanne M. Clifford, Yu Xiang, Pramod K. Varshney

2509.15379 2026-04-03 stat.AP

A Single Index Approach to Integrated Species Distribution Modeling for Fisheries Abundance Data

Quan Vu, Francis K. C. Hui, A. H. Welsh, Samuel Muller, Eva Cantoni, Christopher R. Haak

2509.12533 2026-04-03 stat.AP stat.ME

Transporting Predictions via Double Machine Learning: Predicting Partially Unobserved Students' Outcomes

Falco J. Bargagli-Stoffi, Emma Landry, Kevin P. Josey, Kenneth De Beckker, Joana E. Maldonado, Kristof De Witte

Comments arXiv admin note: substantial text overlap with arXiv:2102.04382

2509.04718 2026-04-03 stat.ME physics.data-an q-bio.QM

When correcting for regression to the mean is worse than no correction at all

José F. Fontanari, Mauro Santos

2508.20755 2026-04-03 cs.LG cs.AI stat.ML

Provable Benefits of In-Tool Learning for Large Language Models

Sam Houliston, Ambroise Odonnat, Charles Arnal, Vivien Cabannes

2508.14285 2026-04-03 cs.LG cs.AI stat.ML

Meta-Learning at Scale for Large Language Models via Low-Rank Amortized Bayesian Meta-Learning

Liyi Zhang, Jake Snell, Thomas L. Griffiths

Comments 17 pages, 2 figures

2507.20598 2026-04-03 stat.ME q-bio.GN stat.AP

Nullstrap-DE: A General Framework for Calibrating FDR and Preserving Power in DE Methods, with Applications to DESeq2 and edgeR

Chenxin Jiang, Changhu Wang, Jingyi Jessica Li

2507.17190 2026-04-03 stat.ME

Model-robust standardization in stepped wedge designs

Xi Fang, Xueqi Wang, Patrick J. Heagerty, Bingkai Wang, Fan Li

2507.11816 2026-04-03 stat.ME math.ST stat.TH

A Relativity-Based Framework for Statistical Testing Guided by the Independence of Ancillary Statistics: Methodology and Nonparametric Illustrations

Albert Vexler, Douglas Landsittel

2506.23849 2026-04-03 stat.ME stat.AP

Developing a Synthetic Socio-Economic Index through Autoencoders: Evidence from Florence's Suburban Areas

Giulio Grossi, Emilia Rocco

2506.23396 2026-04-03 stat.ML cs.LG

AICO: Feature Significance Tests for Supervised Learning

Kay Giesecke, Enguerrand Horel, Chartsiri Jirachotkulthorn

2506.12553 2026-04-03 cs.LG cs.CR stat.ML

Beyond Laplace and Gaussian: Exploring the Generalized Gaussian Mechanism for Private Machine Learning

Roy Rinberg, Ilia Shumailov, Vikrant Singhal, Rachel Cummings, Nicolas Papernot

2504.12214 2026-04-03 stat.ME

Bayesian random-effects meta-analysis of aggregate data on clinical events

Christian Röver, Qiong Wu, Anja Loos, Tim Friede

Comments 23 pages, 8 figures

2503.08881 2026-04-03 stat.ME

Bayesian local clustering of functional data via semi-Markovian random partitions

Giovanni Toto, Antonio Canale

2502.10600 2026-04-03 stat.ML cs.LG cs.NA math.NA

Weighted quantization using MMD: From mean field to mean shift via gradient flows

Ayoub Belhadji, Daniel Sharp, Youssef Marzouk

2412.10683 2026-04-03 stat.ME stat.ML

Adaptive Nonparametric Perturbations of Parametric Models with Generalized Bayes

Bohan Wu, Eli N. Weinstein, Sohrab Salehi, Yixin Wang, David M. Blei

2412.09304 2026-04-03 stat.ME

Nonparametric estimation of the total treatment effect with multiple outcomes in the presence of terminal events

Jessica Gronsbell, Zachary R. McCaw, Isabelle-Emmanuella Nogues, Xiangshan Kong, Tianxi Cai, Lu Tian, LJ Wei

2411.18942 2026-04-03 math.ST math.DG stat.TH

Robust boundary detection and density estimation using doubly stochastic scaling of the Gaussian kernel

Dhruv Kohli, Jesse He, Chester Holtz, Alexander Cloninger, Gal Mishne

2411.08778 2026-04-03 stat.ME

Causal-DRF: Conditional Kernel Treatment Effect Estimation using Distributional Random Forest

Jeffrey Näf, Junhyung Park, Herbert Susmann

2406.02402 2026-04-03 math.OC cs.GT stat.ML

Online Fair Allocation of Perishable Resources

Siddhartha Banerjee, Chamsi Hssaine, Sean R. Sinclair

Comments 57 pages, 10 figures

2405.20957 2026-04-03 stat.ME stat.AP

Causal-ICM: A Data Fusion Framework For Heterogeneous Treatment Effect Estimation With Multi-Task Gaussian Processes

Evangelos Dimitriou, Edwin Fong, Jens Magelund Tarp, Karla Diaz-Ordaz, Brieuc Lehmann

Comments Accepted at the 5th Conference on Causal Learning and Reasoning (CLeaR 2026)

2206.08817 2026-04-03 stat.ME

Species Distribution Modeling with Expert Elicitation and Bayesian Calibration

Karel Kaurila, Sanna Kuningas, Antti Lappalainen, Jarno Vanhatalo

Comments Article: 20 pages, 4 figures. Supplement: 10 pages, 8 figures

详情

DOI: 10.1002/ecog.08173

英文摘要

Species distribution models (SDMs) are key tools in ecology, conservation and management of natural resources. They are commonly trained by scientific survey data but, since surveys are expensive, there is a need for complementary sources of information to train them. To this end, several authors have proposed to use expert elicitation since local citizen and substance area experts can hold valuable information on species distributions. Expert knowledge has been incorporated within SDMs, for example, through informative priors. However, existing approaches pose challenges related to assessment of the reliability of the experts. Since expert knowledge is inherently subjective and prone to biases, we should optimally calibrate experts' assessments and make inference on their reliability. Moreover, demonstrated examples of improved species distribution predictions using expert elicitation compared to using only survey data are few as well. In this work, we propose a novel approach to use expert knowledge on species distribution within SDMs and demonstrate that it leads to significantly better predictions. First, we propose expert elicitation process where experts summarize their belief on a species occurrence proability with maps. Second, we collect survey data to calibrate the expert assessments. Third, we propose a hierarchical Bayesian model that combines the two information sources and can be used to make predictions over the study area. We apply our methods to study the distribution of spring spawning pikeperch larvae in a coastal area of the Gulf of Finland. According to our results, the expert information significantly improves species distribution predictions compared to predictions conditioned on survey data only. However, experts' reliability also varies considerably, and even generally reliable experts had spatially structured biases in their assessments.

URL PDF HTML ☆

赞 0 踩 0

1812.05741 2026-04-03 stat.ME

Posterior Projection for Inference in Constrained Spaces

Lachlan Astfalck, Deborshee Sen, Sayan Patra, Edward Cripps, David Dunson