arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.14542 2026-03-11 astro-ph.GA physics.data-an stat.AP

New techniques to investigate the AGN-SF connection with integral field spectroscopy

Aman Chopra, Henry R. M. Zovaro, Rebecca L. Davies

Comments 26 pages, 23 figures

详情

DOI: 10.1017/pasa.2026.10158
Journal ref: Publ. Astron. Soc. Aust. 43 (2026) e023

英文摘要

Understanding the connection between active galactic nuclei and star-formation (the AGN-SF connection) is one of the longest standing problems in modern astrophysics. In the age of large Integral Field Unit (IFU) surveys, studies of the AGN-SF connection greatly benefit from spatially resolving AGN and SF contributions to study the two processes independently. Using IFU data for 54 local active galaxies from the S7 sample, we present a new method to separate emission from AGN activity and SF using mixing sequences observed in the [NII]$λ6584$/H$α$ vs. [OIII]$λ5007$/H$β$ Baldwin-Phillips-Terlevich (BPT) diagram. We use the new decomposition method to calculate the H$α$ star-formation rate and AGN [OIII] luminosity for the galaxies. Our new method is robust to outliers in the line-ratio distribution and can be applied to large galaxy samples with little manual intervention. We infer star-formation histories (SFHs) using pPXF, conducting detailed recovery tests to determine the quantities that can be considered robust. We test the correlation between the AGN Eddington ratio, using the proxy L[OIII]/$σ_*^4$, and star-formation properties. We find a moderately strong correlation between the Eddington ratio and the star-formation rate (SFR). We also observe marginally significant correlations between the AGN Eddington ratio and the light-weighted stellar age under 100 Myr. Our results point to higher AGN accretion being associated with young nuclear star formation under 100 Myr, consistent with timelines presented in previous studies. The correlations found in this paper are relatively weak; extending our methods to larger samples, including radio-quiet galaxies, will help better constrain the physical mechanisms and timescales of the AGN-SF connection.

URL PDF HTML ☆

赞 0 踩 0

2407.18835 2026-03-11 stat.ME math.ST stat.AP stat.OT stat.TH

Robust Estimation of Polychoric Correlation

Max Welz, Patrick Mair, Andreas Alfons

Comments 78 pages (37 main text), 21 figures (9 in main text), 10 tables (5 in main text). This is the final version of this article, as accepted in Psychometrika

2603.09952 2026-03-11 cs.LG cs.NA cs.SY eess.SY math.NA math.OC stat.ML

On the Width Scaling of Neural Optimizers Under Matrix Operator Norms I: Row/Column Normalization and Hyperparameter Transfer

Ruihan Xu, Jiajin Li, Yiping Lu

详情

英文摘要

A central question in modern deep learning is how to design optimizers whose behavior remains stable as the network width $w$ increases. We address this question by interpreting several widely used neural-network optimizers, including \textrm{AdamW} and \textrm{Muon}, as instances of steepest descent under matrix operator norms. This perspective links optimizer geometry with the Lipschitz structure of the network forward map, and enables width-independent control of both Lipschitz and smoothness constants. However, steepest-descent rules induced by standard $p \to q$ operator norms lack layerwise composability and therefore cannot provide width-independent bounds in deep architectures. We overcome this limitation by introducing a family of mean-normalized operator norms, denoted $\pmean \to \qmean$, that admit layerwise composability, yield width-independent smoothness bounds, and give rise to practical optimizers such as \emph{rescaled} \textrm{AdamW}, row normalization, and column normalization. The resulting learning rate width-aware scaling rules recover $μ$P scaling~\cite{yang2021tensor} as a special case and provide a principled mechanism for cross-width learning-rate transfer across a broad class of optimizers. We further show that \textrm{Muon} can suffer an $\mathcal{O}(\sqrt{w})$ worst-case growth in the smoothness constant, whereas a new family of row-normalized optimizers we propose achieves width-independent smoothness guarantees. Based on the observations, we propose MOGA (Matrix Operator Geometry Aware), a width-aware optimizer based only on row/column-wise normalization that enables stable learning-rate transfer across model widths. Large-scale pre-training on GPT-2 and LLaMA shows that MOGA, especially with row normalization, is competitive with Muon while being notably faster in large-token and low-loss regimes.

URL PDF HTML ☆

赞 0 踩 0

2603.09842 2026-03-11 cs.LG stat.ME stat.ML

A Unified Hierarchical Multi-Task Multi-Fidelity Framework for Data-Efficient Surrogate Modeling in Manufacturing

Manan Mehta, Zhiqiao Dong, Yuhang Yang, Chenhui Shao

2603.09768 2026-03-11 math.ST stat.TH

The exact region between Chatterjee's and Blest's rank correlations

Marcus Rockel

2603.09680 2026-03-11 math.NT stat.ML

Murmurations: a case study in AI-assisted mathematics

Yang-Hui He, Kyu-Hwan Lee, Thomas Oliver, Alexey Pozdnyakov

Comments 12 pages, 15 figures

2603.09629 2026-03-11 math.ST stat.TH

On the last time and the number of times an estimator is more than epsilon from its target value

Nils Lid Hjort, Grete Fenstad

Comments 18 pages, no figures; Statistical Research Report, Department of Mathematics, University of Oslo, from April 1991, now arXiv'd March 2026. The paper has appeared in Annals of Statistics, 1992, vol. 20, pages 469-489, at this url: projecteuclid.org/journals/annals-of-statistics/volume-20/issue-1/On-the-Last-Time-and-the-Number-of-Times-an/10.1214/aos/1176348533.full

2603.09601 2026-03-11 cs.LG stat.ME stat.ML

MM-algorithms for traditional and convex NMF with Tweedie and Negative Binomial cost functions and empirical evaluation

Elisabeth Sommer James, Asger Hobolth, Marta Pelizzola

2603.09564 2026-03-11 stat.ML cs.LG

a-TMFG: Scalable Triangulated Maximally Filtered Graphs via Approximate Nearest Neighbors

Lionel Yelibi

2603.09532 2026-03-11 stat.ML cs.LG

What Do We Care About in Bandits with Noncompliance? BRACE: Bandits with Recommendations, Abstention, and Certified Effects

Nicolás Della Penna

详情

英文摘要

Bandits with noncompliance separate the learner's recommendation from the treatment actually delivered, so the learning target itself must be chosen. A platform may care about recommendation welfare in the current mediated workflow, treatment learning for a future direct-control regime, or anytime-valid uncertainty for one of those targets. These objectives need not agree. We formalize this objective-choice problem, identify the direct-control regime in which recommendation and treatment objectives collapse, and show by example that recommendation welfare can strictly exceed every learner-measurable treatment policy when downstream actors use private information. For finite-context square-IV problems we propose BRACE, a parameter-free phase-doubling algorithm that performs IV inversion only after matrix certification and otherwise returns full-range but honest structural intervals. BRACE delivers simultaneous policy-value validity, fixed-gap identification of the operationally optimal recommendation policy, and fixed-gap identification of the structurally optimal treatment policy under contextual homogeneity and invertibility. We complement the theory with a finite-context empirical benchmark spanning direct control, mediated present-versus-future tradeoffs, weak identification, homogeneity failure, and rectangular overidentification. The experiments show that safety appears as regret on easy problems, as abstention and wide valid intervals under weak identification, as a reason to prefer recommendation welfare under homogeneity failure, and as tighter structural uncertainty when extra instruments are available. For rich contexts, we also derive an orthogonal score whose conditional bias factorizes into compliance-model and outcome-model errors, clarifying what must be stabilized for anytime-valid semiparametric IV inference.

URL PDF HTML ☆

赞 0 踩 0

2603.09504 2026-03-11 math.PR math.ST stat.TH

Uniform Lorden-type bounds for overshoot moments for standard exponential families: small drift and an exponential correction

El'mira Yu. Kalimulina, Mark Ya. Kelbert

Comments 20 pages, no figure

2603.09428 2026-03-11 stat.AP

Bayesian Species Distribution Models using Hierarchical Decomposition Priors

Luisa Ferrari, Massimo Ventrucci, Alex Laini

Comments 43 pages, 8 figures

2603.09425 2026-03-11 stat.AP cs.AI

CERES: A Probabilistic Early Warning System for Acute Food Insecurity

Tom Danny S. Pedersen

Comments 12 pages, 4 tables, 2 appendices. Live system: https://ceres.northflow.no

2603.09318 2026-03-11 stat.ME stat.AP stat.OT

Anomaly detection using surprisals

Rob J Hyndman, David T. Frazier

2603.09314 2026-03-11 math.ST stat.TH

Second order asymptotics for the number of times an estimator is more than epsilon from its target value

Nils Lid Hjort, Grete Fenstad

Comments 11 pages, no figures; Statistical Research Report, Department of Mathematics, University of Oslo, September 1994, but now arXiv'd March 2026. The paper has appeared in essentially this form in Journal of Statistical Planning and Inference, 1995, vol. 48, pages 261-275, at this url: www.sciencedirect.com/science/article/pii/037837589500008W

2603.09310 2026-03-11 cs.LG math.PR stat.ML

A Gaussian Comparison Theorem for Training Dynamics in Machine Learning

Ashkan Panahi

2603.09306 2026-03-11 stat.ME

Contrastive Bayesian Inference for Unnormalized Models

Naruki Sonobe, Shonosuke Sugasawa, Daichi Mochihashi, Takeru Matsuda

2603.09257 2026-03-11 cs.LG stat.ML

Transductive Generalization via Optimal Transport and Its Application to Graph Node Classification

MoonJeong Park, Seungbeom Lee, Kyungmin Kim, Jaeseung Heo, Seunghyuk Cho, Shouheng Li, Sangdon Park, Dongwoo Kim

2603.09251 2026-03-11 stat.ML cs.LG cs.NA math.NA

A Generative Sampler for distributions with possible discrete parameter based on Reversibility

Lei Li, Zhen Wang, Lishuo Zhang

2603.09168 2026-03-11 cs.LG cs.DS stat.ML

Better Bounds for the Distributed Experts Problem

David P. Woodruff, Samson Zhou

2603.06602 2026-03-11 cs.LG stat.ML

Khatri-Rao Clustering for Data Summarization

Martino Ciaperoni, Collin Leiber, Aristides Gionis, Heikki Mannila

2603.00945 2026-03-11 math.OC cs.LG stat.ML

Non-Rectangular Average-Reward Robust MDPs: Optimal Policies and Their Transient Values

Shengbo Wang, Nian Si

2602.20007 2026-03-11 math.ST stat.ME stat.TH

Order-Induced Variance in the Moving-Range Sigma Estimator: A Total-Variance Decomposition

Andrew T. Karl

2601.14947 2026-03-11 math.ST stat.ME stat.TH

Central subspace data depth

Giacomo Francisci, Claudio Agostinelli

Comments 25+34 pages, 7+4 figures

2601.05355 2026-03-11 stat.ML cs.AI cs.LG stat.CO stat.ME

An AI-powered Bayesian Generative Modeling Approach for Arbitrary Conditional Inference

Qiao Liu, Wing Hung Wong

2512.11427 2026-03-11 stat.ME

Conditional Copula models using loss-based Bayesian Additive Regression Trees

Tathagata Basu, Fabrizio Leisen, Cristiano Villa, Kevin Wilson

Comments typos related to loss function inside the prior is fixed

2509.18978 2026-03-11 math.ST math.DG math.PR stat.TH

Refining Cramér-Rao Bound With Multivariate Parameters: An Extrinsic Geometry Perspective

Sunder Ram Krishnan

Comments Vector parameter extension of work done in arXiv:2509.17886

2509.17886 2026-03-11 math.ST math.DG math.PR stat.TH

Improving Cramér-Rao Bound And Its Variants: An Extrinsic Geometry Perspective

Sunder Ram Krishnan

Comments Improved and corrected version

2509.10325 2026-03-11 stat.ME

Using the rejection sampling for finding tests

Markku Kuismin

2509.10166 2026-03-11 stat.ML cs.LG

Repulsive Monte Carlo on the sphere for the sliced Wasserstein distance

Vladimir Petrovic, Rémi Bardenet, Agnès Desolneux

详情

英文摘要

In this paper, we consider the problem of computing the integral of a function on the unit sphere, in any dimension, using Monte Carlo methods. Although the methods we present are general, our guiding thread is the sliced Wasserstein distance between two measures on $\mathbb{R}^d$, which is precisely an integral on the $d$-dimensional sphere. The sliced Wasserstein distance (SW) has gained momentum in machine learning either as a proxy to the less computationally tractable Wasserstein distance, or as a distance in its own right, due in particular to its built-in alleviation of the curse of dimensionality. There has been recent numerical benchmarks of quadratures for the sliced Wasserstein, and our viewpoint differs in that we concentrate on quadratures where the nodes are repulsive, i.e. negatively dependent. Indeed, negative dependence can bring variance reduction when the quadrature is adapted to the integration task. Our first contribution is to extract and motivate quadratures from the recent literature on determinantal point processes (DPPs) and repelled point processes, as well as repulsive quadratures from the literature specific to the sliced Wasserstein distance. We then numerically benchmark these quadratures. Moreover, we analyze the variance of the UnifOrtho estimator, an orthogonal Monte Carlo estimator. Our analysis sheds light on UnifOrtho's success for the estimation of the sliced Wasserstein in large dimensions, as well as counterexamples from the literature. Our final recommendation for the computation of the sliced Wasserstein distance is to use randomized quasi-Monte Carlo in low dimensions and UnifOrtho in large dimensions. DPP-based quadratures only shine when quasi-Monte Carlo also does, while repelled quadratures show moderate variance reduction in general, but more theoretical effort is needed to make them robust.

URL PDF HTML ☆

赞 0 踩 0

2506.20533 2026-03-11 stat.ML cs.LG math.OC

Global Convergence of Iteratively Reweighted Least Squares for Robust Subspace Recovery

Gilad Lerman, Kang Li, Tyler Maunu, Teng Zhang

2506.12842 2026-03-11 cs.SI cs.LG stat.ML

Uncovering Social Network Activity Using Joint User and Topic Interaction

Gaspard Abel, Argyris Kalogeratos, Jean-Pierre Nadal, Julien Randon-Furling

Comments Content: 13 pages, 8 figures, 4 tables

2506.00168 2026-03-11 q-bio.QM q-bio.CB stat.ML

SSRCA: a novel machine learning pipeline to perform sensitivity analysis for agent-based models

Edward H. Rohr, John T. Nardini

2503.20940 2026-03-11 stat.ME

A Restricted Latent Class Hidden Markov Model for Polytomous Responses, Polytomous Attributes, and Covariates: Identifiability and Application

Eric Alan Wayman, Steven Andrew Culpepper, Jeff Douglas, Jesse Bowers

Comments 60 pages, 3 figures, 34 tables. Edited language for clarity, removed one table, and fixed typos. Published in the Journal of Educational and Behavioral Statistics

2502.15933 2026-03-11 stat.ME stat.AP

Empirical best prediction of poverty indicators via nested error regression with high dimensional parameters

Yuting Chen, Partha Lahiri, Nicola Salvati

2410.09067 2026-03-11 stat.AP cs.CG physics.soc-ph

Evaluating Cooling Center Coverage Using Persistent Homology of a Filtered Witness Complex

Erin O'Neil, Sarah Tymochko

2410.05861 2026-03-11 stat.ME econ.EM

Persistence-Robust Break Detection in Predictive CoVaR Regressions

Yannick Hoga

2410.02840 2026-03-11 cs.LG cs.CY math.ST stat.TH

Overcoming Representation Bias in Fairness-Aware data Repair using Optimal Transport

Abigail Langbridge, Anthony Quinn, Robert Shorten

2407.05277 2026-03-11 eess.SP math.ST stat.TH

Einstein from Noise: Statistical Analysis

Amnon Balanov, Wasim Huleihel, Tamir Bendory

2402.18741 2026-03-11 stat.ME

Spectral Graph Filtering for Modality-Specific Representation Learning

Shira Yoffe, Amit Moscovich, Ariel Jaffe

2311.15485 2026-03-11 stat.ME

Calibrated Generalized Bayesian Inference

David T. Frazier, Christopher Drovandi, Robert Kohn

Comments This paper is a substantially revised version of arXiv:2302.06031v1. This revised version has a slightly different focus, additional examples, and theoretical results, as well as different authors

2210.13687 2026-03-11 stat.AP cs.CY

Implicit Biases in Refereeing: Lessons from NBA Referees

Konstantinos Pelechrinis

1904.11060 2026-03-11 econ.EM math.ST stat.TH

Normal Approximation in Large Network Models

Michael P. Leung, Hyungsik Roger Moon

2603.09067 2026-03-11 stat.ML cond-mat.stat-mech cs.LG math-ph math.MP

Verifying Good Regulator Conditions for Hypergraph Observers: Natural Gradient Learning from Causal Invariance via Established Theorems

Max Zhuravlev

Comments 18 pages, 15 formal results. Part of a series of companion papers submitted simultaneously; cross-references updated with arXiv IDs in v2

2603.09061 2026-03-11 stat.AP stat.ME

Distribution-free screening of spatially variable genes in spatial transcriptomics

Changhu Wang, Qiyun Huang, Zihao Chen, Jin Liu, Ruibin Xi

2603.09058 2026-03-11 stat.ME cs.LG

Adaptive Active Learning for Online Reliability Prediction of Satellite Electronics

Shixiang Li, Yubin Tian, Dianpeng Wang, Piao Chen, Mengying Ren

2603.09041 2026-03-11 stat.ME stat.AP

AgroDesign: A Design-Aware Statistical Inference Framework for Agricultural Experiments in Python

Aqib Gul

Comments 21 pages, 8 figures, 8 tables

2603.09009 2026-03-11 stat.ML cs.LG

Statistical Inference via Generative Models: Flow Matching and Causal Inference

Shinto Eguchi

2603.08981 2026-03-11 stat.ME stat.AP

Uncertainty quantification for critical energy systems during compound extremes via BMW-GAM

Mitchell L. Krock, W. Neal Mann, Zhi Zhou

2603.08979 2026-03-11 math.OC cs.LG stat.ML

Data-driven robust Markov decision processes on Borel spaces: performance guarantees via an axiomatic approach

Sivaramakrishnan Ramani

2603.08963 2026-03-11 stat.ME stat.ML

Estimation of heterogeneous principal effects under principal ignorability

Rui Zhang, Charles R. Doss, Jared D. Huling

2603.08947 2026-03-11 stat.ML cs.LG

Towards Reliable Simulation-based Inference

Arnaud Delaunoy

Comments PhD thesis

详情

英文摘要

Scientific knowledge expands by observing the world, hypothesizing some theories about it, and testing them against collected data. When those theories take the form of statistical models, statistical analyses are involved in the process of testing and refining scientific hypotheses. In this thesis, we focus on statistical models that take the form of scientific simulators and provide background about how machine learning can be used for statistical analyses in this context. The first part of this thesis is about showing empirically that performing statistical analyses with machine learning involves a degree of approximation. Specifically, all statistical analyses involve a level of uncertainty in the conclusions drawn, and we show that approximations can lead to overconfident conclusions. We draw caution regarding such overconfident conclusions and introduce a criterion to diagnose overconfident approximations. In the second part, we introduce balancing, a way to regularize machine learning models to reduce overconfidence and favor calibrated or underconfident approximations. Balancing is first introduced for neural ratio estimation algorithms and then extended to other algorithms. Intuition about why balancing leads to less overconfident solutions is provided, and it is shown empirically that balanced algorithms are often either close to calibrated or underconfident. The third part shows that Bayesian neural networks can also be used to mitigate the overconfidence of approximations. Unlike balancing, no regularization is required, and this solution can then work with few training samples and, hence, computationally expensive simulators. To that end, a new Bayesian neural network prior tailored for simulation-based inference is developed, and empirical results show a reduction in overconfidence compared to similar solutions without Bayesian neural networks.

URL PDF HTML ☆

赞 0 踩 0

2603.08945 2026-03-11 math.ST cs.LG stat.ML stat.TH

Kernel Debiased Plug-in Estimation based on the Universal Least Favorable Submodel

Haiyi Chen, Yang Liu, Ivana Malenica

2603.08925 2026-03-11 math.ST stat.ML stat.TH

Functional Bias and Tangent-Space Geometry in Variational Inference

Sean Plummer

2603.08907 2026-03-11 cs.LG cs.AI stat.ML

Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting

Abhinaba Basu

详情

英文摘要

We present a comprehensive ablation of nine finite-sample bound families for selective prediction with risk control, combining concentration inequalities (Hoeffding, Empirical Bernstein, Clopper-Pearson, Wasserstein DRO, CVaR) with multiple-testing corrections (union bound, Learn Then Test fixed-sequence) and betting-based confidence sequences (WSR). Our main theoretical contribution is Transfer-Informed Betting (TIB), which warm-starts the WSR wealth process using a source domain's risk profile, achieving tighter bounds in data-scarce settings with a formal dominance guarantee. We prove that the TIB wealth process remains a valid supermartingale under all source-target divergences, that TIB dominates standard WSR when domains match, and that no data-independent warm-start can achieve better convergence. The combination of betting-based confidence sequences, LTT monotone testing, and cross-domain transfer is, to our knowledge, a three-way novelty not present in the literature. We evaluate all nine bound families on four benchmarks-MASSIVE (n=1,102), NyayaBench (n=280), CLINC-150 (n=22.5K), and Banking77 (n=13K)-across 18 (alpha, delta) configurations. On MASSIVE at alpha=0.10, LTT eliminates the ln(K) union-bound penalty, achieving 94.0% guaranteed coverage versus 73.8% for Hoeffding-a 27% relative improvement. On NyayaBench, where the small calibration set makes Hoeffding-family bounds infeasible below alpha=0.20, Transfer-Informed Betting achieves 18.5% coverage at alpha=0.10, a 5.4x improvement over LTT + Hoeffding. We additionally compare with split-conformal prediction, showing that conformal methods produce prediction sets (avg. 1.67 classes) whereas selective prediction provides single-prediction risk guarantees. We apply these methods to agentic caching systems, formalizing a progressive trust model where the guarantee determines when cached responses can be served autonomously.

URL PDF HTML ☆

赞 0 踩 0

2603.08871 2026-03-11 stat.ME

Efficient semiparametric estimation of marginal treatment effects with genetic instrumental variables

Ashish Patel, Francis J DiTraglia, Stephen Burgess

2603.08803 2026-03-11 cs.LG stat.ML

The Temporal Markov Transition Field

Michael Leznik

Comments 13 pages, 2 figures

2603.08773 2026-03-11 cs.LG cs.AI stat.ML

Multi-level meta-reinforcement learning with skill-based curriculum

Sichen Yang, Mauro Maggioni

Comments 78 pages, 12 figures

详情

英文摘要

We consider problems in sequential decision making with natural multi-level structure, where sub-tasks are assembled together to accomplish complex goals. Systematically inferring and leveraging hierarchical structure has remained a longstanding challenge; we describe an efficient multi-level procedure for repeatedly compressing Markov decision processes (MDPs), wherein a parametric family of policies at one level is treated as single actions in the compressed MDPs at higher levels, while preserving the semantic meanings and structure of the original MDP, and mimicking the natural logic to address a complex MDP. Higher-level MDPs are themselves independent MDPs with less stochasticity, and may be solved using existing algorithms. As a byproduct, spatial or temporal scales may be coarsened at higher levels, making it more efficient to find long-term optimal policies. The multi-level representation delivered by this procedure decouples sub-tasks from each other and usually greatly reduces unnecessary stochasticity and the policy search space, leading to fewer iterations and computations when solving the MDPs. A second fundamental aspect of this work is that these multi-level decompositions plus the factorization of policies into embeddings (problem-specific) and skills (including higher-order functions) yield new transfer opportunities of skills across different problems and different levels. This whole process is framed within curriculum learning, wherein a teacher organizes the student agent's learning process in a way that gradually increases the difficulty of tasks and and promotes transfer across MDPs and levels within and across curricula. The consistency of this framework and its benefits can be guaranteed under mild assumptions. We demonstrate abstraction, transferability, and curriculum learning in examples, including MazeBase+, a more complex variant of the MazeBase example.

URL PDF HTML ☆

赞 0 踩 0

2603.08753 2026-03-11 stat.ML cs.AI cs.LG

Permutation-Equivariant 2D State Space Models: Theory and Canonical Architecture for Multivariate Time Series

Seungwoo Jeong, Heung-Il Suk

2603.08742 2026-03-11 cs.NE cs.LG cs.NA math.NA stat.ML

Robust Parameter and State Estimation in Multiscale Neuronal Systems Using Physics-Informed Neural Networks

Changliang Wei, Yangyang Wang, Xueyu Zhu

2603.06820 2026-03-11 econ.EM stat.OT

Hippocratic Utility

Tomasz Strzalecki

2603.06465 2026-03-11 stat.AP

Risk Prediction in Cancer Imaging Using Enriched Radiomics Features

Alec Reinhardt, Tsung-Hung Yao, Raven Hollis, Galia Jacobson, Millicent Roach, Mohamed Badawy, Peter Park, Laura Beretta, David Fuentes, Newsha Nikzad, Prasun Jalal, Eugene Koay, Suprateek Kundu

详情

英文摘要

Background: We aim to develop enriched radiomics features that integrate classical structural radiomics with novel functional radiomics derived from liver MRI for diagnosis and risk stratification in liver cancer. The proposed framework leverages enhancement pattern mapping (EPM) images to provide an automated and robust radiomics representation that captures intratumoral heterogeneity through pixel-level functional information. Methods: Pixel-wise EPM data reflecting blood perfusion were extracted from T1-weighted MRI scans. Classical structural radiomics features were extracted via existing software such as PyRadiomics. In addition, empirical quantiles of EPM values over all pixels within the image, and then smoothed using suitable basis. The smoothed quantiles, along with the classical structural quantiles, are used as functional radiomics features for diagnostic classification and tumor grade stratification, using L1-penalized logistic model that automatically downweights the contribution of the irrelevant features. Further, we conducted longitudinal analyses using Bayesian tensor response regression, which enables spatial smoothing and parsimonious modeling of temporally evolving imaging patterns. Results: The enriched radiomics features illustrate higher diagnostic classification performance (AUC=0.96, sensitivity> 0.8) and superior tumor grade stratification accuracy (AUC=0.87, sensitivity=0.8) compared to alternate radiomics features. Moreover, we find that the proportion of lesion pixels with significant reduction in EPM values over time is considerably higher (median = 0.12) in aggressive lesions versus stable or mildly aggressive lesions (median = 0.025). Conclusion: The enriched novel radiomics features can potentially replace classical radiomics analysis and be used for imaging biomarkers in cross-sectional and in longitudinal cancer imaging studies.

URL PDF HTML ☆

赞 0 踩 0

2602.10696 2026-03-11 stat.ML cs.LG math.OC math.ST stat.TH

Robust Assortment Optimization from Observational Data

Miao Lu, Yuxuan Han, Han Zhong, Zhengyuan Zhou, Jose Blanchet

Comments 65 pages, 9 figures

2602.04146 2026-03-11 math.ST stat.TH

Bayes, E-values and Testing

Nicholas G. Polson, Vadim Sokolov, Daniel Zantedeschi

Comments Revised submission: fixed typos, added clarifications, and compressed the exposition

2510.16232 2026-03-11 stat.ML cs.LG cs.MA cs.SY eess.SY

Personalized Collaborative Learning with Affinity-Based Variance Reduction

Chenyu Zhang, Navid Azizan

Comments Published as a conference paper at ICLR 2026

2508.20943 2026-03-11 stat.AP

DESA: An R Package for Detecting Epidemics using a School-Absenteeism Surveillance Framework

Vinay Joshy, Zeny Feng, Lorna Deeth, Kayla Vanderkruk, Justin Slater

2508.20924 2026-03-11 math.ST math.PR stat.TH

Palm distributions of superposed point processes for statistical inference

Mario Beraha, Federico Camerlenghi, Lorenzo Ghilotti

Comments This submission replaces arXiv:2409.14753

2506.04626 2026-03-11 stat.ML cs.LG

Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning

Haochen Zhang, Zhong Zheng, Lingzhou Xue

Comments arXiv admin note: text overlap with arXiv:2502.02859

2504.04528 2026-03-11 cs.LG cs.AI stat.ME stat.ML

A Consequentialist Critique of Binary Classification Evaluation: Theory, Practice, and Tools

Gerardo Flores, Abigail Schiff, Alyssa H. Smith, Julia A Fukuyama, Ashia C. Wilson

2410.12367 2026-03-11 math.ST cs.LG stat.ME stat.TH

Adaptive and Stratified Subsampling for High-Dimensional Robust Estimation

Prateek Mittal, Joohi Chauhan

2409.13060 2026-03-11 stat.ME

Forecasting Causal Effects of Future Interventions: Confounding and Transportability Issues

Laura Forastiere, Fan Li, Michela Baccini

2405.11111 2026-03-11 stat.ME

Euclidean mirrors and first-order changepoints in network time series

Tianyi Chen, Zachary Lubberts, Avanti Athreya, Youngser Park, Carey E. Priebe

2401.15014 2026-03-11 stat.ME

Constructing Genetic Risk Scores: Robust Bayesian Approach through Projected Summary Statistics and Flexible Shrinkage

Yuzheng Dun, Nilanjan Chatterjee, Jin Jin, Akihiko Nishimura

2307.14282 2026-03-11 econ.EM econ.TH stat.ME

Causal Effects in Matching Mechanisms with Strategically Reported Preferences

Marinho Bertanha, Margaux Luflade, Ismael Mourifié

2202.00190 2026-03-11 math.ST stat.ME stat.TH

Sketching stochastic valuation functions

Milan Vojnović, Yiliu Wang