arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.12492 2026-05-13 cs.LG stat.ML

Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation

Kexuan Shi, Hanxuan Li, Zeju Qiu, Yandong Wen, Simon Buchholz, Weiyang Liu

AI总结本文提出了一种基于正交等价变换的谱值保持优化器Pion，用于大语言模型的训练。与Adam等加法优化器不同，Pion通过左右正交变换更新权重矩阵，从而在训练过程中保持其奇异值不变。该方法在调整权重矩阵几何结构的同时固定其谱范数，实验表明Pion在大模型预训练和微调任务中表现出稳定且具有竞争力的性能。

2605.12461 2026-05-13 math.ST cs.DS cs.LG stat.ML stat.TH

A proximal gradient algorithm for composite log-concave sampling

Linghai Liu, Sinho Chewi

AI总结本文提出了一种用于从复合对数凹分布中采样的近端梯度算法，该分布形式为 $π \propto e^{-f - g}$，假设能够获取 $f$ 的梯度以及 $g$ 的受限高斯预言机（RGO）。该算法通过结合梯度信息和 RGO 采样，实现了高效的采样过程。研究证明，在 $f + g$ 强凸且 $f$ 光滑的条件下，该算法在总变分距离下达到 $\varepsilon$ 精度所需的迭代次数为 $\widetilde{\mathcal{O}}(κ\sqrt{d} \log^4(1/\varepsilon))$，与现有最优结果一致，并进一步扩展到非对数凹分布和非光滑 $f$ 的情形。

2605.12410 2026-05-13 stat.ML cs.LG math.OC math.ST stat.TH

Model-based Bootstrap of Controlled Markov Chains

Ziwei Su, Imon Banerjee, Diego Klabjan

AI总结本文提出并分析了一种基于模型的引导方法，用于估计有限可控马尔可夫链（CMC）中的转移核，适用于可能具有非平稳或历史依赖控制策略的情形，这在行为策略未知的离线强化学习中具有重要意义。研究通过引入新的引导大数定律和鞅中心极限定理，建立了引导转移估计器在分布上的一致性，并进一步扩展到离线策略评估和最优策略恢复任务，获得了价值函数和Q函数的渐近有效置信区间。实验表明，该方法在覆盖精度上优于现有方法，尤其在小样本和短回合场景下表现更优。

2605.12341 2026-05-13 stat.ML cs.LG

Multi-Variable Conformal Prediction: Optimizing Prediction Sets without Data Splitting

Laura Lützow, Simone Garatti, Marco C. Campi, Lars Lindemann, Matthias Althoff

AI总结该论文提出了一种多变量校准预测（MCP）框架，旨在在不进行数据划分的情况下优化预测集的形状，同时保持有限样本下的覆盖保证。MCP 扩展了传统校准预测方法，支持向量值评分函数和多个校准变量，将预测集设计与校准统一为一个优化问题。研究提出了两种高效变体 RemMCP 和 RelMCP，分别适用于不同类型的优化需求，并在实验中验证了其在保持目标覆盖的同时，能够获得更小或相当的预测集大小，并显著降低校准过程中的方差。

2605.12338 2026-05-13 cs.LG cs.AI stat.CO

Manifold Sampling via Entropy Maximization

Cornelius V. Braun, Tilman Burghoff, Marc Toussaint

AI总结该论文研究了在由平滑等式和不等式约束隐式定义的流形上进行采样的问题，特别是在可行域包含多个不连通部分的情况下。为了解决这一挑战，作者提出了基于熵最大化重采样的MASEM方法，通过k近邻密度估计最大化经验分布的熵，从而提升采样效率。实验表明，MASEM在合成数据和机器人应用中表现出优越的混合效率和可扩展性，显著优于现有方法。

2605.12301 2026-05-13 cs.LG math.ST stat.TH

Approximation of Maximally Monotone Operators : A Graph Convergence Perspective

Takashi Furuya, Yury Korolev, Takaharu Yaguchi

AI总结该论文研究了如何通过图收敛方法对极大单调算子进行逼近，这类算子在数学和机器学习中具有重要应用。传统上的均匀或 $L^p$ 近似方法在处理此类算子时存在局限，作者提出利用图收敛（Painlevé-Kuratowski 收敛）作为逼近框架，证明了任何极大单调算子均可通过编码-解码结构进行局部图收敛逼近，并构建了保持极大单调性质的结构化近似方法。这一成果为处理不连续或集值算子的算子学习提供了新的理论基础和方法。

2605.12296 2026-05-13 math.ST stat.TH

Efficiency of pattern-based independence test

L. Baringhaus, R. Grübel

AI总结本文研究了基于模式的独立性检验的效率问题，探讨了不同长度模式所对应检验的一致性及其在大样本下的渐近相对效率。通过连接离散数学与理论计算机科学中的概念，如拟随机性与模式一致性，文章详细描述了相关检验的极限分布，并提供了理论分析与数值模拟结果，为实际应用中的独立性检验提供了理论支持与指导。

2605.12248 2026-05-13 stat.ME stat.AP stat.CO

Time-variant reliability using time-dependent surrogate models

Stefano Marelli, Styfen Schär, Bruno Sudret

AI总结本文研究了具有时变特性的工程动态系统在随机激励下的可靠性分析问题，针对传统蒙特卡洛方法计算成本高的问题，提出两种新型代理模型方法——mNARX和F-NARX，分别通过降维流形和函数轨迹特征提取，有效处理高维输入和时序依赖性，实验表明这两种方法在两个典型可靠性问题中能够准确捕捉系统响应的尾部行为，从而高效估计首次穿越概率。

2605.12235 2026-05-13 stat.ML cs.LG

Optimal Policy Learning under Budget and Coverage Constraints

Giovanni Cerulli

AI总结本文研究在预算和最低覆盖约束下的最优策略学习问题，揭示了该问题具有类似于背包问题的结构，并证明最优策略可通过结合预算和覆盖影子价格的线性阈值规则来刻画。研究还表明其组合优化的线性规划松弛具有常数积分间隙，意味着离散分配与最优解在渐近情况下等价。基于此，作者提出了两种可实施的算法——贪心拉格朗日算法和排序-切割算法，并通过实验验证了它们在不同条件下的近似最优性能。

2605.12190 2026-05-13 stat.ML cs.LG

Information-Theoretic Generalization Bounds for Sequential Decision Making

Futoshi Futami, Masahiro Fujisawa

AI总结本文研究了序贯决策问题中的泛化界分析，针对在线学习、流式主动学习和多臂老虎机等场景，提出了一个序贯超样本框架。该方法通过分离学习者的过滤过程与用于幽灵坐标比较的证明扩展，引入了基于轮次选择器-损失信息项的序贯条件互信息（CMI）来控制泛化差距，并在适当方差条件下建立了伯恩斯坦型改进，提升了收敛速率。该方法适用于多种序贯决策场景，为算法依赖的泛化分析提供了新工具。

2605.12136 2026-05-13 stat.ME

Synthetic Control Method with Mixed Frequency Data

Lu Zhang, Shijin Gong, Xinyu Zhang

AI总结本文研究了如何在混合频率数据下有效应用合成控制方法（SCM），以解决经济学和金融学中常见的不同时间分辨率变量问题。作者提出了一种新的混合频率合成控制方法（MF-SCM），通过构建灵活的估计过程，在不丢失信息的前提下整合不同频率的数据，并建立了该估计方法的理论性质。该方法在数值模拟和两个实证案例中得到验证，展示了其在处理实际政策效果评估中的有效性。

2605.12103 2026-05-13 stat.ME

Informative Simultaneous Confidence Intervals for Graphical Group Sequential Test Procedures

Liane Kluge, Werner Brannath

AI总结本文研究了在多假设组序临床试验中控制族误差率的检验方法，提出了一种新的基于当前阶段重复p值的检验策略，该策略在所有假设中具有一致性且功效更高。为该检验方法提供了相应的同时置信区间，并进一步扩展了单阶段图形检验的有信息置信区间方法，引入了迭代算法以计算接近原检验功效的有信息边界，同时提供了评估数值边界精度的标准，有助于更可靠地估计多假设组序试验中的治疗效应。

2605.12099 2026-05-13 stat.ME q-fin.ST

Bayesian Dynamic Modeling of Realized Volatility in Financial Asset Price Forecasting

Patrick Woitschig, Mike West

AI总结本文提出了一类用于金融资产价格和实现波动率双变量时间序列的贝叶斯动态模型。该模型将新的动态伽马过程应用于实现波动率建模，并与传统的贝叶斯动态线性模型相结合，以捕捉价格序列中的波动率杠杆效应和反馈效应。研究通过高频数据合成，提升了对波动率变化的跟踪与预测能力，并在多个标普行业ETF的实证分析中展示了其在资产价格预测方面的优越性，为投资组合构建和风险管理提供了实用的理论支持。

2605.12092 2026-05-13 stat.ME

Laplacian-P-splines for shared Gamma frailty models applied to clustered right-censored time-to-event data

Piotr Lewczuk, Oswaldo Gressani, Steven Abrams, Christel Faes

AI总结本文研究了在存在未观测集群风险因素的右删失时间到事件数据中，如何利用共享Gamma frailty模型进行分析。作者提出了一种基于Laplacian-P-splines（LPS）的方法，通过高斯近似避免了传统的马尔可夫链蒙特卡洛采样，从而实现了高效且无需采样的参数估计。该方法提供了梯度和海森矩阵的解析表达式，提高了计算效率并能自然量化不确定性。文章通过模拟研究和三个实际生物医学数据集验证了该方法的有效性。

2605.12089 2026-05-13 stat.ME astro-ph.IM hep-ex stat.AP

Power Studies For Two-Sample and Goodness-of-Fit Methods For Multivariate Data

Wolfgang Rolke

AI总结本文通过大量模拟研究，探讨了多种多元数据 goodness-of-fit 和非参数两样本检验方法的统计功效。研究发现，不同方法在不同假设组合下的表现差异较大，因此作者推荐了一组效果较优的方法，以确保在各类案例中至少有一种方法具有良好的检验能力。研究使用了 R 包 MD2sample 和 MDgof 进行实现。

2605.12025 2026-05-13 cs.LG stat.ML

Approximation Theory of Laplacian-Based Neural Operators for Reaction-Diffusion System

Takashi Furuya, Ryo Ozawa, Jenn-Nan Wang

AI总结本文研究了基于拉普拉斯算子的神经算子在非线性反应-扩散系统中的逼近理论，以通用的Gierer-Meinhardt模型为例，分析了从初始条件到时间依赖解的映射学习问题。通过利用PDE格林函数的拉普拉斯谱表示，作者建立了神经网络深度、宽度和谱秩相关的显式逼近误差界，证明了所需参数复杂度随目标精度呈多项式增长，从而克服了传统算子学习中面临的参数复杂度指数增长问题。数值实验验证了理论结果的有效性。

2605.11987 2026-05-13 cs.AI cs.LG stat.AP stat.ML

Random-Set Graph Neural Networks

Tommy Woodley, Shireen Kudukkil Manchingal, Matteo Tolloso, Davide Bacciu, Fabio Cuzzolin

AI总结本文提出了一种新的图神经网络框架——随机集图神经网络（RS-GNN），用于更准确地量化节点层面的不确定性。该方法通过信念函数形式对节点的认识不确定性进行建模，能够同时输出精确的概率预测和不确定性度量。实验表明，RS-GNN在多个真实世界的图学习数据集上表现出优越的不确定性量化能力。

2605.11983 2026-05-13 cs.LG stat.ML

QDSB: Quantized Diffusion Schrödinger Bridges

Tobias Fuchs, Florian Kalinke, Nadja Klein

AI总结在源分布和目标分布仅通过未配对样本指定的情况下，生成模型的学习变得越来越重要。本文提出了一种名为QDSB的量化扩散Schrödinger桥方法，用于加速无模拟Schrödinger桥的训练过程。该方法通过在锚点量化后的分布上计算端点耦合，并通过单元采样将结果映射回原始数据点，从而减少计算成本并保持全局传输结构的稳定性。实验表明，QDSB在保持样本质量的同时显著提升了训练效率。

2605.11973 2026-05-13 math.ST math.PR stat.TH

Stochastic Ordering under Weaker Likelihood-Ratio Shape Conditions

Z. Derbazi

AI总结本文研究了在较弱的似然比形状条件下方，风险率和通常随机序的端点准则仍能成立的问题。通过引入单峰性及似然比减一的符号模式条件，作者证明了端点性质的保持性，并进一步提出了一种基于超水平集的直接判别准则，该准则对不连续的似然比情形尤为有效。这一成果拓展了传统序关系的适用范围，为相关统计推断提供了更灵活的理论支持。

2605.11935 2026-05-13 stat.ME stat.CO

Bayesian low-rank latent-cluster regression for mixed health outcomes

Hsin-Hsiung Huang, Suyeon Kang

AI总结该研究提出了一种贝叶斯低秩潜聚类回归模型，用于处理高维健康和监测数据中的多类型相关响应变量和潜在异质性问题。模型通过有限混合回归表面的形式，为每个潜聚类提供特定的均值偏移和低秩系数矩阵，实现了聚类、降维和可解释性分析。研究还引入了乘法伽马过程收缩以自适应调整各聚类的有效秩，并基于WAIC准则确定聚类数和最大秩，展示了模型在不同数据生成机制下的良好聚类性能和实际应用效果。

2605.11926 2026-05-13 stat.AP

An ensemble prediction method for forecasting sap flux density and water-use in temperate trees

Mengyi Gong, Rebecca Killick, Andrew Hirons

AI总结本文提出了一种基于加性模型的集成预测方法，用于预测温带树木的日水分消耗，旨在提升灌溉管理的效率。该方法结合树液流传感器、气象站数据和统计模型，通过考虑环境因子与树液流密度之间的非线性关系及个体树木在不同生长季的变化，实现了对树木水分利用的可靠预测。研究利用2022至2024年九种树木的实地数据验证了方法的有效性，并探讨了气候胁迫和树木大小对预测的影响，为农业、林业及生态保护提供了通用的预测框架。

详情

Comments: Main manuscript: 18 pages, 6 figures. Supplementary document: 11 pages, 10 figures

英文摘要

Efficient irrigation management is crucial to agriculture, forestry and horticulture, especially under climate change. Developments in novel sensors and Internet of Things technology provide an opportunity to carry out real-time monitoring of tree sap flux density, which, when coupled with advanced modelling techniques, enables online prediction of tree water-use suitable for irrigation planning. This manuscript proposes one such pipeline that integrates tree sap flow sensors, weather station sensors, and statistical models to predict tree daily water-use. In particular, an ensemble prediction approach based on additive models has been developed, using weather data as the main predictors of sap flux density. The method simultaneously considers the non-linear relationships and interactions between sap flux density and its environmental drivers, as well as the variability among individual trees over different growing seasons. Using field data collected on nine species of trees over the 2022, 2023 and 2024 growing seasons, this manuscript demonstrates the ability of the proposed ensemble prediction method in producing reliable daily water-use forecasts. The challenge of predicting tree water-use under climate stress, such as heatwaves, and the impact of tree sizes on prediction have also been discussed. Despite the complexity of the problem, the proposed method provides a general framework which can be used in a variety of settings, from commercial tree growers to conversation work. The model can be integrated into an online monitoring platform, assisting real-time decision making on irrigation management.

URL PDF HTML ☆

赞 0 踩 0

2605.11872 2026-05-13 cs.LG stat.ML

LOFT: Low-Rank Orthogonal Fine-Tuning via Task-Aware Support Selection

Lanxin Zhao, Bamdev Mishra, Pratik Jawanpuria, Lequan Lin, Dai Shi, Junbin Gao, Andi Han

AI总结该论文提出了一种名为LOFT的低秩正交微调框架，旨在解决现有正交参数高效微调方法中子空间选择与变换方式混淆的问题。LOFT通过将正交微调视为子空间旋转，统一了多种已有方法，并将支持选择作为核心设计要素，提出了基于任务信号的实用支持选择策略。实验表明，LOFT在多个任务中表现出优越的效率与性能平衡，突显了合理支持选择对提升正交微调效果的重要性。

2605.11865 2026-05-13 stat.ML cs.LG

Variance-aware Reward Modeling with Anchor Guidance

Shuxing Fang, Ruijian Han, Liangyu Zhang, Fan Zhou

AI总结本文研究了在人类偏好多样化的情况下，如何改进奖励模型以更准确地反映偏好不确定性。提出了一种基于锚点引导的方差感知奖励建模方法，通过引入两个粗粒度的响应级锚点标签，解决了高斯奖励模型在仅依赖成对偏好数据时的基本不可识别性问题。该方法在理论分析和多个实际数据集上均表现出优越的奖励建模性能和强化学习效果。

2605.11841 2026-05-13 stat.ML cs.AI cs.LG

Minimax Rates and Spectral Distillation for Tree Ensembles

Binh Duc Vu, David S. Watson

AI总结本文研究了随机森林和梯度提升机等树集成模型的理论性质，提出了基于谱方法的分析框架。通过分析诱导核算子的特征值衰减，得出了随机森林回归的最小最大收敛率，并基于这一视角开发了模型压缩方法。该方法通过学习核算子或平滑矩阵的主特征函数或奇异向量，生成预测性能优异但规模大幅缩减的蒸馏模型，适用于资源受限的计算场景。

2605.08471 2026-05-13 math.ST stat.TH

Asymptotics for likelihood ratio tests of boundary points with singular information and unidentifiable nuisance parameters

Karl Oskar Ekvall, Ola Hössjer, Matteo Bottai, J. M. Patrik Albin

AI总结本文研究了在 nuisance 参数不可识别、参数兴趣位于参数空间边界且可识别参数的信息矩阵可能奇异的情况下，似然比检验的渐近分布问题。作者提出了一种适用于此类复杂情形的理论框架，指出检验统计量的渐近分布为 $\barχ^2$-过程的上确界，并在局部备择假设下推广为非中心 $\barχ^2$-过程的上确界。该结果统一并扩展了现有的一些边界推断和奇异信息情形下的结论，适用于混合模型、遗传连锁分析等实际问题。

2605.02858 2026-05-13 math.ST stat.TH

Characterizing Schur-concave commutative copulas as the closure of associative ones

Manuel Úbeda-Flores

AI总结本文研究了关联性copula类在一致度量下的闭包性质，证明了该闭包等同于Schur-凹交换copula类。研究通过凸包的闭包操作，揭示了两类copula之间的包含关系，并建立了它们的等价性，为copula理论提供了新的结构理解。

2604.24037 2026-05-13 cs.LG math.ST stat.TH

A Limit Theory of Foundation Models: A Mathematical Approach to Understanding Emergent Intelligence and Scaling Laws

Jun Shu, Junxiong Jia, Deyu Meng, Zongben Xu

AI总结本文从极限理论的角度出发，提出了一种数学方法以形式化理解基础模型中的涌现智能现象。研究引入了一个依赖于数据量、模型规模和训练步数的性能函数，将智能行为的涌现视为从有限知识向无限知识的转变过程，并通过极限的存在性刻画这一现象。理论分析揭示了涌现智能的产生与极限架构的存在密切相关，并推导出基础模型的扩展定律，为理解智能涌现的机制提供了理论依据。

详情

Comments: There exist some typos and inaccurate expression in this version

英文摘要

Emergent intelligence have played a major role in the modern AI development. While existing studies primarily rely on empirical observations to characterize this phenomenon, a rigorous theoretical framework remains underexplored. This study attempts to develop a mathematical approach to formalize emergent intelligence from the perspective of limit theory. Specifically, we introduce a performance function E(N, P, K), dependent on data size N, model size P and training steps K, to quantify intelligence behavior. We posit that intelligence emerges as a transition from finite to effectively infinite knowledge, and thus recast emergent intelligence as existence of the limit $\lim_{N,P,K \to \infty} \mathcal{E}(N,P,K)$, with emergent abilities corresponding to the limiting behavior. This limit theory helps reveal that emergent intelligence originates from the existence of a parameter-limit architecture (referred to as the limit architecture), and that emergent intelligence rationally corresponds to the learning behavior of this limit system. By introducing tools from nonlinear Lipschitz operator theory, we prove that the necessary and sufficient conditions for existence of the limit architecture. Furthermore, we derive the scaling law of foundation models by leveraging tools of Lipschitz operator and covering number. Theoretical results show that: 1) emergent intelligence is governed by three key factors-training steps, data size and the model architecture, where the properties of basic blocks play a crucial role in constructing foundation models; 2) the critical condition Lip(T)=1 for emergent intelligence provides theoretical support for existing findings. 3) emergent intelligence is determined by an infinite-dimensional system, yet can be effectively realized in practice through a finite-dimensional architecture. Our empirical results corroborate these theoretical findings.

URL PDF HTML ☆

赞 0 踩 0

2604.23260 2026-05-13 stat.ML cs.LG

Explicit integral representations and quantitative bounds for two-layer ReLU networks

Anthony Lee

AI总结本文提出了一种为两层ReLU网络构建显式积分表示的方法，能够为任意多变量多项式提供较为简单的表达形式。通过引入调和延拓和投影的锐化ReLU积分表示，给出了定量误差界，表明函数的$L^{2}(\mathcal{D})$逼近误差仅依赖于其单项式展开的系数和分布$\mathcal{D}$，而与维度或次数无关。此外，文章还建立了该表示与指数核再生核希尔伯特空间之间的联系，并提出了一种具有更优误差界的简单积分表示形式。

2604.16642 2026-05-13 q-bio.QM q-bio.CB q-bio.GN stat.AP

Geometric coherence of single-cell CRISPR perturbations reveals regulatory architecture and predicts cellular stress

Prashant C. Raju

AI总结该研究提出了一种新的几何稳定性度量方法Shesha，用于评估单细胞CRISPR扰动响应的方向一致性，揭示了基因调控结构并预测细胞应激状态。通过分析多个CRISPR数据集，研究发现稳定性与扰动效应大小高度相关，但在某些情况下二者分离，揭示了不同调控因子的生物学特性。该方法为筛选实验中的靶点优先级排序、细胞制造中的表型质量控制以及计算扰动预测的评估提供了新视角。

详情

英文摘要

Genome engineering has achieved remarkable sequence-level precision, yet predicting the transcriptomic state that a cell will occupy after perturbation remains an open problem. Single-cell CRISPR screens measure how far cells move from their unperturbed state, but this effect magnitude ignores a fundamental question: do the cells move together? Two perturbations with identical magnitude can produce qualitatively different outcomes if one drives cells coherently along a shared trajectory while the other scatters them across expression space. We introduce a geometric stability metric, Shesha, that quantifies the directional coherence of single-cell perturbation responses as the mean cosine similarity between individual cell shift vectors and the mean perturbation direction. Across five CRISPR datasets (2,200+ perturbations spanning CRISPRa, CRISPRi, and pooled screens), stability correlates strongly with effect magnitude (Spearman $ρ=0.75-0.97$), with a calibrated cross-dataset correlation of 0.97. Crucially, discordant cases where the two metrics decouple expose regulatory architecture: pleiotropic master regulators such as CEBPA and GATA1 pay a "geometric tax," producing large but incoherent shifts, while lineage-specific factors such as KLF1 produce tightly coordinated responses. After controlling for magnitude, geometric instability is independently associated with elevated chaperone activation (HSPA5/BiP; $ρ_{partial}=-0.34$ and $-0.21$ across datasets), and the high-stability/high-stress quadrant is systematically depleted. The magnitude-stability relationship persists in scGPT foundation model embeddings, confirming it is a property of biological state space rather than linear projection. Perturbation stability provides a complementary axis for hit prioritization in screens, phenotypic quality control in cell manufacturing, and evaluation of in silico perturbation predictions.

URL PDF HTML ☆

赞 0 踩 0

2604.14322 2026-05-13 stat.ML cs.LG

Doubly Outlier-Robust Online Infinite Hidden Markov Model

Horace Yiu, Leandro Sánchez-Betancourt, Álvaro Cartea, Gerardo Duran-Martin

AI总结本文研究了在流数据中存在异常值和模型误设的情况下，如何提高在线无限隐马尔可夫模型（iHMM）的鲁棒性。通过引入后验影响函数（PIF）定义鲁棒性，并给出了在线iHMM具有有界PIF的条件，提出了一种名为BR-iHMM的方法，在适应性和鲁棒性之间引入两个可调参数进行平衡。实验表明，该方法在多个实际数据集上显著降低了一步预测误差，验证了其在预测和可解释在线学习中的有效性。

2604.14083 2026-05-13 physics.comp-ph cond-mat.mtrl-sci stat.CO

Distributional Inverse Homogenization

Arnaud Vadeboncoeur, Mark Girolami, Kaushik Bhattacharya, Andrew M. Stuart

AI总结该研究提出了一种非侵入性的方法，用于从宏观力学性能反推材料微观结构的统计特性，称为分布逆均质化。通过利用大量宏观测量数据，该方法能够在不直接观测微观结构的情况下，学习其全局统计信息，并适用于周期性和随机性均质化场景。研究还展示了如何利用微观结构的自然空间变化来实现分布逆推，并构建了用于加速计算的替代模型，为从宏观测量中学习材料微观结构变异提供了新思路。

2603.22000 2026-05-13 cs.LG stat.ML

CRPS-Optimal Binning for Univariate Conformal Regression

Paolo Toccaceli

AI总结本文提出了一种基于分箱的非参数条件分布估计方法，通过将排序后的协变量观测划分为连续区间，并使用区间内的经验CDF作为预测分布。该方法通过最小化留一法连续排名概率分数（LOO-CRPS）确定最优分箱边界，并采用动态规划以高效求解全局最优分箱数。实验表明，该方法在保持预测区间覆盖率接近名义水平的同时，能显著缩小预测区间，优于多种主流的分层确认回归方法。

2603.14094 2026-05-13 stat.ML cs.LG math.ST stat.CO stat.ME stat.TH

Maximin Robust Bayesian Experimental Design

Hany Abdulsamad, Sahel Iqbal, Christian A. Naesseth, Takuo Matsubara, Adrien Corenflos

AI总结本文研究了贝叶斯实验设计在模型误设下的鲁棒性问题，将其建模为实验者与对抗性自然之间的极大极小博弈，并引入信息论约束以提升鲁棒性。研究提出使用Sibson的α-互信息作为鲁棒目标函数，确定了α-倾斜后验作为鲁棒信念更新方式，并以Rényi散度作为条件信息增益的度量。为减少嵌套蒙特卡洛估计器的偏差和方差，作者采用PAC-Bayes框架搜索随机设计策略，从而得到具有显式有限样本误差控制的鲁棒期望信息增益下界。

2602.02406 2026-05-13 stat.ML cs.LG

Provably Data-driven Multiple Hyper-parameter Tuning with Structured Loss Function

Tung Quoc Le, Anh Tuan Nguyen, Viet Anh Nguyen

AI总结本文研究了数据驱动方法中多维超参数调优的泛化保证问题，针对现有理论仅适用于单维超参数的局限性，提出了首个适用于多维超参数调优的通用框架。该方法结合实代数几何工具，强化了半代数函数类的泛化界分析，获得了更精确且适用性更广的理论保证，并进一步拓展到验证损失下的超参数调优场景，展示了框架在数据驱动加权组lasso和加权融合lasso等新学习问题中的应用潜力。

2601.19836 2026-05-13 stat.ME stat.AP

Personalized Treatment Hierarchies in Bayesian Network Meta-Analysis

Augustine Wigle, Erica E. M. Moodie

AI总结本文研究了在贝叶斯网络Meta分析（NMA）中如何构建个性化治疗层次结构，考虑治疗-协变量交互作用（TCIs）对治疗效果的影响。作者指出，当NMA模型包含TCIs时，应针对特定协变量特征生成治疗排序。文章提出了一种基于协变量配置构建个性化治疗层次的方法，并通过抑郁症治疗的真实研究网络进行了验证，为精准医疗提供了新的分析工具。

2512.11114 2026-05-13 cs.LG cs.AI stat.ML

In-Context Multi-Objective Optimization

Xinyu Zhang, Conor Hassan, Julien Martinelli, Daolang Huang, Samuel Kaski

AI总结在多目标优化问题中，如何平衡多个竞争目标是一个普遍存在的挑战，尤其在药物设计和自主系统等领域。本文提出了一种名为TAMO的全摊销通用策略，利用Transformer架构实现对不同输入和目标维度的多目标黑盒优化，无需针对每个任务重新训练模型。通过强化学习预训练，TAMO能够在单次前向传播中快速生成优化方案，显著提升了计算效率，并在多个基准和实际任务中表现出优异的帕累托前沿质量。

2511.11412 2026-05-13 cs.CL cs.CY stat.OT

MajinBook: An open catalogue of digitally mediated world literature

Antoine Mazières, Thierry Poibeau

AI总结本文介绍了MajinBook，一个开放的数字文献目录，旨在促进对影子图书馆（如Library Genesis和Z-Library）在计算社会科学和文化分析中的应用。通过将这些众包档案的元数据与Goodreads的结构化书目数据进行关联，构建了一个包含539,000多本英文书籍的高精度语料库，并附有首次出版日期、类型和受欢迎程度等信息。该研究采用原生数字EPUB文件以确保机器可读性，同时解决了传统语料库的偏差问题，并提供了法语、德语和西班牙语的辅助数据集。

2509.21711 2026-05-13 stat.ML cs.LG

Multi-modal Bayesian Neural Network Surrogates with Conjugate Last-Layer Estimation

Ian Taylor, Juliane Mueller, Julie Bessac

AI总结本文研究了如何利用多模态数据构建高效的代理模型，以支持对昂贵目标量的建模与分析。作者提出两种基于共轭后验分布的多模态贝叶斯神经网络代理模型，并利用变分推断方法进行参数估计，特别适用于存在部分缺失观测的情况。实验表明，与单模态模型相比，该方法在标量和时序数据上均表现出更高的预测精度和不确定性量化能力。

2509.13548 2026-05-13 cs.SD eess.AS stat.ML

Mixture-of-Experts Framework for Field-of-View Enhanced Signal-Dependent Binauralization of Moving Talkers

Manan Mittal, Thomas Deppisch, Joseph Forrer, Chris Le Sueur, Zamir Ben-Hur, David Lou Alon, Daniel D. E. Wong

AI总结本文提出了一种基于专家混合框架的新型方法，用于增强移动说话人声源的视野感知双耳渲染。该方法通过隐式定位在线融合多个双耳滤波器，实现了对连续运动声源的实时追踪与增强，能够在保持自然双耳线索的同时，突出或抑制特定方向的声音。与传统依赖到达方向估计或基于Ambisonics域的方法不同，该信号依赖框架具有阵列结构无关性，适用于下一代消费音频设备中的空间音频捕获与个性化播放。

2509.09162 2026-05-13 stat.CO math.PR

Divide, Interact, Sample: The Two-System Paradigm

James Chok, Myung Won Lee, Daniel Paulin, Geoffrey M. Vasil

AI总结本文提出了一种统一的“双系统”框架，将均场采样、集合链采样和自适应采样等多种蒙特卡洛方法纳入同一理论体系。该方法通过将粒子集合分为两个相互作用的子系统，以对称交替的方式为彼此提出更新，从而保证了有限集合的不变分布特性。研究还揭示了集合链采样是均场采样的有限近似，并为离散化均场朗之万动力学提供了原理指导，同时展示了其在自适应单链方法中的应用潜力。实验表明，该框架下的采样器在有效样本量和计算效率方面优于传统方法，尤其在高维后验推断任务中表现突出。

详情

英文摘要

Mean-field, ensemble-chain, and adaptive samplers have historically been viewed as distinct approaches to Monte Carlo sampling. In this paper, we present a unifying {two-system} framework that brings all three under one roof. In our approach, an ensemble of particles is split into two interacting subsystems that propose updates for each other in a symmetric, alternating fashion. For the memoryless two-system samplers, this cross-system interaction ensures that the finite ensemble has $ρ^{\otimes 2N}$ as its invariant distribution; for finite-adaptive variants, exact stationarity applies after the adaptation phase is frozen. The two-system construction reveals that ensemble-chain samplers can be interpreted as finite-$N$ approximations to an ideal mean-field sampler; conversely, it provides a principled recipe for discretizing mean-field Langevin dynamics into tractable parallel MCMC algorithms. The framework also connects naturally to adaptive single-chain methods: by replacing particle-based statistics with time-averaged statistics from a single chain, one recovers analogous adaptive dynamics in the long-time limit without requiring a large ensemble. We derive novel two-system versions of both overdamped and underdamped Langevin MCMC samplers within this paradigm. Across synthetic benchmarks and real-world posterior inference tasks, these two-system samplers -- which use a single BCSS-2 integrator step per Metropolis--Hastings accept/reject, in contrast to the long-trajectory style of HMC/NUTS -- exhibit substantial performance gains over No-U-Turn Sampler baselines, achieving higher effective sample sizes per gradient evaluation and markedly higher wall-clock throughput. On higher-dimensional posteriors, the adaptive MAKLA-BCSS-2 methods remain stable and achieve substantially better per-gradient efficiency and wall-clock throughput than the NUTS variants in our benchmark suite.

URL PDF HTML ☆

赞 0 踩 0

2509.00931 2026-05-13 stat.ML cs.LG

Semi-Supervised Bayesian GANs with Log-Signatures for Uncertainty-Aware Credit Card Fraud Detection

David Hirnschall

AI总结本文提出了一种基于半监督贝叶斯生成对抗网络（GAN）的新型深度生成框架，用于信用卡欺诈检测，将问题建模为时间序列分类任务。该方法结合条件GAN进行目标数据增强，引入贝叶斯推理以量化预测不确定性，并利用对数符号（log-signatures）对交易历史进行鲁棒特征编码，同时设计了一种基于Wasserstein距离的损失函数以对齐生成样本与真实未标记样本。实验表明，该方法在BankSim数据集上优于现有基准，在不同标签比例下均表现出优异的统计和领域特定性能。

2508.20614 2026-05-13 stat.ML cs.LG stat.CO

Improving the Accuracy of Amortized Model Comparison with Self-Consistency

Šimon Kucharský, Aayush Mishra, Daniel Habermann, Stefan T. Radev, Paul-Christian Bürkner

AI总结该论文研究了如何提高免训练模型比较（Amortized Bayesian Model Comparison, BMC）的准确性，特别是在模拟模型存在偏差的情况下。作者提出了一种基于自一致性（self-consistency）损失的新方法，通过在未标记的真实数据上训练神经代理模型，以增强模型比较在分布偏移情况下的鲁棒性。实验表明，在开放世界场景下，结合自一致性训练的方法能显著提升BMC估计的准确性，尤其在模型严重偏差时效果更明显。

2508.06138 2026-05-13 stat.ME

Variable selection via knockoffs in missing data settings with categorical predictors

Silvia Bacci, Emanuela Dreassi, Leonardo Grilli, Carla Rampichini

AI总结该研究针对存在大量分类变量和缺失值的大规模评估数据，扩展了敲除法（knockoffs）用于变量选择。研究提出了一种基于多重插补的策略，对缺失值进行填补后，再应用敲除法筛选重要预测变量。该方法在模拟研究和实际教育数据案例中表现出良好的性能，尤其适用于包含无序分类变量和多层结构的复杂数据场景。

2506.19988 2026-05-13 stat.ME

Tipping Point Sensitivity Analysis for Missing Data in Time-to-Event Endpoints: Model-Based and Ad hoc Approaches

Ajmal Oodally, Craig Wang, Zheng Li, Tim Morris, Tobias Mütze, Arunava Chakravartty

AI总结该研究探讨了在时间至事件终点存在缺失数据时，如何评估治疗策略估计量下治疗效果的稳健性。文章对比了基于模型和两种经验性方法的“临界点分析”方法，用于评估独立删失假设的偏离对结论的影响。研究通过实际临床试验的重构案例展示了不同方法的假设及其对结果解释和临床合理性的影响，为处理缺失数据提供了更稳健的分析框架。

2506.10664 2026-05-13 stat.ML cs.LG

Sequential Off-Policy Learning with Logarithmic Smoothing

Maxime Haddouche, Otmane Sakhi

AI总结本文研究了序列离线策略学习问题，即在实际系统中不断更新和重新部署策略时，如何利用所有历史数据进行学习。作者提出了一种结合对数平滑估计与在线PAC-贝叶斯工具的简单算法，并证明在温和条件下对对数平滑方法的改进可以提升性能并加速收敛。该算法在批量设置下与当前最优离线方法相当，而在序列更新场景下则显著优于现有方法，实验验证了其有效性。

2506.02084 2026-05-13 cs.LG stat.ML

Adversarial Causal Tuning for Realistic Time-series Generation

Nikolaos Gkorgkolis, Nikolaos Kougioulis, MingXue Wang, Bora Caglayan, Andrea Tonon, Dario Simionato, Ioannis Tsamardinos

AI总结本文研究如何从真实时间序列数据中生成具有相同观测和干预分布的仿真数据，旨在构建概率因果数字孪生模型。为此，作者提出了一种对抗因果调优（ACT）方法，结合生成对抗网络和自动机器学习的思想，搜索最优的因果模型和判别器，以提升生成数据与真实数据分布的一致性，并通过置换检验控制模型复杂度。实验表明，ACT在多个数据集上表现出优越的拟合能力和泛化性能，为现实时间序列的生成提供了新的有效方法。

详情

Comments: 22 pages, 3 figures

英文摘要

We address the problem of generating simulated, yet realistic, time-series data from a causal model with the same observational and interventional distributions as a given real dataset (probabilistic causal digital twin). While non-causal models (e.g., GANs) also strive to simulate realistic data, causal models are fundamentally more powerful, able to simulate the effect of interventions (what-if scenarios), optimize decisions, perform root-cause analysis, and counterfactual causal reasoning. We introduce the Adversarial Causal Tuning (ACT) methodology, which outputs the optimal causal model that fits the data, along with a quantification of the goodness-of-fit. The returned causal model can then be employed to simulate new data or to perform other causal reasoning tasks. ACT adopts ideas from Generative Adversarial Network training and AutoML to search for optimal causal pipelines and discriminators that detect deviations between the distributions of real and simulated data. It also adapts a permutation testing procedure from established causal tuning methods to penalize models for complexity. Through extensive experiments on real, semi-synthetic, and synthetic datasets, we show that (a) employing multiple optimized discriminators is paramount for selecting the optimal causal models and quantifying goodness-of-fit, (b) ACT selects the optimal causal model in synthetic datasets while avoiding overfitting, generating data indistinguishable from the true data distribution (c) all state-of-the-art generative and causal simulation methods, exhibit room for improvement in reproducing real data distributions; generating realistic temporal data is still an open research challenge.

URL PDF HTML ☆

赞 0 踩 0

2505.20754 2026-05-13 stat.ML cs.LG

Stationary MMD Points

Zonghao Chen, Toni Karvonen, Heishiro Kanagawa, François-Xavier Briol, Chris. J. Oates

AI总结本文研究如何利用有限点集近似目标概率分布的问题，核心方法是通过最小化最大均值差异（MMD）来选择点集。由于MMD目标函数的非凸性，难以直接求得全局最优解，因此作者提出研究MMD的平稳点，这些点可以被准确计算。理论分析表明，对于相关再生核希尔伯特空间中的积分函数，平稳MMD点的数值积分误差收敛速度比MMD本身更快，并基于此提出了MMD梯度流作为计算平稳点的实用方法，同时给出了其收敛性的严格分析与误差界。

2505.17907 2026-05-13 stat.ML cs.LG

Approximating Simple ReLU Networks based on Spectral Decomposition of Fisher Information

Ka Long Keith Ho, Yoshinari Takeishi, Junichi Takeuchi

AI总结本文研究了具有随机隐藏权重的两层ReLU神经网络的费舍尔信息矩阵的性质。研究发现，其特征值分布高度集中在少数几个特征空间中，前三个特征空间的特征值之和占费舍尔信息矩阵迹的97.7%，且与参数数量无关。作者识别出对应这些主要特征空间的函数空间，发现其由阶数不超过2的球谐函数组成，该结果与神经切核的Mercer分解密切相关。

2505.16156 2026-05-13 stat.ML cs.LG

Integral Imprecise Probability Metrics

Siu Lun Chau, Michele Caprio, Krikamol Muandet

AI总结本文提出了一种基于Choquet积分的积分模糊概率度量（IIPM）框架，用于在模糊概率模型下比较概率分布之间的差异，扩展了经典概率度量的应用范围。该方法适用于包括下概率、概率区间和信念函数在内的多种模糊概率模型，能够有效衡量认识不确定性。理论分析表明IIPM满足度量空间的条件，并可用于描述模糊概率的弱收敛形式；实验验证显示其在分类任务中表现优异，尤其在类别数量较多时优于传统方法。

详情

Comments: 48 pages

英文摘要

Quantifying differences between probability distributions is fundamental to statistics and machine learning, primarily for comparing statistical uncertainty. In contrast, epistemic uncertainty -- due to incomplete knowledge -- requires richer representations than those offered by classical probability. Imprecise probability (IP) theory offers such models, capturing ambiguity and partial belief. This has driven growing interest in imprecise probabilistic machine learning (IPML), where inference and decision-making rely on broader uncertainty models -- highlighting the need for metrics beyond classical probability. This work introduces the integral imprecise probability metric framework, a Choquet integral-based generalisation of classical integral probability metrics to the setting of capacities -- a broad class of IP models encompassing many existing ones, including lower probabilities, probability intervals, belief functions, and more. Theoretically, we establish conditions under which IIPM serves as a valid metric and metrises a form of weak convergence of capacities. Practically, IIPM not only enables comparison across different IP models but also supports the quantification of epistemic uncertainty~(EU) within a single IP model. In particular, by comparing an IP model with its conjugate, IIPM gives rise to a new class of epistemic uncertainty measures -- Maximum Mean Imprecision -- which satisfy key axiomatic properties proposed in the uncertainty quantification literature. We validate MMI through selective classification experiments, demonstrating strong empirical performance against established EU measures, and outperforming them when classical methods struggle to scale to a large number of classes. Our work advances both theory and practice in Imprecise Probabilistic Machine Learning, offering a principled framework for comparing and quantifying epistemic uncertainty under imprecision.

URL PDF HTML ☆

赞 0 踩 0

2504.10428 2026-05-13 stat.ML cs.DS cs.LG math.ST stat.TH

Smoothed Analysis of Learning from Positive Samples

Jane H. Lee, Anay Mehrotra, Manolis Zampetakis

AI总结本文研究了仅从正样本中学习二分类问题的平滑分析，旨在解决传统最坏情况下学习能力有限的问题。通过假设真实分布相对于参考分布是平滑的，作者证明了所有VC类在平滑模型下均可学习，并给出了所需的样本数量和高效算法。该成果还带来了未知截断估计、截断检测和多参考分布学习等多个应用领域的改进算法。

详情

Comments: Accepted for presentation at the 58th ACM Symposium on Theory of Computing (STOC), 2026; abstract shortened for arXiv

英文摘要

Binary classification from positive-only samples is a variant of PAC learning where the learner receives i.i.d. positive samples and aims to learn a classifier with low error. Previous work by Natarajan, Gereb-Graus, and Shvaytser characterized learnability and revealed a largely negative picture: almost no interesting classes, including two-dimensional halfspaces, are learnable. This poses a challenge for applications from bioinformatics to ecology, where practitioners rely on heuristics. In this work, we initiate a smoothed analysis of positive-only learning. We assume samples from a reference distribution $D$ such that the true distribution $D^*$ is smooth with respect to it. In stark contrast to the worst-case setting, we show that all VC classes become learnable in the smoothed model, requiring $O(VC/ε^2)$ positive samples for $ε$ classification error. We also give an efficient algorithm for any class admitting $\mathrm{poly}(ε)$-approximation by degree-$k$ polynomials whose range is lower-bounded by a constant with respect to $D$ in L1-norm. It runs in time $\mathrm{poly}(d^k/ε)$, qualitatively matching L1-regression. Our results also imply faster or more general algorithms for: (1) estimation with unknown-truncation, giving the first polynomial-time algorithm for estimating exponential-family parameters from samples truncated to an unknown set approximable by non-negative polynomials in L1 norm, improving on [KTZ FOCS19; LMZ FOCS24], who required strong L2-approximation; (2) truncation detection for broad classes, including non-product distributions, improving on [DLNS STOC24]'s who required product distributions; and (3) learning from a list of reference distributions, where samples come from $O(1)$ distributions, one of which witnesses smoothness of $D^*$, as arises when list-decoding algorithms learn samplers for $D^*$ from corrupted data.

URL PDF HTML ☆

赞 0 踩 0

2503.21576 2026-05-13 math.PR cs.LO math.CT math.ST stat.TH

Empirical Measures and Strong Laws of Large Numbers in Categorical Probability

Tobias Fritz, Tomáš Gonda, Antonio Lorenzin, Paolo Perrone, Areeb Shah Mohammed

AI总结本文研究了范畴概率中经验测度的极限行为，探讨了强定律大数在分类概率中的推广。作者提出了两个公理——排列不变性和经验充分性，用于刻画从无限序列生成经验样本的态射，并指出这类“经验采样态射”存在于准马尔可夫范畴中。通过引入这些态射及相关性质，论文证明了包括德·菲内蒂定理、格列文科-坎特利定理和强大数定律在内的多个抽象定理，并在标准波莱尔空间上给出了具体的构造，实现了这些定理的统一证明。

详情

DOI: 10.46298/lmcs-22(2:13)2026
Journal ref: Logical Methods in Computer Science, Volume 22, Issue 2 (May 5, 2026) lmcs:15844

英文摘要

The Glivenko--Cantelli theorem is a uniform version of the strong law of large numbers. It states that for every IID sequence of random variables, the empirical measure converges to the underlying distribution (in the sense of uniform convergence of the CDF). In this work, we provide tools to study such limits of empirical measures in categorical probability. We propose two axioms, namely permutation invariance and empirical adequacy, that a morphism of type $X^{\mathbb{N}} \to X$ should satisfy to be interpretable as taking an infinite sequence as input and producing a sample from its empirical measure as output. Since not all sequences have a well-defined empirical measure, such \emph{empirical sampling morphisms} live in quasi-Markov categories, which, unlike Markov categories, allow for partial morphisms. Given an empirical sampling morphism and a few other properties, we prove representability as well as abstract versions of the de Finetti theorem, the Glivenko--Cantelli theorem and the strong law of large numbers. We provide several concrete constructions of empirical sampling morphisms as partially defined Markov kernels on standard Borel spaces. Instantiating our abstract results then recovers the standard Glivenko--Cantelli theorem and the strong law of large numbers for random variables with finite first moment. Our work thus provides a joint proof of these two theorems in conjunction with the de Finetti theorem from first principles.

URL PDF HTML ☆

赞 0 踩 0

2503.17606 2026-05-13 stat.ME stat.AP

Combining longitudinal cohort studies to examine cardiovascular risk factor trajectories across the adult lifespan

Zeynab Aghabazaz, Michael J Daniels, Hongyan Ning, Donald M. Lloyd-Jones, Juned Siddique

AI总结该研究提出了一种统计框架，用于整合多个大型纵向心血管队列数据，以研究从成年早期开始的长期心血管健康变化。通过引入贝叶斯分层多元模型，该方法能够同时建模多个风险因素在时间上的变化及不同队列间的关联，提高轨迹估计的精度，并填补缺失的风险因素信息。研究揭示了风险因素轨迹在不同年龄阶段、子群体和队列中的显著差异，突出了心血管预防和监测的关键时期。

2503.15821 2026-05-13 stat.AP stat.OT

Temporal Point Process Modeling of Aggressive Behavior Onset in Psychiatric Inpatient Youths with Autism

Michael Potter, Michael Everett, Ashutosh Singh, Georgios Stratis, Yuna Watanabe, Ahmet Demirkaya, Deniz Erdogmus, Tales Imbiriba, Matthew S. Goodwin

AI总结该研究旨在预测自闭症青少年住院患者攻击行为的发生时间，针对其沟通困难和情绪洞察力差的特点，提出使用时间点过程（特别是自激发霍克斯过程）对攻击行为的发作进行建模。与传统泊松模型相比，该方法能更准确地捕捉攻击行为不规则且成簇出现的特性，并提供了可解释的发作动态分析，为临床决策和预防干预提供了新的支持。

详情

DOI: 10.1038/s41598-026-46996-8
Comments: Accepted to Nature Scientific Reports. Updated results on Hawkes Process with Power Law intensity, and made stricter conditions for sampling evaluation points in the Mean Absolute Percent Error and ROC-AUC calculations. Small notation discrepancies fixed

英文摘要

Aggressive behavior, including aggression towards others and self-injury, occurs in up to 80% of children and adolescents with autism, making it a leading cause of behavioral health referrals and a major driver of healthcare costs. Predicting when autistic youth will exhibit aggression can be challenging due to their communication difficulties. Many are minimally verbal or have poor emotional insight. Recent advances in Machine Learning and wearable biosensing demonstrate the ability to predict aggression within a limited future window (typically one to three minutes) in autistic individuals. However, existing works don't estimate aggression onset probability or the expected number of aggression onsets over longer periods, nor do they provide interpretable insights into onset dynamics. To address these limitations, we apply Temporal Point Processes (TPPs) - particularly self-exciting Hawkes processes - to model the timing of aggressive behavior onsets in psychiatric inpatient autistic youth. We benchmark several TPP models by evaluating their goodness-of-fit and predictive metrics. Our results demonstrate that self-exciting TPPs more accurately captures the irregular and clustered nature of aggression onsets, especially compared to traditional Poisson models. These incipient findings suggest that TPPs can provide interpretable, probabilistic forecasts of aggression onset along a time continuum, supporting future clinical decision-making and preemptive intervention.

URL PDF HTML ☆

赞 0 踩 0

2502.01577 2026-05-13 stat.CO

plmmr: an R package to fit penalized linear mixed models for genome-wide association data with complex correlation structure

Tabitha K. Peter, Anna C. Reisetter, Yujing Lu, Oscar A. Rysavy, Patrick J. Breheny

AI总结本文介绍了一个名为 plmmr 的 R 语言开源包，用于拟合具有复杂相关结构的高维基因组关联数据的惩罚线性混合模型。该包通过估计观测间的相关性并结合最佳线性无偏预测方法，提升预测性能，同时采用内存映射技术，使得在普通计算机上即可处理超出内存规模的基因组数据。研究展示了 plmmr 的方法实现、工作流程及文件后置处理策略，并通过真实 GWAS 数据实例验证了其计算能力。

2501.16931 2026-05-13 cs.LG stat.AP

Beyond Point Estimates: Distributional Uncertainty in Machine Learning Performance Evaluation

Christoph Lehmann, Yahor Paromau

AI总结该论文提出了一种基于分布的机器学习模型评估方法，强调性能指标应被视为随机变量而非固定值，以更全面地反映训练过程中的不确定性。研究通过分析性能指标的经验分布，利用分位数和置信区间进行点估计和区间估计，尤其关注小样本情况下的统计推断可行性。该方法相比传统的基于均值的评估，能够更细致地刻画模型性能的变异性和不确定性，适用于需要可靠性的实际应用场景，并且易于实现和推广。

详情

Comments: 21 pages, 9 figures

英文摘要

Machine learning models are often evaluated using point estimates of performance metrics such as accuracy, F1 score, or mean squared error. Such summaries fail to capture the inherent variability induced by stochastic elements of the training process, including data splitting, initialization, and hyperparameter optimization. This work proposes a distributional perspective on model evaluation by treating performance metrics as random quantities rather than fixed values. Instead of focusing solely on aggregate measures, empirical distributions of performance metrics are analyzed using quantiles and corresponding confidence intervals. The study investigates point and interval estimation of quantiles based on real-data use cases for classification and regression tasks, complemented by simulation studies for validation. Special emphasis is placed on small sample sizes, reflecting practical constraints in machine learning, where repeated training is computationally expensive. The results show that meaningful statistical inference on the underlying performance distribution is feasible even with sample sizes in the range of 10-25, while standard nonparametric confidence interval remain applicable under these conditions. The proposed approach provides a more detailed characterization of variability and uncertainty compared to mean-based evaluation and enables a more differentiated comparison of models. In particular, it supports a risk-oriented interpretation of model performance, which is relevant in applications where reliability is critical. The presented methods are easy to implement and broadly applicable, making them a practical extension to standard performance evaluation procedures in machine learning.

URL PDF HTML ☆

赞 0 踩 0

2412.18594 2026-05-13 cs.LG stat.ML

Local and Mixing-Based Algorithms for Gaussian Graphical Model Selection from Glauber Dynamics

Vignesh Tirukkonda, Anirudh Rayas, Gautam Dasarathy

AI总结该论文研究了在数据来自高斯Glauber动力学的依赖样本下，如何进行高斯图模型结构学习的问题。作者提出了两种互补的方法：一种是基于相关性检验的局部边检测算法，无需等待链的混合即可并行实现；另一种是在满足Dobrushin收缩条件时，通过子采样高斯Gibbs轨迹，使其在总变分距离下接近独立同分布样本，从而可直接使用标准的独立样本图模型学习方法。研究还提供了有限样本下的恢复保证，并分析了观测时间的信息下界。

2412.11875 2026-05-13 stat.ML cs.LG

Bayesian Surrogate Training on Multiple Data Sources: A Hybrid Modeling Strategy

Philipp Reiser, Paul-Christian Bürkner, Anneli Guthke

AI总结该论文提出了一种融合仿真数据和真实测量数据的混合建模策略，用于提升代理模型的训练效果。研究通过两种概率方法，在代理模型训练过程中整合不同数据源的信息，一种是分别训练不同数据源的代理模型并融合预测分布，另一种是训练一个统一的代理模型以同时利用多源数据。这两种方法均采用了一种新颖的异构数据加权策略，能够提升预测精度与覆盖性，并有助于诊断仿真模型中的潜在问题。

2406.12017 2026-05-13 stat.ML cs.LG stat.CO

Sparsity-Constraint Optimization via Splicing Iteration

Jin Zhu, Junxian Zhu, Zezhi Wang, Borui Tang, Hongmei Lin, Xueqin Wang

AI总结本文提出了一种名为SCOPE的新型稀疏性约束优化算法，用于解决信号处理、统计和机器学习中的相关问题。该算法通过拼接迭代操作替代传统梯度步骤，无需调整连续超参数，从而实现了自然收敛。理论分析表明，SCOPE在稀疏度正确设定时能够线性收敛并准确恢复稀疏支撑集，且其理论结果不依赖于受限等距性质条件。实验表明，SCOPE在稀疏二次优化、稀疏分类器学习和稀疏马尔可夫网络恢复等任务中表现出优越的性能。

2110.01729 2026-05-13 stat.ML cs.LG

Stochastic tensor space feature theory with applications to robust machine learning

Julio Enrique Castrillon-Candas, Kaili Shi, Dingning Liu, Sicheng Yang, Xiaoling Zhang, Mark Kon, the Alzheimer's Disease Neuroimaging Initiative

AI总结本文提出了一种基于随机张量空间的多级正交子空间（MOS）Karhunen-Loeve特征理论，用于构建鲁棒的机器学习特征。通过将训练数据视为某个博赫纳空间中的随机场实例，并利用Karhunen-Loeve展开和层次化展开方法，构建多级正交子空间以检测异常信号成分，从而提取更具区分性的特征用于分类。实验表明，该方法在阿尔茨海默病血浆数据集上的分类准确率显著优于梯度提升、随机森林等主流机器学习方法。

1905.10432 2026-05-13 stat.ME

Cross validation approaches for penalized Cox regression

Biyue Dai, Patrick Breheny

AI总结本文研究了在惩罚Cox回归模型中选择调参参数的交叉验证方法。由于Cox模型的偏似然结构，其交叉验证实现较为复杂，作者提出了两种新的交叉验证方法，并与已有方法进行了比较。研究发现，对线性预测子进行交叉验证的方法在性能和数值稳定性方面表现良好，并通过模拟数据和肺癌患者的高维生存数据分析验证了其优势。

1809.05497 2026-05-13 stat.ME

Feature-specific inference for penalized regression using local false discovery rates

Ryan Miller, Patrick Breheny

AI总结本文研究了如何在正则化回归（如Lasso）中对每个变量进行特征特异性推断，以评估其选择的可靠性。作者提出了一种基于局部错误发现率（local false discovery rate）的方法，能够在不同正则化参数λ下计算每个变量的局部错误发现率，从而判断单个特征的显著性或估计模型整体的错误发现率。该方法适用于多种正则化模型，如广义线性模型和Cox模型，并通过两个高维基因数据案例验证了其有效性与实用性。

详情

DOI: 10.1002/sim.9678
Journal ref: Statistics in Medicine (2023), 42: 1412-1429
Comments: 22 pages, 5 figures

英文摘要

Penalized regression methods, most notably the lasso, are a popular approach to analyzing high-dimensional data. An attractive property of the lasso is that it naturally performs variable selection. An important area of concern, however, is the reliability of these variable selections. Motivated by local false discovery rate methodology from the large-scale hypothesis testing literature, we propose a method for calculating a local false discovery rate for each variable under consideration by the lasso model. These rates can be used to assess the reliability of an individual feature, or to estimate the model's overall false discovery rate. The method can be used for all values of $λ$. This is particularly useful for models with a few highly significant features but a high overall Fdr, which are a relatively common occurrence when using cross validation to select $λ$. It is also flexible enough to be applied to many varieties of penalized likelihoods including GLM and Cox models, and a variety of penalties, including MCP and SCAD. We demonstrate the validity of this approach and contrast it with other inferential methods for penalized regression as well as with local false discovery rates for univariate hypothesis tests. Finally, we show the practical utility of our method by applying it to two case studies involving high dimensional genetic data.

URL PDF HTML ☆

赞 0 踩 0

1704.08742 2026-05-13 stat.ML stat.CO

Hybrid safe-strong rules for efficient optimization in lasso-type problems

Yaohui Zeng, Tianbao Yang, Patrick Breheny

AI总结本文针对高维数据下的lasso型问题，提出了一种混合安全-强规则（HSSR），将安全筛选规则与序列强规则结合，以高效剔除不相关的特征，提升优化效率。研究设计了两种具体方法SSR-Dome和SSR-BEDPP，并将其扩展至弹性网和组lasso问题，展示了方法的通用性。实验表明，该混合规则在计算效率上显著优于现有先进方法。

1701.05936 2026-05-13 stat.CO stat.ML

The biglasso Package: A Memory- and Computation-Efficient Solver for Lasso Model Fitting with Big Data in R

Yaohui Zeng, Patrick Breheny

AI总结该研究提出了一款名为 biglasso 的 R 语言软件包，旨在高效处理超大规模数据集上的 Lasso 模型拟合问题。该包通过使用内存映射文件技术实现外存计算，仅在必要时将数据载入内存，从而有效克服了传统 R 包在内存限制下的不足。此外，biglasso 引入了更高效的特征筛选规则，显著提升了计算效率，并在实际测试中表现出比现有工具如 glmnet 更优越的内存和计算效率。

1607.05636 2026-05-13 math.ST stat.TH

Marginal false discovery rates for penalized regression models

Patrick Breheny

AI总结本文研究了在高维数据中使用惩罚回归模型时，如何评估所选特征的可靠性问题。作者提出通过分析边际假发现率（mFDR）来衡量特征选择的可靠性，并给出了该方法的理论分析与模拟验证，表明其在预测变量相关性较弱时具有较高的准确性。研究还通过癌症基因组和心肌基因组数据展示了该方法的实用价值和相较于其他方法的优势。

2605.11806 2026-05-13 math.ST stat.TH

Adaptive Kernel Ridge Regression with Linear Structure: Sharp Oracle Inequalities and Minimax Optimality

Xin Bing, Chao Wang

AI总结本文研究了在核岭回归（KRR）中如何更有效地处理信号中的线性与非线性结构问题。作者提出了一种改进的回归方法，在标准KRR的基础上显式引入线性成分，从而在不增加计算复杂度和调参需求的前提下，提升预测性能。理论分析表明，该方法能够自适应捕捉信号中的线性与非线性结构，达到最小最大最优的预测风险，并通过大量仿真验证了其有效性。

2605.11776 2026-05-13 stat.ME

Probability of Root Cause: A Counterfactual Definition and Its Identification

Zitong Lu, Zhi Geng, Wei Li, Min Xie

AI总结本文提出了一种根因的正式定义，并基于潜在结果框架和中介分析的思想，定义了“根因概率”（PRC），用于量化某个变量集作为给定结果的根因的可能性。该方法在标准假设下建立了PRC的可识别性，并推导了其识别公式，为因果推理中的根因归因提供了新的理论工具和计算方法。

2605.11768 2026-05-13 stat.ME

Using NonTargeted HPV Infections in Studies with Risk Compensation

Lola Etievant

AI总结该研究探讨了在观察性研究中利用非靶向HPV感染评估疫苗效果时可能存在的混淆偏差和风险补偿问题。作者提出，在未测量的性行为同时作为混杂因素和中介变量的情况下，使用非靶向HPV感染可以消除混淆偏差并分离出疫苗对靶向HPV株的直接免疫效应。然而，这种方法可能高估疫苗的实际保护效果，因为其未考虑风险补偿的影响，因此在公共卫生层面可能具有误导性。

2605.11766 2026-05-13 stat.ME

Uncovering Local Heterogeneity: Local Summary Characteristics for Spatial Point Processes with Composition-Valued Marks

Clemens Baldzuhn, Matthias Eckardt

AI总结本文提出了一种用于分析具有组成型标记的空间点过程的局部指标方法（LIMA），以揭示传统全局统计方法所掩盖的局部异质性。通过引入基于对数比变换的LIMA框架，将受约束的组成型标记映射到欧几里得空间，实现了对点特征的局部分解，有效提升了局部标记聚类的检测能力。研究通过模拟和西班牙卡斯蒂利亚-拉曼查地区的经济部门数据验证了方法的有效性，展示了其在识别区域经济聚集和局部“排水”效应方面的优势。

2605.11684 2026-05-13 cs.LG eess.SP math.PR stat.AP

Partial Model Sharing Improves Byzantine Resilience in Federated Conformal Prediction

Ehsan Lari, Reza Arablouei, Stefan Werner

AI总结本文提出了一种基于部分模型共享的拜占庭鲁棒联邦共形预测方法，通过每次仅交换部分模型参数来提升系统安全性与通信效率。该方法在训练和校准阶段均增强了鲁棒性，训练阶段通过部分共享限制攻击面并减少恶意更新的影响，校准阶段则利用直方图特征向量进行异常检测与共形分位数估计。实验表明，该方法在多种拜占庭攻击场景下能实现更接近名义值的预测覆盖率，并显著缩小预测区间，为联邦不确定性量化提供了更高效且鲁棒的解决方案。

2605.11677 2026-05-13 math.ST stat.TH

Bayesian and Empirical Bayesian Bootstrapping

Nils Lid Hjort

AI总结本文提出了一种非参数的“贝叶斯自助法”，用于估计未知分布下的参数及其置信限，通过使用狄利克雷过程先验近似参数的精确后验分布。该方法在大样本下与经典自助法、贝叶斯自助法和Delta方法等在渐近意义上等价，并扩展到了半参数回归、删失数据和更一般的风险率模型中。研究还探讨了经验贝叶斯版本，展示了经典自助法在贝叶斯框架下的合理性。

详情

Comments: 24 pages, no figures. Statistical Research Report, Department of Mathematics, University of Oslo, May 1991; this version arXiv'd May 2026, for broader visibility. Biometrika expressed positive interest, and invited a revision, but somehow the author did not use that opportunity

英文摘要

Let $X_1,\ldots,X_n$ be a random sample from an unknown probability distribution $P$ on the sample space ${\cal X}$, and let $θ=θ(P)$ be a parameter of interest. The present paper proposes a nonparametric `Bayesian bootstrap' method of obtaining Bayes estimates and Bayesian confidence limits for $θ$. It uses a simple simulation technique to numerically approximate the exact posterior distribution of $θ$ using a (non-degenerate) Dirichlet process prior for $P$. Asymptotic arguments are given which justify the use of the Bayesian bootstrap for any smooth functional $θ(P)$. When the prior is fixed and the sample size grows five approaches become first-order equivalent: the exact Bayesian, the Bayesian bootstrap, Rubin's degenerate-prior bootstrap, Efron's bootstrap, and the classical one using delta methods. The Bayesian bootstrap method is also extended to the semiparametric regression case. A separate section treats similar ideas for censored data and for more general hazard rate models, where a connection is made to a `weird bootstrap' proposed by Gill. Finally empirical Bayesian versions of the procedure are discussed, where suitable parameters of the Dirichlet process prior are inferred from data. Our results lend Bayesian support to the classic Efron bootstrap. It is the Bayesian bootstrap under a noninformative reference prior; it is a limit of natural approximations to good Bayes solutions; it is an approximation to a natural empirical Bayesian strategy; and the formally incorrect reading of a bootstrap histogram as a posterior distribution for the parameter isn't so incorrect after all.

URL PDF HTML ☆

赞 0 踩 0

2605.11652 2026-05-13 stat.ML cs.LG math.ST stat.TH

Posterior Contraction Rates for Sparse Kolmogorov-Arnold Networks in Anisotropic Besov Spaces

Jeunghun Oh, Kyeongwon Lee, Jaeyong Lee, Lizhen Lin

AI总结本文研究了稀疏贝叶斯Kolmogorov-Arnold网络（KAN）在各向异性Besov空间中的后验收缩速率，从贝叶斯角度为KAN提供了统计学基础。通过引入尖峰-平缓型稀疏先验，证明稀疏贝叶斯KAN能够达到近似最优的后验收缩率，且该速率依赖于目标函数的内在各向异性光滑性。通过在模型规模参数上设置超先验，后验还能自适应未知的各向异性光滑性并保持相应的近似最优速率。与基于稀疏MLP的模型相比，KAN的深度可保持固定，其复杂度可通过网络宽度、样条网格范围和参数稀疏性进行控制，从而有效避免维数灾难。

2605.11638 2026-05-13 stat.ML cs.LG

Learning U-Statistics with Active Inference

Xiaoning Wang, Yuyang Huo, Liuhua Peng, Changliang Zou

AI总结该论文研究了如何在标签获取成本较高的情况下，利用主动推断方法提高U统计量的估计效率。作者提出了一种基于增强逆概率加权的U统计量框架，结合采样规则和机器学习预测，设计了最优采样策略以最小化方差，并将其扩展到基于U统计量的经验风险最小化中。实验表明，该方法在保证统计推断有效性的同时，显著提升了估计效率。

2605.11614 2026-05-13 stat.AP

Fairness Testing for Algorithmic Pricing

Fei Huang, Giles Hooker

AI总结本文研究了算法定价系统中的公平性测试问题，指出当前常用的基于普通最小二乘法的审计方法在结构上存在缺陷，因为定价算法通常是确定性的，导致残差反映的是近似误差而非抽样变异。作者提出了正确的方差估计方法和交叉协方差公式，并应用于伊利诺伊州34家汽车保险公司的定价数据，发现所有公司都未能通过条件人口统计公平性测试，而传统方法未能识别出这一问题，修正后的方法则揭示了系统性的不公平现象。

2605.11540 2026-05-13 stat.ME

The design of selection experiments using a model-based approach

Brian R Cullis, Alison B Smith, David GD Hughes, David Butler

AI总结本文提出了一种基于线性混合模型的模型化方法，用于设计最优或近似最优的多环境选择实验方案，以提高植物育种中遗传增益的准确性。该方法通过引入额外步骤优化基因型重复次数的分配，从而更高效地利用资源，并在单环境和多环境实验中均展示了其优势。研究通过实际案例和模拟实验验证了该方法的有效性，为选择实验设计提供了灵活且科学的解决方案。

2605.11531 2026-05-13 physics.ao-ph cs.LG stat.AP

Generative climate downscaling enables high-resolution compound risk assessment by preserving multivariate dependencies

Takuro Kutsuna, Noriko N. Ishizaki, Norihiro Oyama, Hiroaki Yoshida

AI总结该研究提出了一种基于扩散模型的多变量生成框架，用于生成高分辨率的气候数据，以提升复合风险评估的准确性。该方法通过结合偏差校正技术，有效恢复了在分辨率提升后退化的变量间相关性，从而更精确地捕捉如干旱、热应激等复合灾害的关联特征。实验表明，该方法在提高单变量和空间精度的同时，显著降低了变量间相关性误差，为区域气候风险评估提供了更可靠的依据。

2605.11515 2026-05-13 stat.ME

Exploiting independence constraints for efficient estimation of bounds on causal effects in the presence of unmeasured confounding

Ting-Hsuan Chang, Caleb H. Miles, Ilya Shpitser, Eric J. Tchetgen Tchetgen, Daniel Malinsky

AI总结该研究旨在在存在未测量混杂因素的情况下，高效估计因果效应的上下界。作者提出了一种基于因果图的条件独立性约束的影响函数投影方法，以提升半参数估计器的效率。该方法适用于多种敏感性分析框架和因果效应估计量，将图结构知识与敏感性分析相结合，通过模拟和实际数据案例验证了其有效性。

2605.11511 2026-05-13 stat.ML cs.LG

Post-ADC Inference: Valid Inference After Active Data Collection

Shuichi Nishino, Tomohiro Shiraishi, Teruyuki Katsuoka, Ichiro Takeuchi

AI总结本文研究了在主动数据收集（ADC）后进行统计推断时的效度问题，指出传统推断方法由于数据采集过程的自适应偏差而可能失效。为此，作者提出了“后ADC推断”框架，通过结合选择性推断方法，有效校正了数据采集过程和后续数据驱动目标构建所带来的偏差，从而提供有效的p值和置信区间。该方法仅需对观测噪声做假设，适用于多种ADC过程，实验表明其在GP-UCB和TPE等方法收集的数据上具有良好的推断效度。

2605.11478 2026-05-13 cs.AI cs.IT math.IT stat.ML

FibQuant: Universal Vector Quantization for Random-Access KV-Cache Compression

Namyoon Lee, Yongjune Kim

AI总结本文提出了一种名为FibQuant的通用固定率向量量化方法，用于随机访问的键值缓存压缩，以解决长上下文推理中的内存和流量瓶颈问题。该方法在保持归一化-旋转-存储接口的同时，将传统的标量编码表替换为与标准化源匹配的共享径向-角向码本，从而保留归一化步骤所创建的几何信息并提升压缩效率。实验表明，FibQuant在保持高注意力相似度的同时实现了更高的压缩比，并在多个模型上表现出优于现有标量量化方法的性能。

详情

Comments: 15 pages

英文摘要

Long-context inference is increasingly a memory-traffic problem. The culprit is the key--value (KV) cache: it grows with context length, batch size, layers, and heads, and it is read at every decoding step. Rotation-based scalar codecs meet this systems constraint by storing a norm, applying a shared random rotation, and quantizing one coordinate at a time. They are universal and random-access, but they discard the geometry created by the normalization step. After a Haar rotation, a block of $k$ consecutive coordinates is not a product source; it is a spherical-Beta source on the unit ball. We introduce \textsc{FibQuant}, a universal fixed-rate vector quantizer that keeps the same normalize--rotate--store interface while replacing scalar tables by a shared radial--angular codebook matched to this canonical source. The codebook combines Beta-quantile radii, Fibonacci\,/\,Roberts--Kronecker quasi-uniform directions, and multi-restart Lloyd--Max refinement. We prove that the resulting vector code strictly improves on its scalar product specialization at matched rate, with a high-rate gain that separates into a cell-shaping factor and a density-matching factor. The same construction gives a dense rate axis, including fractional-bit and sub-one-bit operating points, without calibration or variable-length addresses. On GPT-2 small KV caches, \textsc{FibQuant} traces a memory--fidelity frontier from $5\times$ compression at $0.99$ attention cosine similarity to $34\times$ at $0.95$. End-to-end on TinyLlama-1.1B, it is within $0.10$ perplexity of fp16 at $4\times$ compression and has $3.6\times$ lower perplexity than scalar \textsc{TurboQuant} at $b = 2$ ($8\times$ compression), where scalar random-access quantization begins to fail.

URL PDF HTML ☆

赞 0 踩 0

2605.11476 2026-05-13 math.OC stat.ML

A Barrier-Metric First-Order Method for Linearly Constrained Bilevel Optimization

Tenglong Hong, Paul Grigas

AI总结本文研究具有固定多项式下层可行域的双层优化问题，针对其下层可行域变化导致上层目标非光滑以及现有方法计算代价高的问题，提出了一种基于对数障碍函数的光滑化方法，将约束双层目标转化为可微形式，并设计了一种仅依赖上下层目标梯度的代理梯度算法。通过引入障碍度量下的局部几何分析，算法能够在靠近下层可行域中心的区域内保持良好的收敛性质，并在确定性和随机噪声环境下分别实现了$\widetilde{O}(K^{-2/3})$和$\widetilde{O}(K^{-2/5})$的收敛速率。

2605.11473 2026-05-13 cs.AI cs.LG cs.RO stat.ML

TOPPO: Rethinking PPO for Multi-Task Reinforcement Learning with Critic Balancing

Yuanpeng Li, Gefei Lin, Annie Qu, Rui Miao

AI总结本文研究了多任务强化学习中基于策略梯度的PPO方法的优化问题，指出其在多任务环境下存在价值函数梯度条件不佳的问题，导致部分任务学习停滞。为此，作者提出TOPPO方法，通过引入批评者平衡模块改善梯度条件，提升任务间的学习均衡性。实验表明，TOPPO在参数和环境步数更少的情况下，优于现有的SAC和ARS方法，在多任务基准测试中表现出更强的平均和尾部任务性能，证明了基于策略的方法在适当优化下可以媲美甚至超越基于价值的方法。

2605.11415 2026-05-13 stat.ME

Causal inference with ordinal outcomes: copula-based identification, estimation and sensitivity analysis

Peiyu He, Fan Li

AI总结本文研究了在有序结果下的因果推断问题，提出了一种基于copula的方法，通过参数化copula关联参数将潜在结果的边缘分布联系起来，从而实现对因果效应的识别与估计。该方法在无混杂假设下推导了非参数模型中的高效影响函数，并构建了具有良好稳健性的估计器，能够在部分识别与点估计之间建立可解释的桥梁。此外，文章还提供了对copula结构和无混杂假设的全面敏感性分析，并开发了相应的R包ordinalCI。

2605.11394 2026-05-13 stat.ML cs.AI cs.LG stat.AP stat.ME

Spatial Adapter: Structured Spatial Decomposition and Closed-Form Covariance for Frozen Predictors

Wen-Ting Wang, Wei-Ying Wu, Hao-Yun Huang, Xuan-Chun Wang

AI总结本文提出了一种名为 Spatial Adapter 的参数高效模块，能够在不修改原始预测模型参数的前提下，为任意冻结的初始预测器提供结构化的空间残差表示及其闭式协方差估计。该方法通过可追踪的批量 ADMM 算法，联合学习空间正则化的正交基与样本级得分，从而在残差场中提取出具有平滑性、稀疏性和正交性的低秩空间结构。该方法不仅支持对未观测位置进行克里金插值式的空间预测，还可用于不确定性量化，实验表明其在多种数据集上均能有效恢复残差空间结构，且参数量远低于传统方法。

2605.11373 2026-05-13 cs.AI cs.LG stat.ML

Causal Algorithmic Recourse: Foundations and Methods

Drago Plecko, Collin Wang, Elias Bareinboim

AI总结本文研究如何在人工智能决策系统中为个体提供可靠的逆向决策建议，即算法性补救（algorithmic recourse）问题。作者提出了一种因果框架，将补救过程建模为干预前后的结果过程，考虑了潜在变量的重新采样和部分稳定性。文章引入了后补救稳定性条件，并开发了基于copula的算法以从观测数据中推断补救效果，同时提出了在数据不满足copula模型时的分布无关学习方法，为算法性补救提供了更稳健和实用的解决方案。

2605.11372 2026-05-13 math.ST stat.TH

The Geometry of Spectral Fluctuations: On Near-Optimal Conditions for Universal Gaussian CLTs, with Statistical Applications

Yanqing Yin, Wang Zhou

AI总结本文研究了高维样本协方差矩阵的线性谱统计量，在经验谱分布仍遵循经典样本协方差律但波动理论非经典的场景下，提出了一种分解方法，将中心化二次型协方差分解为通用高斯部分和模型相关的四阶修正项，构建了名为GHOST的抽象框架，用于处理具有结构化四阶效应的通用高斯中心极限定理。通过该框架，作者证明了线性谱统计量的高斯中心极限定理，并给出了由双线性四阶核确定的显式均值和协方差修正。研究还展示了该条件在广泛适用性上的必要性，并开发了一个块混合径向模型，使修正项具体化，揭示了波动尺度可能发生变化的相变现象，最后将该方法应用于球形检验，提出了对John检验的可行数据驱动修正。

2605.11371 2026-05-13 stat.AP

Statistical evaluation of measurement precision in linear dose-response relationships via interlaboratory studies

Jun-ichi Takeshita, Yuto Ikeuchi, Tomomichi Suzuki

AI总结本文提出了一种基于多实验室研究评估测量方法统计精度的框架，适用于以回归线形式总结剂量-反应关系的测量结果。通过引入ISO 5725中定义的重复性及实验室间方差等精度评价指标，研究建立了适用于实验室间基线和剂量-反应斜率均可能不同的线性混合效应模型的分析方法，并给出了在平衡设计下总平方和的精确分解、精度方差的方差分析估计及相应的F检验。该方法能够区分实验室间差异是由基线偏移还是灵敏度差异引起，为剂量-反应关系的测量精度评估提供了更全面的统计工具。

2605.11362 2026-05-13 cs.LG cs.AI stat.AP stat.ML

Causal Fairness for Survival Analysis

Drago Plecko

AI总结在数据驱动时代，机器学习和人工智能被广泛用于医疗、就业等高风险领域，引发了对系统公平性问题的关注。现有公平机器学习研究多聚焦于静态场景，而对生存分析等时间序列场景中的公平性研究仍较为缺乏。本文提出一种因果框架，用于生存分析中的公平性研究，能够将生存差异分解为直接、间接和虚假路径的贡献，从而提供对差异成因和演变过程的可解释分析，并应用于分析重症监护病房中种族差异随时间的变化。

2605.11324 2026-05-13 cs.LG stat.ML

$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search

Yinan Li, Tuan Nguyen, Kwang-Sung Jun

AI总结本文研究了在固定预算下深度为2的max-min树中识别ε-优质动作的问题，这是蒙特卡洛树搜索的一个重要特例。作者提出了一种无需输入ε值的算法，能够针对每个有意义的ε值实现实例相关的误差界，其误识别概率以指数形式衰减。此外，作者还分析了该问题与标准K臂老虎机在难度结构上的差异，并提供了相应的下界结果，这是首个针对max-min动作识别的固定预算算法保证。

2605.11311 2026-05-13 cs.LG cs.CV stat.CO stat.ML

Couple to Control: Joint Initial Noise Design in Diffusion Models

Jing Jia, Liyue Shen, Guanyang Wang

AI总结该论文研究了扩散模型中初始噪声设计的问题，指出传统方法中假设初始噪声相互独立可能限制了生成效果。作者提出通过设计噪声之间的依赖结构，保持单个噪声仍为标准高斯分布，从而在不改变模型输入分布的前提下，提升多样本生成的多样性与质量。实验表明，该方法在多个主流扩散模型中有效提升了生成多样性，同时保持了图像质量和提示对齐，并在部分指标上优于现有优化方法。

2605.11306 2026-05-13 stat.ME

Unified Operator Framework for Functional and Multivariate Regression

Mark Carpenter, Nicholas Gaubatz

AI总结本文提出了一种统一的算子框架，用于处理标量、多元和函数型回归问题，该框架基于一般测度定义的积分算子。通过选择不同的输入和输出测度，经典回归模型如标量对函数、函数对标量、函数对函数以及多元多元回归均可视为该框架下的特例。研究证明了离散表示与连续算子之间的关系，并指出离散测度下的估计可归约为标准多元回归，具有良好的统计性质。该框架有助于理解函数型回归与多元回归之间的联系，并解释了为何在某些线性场景中向量化多元回归方法能与函数型方法竞争。

2605.11284 2026-05-13 stat.ME cs.AI cs.LG

Rethinking external validation for the target population: Capturing patient-level similarity with a generative model

Mohammad Azizmalayeri, Ameen Abu-Hanna, Saskia Houterman, Marije M. Vis, Giovanni Cinà

AI总结该研究旨在解决外部验证中因目标人群与模型开发人群差异而导致的模型性能解释困难问题，提出了一种基于生成模型的框架，用于量化每个外部患者与开发数据的相似性，并在不同相似度子群中评估模型性能。通过使用自编码器等生成模型，该方法无需共享原始开发数据即可实现更灵活的相似性估计，提升了外部验证的可解释性与实用性。实验表明，该框架能够揭示传统外部验证所掩盖的模型性能差异，为模型的可迁移性评估提供了更科学的依据。

详情

英文摘要

Background: External validation is essential for assessing the transportability of predictive models. However, its interpretation is often confounded by differences between external and development populations. This study introduces a framework to distinguish model deficiencies from case-mix effects. Method: We propose a framework that quantifies each external patient's similarity to the development data and measures performance in subgroups with varying levels of alignment to the development distribution. We use generative models, specifically autoencoders, to estimate similarity, offering a more flexible alternative to traditional linear approaches and enabling validation without sharing the original development data. The utility of autoencoder-based similarity measure is demonstrated using synthetic data, and the framework's application is illustrated using data from the Netherlands Heart Registration (NHR) to predict mortality after transcatheter aortic valve implantation. Results: Our framework revealed substantial variation in model performance across similarity-defined subgroups, differences that remain hidden under conventional external validation yet can meaningfully alter conclusions. In several settings, conventional external validation suggested poor overall performance. However, after accounting for differences in patient characteristics, for some sub-groups, the model performance was consistent with internal validation results. Conversely, apparently acceptable overall performance could mask clinically relevant performance deficits in specific subgroups. Conclusion: The proposed framework enhances the interpretability of external validation by linking model performance to population alignment with the development data. This provides a more principled basis for deciding whether a model is transportable and to which patients it can be safely applied.

URL PDF HTML ☆

赞 0 踩 0

2605.11282 2026-05-13 math.ST stat.AP stat.TH

A Data-Consistent Approach to Ensemble Filtering

Rylan Spence, Troy Butler, Clint Dawson

AI总结本文提出了一种数据一致性的集合滤波方法——QPCA-EnDCF，用于处理混沌系统在部分观测条件下的状态估计问题。该方法通过在观测空间中进行谱正则化更新，替代传统的随机观测扰动，从而提升滤波精度与概率校准能力。理论分析表明，与传统随机集合卡尔曼滤波相比，QPCA-EnDCF 在保持相似计算复杂度的同时，能够有效降低估计方差，并在数值实验中表现出更优的误差扩散与技能匹配特性。

详情

Comments: 42 pages, 5 figures

英文摘要

Ensemble filtering of chaotic, partially observed systems is often performed with ensembles far smaller than the state dimension resulting in empirical covariances that are low rank. Subsequently, stochastic observation perturbations can degrade both accuracy and probabilistic calibration. We develop a data-consistent perspective on ensemble filtering and introduce the Quantity-of-Interest Principal Component Analysis Ensemble Data Consistent Filter (QPCA-EnDCF), which is a deterministic method that replaces perturbed observations with a spectrally regularized update in observation space. The method whitens forecast--observation residuals, computes an empirical eigendecomposition of the residual covariance, and restricts the correction to a rank-$κ$ subspace before mapping the increment back to state space through an empirical gain. We establish a theoretical framework that separates population and finite-ensemble objects and yields a bias--variance decomposition for the analysis mean. The analysis shows that stochastic EnKF variants incur an irreducible $\mathcal{O}(1/N)$ variance contribution from observation perturbations, whereas QPCA-EnDCF replaces this term with projector-estimation variability that is also $\mathcal{O}(1/N)$ but depends on the retained rank and the cutoff gap through eigenspace stability. Numerical experiments on the Lorenz--96 system in strongly undersampled regimes demonstrate that QPCA-EnDCF substantially improves spread--skill behavior, temporal tracking between spread and error, and rank-histogram reliability relative to sequential and four-dimensional stochastic EnKF. Under the baseline configuration, these calibration gains are accompanied by lower RMSE.

URL PDF HTML ☆

赞 0 踩 0

2605.11239 2026-05-13 cs.LG stat.ML

Extending Kernel Trick to Influence Functions

Zhenhuan Sun, Shahrokh Valaee

AI总结本文提出了一种影响函数的对偶表示方法，其计算复杂度随数据集规模增长而非模型规模，为大规模模型的影响分析提供了更高效的替代方案。该方法适用于可线性化的模型，通过构造一个与模型输出维度和数据集规模乘积相关的矩阵实现，能够在参数空间难以计算原始影响函数时有效估计参数、模型输出和损失的变化。这一成果在模型规模远大于数据集规模时具有显著优势。

2605.11238 2026-05-13 math.ST stat.CO stat.TH

Efficient Robust Constrained Signal Detection via Kolmogorov Width Approximations

Yikun Li, Matey Neykov

AI总结本文研究了在强ε污染的高斯序列模型下，具有先验约束集K的信号的鲁棒最小最大检测问题。针对传统最优检测方法需要计算复杂约束集的Kolmogorov宽度这一计算难题，作者提出了一种多项式时间检测框架，适用于多种结构约束集。通过半定规划松弛和改进的椭球法结合近似次梯度 oracle，有效逼近Kolmogorov宽度，实现了与现有上界仅相差对数因子的鲁棒检测边界，为结构化信号提供了无需先验几何复杂度知识的高效检测方案。

2605.11226 2026-05-13 math.AT math.ST stat.ML stat.TH

A Stable Distance Persistence Homology for Dynamic Bayesian Network Clustering

Will Bales, Carmen Rovi

AI总结该研究提出了一种用于动态贝叶斯网络聚类的稳定持久同调方法，旨在捕捉变量间依赖关系随时间演变的全局模式。通过构建动态贝叶斯图（DBG），并利用持久同调生成条形码，能够记录强依赖变量组的连接与消散过程。该方法具有稳定性，即网络条件概率表的小扰动只会引起条形码的微小变化，从而为动态贝叶斯网络的依赖结构演化提供了鲁棒的摘要描述。

2605.11220 2026-05-13 stat.AP

Prediction Markets Underperform Simple Baselines For Infectious Disease Forecasting

Carson Dudley, Reiden Magdaleno

AI总结该研究评估了预测市场（如Polymarket、Kalshi）在传染病预测中的表现，发现其在流感和麻疹病例预测中均未能超越简单统计基准。尽管预测市场具有实时响应新闻和非结构化信号的优势，但其参与者缺乏流行病学专业知识，且存在概率分布不合理和交易量低等问题，导致市场预测效率低下。研究结果表明，当前预测市场在传染病动态预测中不可靠，也不适合作为现有预测系统的补充特征。

2605.11191 2026-05-13 stat.ML cs.LG

Adaptive Policy Learning Under Unknown Network Interference

Aidan Gleich, Eric Laber, Alexander Volfovsky

AI总结本文研究了在未知网络干扰环境下进行自适应策略学习的问题，旨在同时学习网络中个体间的干扰动态并据此优化个体层面的干预分配以最大化累积收益。作者提出了一种基于吉布斯采样的汤普森采样算法，能够联合学习干扰网络并自适应优化干预策略，同时提供干扰网络的估计以支持后续因果分析。实验表明，该方法在多种场景下均能实现显著的累积收益提升，并具有良好的理论保证和实际效果。

2605.11181 2026-05-13 cs.LG cs.AI cs.NA math.NA math.OC stat.ML

Muon is Not That Special: Random or Inverted Spectra Work Just as Well

Zakhar Shumaylov, Nathaël Da Costa, Peter Zaika, Bálint Mucsányi, Alex Massucco, Yoav Gelberg, Carola-Bibiane Schönlieb, Yarin Gal, Philipp Hennig

AI总结本文挑战了Muon优化器在非欧几里得优化中依赖几何结构的主流观点，提出精确的几何结构并非影响优化性能的关键因素。研究引入了基于Schatten（准）范数的Freon优化器，其性能在GPT-2等任务中优于Muon，并揭示了最佳参数位于准范数区域，无法用传统LMO理论解释。进一步提出Kaon优化器，通过用随机噪声替代奇异值仍能匹配Muon性能，证明严格的几何结构并非必要。研究指出，优化性能主要由对齐度和下降潜力等局部量决定，而非全局几何结构。

2605.11179 2026-05-13 stat.ML cs.LG

Interpretable Machine Learning for Spatial Science: A Lie-Algebraic Kernel for Rotationally Anisotropic Gaussian Processes

Kane Warrior, Dalia Chakrabarty

AI总结许多三维空间场具有旋转各向异性，即变化方向不与坐标轴对齐。本文提出了一种可解释的旋转各向异性高斯过程核函数，通过三个主尺度和一个显式的SO(3)旋转参数化三维对称正定协方差度量，从而更直观地描述各向异性方向和尺度。该方法利用李代数指数映射将旋转表示为无约束的欧几里得坐标，同时保证协方差矩阵的有效性，并在合成数据和实际材料密度数据上验证了其优越性和可解释性。

2605.11120 2026-05-13 cs.IT cs.SY eess.SP eess.SY math.IT stat.ML

Sensor Design for Accuracy-Bounded Estimation via Maximum-Entropy Likelihood Synthesis

Raktim Bhattacharya

AI总结本文研究了在传感器模型不确定或不可用的情况下，如何为大规模时空系统设计满足精度要求的感知架构。核心方法是通过最大熵似然合成，在给定误差预算的前提下，构造满足精度约束且信息注入最少的测量似然函数。该方法适用于多种度量标准，提出了对应的离散粒子级优化问题及求解算法，并通过实验验证了其在不同场景下对精度约束的有效控制能力。

2605.11059 2026-05-13 stat.ML cs.LG math.PR

Uniform Scaling Limits in AdamW-Trained Transformers

William Gibson, Christoph Reisinger

AI总结本文研究了使用AdamW优化器训练的深层Transformer模型的极限行为，通过将隐藏状态的动力学建模为由注意力机制耦合的相互作用粒子系统。在适当缩放注意力头数的前提下，证明了隐藏状态和反向传播变量的联合动力学在$L^2$意义下，以$\mathcal O(L^{-1}+L^{-1/3}H^{-1/2})$的速率，统一收敛于一个前向-反向常微分方程组的解。该结果为理解Transformer在深度增加时的行为提供了理论支持，并给出了离散模型与连续模型之间差异的统一界，且该界不依赖于初始条件的紧集和token数量。

2605.11044 2026-05-13 stat.ME stat.AP

Rethinking Factor Loading Thresholds: A Case for a Strict λ >= .70 Rule

M. Murat Yaslioglu, Duygu Toplu Yaslioglu

AI总结本文质疑了在验证性因子分析中接受标准化因子载荷低至0.50的常见做法，提出应采用更严格的项目层面阈值：仅保留因子载荷λ≥0.70（即λ²≥0.50）的指标。作者认为，载荷低于0.70的指标包含的误差方差超过解释方差，会损害构念效度和因子解的稳定性。文章通过理论分析、模拟证据和结构方程模型的影响，论证了弱载荷会降低测量质量、因子得分确定性和模型拟合度，采用λ≥0.70的最低标准有助于提升潜变量模型的严谨性和可解释性。

2605.10949 2026-05-13 stat.AP cs.AI cs.CV cs.LG

AlphaEarth Satellite Embeddings for Modelling Climate Sensitive Diseases Towards Global Health Resilience

Usman Nazir, I-Han Cheng, Sara Khalid

AI总结该研究探讨了利用卫星遥感数据（AlphaEarth嵌入）预测气候敏感性疾病的潜力，以提升全球健康韧性。研究聚焦于疟疾、儿童急性呼吸道感染和发育迟缓等疾病，评估了64维卫星嵌入在不同国家和地区的预测性能。结果显示，卫星数据在疟疾和呼吸道感染预测中具有显著的预测能力，但在发育迟缓预测中受固定效应影响较大，需进一步数据支持。这一工作为利用遥感技术辅助公共卫生监测提供了新的方法和实证依据。

2605.10395 2026-05-13 stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.IT cs.LG math.IT

Sharp feature-learning transitions and Bayes-optimal neural scaling laws in extensive-width networks

Minh-Toan Nguyen, Jean Barbier

AI总结本文研究了在高维环境下，从噪声查询中学习具有层次特征的单隐藏层教师网络的信息论极限，并探讨了知识向更小的学生模型迁移的问题。通过引入一种启发式的留一法解耦分析，并结合数值验证，作者推导出贝叶斯最优泛化误差和特征重叠的精确刻画，揭示了特征可学习性存在一系列尖锐的相变点。研究还提出了“有效宽度”这一概念，统一了两个不同的缩放阶段，并展示了学生模型在接近有效宽度时可通过Adam优化器实现最优的泛化误差缩放规律。

2605.09523 2026-05-13 cs.LG cs.CE cs.NA math.NA physics.comp-ph stat.ML

HS-FNO: History-Space Fourier Neural Operator for Non-Markovian Partial Differential Equations

Lennon J. Shikhman

AI总结本文提出了一种名为HS-FNO的历史空间傅里叶神经算子，用于求解非马尔可夫型偏微分方程。该方法通过引入扩展状态$u_t(θ,x)$，将历史信息纳入模型，从而更准确地捕捉系统动态。HS-FNO通过将历史状态更新分解为预测新时间片和精确移动已知部分，减少了学习维度并提升了预测精度。实验表明，HS-FNO在多个基准问题上优于现有方法，尤其在自回归预测中表现出显著的误差降低。

详情

Comments: 15 pages, 4 figures, 1 table. Code at https://github.com/lennonshikhman/hs-fno/

英文摘要

Neural operators provide fast surrogate models for time-dependent partial differential equations, but their standard autoregressive use usually assumes that the instantaneous field $u(t,\cdot)$ is a complete state. This assumption fails for delay equations, distributed-memory systems, and other non-Markovian dynamics: two trajectories may agree at time $t$ and nevertheless have different futures because their histories differ. We introduce the History-Space Fourier Neural Operator (HS-FNO), a neural operator for delay and memory-driven PDEs formulated on the lifted state $u_t(θ,x)=u(t+θ,x)$, $θ\in[-τ,0]$. The key computational step is to decompose one history-state update into a learned predictor for the newly exposed future slice and an exact shift-append transport for the portion of the history window already known from the previous state. This avoids learning deterministic history coordinates, reduces the learned output dimension, and enforces the natural discrete history update. We test HS-FNO on five benchmark families covering delayed reaction--diffusion, spatial epidemiology, nonlocal neural-field dynamics, delayed waves, and distributed-memory closures. Across ten random seeds, HS-FNO attains the lowest aggregate one-step, history-space, and rollout errors among the principal baselines. The largest gain occurs in autoregressive prediction, where aggregate rollout error decreases from $0.241$, $0.188$, and $0.185$ for current-state, lag-stack, and unconstrained history-to-history operators, respectively, to $0.094$. The same model uses fewer parameters than unconstrained history prediction. These results indicate that enforcing the discrete shift structure of history-state evolution is an effective inductive bias for non-Markovian PDE surrogate modeling.

URL PDF HTML ☆

赞 0 踩 0

2605.06873 2026-05-13 stat.ML cs.LG cs.NA math.NA

One Operator for Many Densities: Amortized Approximation of Conditioning by Neural Operators

Panos Tsimpos, Edoardo Calvello, Ayoub Belhadji, Nicholas H. Nelsen

AI总结本文研究了概率条件化问题，即在已知随机变量 $Y$ 的条件下，确定随机变量 $X$ 的分布。传统方法是针对固定联合分布学习条件分布，而本文提出了一种新思路：通过识别一个统一的算子，将任意联合密度映射到对应的条件密度，从而实现对联合-条件对的 amortization。作者证明条件算子可以用神经算子以任意精度逼近，并通过高斯混合模型的实验验证了该框架的有效性，为通用的条件化方法提供了理论基础。

2605.05798 2026-05-13 stat.ME

Dual-Homotopy Framework for Constrained EM Algorithm

Jisoo Choi, Hee-Seok Oh

AI总结本文提出了一种适用于一般约束估计问题的新约束EM算法，核心方法是基于“对偶同伦框架”，该框架结合了确定性退火EM与基于障碍函数的优化方法，从而在参数约束下实现更稳定的估计。进一步引入了一种自适应约束EM算法，能够在不同分布形式和约束结构下保持似然单调性。通过仿真研究和实际数据分析验证，该算法在稳定性与准确性方面优于现有方法，包括标准EM算法。

2605.04946 2026-05-13 cs.LG stat.ML

Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks

Xuan Qi, Yi Wei, Fanqi Yu, Furao Shen, Vittorio Murino, Cigdem Beyan

AI总结本文研究了训练过程中批量归一化（BN）在分段仿射网络中的几何影响，揭示了BN如何通过调整神经元的参考超平面，改变局部区域的划分结构。研究发现，BN在每个神经元上定义了一个以小批量中心为基准的超平面，其切换超平面的偏移量以标准化坐标表示，与原始偏置无关。这一机制提高了局部划分的精细程度，并在深度网络中具有局部传递性，为理解BN在训练阶段的函数级几何作用提供了新视角。

2605.03911 2026-05-13 stat.ME

The Multiplicative Quasi-Instrumental Variable Model

Jiewen Liu, Chan Park, David Richardson, Eric J. Tchetgen Tchetgen

AI总结本文提出了一种乘法准工具变量（MQIV）模型，用于在存在未测量混杂因素的情况下进行因果推断，该模型利用可能不完全外生的准工具变量。该模型允许准工具变量对结果有非通过处理的直接影响，从而突破了传统工具变量的排除限制条件。研究在乘法处理模型下建立了对处理组平均处理效应的非参数识别，并提出了一种修正偏差的改进沃尔德比估计量，以及一类多重稳健且半参数高效的估计方法，适用于处理效应异质性和排除限制违反的情形。

2605.02453 2026-05-13 gr-qc astro-ph.HE cs.LG physics.data-an stat.ML

Testing General Relativity Through Gravitational Wave Classification: A Convolutional Neural Network Framework

Lavinia Heisenberg, Shayan Hemmatyar, Hector Villarrubia-Rojo

AI总结本文提出了一种基于卷积神经网络（CNN）的机器学习框架，用于通过引力波信号检验广义相对论（GR）。研究利用GWTC目录中173个双黑洞并合事件的源参数生成模拟的GR和超越GR（BGR）波形，并引入响应函数形式化方法以量化观测对GR修正的响应。实验表明，使用响应函数作为CNN输入可使分类灵敏度提高约33倍，且在所有变形尺度下，CNN均优于基于单一特征的分类方法，展示了该框架在检测引力波信号中GR偏差方面的优越性和有效性。

2604.24196 2026-05-13 stat.ML cs.LG

Identifiability and Stability of Generative Drifting with Companion-Elliptic Kernel Families

HakGeun Lee, Hyonho Chun

AI总结本文研究了生成模型中漂移场的可识别性与稳定性问题，重点探讨零漂移平衡是否能唯一确定目标分布，以及近似零漂移是否意味着分布的弱收敛。为解决原漂移模型中拉普拉斯核的局限性，作者引入了伴椭圆核族，该核族包含拉普拉斯核，并精确涵盖光滑参数为 $ν\ge 1/2$ 的高斯核与马特恩核。研究证明，在该核族下任意实数空间上的博雷尔概率测度均可由漂移场唯一识别，并揭示了漂移场收敛与弱收敛之间的关系，指出即使在缺乏紧致性的情况下，也可通过单个 $C_0$-可观测量恢复弱收敛。

2604.16684 2026-05-13 cs.LG stat.ML

DARLING: Detection Augmented Reinforcement Learning with Non-Stationary Guarantees

Argyrios Gerogiannis, Yu-Han Huang, Venugopal V. Veeravalli

AI总结本文研究了在非平稳有限时间回合马尔可夫决策过程（MDPs）中的无模型强化学习问题，且不预先知道非平稳性。针对分段平稳（PS）环境，即奖励和转移动态在未知时间点发生变化的情况，提出了一个名为DARLING的模块化方法，适用于表格和线性MDPs，无需提前知道变化时间点。DARLING在理论分析中改进了已知的最佳动态遗憾界，并在多种非平稳基准测试中表现出优于现有方法的性能。

2603.24913 2026-05-13 math.OC math.DG math.DS math.PR math.ST stat.TH

Geometry-Aware Langevin Sampling for Matrix-Valued Graph Learning

Papri Dey

AI总结本文研究了在正定矩阵值参数空间中进行贝叶斯推断的问题，特别是在图学习和协方差估计中的应用。为了解决欧几里得空间采样方法在锥体边界附近混合效率低的问题，作者提出了一种基于几何结构的Metropolis调整朗之万算法——\ConeMALA，该方法利用模型的对数行列式结构诱导采样几何。实验表明，该方法在采样效率和稳定性方面优于传统方法，为正定约束下的图学习提供了有效的不确定性量化方案。

2603.23374 2026-05-13 stat.ME stat.ML

Shape-Adaptive Conditional Calibration for Conformal Prediction via Minimax Optimization

Yajie Bao, Chuchen Zhang, Zhaojun Wang, Haojie Ren, Changliang Zou

AI总结在条件预测校准中，实现有效的条件覆盖率是一个具有挑战性的问题，因为有限样本下满足逐点约束在理论上存在困难。本文提出了一种基于极大极小优化的预测推断框架MOPI，通过在校准阶段优化一个灵活的集合映射类，而非仅校准固定子水平集，从而提升了形状适应性，并保持了与最小化平均平方覆盖率误差的理论联系。该方法在理论上提供了非渐近的Oracle不等式，并证明了覆盖率误差的收敛速率在常规条件下达到最优阶数，同时支持在测试时不可观测但校准时可得的敏感属性上的有效条件推理。实验结果表明，MOPI在复杂非标准条件分布下生成的预测集比现有方法更高效。

2602.13004 2026-05-13 cs.LG stat.ML

Towards Uncertainty-Aware Federated Granger Causal Learning

Ayush Mohanty, Nazal Mohamed, Nagi Gebraeel

AI总结该研究旨在解决联邦格兰杰因果学习中缺乏不确定性感知的问题，提出了一种能够量化跨客户端因果关系不确定性的方法。通过分析联邦学习框架中不确定性传播的机制，作者推导了客户端与服务器之间协方差的闭式递推公式，并建立了基于谱半径的收敛条件，从而获得了稳态方差的解析表达式。实验表明，该方法能有效区分真实的跨客户端因果关系与虚假连接，优于现有联邦因果结构学习方法。

2602.04611 2026-05-13 stat.ML cs.LG

Targeted Synthetic Control Method

Yuxin Wang, Dennis Frauen, Emil Javurek, Konstantin Hess, Yuchen Ma, Stefan Feuerriegel

AI总结本文提出了一种新的合成控制方法——目标合成控制（TSC），用于更准确地估计单一处理单元的因果效应。TSC 采用两阶段估计策略，通过目标更新优化初始权重，提高估计稳定性，并确保合成控制结果为控制单元观测值的凸组合，从而增强解释性。该方法兼容任意机器学习模型，有效避免了现有方法中可能出现的反事实估计无界问题，在合成与真实数据实验中均表现出更优的估计精度。

2602.04042 2026-05-13 cs.LG stat.ME stat.ML

Partition Tree: Conditional Density Estimation over General Outcome Spaces

Felipe Angelim, Alessandro Leite

AI总结本文提出了一种名为 Partition Tree 的新型树状框架，用于在一般结果空间上进行条件密度估计，能够统一处理连续和分类变量。该方法通过数据自适应划分将条件分布建模为分段常数密度，并直接最小化条件负对数似然来学习树结构，提供了一种无需参数假设的可扩展非参数替代方案。此外，文章还引入了 Partition Forest，通过平均条件密度实现对 Partition Tree 的袋外扩展，并在实验中展示了其在概率预测方面的优越性和与最新方法的竞争力。

2602.01682 2026-05-13 cs.LG cs.DS stat.ML

Finite and Corruption-Robust Regret Bounds in Online Inverse Linear Optimization under M-Convex Action Sets

Taihei Oki, Shinsaku Sakaue

AI总结本文研究在线逆线性优化问题，即根据随时间变化的可行集上观测到的最优动作，推断隐藏的目标向量，并推荐符合该目标的行动。研究关注在M-凸可行集（如拟阵）下，能否获得与维度多项式相关的有限悔度界。作者通过结合M-凸集最优解的结构特性与几何体积论证，证明了悔度界为 $O(d\log d)$，部分解决了该问题的开放性疑问，并进一步拓展到对抗性噪声场景，给出了无需先验知识的悔度界 $O((C+1)d\log d)$。

2512.24768 2026-05-13 stat.ML cs.LG

Sparse Offline Reinforcement Learning with Corruption Robustness

Nam Phuong Tran, Andi Nika, Goran Radanovic, Long Tran-Thanh, Debmalya Mandal

AI总结本文研究了在高维稀疏马尔可夫决策过程下，离线稀疏强化学习中对强数据污染的鲁棒性问题。面对对手对部分轨迹的任意扰动，作者提出了一种结合稀疏性的演员-评论家方法，避免了传统方法中过于悲观的奖励估计，首次在单策略集中覆盖假设下给出了非空的理论保证。该方法在污染环境下仍保持鲁棒性，为高维稀疏环境中学习近似最优策略提供了新的可能性。

2511.17038 2026-05-13 cs.AI eess.IV stat.ML

DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing

Hao Chen, Renzheng Zhang, Scott S. Howard

AI总结本文提出了一种名为DAPS++的新型扩散逆问题求解方法，旨在解决传统扩散模型在逆问题中先验引导不足的问题。该方法通过将扩散初始化与似然驱动的优化过程完全解耦，使重建过程更直接地由测量一致性引导，同时保持数值稳定性。实验表明，DAPS++在减少函数评估次数和优化步骤的前提下，实现了高效的计算性能和鲁棒的图像恢复效果。

2510.20191 2026-05-13 stat.ME

Bias-Variance Tradeoff of Matching Prior to Difference-in-Differences When Parallel Trends is Violated

Mingxuan Ge, Dae Woong Ham

AI总结本文研究了在平行趋势假设被违反时，匹配先验（Matching Prior）与双重差分法（DiD）之间的偏差-方差权衡问题。作者在存在未观测时间变化混杂因素的线性结构模型下，分析了匹配对偏差和均方误差（MSE）的影响，发现匹配处理组与对照组在预处理结果上的变量总是有助于提升估计精度，而仅基于可观测协变量的匹配可能因样本量变化带来不利影响。基于此，作者建议在实际应用中应综合考虑偏差与方差，推荐使用MSE作为评估标准，并提供了匹配变量选择的理论指导与实践建议。

2510.18071 2026-05-13 stat.ML cs.LG stat.ME

Arbitrated Indirect Treatment Comparisons

Yixin Fang, Weili He

AI总结本文提出了一种新的间接治疗比较方法——仲裁间接治疗比较，旨在解决MAIC方法中出现的“MAIC悖论”问题。该悖论指不同研究者使用相同数据得出矛盾的治疗效果结论，原因在于各自隐含针对不同目标人群。新方法通过聚焦于一个共同的重叠人群，估计治疗效果，从而消除因目标人群差异导致的结论不一致问题，提升了间接比较的可靠性和一致性。

2510.16127 2026-05-13 stat.ML cs.LG

Learning density ratios in causal inference using Bregman-Riesz regression

Oliver J. Hines, Caleb H. Miles

AI总结本文提出了一种统一的框架——Bregman-Riesz回归，用于因果推断中的密度比估计问题。该方法结合了基于Bregman散度、概率分类模型以及Riesz回归的三种经典方法，提升了密度比估计的稳定性与泛化能力。研究还探讨了数据增强技术在因果问题中的应用，并通过实验分析了不同Bregman散度和数据增强策略对模型性能的影响，同时提供了相应的Python工具包以支持实际应用。

详情

Comments: Replication code is available from https://github.com/CI-NYC/densityratios

英文摘要

The ratio of two probability density functions is a fundamental quantity that appears in many areas of statistics and machine learning, including causal inference, reinforcement learning, covariate shift, outlier detection, independence testing, importance sampling, and diffusion modeling. Naively estimating the numerator and denominator densities separately using, e.g., kernel density estimators, can lead to unstable performance and suffer from the curse of dimensionality as the number of covariates increases. For this reason, several methods have been developed for estimating the density ratio directly based on (a) Bregman divergences or (b) recasting the density ratio as the odds in a probabilistic classification model that predicts whether an observation is sampled from the numerator or denominator distribution. Additionally, the density ratio can be viewed as the Riesz representer of a continuous linear map, making it amenable to estimation via (c) minimization of the so-called Riesz loss, which was developed to learn the Riesz representer in the Riesz regression procedure in causal inference. In this paper we show that all three of these methods can be unified in a common framework, which we call Bregman--Riesz regression. We further show how data augmentation techniques can be used to apply density ratio learning methods to causal problems, where the numerator distribution typically represents an unobserved intervention. We show through simulations how the choice of Bregman divergence and data augmentation strategy can affect the performance of the resulting density ratio learner. A Python package is provided for researchers to apply Bregman--Riesz regression in practice using gradient boosting, neural networks, and kernel methods.

URL PDF HTML ☆

赞 0 踩 0

2510.14285 2026-05-13 econ.EM math.ST stat.TH

Debiased Kernel Estimation of Spot Volatility in the Presence of Infinite Variation Jumps

B. Cooper Boniece, José E. Figueroa-López, Tianwei Zhou

AI总结本文研究了在存在无限变差跳跃的情况下，如何对局部波动率进行无偏核估计的问题。作者提出了一种截断核估计方法及其去偏版本，将最优速率的局部波动率估计扩展到更广的跳跃活动指数范围，并建立了相应的中心极限定理。相比以往方法，该方法通过使用更一般的核函数和最优带宽收敛速率，实现了更小的渐近方差，并在更灵活的模型假设下具有更广泛的应用性。仿真研究表明，该方法在有限样本中优于现有方法。

2510.09401 2026-05-13 stat.ME stat.CO

Uncertainty Quantification for Multi-level Models Using the Survey-Weighted Pseudo-Posterior

Matthew R. Williams, F. Hunter McGuire, Terrance D. Savitsky

AI总结本文研究了如何在复杂调查样本中对多层模型进行不确定性量化，重点解决由于样本不平衡和有效样本量差异带来的估计偏差和方差估计问题。作者提出了一种改进的自动后处理方法，用于调整局部和全局参数的不确定性估计，并通过模拟研究和国家药物使用健康调查实例验证了其有效性。该方法为多层模型在复杂抽样设计下的参数推断提供了更准确的置信区间估计。

2510.04265 2026-05-13 cs.AI cs.CL math.ST stat.ML stat.TH

Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation

Mohsen Hariri, Amirhossein Samandar, Michael Hinczewski, Vipin Chaudhary

AI总结本文提出了一种基于贝叶斯框架的大语言模型评估方法，旨在解决传统Pass@k指标在样本量有限时排名不稳定、易误导的问题。该方法通过估计模型的底层成功概率及其可信区间，提供更稳定且具有统计意义的模型排名，并支持对评分标准的灵活加权。实验表明，该框架在收敛速度和排名稳定性方面优于Pass@k，且能明确区分统计显著差异与噪声，适用于二元和非二元评估场景。

详情

Journal ref: The Fourteenth International Conference on Learning Representations (ICLR), 2026
Comments: OpenReview (ICLR 2026): https://openreview.net/forum?id=PTXi3Ef4sT

英文摘要

Pass$@k$ is widely used to report the reasoning performance of LLMs, but it often produces unstable and potentially misleading rankings, especially when the number of trials (samples) is limited and computational resources are constrained. We present a principled Bayesian evaluation framework that replaces Pass$@k$ and average accuracy over $N$ trials (avg$@N$) with posterior estimates of a model's underlying success probability and credible intervals, yielding stable rankings and a transparent decision rule for differences. Evaluation outcomes are modeled as categorical (not just 0/1) with a Dirichlet prior, giving closed-form expressions for the posterior mean and uncertainty of any weighted rubric and enabling the use of prior evidence when appropriate. Theoretically, under a uniform prior, the Bayesian posterior mean is order-equivalent to average accuracy (Pass$@1$), explaining its empirical robustness while adding principled uncertainty. Empirically, in simulations with known ground-truth success rates and on AIME'24/'25, HMMT'25, and BrUMO'25, the posterior-based procedure achieves faster convergence and greater rank stability than Pass$@k$ and recent variants, enabling reliable comparisons at far smaller sample counts. The framework clarifies when observed gaps are statistically meaningful (non-overlapping credible intervals) versus noise, and it naturally extends to graded, rubric-based evaluations. Together, these results recommend replacing Pass$@k$ for LLM evaluation and ranking with a posterior-based, compute-efficient protocol that unifies binary and non-binary evaluation while making uncertainty explicit. Source code is available at https://github.com/mohsenhariri/scorio

URL PDF HTML ☆

赞 0 踩 0

2508.21260 2026-05-13 cs.RO eess.SP math.ST stat.TH

Remarks on stochastic cloning and delayed-state filtering

Tara Mina, Lindsey Marinello, John Christian

AI总结本文研究了在航空航天导航和机器人领域中处理依赖于先验状态的延迟状态测量的估计问题，重点探讨了随机克隆（SC）方法以及一种被长期忽视的替代方法——延迟状态卡尔曼滤波（DSKF）。研究发现，正确推导的DSKF能够在无需状态扩增的情况下，实现与SC相同的状态和协方差更新，并提供了两种等效的DSKF形式，从不同角度解释了如何在广义卡尔曼滤波框架中处理先验状态测量的相关性。研究还表明，DSKF在计算和存储复杂度上与SC相当，且在某些问题维度下可进一步降低计算和存储成本，澄清了卡尔曼滤波无法处理相关延迟状态测量的误解。

2508.16132 2026-05-13 q-fin.PM math.ST q-fin.RM stat.TH

On a multivariate extension for Copula-based Conditional Value at Risk

Andres Mauricio Molina Barreto

AI总结本文研究了基于Copula的条件风险价值（CCVaR）在多维（d≥2）情况下的扩展，特别针对由阿基米德Copula描述的依赖结构。作者推导出在阿基米德Copula下的CCVaR的近似闭式表达式，并探讨了该风险度量满足一致性的条件。通过基于实际数据的数值实验，验证了CCVaR的有效性，并与传统的风险价值（VaR）和条件风险价值（CVAR）进行了比较。

2508.08420 2026-05-13 cs.LG stat.ML

Regret minimization in Linear Bandits with offline data via extended D-optimal exploration

Sushant Vijayan, Arun Suggala, Karthikeyan Shanmugam, Soumyabrata Pal

AI总结本文研究了在拥有离线数据的情况下，如何在线最小化线性强盗问题的累积遗憾。提出了一种名为Offline-Online Phased Elimination (OOPE) 的算法，通过在探索阶段使用扩展的D-最优设计，有效利用离线数据以显著降低在线遗憾。该算法的在线遗憾界为 $\tilde{O}(\sqrt{\deff T \log (|\mathcal{A}|T)} + d^2)$，其中 $\deff$ 表示离线数据中未充分探索的方向数，反映了离线数据的质量。此外，本文还给出了依赖于离线数据质量的最小最大遗憾下界，并通过Frank-Wolfe近似进一步优化了算法的复杂度。

详情

Comments: Accepted to TMLR, with J2C certification, link: https://openreview.net/forum?id=4WcK8gKgCi

英文摘要

We consider the problem of online regret minimization in linear bandits with access to prior observations (offline data) from the underlying bandit model. There are numerous applications where extensive offline data is often available, such as in recommendation systems, online advertising. Consequently, this problem has been studied intensively in recent literature. Our algorithm, Offline-Online Phased Elimination (OOPE), effectively incorporates the offline data to substantially reduce the online regret compared to prior work. To leverage offline information prudently, OOPE uses an extended D-optimal design within each exploration phase. OOPE achieves an online regret is $\tilde{O}(\sqrt{\deff T \log \left(|\mathcal{A}|T\right)}+d^2)$. $\deff \leq d)$ is the effective problem dimension which measures the number of poorly explored directions in offline data and depends on the eigen-spectrum $(λ_k)_{k \in [d]}$ of the Gram matrix of the offline data. The eigen-spectrum $(λ_k)_{k \in [d]}$ is a quantitative measure of the \emph{quality} of offline data. If the offline data is poorly explored ($\deff \approx d$), we recover the established regret bounds for purely online setting while, when offline data is abundant ($\Toff >> T$) and well-explored ($\deff = o(1) $), the online regret reduces substantially. Additionally, we provide the first known minimax regret lower bounds in this setting that depend explicitly on the quality of the offline data. These lower bounds establish the optimality of our algorithm in regimes where offline data is either well-explored or poorly explored. Finally, by using a Frank-Wolfe approximation to the extended optimal design we further improve the $O(d^{2})$ term to $O\left(\frac{d^{2}}{\deff} \min \{ \deff,1\} \right)$, which can be substantial in high dimensions with moderate quality of offline data $\deff = Ω(1)$.

URL PDF HTML ☆

赞 0 踩 0

2507.09302 2026-05-13 stat.ME

The Multiplicative Instrumental Variable Model

Jiewen Liu, Chan Park, Yonghoon Lee, Yunshu Zhang, Mengxin Yu, James M. Robins, Eric J. Tchetgen Tchetgen

AI总结本文提出了一种新的工具变量模型——乘法工具变量模型（MIV），用于解决隐藏混杂因素带来的偏差问题。该模型通过在处理倾向得分模型中引入无乘法交互条件，为工具变量与未观测混杂因素对处理变量的影响提供了独立作用机制的正式表述。研究证明，MIV可以在非参数条件下识别处理效应，并提出了一类多重稳健且半参数高效的估计方法，最后通过模拟和实际应用验证了方法的有效性。

2507.03622 2026-05-13 cs.LG cs.AI stat.ML

Localising Dropout Variance in Twin Networks

Cooper Doyle

AI总结该论文研究了如何在双网络模型中定位预测不确定性来源的问题，提出了一种分层方差分解方法，将总预测方差分解为编码器部分和输出头部分。通过独立控制共享编码器和输出头的蒙特卡洛Dropout，能够区分不同来源的不确定性。实验表明，编码器方差在分布偏移时占主导，是预测误差的主要指标，而输出头方差在编码器不确定性控制后才具有信息量，该方法成本低廉，可为数据收集提供实用指导。

2506.23619 2026-05-13 q-fin.ST cs.LG econ.EM stat.ML

Overparametrized models with posterior drift

Guillaume Coqueret, Martial Laguerre

AI总结本文研究了在过度参数化的机器学习模型中，后验漂移对样本外预测准确性的影响。研究发现，当训练与测试样本的数据生成过程参数发生变化时，模型性能会显著下降，这在金融市场等易发生制度变化的场景中尤为重要。应用于股权溢价预测时，研究指出市场择时策略对子时期和模型复杂度参数高度敏感，较小的带宽参数会导致投资回报高度异质，而较大的带宽参数虽能带来更一致的结果，但风险调整后的收益较差，因此在股票市场预测中应谨慎使用大型线性模型。

2505.20761 2026-05-13 cs.LG stat.ML

Practical estimation of the optimal classification error with soft labels and calibration

Ryota Ushio, Takashi Ishida, Masashi Sugiyama

AI总结本文研究了在二分类任务中如何实用且理论严谨地估计最优分类错误率（即贝叶斯错误）。作者在原有基于软标签的方法基础上进行了两个重要扩展：一方面，他们分析了基于硬标签的估计器的偏差性质，揭示其衰减速度与两类条件分布的分离程度相关，并在每实例硬标签数量增加时可能显著优于先前结果；另一方面，他们解决了在软标签被污染的情况下进行估计的问题，指出即使使用校准后的软标签，估计结果仍可能不准确，并提出一种基于等距校准的估计方法，在更弱的假设下仍具有统计一致性。该方法无需具体实例，适用于隐私受限的实际场景。实验验证了方法的有效性。

详情

Comments: ICLR 2026 camera-ready version updated; 40 pages, 12 figures; GitHub: https://github.com/RyotaUshio/bayes-error-estimation

英文摘要

While the performance of machine learning systems has experienced significant improvement in recent years, relatively little attention has been paid to the fundamental question: to what extent can we improve our models? This paper provides a means of answering this question in the setting of binary classification, which is practical and theoretically supported. We extend a previous work that utilizes soft labels for estimating the Bayes error, the optimal error rate, in two important ways. First, we theoretically investigate the properties of the bias of the hard-label-based estimator discussed in the original work. We reveal that the decay rate of the bias is adaptive to how well the two class-conditional distributions are separated, and it can decay significantly faster than the previous result suggested as the number of hard labels per instance grows. Second, we tackle a more challenging problem setting: estimation with corrupted soft labels. One might be tempted to use calibrated soft labels instead of clean ones. However, we reveal that calibration guarantee is not enough, that is, even perfectly calibrated soft labels can result in a substantially inaccurate estimate. Then, we show that isotonic calibration can provide a statistically consistent estimator under an assumption weaker than that of the previous work. Our method is instance-free, i.e., we do not assume access to any input instances. This feature allows it to be adopted in practical scenarios where the instances are not available due to privacy issues. Experiments with synthetic and real-world datasets show the validity of our methods and theory. The code is available at https://github.com/RyotaUshio/bayes-error-estimation.

URL PDF HTML ☆

赞 0 踩 0

2505.17506 2026-05-13 stat.ML cs.LG

Offline Constrained Reinforcement Learning under Partial Data Coverage

Seokmin Ko, Ambuj Tewari, Kihyuk Hong

AI总结本文研究了在折扣约束马尔可夫决策过程下，使用通用函数逼近的离线约束强化学习问题。针对现有方法需要完整数据覆盖、缺乏oracle效率或依赖数据生成分布等限制，提出了一种基于分解线性规划的原对偶算法PDOCRL，将策略显式纳入优化变量，避免了对数据生成分布的依赖。该方法在部分策略覆盖条件下，能够在不依赖数据生成分布的情况下，以$\widetilde{\mathcal O}(ε^{-2})$的样本复杂度返回近似最优且近似可行的策略，并在实验中表现出与强基线相当的性能。

2505.13770 2026-05-13 cs.AI cs.CL cs.LG stat.ME stat.ML

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

Jin Du, Li Chen, Xun Xian, An Luo, Fangqiao Tian, Ganghua Wang, Charles Doss, Xiaotong Shen, Jie Ding

AI总结本研究探讨了大型语言模型（LLMs）在因果推断中应对统计陷阱的能力，指出当前模型在处理如辛普森悖论和选择偏差等复杂统计问题时存在明显不足。为此，研究提出了一个名为CausalPitfalls的综合性基准，通过多难度级别的结构化挑战和评分标准，系统评估模型的因果推理能力与回答可靠性。实验结果揭示了现有LLMs在统计因果推理中的局限性，并为构建可信的因果推理系统提供了重要参考。

2505.11517 2026-05-13 physics.soc-ph cs.CE cs.IT math.IT stat.AP

Information-Theoretic Grid Topology Reconstruction using Low-Precision Smart Meter Data

Daniel T. Speckhard

AI总结本文研究了在低精度智能电表数据条件下重建配电网络拓扑所需的最小数据保真度。采用信息论方法，利用Chow-Liu算法基于互信息生成最大生成树，重点分析了数据位深、有效数字截断、时间窗口长度及互信息估计方法对重构精度的影响。实验表明，即使使用8位量化数据或毫伏级精度，也能有效恢复电网拓扑，但若采样间隔超过20分钟或数据持续时间过短，性能将显著下降。研究为未来评估实际噪声数据和融合工程先验的混合方法提供了理论基础。

2502.09162 2026-05-13 stat.ME

Infinitely divisible priors for multivariate survival functions

Florian Brück

AI总结本文提出了一种非参数先验框架，用于实值随机向量，可视为对“右中立先验”的多变量推广。该方法通过将最小无限可分随机向量的指数测度随机化，结合无限可分随机测度，能够自然地处理部分可交换数据和可交换随机向量。研究展示了如何从简单模块构建分层先验，并将许多贝叶斯非参数生存分析模型嵌入该框架，同时探讨了后验预测分布及其在一定正则条件下的性质，并提供了相关模拟方法，适用于生存分析中的部分可交换数据场景。

2411.00471 2026-05-13 stat.ME cs.LG

Dirichlet process mixtures of block $g$ priors for model selection and prediction in linear models

Anupreet Porwal, Abel Rodriguez

AI总结本文提出了一种用于线性模型中模型选择和预测的块 $g$ 先验的狄利克雷过程混合方法。该方法在传统 $g$ 先验混合的基础上进行扩展，允许对不同参数块进行差异化的收缩，同时充分考虑预测变量之间的相关性，连接了模型选择与连续收缩先验的研究领域。该方法在多种意义上具有一致性，能够避免条件林德利悖论，并通过一种仅需少量调参的马尔可夫链蒙特卡洛算法进行后验推断。实验证明，当存在少量非常大的效应时，该方法能够更有效地检测出较小但显著的效应，且仅带来少量的假阳性发现。

2407.11465 2026-05-13 math.ST math.PR q-fin.MF stat.ME stat.TH

Testing by Betting while Borrowing and Bargaining

Hongjian Wang, Muriel F. Pérez-Ortiz, Wouter M. Koolen, Aaditya Ramdas

AI总结本文研究了在博弈论统计学中允许借贷情况下，基于下注的假设检验方法的调整问题。传统方法要求下注金额不能超过当前财富，而本文探讨了当允许借贷时，如何调整拒绝域阈值以保持相同的显著性水平。研究发现，若采用依赖负债的阈值，需支付额外代价；而若采用依赖历史杠杆率的路径相关阈值，则无需额外代价，从而为借贷条件下的假设检验提供了新的理论支持。

2303.12834 2026-05-13 quant-ph cs.AI cs.LG stat.ML

The power and limitations of learning quantum dynamics incoherently

Sofiene Jerbi, Joe Gibbs, Manuel S. Rudolph, Matthias C. Caro, Patrick J. Coles, Hsin-Yuan Huang, Zoë Holmes

AI总结本文研究了在不依赖系统与目标直接量子交互的非相干框架下学习量子动力学的可行性与限制。通过分析模拟已有相干学习策略所需的测量次数，作者给出了学习单元过程的样本复杂度界限，并证明在允许任意测量时，任何高效可表示的单元算子均可在非相干框架中高效学习；而仅使用浅层测量时，仅能学习低纠缠单元算子。研究还通过在IBM量子设备上成功学习16量子比特单元算子，并通过数值实验验证了算法的可扩展性。