arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2606.20141 2026-06-19 stat.CO 新提交

DASH: A Dimensionality Reduction Method for Large-scale Convex MIQP with Applications in Subset Portfolio Selection

DASH: 一种用于大规模凸MIQP的降维方法及其在子集投资组合选择中的应用

Pinzhang Cheng

AI总结提出DASH降维方法，通过减少变量层次改善大规模凸MIQP求解器性能，在子集投资组合选择中显著提升Gurobi难以求解问题的初始解质量。

详情

AI中文摘要

作为MIP（混合整数规划）的子集选择问题是NP难的。对于大规模问题，在合理时间内找到全局最优解是不可行的，实践中常通过MIP求解器寻找高质量的初始解。本文提出DASH（递减活动集层次）——一种降维方法，针对可表述为MIQP（混合整数二次规划）的一类最佳子集选择问题，提高MIP求解器的性能。我们在子集投资组合选择问题中开发并评估了DASH的性能，并与商业MIP求解器Gurobi进行了比较。除了问题规模外，问题的难度还与协方差矩阵的条件数以及投资组合权重的箱约束有关。大量不同问题配置的数值实验表明，当Gurobi难以求解问题时，DASH能持续显著改进初始解。特别是，DASH改进的幅度和持续时间随问题难度增加而扩大。

英文摘要

Subset selection problems as MIPs (Mixed Integer Programs) are NP-hard. For large scale problems, it is infeasible to find global optimal solutions in a reasonable time and good-quality incumbent solutions are sought after with MIP solvers in practice. This paper proposes DASH (Decreasing Active Set Hierarchy) -- a dimensionality reduction method that improves the MIP solver performance for a subclass of best subset selection problems that can be formulated as MIQPs (Mixed Integer Quadratic Programs). We develop and evaluate the performance of DASH in the subset portfolio selection problem with comparison to Gurobi, a commercial MIP solver. In addition to the problem size, the difficulty of a problem is related to the condition number of the covariance matrix and the box constraint on portfolio weights. An extensive set of numerical experiments with varying problem configurations shows that DASH offers consistent and significant improvement of incumbent solutions when the problem is difficult to solve by Gurobi. In particular, the magnitude and duration of improvement by DASH scale with the difficulty of the problem.

URL PDF HTML ☆

赞 0 踩 0

2606.19909 2026-06-19 stat.CO math.PR stat.ME 新提交

Establishing an $Ω(\sqrt{d})$ complexity lower bound for PDMP samplers and how to break it: a sub-$\sqrt{d}$ algorithm for Gaussian-tailed targets

建立 PDMP 采样器的 $\Omega(\sqrt{d})$ 复杂度下界及如何突破：针对高斯尾目标的一个亚 $\sqrt{d}$ 算法

Augustin Chevallier

AI总结本文证明分段确定性马尔可夫过程采样器在标准设置下具有 $\Omega(\sqrt{d})$ 复杂度下界，并通过放宽目标密度连续时间不变性假设，提出一种新方案，对高斯尾目标实现 $O(d^\alpha)$（$\alpha\in[0.2,0.3]$）的经验复杂度。

2606.19655 2026-06-19 stat.CO math.ST stat.TH 新提交

A Flat Connection: The Pooling Factor and the Geometry of Centring in Hierarchical MCMC

平坦联络：分层MCMC中的汇集因子与中心化几何

Aidan D. Bindoff

AI总结研究分层MCMC中中心化/非中心化障碍的几何原因，证明Fisher信息诱导的联络是平坦的，障碍源于统计上的汇集因子π_j，并据此提出诊断方法。

Comments 39 pages, 9 figures, accompanying R package

详情

AI中文摘要

标准MCMC诊断（$\hat{R}$、有效样本量、发散计数）检测链是否混合，但不检测为何未混合。我们询问分层模型中的中心化/非中心化障碍是否具有度量之外的几何原因。联合参数空间是一个纤维丛（超参数为底，组级参数为纤维），Fisher信息度量诱导一个Ehresmann联络$A = -G_{FF}^{-1}G_{BF}$；自然假设是障碍是其曲率，采样器将其感受为和乐。我们证明这是错误的。对于任何光滑的分层后验，不仅是高斯情况，联络是平坦的，因为其水平叶是纤维得分$\partial_\alpha \log p$的水平集：度量之上没有几何障碍。剩下的障碍是统计的，而非几何的，平坦联络将其识别为一个单一量：纤维对底的条件依赖性，由每组的先验比例$\pi_j$（经典汇集因子）控制。该框架由此恢复了已有图景：先验主导的组混合缓慢，每组的非中心化最优权重有闭式解，并且一项模拟研究通过它们对分层方差的相反依赖性，将这种底-纤维耦合与漏斗（一种不同的底空间病态）区分开来。一项直接归因测试确认NUTS不运输纤维：链级足迹是先验主导组中多余的条件自相关，正如$\pi_j$所预测。真正的、甚至旋转的曲率确实出现，但仅针对由采样器工作度量（固定质量矩阵）构建的联络，此时和乐作为算法现象而非几何现象重新出现。先验比例诊断作为R包fibr分发，几何方法作为附带的复现代码。

英文摘要

Standard MCMC diagnostics ($\hat{R}$, effective sample size, divergence counts) detect whether a chain has mixed, but not why it has not. We ask whether the centring/non-centring obstruction in hierarchical models has a geometric cause beyond the metric. The joint parameter space is a fiber bundle (hyperparameters the base, group-level parameters the fibers), and the Fisher information metric induces an Ehresmann connection $A = -G_{FF}^{-1}G_{BF}$; the natural hypothesis is that the obstruction is its curvature, felt by the sampler as holonomy. We prove this false. The connection is flat for any smooth hierarchical posterior, not only the Gaussian case, because its horizontal leaves are the level sets of the fiber score $\partial_α\log p$: there is no geometric obstruction above the metric. What remains is statistical, not geometric, and the flat connection identifies it as a single quantity: the conditional dependence of fiber on base, governed per group by the prior fraction $π_j$, the classical pooling factor. From it the framework recovers the established picture, that prior-dominated groups mix slowly and that the optimal per-group non-centring weight follows in closed form, and a simulation study separates this base-fiber coupling from the funnel, a distinct base-space pathology, by their opposite dependence on the hierarchical variance. A direct attribution test confirms that NUTS does not transport the fiber: the chain-level footprint is excess conditional autocorrelation in prior-dominated groups, exactly as $π_j$ predicts. Genuine, even rotational, curvature does appear, but only for connections built from a sampler's working metric (a fixed mass matrix), where holonomy re-enters as an algorithmic rather than geometric phenomenon. The prior-fraction diagnostic is distributed as the R package fibr, with the geometric methods as accompanying reproduction code.

URL PDF HTML ☆

赞 0 踩 0

2606.20406 2026-06-19 stat.ME stat.CO 交叉投稿

Flexible modeling of bimodal distributions via skewed-$t$ mixtures

双峰分布的灵活建模：基于偏斜-t分布的混合模型

Marco Bee, Flavio Santi

AI总结提出基于Fernández和Steel (1998)偏斜-t分布的混合模型，通过EM算法进行极大似然估计，并开发似然比检验，用于拟合双峰、偏斜和厚尾数据，在标准普尔500指数中验证了双峰性。

详情

AI中文摘要

我们提出了一种位置-尺度偏斜-t分布的混合模型，用于拟合双峰、偏斜和厚尾数据。特别地，该混合模型基于Fernández和Steel (1998)的偏斜-t分布，因此模型构建过程可以轻松扩展到其他对称分布的混合。在研究了混合模型的性质后，我们通过EM算法开发了极大似然估计方法，并提出了一个似然比检验，用于检验任何给定成分中无偏斜的原假设。与最近提出的g-and-h分布混合的基于模拟的比较表明，所提出模型在良好指定设置下的估计精度和错误指定框架下的建模能力方面均表现出色。将该模型拟合到标准普尔500指数失真数据，证实了其分布的双峰性，这意味着美国股市历史上处于熊市或牛市状态，而非接近其基本面价值。

英文摘要

We propose a mixture of location-scale skewed-$t$ distributions to fit bimodal, skewed and heavy-tailed data. In particular, the mixture is based on the skewed-$t$ distribution by Fernández and Steel (1998), so that the model-building procedure can be easily extended to mixtures of other symmetric distributions. After studying the properties of the mixture, we develop a maximum likelihood estimation approach via the EM algorithm and a likelihood ratio test of the null hypothesis of no skewness in any given component. A simulation-based comparison to a recently proposed mixture of g-and-h distributions suggests that the performance of the proposed model is excellent, in terms of both estimation precision in well-specified setups and modeling capability in mis-specified frameworks. Fitting the model to the Standard & Poor's 500 distortion allows us to confirm the bimodality of its distribution, with the implication that the US stock market has historically been in bearish or bullish conditions, rather than near its fundamental value.

URL PDF HTML ☆

赞 0 踩 0

2606.20226 2026-06-19 stat.ME stat.CO 交叉投稿

Analysis of uncertain fixed-effects model for Latin square designs

拉丁方设计的不确定固定效应模型分析

Yaru Cheng, Zhiming Li

AI总结针对无频率稳定性的不确定实验数据，建立拉丁方设计的不确定固定效应模型，提出三种估计方法并构建置信区间，进行不确定齐性检验和常见检验，通过数值模拟和实例验证模型有效性。

2606.19540 2026-06-19 stat.ME stat.CO stat.ML 交叉投稿

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

通过自适应谱收缩的过拟合高维矩阵分解

Lorenzo Mauri, David B. Dunson

AI总结提出EigenBayes方法，通过谱估计和自适应经验贝叶斯校准超参数，实现快速且具有不确定性量化的过拟合因子模型，在数值实验和基因组学应用中优于现有方法。

详情

AI中文摘要

因子模型是分析高维数据以提取低秩信号和估计协方差的常用方法。它们将协方差矩阵分解为低秩分量和对角分量之和。一个关键问题是如何选择潜在维度$k$，当因子模型仅近似成立且信噪比较低时，这尤其具有挑战性。贝叶斯过拟合因子模型指定$k$的上界，并依赖结构化收缩先验有效去除多余分量。这类方法流行且有效，但计算成本高。我们提出了一种更快的\texttt{EigenBayes}方法，基于潜在因子的谱估计和关键超参数的自适应经验贝叶斯校准，提供有效的不确定性量化。得到的后验分布可跨结果分解且解析可处理，绕过了马尔可夫链蒙特卡洛。我们证明\texttt{EigenBayes}能适应每个结果和潜在维度的信噪比，同时将多余的潜在分量收缩至零。我们建立了良好的渐近性质，并在数值实验和基因组学应用中展示了强大的实证性能，其中EigenBayes优于最先进的替代方法。

英文摘要

Factor models are popular approaches for analyzing high-dimensional data to extract low-rank signals and estimate covariances. They decompose the covariance matrix as the sum of low-rank and diagonal components. A key issue is how to choose the latent dimension $k$, which is particularly challenging when the factor model only holds approximately and in low signal-to-noise scenarios. Bayesian overfitted factor models specify an upper bound on $k$ and rely on structured shrinkage priors to effectively remove extra components. Such approaches are popular and effective, but computationally expensive. We propose a much faster \texttt{EigenBayes} approach that provides valid uncertainty quantification, based on spectral estimation of latent factors and adaptive empirical Bayes calibration of key hyperparameters. The resulting posterior distribution factorizes across outcomes and is analytically tractable, bypassing Markov chain Monte Carlo. We show that \texttt{EigenBayes} adapts to the signal-to-noise ratio of each outcome and latent dimension, while shrinking superfluous latent components to zero. We establish favorable asymptotic properties and demonstrate strong empirical performance in numerical experiments and a genomics application, where EigenBayes outperforms state-of-the-art alternatives.

URL PDF HTML ☆

赞 0 踩 0

2606.20451 2026-06-19 stat.ML cs.LG stat.AP stat.CO 交叉投稿

SSH-Net: A Deep Neural Network for Predicting Failure Time Distribution Functions under Competing Risks with Application to GPU Data

SSH-Net: 一种用于竞争风险下预测失效时间分布函数的深度神经网络及其在GPU数据上的应用

Jie Min, Yueyao Wang, Mengkun Chen

AI总结提出结构化分段风险深度神经网络（SSH-Net），通过将网络结构与数据结构关联，允许不同协变量组通过子网络影响预测，在竞争风险框架下预测失效时间分布函数，仿真和GPU数据验证了准确性。

详情

AI中文摘要

竞争风险在工程领域常见，当应用场景复杂时会给时间事件数据建模带来挑战。近年来，深度神经网络因其灵活性和高学习能力在竞争风险预测中受到广泛关注。然而，神经网络结构的复杂性使得基于不同数据输入的超参数调优更加困难。此外，当工程系统具有多层级的复杂物理结构时，将所有结构层级视为单一输入组可能无法捕捉关键信息。为解决这些问题，我们提出了一种结构化分段风险深度神经网络（SSH-Net），用于在特定原因竞争风险框架下预测失效时间。我们的方法将神经网络结构与数据结构相关联，并允许不同的协变量组通过分离的子网络影响失效预测。神经网络基于特定原因竞争风险模型构建。SSH-Net输出特定原因风险函数，并采用惩罚对数似然作为损失函数。通过评估Brier分数、接收者操作特征曲线下面积（AUC）和预测的特定原因累积发生函数的均方根误差（RMSE），仿真研究验证了SSH-Net的预测准确性。我们进一步使用Titan GPU失效时间数据展示了模型预测失效时间分布函数的能力。

英文摘要

Competing risks are commonly observed in engineering fields and can bring challenges to time-to-event data modeling when the application scenarios are complicated. Recently, deep neural networks have received great attention for prediction with competing risks, due to their flexibility and high learning capability. However, the complexity of neural network structure brings extra difficulty in hyperparameter tuning based on different data inputs. Additionally, when an engineered system has complex physical structures with multiple hierarchical levels, treating all structural levels as a single group of inputs may fail to capture critical information. To address the issues, we propose a Structured Segmented Hazard Deep Neural Network (SSH-Net) for failure time prediction under cause-specific competing risks framework. Our approach associates neural network structure with data structures, and allows different covariate groups to impact the failure prediction through separate sub-networks. The neural network is constructed based on a cause-specific competing risks model. The SSH-Net outputs cause-specific hazard functions, and utilizes the penalized log-likelihood as the loss function. The prediction accuracy of SSH-Net is validated through simulation studies by evaluating the Brier score, the area under receiver operating characteristic curves (AUC), and the root mean square error (RMSE) of the predicted cause-specific cumulative incident function. We further demonstrate the model's ability to predict failure time distribution functions using the Titan GPU failure time data.

URL PDF HTML ☆

赞 0 踩 0

2606.19714 2026-06-19 stat.ML cs.AI cs.LG stat.CO stat.ME 交叉投稿

AURA: Adaptive Uncertainty-aware Refinement for LLM-as-a-Judge Auditing

AURA: 用于LLM作为评判审计的自适应不确定性感知精炼

Zilong Zhang, Yi-Ting Hung, Weiyi He, Junxi Zhang, Lei Ding, Chi-Kuang Yeh

AI总结提出AURA框架，通过自适应不确定性感知精炼，在少量人工验证下迭代学习人类一致性信号，优先审核不确定比较，提升LLM评判的可靠性。

详情

AI中文摘要

大型语言模型（LLM）越来越多地被用作开放式生成的评判者，因为大规模人工评估通常昂贵且难以扩展，但它们的偏好仍然是人类判断的不完美代理。现有的审计流程通常假设事先存在可靠的示例子集或干净的监督信号，例如来自人工注释、启发式过滤或强评判者的输出。在LLM评估中，这一假设是脆弱的：初始分割可能继承评判者偏差，而人工验证通常过于稀缺，无法在规模上定义稳定组。我们提出AURA，一种自适应不确定性感知精炼框架，用于在选定的人工验证下审计成对LLM作为评判的决策。AURA迭代学习人类一致性信号，传播可靠证据，并优先将不确定的比较提交人工审核。关键思想是将对评判者的信任视为一个潜在量，随着证据积累逐步精炼。我们提供了紧凑的公式、稳定的精炼过程，以及在合成和真实成对LLM答案数据上的全面评估。

英文摘要

Large language models (LLMs) are increasingly used as judges for open-ended generation, as large-scale human evaluation is often expensive and difficult to scale, yet their preferences remain imperfect proxies for human judgment. Existing auditing pipelines often assume that a reliable subset of examples or clean supervision signals are available beforehand, for example from human annotation, heuristic filtering, or the outputs of strong judges. In LLM evaluation, this assumption is fragile: the initial split may inherit judge bias, while human verification is typically too scarce to define stable groups at scale. We propose AURA, an adaptive uncertainty--aware refinement framework for auditing pairwise LLM--as--a--judge decisions under selected human verification. AURA iteratively learns a human-consistency signal, propagates reliable evidence, and prioritizes uncertain comparisons for human review. The key idea is to treat trust in a judge as a latent quantity that is progressively refined as evidence accumulates. We provide a compact formulation, a stable refinement procedure, and a comprehensive evaluation on both synthetic and real pairwise LLM-answer data.

URL PDF HTML ☆

赞 0 踩 0

2606.19361 2026-06-19 cs.LG cs.AI cs.NA math.NA stat.CO stat.ME stat.ML 交叉投稿

Computational Identifiability

计算可识别性

Lucius E. J. Bynum, Rajesh Ranganath, Kyunghyun Cho

发表机构 * New York University（纽约大学）

AI总结提出“计算可识别性”框架，通过有限计算搜索过程在指定误差容限内找到经验估计量，从而解决理论可识别性在有限样本、模糊图标准等实际场景中的不足。

详情

AI中文摘要

识别条件描述了目标查询或感兴趣参数作为可用信息类型和数量的函数的可计算性。在因果识别中，这些信息通常以因果图的形式表达，数据是针对图中某些变量子集观测或收集的。目标查询可以是单个效应，也可以是给定模型中的一类效应。识别算法的推导在数学上定义了期望中理论上唯一确定所需因果效应的过程。期望中的可识别性，即“理论可识别性”，通常假设渐近性质、无限数据或其他数学理想化条件。在本文中，我们探讨了这种理论理想化的可识别性与一种受计算限制的替代方案之间的根本区别。我们提出的框架——“计算可识别性”——而是为经验估计量定义一个有限的计算搜索过程。如果该过程在期望的误差容限内经验性地找到了估计量，则满足可识别性，条件取决于搜索的指定假设（即参数上的先验分布）以及搜索过程本身。通过多个实验，我们展示了该框架如何回答细粒度的实际识别问题，例如小有限样本下的识别、模糊图标准下的识别、混合观测-干预数据下的识别，以及跨反事实数据和估计量的识别。代码见 https://this https URL。

英文摘要

Identification conditions describe the computability of a target query or parameter of interest as a function of the type and amount of information available. In causal identification, this information is often expressed in the form of a causal graph, and data are observed or collected for some subset of variables in the graph. Target queries may be for a single effect alone or for a class of effects in a given model. The derivation of an identification algorithm then defines mathematically the process by which the desired causal effect(s) can be uniquely determined, theoretically, in expectation. Identifiability in expectation, or 'theoretical identifiability,' generally assumes asymptotic properties, infinite data, or other mathematically idealized conditions. In this paper, we explore a fundamental distinction between this theoretical, idealized notion of identifiability and a proposed alternative that is computation-bound. The framework we propose - 'computational identifiability' - is to instead define a finite computational search procedure for an empirical estimator. If this process finds an estimator empirically, within a desired error tolerance, then identifiability is satisfied, conditional on the specified assumptions of the search (i.e., a prior distribution over the parameters) and conditional on the search procedure itself. Through several experiments, we demonstrate how this framework allows us to answer fine-grained, practical identification questions, such as identification with small finite samples, with ambiguous graphical criteria, with mixed observational-interventional data, and across counterfactual data and estimands. Code is available at https://github.com/lbynum/metadentify.

URL PDF HTML ☆

赞 0 踩 0

2606.04307 2026-06-19 cs.LG stat.CO stat.ME 版本更新

Folded Transport MCMC: Eliminating Label Switching by Sampling on a Fundamental Domain

折叠传输MCMC：对称贝叶斯模型的可认证商后验计算

Jun Hu

发表机构 * Wuhan University of Technology（武汉理工大学）

AI总结针对对称贝叶斯模型中的冗余多峰性导致MCMC收敛诊断退化的问题，提出Folded Transport MCMC方法，通过在对称群的基本域上构建独立采样器直接对商后验进行推断，并利用LCNF振荡认证框架在商度量下提供可证明的认证下界。

Comments 50 pages (including supplementary material), 5 figures, 6 tables. Submitted to Journal of Computational and Graphical Statistics

详情

AI中文摘要

具有有限对称性的贝叶斯模型——如可交换分量的混合模型、具有紧密间隔模态的结构识别——定义的后验在标签置换群下不变，产生冗余的多峰性，从而降低MCMC收敛诊断的质量。我们引入折叠传输MCMC（FolT-MCMC），该方法通过在对称群的基本域上构建独立采样器，直接对商后验进行推断。商提议分布通过对群轨道上学习的归一化流进行对称化得到。我们证明了基于LCNF振荡的认证框架可以迁移到商度量，并具有稳定子修正的球质量界和改进的覆盖半径，并且当未折叠流表现出跨模态提议缺陷时，分位数核心认证下界会得到改善。在高斯混合（d=2-20）、标签切换目标（最多24个等价模态）以及标准贝叶斯三分量混合后验上，分位数核心认证改进比从2倍到145倍不等，且折叠认证经验上几乎与维度无关。在台风山竹期间超高层建筑的真实加速度计数据上，FolT-MCMC产生了非平凡的分位数核心认证，而未折叠认证是平凡的。

英文摘要

In Bayesian mixture models and other exchangeable-component models, the posterior is invariant under permutation of component labels, creating m! equivalent modes-the label-switching problem. Standard MCMC methods either mix poorly across these modes or rely on post-hoc relabelling that cannot guarantee the sampler has converged. We propose Folded Transport MCMC (FolT-MCMC), which eliminates label switching before sampling by restricting the Markov chain to a fundamental domain-a sorted or reflected subspace containing exactly one representative from each symmetric mode. The proposal is a learned normalising flow whose density is symmetrised over the group orbits, ensuring correct targeting on the reduced space. We show that this construction preserves a computable convergence diagnostic based on the oscillation of the log-density ratio, and that the diagnostic becomes sharper on the fundamental domain whenever the original-space flow under-covers one or more symmetric modes. Experiments on Gaussian mixtures (d=2-20), label-switching targets (up to 24 equivalent modes), a standard Bayesian three-component mixture posterior, and real accelerometer data from a supertall building show improvement ratios of 2x to 145x, with the folded diagnostic stable across dimensions while the unfolded diagnostic collapses.

URL PDF HTML ☆

赞 0 踩 0

2602.01929 2026-06-19 math.DS stat.CO stat.ML 版本更新

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of stochastic dynamical systems

概率函数对函数非线性自回归模型用于随机动力系统的仿真与可靠性分析

Zhouzhou Song, Marcos A. Valdebenito, Styfen Schär, Stefano Marelli, Bruno Sudret, Matthias G. R. Faes

AI总结提出F2NARX模型，从函数对函数回归角度改进NARX方法，结合PCA和高斯过程回归实现概率预测，并通过主动学习高效估计首次穿越失效概率。

详情

AI中文摘要

在许多工程领域，构建准确且计算高效的代理模型（或仿真器）用于预测动力系统响应至关重要，但由于外部激励和系统参数到系统响应的强非线性和高维映射，这仍然具有挑战性。本文引入了一种新颖的函数对函数非线性自回归外生输入模型（F2NARX），该模型从函数对函数回归的角度重新表述了最近提出的$\mathcal{F}$-NARX方法。所提出的框架在保持高精度的同时显著提高了预测效率。通过将主成分分析与高斯过程回归相结合，F2NARX进一步通过无迹变换以自回归方式实现动力响应的概率预测。这种概率预测能力进一步促进了首次穿越概率评估的主动学习。通过不同复杂度的案例研究证明了该方法的有效性。结果表明，F2NARX在效率上比最先进的NARX模型高出几个数量级，同时通常达到更高的精度。此外，主动学习方法能够仅使用少量训练时间历程准确估计动力系统的首次穿越失效概率。

英文摘要

Constructing accurate and computationally efficient surrogate models (or emulators) for predicting dynamical system responses is critical in many engineering domains, yet remains challenging due to the strongly nonlinear and high-dimensional mapping from external excitations and system parameters to system responses. This work introduces a novel Function-on-Function Nonlinear AutoRegressive model with eXogenous inputs (F2NARX), which reformulates the recently proposed $\mathcal{F}$-NARX method from a function-on-function regression perspective. The proposed framework substantially improves predictive efficiency while maintaining high accuracy. By combining principal component analysis with Gaussian process regression, F2NARX further enables probabilistic predictions of dynamical responses via the unscented transform in an autoregressive manner. Such probabilistic prediction capabilities further facilitate active learning for first-passage probability evaluation. The effectiveness of the method is demonstrated through case studies of varying complexity. Results show that F2NARX outperforms state-of-the-art NARX model by orders of magnitude in efficiency while achieving higher accuracy in general. Meanwhile, the active learning approach enables accurate estimation of first-passage failure probabilities for dynamical systems using only a small number of training time histories.

URL PDF HTML ☆

赞 0 踩 0

2503.11479 2026-06-19 stat.CO math.PR math.ST stat.ME stat.TH 版本更新

Towards practical PDMP sampling: Metropolis adjustments, locally adaptive step-sizes, and NUTS-based time lengths

走向实用的PDMP采样：Metropolis调整、局部自适应步长和基于NUTS的时间长度

Augustin Chevallier, Sam Power, Matthew Sutton

AI总结针对PDMP采样需要计算模型特定界限的难题，提出Metropolis调整近似、自适应步长机制和NUTS启发的路径长度选择，集成得到双重自适应PDMP采样器，提升鲁棒性和效率。

详情

AI中文摘要

分段确定性马尔可夫过程（PDMP）在从复杂概率分布中采样方面具有重要前景。然而，其实践应用受到需要计算模型特定界限的限制。相反，虽然哈密顿蒙特卡洛（HMC）提供了一种普遍有效的采样方法，但其无法自适应调整步长，导致在采样漏斗形等复杂分布时性能受损。为解决这些限制，我们引入了三个创新概念：(a) 一种Metropolis调整的PDMP模拟近似，无需显式界限且不破坏不变测度；(b) 一种与Metropolis校正兼容的自适应步长机制；(c) 一种受无U型转弯采样器（NUTS）启发的方案，用于动态选择PDMP中的路径长度。这三个想法可以无缝集成到一个单一的“双重自适应”PDMP采样器中，具有良好的鲁棒性和效率特性。

英文摘要

Piecewise-Deterministic Markov Processes (PDMPs) hold significant promise for sampling from complex probability distributions. However, their practical implementation is hindered by the need to compute model-specific bounds. Conversely, while Hamiltonian Monte Carlo (HMC) offers a generally efficient approach to sampling, its inability to adaptively tune step sizes impedes its performance when sampling complex distributions like funnels. To address these limitations, we introduce three innovative concepts: (a) a Metropolis-adjusted approximation for PDMP simulation that eliminates the need for explicit bounds without compromising the invariant measure, (b) an adaptive step size mechanism compatible with the Metropolis correction, and (c) a No U-Turn Sampler (NUTS)-inspired scheme for dynamically selecting path lengths in PDMPs. These three ideas can be seamlessly integrated into a single, `doubly-adaptive' PDMP sampler with favourable robustness and efficiency properties.

URL PDF HTML ☆

赞 0 踩 0