arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1530
专题追踪
2606.20148 2026-06-19 stat.ME 新提交

A case study of causal mediation using Bayesian nonparametrics and semiparametric corrections

使用贝叶斯非参数和半参数修正的因果中介分析案例研究

Yuhua Zhang, Michael J. Daniels

AI总结 提出截断富集狄利克雷过程混合模型估计自然直接和间接效应,结合高效MCMC算法和基于有效影响函数的一步后验修正,解决贝叶斯非参数中因果估计量的可靠推断问题。

详情
AI中文摘要

我们提出了一种贝叶斯非参数方法,使用截断富集狄利克雷过程混合(EDPM)模型来估计存在后处理混杂因素时的因果中介分析中的自然直接效应(NDE)和间接效应(NIE)。我们引入了一种高效的簇重分配Metropolis-Hasting算法,以改善阻塞吉布斯采样器中的混合。我们基于有效影响函数实现了针对我们设定的一步后验修正。这个后处理步骤解决了贝叶斯非参数中的一个关键问题:如何从为复杂联合分布设计的模型中获得特定因果估计量(NDE和NIE)的可靠估计和后验,并具有优良的频率性质,如正确的覆盖。我们进行了模拟研究以评估我们方法的性能,并将其应用于评估一项体重管理临床试验中的因果中介效应。

英文摘要

We propose a Bayesian nonparametric approach using a truncated Enriched Dirichlet Process mixture (EDPM) model to estimate natural direct (NDE) and indirect (NIE) effects in causal mediation analyses in the presence of post-treatment confounders. We introduce an efficient cluster reallocation Metropolis-Hasting algorithm to improve mixing in the blocked Gibbs sampler. We implement a one-step posterior correction based on the efficient influence function for our setting. This post-processing step solves a critical problem in Bayesian nonparametrics: how to obtain reliable estimates and posteriors for a specific causal estimand of interest (the NDE and NIE) with excellent frequentist properties, such as correct coverage, from a model designed for complex joint distributions. We conduct simulation studies to assess our method's performance and apply it to evaluate causal mediation effects in a weight management clinical trial.

2606.20141 2026-06-19 stat.CO 新提交

DASH: A Dimensionality Reduction Method for Large-scale Convex MIQP with Applications in Subset Portfolio Selection

DASH: 一种用于大规模凸MIQP的降维方法及其在子集投资组合选择中的应用

Pinzhang Cheng

AI总结 提出DASH降维方法,通过减少变量层次改善大规模凸MIQP求解器性能,在子集投资组合选择中显著提升Gurobi难以求解问题的初始解质量。

详情
AI中文摘要

作为MIP(混合整数规划)的子集选择问题是NP难的。对于大规模问题,在合理时间内找到全局最优解是不可行的,实践中常通过MIP求解器寻找高质量的初始解。本文提出DASH(递减活动集层次)——一种降维方法,针对可表述为MIQP(混合整数二次规划)的一类最佳子集选择问题,提高MIP求解器的性能。我们在子集投资组合选择问题中开发并评估了DASH的性能,并与商业MIP求解器Gurobi进行了比较。除了问题规模外,问题的难度还与协方差矩阵的条件数以及投资组合权重的箱约束有关。大量不同问题配置的数值实验表明,当Gurobi难以求解问题时,DASH能持续显著改进初始解。特别是,DASH改进的幅度和持续时间随问题难度增加而扩大。

英文摘要

Subset selection problems as MIPs (Mixed Integer Programs) are NP-hard. For large scale problems, it is infeasible to find global optimal solutions in a reasonable time and good-quality incumbent solutions are sought after with MIP solvers in practice. This paper proposes DASH (Decreasing Active Set Hierarchy) -- a dimensionality reduction method that improves the MIP solver performance for a subclass of best subset selection problems that can be formulated as MIQPs (Mixed Integer Quadratic Programs). We develop and evaluate the performance of DASH in the subset portfolio selection problem with comparison to Gurobi, a commercial MIP solver. In addition to the problem size, the difficulty of a problem is related to the condition number of the covariance matrix and the box constraint on portfolio weights. An extensive set of numerical experiments with varying problem configurations shows that DASH offers consistent and significant improvement of incumbent solutions when the problem is difficult to solve by Gurobi. In particular, the magnitude and duration of improvement by DASH scale with the difficulty of the problem.

2606.20114 2026-06-19 stat.ME stat.AP 新提交

Community detection in small-sample ordinal regimes: A benchmarking framework for Delphi data

小样本有序情境下的社区检测:德尔菲数据的基准测试框架

Yuri Calleo, Simone Di Zio, Fabrizio Maturo

AI总结 针对德尔菲数据高维小样本导致的秩亏问题,提出从变量中心协方差模型转向网络中心连接模型,利用社区检测算法识别潜在主题结构,实现结构稳定的降维。

详情
AI中文摘要

德尔菲数据共识的统计建模面临一个关键瓶颈:问卷项目的高维性与专家小组有限样本量之间的矛盾。这种秩亏导致传统潜变量模型(如主成分分析)结构不稳定且易过拟合。为弥补这一方法论空白,本研究提出从变量中心协方差模型转向网络中心连接模型。通过将项目相关性映射到加权图拓扑,我们提出了一个基于模拟的基准测试,利用社区检测算法识别潜在主题结构,有效解决了高维小样本情境下典型的谱不稳定性和秩亏问题。该研究系统评估了基于结构密度、信息流和谱划分的拓扑方法在合成数据集上的鲁棒性,这些数据集旨在复制共识数据的病理条件,包括有序量表和系统噪声。核心方法论贡献在于证明专家判断间的共线性——传统上被视为需要正则化的统计冗余——可以有效地重新解释为凝聚的拓扑信号。该框架为研究人员提供了一种结构化的自动降维程序,确保即使在标准因子分析失效的小样本情境下也能保持结构稳定性和心理测量一致性。

英文摘要

The statistical modeling of consensus in Delphi data faces a critical bottleneck: the high dimensionality of questionnaire items relative to the limited sample size of expert panels. This rank deficiency leads traditional latent variable models, such as Principal Component Analysis, to be structurally unstable and prone to overfitting. Addressing this methodological gap, this study proposes a transition from variable-centric covariance models to network-centric connectivity models. By mapping item correlations onto a weighted graph topology, we present a simulation-based benchmark that utilizes community detection algorithms to identify latent thematic structures, effectively addressing the spectral instability and rank deficiency typical of high-dimensional, low-sample-size regimes. The research systematically evaluates the robustness of topological approaches based on structural density, information flow, and spectral partitioning against synthetic datasets designed to replicate the pathological conditions of consensus data, including ordinal scales and systemic noise. The central methodological contribution lies in demonstrating that collinearity among expert judgments - traditionally treated as statistical redundancy to be regularized - can be effectively reinterpreted as a topological signal of cohesion. This framework provides researchers with a structured and automated procedure for dimensionality reduction, ensuring structural stability and psychometric consistency even in small-sample regimes where standard factor analysis breaks down.

2606.20078 2026-06-19 stat.OT 新提交

A Law of Iterated Expectation Primer for Causal Inference

因果推断中的迭代期望定律入门

Ashley I. Naimi, Razieh Nabi, Lindsay J. Collin, Paul N. Zivich, Stephen R. Cole

AI总结 本文介绍迭代期望定律及其在因果效应识别中的应用,通过g公式的两种非参数等价形式(NICE和ICE)和三个数值示例阐明其数学直觉。

详情
AI中文摘要

g公式是识别观察数据中因果效应的基础工具,它基于迭代期望定律——统计学中的一个关键数学恒等式。然而,表达迭代期望定律和g公式的符号对于统计背景不足的人来说可能难以理解。我们提供了一篇入门文章,介绍迭代期望定律、用于表达它的积分符号,以及它通过g公式在因果效应识别中的作用。在因果一致性、正性和条件可交换性假设下,迭代期望定律可以重写为因果标准化公式(g公式),有两种非参数等价形式:非迭代条件期望(NICE)形式,涉及条件结果均值的单一加权平均;以及迭代条件期望(ICE)形式,涉及嵌套期望。我们通过三个逐步复杂的数值示例说明这两种形式:一个时间固定示例,包含单个二元混杂因子;一个时间固定示例,包含离散和连续混杂因子;以及一个时间变化示例,包含两个时间点。我们阐明了迭代期望定律是什么,它与g公式的关系,以及如何在实际数据示例中理解其数学公式的直觉,这些示例可以推广到各种场景。

英文摘要

The g-formula is a foundational tool for identifying causal effects in observational data. This tool is based on the law of iterated expectation, a key mathematical identity in statistics. However, the notation with which the law of iterated expectation and the g-formula is expressed can be opaque to those with little background in statistics. We provide a primer introducing the law of iterated expectation, the integration notation used to express it, and its role for causal effect identification via the g-formula. Under the assumptions of causal consistency, positivity, and conditional exchangeability, the law of iterated expectation can be rewritten as a causal standardization formula (the g-formula) in two nonparametrically equivalent forms: a non-iterative conditional expectation (NICE) form involving a single weighted average of conditional outcome means, and an iterative conditional expectation (ICE) form involving nested expectations. We illustrate both forms using three progressively complex numerical examples: a time-fixed example with a single binary confounder, a time-fixed example with discrete and continuous confounders, and a time-varying example with two timepoints. We provide clarity on what the law of iterated expectation is, how it is related to the g-formula, and how to gain intuition of its mathematical formulations in actual data examples that can be generalized to a range of settings.

2606.20069 2026-06-19 stat.ME 新提交

A minimum-risk and cost-efficient two-sample sequential testing framework for the shifted exponential models with application to precipitation data

移位指数模型的最小风险与成本高效双序贯检验框架及其在降水数据中的应用

Ashwani Rajput, Neeraj Joshi

AI总结 提出一种双序贯抽样框架,通过控制第一类错误概率并最小化包含第二类错误和抽样成本的损失函数,检验两个移位指数模型的位置参数差异,具有一阶、二阶效率和风险效率。

详情
AI中文摘要

本文通过一种新颖的双序贯抽样框架,研究了比较两个移位指数模型位置参数的问题。所提出的假设检验过程通过将第一类错误概率控制在预设水平,同时最小化包含第二类错误概率和相应抽样成本的损失函数来开发。相应的最优固定样本量表达式依赖于未知的尺度参数,这使得在固定样本设计下,期望的检验精度在实践中无法实现。为克服这一困难,提出了一种双序贯抽样程序,用于在尺度参数未知且不等时检验位置参数之间的差异。所提出的方法具有理想的新近性质,包括一阶效率、二阶效率和二阶风险效率。广泛的模拟研究和涉及气象站强降水事件的实际数据应用证明了所提出程序的实际有效性和适用性。

英文摘要

This paper investigates the problem of comparing the location parameters of two shifted exponential models through a novel double sequential sampling framework. The proposed hypothesis testing procedure is developed by controlling the type I error probability at a preassigned level while minimizing a loss function that incorporates both the type II error probability and the associated sampling cost. The corresponding optimal fixed-sample-size expressions are shown to depend on unknown scale parameters, rendering the desired testing accuracies unattainable in practice under fixed-sample designs. To overcome this difficulty, a double sequential sampling procedure is proposed to test the difference between location parameters when the scale parameters are unknown and unequal. The proposed methodology is shown to possess desirable asymptotic properties, including first-order efficiency, second-order efficiency, and second-order risk efficiency. Extensive simulation studies and a real-data application that involves heavy precipitation episodes at meteorological stations demonstrate the practical effectiveness and applicability of the proposed procedure.

2606.19982 2026-06-19 stat.ME 新提交

Built-in Selection Bias in Proportional Hazards Models with Omitted Covariates: Simulation Evidence and Alternative Approaches

省略协变量的比例风险模型中的内置选择偏倚:模拟证据与替代方法

Ayoub Bifenzi, Helene Jacqmin-Gadda

AI总结 本文通过模拟和实际数据,证明在随机试验中,即使省略的协变量与处理独立,仍会导致Cox比例风险模型估计的处理风险比存在偏倚,并比较了脆弱模型、加速失效时间模型和Kaplan-Meier曲线等替代方法的稳健性。

详情
AI中文摘要

在时间-事件分析中,来自Cox比例风险(PH)模型的风险比(HR)是评估治疗效果最常用且广泛报告的指标。然而,由于风险比固有地依赖于每个时间点的生存条件,它们具有非可压缩性。因此,当存在因省略重要协变量导致的未测量异质性时,即使这些协变量在基线时与主要暴露独立(如随机对照试验中),风险比也会受到内置选择偏倚的影响。本文旨在概述文献中关于未观测异质性(由影响结局的省略协变量引起)如何在标准比例风险模型中偏倚治疗风险比估计的关键发现,即使在处理分配独立于这些协变量的随机试验中也是如此。通过模拟,我们评估了半参数Cox PH模型和参数PH模型在各种未测量异质性场景下的偏倚程度。然后,我们将这些标准模型与替代方法进行比较,这些方法要么解决了这一问题,要么被认为对此具有稳健性。这些替代方法包括来自脆弱模型的风险比、来自加速失效时间(AFT)模型的回归参数,以及使用Kaplan-Meier曲线非参数估计或基于具有时变暴露效应的Cox模型估计的治疗组间生存差异。我们通过一个来自放射治疗肿瘤学组(RTOG 9202)的随机对照试验的实际数据应用,说明了所探索替代方法的实际相关性。

英文摘要

In time-to-event analysis, the hazard ratio (HR) derived from the Cox proportional hazards (PH) model is the most commonly used and widely reported measure for assessing treatment effects. However, hazard ratios are non-collapsible due to their inherent conditioning on survival up to each time point. As a result, they are subject to built-in selection bias in the presence of unmeasured heterogeneity arising from omitted important covariates, even when these covariates are independent of the main exposure at baseline, as is the case in randomized controlled trials. This article aims to provide an overview of key findings from the literature on how unobserved heterogeneity, due to omitted covariates that affect the outcome, can bias the estimation of the treatment hazard ratio in standard proportional hazards models, even in randomized trials where treatment is assigned independently of such covariates. Through simulations, we evaluate the extent of bias in the semi-parametric Cox PH model and parametric PH model under various scenarios of unmeasured heterogeneity. We then compare these standard models to alternative approaches that either account for this issue or are considered robust to it. These alternatives include the hazard ratio estimated from frailty models, regression parameters from an Accelerated Failure Time (AFT) model, and survival differences between treatment groups estimated nonparametrically using Kaplan-Meier curves or based on a Cox model with time-dependent effect of the exposure. We illustrate the practical relevance of the explored alternatives through a real data application to a randomized controlled trial from the Radiation Therapy Oncology Group (RTOG 9202).

2606.19892 2026-06-19 stat.ME 新提交

The Ghosh-Lin and Fine-Gray models for a mix of administrative and random censoring

混合行政删失与随机删失下的Ghosh-Lin和Fine-Gray模型

Thomas H. Scheike, Christian Mirian, Isao Yokota, Giuliana Cortese

AI总结 针对同时存在行政删失和随机删失的数据,提出结合风险集调整和逆概率删失加权的方法,使Ghosh-Lin和Fine-Gray模型得到一致估计。

详情
AI中文摘要

复发事件或竞争风险回归模型通常应用于生物医学领域,两者都可视为边际模型。在存在右删失的情况下,需要调整这些模型以获得一致估计量。当删失是行政性时,边际回归模型特别容易估计。然而,当删失是随机作用时,通常考虑逆概率删失加权(IPCW)调整来获得参数估计。该技术通过正确的删失模型进行删失权重调整,但对于行政删失,只需修改风险集即可正确调整。在实践中,对于大型中央登记处或某些临床试验,所有受试者的行政删失时间已知,但通常也会有一定比例的受试者被随机删失。在这项工作中,我们考虑两种常用的回归方法:用于带有终止事件的复发事件的Ghosh-Lin模型和用于竞争事件的Fine-Gray模型。对于这两种情况,当同时存在行政删失和随机删失时,我们展示了如何通过处理这两种不同类型删失的组合,在最小化建模假设的基础上获得正确估计。

英文摘要

Recurrent events or competing risks regression models are often applied in the bio-medical setting and both can be considered as marginal models. In presence of right-censoring, such models need to be adjusted to give consistent estimators. When censoring is administrative, marginal regression models are particularly easy to estimate. However, when censoring is instead acting randomly, inverse probability of censoring weighting (IPCW) adjustments are typically considered to obtain parameter estimates. This technique relies on a censoring-weights adjustment via a correct censoring model, but for administrative censoring the adjustment is done correctly simply by modifying the risk-set. In practice for large central registries or some clinical trials, the administrative censoring time will be known for all subjects, but there will typically also be a proportion of subjects that are censored at random. In this work, we consider two frequently used regression approaches, the Ghosh-Lin model for recurrent events with terminal events and the Fine-Gray model for competing events. For these two settings, when both administrative and random censoring are present, we demonstrate how to obtain correct estimation by dealing with the combination of the two different types of censoring relying on a minimum of modeling assumptions.

2606.19760 2026-06-19 stat.AP 新提交

Covariate-Adjusted Functional Principal Components Analysis for Modeling Hazard Rates of Physical Activity in the US Population

协变量调整的功能主成分分析用于建模美国人口体力活动的风险率

Md Rokibul Hasan, Pratim Guha Niyogi

AI总结 提出基于风险函数的分布分析方法,利用功能主成分分析(FPCA)从腕部加速度计数据中刻画个体活动强度分布变异,优于均值摘要。

详情
AI中文摘要

体力活动在人类健康中起着至关重要的作用。其整体分布因人而异。常用的汇总指标无法描述这种分布模式。我们提出了一种基于分布的分析方法,通过从腕部加速度计数据中导出的风险函数来建模个体活动强度模式,从而描述体力活动。我们分析了2011-2012年国家健康与营养调查(NHANES)中4297名连续佩戴设备7天的成年人的分钟级独立于监测器的运动摘要(MIMS)数据。我们使用基于生存的方法为每个个体在共同强度网格上导出了非参数活动强度风险,将MIMS的风险曲线及其对数变换后的MIMS都视为功能对象。我们在MIMS的两个尺度上使用功能主成分分析(FPCA)来表征活动强度分布的主要变异模式。组均值风险函数在低强度水平上差异很小,而在高强度水平上我们观察到显著差异。我们的结果表明,基于风险的功能表示方法能够捕捉个体间体力活动强度分布的差异,提供了一种灵活且可解释的方式来表征异质性。该方法优于基于均值的摘要,并支持对人口亚组之间体力活动模式进行有原则的比较。

英文摘要

Physical activity plays a vital role in human health. Its entire distribution differs among people. Commonly used summary measures cannot describe this distributional pattern. We present a distribution-based analytical approach to describe physical activity by modeling individual-level activity-intensity patterns through hazard functions derived from wrist-worn accelerometer data. We analyzed minute-level Monitor-Independent Movement Summary (MIMS) data of 4297 adults with seven continuous days of device wear from the 2011- 2012 National Health and Nutrition Examination Survey (NHANES). We derived a nonparametric activity-intensity hazard using a survival-based approach for each individual on a common intensity grid, treating both the hazard curves from MIMS and their log-transformed MIMS as functional objects. We used functional principal component analysis (FPCA) on both scales of MIMS to characterize dominant modes of variation in activity-intensity distributions. Group-wise mean hazard functions showed little difference at lower intensity levels, while we observed a substantial difference at higher intensity levels. Our results demonstrate that hazard-based functional representations for capturing differences in physical activity intensity distributions across individuals offer a flexible and interpretable way to characterize heterogeneity. This approach works better than mean-based summaries and supports principled comparisons of physical activity patterns across population subgroups.

2606.19743 2026-06-19 stat.ME stat.AP 新提交

A Bayesian spatio-temporal nearest neighbor Gaussian process model for pooled genetic data

一种用于汇总遗传数据的贝叶斯时空最近邻高斯过程模型

Imke Botha, Tianxiao Hao, Lucinda E. Harrison, Nick Golding, Daniel J. Weiss, Jennifer A. Flegg

AI总结 提出最近邻高斯过程模型,结合序贯蒙特卡洛平方算法,高效推断汇总遗传数据中的单倍型频率,并应用于非洲抗疟药物耐药性遗传数据分析。

详情
AI中文摘要

大规模遗传数据集通常汇总不同遗传标记的总等位基因计数。从这些汇总数据中推断单倍型频率(即多标记等位基因的频率)是一个挑战。由于计算成本,先前在此背景下的时空建模仅限于3个标记。在这项工作中,我们提出了一种最近邻高斯过程(NNGP)模型,以改善随标记和观测数量扩展的规模。为了推断模型参数,我们开发了一种新颖的序贯蒙特卡洛平方算法,该算法使用带有祖先抽样的粒子吉布斯来变异NNGP函数值。后者在观测数量和NNGP数量上具有线性成本,并可应用于广泛的NNGP模型。作为案例研究,我们分析了与非洲抗疟药物耐药性相关的遗传数据,并在3和6个遗传标记数据集上实证展示了我们的扩展结果。

英文摘要

Large scale genetic datasets often aggregate the total allele counts of distinct genetic markers. Inferring haplotype frequencies (i.e.\ the frequency of multimarker alleles) from these pooled data is a challenge. Previous spatio-temporal modelling in this context has been limited to 3 markers due to the computational cost. In this work, we propose a nearest neighbor Gaussian process (NNGP) model to improve scaling with the number of markers and observations. To infer the parameters of our model, we develop a novel sequential Monte Carlo squared algorithm, which uses particle Gibbs with ancestor sampling to mutate the NNGP function values. The latter has a linear cost in the number of observations and the number of NNGPs, and can be applied to a broad range of NNGP models. As a case study, we analyse genetic data relating to antimalarial drug resistance in Africa, and show our scaling results empirically on a 3 and 6 genetic marker dataset.

2606.19737 2026-06-19 stat.ME stat.ML 新提交

Calibration without labels in multiple testing

多重检验中的无标签校准

Adway S. Wadekar, Jake A. Soloff

AI总结 针对多重检验中无法观测真实标签的难题,利用有序p值间距构造伪标签,实现局部错误发现率的校准,并揭示q值在心理学和神经科学文献中可能严重失准。

详情
AI中文摘要

大规模假设检验支持对单个假设的概率性声明,如经验贝叶斯方法估计局部错误发现率。我们研究如何将这些声明解释为原假设的近似校准预测,即使在模型误设定下也能产生可解释的错误概率。我们的方法从概率预测中汲取概念灵感,但面临不同的挑战:与预测不同(标签最终可观测),在多重检验中真实情况从未揭示,因此校准必须随机评估并间接建立。我们通过构造一组伪标签来应对这一挑战,这些伪标签源自有序$p$值的间距,并以局部错误发现率作为回归目标。我们的构造解锁了现有工具,用于评估和执行多重检验中的事后校准。值得注意的是,我们在对已发表的心理学和神经科学文献的大规模实证调查中发现,基于错误发现率的流行误差度量$q$值可能严重失准。

英文摘要

Large-scale hypothesis testing supports probability claims about individual hypotheses, as in empirical Bayes methods for estimating local false discovery rates. We study how such claims can be interpreted as approximately calibrated forecasts of the null hypothesis, yielding interpretable error probabilities even under model misspecification. Our approach draws conceptual inspiration from probabilistic forecasting but addresses a different challenge: unlike forecasting, where labels are eventually observed, in multiple testing the ground truth is never revealed, so calibration must be assessed stochastically and established indirectly. We address this challenge by constructing a set of pseudo-labels, derived from the spacings of ordered $p$-values, which have the local false discovery rate as their regression target. Our construction unlocks existing tools for assessing and performing post-hoc calibration in multiple testing. Notably, we find on a large-scale empirical survey of published psychology and neuroscience literature that the $q$-value, a popular error measure based on the false discovery rate, can be severely miscalibrated.

2606.19580 2026-06-19 stat.ME stat.ML 新提交

Machine Learning Integrated in Wavelet Shrinkage (MLShrink)

机器学习集成小波收缩 (MLShrink)

Dixon Vimalajeewa, Vijini Lakmini, Brani Vidakovic

AI总结 提出MLShrink,结合小波收缩与机器学习,通过双阈值对中间带系数进行数据自适应分类,保留经典阈值简单性,理论证明其非扩张性和oracle一致性,在非平滑信号上表现优异。

详情
AI中文摘要

实践中遇到的数据经常被加性噪声污染,小波收缩仍是非参数估计中恢复潜在信号的基本工具。经典方法如硬阈值和软阈值几乎完全根据系数的大小决定是否保留。尽管在许多情况下有效,这些规则对于幅度落在信号与噪声区分不确定的中间区域的系数可能过于僵化。我们提出MLShrink,一种将小波收缩与机器学习相结合的双阈值小波去噪过程。低于下阈值的系数被丢弃,高于上阈值的系数被保留,中间带的系数使用局部小波域特征进行分类。这样,MLShrink在远离决策边界处保留了经典阈值的简单性,同时允许对模糊系数进行数据自适应决策。本文还为此架构开发了一个理论框架。我们证明MLShrink是一个非扩张的支持选择规则,推导出一个基于oracle的风险分解,表明多余的去噪风险由未决策带上的分类误差决定,并在分类器性能的适当假设下建立了oracle一致性结果。在标准基准信号上的模拟实验表明,MLShrink与几种已建立的小波收缩方法具有竞争力,尤其适用于具有不规则、边缘丰富或非平滑结构的信号。这些发现表明,中间阈值带上的学习决策为经典小波去噪与现代统计学习之间提供了有用且可解释的联系。

英文摘要

Data encountered in practice are frequently contaminated by additive noise, and wavelet shrinkage remains a fundamental tool for recovering underlying signals in nonparametric estimation. Classical procedures such as hard and soft thresholding decide whether to retain a wavelet coefficient almost entirely from its magnitude. Although effective in many settings, these rules can be too rigid for coefficients whose magnitudes fall in an intermediate region where the distinction between signal and noise is uncertain. We propose MLShrink, a two-threshold wavelet denoising procedure that combines wavelet shrinkage with machine learning. Coefficients below a lower threshold are discarded, coefficients above an upper threshold are retained, and coefficients in the intermediate band are classified using local wavelet-domain features. In this way, MLShrink preserves the simplicity of classical thresholding away from the decision boundary while allowing data-adaptive decisions for ambiguous coefficients. The paper also develops a theoretical framework tailored to this architecture. We show that MLShrink is a nonexpansive support-selection rule, derive an oracle-based risk decomposition showing that excess denoising risk is determined by classification errors on the undecided band, and establish an oracle-consistency result under suitable assumptions on classifier performance. Simulation experiments on standard benchmark signals indicate that MLShrink is competitive with several established wavelet shrinkage methods and is especially effective for signals with irregular, edge-rich, or non-smooth structure. These findings suggest that learned decisions on the intermediate threshold band provide a useful and interpretable connection between classical wavelet denoising and modern statistical learning.

2606.19572 2026-06-19 stat.ME 新提交

SCOPE Shrinkage: A Unified Framework for Wavelet Denoising

SCOPE 收缩:小波去噪的统一框架

Dixon Vimalajeewa, Vijini Lakmini, Malith Premarathna, Fabrizio Ruggeri, Brani Vidakovic

AI总结 提出基于对称单峰分布累积分布函数的SCOPE收缩族,通过两个可解释参数分离尺度与形状效应,实现局部强收缩与渐近无偏的平衡,在小波去噪中性能与可解释性兼具。

详情
AI中文摘要

我们引入了对称CDF导向概率增强(SCOPE)收缩,这是一个由对称单峰分布的中心累积分布函数构造的保号收缩规则统一族。所提出的框架生成了一类广泛的衰减轮廓,在零点附近强局部收缩与尾部渐近无偏行为之间插值。我们开发了一个通用公式,通过两个可解释参数分离尺度与形状效应,从而能够独立控制有效的阈值位置和过渡锐度。在明确的规律性假设下,建立了SCOPE收缩的结构性质,包括奇性、单调性、连续性、收缩性以及将规则与软化阈值算子联系起来的混合表示。还发展了贝叶斯和惩罚似然解释:SCOPE规则允许偶惩罚表示,该表示在系数幅度上非递减,并且合适的子类在适当的对称单峰先验下作为精确的最大后验估计出现。基于逻辑分布、均匀分布和柯西分布的代表性例子说明了概率形状如何控制收缩行为。通过Stein型无偏风险估计讨论了光滑子类的数据驱动参数选择。在标准Donoho-Johnstone测试函数上的Oracle校准模拟研究表明,SCOPE收缩与几种已建立的小波去噪方法相比具有竞争力,同时保持了高度的可解释性和结构灵活性。结果突出了中心分布函数作为小波去噪及相关估计问题中收缩的自然且通用的设计原则。

英文摘要

We introduce Symmetric CDF Oriented Probability Enhanced (SCOPE) shrinkage, a unified family of sign-preserving shrinkage rules constructed from centered cumulative distribution functions of symmetric unimodal distributions. The proposed framework generates a broad class of attenuation profiles that interpolate between strong local shrinkage near zero and asymptotically unbiased behavior in the tails. A general formulation is developed that separates scale and shape effects through two interpretable parameters, allowing effective threshold location and transition sharpness to be controlled independently. Under explicit regularity assumptions, structural properties of SCOPE shrinkage are established, including oddness, monotonicity, continuity, contractivity, and a mixture representation that connects the rules to softened thresholding operators. A Bayesian and penalized likelihood interpretation is also developed: SCOPE rules admit even penalty representations that are nondecreasing in coefficient magnitude, and suitable subclasses arise as exact maximum a posteriori estimators under proper symmetric unimodal priors. Representative examples based on logistic, uniform, and Cauchy distributions illustrate how probabilistic shape governs shrinkage behavior. Data driven parameter selection for smooth subclasses is discussed via Stein-type unbiased risk estimation. Oracle calibrated simulation studies on standard Donoho-Johnstone test functions show that SCOPE shrinkage performs competitively with several established wavelet denoising methods, while retaining a high degree of interpretability and structural flexibility. The results highlight centered distribution functions as a natural and versatile design principle for shrinkage in wavelet denoising and related estimation problems.

2606.19540 2026-06-19 stat.ME stat.CO stat.ML 新提交

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

通过自适应谱收缩的过拟合高维矩阵分解

Lorenzo Mauri, David B. Dunson

AI总结 提出EigenBayes方法,通过谱估计和自适应经验贝叶斯校准超参数,实现快速且具有不确定性量化的过拟合因子模型,在数值实验和基因组学应用中优于现有方法。

详情
AI中文摘要

因子模型是分析高维数据以提取低秩信号和估计协方差的常用方法。它们将协方差矩阵分解为低秩分量和对角分量之和。一个关键问题是如何选择潜在维度$k$,当因子模型仅近似成立且信噪比较低时,这尤其具有挑战性。贝叶斯过拟合因子模型指定$k$的上界,并依赖结构化收缩先验有效去除多余分量。这类方法流行且有效,但计算成本高。我们提出了一种更快的\texttt{EigenBayes}方法,基于潜在因子的谱估计和关键超参数的自适应经验贝叶斯校准,提供有效的不确定性量化。得到的后验分布可跨结果分解且解析可处理,绕过了马尔可夫链蒙特卡洛。我们证明\texttt{EigenBayes}能适应每个结果和潜在维度的信噪比,同时将多余的潜在分量收缩至零。我们建立了良好的渐近性质,并在数值实验和基因组学应用中展示了强大的实证性能,其中EigenBayes优于最先进的替代方法。

英文摘要

Factor models are popular approaches for analyzing high-dimensional data to extract low-rank signals and estimate covariances. They decompose the covariance matrix as the sum of low-rank and diagonal components. A key issue is how to choose the latent dimension $k$, which is particularly challenging when the factor model only holds approximately and in low signal-to-noise scenarios. Bayesian overfitted factor models specify an upper bound on $k$ and rely on structured shrinkage priors to effectively remove extra components. Such approaches are popular and effective, but computationally expensive. We propose a much faster \texttt{EigenBayes} approach that provides valid uncertainty quantification, based on spectral estimation of latent factors and adaptive empirical Bayes calibration of key hyperparameters. The resulting posterior distribution factorizes across outcomes and is analytically tractable, bypassing Markov chain Monte Carlo. We show that \texttt{EigenBayes} adapts to the signal-to-noise ratio of each outcome and latent dimension, while shrinking superfluous latent components to zero. We establish favorable asymptotic properties and demonstrate strong empirical performance in numerical experiments and a genomics application, where EigenBayes outperforms state-of-the-art alternatives.

2606.20420 2026-06-19 q-fin.CP stat.AP 新提交

Advanced Calibration Analysis and Tools: Identifying Influential Observations in Stochastic Interest Rate Model Calibration

高级校准分析与工具:识别随机利率模型校准中的有影响观测值

Philipp Mahler, Peter Ruckdeschel

AI总结 将校准问题嵌入非线性回归理论,证明最小化RMSRE等价于加权最小二乘,开发诊断框架(加权帽子矩阵、影响函数、泛函Delta方法),实证发现杠杆边界主导、有效维度损失及2022年后参数稳定性转变,指出低RMSRE不足以验证校准。

Comments 47 pages, 9 figures, 1 table

详情
AI中文摘要

利率模型的准确校准对于市场一致性估值和经济情景生成器(ESGs)至关重要。多因子模型(如G2++模型)的传统校准方法通常依赖于点估计,忽略了特定市场数据的影响和估计不确定性的量化。本文开发了一个诊断框架,将校准问题嵌入非线性回归理论。研究表明,行业常见的均方根相对误差(RMSRE)最小化等价于加权最小二乘(WLS)问题。这一等价关系导出了诊断工具的相应公式,包括用于杠杆分析的加权帽子矩阵、用于局部敏感性诊断的影响函数,以及用于局部、边界置信区间的泛函Delta方法。实现中采用了高效的雅可比矩阵分解,利用了平价(ATM)上限的解析可处理性。该框架应用于2016-2025年期间的欧元ATM上限数据集。我们的实证分析揭示了边界主导的杠杆分布、由于参数约束活跃导致的重复有效维度损失,以及2022年后市场转型中局部参数稳定性的诊断机制转变。对精算模型治理的启示是:低RMSRE不足以验证校准。最后,我们讨论了该框架对一般最小二乘问题的适用性,同时指出了对于缺乏闭式梯度的工具(如互换期权)的计算挑战。

英文摘要

The accurate calibration of interest rate models is central to market-consistent valuation and Economic Scenario Generators (ESGs). Traditional calibration methods for multi-factor models such as the G2++ model often rely on point estimates, neglecting the influence of specific market data and the quantification of estimation uncertainty. This paper develops a diagnostic framework embedding the calibration problem into non-linear regression theory. It shows that the common industry practice of minimizing the Root Mean Squared Relative Error (RMSRE) is equivalent to a Weighted Least Squares (WLS) problem. This equivalence yields the corresponding formulations for diagnostic tools, including the Weighted Hat Matrix for leverage analysis, Influence Functions for local sensitivity diagnostics, and the Functional Delta Method for local, boundary-respecting confidence intervals. The implementation uses an efficient Jacobian factorization that exploits the analytical tractability of At-The-Money (ATM) caps. The framework is applied to a dataset of Euro ATM caps covering the period 2016--2025. Our empirical analysis reveals a boundary-dominated leverage profile, repeated losses of effective dimensionality due to active parameter constraints, and a diagnostic regime shift in local parameter stability around the post-2022 market transition. The resulting message for actuarial model governance is that low RMSRE is not sufficient for calibration validation. We conclude by discussing the framework's applicability to general least-squares problems while highlighting the computational challenges for instruments lacking closed-form gradients, such as swaptions.

2606.20079 2026-06-19 q-fin.RM 新提交

How to spot outliers: an Ensemble Anomaly Detection Framework

如何发现异常值:一种集成异常检测框架

Daniil Peysakhovich, Rafał Sieradzki

AI总结 针对风险估值输出中的异常问题,提出集成质量评估框架(EQAF),结合多种无监督异常检测方法,在信用衍生品数据上实现F1分数61-79%,优于最佳单一方法(6-66%),并揭示纯统计方法无法检测冻结馈送异常。

详情
AI中文摘要

由数据馈送失败、模型配置错误或系统故障引起的风险估值输出错误可能通过投资银行的风险基础设施未被检测地传播,并产生重大操作损失。利用一家全球大型投资银行涵盖129个交易日183笔交易的专有每日信用衍生品数据,我们设计、实施并实证评估了集成质量评估框架(EQAF),这是一种分层无监督架构,结合互补的异常检测方法,实时监控风险计算完整性。通过使用八种操作现实场景的受控异常注入协议,我们表明校准后的集成在四个不同风险度量数据集上实现了61-79%的F1分数,显著优于最佳单一方法(6-66%)。AUC-ROC提高4-6个百分点证实了这种优势对阈值选择具有鲁棒性。我们进一步证明,纯统计检测方法系统地无法识别冻结值异常,这是一类冻结馈送错误,其中估值输出与先前观测相同,因此与正常数据无法区分,并且领域特定的确定性规则在架构上是不可或缺的。这些发现对巴塞尔III和交易账簿基本审查(FRTB)下的模型风险管理具有直接影响,其中对内部风险模型的自动化和可审计质量控制要求日益增加。

英文摘要

Errors in risk valuation outputs arising from data-feed failures, model misconfiguration, or system malfunctions can propagate undetected through an investment bank's risk infrastructure and generate material operational losses. Using proprietary daily credit-derivatives data from a major global investment bank covering 183 trades across 129 trading days, we design, implement, and empirically evaluate the Ensemble Quality Assessment Framework (EQAF), a layered unsupervised architecture that combines complementary outlier-detection methods to monitor risk calculation integrity in real time. Using a controlled anomaly-injection protocol with eight operationally realistic scenarios, we show that the calibrated ensemble achieves F1 scores of 61-79%, substantially outperforming the best individual method (6-66%) across four distinct risk-measure datasets. Improvements of 4-6 percentage points in AUC-ROC confirm that this advantage is robust to threshold selection. We further demonstrate that purely statistical detection methods systematically fail to identify stale-value anomalies, a class of frozen-feed errors in which valuation outputs are identical to prior observations and therefore indistinguishable from normal data, and that domain-specific deterministic rules are architecturally indispensable. These findings have direct implications for model risk management under Basel III and the Fundamental Review of the Trading Book (FRTB), where automated and auditable quality controls for internal risk models are increasingly required.

2606.19846 2026-06-19 econ.GN q-fin.EC 新提交

What Capital After Labor? Forecasting the Talent ROI Transition in the Human-AI Era

劳动力之后是什么资本?预测人机时代的人才ROI转型

Kwan Soo Shin, In Seok Kang

AI总结 针对AI增强打破劳动时间与贡献的会计关联,本文构建从时间到产出的人才ROI预测框架,核心定理为ROI反转,并利用韩国52小时工作制案例验证了前期压力信号,预测产出型企业在2032年TFP增长领先1.5-2.0个百分点。

Comments 90 pages, 6 figures

详情
AI中文摘要

AI增强打破了劳动时间与生产贡献之间的会计联系,但企业仍通过基于时间的间接费用包来评估人才。本文开发了一个预测框架,用于在人机时代从基于时间的人才会计向基于产出的人才ROI转型。该框架以定理3(在τ*处的ROI反转)为实证主轴,包含四个机制定理:间接费用非加性、增强节省时间路径、创新溢价放大以及人机二元归因不确定性。韩国分阶段实施的52小时工作制规定提供了一个实证预警案例。在一个包含365家上市公司的DART面板数据(2281个公司-年观测值)中,SG&A与收入比率从2018年的18.26%上升至2020年的20.06%,在2021-2022年略有修正,并于2024年达到20.10%的峰值。在收入百分位队列代理下,双向固定效应(+1.56个百分点,p=0.049)、合并事件研究估计(t=+3时为+4.21个百分点,p=0.001)以及Callaway-Sant'Anna双重稳健交错DID估计(t=+4时为+4.51个百分点)收敛于一个正向间接费用压力特征。2015-2017年的向后扩展(224家公司,601个观测值)提供了预处理数据,提供了反对预先存在的上升趋势混杂因素的证据。我们将韩国证据解读为,据我们所知,第一个经验记录的τ*前间接费用压力制度特征,其中基于时间的会计仍占主导地位,而AI增强和劳动时间压缩共同推高了间接费用。预计到2032年,基于产出的公司在公司层面TFP增长上比基于时间的同行高出1.5-2.0个百分点。贡献在于为向AI增强的人才ROI会计转型提供了一个预测模型和管理规划工具。

英文摘要

AI augmentation breaks the accounting link between labor time and productive contribution, yet firms continue to evaluate talent through time-based overhead bundles. This paper develops a forecasting framework for the transition from time-based talent accounting to output-based talent ROI in the human-AI era. The framework centres on Theorem 3 (ROI Inversion at τ*) as the empirical spine, with four mechanism theorems: overhead non-additivity, augmentation-saved-time pathways, innovation-premium amplification, and human-AI dyad attribution uncertainty. Korea's staged 52-hour workweek mandate provides an empirical early-warning case. In a DART panel of 365 listed firms (2,281 firm-year observations), the SG&A-to-revenue ratio rose from 18.26 percent in 2018 to 20.06 percent in 2020, corrected mildly in 2021-2022, and peaked at 20.10 percent in 2024. Under the revenue-percentile cohort proxy, two-way fixed effects (+1.56 pp, p = 0.049), pooled event-study estimates (+4.21 pp at t = +3, p = 0.001), and Callaway-Sant'Anna doubly-robust staggered DiD estimates (+4.51 pp at t = +4) converge on a positive overhead-pressure signature. A 2015-2017 backward extension (224 firms, 601 observations) supplies pre-treatment data, providing evidence against pre-existing upward-trend confounds. We read the Korean evidence not as a direct τ* estimate or a point causal magnitude, but as, to our knowledge, the first empirically documented signature of the pre-τ overhead-pressure regime, where time-based accounting still dominates while AI augmentation and labor-time compression jointly raise overhead. Output-based firms are forecast to outperform time-based peers by 1.5-2.0 percentage points in firm-level TFP growth by 2032. The contribution is a forecasting model and managerial planning tool for the shift to AI-augmented talent ROI accounting.

2606.19550 2026-06-19 q-fin.GN q-fin.PR 新提交

Which Portfolios? The Construction Dependence of Factor Model Performance

哪些投资组合?因子模型表现的构建依赖性

Useong Shin

AI总结 研究发现因子模型表现高度依赖于测试资产的构建方式,如选股、初始加权、持有期和再平衡,其中买入持有策略偏好FF5和FF6,而每日恒定加权偏好FF3,且q5在因子跨度测试中夏普比率最高但定价误差较大。

详情
AI中文摘要

因子模型的表现不仅取决于模型本身,还取决于测试资产的构建方式。我们从广泛的CRSP范围内形成特征未排序的随机投资组合,并改变股票选择、初始加权、持有期和再平衡。排名发生实质性变化:买入持有策略偏好FF5和FF6,而每日恒定加权偏好FF3,这是跨设计最稳定的模型。尽管q5在因子跨度测试中达到了最高的最大夏普比率,但它对随机投资组合留下了相对较大且对构建敏感的定价误差。这些结果反映了每个模型定价误差向量的构建特定加权。因此,测试资产构建,包括动态权重管理,是模型评估中的一个设计选择。

英文摘要

Factor-model performance depends not only on the model but also on how test assets are constructed. We form characteristic-unsorted random portfolios from a broad CRSP universe and vary stock selection, initial weighting, holding, and rebalancing. Rankings shift materially: buy-and-hold favors FF5 and FF6, whereas daily constant-weighting favors FF3, the most stable model across designs. Although q5 attains the highest maximum Sharpe ratio in factor-spanning tests, it leaves comparatively large and construction-sensitive pricing errors on random portfolios. These results reflect construction-specific weighting of each model's pricing-error vector. Test-asset construction, including dynamic weight management, is therefore a design choice in model evaluation.

2606.19517 2026-06-19 q-fin.TR 新提交

Do Prediction Markets Match Option Prices? Bitcoin Threshold Evidence from Binance and Polymarket

预测市场是否与期权价格匹配?来自币安和Polymarket的比特币阈值证据

Victoria Portnaya

AI总结 本文通过比较Polymarket预测市场与币安期权隐含的比特币阈值合约价格,发现两者之间存在显著且持久的定价差距,平均约6.3个百分点,表明数字金融市场碎片化导致经济上相同的收益产生系统性定价偏差。

Comments 22 pages, 6 figures, 7 tables; JEL: G13, G14, G19

详情
AI中文摘要

金融市场的数字化产生了两类平台,它们原则上对相同的状态依存收益进行定价:中心化加密期权交易所和基于区块链的预测市场。本文首次提供了加密货币阈值合约的预测市场定价的期权隐含基准测试。在匹配样本的每个小时,我们将Polymarket的Yes价格与同一标的、行权价和到期日的上市币安看涨期权所隐含的贴现风险中性二元值进行比较,并研究两者之间的差距。在2023年9月的主要比特币合约中,平均定价差距为5.6个百分点(基于214个每小时观测值,t=6.46,p<10^{-9})。合并三个与币安兼容的比特币阈值市场,在287个观测值上得到平均差距为6.3个百分点,对HAC和块自举推断稳健。该差距是持久的——AR(1)半衰期约为四小时——但均值回归,这与分割场所之间缓慢的信息传递而非机械噪声一致。横截面回归显示,价差在期权隐含概率低和到期时间长时最大,这与预测市场合约的投机需求而非测量误差一致。在对冲套利代理中,在保守交易成本后仍保持盈利,但统计精度边际。在相同三个比特币合约上扩展至Deribit,合并差距更大,为11个百分点,而较小的以太坊练习则产生混合证据。结果表明,数字金融市场碎片化导致经济上相同的收益产生系统性、持久的定价偏差。

英文摘要

The digitization of financial markets has produced two classes of platforms that price, in principle, the same state - contingent payoffs: centralized crypto-option exchanges and blockchain-based prediction markets. This paper provides the first option-implied benchmark test of prediction-market pricing for cryptocurrency threshold contracts. For each hour in a matched sample, we compare the Polymarket Yes price with the discounted risk-neutral binary value implied by a listed Binance call option on the same underlying, strike, and maturity, and study the gap between them. In the main September 2023 Bitcoin contract, the mean pricing gap equals 5.6 percentage points across 214 hourly observations (t = 6.46, p < 10^{-9}). Pooling three Binance-compatible Bitcoin threshold markets yields a mean gap of 6.3 percentage points across 287 observations, robust to HAC and block-bootstrap inference. The gap is persistent - with an AR(1) half-life of roughly four hours - yet mean-reverting, consistent with slow information transmission between segmented venues rather than mechanical noise. Cross-sectional regressions reveal that the wedge is largest at low option-implied probabilities and long maturities, a pattern consistent with speculative demand for prediction-market contracts rather than measurement error. A delta-hedged arbitrage proxy remains profitable after conservative transaction costs, though with marginal statistical precision. A Deribit extension on the same three Bitcoin contracts produces a larger pooled gap of 11 percentage points, while a smaller Ethereum exercise yields mixed evidence. The results demonstrate that digital fragmentation of financial markets generates systematic, persistent pricing wedges even for economically identical payoffs.

2606.19762 2026-06-19 q-bio.MN 新提交

Oscillations and Spatial Patterns in Large-Scale Stochastic Gene Regulatory Networks

大规模随机基因调控网络中的振荡与空间模式

Manuel Eduardo Hernández-García, Jorge Velázquez-Castro

AI总结 研究负反馈与扩散的循环基因调控网络,通过确定性和随机方法分析其稳定性,发现随机波动可诱导图灵失稳,为理解发育中的模式形成提供新视角。

Comments 16 pages, 10 figures

详情
AI中文摘要

基因调控网络(GRNs)是细胞生长和组织形成的基础,在发育过程中协调基因表达的时空调控。这些网络固有地受到分子噪声引起的内在波动的影响,因此分析其稳定性对于理解生物体稳健的模式形成和发育动力学至关重要。在本研究中,我们分析了具有负反馈和扩散的循环GRNs的稳定性和动力学,考虑了确定性和随机方法。在确定性情况下,系统表现出稳定性与不稳定性之间的分岔,导致无扩散时的Hopf失稳和包含扩散时的Turing-Hopf失稳。观察到空间域的离散化引入了额外的不稳定模式,从而允许更广泛的模式。基于二阶矩方法的随机框架包含了内在波动,揭示了对于小系统尺寸,即使系统在无扩散时是稳定的,波动也可以主导动力学并诱导随机Turing失稳。值得注意的是,即使所有变量具有相同的扩散速率,Turing失稳也可能出现。所开发的框架提供了一种系统的方法来分析具有扩散的高维随机系统的稳定性,从而简化了Turing和Turing-Hopf失稳的预测。这些发现有助于更深入地理解GRNs中的复杂动力学和模式形成,对细胞分化和发育等生物过程具有潜在意义。

英文摘要

Gene regulatory networks (GRNs) are fundamental to cellular growth and tissue formation, orchestrating spatially and temporally regulated gene expression during development. These networks are inherently subject to intrinsic fluctuations arising from molecular noise, making the analysis of their stability essential for understanding robust pattern formation and developmental dynamics of the organism. In this study, we analyze the stability and dynamics of cyclic GRNs with negative feedback and diffusion, considering both deterministic and stochastic approaches. In the deterministic case, the system exhibits a bifurcation between stability and instability, leading to Hopf instability in the absence of diffusion and to Turing-Hopf instability when diffusion is included. It was observed that the discretization of the spatial domain introduces additional unstable modes, enabling a wider range of patterns. The stochastic framework based on the second-moment approach, which incorporates intrinsic fluctuations, reveals that for small system sizes, fluctuations can dominate the dynamics and induce stochastic Turing instability, even when the system is stable in the absence of diffusion. Notably, Turing instabilities can emerge even when all variables have the same diffusion rate. The developed framework provides a systematic method for analyzing the stability of high-dimensional stochastic systems with diffusion, thereby simplifying the prediction of Turing and Turing-Hopf instabilities. These findings contribute to a deeper understanding of the complex dynamics and pattern formation in GRNs, with potential implications for biological processes, such as cellular differentiation and development.

2606.19739 2026-06-19 q-bio.NC 新提交

Robust probabilistic measurement of structural-functional module consistency in infant brain development

婴儿大脑发育中结构-功能模块一致性的鲁棒概率测量

Lingbin Bian, Feihong Liu, Qian Wang, Han Zhang, Dinggang Shen, the UNC/UMN Baby Connectome Project Consortium

AI总结 提出基于随机模块的概率方法,鲁棒测量婴儿大脑结构-功能模块一致性,发现0-5岁间一致性下降,初级脑区一致性更高。

详情
AI中文摘要

脑网络通常被划分为模块,用于分析其在神经影像学研究的群体分析中功能分离的角色。这里,我们引入脑网络中的随机模块,用于在受试者群体中对结构-功能模块一致性(SFMC)进行鲁棒的概率测量。具体而言,随机模块可被视为一个脑区在受试者间可能被分配到群体级子网络的机会,其特征为该脑区的分配概率。这种新方法在评估脑网络中的非均匀模块方面有两个优势。首先,它可以鲁棒地评估脑结构模块与功能模块之间的一致性,而两者的群体规模不必相同;其次,它能够考虑群体中模块的个体间变异性。此外,与传统的结构-功能耦合方法相比,我们的基于随机模块的方法揭示了结构与功能之间耦合的更显著下降,表明更强的发育重组。我们使用婴儿连接组项目(BCP)数据集的结果显示,SFMC在0至5岁期间下降,并且在初级脑区(如视觉区域)较高,而在更高级的认知区域(包括与注意力、控制和默认模式网络相关的区域)较低。

英文摘要

Brain network is commonly divided into modules for analyzing their functionally segregated roles for group-level analysis in neuroimaging studies. Here, we introduce stochastic modules within brain networks for a robust probabilistic measurement of structural-functional module consistency (SFMC) in a group of subjects. Specifically, a stochastic module can be regarded as the chance of a brain region across subjects potentially being assigned to a group-level sub-network, characterized as an assignment probability for this brain region. This novel method has two advantages for evaluating inhomogeneous modules in brain networks. The first is that it can robustly evaluate the consistency between brain structural and functional modules whose population sizes are not necessary the same, and the second is that it is able to take into account the inter-individual variability of the modules for the groups. Moreover, compared with the conventional structural-functional coupling approach, our stochastic module-based method reveals a more pronounced decline in the coupling between structure and function, indicating stronger developmental reorganization. Our results using the dataset from Baby Connectome Project (BCP) show that the SFMC decreases from 0 to 5 years old, and is greater in primary brain regions, such as visual areas, while lower in more advanced cognitive regions, including those related to attention, control, and default mode network.

2606.19396 2026-06-19 q-bio.QM 新提交

BioHarness: Substrate-Aware Evidence Assembly for Biomedical Question Answering across Literature, Knowledge Bases, and Biological Atlases

BioHarness:面向生物医学问答的底物感知证据组装——跨文献、知识库和生物图谱

Meng Xiao, Chuan Qin, Jinmiao Chen, Yihang Cheng, Yuanchun Zhou, Hengshu Zhu

AI总结 提出BioHarness,通过级联控制机制在文献检索、知识库和生物图谱间选择性组装证据,提升生物医学问答准确率,在19,302个问答项上得分从65.9提升至71.0。

Comments 14 Pages, 11 Figures, Keywords: biomedical question answering; retrieval-augmented generation; large language models; evidence assembly; biomedical knowledge bases; biological atlases

详情
AI中文摘要

动机:生物医学问答通常需要超越主题检索文献的证据,包括基因别名解析、数据库标识符标准化以及来自图谱的生物测量值。然而,现有的检索增强生成(RAG)系统通常遵循固定工作流程,缺乏明确机制来决定何时检索文本足够、何时需要经过整理的生物医学知识、或何时应调用对结构化测量值的可执行证据组装。这激发了一种底物感知的大语言模型(LLM)框架,能够跨文献、知识库和生物图谱选择性地组装足够的证据。结果:我们引入BioHarness,一种用于分阶段生物医学证据组装的LLM框架,涵盖文献检索、经过整理的生物医学知识资源以及来自图谱的结构化测量值。BioHarness首先尝试根据重排序的文献证据回答问题,并通过基于接地级联控制,仅在当前证据不确定、接地不足或底物不匹配时升级到REPL风格的证据组装。在涵盖七种答案格式的19,302个生物医学问答项上,BioHarness将最强非预言基线的综合得分从65.9提升至71.0。消融实验、案例研究和骨干扩展分析表明,这些提升源于通过重排序、实体接地和结构化测量访问修复证据-底物不匹配,而非不加区分地调用更多推理步骤、检索更多文献或依赖特定答案模型规模。

英文摘要

Motivation: Biomedical question answering often requires evidence beyond topically retrieved literature, including gene alias resolution, database identifier normalization, and atlas-derived biological measurements. However, existing retrieval-augmented generation (RAG) systems typically follow a fixed workflow and lack an explicit mechanism for deciding when retrieved text is sufficient, when curated biomedical knowledge is required, or when executable evidence assembly over structured measurements should be invoked. This motivates a substrate-aware large language model (LLM) harness that selectively assembles sufficient evidence across literature, knowledge bases, and biological atlases. Results: We introduce BioHarness, an LLM harness for staged biomedical evidence assembly across literature retrieval, curated biomedical knowledge resources, and atlas-derived structured measurements. BioHarness first attempts to answer from reranked literature evidence and escalates through grounded cascade control to REPL-style evidence assembly only when the current evidence is uncertain, weakly grounded, or substrate-mismatched. Across 19,302 biomedical QA items spanning seven answer formats, BioHarness improves the pooled score from 65.9 to 71.0 over the strongest non-oracle baseline. Ablations, case studies, and backbone-scaling analyses show that these gains arise from repairing evidence-substrate mismatches through reranking, entity grounding, and structured measurement access, rather than from indiscriminately invoking more reasoning steps, retrieving additional literature, or relying on a particular answer-model scale.

2606.20315 2026-06-19 q-bio.GN cs.CR 新提交

bioETH-Beacon: A Confidential On-Chain Genomic Beacon with Encrypted Counts, Filters, and Bounded Noise over a Fully Homomorphic EVM

bioETH-Beacon: 基于全同态EVM的机密基因组信标,支持加密计数、过滤和有界噪声

Christos Galanopoulos, Kimon Antonios Provatas, Ilias Georgakopoulos-Soares

AI总结 提出基于全同态EVM的智能合约原型bioETH-Beacon,实现加密基因组信标查询,通过加密计数、有界噪声和访问控制抵御成员推理攻击,并优化查询成本。

Comments 11 pages, 6 figures, 8 tables. Research prototype for privacy-preserving genomics using Fully Homomorphic Encryption (FHE) on blockchain (fhEVM)

详情
AI中文摘要

全球基因组学与健康联盟(GA4GH)Beacon协议允许研究人员查询某个基因组变异是否在参与队列中被观察到,并返回聚合的变异级计数。随着Beacon网络的发展,两个隐私风险依然存在:宿主机构可以看到明文查询,而重复的罕见变异查询可能支持成员推理攻击。我们提出了bioETH-Beacon,一个智能合约原型,它在全同态以太坊虚拟机(fhEVM)上对加密数据执行Beacon“聚合计数”查询。医院上传加密的标记计数条目,授权研究人员提交加密的标记查询,合约返回加密答案,通过链下密钥管理服务仅释放给合约链上ACL中指定的请求者。该设计组织为一个3x4的层级-查询族网格,涵盖基因型、性别、年龄和表型查询,层级在更强的机密性和更低的查询成本之间进行权衡。对于基因型路径,原型可以添加链上有界噪声以减轻探测攻击。基于多基因评分(PGS)目录的合成面板实验显示了预期的扩展行为,并证明当公共标记存在是可接受的权衡时,预聚合可以显著降低查询gas成本。总体而言,bioETH-Beacon提供了一个无需可信计算评估者的机密Beacon式基因组查询研究原型。

英文摘要

The Global Alliance for Genomics and Health (GA4GH) Beacon protocol lets researchers ask whether a genomic variant has been observed in a participating cohort and receive aggregate variant-level counts. As Beacon networks grow, two privacy risks remain: host institutions can see plaintext queries, and repeated rare-variant queries can support membership-inference attacks. We present bioETH-Beacon, a smart-contract prototype that runs the Beacon "aggregate count" query over encrypted data on a fully homomorphic Ethereum Virtual Machine (fhEVM). Hospitals upload encrypted marker-count entries, authorized researchers submit encrypted marker queries, and the contract returns an encrypted answer that is released, via an off-chain key-management service, only to the requester named in the contract's on-chain ACL. The design is organized as a 3x4 tier-by-query-family grid spanning genotype, sex, age, and phenotype queries, with tiers that trade stronger confidentiality for lower query cost. For genotype paths, the prototype can add bounded on-chain noise to mitigate probing attacks. Experiments on synthetic panels derived from a Polygenic Score (PGS) catalog show the expected scaling behavior and demonstrate that pre-aggregation can substantially reduce query gas when public marker presence is an acceptable trade-off. Overall, bioETH-Beacon provides a research prototype for confidential Beacon-style genomic querying without a trusted compute evaluator.

2606.19794 2026-06-19 econ.GN cs.CY q-fin.EC 新提交

Forecasting AI-Era Productivity: The Intellectually Converged Human Framework and a Missing Cognitive Mediator in Production Function Theory

预测AI时代的生产率:智力融合人类框架与生产函数理论中缺失的认知中介

Kwan Soo Shin, In Seok Kang

AI总结 本文提出智力融合人类(ICH)框架,通过引入四维认知构念“融合能力”(C)作为AI与生产率之间的认知中介,解释了AI投资未能带来相应生产率增长的理论悖论,并基于20个OECD国家的数据分析验证了AI与C的交互作用对全要素生产率变异的解释力。

Comments 78 pages, 3 figures

详情
AI中文摘要

为什么大规模AI投资未能产生相应的生产率增长?我们认为这一悖论在理论上是生成的:主流生产函数框架通过将AI视为可分离的生产要素,而未建模AI产生生产性价值的认知中介,从而遇到了结构性边界。这导致投资倾向于部署,而生产率需要先发展我们称之为融合能力(C)的东西。我们提出了智力融合人类(ICH)框架,这是生产函数理论的第五阶段框架:H-hat = H[1 + phi(A,C)],其中有效生产能力等于人力资本(H)乘以一个增强因子[1 + phi],phi由AI利用强度(A)和融合能力(C)共同决定,C是一个四维认知构念,涵盖具身理解、元认知、时间整合和整合思维。生产函数Y = F(K, H-hat)为索洛的TFP残差提供了一个以人为中心的机制:A_Solow = [1 + phi(A,C)]^(1-alpha)。该框架预测了三种具有不同政策含义的增强机制。对20个OECD经济体的描述性跨国分析显示,AIxC交互作用与86%的TFP变异相关,而仅AI为31%,这是小n理论传统中模式一致的发现。韩国是国家级欠增强的例证:高H、大量A、低C导致phi=0。我们将融合能力与相邻构念——吸收能力、动态能力和人力资本——区分开来,并证明C构成了先前框架中隐含的特定认知中介。我们推导出C优先的政策建议,并提出了三个可实证检验的命题及一个可证伪的10年预测。

英文摘要

Why does massive AI investment fail to generate commensurate productivity gains? We argue the paradox is theoretically generated: prevailing production function frameworks encounter a structural boundary by treating AI as a separable factor of production without modeling the cognitive mediation through which AI generates productive value. This directs investment toward deployment when productivity requires prior development of what we term convergence capacity (C). We propose the Intellectually Converged Human (ICH) framework, a fifth-stage framework for production function theory: H-hat = H[1 + phi(A,C)], where effective productive capacity equals human capital (H) scaled by an augmentation factor [1 + phi], with phi jointly determined by AI utilization intensity (A) and convergence capacity (C), a four-dimensional cognitive construct encompassing embodied understanding, metacognition, temporal integration, and integrative thinking. The production function Y = F(K, H-hat) provides a human-centered mechanism for Solow's TFP residual: A_Solow = [1 + phi(A,C)]^(1-alpha). The framework predicts three augmentation regimes with distinct policy implications. Descriptive cross-national analysis of 20 OECD economies shows the AIxC interaction is associated with 86% of TFP variance versus 31% for AI alone, a pattern-consistent finding in the small-n theoretical tradition. South Korea exemplifies national-scale under-augmentation: high H, substantial A, low C produce phi = 0. We distinguish convergence capacity from adjacent constructs, absorptive capacity, dynamic capability, and human capital, and demonstrate that C constitutes the specific cognitive mediator that prior frameworks have left implicit. We derive C-first policy prescriptions and offer three empirically testable propositions with a falsifiable 10-year forecast.

2606.20553 2026-06-19 cs.CR 新提交

From Efficiency to Leakage -- Privacy Backdoor in Federated Language Model Fine-Tuning

从效率到泄露——联邦语言模型微调中的隐私后门

Shanghao Shi, Chaoyu Zhang, Heng Jin, Yang Xiao, Yevgeniy Vorobeychik, William Yeoh, Ning Zhang, Y. Thomas Hou, Wenjing Lou

AI总结 提出NeuroImprint攻击,恶意参数服务器在参数高效微调中植入隐私后门,通过为每个样本分配独立神经元并限制单次更新,实现高保真重建训练文本。

详情
AI中文摘要

联邦学习(FL)使多方能够协作微调语言模型以完成特定领域任务,而无需共享原始数据。由于完整模型微调对FL客户端而言通常过于昂贵,参数高效微调(PEFT)已成为实践中的事实标准,它冻结基础模型,仅训练少量适配器。在本文中,我们表明恶意参数服务器可以隐秘地将PEFT适配器破坏为隐私后门,该后门隐式记忆客户端的训练样本,作为存储在独立神经元中的隔离的每样本参数更新,而不降低模型效用。具体来说,我们的攻击NeuroImprint为每个训练样本分配一个专用的记忆神经元,并约束每个神经元在局部微调轨迹中最多更新一次。这种设计减轻了语言模型微调中由大批量和状态优化器(如Adam/AdamW)引入的跨样本碰撞和跨步混合。微调后,得到的隔离的每样本更新可以通过闭式解析逆变换恢复文本嵌入,然后确定性地映射回令牌序列。为了理解我们方法的通用性,我们在多个语言模型(BERT、GPT-2、Qwen2和Llama3.2)上实现了NeuroImprint,并在涵盖不同领域的四个微调数据集上进行了评估。结果表明,我们的攻击能够以高语义保真度重建59%至79%的所有微调样本。

英文摘要

Federated learning (FL) enables multiple parties to collaboratively fine-tune language models for domain-specific tasks without sharing raw data. Since full model fine-tuning is often prohibitively expensive for FL clients, parameter-efficient fine-tuning (PEFT) has become the de facto approach in practice, freezing the base model and training only a small set of adapters. In this paper, we show that a malicious parameter server can stealthily corrupt a PEFT adapter into a privacy backdoor that implicitly memorizes the client's training samples as isolated per-sample parameter updates stored in separate neurons, without degrading model utility. Concretely, our attack, NeuroImprint, assigns a dedicated memorization neuron to each training sample and constrains that each neuron is updated at most once along the local fine-tuning trajectory. This design mitigates both cross-sample collisions and cross-step mixing introduced by large local batches and stateful optimizers (e.g., Adam/AdamW) in language-model fine-tuning. After fine-tuning, the resulting isolated per-sample updates can be analytically inverted in closed form to recover text embeddings, which are then deterministically mapped back to token sequences. To understand the generality of our method, we implemented NeuroImprint on multiple language models (BERT, GPT-2, Qwen2, and Llama3.2) and evaluated it across four fine-tuning datasets spanning diverse domains. The results demonstrate that our attack can reconstruct 59% to 79% of all finetuning samples with high semantic fidelity.

2606.20550 2026-06-19 cs.DL cs.HC cs.IR 新提交

Easy Reads: A Python program for making Scientific Papers on arXiv more Reader Friendly and Accessible

Easy Reads: 一个使arXiv上的科学论文更易读和更易访问的Python程序

Vishal Verma

AI总结 针对科学论文排版紧凑、可读性差的问题,提出Easy Reads——一个自动化、端到端的开源Python程序,通过自定义字体大小和列数等格式,从arXiv获取论文并重新排版,提升可读性和可访问性。

Comments 9 pages. Open-source software project available at: https://github.com/Curious-flow/Easy-Reads

详情
AI中文摘要

科学论文通常排版紧凑,具有小字体、小行距、双栏文本和紧密排列的图表等特点。虽然这些特性使论文更紧凑,但会妨碍可读性,降低可访问性,并可能使读者感到疲劳。arXiv是一个跨学科的科学论文开放获取库,被包括物理学和天体物理学社区在内的研究人员广泛使用。Easy Reads是一个自动化、端到端的开源Python程序,通过使arXiv上的论文更易读和更易访问来帮助解决上述挑战。Easy Reads可以通过URL自动从arXiv获取论文,并处理源TeX文件,允许自定义论文的格式特性,主要是字体大小和使用的栏数。Easy Reads的主要目标是促进科学论文的易读性。

英文摘要

Scientific papers are frequently dense and characterized by features such as small fonts and line spacing, double columns of text, and tightly arranged figures. While these features make papers more compact, they can hinder readability, make them less accessible, and can strain the reader. arXiv is a premier open-access repository for scientific papers across different fields and is used extensively by researchers, including those in the physics and astrophysics communities. Easy Reads is an automated, end-to-end, open-source Python program that helps address the stated challenge by making papers from arXiv more reader-friendly and accessible. Easy Reads can automatically fetch a paper from arXiv via its URL and work with the source TeX file to allow custom formatting of the paper features, primarily the font size, and the number of columns used. The main goal of Easy Reads is to facilitate ease of reading of scientific papers.

2606.20539 2026-06-19 cs.DB cs.DS 新提交

Caching for Dollars, Not Hits: An Exact Offline Reference for Cloud-Egress Caching and the Crossover That Decides When It Pays

为美元缓存,而非命中率:云出口缓存的精确离线参考及决定何时值得的交叉点

Madhulatha Mandarapu, Sandeep Kunkunuru

AI总结 针对云存储出口费用而非延迟的缓存问题,提出多项式时间精确离线最优策略,发现LRU的美元后悔随成本分散度上升,而成本感知的GreedyDual可大幅降低,并给出决定何时需要成本感知缓存的闭合形式交叉点。

Comments 6 pages, 3 figures. Code, benchmarks, and full pre-registration: https://github.com/samyama-ai/cloud-egress-cache

详情
AI中文摘要

当缓存未命中从云对象存储获取数据时,计费基于每次GET请求和每字节出口流量,而非延迟。经典缓存最小化未命中率,这是错误的目标:一个很少但昂贵获取的对象可能比一个频繁但廉价获取的对象花费数千倍。广义缓存理论界定了未命中成本目标,但尚无公开基准衡量实际部署的启发式策略在真实云价格下与美元最优离线策略的差距。我们提供了该参考。对于具有异构未命中成本的统一大小页面缓存,离线美元最优可通过积分区间线性规划在多项式时间内精确求解——经暴力验证;可变大小是NP难的,因此我们将基于流的离线界从命中率目标扩展到美元(成本-FOO),误差约4%。基于此参考我们发现:(i) 异质性遗憾定律——LRU的美元遗憾随未命中成本分散度上升(Spearman 0.87),而成本感知的GreedyDual将其降至约十分之一;(ii) 竞争边界——当预算恰好覆盖昂贵工作集时,GreedyDual的残余遗憾降至接近零,否则为开放区间;(iii) 闭合形式交叉点 s* = GET费用/出口费率(S3上约4 KB,GCS上约330 B),可预测哪些部署需要成本感知缓存。在真实Twitter轨迹上,仅价格向量即可使工作负载跨越s*,按预测改变状态。该工件是一个可复现的计费忠实基准;其构建的启发式策略和界为先前工作,已致谢。

英文摘要

When a cache miss fetches from cloud object storage, the bill is per GET request and per byte of egress, not latency. Classic caching minimizes the miss rate, the wrong objective: a rarely but expensively fetched object can cost thousands of times more dollars than a frequently but cheaply fetched one. Generalized-caching theory bounds the miss-cost objective, but no reported benchmark measures how far deployed heuristics sit from the dollar-optimal offline policy on real cloud prices. We supply that reference. For uniform-size page caches with heterogeneous miss costs the offline dollar-optimum is exact in polynomial time via an integral interval linear program -- validated against brute force; variable sizes are NP-hard, so we extend the flow-based offline bound from the hit-ratio objective to dollars (cost-FOO), tight to about four percent. Against this reference we find: (i) a heterogeneity-regret law -- LRU's dollar-regret rises with miss-cost dispersion (Spearman 0.87) while cost-aware GreedyDual cuts it to roughly a tenth; (ii) a contention frontier -- GreedyDual's residual regret collapses to near zero exactly when the budget fits the expensive working set, and is the open slice otherwise; and (iii) a closed-form crossover s* = GET_fee/egress_rate (about 4 KB on S3, 330 B on GCS) that predicts which deployments need dollar-aware caching at all. On a real Twitter trace the price vector alone moves the workload across s*, shifting the regime as predicted. The artifact is a reproducible billing-faithful benchmark; heuristics and bounds it builds on are prior work, credited.

2606.20492 2026-06-19 cs.CR cs.LO 新提交

A-COMPASS: Formal Foundations for Anonymity Analysis in Microdata

A-COMPASS:微观数据匿名性分析的形式化基础

Tamara Tagliavia, Silvia Ghilezan

AI总结 本文修改COMPASS语言为A-COMPASS,使其适用于微观数据表,支持匿名条件检查与匿名化操作,并证明其语义的确定性和组合性,可用于验证k-匿名和l-多样性等属性。

详情
AI中文摘要

在信息时代,主要问题之一是如何确保个人隐私。根据考虑隐私的背景,出现了各种数据隐私模型。然而,即使对于最基本的模型,这些模型的形式化验证领域仍未得到充分探索。验证隐私需求的一种尝试是合规断言语言(COMPASS)。在COMPASS中,可以指定表需要满足的匿名条件,以及条件不满足时将修改表的操作。它设计用于对预处理后的表进行操作,形式为一条记录对应一组人。在本文中,我们修改COMPASS语言,使其以通常的一条记录对应一个人的形式对微观数据表进行操作。修改后的语言称为A-COMPASS。除了检查先前应用的匿名条件外,A-COMPASS还作为新功能支持执行匿名化操作。我们进一步提供了A-COMPASS语言的语法和语义。我们还证明了引入的语义的最重要属性,如确定性和组合性。最后,我们提供了一种验证匿名属性(如k-匿名和l-多样性)的机制。

英文摘要

In the information age, one of the leading problems is how to ensure individual's privacy. Depending on the context in which privacy is considered, various data privacy models have emerged. However, the domain of formal verification of these models is still not sufficiently explored even when it comes to the most basic models. An attempt to verify privacy requirements is the Compliance Assertion Language (COMPASS). In COMPASS, one can specify an anonymity condition that a table needs to satisfy, and an action that will modify the table if the condition is not satisfied. It is designed to operate on preprocessed tables in a form one record - one group of people. In this paper, we modify the COMPASS language in order to operate on microdata tables in their usual form of one record - one person. The modified language is called A-COMPASS. Along with checking of previously applied anonymity conditions, A-COMPASS enables the execution of anonymization actions as a new feature. We further provide the syntax and the semantics for the A-COMPASS language. We also prove the most important properties of the introduced semantics like determinism and compositionality. Finally, we provide a mechanism to verify anonymity properties, such as k-anonymity and l-diversity.

2606.20490 2026-06-19 cs.MS 新提交

Software package MaRDI Open Interfaces for improved interoperability in numerical optimization

软件包MaRDI开放接口:提升数值优化互操作性

Dmitry I. Kabanov, Stephan Rave, Mario Ohlberger

AI总结 提出MaRDI开放接口软件包,通过统一非线性优化接口减少编码与测试工作,并以物理信息神经网络求解粘性Burgers方程为例验证其互操作性。

Comments 15 pages, 1 figure, 1 table, GAMM2026

详情
AI中文摘要

为了解决计算科学中的互操作性挑战,我们介绍了软件包MaRDI Open Interfaces的最新更新。该软件包旨在减少计算科学家在编写数值求解器绑定以及将实验代码适配到同一问题类型(例如,基准测试哪个求解器更好)的不同求解器接口上所花费的时间和编码/测试工作。通过简化这些任务,该软件包帮助研究人员专注于其计算项目的实际本质。在这里,我们展示了一个最近开发的非线性优化接口,并说明了如何将其应用于优化问题的计算实验。作为此类问题的一个例子,我们考虑了训练物理信息神经网络以预测粘性Burgers方程的解。

英文摘要

To address the challenges of interoperability in computational science, we present the latest updates to the software package MaRDI Open Interfaces. This software package aims to decrease the time and coding/testing efforts spent by computational scientists on tasks such as writing bindings to numerical solvers and adapting experiment codes to the varying interfaces of solvers for the same problem type (e.g., for benchmarking, which solver is better). By streamlining these tasks, this software package helps researchers focus on the actual essence of their computational projects. Here, we demonstrate a recently developed interface for nonlinear optimization and illustrate how it can be applied for computational experiments with optimization problems. As an example of such problem, we consider training of physics-informed neural networks to predict the solutions of viscous Burgers' equation.

2606.20465 2026-06-19 cs.CY cs.SI 新提交

Farmer Connect: Improving Farmers' Access to Produce Markets

Farmer Connect:改善农民进入农产品市场的途径

Micheal Amanya, Darius Kainamura, Christine Namatovu, Lailah Kobugabe, Solomon Buwule Fortune, Adones Rukundo

AI总结 针对乌干达小农户面临的市场准入难、议价能力弱等问题,提出基于合作社的数字平台Farmer Connect,通过移动优先架构和云后端支持群体管理、市场协调和收益透明,实现约85%的用户需求。

详情
AI中文摘要

乌干达的小农户玉米种植者仍然面临有限的市场准入、薄弱的议价能力、低价格透明度以及对中间商的严重依赖。这些问题因农产品协调不善、付款延迟以及合作社交易可见性差而加剧。本文介绍了Farmer Connect,一个基于合作社的数字平台,旨在支持农民群体之间的农产品管理、市场协调和透明的收益跟踪。该系统支持四种用户角色:管理员、监督员、农民和客户。其核心功能包括农民群体管理、贡献记录和验证、市场列表、订单处理、基于先进先出的农产品分配、收益可见性、移动货币支付支持和通知服务。该平台采用移动优先架构,配备基于云的后端服务和行政网页仪表板。功能实现表明,该系统能够支持基于群体的玉米营销和合作社协调所需的主要工作流程,约85%的已识别用户需求得到实现。研究表明,以合作社为中心的数字平台可以为改善小农户的透明度、协调性和买家准入提供实用框架。

英文摘要

Smallholder maize farmers in Uganda continue to face limited market access, weak bargaining power, low price transparency, and heavy reliance on intermediaries. These challenges are compounded by poor produce coordination, delayed payments, and weak visibility into cooperative transactions. This paper presents Farmer Connect, a cooperative-based digital platform designed to support produce management, marketplace coordination, and transparent earnings tracking among farmer groups. The system supports four user roles: administrators, supervisors, farmers, and customers. Its core functions include farmer group management, contribution recording and verification, marketplace listing, order processing, First In First Out based produce allocation, earnings visibility, mobile money payment support, and notification services. The platform was implemented using a mobile-first architecture with cloud-based backend services and an administrative web dashboard. Functional implementation showed that the system was able to support the major workflows required for group-based maize marketing and cooperative coordination, with approximately 85% of identified user requirements implemented. The study shows that cooperative-centered digital platforms can provide a practical framework for improving transparency, coordination, and buyer access for smallholder farmers.

2606.20454 2026-06-19 cs.FL 新提交

Minimality of Random Moore Automata under Prefix-Dependent Congruences

随机摩尔自动机在前缀依赖同余下的极小性

Matías Carrasco, Sergio Yovine

AI总结 研究随机确定性迁移系统中前缀依赖同余的平凡性,证明在标签独立且每个标签至少有三个可接受符号时,同余高概率为平凡。

Comments 9 pages

详情
AI中文摘要

我们研究带有状态输出的随机确定性迁移系统的前缀依赖同余。在此设定下,用于比较两个状态的可接受延续可能依赖于观察到的前缀,并且只有当没有共同的可接受延续能区分它们的未来输出时,两个状态才被识别。该框架包括概率确定性有限自动机作为一个激励性的特例。我们分析随机迁移模型,其中所有迁移值是独立且均匀的。每个状态还被分配一个独立标签,该标签指定其输出及其可接受符号集。如果两个独立标签以严格小于1的概率一致,并且每个标签至少有三个可接受符号,则诱导的同余以高概率是平凡的。证明结合了配对上的剪枝过程、控制其早期演化的无碰撞探索,以及表明剩余配对无法组织成非平凡等价类的第一矩论证。

英文摘要

We study prefix-dependent congruences for random deterministic transition systems with state outputs. In this setting, the admissible continuations used to compare two states may depend on the observed prefix, and two states are identified only if no common admissible continuation distinguishes their future outputs. The framework includes probabilistic deterministic finite automata as a motivating special case. We analyze the random transition model in which all transition values are independent and uniform. Each state is also assigned an independent label that specifies both its output and its set of admissible symbols. If two independent labels agree with probability strictly less than one, and every label has at least three admissible symbols, then the induced congruence is trivial with high probability. The proof combines a pruning process on pairs, a collision-free exploration controlling its early evolution, and a first-moment argument showing that the remaining pairs cannot organize into nontrivial equivalence classes.