arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.20142 2026-05-20 stat.AP q-fin.ST

Mining Financial Data using Mixtures of Mirrored Weibull Distributions

使用镜像Weibull分布混合物挖掘金融数据

Zijun Jia, Sharon X. Lee

AI总结 本文提出了一种基于镜像Weibull分布混合物(MMW)模型,用于建模股票收益和估计风险指标,该模型能够灵活适应金融数据中常见的非正态特征,并在价值-at-风险(VaR)估计中表现出显著优势。

详情
AI中文摘要

风险管理是金融实践中的重要部分,对于保护资产和投资在现代波动市场中至关重要。本文提出了一种混合镜像Weibull(MMW)分布用于建模股票收益和估计风险指标。与通常基于正态分布的常见做法不同,MMW模型可以灵活地适应金融数据中频繁出现的非正态特征。它还具有吸引人的性质,如具有简单的密度表达式和快速的参数估计。我们通过评估其在三个S&P500股票的价值-at-风险(VaR)估计中的性能来证明该模型的有效性。MMW模型在与高斯混合模型和t混合模型相比时表现优异,在VaR估计和预测方面有显著改进。

英文摘要

Risk management is an important part of financial practice, essential for protecting assets and investments in modern-day volatile markets. This paper proposes a mixture of mirrored Weibull (MMW) distribution for modelling stock returns and estimating risk measures. Unlike common practices which are typically based on the normal distribution, the MMW model can flexibly accommodate non-normal features frequently exhibited in financial data. It also enjoys appealing properties such as having a simple density expression and fast parameter estimation. We demonstrate the effectiveness of our model by assessing its performance in Value-at-Risk (VaR) estimation of three S&P500 stocks. The MMW model compares favourably to Gaussian mixture model and t-mixture model, with significant improvements in VaR estimation and prediction.

2605.17299 2026-05-20 econ.GN cond-mat.stat-mech q-fin.EC q-fin.RM

Geometric Brownian motion with intermittent entries and exits

具有间歇进入和退出的几何布朗运动

Suvam Pal, Viktor Stojkoski, Arnab Pal, Trifce Sandev

AI总结 本文研究了一种扩展的几何布朗运动框架,结合了新单位的进入和当前人口的退出机制,扩展了早期的随机重置模型,其中这些速率被视为相同。该模型捕捉了许多经济可观测特征,可以解释为市场驱动的企业进入/退出、工人流入/流出以及收入增长/损失。该模型非保守,尽管进入和退出速率存在不对称性,但系统最终会趋于平稳分布。此外,我们的分析揭示了分布矩的三个不同动态制度,源于波动性、漂移、进入和退出速率之间的相互作用。我们进一步推导了生存概率和与观察变量达到特定阈值相关的平均首次通过时间,与竞争的进入-退出过程相关。有趣的是,我们发现了一个最优的退出速率,该速率最小化了平均首次通过时间,为如何通过进入和退出政策影响系统结果提供了见解。这些结果应有助于理解其中增长、波动性、进入和退出共同塑造异质单位演变的经济系统的长期行为。

详情
Comments
15 pages, 8 figures
AI中文摘要

我们研究了一种扩展的几何布朗运动框架,该框架结合了新单位的进入和当前人口的退出机制,扩展了早期的随机重置模型,其中这些速率被视为相同。该模型捕捉了许多经济可观测特征,可以解释为市场驱动的企业进入/退出、工人流入/流出以及收入增长/损失。该模型非保守,尽管进入和退出速率存在不对称性,但系统最终会趋于平稳分布。此外,我们的分析揭示了分布矩的三个不同动态制度,源于波动性、漂移、进入和退出速率之间的相互作用。我们进一步推导了生存概率和与观察变量达到特定阈值相关的平均首次通过时间,与竞争的进入-退出过程相关。有趣的是,我们发现了一个最优的退出速率,该速率最小化了平均首次通过时间,为如何通过进入和退出政策影响系统结果提供了见解。这些结果应有助于理解其中增长、波动性、进入和退出共同塑造异质单位演变的经济系统的长期行为。

英文摘要

We study a generalized geometric Brownian motion framework that incorporates both entries of new units and exit mechanisms for the current population, extending earlier stochastic resetting models where these rates are treated as identical. The model captures realistic features observed in many economic observables, which can be explained as market-driven firm entries/exits, worker inflow/outflow, and income growth/loss. This model is not conservative and, despite the asymmetry in the entry and exit rates, we find that the system eventually relaxes to a stationary distribution. Moreover, our analysis reveals three distinct dynamical regimes in the moments of the distribution, arising from the interplay between volatility, drift, entry, and exit rates. We further derive the survival probability and the mean first-passage time associated with the observed variable reaching certain threshold under the competing entry-exit processes. Interestingly, we identify an optimal exit rate that minimizes the mean first-passage time, providing insights into how entry and exit policies can influence the outcome of the system. These results should be useful for understanding the long-run behavior of economic systems in which growth, volatility, entry, and exit jointly shape the evolution of heterogeneous units.

2603.02456 2026-05-20 econ.TH econ.EM econ.GN q-fin.EC

When Does Static Willingness to Pay Mislead? A Framework for Dynamic Hedonic Valuation

当静态支付意愿误导何时?动态享乐估值的框架

Josephine Auer

AI总结 本文研究了静态支付意愿何时会误导,提出了一种动态享乐估值框架,通过家庭扫描数据展示了享乐表示对价格的现实限制,并说明习惯性形成如何在该表示条件下提高行为一致性。

详情
AI中文摘要

许多政策反事实依赖于消费者对产品属性(如糖、钠、咖啡因、酒精或排放)的价值评估。标准的享乐和差异化产品模型将这些评估静态化。当属性具有习惯性时,这种解释是受限的:观察到的价格反映了同时期边际价值和当前消费产生的持续价值。我开发了一个非参数揭示偏好框架用于动态享乐估值,推导出合理化观察价格和选择的必要和充分条件。利用家庭扫描数据对谷物购买的分析,我表明享乐表示对价格施加了现实限制,而习惯性形成在该表示条件下提高了行为一致性。结果提供了一个诊断工具,用于判断静态属性估值何时是合理的,以及价格如何揭示超过同时期边际价值。

英文摘要

Many policy counterfactuals depend on how consumers value product attributes such as sugar, sodium, caffeine, alcohol, or emissions. Standard hedonic and differentiated-products models interpret these valuations statically. That interpretation is restrictive when attributes are habit forming: observed prices then reflect both contemporaneous marginal value and the continuation value generated by current consumption. I develop a nonparametric revealed-preference framework for dynamic hedonic valuation, deriving necessary and sufficient conditions for rationalising observed prices and choices. Using household scanner data on cereal purchases, I show that the hedonic representation places real restrictions on prices, while habit formation improves behavioural coherence conditional on that representation. The results provide a diagnostic for when static attribute valuation is justified and when prices reveal more than contemporaneous marginal values.

2601.05085 2026-05-20 q-fin.TR

Trading Electrons: Predicting DART Spread Spikes in ISO Electricity Markets

交易电子:预测ISO电力市场中的DART价差尖峰

Emma Hubert, Dimitrios Lolas, Ronnie Sircar

AI总结 本文研究了预测和优化日间与实时(DART)价差的策略,通过扩展单区域到多区域的尖峰预测模型,并基于日间投标堆栈开发了一个结构化和市场一致的价格影响模型,以提高风险回报比。

详情
Comments
40 pages
AI中文摘要

我们研究了预测和优化日间与实时(DART)价差的问题,在美国批发电力市场中。基于Galarneau-Vincent等人框架,我们扩展了从单区域到多区域的尖峰预测,并在统一的统计模型中处理正负DART尖峰。为将方向信号转化为经济上有意义的头寸,我们开发了一个基于日间投标堆栈的结构化和市场一致的价格影响模型。这产生了最优区域INC/DEC数量向量的闭式表达式,捕捉了不对称的买/卖影响和跨区域拥堵效应。当应用于NYISO时,所得到的影响意识策略相对于单位大小交易显著提高了风险回报比,并突显了市场和季节间的显著异质性。

英文摘要

We study the problem of forecasting and optimally trading day-ahead versus real-time (DART) price spreads in U.S. wholesale electricity markets. Building on the framework of Galarneau-Vincent et al., we extend spike prediction from a single zone to a multi-zone setting and treat both positive and negative DART spikes within a unified statistical model. To translate directional signals into economically meaningful positions, we develop a structural and market-consistent price impact model based on day-ahead bid stacks. This yields closed-form expressions for the optimal vector of zonal INC/DEC quantities, capturing asymmetric buy/sell impacts and cross-zone congestion effects. When applied to NYISO, the resulting impact-aware strategy significantly improves the risk-return profile relative to unit-size trading and highlights substantial heterogeneity across markets and seasons.

2508.04003 2026-05-20 q-fin.TR

The Marginal Effects of Ethereum Network MEV Transaction Re-Ordering

以太坊网络MEV交易重排的边际效应

Bruce Mizrach, Nathaniel Yoshida

AI总结 研究以太坊网络中MEV构建者通过重排交易对参与者造成的影响,发现其每月需支付约1400万美元以保持交易在区块前四分之一,且沙丁鱼攻击频繁,导致交易费用分配不均,提出可能的改革措施。

详情
AI中文摘要

目前,两个MEV构建者几乎生产了以太坊80%的区块。区块构建者有能力重排区块链上的交易,这种方式可能对参与者有害。我们估计,他们每月需支付近1400万美元以确保交易位于区块前四分之一。沙丁鱼攻击(即抢先交易)频繁,平均每块超过一个。这些交易的燃气费用支付了MEV支付给验证器的近15%。这些攻击具有显著的边际效应并扭曲了分布。改革如燃气费用优先或私人交易池可能有所帮助。

英文摘要

Two MEV builders now produce nearly 80\% of Ethereum blocks. Block builders have the ability to reorder transactions on the blockchain in a way that can be harmful to participants. We estimate they would pay in the aggregate nearly \$14 million per month to ensure that they remained in the first quartile of the block. Sandwich attacks, in which a transaction is front-run, are frequent, averaging more than one per block. Gas fees on these transactions pay for nearly 15\% of the MEV payments to the validator. These attacks have especially large marginal effects and skew the distribution. Reforms such as gas fee priority or private transaction pools might be helpful.

2407.18687 2026-05-20 q-fin.MF q-fin.RM

Set risk measures

集风险度量

Marcelo Righi, Eduardo Horta, Marlon Moresco

AI总结 本文引入了集风险度量(SRMs),一种定义在本质上有界随机变量非空闭合集族上的实值映射。SRMs通过为整个位置集分配单一资本要求扩展了传统标量风险度量。本文发展了SRMs的公理框架,将单调性、翻译不变性、凸性和正齐次性等经典属性适应到集算术中。主要技术贡献是通过严格拓扑和正则τ-可加单位质量测度对凸SRMs进行对偶表示。还刻画了最坏情况SRMs,并给出了与系统性风险、Knightian不确定性及偏好表示相关的例子。

详情
AI中文摘要

我们引入了集风险度量(SRMs),即定义在非空闭合集族上的实值映射,该族由本质有界随机变量组成。SRMs通过为整个位置集分配单一资本要求扩展了传统标量风险度量。我们发展了SRMs的公理框架,将单调性、翻译不变性、凸性和正齐次性等经典属性适应到集算术中。主要技术贡献是通过严格拓扑和正则τ-可加单位质量测度对凸SRMs进行对偶表示。我们还刻画了最坏情况SRMs,并给出了与系统性风险、Knightian不确定性及偏好表示相关的例子。

英文摘要

We introduce set risk measures (SRMs), real-valued maps defined on the family of non-empty closed bounded sets of essentially bounded random variables. SRMs extend traditional scalar risk measures by assigning a single capital requirement to an entire set of positions. We develop an axiomatic framework for SRMs, adapting classical properties such as monotonicity, translation invariance, convexity, and positive homogeneity to set arithmetic. The main technical contribution is a dual representation of convex SRMs through the \strict{} topology and regular $τ$-additive unit-mass measures. We also characterize worst-case SRMs and present examples related to systemic risk, Knightian uncertainty, and preference representations.

2605.19401 2026-05-20 econ.GN q-fin.EC q-fin.GN stat.AP

External Demand, Domestic Monetary Conditions, and Remittance Dynamics in Nepal

外部需求、国内货币政策与尼泊尔汇款动态

Sahaj Raj Malla

AI总结 本文研究了尼泊尔国内汇款占GDP比重的宏观经济决定因素和动态行为,重点分析主要目的地国家的外部需求和国内货币政策。通过1993-2024年的年度数据,构建了多国外部需求的综合指数和国内货币条件指数,采用ARDL界限检验、Engle-Granger共整合、动态OLS和两步误差修正模型等小样本计量方法,发现外部需求对汇款有显著正向影响,而紧缩的国内货币政策则有显著负向影响。

详情
Comments
16 pages, 1 figure, 7 tables
AI中文摘要

本文研究了尼泊尔国内汇款占GDP比重的宏观经济决定因素和动态行为,重点分析主要目的地国家的外部需求和国内货币政策。通过1993-2024年的年度数据,构建了多国外部需求的综合指数和国内货币条件指数,采用ARDL界限检验、Engle-Granger共整合、动态OLS和两步误差修正模型等小样本计量方法,发现外部需求对汇款有显著正向影响,而紧缩的国内货币政策则有显著负向影响。误差修正模型确认了稳定的共整合关系,每年纠正约26%的不平衡。中期预测表明,汇款将在结构上保持重要地位,到2030年在基准条件下将达到GDP的28.3%,同时对外部需求冲击高度敏感。本文通过将PCA衍生的外部需求和货币条件指数整合到统一的ARDL-ECM框架中,推动了文献发展。聚焦于全球最依赖汇款的经济体之一,为货币政策校准、移民多样化和汇款流入的生产性利用提供了可操作的见解。

英文摘要

This study investigates the macroeconomic determinants and dynamic behaviour of personal remittances as a share of Gross Domestic Product (GDP) in Nepal, emphasizing external demand in major destination countries and domestic monetary policy. Using annual data (1993-2024), we construct composite indices via Principal Component Analysis (PCA) for multi-country external demand and a domestic Monetary Conditions Index (MCI). Our small-sample econometric pipeline includes Autoregressive Distributed Lag (ARDL) bounds testing, Engle-Granger cointegration, Dynamic OLS (DOLS), and a two-step Error Correction Model (ECM). We also employ Granger causality tests and multi-model forecasting using machine learning and ECM scenarios. The analysis reveals a strong positive long-run effect of external demand on remittances and a significant negative impact of tighter domestic monetary conditions. The ECM confirms a stable cointegrating relationship, correcting approximately 26% of disequilibria annually. Medium-term projections indicate remittances will remain structurally important, reaching around 28.3% of GDP by 2030 under baseline conditions, while exhibiting high sensitivity to external demand shocks. This study advances the literature by integrating PCA-derived external demand and monetary conditions indices within a unified ARDL-ECM framework for small samples. Focusing on one of the world's most remittance-dependent economies, it offers actionable insights for monetary policy calibration, migration diversification, and the productive utilization of remittance inflows.

2605.19154 2026-05-20 econ.GN q-fin.EC

Indirect Estimators of Intergenerational Mobility

代际流动性间接估计器

Andrea Del Pizzo, Martin Nybom, Jan Stuhler

AI总结 本文研究了代际流动性间接估计方法,通过家庭关联推断收入数据不完整时的父母-子女关系,综合了工具变量、可观测特征插补、姓氏估计器和多代关联方法,并提出一个统一框架来解释不同方法的权重分配。

详情
Comments
In preparation for the Handbook of the Economics of Intergenerational Mobility
AI中文摘要

本章回顾了代际流动性的间接估计方法,重点探讨在直接收入数据不完整或不可用时,如何通过父母-子女或其他家庭关联推断。我们综合了基于工具变量、利用教育和职业等可观测特征进行插补、基于姓氏的估计器以及多代关联方法。为统一这些方法,我们引入了一个简化框架,其中经济状况通过多种路径传递,具有异质性的持续率。在该框架下,直接和间接估计器均可视为这些底层传递渠道的加权平均。核心发现是,选择工具或插补策略决定了这些权重,使不同方法捕捉到传递过程的不同方面。我们强调了解释上的影响,表明间接估计器不必恢复传统父母-子女相关性,而是可以提供关于长期持续性和持久不平等机制的补充证据。

英文摘要

This chapter reviews indirect estimators of intergenerational mobility, focusing on approaches that infer parent-child or other family associations when direct income data are incomplete or unavailable. We synthesize methods based on instrumental variables, imputation using observable characteristics such as education and occupation, surname-based estimators, and multigenerational linkages. To unify these approaches, we introduce a stylized framework in which socioeconomic status is transmitted through multiple pathways with heterogeneous persistence rates. Within this framework, both direct and indirect estimators can be interpreted as weighted averages of these underlying transmission channels. A central insight is that the choice of instrument or imputation strategy determines these weights, leading different methods to capture distinct aspects of the transmission process. We highlight implications for interpretation, showing that indirect estimators need not recover conventional parent-child correlations but can instead provide complementary evidence on long-run persistence and the mechanisms underlying persistent inequalities.

2605.19146 2026-05-20 q-fin.MF cs.CE

Designing On-Chain Options: Amortizing Perpetual Options

设计链上期权:摊还永续期权

Maxim Bichuch, Zachary Feinstein

AI总结 本文提出了一种适用于区块链环境的摊还永续期权设计,通过最小化一致性要求的去中心化市场框架,为DeFi提供基础风险原语,实现去中心化清算和尾部风险共担。

详情
AI中文摘要

金融期权是传统市场中的基本工具,使策略范围从对冲到投机。然而,尽管自动化市场maker范式已革新去中心化现货市场,但链上期权尚无等效标准。典型设计试图复制集中化交易所机制,需要高频预言机和稳健的清算引擎,在压力事件中可能失效。本文提出一种适用于区块链环境的操作和对抗约束的摊还永续期权设计。利用这一基础构件,我们引入一个去中心化市场框架,具有最小的一致性要求。我们证明该合同作为DeFi的基础风险原语,支持如内生抵押和显式定价去锚定保险的应用,从而表明该设计提供了一层跨协议共担尾部风险的层,而无需依赖集中化清算机构。

英文摘要

Financial options are fundamental to traditional markets, enabling strategies ranging from hedging to speculating. Yet, while the Automated Market Maker paradigm has revolutionized decentralized spot markets, no equivalent standard has emerged for on-chain options. Typical designs attempt to replicate centralized exchange mechanics, requiring high-frequency oracles and robust liquidation engines which may fail during stress events. This paper presents a design for amortizing perpetual options tailored to the operational and adversarial constraints of blockchain environments. Leveraging this primitive, we introduce a decentralized market framework with minimal consistency requirements. We demonstrate that this contract functions as a foundational risk primitive for DeFi, enabling applications such as endogenous collateralization and explicitly priced de-peg insurance, thereby showing that this design provides a layer for mutualizing tail risk across protocols without reliance on centralized clearing institutions.

2604.02064 2026-05-20 quant-ph cs.NA math.NA q-fin.PR

Quantitative Universal Approximation for Noisy Quantum Neural Networks

量子神经网络在噪声环境下的定量通用逼近

Lukas Gonon, Antoine Jacquier, Marcel Mordarski

AI总结 本文提出一个具有精确误差界的一般逼近定理,用于噪声量子神经网络,特别应用于量化金融领域,通过在实际噪声量子硬件上进行数值分析来验证结果。

详情
Comments
30 pages, 17 figures
AI中文摘要

本文提供了一个具有精确误差界的通用逼近定理,用于噪声量子神经网络。我们专注于量化金融应用,其中目标函数通常以期望值给出。我们进一步提供了详细的数值分析,测试我们的结果在实际噪声量子硬件上的表现。

英文摘要

We provide here a universal approximation theorem with precise quantitative error bounds for noisy quantum neural networks. We focus on applications to Quantitative Finance, where target functions are often given as expectations. We further provide a detailed numerical analysis, testing our results on actual noisy quantum hardware.

2512.13562 2026-05-20 q-fin.RM math.PR

Disability insurance with collective health claims: A mean-field approach

残疾保险与集体健康索赔:一种均场方法

Christian Furrer, Philipp C. Hornung

AI总结 本文提出了一种结合个体和集体健康索赔的残疾保险模型,通过均场方法将复杂的多体问题简化为非线性单体问题,从而提高预测能力。

详情
AI中文摘要

经典的半马尔可夫残疾模型通过引入个体和集体健康索赔来增强其解释和预测能力,特别是在团体经验定价的背景下。集体健康索赔的引入导致了计算上具有挑战性的多体问题。通过采用均场方法,该多体问题可以被近似为非线性单体问题,从而得到一种基于低维非线性前向积分微分方程系统的透明定价方法。在面向实践的模拟研究中,均场近似在与朴素蒙特卡洛方法的比较中表现良好。

英文摘要

The classic semi-Markov disability model is expanded with individual and collective health claims to improve its explanatory and predictive power -- in particular in the context of group experience rating. The inclusion of collective health claims leads to a computationally challenging many-body problem. By adopting a mean-field approach, this many-body problem can be approximated by a non-linear one-body problem, which in turn leads to a transparent pricing method for disability coverages based on a lower-dimensional system of non-linear forward integro-differential equations. In a practice-oriented simulation study, the mean-field approximation clearly stands its ground in comparison to naïve Monte Carlo methods.

2509.25055 2026-05-20 q-fin.CP

AlphaSAGE: Structure-Aware Alpha Mining via GFlowNets for Robust Exploration

AlphaSAGE: 通过GFlowNets进行结构感知的Alpha挖掘以实现稳健探索

Binqi Chen, Hongjun Ding, Ning Shen, Jinsheng Huang, Taian Guo, Luchen Liu, Ming Zhang

AI总结 本研究提出AlphaSAGE框架,通过结构感知编码器、生成流网络和密集多维奖励结构,解决量化金融中Alpha挖掘的奖励稀疏性、表达不足和单一最优模式问题,实现更多样化的Alpha挖掘。

详情
AI中文摘要

自动挖掘预测信号或Alpha是量化金融中的核心挑战。尽管强化学习(RL)已 emerge 作为生成公式化Alpha的有前景范式,但现有框架本质上受到三个相互关联问题的限制。首先,它们面临奖励稀疏性问题,其中有意义的反馈仅在完成完整公式后才出现,导致探索效率低下且不稳定。其次,它们依赖于语义不充分的数学表达式序列表示,无法捕捉决定Alpha行为的结构。第三,标准RL目标最大化预期回报本质上会推动策略朝单一最优模式发展,直接与实际对非相关Alpha多样化投资的需要相矛盾。为克服这些挑战,我们引入AlphaSAGE(通过生成流网络进行结构感知的Alpha挖掘以实现稳健探索)框架,该框架基于三个核心创新:(1)基于关系图卷积网络(RGCN)的结构感知编码器;(2)生成流网络(GFlowNets)的新框架;(3)密集、多维奖励结构。实验结果表明,AlphaSAGE在挖掘更多样、新颖且高度预测性的Alpha组合方面优于现有基线,从而提出了一种新的自动化Alpha挖掘范式。我们的代码可在https://github.com/BerkinChen/AlphaSAGE上获取。

英文摘要

The automated mining of predictive signals, or alphas, is a central challenge in quantitative finance. While Reinforcement Learning (RL) has emerged as a promising paradigm for generating formulaic alphas, existing frameworks are fundamentally hampered by a triad of interconnected issues. First, they suffer from reward sparsity, where meaningful feedback is only available upon the completion of a full formula, leading to inefficient and unstable exploration. Second, they rely on semantically inadequate sequential representations of mathematical expressions, failing to capture the structure that determine an alpha's behavior. Third, the standard RL objective of maximizing expected returns inherently drives policies towards a single optimal mode, directly contradicting the practical need for a diverse portfolio of non-correlated alphas. To overcome these challenges, we introduce AlphaSAGE (Structure-Aware Alpha Mining via Generative Flow Networks for Robust Exploration), a novel framework is built upon three cornerstone innovations: (1) a structure-aware encoder based on Relational Graph Convolutional Network (RGCN); (2) a new framework with Generative Flow Networks (GFlowNets); and (3) a dense, multi-faceted reward structure. Empirical results demonstrate that AlphaSAGE outperforms existing baselines in mining a more diverse, novel, and highly predictive portfolio of alphas, thereby proposing a new paradigm for automated alpha mining. Our code is available at https://github.com/BerkinChen/AlphaSAGE.

2605.18887 2026-05-20 econ.EM econ.GN q-fin.EC stat.AP

Valuing Winners: When and How to Correct for Selection Bias in Randomized Experiments

估值赢家:何时以及如何在随机实验中纠正选择偏差

Ron Berman, Walter W. Zhang, Hangcheng Zhao

AI总结 本文研究了在随机实验中如何纠正选择偏差,区分了全局和选择性赢家诅咒两种形式,并探讨了如何根据管理目标选择合适的方法。

详情
Comments
68 pages
AI中文摘要

决策者经常选择随机实验中表现最好的处理方法,从而产生赢家诅咒:选择倾向于那些观察到的结果较高的处理,部分原因是统计噪声,因此对赢家的简单估计存在向上偏差。我们区分了两种形式的赢家诅咒,即相对于真实最佳处理的偏差(全局)和相对于所选处理真实均值的偏差(选择性),并将它们与部署次优处理的遗憾联系起来。该框架定义了七个决策相关的评估目标:全局和选择性赢家诅咒的均值偏差、均方误差和置信区间覆盖率,以及均值遗憾。然后我们显示,在一种目标上表现良好的方法可能在其他目标上表现不佳,因此纠正措施应与管理目标相匹配。在具有不同效应大小、多臂设置和校准到在线A/B测试平台的数据模拟中,没有方法在所有情况下都占优:插值估计器在处理差异较大的情况下表现最佳,交叉拟合在处理相似时表现最佳,而重采样方法在中等差异时通常能实现较低的均方误差。我们还介绍了一种自适应经验似然程序,该程序在各种情况下都能提供渐近有效的置信区间,而无需重采样方法的调参敏感性。

英文摘要

Decision-makers often deploy the best-performing treatment from a randomized experiment, creating a winner's curse: selection favors treatments whose observed outcomes are high partly because of statistical noise, so the naïve estimate of the winner is upward biased. We distinguish two forms of winner's curse, bias relative to the true best treatment (global) and bias relative to the selected treatment's true mean (selective), and link them to regret from deploying a suboptimal treatment. This framework defines seven decision-relevant evaluation targets: mean bias, mean squared error, and confidence interval coverage for the global and selective winner's curse, and mean regret. We then show that methods that perform well on one target can perform poorly on others, so corrections should be matched to the manager's objective. Across simulations with varying effect sizes, multiple-arm settings, and data calibrated to an online A/B testing platform, no method dominates uniformly: the plug-in estimator performs best when treatment differences are large, cross-fitting performs best when treatments are similar, and resampling methods often achieve low mean squared error for moderate differences. We also introduce an adaptive empirical likelihood procedure that delivers asymptotically valid confidence intervals across settings without the tuning sensitivity of resampling-based methods.

2605.18784 2026-05-20 q-fin.RM cs.AI cs.CR cs.CY econ.GN q-fin.EC

The Insurability Frontier of AI Risk: Mapping Threats to Affirmative Coverage, Silent Exposures, and Exclusions

AI风险的可保险边界:将威胁映射到积极保险、沉默暴露和排除

Alex Leung, Rex Zhang, Ervin Ling, Kentaroh Toyoda, SiewMei Loh

AI总结 本文研究了AI风险在商业保险中的可保险性边界,通过分析55类AI威胁与26种保险产品和排除制度,揭示了四个层次的可保险性前沿:积极保险的风险、沉默AI暴露、主动排除的风险以及传统私人保险结构之外的风险。

详情
AI中文摘要

代理AI的快速扩散为商业保险创造了一个新的覆盖问题:一些AI中介的损失现在被积极保险,一些在传统网络安全、技术错误与遗漏(E&O)、董事与高管(D&O)、雇佣实践责任(EPLI)、犯罪和媒体政策下产生沉默AI暴露,而其他则被积极排除。本文通过编码55类AI威胁与26种保险产品、保证和排除制度,利用公开承运商材料和OWASP/MITRE威胁目录,确定了四个层次的可保险性前沿:积极保险的风险、沉默AI暴露、主动排除的风险以及传统私人保险结构之外的风险。我们的编码测量公开声明的定位,而非执行合同的措辞;头条统计数据描述承运商公开声明的覆盖情况,而非任何具体索赔将支付什么。三个模式显现。首先,积极AI覆盖开始通过主要风险重点进行区分:公开材料通常将慕尼黑再保险定位在模型性能和漂移,Armilla和 Lloyd's 市场部分围绕幻觉和更广泛的AI责任,Tokio Marine Kiln和CFC围绕知识产权和技术E&O关注,Apollo ibott围绕新兴自主系统责任,Coalition围绕深度伪造和AI增强的网络安全响应。其次,传统业务线在AI作为工具而非损失法律原因的情况下保留沉默AI暴露。第三,基础模型集中是清晰的真正新型可保险性前沿,因为上游模型失败可以一次关联多个被保险人损失;相关市场设计问题是每个候选结构放松了哪些可保险性约束,而不是仅仅存在哪种系统性风险模板。

英文摘要

The rapid diffusion of agentic AI has created a new coverage problem for commercial insurance: some AI-mediated losses are now affirmatively insured, some create silent-AI exposure under legacy cyber, technology errors-and-omissions (E&O), directors-and-officers (D&O), employment practices liability (EPLI), crime, and media policies, and others are being actively excluded. This paper maps that emerging boundary by coding 55 AI threat classes against 26 insurance products, endorsements, and exclusion regimes using public carrier materials and OWASP/MITRE threat catalogs. We identify a four-tier insurability frontier: affirmatively insured perils, silent-AI exposures, actively excluded perils, and perils outside conventional private insurance structures. Our coding measures publicly claimed positioning rather than executed contract wording; the headline statistics describe what carriers publicly state about coverage, not what would be paid in any specific claim. Three patterns emerge. First, affirmative AI coverage is beginning to differentiate by primary risk emphasis: public materials often position Munich Re around model performance and drift, Armilla and parts of the Lloyd's market around hallucination and broader AI liability, Tokio Marine Kiln and CFC around IP and technology E&O concerns, Apollo ibott around emerging autonomous system liability, and Coalition around deepfake and AI-enabled cyber response. Second, legacy lines retain silent-AI exposure where AI is an instrumentality rather than the legal cause of loss. Third, foundation model concentration is the clearest genuinely novel insurability frontier because upstream model failure can correlate losses across many cedents at once; the relevant market design question is which insurability constraint each candidate structure relaxes, not merely which systemic risk template exists.

2506.19958 2026-05-20 stat.ME econ.GN q-fin.EC stat.AP stat.CO

RobustiPy: An efficient next generation multiversal library with model selection, averaging, resampling, and explainable artificial intelligence

RobustiPy: 一个高效的下一代多宇宙库,包含模型选择、平均、重采样和可解释人工智能

Daniel Valdenegro, Jiani Yan, Duiyi Dai, Charles Rahal

AI总结 本文提出RobustiPy,一个高效的多宇宙分析库,通过统一的模块化框架整合了重采样推断、组合规范搜索、模型选择与平均、联合推断和可解释AI方法,以提高实证研究的透明度和可重复性。

详情
AI中文摘要

科学推断常常受到广泛但很少探索的“多宇宙”可辩护建模选择的影响,这些选择可以产生与研究现象一样多变的结果。我们介绍了RobustiPy,一个开源的Python库,它系统化地在大规模上进行多宇宙分析和模型不确定性量化。RobustiPy在一个模块化、可重复的框架中统一了基于重采样的推断、组合规范搜索、模型选择和平均、联合推断程序以及可解释的人工智能方法。除了详尽的规范曲线外,它还支持严格的离样验证,并量化每个协变量的边际贡献。我们展示了其在五个模拟设计和十个涵盖经济学、社会学、心理学和医学的实证案例研究中的实用性,包括对广泛引用但有记录差异的发现的重新分析。在约6.72亿次模拟回归上进行基准测试表明,RobustiPy在提高实证研究透明度的同时,实现了最先进的计算效率。通过标准化和加速稳健性分析,RobustiPy改变了研究人员在分析多宇宙中的敏感性探究方式,为更可重复和可解释的计算科学提供了实用基础。

英文摘要

Scientific inference is often undermined by the vast but rarely explored "multiverse" of defensible modelling choices, which can generate results as variable as the phenomena under study. We introduce RobustiPy, an open-source Python library that systematizes multiverse analysis and model-uncertainty quantification at scale. RobustiPy unifies bootstrap-based inference, combinatorial specification search, model selection and averaging, joint-inference routines, and explainable AI methods within a modular, reproducible framework. Beyond exhaustive specification curves, it supports rigorous out-of-sample validation and quantifies the marginal contribution of each covariate. We demonstrate its utility across five simulation designs and ten empirical case studies spanning economics, sociology, psychology, and medicine, including a re-analysis of widely cited findings with documented discrepancies. Benchmarking on ~672 million simulated regressions shows that RobustiPy delivers state-of-the-art computational efficiency while expanding transparency in empirical research. By standardizing and accelerating robustness analysis, RobustiPy transforms how researchers interrogate sensitivity across the analytical multiverse, offering a practical foundation for more reproducible and interpretable computational science.