arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.06089 2026-06-05 q-fin.MF econ.GN q-fin.EC q-fin.RM

Leveraging LLMs for Unstructured Claims Data Analysis

利用大语言模型进行非结构化索赔数据分析

Robert D. Lieberthal, Richard Tran, Vietbao Phan, Jawand Singh, Elizabeth Sottung

AI总结提出一个两阶段处理框架，利用大语言模型从非结构化索赔数据中提取结构化精算变量，并通过链梯法准备金验证其实际价值。

详情

Comments: 41 pages, 6 figures, 3 tables. Code available at https://github.com/mdsight/llm-claims-analysis . Technical Specification Requirement included as Appendix D. Funded by the Casualty Actuarial Society Artificial Intelligence Working Group

AI中文摘要

精算师主要依赖结构化数值数据进行准备金和费率制定，而非结构化文本（包括医疗记录、理赔员笔记和通话记录）中包含的有价值预测信息大多未被使用。手动处理这些文档耗时、跨审查员不一致且不可扩展。我们提出了一个概念验证框架，使用大语言模型（LLMs）从非结构化索赔数据中提取结构化精算变量。我们实现了一个两阶段处理架构，将文档级提取（阶段1）与索赔级综合（阶段2）分开。一个模块化的四脚本Python管道处理基于FHIR的合成索赔数据和真实索赔文档，提取了涵盖准备金、费率制定和索赔管理类别的36个精算变量。我们使用两名独立临床专家审查员对20个合成索赔进行五点Likert评分，验证了14个核心变量，平均得分超过4.0，加权kappa为0.53。与链梯法准备金的集成展示了实际精算价值：严重程度分段分析将准备金估计误差从6.5%降低到4.0%。开源实现包括审计轨迹和置信度评分，为财产险中基于LLM的精算变量提取提供了可复现的基础。

英文摘要

Actuaries rely primarily on structured numerical data for reserving and ratemaking, while valuable predictive information in unstructured text including medical records, adjuster notes, and call transcripts remains largely unused. Manual processing of these documents is time-consuming, inconsistent across reviewers, and unscalable. We present a proof-of-concept framework using large language models (LLMs) to extract structured actuarial variables from unstructured claims data. We implement a two-stage processing architecture separating document-level extraction (Stage 1) from claim-level synthesis (Stage 2). A modular four-script Python pipeline processes synthetic FHIR-based claims data and real claims documents, extracting 36 actuarial variables across reserving, ratemaking, and claims management categories. We validate 14 core variables using two independent clinical expert reviewers scoring 20 synthetic claims on a five-point Likert rubric, achieving mean scores above 4.0 and a weighted kappa of 0.53. Integration with chain ladder reserving demonstrates practical actuarial value: severity-segmented analysis reduced reserve estimation error from 6.5% to 4.0%. The open-source implementation includes audit trails and confidence scoring, providing a replicable foundation for LLM-based actuarial variable extraction in property-casualty insurance.

URL PDF HTML ☆

赞 0 踩 0

2606.06059 2026-06-05 econ.TH

Fair Division of a Heterogeneous Good Between Two Agents: An Ordinal Approach

异质商品在两个代理人之间的公平分配：一种序数方法

Mihir Bhattacharya, Ojasvi Khare

AI总结本文在纯序数框架下研究异质商品在两个代理人之间的连续束分配，引入单调偏好类，刻画帕累托有效且无嫉妒的分配条件，并证明不存在同时满足帕累托效率、无嫉妒和策略防护的规则。

2606.05991 2026-06-05 q-fin.GN econ.EM

Forecasting of volatility and risk premia in electricity markets

电力市场波动率与风险溢价的预测

Thomas K. Kloster, Fred Espen Benth

AI总结研究电力市场已实现协方差的预测，通过构建简约矩阵HAR模型，发现纳入更长的时间跨度和可再生能源发电信息能显著提升预测能力，并利用方差预测改进远期市场价差风险溢价的预测。

2606.05705 2026-06-05 econ.EM cs.SE

Econstellar: An Open-Source AI-Augmented Research Engine for Computational Financial Econometrics

Econstellar: 一个用于计算金融计量经济学的开源AI增强研究引擎

Avishek Bhandari

AI总结提出一个开源AI增强研究引擎Econstellar，通过浏览器运行可复现的金融计量分析，并利用AI助手解释结果，旨在缩短研究声明与独立验证之间的距离。

详情

Comments: 13 pages, 1 figure, 3 tables. Open-source code and live demonstration: https://avishekb9.github.io/econstellar/ . JEL: C58, C63, C88, G15

AI中文摘要

将一个有前景的经济学想法转化为可信的实证发现在实践中代价高昂：它需要大量的专业计算，而结果很少以他人可以检查或构建的形式发布。Econstellar是我们的回应。它是一个开放的、公开服务的研究引擎，可以在普通网络浏览器中运行出版级别的金融计量经济学，并解释结果的含义，这样读者不仅可以阅读发现，还可以重新运行它、改变输入，并精确追踪其产生过程。三个选择赋予了该系统特色。繁重的计算被放置在适合的处理器上，而不是强加于不匹配的硬件，这很大程度上是这类分析很少向公众提供的原因。一个人工智能助手选择并解释分析，但从不产生数字，因此它报告的每个数量都是读者可以复现的真实计算。访问者使用的引擎与生成我们已发表研究中的图表的代码相同。我们公开了十七种计量经济学方法，每种方法都报告了经过验证的实时值，并可在公共端点上复现，这些计算在单一纪律下进行：价格被视为非平稳的，所有方法都应用于收益率。该系统还按需重新生成一项关于金融传染的附带研究的头条结果，来自生成该结果的软件包。该平台是一个活跃研究计划的工作核心，涵盖三个软件版本和三个预印本，现在即可免费开源获取，位于一个实时公共地址。我们的目标很简单：缩短研究声明与另一个人能够独立验证它之间的时间。

英文摘要

Turning a promising economic idea into a credible empirical finding is, in practice, an expensive undertaking: it demands a great deal of specialised computation, and the results are seldom released in a form that others can check or build upon. Econstellar is our response. It is an open, publicly served research engine that runs publication-grade financial econometrics from an ordinary web browser and explains what the results mean, so that a reader does not merely read a finding but can re-run it, vary its inputs, and trace exactly how it was produced. Three choices give the system its character. The heavy computation is placed on the processor that suits it, rather than forced onto hardware ill-matched to the task, which is much of the reason analysis of this kind is so rarely served to the public. An artificial-intelligence assistant selects and interprets the analyses but never originates a number, so every quantity it reports is a real computation the reader can reproduce. And the engine a visitor exercises is the same code that produced the figures in our published research. We expose seventeen econometric methods, each reported with a verified live value and reproducible at the public endpoint, computed under a single discipline: prices are treated as non-stationary and all methods are applied to returns. The system also regenerates, on demand, the headline result of an accompanying study of financial contagion, from the package that generated it. The platform is the working core of an active research programme spanning three software releases and three preprints, and it is available now, free and open-source, at a live public address. Our aim is a simple one: to shorten the distance between a research claim and the moment another person can independently verify it.

URL PDF HTML ☆

赞 0 踩 0

2606.05655 2026-06-05 econ.TH cs.GT

Measuring Concentration of Power in Approval Voting Games

衡量批准投票游戏中的权力集中度

Takaaki Abe

AI总结本文提出并刻画了一个衡量单调批准投票游戏中权力集中度的函数，该函数与Deegan-Packel权力指数的平方和成正比，并应用于联合国安理会。

2606.05631 2026-06-05 q-fin.MF econ.GN q-fin.EC

Stress Amplified Resilience: ESG and Joint Fragility in Equity Markets

压力放大韧性：ESG与股票市场的联合脆弱性

Minxuan Hu, Jiayu Yi, Ziheng Chen, Wenxi Sun, Qishi Zhan

AI总结本文通过分析2014-2025年标普500成分股数据，研究ESG是否与较低的市场联合脆弱性（下行收益、波动性、非流动性同时发生）相关，发现ESG在压力时期通过多通道放大韧性，而非提供无条件溢价。

详情

AI中文摘要

市场压力很少通过单一渠道损害投资者。损失、波动性飙升和交易性恶化往往同时发生。我们检验ESG是否与较低的股票市场聚类脆弱性暴露相关。使用2014年至2025年标普500成分股的月度数据，我们研究下行收益、波动性、非流动性以及一个捕捉它们在同一公司月份内同时发生的共脆弱性状态。证据支持压力放大韧性的解释，而非无条件的ESG回报溢价。在回报渠道中，ESG关联集中在压力月份的极端下行尾部。在波动性渠道中，较高的ESG与整体条件疲弱时较小的风险飙升相关。在非流动性渠道中，关联更为持久，表明流动性质量成分的相关性在市场整体交易条件恶化时增加。核心证据来自联合分析：ESG增加一个标准差，将压力时期严重共脆弱性的概率降低0.92个百分点，相对于基线约9%。双机器学习在灵活调整可观测公司特征后显示类似的负ESG关联。支柱证据表明，环境得分具有更强的基线韧性，而社会得分具有更清晰的压力放大效应。总体而言，这些发现将ESG描述为用于尾部风险监控、压力分析和支柱级ESG评估的多通道脆弱性信号。

英文摘要

Market stress rarely harms investors through one channel alone. Losses, volatility spikes, and deteriorating tradability often arrive together. We examine whether ESG is associated with lower exposure to clustered fragility in equity markets. Using monthly data on S&P 500 constituents from 2014 to 2025, we study downside returns, volatility, illiquidity, and a cofragility state that captures their joint occurrence within the same firm month. The evidence supports a stress-amplified resilience interpretation rather than an unconditional ESG return premium. In the return channel, the ESG association is concentrated in the extreme downside tail during stress months. In the volatility channel, higher ESG is associated with smaller risk spikes when aggregate conditions are weak. In the illiquidity channel, the association is more persistent, suggesting a liquidity-quality component whose relevance increases when market-wide trading conditions deteriorate. The central evidence comes from the joint analysis: a one-standard-deviation increase in ESG lowers the stress-period probability of severe cofragility by 0.92 percentage points, about 9% relative to the baseline. Double Machine Learning shows a similar negative ESG association after flexible adjustment for observable firm characteristics. Pillar evidence suggests stronger baseline resilience for Environmental scores and clearer stress amplification for Social scores. Overall, the findings characterize ESG as a multi-channel fragility signal for tail-risk monitoring, stress analysis, and pillar-level ESG assessment.

URL PDF HTML ☆

赞 0 踩 0

2606.05582 2026-06-05 econ.TH

Three characterizations of the weighted center of imputations value

加权中心分配值的三种刻画

Shan Erfang, Liying Kang

AI总结本文通过推广个体理性和子博弈序保持公理，给出了加权中心分配值的三种公理化刻画，统一并扩展了平均分配和标准CIS值作为特例。

2606.05554 2026-06-05 econ.TH

An Irrelevance Theorem for Risk Aversion and Time-Varying Risk

风险厌恶与时变风险的不相关性定理

Andrew Chen, Francisco Palomino

AI总结本文提出一个定理，在跨期与风险偏好分离、一阶与高阶矩驱动因素分离以及约束近似线性的条件下，证明风险厌恶和时变风险对某些状态变量的弹性无影响，并讨论了模型如何通过违反假设来增强风险的作用。

2606.05449 2026-06-05 cs.AI cs.GT econ.EM

Insurance of Agentic AI

代理型人工智能的保险

Quanyan Zhu

AI总结本文分析了代理型AI带来的新型风险，提出了承保、定价、再保险和产品设计的框架，并构建了整合多种保险覆盖的协调架构。

详情

AI中文摘要

代理型人工智能系统通过超越信息生成，扩展到自主规划、工具调用、决策执行以及对数字和物理环境的持续修改，正在改变风险格局。这些能力引入了新的风险敞口，这些敞口并不完全适合传统的保险类别，如网络、职业责任、产品责任或董事及高管责任保险。本文考察了新兴的代理型AI保险市场，并开发了一个框架来理解其承保、定价、再保险和产品设计的影响。我们将代理型AI描述为自主性和授权委托的连续体，强调信息输出与能够通过外部行动独立产生保险事件的系统之间的区别。我们分析了主要风险路径，包括幻觉、提示注入攻击、自主决策错误、模型漂移、依赖故障和网络物理伤害，并评估了现有保险产品如何适应这些风险敞口。本文进一步提出了一个基于风险暴露评估、情景分析、依赖映射和累积风险管理的精算框架，借鉴了网络保险的发展历程。最后，我们提出了一个协调的保险架构，通过明确的分配机制和专门的AI总限额，整合了网络、技术错误与遗漏、产品责任、性能保证以及明确的AI责任保险。分析表明，代理型AI保险的未来不在于单一的单线产品，而在于一个由改进的治理、透明度、遥测和监管清晰度支持的互补覆盖分层生态系统。

英文摘要

Agentic artificial intelligence (AI) systems are transforming the risk landscape by extending beyond information generation to autonomous planning, tool invocation, decision execution, and persistent modification of digital and physical environments. These capabilities introduce novel exposures that do not fit neatly within traditional insurance categories such as cyber, professional liability, product liability, or directors and officers coverage. This paper examines the emerging insurance market for agentic AI and develops a framework for understanding its underwriting, pricing, reinsurance, and product-design implications. We characterize agentic AI as a continuum of autonomy and delegated authority, emphasizing the distinction between informational outputs and systems capable of independently generating insured events through external actions. We analyze major risk pathways, including hallucinations, prompt-injection attacks, autonomous decision errors, model drift, dependency failures, and cyber-physical harms, and evaluate how existing insurance products are adapting to address these exposures. The paper further proposes an actuarial framework based on exposure assessment, scenario analysis, dependency mapping, and accumulation-risk management, drawing parallels to the evolution of cyber insurance. Finally, we present a coordinated insurance architecture that integrates cyber, technology errors and omissions, product liability, performance-warranty, and affirmative AI-liability coverages through explicit allocation mechanisms and dedicated AI aggregates. The analysis suggests that the future of agentic-AI insurance lies not in a single monoline product but in a layered ecosystem of complementary coverages supported by improved governance, transparency, telemetry, and regulatory clarity.

URL PDF HTML ☆

赞 0 踩 0

2606.05383 2026-06-05 econ.GN cs.AI econ.TH q-fin.EC

Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff

AI能否反驳经济理论？来自知识截止日期之外的证据

Alexis Akira Toda

AI总结本文通过实验测试多个AI模型（Gemini、Refine、Claude和ChatGPT）检查四篇包含错误的经济理论论文，发现ChatGPT Pro表现最佳但无法独立发现错误，表明AI尚不能自主反驳经济理论。

2604.26634 2026-06-05 cs.LG econ.GN q-fin.EC stat.AP

Electricity price forecasting across Norway's five bidding zones in the post-crisis era

在危机后时代跨挪威五个竞价区的电力价格预测

My Thi Diem Phan, Trung Tuyen Truong, Hoai Phuong Ha, Dat Thanh Nguyen

AI总结本文研究了挪威五个竞价区在能源危机后电力价格预测的问题，通过构建多模态数据集并评估了八种预测模型，发现LightGBM在所有区域表现最佳，同时强调了外部特征在不同市场状况下的重要性。

详情

Comments: This version removes variables unavailable at prediction time to eliminate look-ahead leakage, clarifies the forecasting task definition, and updates the results and discussion accordingly. All tables and figures have been recomputed

AI中文摘要

挪威的电力市场长期以来由水电主导，但2021-2022年的能源危机和与欧洲大陆的更强整合已从根本上改变了价格形成机制，降低了基于历史数据校准的预测模型的可靠性。尽管需要更新的模型，但缺乏一个统一的基准来评估所有结构各异的挪威竞价区的特征贡献。本文提出了对Nord Pool市场在所有五个挪威竞价区的一步预测的全面评估。我们构建了一个覆盖2019-2025年的多模态小时数据集，并使用严格因果测试集评估了八种预测模型家族，包括Light Gradient Boosting Machine（LightGBM）、带有外生变量的自回归模型和先进的深度学习架构。我们实现了稳健的滚动起源回测、留一组法特征消融和条件制度分析来分解模型性能和特征效用。我们的结果表明，LightGBM在每个区域都表现最佳，平均绝对误差范围为1.60至5.58欧元每兆瓦时，而一个带有外生变量的岭正则化自回归模型在北部区域仍然是一个高度有竞争力的线性基准。特征消融揭示了仅依赖滞后价格和日历变量的模型能够获得高精度，通常与完整的多模态模型的性能相匹配或接近。然而，条件制度分析显示，外部特征如水库水位和天然气价格在分层预测误差方面至关重要，这些误差在压力市场制度下持续增加。这突显了模型可解释性和制度意识在决策者面对市场动态结构性变化时的实用价值。

英文摘要

Norway's electricity market is heavily dominated by hydropower, but the 2021-2022 energy crisis and stronger integration with Continental Europe have fundamentally altered price formation, reducing the reliability of forecasting models calibrated on historical data. Despite the critical need for updated models, a unified benchmark evaluating feature contributions across all structurally diverse Norwegian bidding zones remains lacking. Here we present a comprehensive evaluation of one-step-ahead forecasting of the Nord Pool market across all five Norwegian bidding zones. We constructed a multimodal hourly dataset spanning 2019-2025 and evaluated eight forecasting model families, including Light Gradient Boosting Machine (LightGBM), autoregressive models with exogenous variables, and advanced deep learning architectures, using a strictly causal test set. We implemented robust rolling-origin backtesting, leave-one-group-out feature ablation, and conditional regime analysis to dissect model performance and feature utility. Our results show that LightGBM achieves the best performance in every zone, with mean absolute error ranging from 1.60 to 5.58 euros per megawatt-hour, while a ridge-regularized autoregressive model with exogenous variables remains a highly competitive linear benchmark in northern zones. Feature ablation reveals that models relying solely on lagged prices and calendar variables achieve high accuracy and often match or closely approach the performance of the full multimodal model. However, conditional regime analysis demonstrates that external features like reservoir levels and gas prices remain crucial to stratify forecast errors, which consistently increase under stressed market regimes. This highlights the practical value of model interpretability and regime awareness for decision makers facing structural changes in market dynamics.

URL PDF HTML ☆

赞 0 踩 0

2511.15427 2026-06-05 econ.EM stat.ME

Tractable Estimation of Nonlinear Panels with Interactive Fixed Effects

可计算的非线性面板模型交互固定效应估计

Andrei Zeleneev, Weisheng Zhang

AI总结本文提出了一种计算高效的方法，用于估计非线性面板模型中的交互固定效应，该方法在理论上等价于Chen等人(2021)提出的估计器，避免了高维非凸优化问题，适用于大规模非线性面板数据。

详情

AI中文摘要

在线性面板模型中，交互固定效应通常被控制。尽管文献中已存在非线性模型的类似固定效应(FE)估计器(Chen, Fernandez-Val和Weidner, 2021)，但其在应用研究中使用有限，因为其实施涉及解决高维非凸问题。本文通过提供一个计算效率高的新估计器来补充Chen等人(2021)的理论分析，该估计器在渐近上等价于他们的估计器。与之前提出的FE估计器不同，我们的估计器避免了解高维非凸优化问题，并且可以在大规模非线性面板中可行计算。我们提出的方法包括两个步骤。第一步是使用核范数正则化(NNR)凸化优化问题，获得参数的初步估计，包括固定效应。然后，我们使用标准梯度下降法在这些初步估计上找到原始优化问题的全局解。为了使我们的方法在实践中易于应用，我们还提出了特定的数值算法来解决涉及的优化问题，建立了其收敛性，并在我们的R包NNRPanel中提供了高效的实现。

英文摘要

Interactive fixed effects are routinely controlled for in linear panel models. While an analogous fixed effects (FE) estimator for nonlinear models has been available in the literature (Chen, Fernandez-Val and Weidner, 2021), it sees much more limited use in applied research because its implementation involves solving a high-dimensional non-convex problem. In this paper, we complement the theoretical analysis of Chen, Fernandez-Val and Weidner (2021) by providing a new computationally efficient estimator that is asymptotically equivalent to their estimator. Unlike the previously proposed FE estimator, our estimator avoids solving a high-dimensional non-convex optimization problem and can be feasibly computed in large nonlinear panels. Our proposed method involves two steps. In the first step, we convexify the optimization problem using nuclear norm regularization (NNR) and obtain preliminary NNR estimators of the parameters, including the fixed effects. Then, we find the global solution of the original optimization problem using a standard gradient descent method initialized at these preliminary estimates. To make our method readily applicable in practice, we also propose specific numerical algorithms for solving the involved optimization problems, establish their convergence, and provide their efficient implementation in our R package NNRPanel.

URL PDF HTML ☆

赞 0 踩 0

2603.20617 2026-06-05 econ.TH

The AI Layoff Trap

AI裁员陷阱

Brett Hemenway Falk, Gerry Tsoukalas

AI总结本文通过竞争性任务模型，研究AI自动化导致的企业间需求外部性如何引发过度裁员，并提出庇古税作为解决方案。

详情

Comments: 63 pages, 4 figures, 2 tables

AI中文摘要

如果AI取代人类工人的速度快于经济重新吸收他们的速度，就有可能侵蚀企业赖以生存的消费者需求。我们表明，仅仅知道这一点并不足以让企业停止这一过程。在一个转型经济的竞争性任务模型中，每个企业获得自动化的全部成本节约，但只承担其在产品市场上创造的需求损失的一小部分；其余部分落在竞争对手身上。这种需求外部性使理性企业陷入自动化军备竞赛，取代工人的程度远超集体最优水平。由此造成的损失既损害工人也损害企业所有者。更多的竞争和“更好”的AI会放大这种过度；工资调整和自由进入无法消除它。资本所得税、员工股权、全民基本收入、技能提升或科斯谈判也无法消除。而庇古式的自动化税可以。结果表明，政策不仅应应对AI劳动力替代的后果，还应解决驱动它的竞争性激励。

英文摘要

If AI displaces human workers faster than the economy can reabsorb them, it risks eroding the very consumer demand firms depend on. We show that knowing this is not enough for firms to stop it. In a competitive task-based model of a transitioning economy, each firm captures the full cost saving from automation but bears only a fraction of the demand loss it creates in the product market; the rest falls on rivals. This demand externality traps rational firms in an automation arms race, displacing workers well beyond what is collectively optimal. The resulting loss harms both workers and firm owners. More competition and ``better'' AI amplify the excess; wage adjustments and free entry cannot eliminate it. Neither can capital income taxes, worker equity, universal basic income, upskilling, or Coasean bargaining. A Pigouvian automation tax can. The results suggest that policy should address not only the aftermath of AI labor displacement but also the competitive incentives that drive it.

URL PDF HTML ☆

赞 0 踩 0

2601.01421 2026-06-05 econ.TH

A measure of choice irrationality based on opposite judgements

基于对立判断的选择非理性度量

Angelo Enrico Petralia

AI总结本文提出一种基于偏好与其对立面之间折衷的选择非理性度量，刻画决策者将偏好中前几个备选方案逆序移至末尾的程度，并给出该度量的充要条件及极端折衷行为的刻画。

详情

Comments: arXiv admin note: substantial text overlap with arXiv:2408.01317

AI中文摘要

在许多选择情境中，决策者采用的标准是其偏好与对立面之间的折衷。根据这种折衷，决策者偏好中排名靠前的i个备选方案被逆序移至底部。这一模式允许定义基于折衷的非理性程度，量化决策者在选择中所采纳的折衷程度。本文给出了刻画该指数的充要条件。我研究了显示最低非理性程度的不可理性化选择，并完全识别了通过最小化对立判断之间的折衷来解释决策者选择的偏好。这些数据集涵盖了一些已知的选择偏差，如次优程序和避免劣势选项。我提供了表现出最严重折衷的选择行为的简单刻画，并证明该子类几乎包含所有选择。最后，刻画了其他几种折衷度量，并与先前确定的得分进行了比较。

英文摘要

In many choice settings the decision maker (DM) adopts a criterion which is a mediation between her preference, and its opposite. According to such compromise, the first i alternatives on top of the DM's taste are moved, in reverse order, to the bottom. This pattern allows to define the compromise-based degree of irrationality, which quantifies the extent of the mediation embraced by the DM in her choice. Necessary and sufficient conditions characterizing this index are singled out. I investigate non rationalizable choices that display the lowest degree of irrationality, and I fully identify the preferences that explain the DM's picks by a minimal mediation between opposite judgments. These datasets account for some well known selection biases, such as second-best procedures, and the handicapped avoidance. I offer a simple characterization of the choice behavior that exhibits the most severe compromise, and I show that this subclass comprises almost all choices. Finally, some alternative measures of compromise are characterized, and compared with the score previously determined.

URL PDF HTML ☆

赞 0 踩 0

2509.11381 2026-06-05 math.ST econ.EM stat.ME stat.ML stat.TH

Accuracy Limits of Causal Trees for Individualized Treatment Effects

因果树在个体化治疗效果中的精度极限

Matias D. Cattaneo, Jason M. Klusowski, Ruiqi Rae Yu

AI总结本文研究了基于自适应递归划分的因果树估计器，推导了其估计精度的下界，指出即使在随机分配的常数效应基准下，标准CART型划分规则构建的因果树在样本空间上的一致误差仍可能以比样本规模的任何幂更慢的速度下降，且样本划分无法消除这一限制。

详情

AI中文摘要

递归决策树被广泛用于估计实验和观察研究中的异质因果治疗效应。这些方法通常使用CART型递归划分实现，其划分标准旨在识别在协变量定义的子组中治疗效应的变化。我们研究了基于自适应递归划分的因果树估计器，并建立了其估计精度的下界。我们分析的类别包括基于常见治疗效应和平方误差划分标准的版本，有和无样本划分。即使在常数效应基准下，使用标准CART型划分规则构建的因果树的一致误差仍可能比样本规模的任何幂更慢地下降。其根本机制是贪心递归划分选择具有非消失概率的高度不平衡划分，产生包含非常少观测的终端节点，导致估计方差较大。我们进一步表明，样本划分（通常称为“诚实”）无法消除这一限制。因此，因果树估计器可能在协变量空间上以任意慢的速度收敛。同时，这些估计器可以具有小的积分均方误差，表明平均准确性可以掩盖局部不准确性。我们的结果也澄清了现有因果森林及相关集成方法的理论保证中平衡划分假设的作用。

英文摘要

Recursive decision trees are widely used to estimate heterogeneous causal treatment effects in experimental and observational studies. These methods are typically implemented using CART-type recursive partitioning, with splitting criteria designed to identify variation in treatment effects across covariate-defined subgroups. We study causal tree estimators based on adaptive recursive partitioning and establish lower bounds on their estimation accuracy. The class we analyze includes versions with and without sample splitting, based on common treatment effect and squared-error splitting criteria. Even in a constant-effect benchmark with randomized treatment assignment, causal trees constructed via standard CART-type splitting rules can have uniform-norm errors that decrease more slowly than any power of the sample size. The underlying mechanism is that greedy recursive partitioning selects highly imbalanced splits with nonvanishing probability, producing terminal nodes containing very few observations and leading to large estimation variance. We further show that sample splitting, often called ``honesty,'' does not remove this limitation. As a consequence, causal tree estimators may converge arbitrarily slowly uniformly over the covariate space. At the same time, these estimators can have small integrated mean squared error, showing that average accuracy can mask local inaccuracy. Our results also clarify the role of balanced partition assumptions in existing theoretical guarantees for causal forests and related ensemble methods.

URL PDF HTML ☆

赞 0 踩 0

2502.14131 2026-06-05 cs.LG cs.AI econ.EM

An Empirical Risk Minimization Approach for Offline Inverse RL and Dynamic Discrete Choice Model

一种用于离线逆强化学习和动态离散选择模型的经验风险最小化方法

Enoch H. Kang, Hema Yoganarasimhan, Lalit Jain

AI总结本文提出了一种基于经验风险最小化（ERM）的逆强化学习/动态离散选择模型框架，该方法无需显式估计贝尔曼方程中的状态转移概率，适用于高维和无限状态空间，并在理论上有Polyak-Lojasiewicz条件的支持，从而保证了快速的全局收敛性。

详情

AI中文摘要

我们研究了估计动态离散选择（DDC）模型的问题，也称为机器学习中的离线最大熵正则化逆强化学习（离线MaxEnt-IRL）。目标是从离线行为数据中恢复支配代理行为的奖励或Q*函数。在本文中，我们提出了一种全局收敛的基于梯度的方法来解决这些问题，而无需线性参数化的奖励假设。我们的方法的创新之处在于引入了基于经验风险最小化（ERM）的IRL/DDC框架，该框架避免了在贝尔曼方程中显式估计状态转移概率的需要。此外，我们的方法与非参数估计技术如神经网络兼容。因此，所提出的方法有潜力扩展到高维、无限状态空间。我们方法的一个关键理论洞察是贝尔曼残差满足Polyak-Lojasiewicz（PL）条件--一个属性，虽然比强凸性弱，但足以保证快速的全局收敛保证。通过一系列合成实验，我们证明我们的方法在性能上始终优于基准方法和最先进的替代方法。

英文摘要

We study the problem of estimating Dynamic Discrete Choice (DDC) models, also known as offline Maximum Entropy-Regularized Inverse Reinforcement Learning (offline MaxEnt-IRL) in machine learning. The objective is to recover reward or $Q^*$ functions that govern agent behavior from offline behavior data. In this paper, we propose a globally convergent gradient-based method for solving these problems without the restrictive assumption of linearly parameterized rewards. The novelty of our approach lies in introducing the Empirical Risk Minimization (ERM) based IRL/DDC framework, which circumvents the need for explicit state transition probability estimation in the Bellman equation. Furthermore, our method is compatible with non-parametric estimation techniques such as neural networks. Therefore, the proposed method has the potential to be scaled to high-dimensional, infinite state spaces. A key theoretical insight underlying our approach is that the Bellman residual satisfies the Polyak-Lojasiewicz (PL) condition -- a property that, while weaker than strong convexity, is sufficient to ensure fast global convergence guarantees. Through a series of synthetic experiments, we demonstrate that our approach consistently outperforms benchmark methods and state-of-the-art alternatives.

URL PDF HTML ☆

赞 0 踩 0

2511.15891 2026-06-05 econ.EM

Heterogeneity in peer effects for binary outcomes

二元结果中同伴效应的异质性

Mathieu Lambotte

AI总结本文基于不完全信息的同时网络博弈结构模型，探讨了同伴效应中异质性的存在，证明了均衡的唯一性和异质同伴效应参数的可识别性，并通过中学吸烟和饮酒数据展示了假设同质偏好会导致同伴效应估计偏差和政策评估偏差。

2410.23587 2026-06-05 econ.EM q-fin.CP stat.CO

Moments by Integrating the Moment-Generating Function

通过积分动差生成函数计算动差

Peter Reinhard Hansen, Chen Tong

AI总结本文提出了一种通用积分框架，用于在满足显式正则条件的情况下，从动差生成函数计算分数、复数、绝对和对数动差。通过沿垂直轮廓评估复数扩展的动差生成函数，获得精确的积分表达式，从而避免了显式概率密度和高阶导数的需要。

2504.03766 2026-06-05 econ.TH

Renewable Natural Resources, Regime Shift and Hysteresis

可再生自然资源、制度变迁与滞后效应

Ted To

AI总结本文研究存在临界点的可再生资源最优收获问题，发现高繁殖力稳态可能不稳定，且低繁殖力稳态存在时会出现内生临界点。

详情

Comments: added more expansive discussion of existing literature

AI中文摘要

世界上许多可再生资源正在减少。关于平滑补充的最优收获已有充分研究，但近年来生态学家得出结论，补充中的临界点很常见。具有临界点的补充在临界点以下繁殖力低，在临界点以上繁殖力高。当高繁殖力的折现值足够高时，存在一个高繁殖力稳态。该稳态是稳定的，但在某些情况下，小的扰动可能导致补充和收获的大幅暂时减少。在临界点以下，低繁殖力稳态不一定存在。当低繁殖力稳态确实存在时，存在一个内生临界点（Skiba点）：低于该点，收获收敛到低繁殖力稳态；高于该点，严格的收获政策将可再生资源过渡到高繁殖力补充。如果补充存在滞后效应，则高稳态可能不稳定。此外，如果高/低繁殖力差异较大，则在向下扰动后，繁殖力最优地保持较低水平。

英文摘要

Many of the world's renewable resources are in decline. Optimal harvests with smooth recruitment is well studied but in recent years, ecologists have concluded that tipping points in recruitment are common. Recruitment with a tipping point has low-fecundity below the tipping point and high-fecundity above. When the discounted value of high-fecundity is sufficiently high, there is a high-fecundity steady-state. This steady-state is stable but in some cases, small perturbations may result in large, temporary reductions in recruitment and harvests. Below the tipping point, a low-fecundity steady-state need not exist. When a low-fecundity steady-state does exist, there is an endogenous tipping (Skiba) point: below, harvests converge to the low-fecundity steady-state and above, an austere harvest policy transitions the renewable resource to high-fecundity recruitment. If there is hysteresis in recruitment, the high steady-state may not be stable. Moreover, if the high-/low-fecundity differential is large then following a downward perturbation, fecundity optimally remains low.

URL PDF HTML ☆

赞 0 踩 0

2204.00473 2026-06-05 econ.EM

Finite Sample Inference in Incomplete Models

不完整模型中的有限样本推断

Lixiong Li, Marc Henry

AI总结本文提出了一种在有限样本中对不完整模型参数进行推断的方法，通过逆向检验来构造置信区域，该方法将蒙特卡洛检验推广到不完整模型，并利用离散最优传输公式来构造检验统计量，同时通过模拟数据和线性规划问题计算临界值。

详情

Comments: JEL codes: C15, C57, C61

AI中文摘要

我们提出了一种置信区域，用于在有限样本中对不完整模型的参数进行推断，确保真实参数的精确覆盖。我们的置信区域通过逆向检验构造，该检验将蒙特卡洛检验推广到不完整模型。检验统计量是新结构模型最优传输公式的一种离散类比。检验统计量和临界值依赖于对潜在变量分布的模拟抽样，并通过解决离散最优传输问题（即线性规划问题）来计算。我们还提出了一种快速的参数空间初步搜索方法，基于一个参数无关的保守但一致的检验。我们通过回归模型中的区间值回归器的模拟比较了我们方法的大小和功效，并最终将我们的方法应用于Ciliberto等人[2021]中的航空业进入和价格竞争模型。

英文摘要

We propose confidence regions for the parameters of incomplete models with exact coverage of the true parameter in finite samples. Our confidence region inverts a test, which generalizes Monte Carlo tests to incomplete models. The test statistic is a discrete analogue of a new optimal-transport formulation of the structural model. Both test statistic and critical values rely on simulation draws from the distribution of latent variables and are computed using solutions to discrete optimal transport, hence linear programming problems. We also propose a fast preliminary search in the parameter space with an alternative, more conservative yet consistent test, based on a parameter-free critical value. We compare size and power of our procedure with competing proposals in simulations based on a regression with interval valued regressors. Finally, we apply our methodology to the model of airline entry and price competition in Ciliberto et al. [2021].

URL PDF HTML ☆

赞 0 踩 0

2508.19006 2026-06-05 q-fin.PR cs.LG econ.EM q-fin.CP

Is attention truly all we need? An empirical study of asset pricing in pretrained RNN sparse and global attention models

注意力真的全部我们需要吗？对预训练RNN稀疏和全局注意力模型在资产定价中的实证研究

Shanyan Lai

AI总结本文研究了预训练RNN注意力模型在资产定价中的应用，探讨了注意力机制在捕捉时间依赖性和长期记忆方面的改进，以及在不同市场条件下的稳定性。

详情

Comments: 72 pages including appendix

AI中文摘要

本研究探讨了主流注意力机制，如加权注意力、Luong的三种注意力、全局自注意力和滑动窗口稀疏注意力，在顶级420只大型美国股票上的实证资产定价研究。这是首次将大规模最先进的（SOTA）注意力机制应用于资产定价领域。这些模型克服了传统机器学习资产定价方法的局限性，如误捕时间依赖性和短期记忆。此外，注意力机制中的强制因果掩码解决了未来数据泄漏问题，而这一问题被更先进的注意力模型如经典Transformer所忽视。所提出的注意力模型还考虑了资产定价数据的时间稀疏性，并通过部署简化模型结构来缓解潜在的过拟合问题。本文为未来实证经济研究提供了某些见解。所有模型均在三个时期内进行测试，涵盖新冠前、新冠期间和新冠后一年，以测试这些模型在极端市场条件下的稳定性。研究发现，在价值加权投资组合回测中，全局自注意力模型和滑动窗口稀疏注意力模型在获得绝对收益和对冲下行风险方面表现出色，在新冠期间静态交易成本情景下，它们分别实现了2.0和1.80的年化Sortino比率。此外，从绝对投资组合收益的角度来看，滑动窗口稀疏注意力模型在股票市值大小方面比全局自注意力模型表现更加稳定。

英文摘要

This study investigates the pre-trained RNN attention models with the mainstream attention mechanisms, such as additive attention, Luong's three attentions, global self-attention and sliding window sparse attention, for the empirical asset pricing research on the top 420 large-cap US stocks. This is the first paper on the large-scale state-of-the-art (SOTA) attention mechanisms applied in the asset pricing context. They overcome the limitations of the traditional machine learning-based asset pricing, such as mis-capturing the temporal dependency and short memory. Moreover, the enforced causal masks in the attention mechanisms address the future data leaking issue ignored by the more advanced attention-based models, such as the classic Transformer. The proposed attention models also consider the temporal sparsity characteristic of asset pricing data and mitigate potential overfitting issues by deploying the simplified model structures. This provides some insights for future empirical economic research. All models are examined in three periods, which cover pre-COVID-19, COVID-19 and one year post-COVID-19, for testing the stability of these models under extreme market conditions. The study finds that in value-weighted portfolio back testing, the global self-attention model and the sliding window sparse attention model exhibit excellent capabilities in deriving the absolute returns and hedging downside risks, while they achieve an annualized Sortino ratio of 2.0 and 1.80 respectively in the period with COVID-19 in the static transaction cost scenario. Moreover, the sliding window sparse attention model performs more stably than the global self-attention model from the perspective of absolute portfolio returns with respect to the size of stocks' market capitalization.

URL PDF HTML ☆

赞 0 踩 0

2412.04605 2026-06-05 econ.EM stat.ML

Semiparametric Bayesian Difference-in-Differences

半参数贝叶斯差分-in-差分

Christoph Breunig, Ruixuan Liu, Zhengfei Yu

AI总结本文研究了在差分-in-差分研究设计中半参数贝叶斯推断平均处理效应（ATT）的方法，提出了两种新的贝叶斯方法并证明了其频率有效性。

2411.19431 2026-06-05 econ.TH

Money Burning Improves Mediated Communication

货币燃烧改善中介通信

Yi Liu, Yang Yu

AI总结本文研究了通过货币燃烧改善战略通信的可能性，发现只有在中间承诺下才能实现，中介可通过成本性信息约束偏差并使信息可信，当预算充足时，增加燃烧预算能提高发件人收益，除非中介通信崩溃为廉价交谈，框架还澄清了通过智能合约和Web 3.0实现的承诺。