arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.23706 2026-05-25 econ.EM

Algorithm or Creative? A Three-Arm Experimental Design for Decomposing Algorithmic Bias in Platform A/B Tests

算法还是创意?用于分解平台A/B测试中算法偏差的三臂实验设计

Pallavi Pal, Anjana Susarla

AI总结 本文研究了在线广告平台A/B测试中算法偏差的分解问题,提出了一种三组实验设计,以区分算法推荐和创意内容对受众触达的影响。该方法通过控制创意内容不变,仅改变算法的处理信息,从而识别出算法的间接效应和创意的直接效应,避免了传统两组实验中因受众选择而产生的偏差。实证研究表明,算法对受众触达的影响远大于创意内容,传统两组实验可能低估算法效应达两倍之多。

详情
AI中文摘要

在线广告平台托管着数十万个A/B测试,但平台的投放算法将每个创意路由到它预测会互动的受众。因此,每个两臂测试都混淆了创意的效果与算法的定向响应,而针对实际受众进行调整是有偏的,因为受众是处理后中介变量。我们提出了一种三臂设计,添加一个臂,在保持用户侧创意与对照组相同的同时,向算法暴露处理元数据,从而在不依赖序贯可忽略性的情况下点识别出自然间接(算法)和直接(创意)效应。在一个针对女性的文本片段的Meta广告活动中,算法渠道将女性曝光份额提高了+2.07个百分点,而创意渠道使其变动了-0.68个百分点;大约四分之三的绝对重新分配来自算法,而传统的两臂测试将算法渠道低估了约两倍。该设计隔离了平台算法对结果的影响,该影响与创意内容可分离。

英文摘要

Online advertising platforms host hundreds of thousands of A/B tests, but the platform's delivery algorithm routes each creative to the audience it predicts will engage. Every two-arm test therefore conflates the creative's effect with the algorithm's targeting response, and adjusting for the realized audience is biased because audience is a post-treatment mediator. We propose a three-arm design that adds an arm exposing the algorithm to the treatment metadata while holding the user-facing creative identical to control, point-identifying the natural indirect (algorithmic) and direct (creative) effects without sequential ignorability. In a live Meta campaign with a women-targeted text fragment, the algorithmic channel raises female impression share by +2.07 ppt while the creative channel moves it by -0.68 ppt; roughly three-quarters of the absolute reallocation is algorithmic, and a conventional two-arm test understates the algorithmic channel by a factor of two. The design isolates the contribution of platform's algorithm to the outcome which is separable from creative content.

2605.23703 2026-05-25 econ.EM

Dynamic Consumer Demand at Large Scale

大规模动态消费者需求

Daniel Brunner, Florian Heiss, Anna B. Schmidt

AI总结 本文研究了大规模零售环境中消费者在多品类、多产品情况下的动态需求,尤其关注重复购买行为。为解决传统离散选择模型在高维环境下计算困难的问题,作者提出了一种基于共享潜在因子结构的动态产品级因子模型,能够捕捉消费者在基准偏好、价格敏感性和惯性方面的异质性。该模型通过因子分解个体-产品系数,实现了跨个体和品类的信息共享,并采用贝叶斯变分推断进行估计,具有良好的可扩展性。实验表明,该模型在预测性能上显著优于静态因子模型和混合逻辑模型,尤其在个体购买历史稀疏时表现更优,同时强调了惯性对需求弹性估计的重要影响。

详情
AI中文摘要

我们研究了大规模零售环境中的消费者需求,其中涉及众多产品、多个类别和重复购买行为。尽管惯性和品牌忠诚度已被充分记录,但现有的离散选择模型通常专注于单一类别,或在高维环境中变得计算上不可行。我们提出了一个动态产品级因子模型,通过共享的潜在因子结构捕捉基线偏好、价格敏感性和惯性的异质性。通过对个体-产品系数进行因子分解,该模型汇集了跨个体和类别的信息,并允许相关异质性。我们使用贝叶斯变分推断估计模型,实现了具有数万个参数的可扩展估计。在基于真实零售数据校准的模拟研究中,我们展示了动态因子模型相对于静态因子模型和混合Logit基准在预测性能上的显著提升,尤其是在个体购买历史稀疏的情况下。考虑惯性还会导致更富有弹性的需求估计,强调了动态性对于衡量消费者响应的重要性。我们的结果突显了动态因子模型作为现代高维零售市场中需求估计的可扩展且灵活的方法。

英文摘要

We study consumer demand in large-scale retail settings with many products, multiple categories and repeated purchase behavior. While inertia and brand loyalty are well documented, existing discrete choice models typically focus on single categories or become computationally infeasible in high-dimensional environments. We propose a dynamic product-level factor model that captures heterogeneity in baseline preferences, price sensitivity and inertia through a shared latent factor structure. By factorizing individual-product coefficients, the model pools information across individuals and categories and allows for correlated heterogeneity. We estimate the model using Bayesian variational inference, enabling scalable estimation with tens of thousands of parameters. In a simulation study calibrated to realistic retail data, we show that the dynamic factor model substantially improves predictive performance relative to static factor models and mixed logit benchmarks, particularly when individual purchase histories are sparse. Accounting for inertia also leads to more elastic demand estimates, underscoring the importance of dynamics for measuring consumer responsiveness. Our results highlight dynamic factor models as a scalable and flexible approach for demand estimation in modern, high-dimensional retail markets.

2410.22443 2026-05-25 econ.GN q-fin.EC

What Do Bitcoin Premiums Measure? Evidence from Global P2P Markets

比特币溢价衡量什么?来自全球P2P市场的证据

Yanan Niu

AI总结 本文研究了比特币在点对点(P2P)市场中的溢价所反映的经济含义。利用LocalBitcoins的交易数据,作者构建了80种货币相对于美元的比特币溢价,并将其与区块链交易条件、中心化加密货币市场状况、跨境支付摩擦以及外汇市场进行关联分析。研究发现,这些溢价既反映了加密货币市场内部的交易摩擦,也体现了本地跨境支付渠道的限制,尤其在资本管制严格或汇率制度不灵活的国家更为显著。此外,比特币溢价还包含了对未来货币贬值的预测信息,表明其在一定程度上反映了市场对汇率走势的预期。

详情
Comments
Major revision with new title; substantially revised framing, empirical analysis, and results. Replaces the previous version
AI中文摘要

本文研究比特币(BTC)在点对点(P2P)市场中的溢价衡量什么。利用LocalBitcoins的交易级数据,我们构建了80种货币相对于美元的BTC溢价,并将其与区块链交易条件、中心化加密市场(CEX)条件、跨境支付摩擦以及外汇(FX)市场联系起来。我们表明,这些溢价既反映了加密市场内部的交易摩擦,也反映了本地在跨境支付准入方面的摩擦。它们随区块链条件和更广泛的加密市场条件(包括BTC收益和波动性)系统性地变化,并且在面临传统跨境支付渠道更大摩擦的国家中更大。这种模式在具有约束性制度约束(即严格的资本管制和非浮动汇率制度)的经济体中尤为明显,与更依赖P2P加密市场作为替代跨境支付渠道的情况一致。我们进一步表明,上升的外汇压力主要通过价格而非交易量吸收,并且P2P BTC溢价预测随后的官方汇率贬值。尽管不同国家的溢价水平不同,但其预测内容大致相似。总体而言,P2P BTC溢价反映了加密交易场所之间的套利限制,特别是在正式跨境支付渠道受到更多限制的地方,并且它们还嵌入了关于货币贬值的远期信息。

英文摘要

This paper studies what Bitcoin (BTC) premiums in peer-to-peer (P2P) markets measure. Using transaction-level data from LocalBitcoins, we construct BTC premiums for 80 currencies relative to the U.S. dollar and relate them to blockchain transaction conditions, centralized crypto market (CEX) conditions, cross-border payment frictions, and foreign exchange (FX) markets. We show that these premiums reflect both trading frictions within crypto markets and local frictions in access to cross-border payments. They vary systematically with blockchain conditions and broader crypto market conditions, including BTC returns and volatility, and they are larger in countries facing greater frictions in conventional cross-border payment channels. This pattern is especially pronounced in economies with binding institutional constraints, i.e., tight capital controls and non-floating exchange-rate regimes, consistent with greater reliance on P2P crypto markets as an alternative cross-border payment channel. We further show that rising FX pressure is absorbed mainly through prices rather than trading volumes, and that P2P BTC premiums predict subsequent official exchange rate depreciation. Although premium levels differ across countries, their predictive content remains broadly similar. Overall, P2P BTC premiums reflect limits to arbitrage across crypto trading venues, especially where formal cross-border payment channels are more constrained, and they also embed forward-looking information about currency depreciation.

2605.23616 2026-05-25 econ.GN q-fin.EC

Value-focused modelling to generate alternatives -- Coupling multi-criteria decision analysis and optimisation models to support strategic decisions

价值导向的替代方案生成——耦合多准则决策分析与优化模型以支持战略决策

Emily Bergup, Jonas Finke, Sebastian Schär, Valentin Bertsch

AI总结 该研究提出了一种名为“价值导向型生成替代方案建模”(VF-MGA)的新方法,旨在解决能源系统规划中复杂战略决策的支持问题。该方法将多准则决策分析(MCDA)与生成替代方案建模(MGA)双向耦合,通过整合利益相关者的价值偏好,引导替代方案的生成,并基于这些偏好对方案进行评估,从而提升决策的参与性和合理性。研究通过大学校园低碳能源供应案例验证了该方法的有效性,展示了其在生成多样化方案、识别利益相关者偏好及支持系统选项选择方面的优势。

详情
AI中文摘要

运筹学中的决策支持方法被广泛用于支持复杂的规划决策。在能源领域,应用建模生成替代方案(MGA)的能源系统模型(ESM)能够生成大量近优且不同的系统配置。然而,它们通常在模型变量空间中生成和分析替代方案,未能确保利益相关者的相关性。相比之下,多准则决策分析(MCDA)提供了一种结构化的方法来处理相互冲突的目标和异质的利益相关者利益,但通常依赖于有限的预定义替代方案,这些方案可能无法充分代表可行的解空间。为解决这些局限性,本文提出了价值导向的建模生成替代方案(VF-MGA),这是一种双向耦合MGA和MCDA的新方法。在MCDA中引出的利益相关者目标为MGA算法提供信息,从而实现替代方案的利益相关者导向多样化,随后根据引出的利益相关者偏好,在MCDA中对这些替代方案进行评估,从而提供全面的决策基础。应用于一个大型大学校园脱碳能源供应的案例研究,涉及代表不同机构群体的11位利益相关者,VF-MGA(i)系统地将利益相关者目标整合到生成691个反映利益相关者相关利益的替代方案中,(ii)通过基于MCDA的评估,从这一大型集合中识别出利益相关者相关的替代方案,以及(iii)通过评估大量多样化的替代方案,提供更差异化的利益相关者偏好信息,从而揭示系统选项的可接受范围。由此,VF-MGA为复杂规划决策提供了一种可推广的方法,将定量建模与参与式决策分析相结合。

英文摘要

Decision support methods from operations research are widely used to support complex planning decisions. Within the energy sector, energy system models (ESMs) applying modelling to generate alternatives (MGA) generate large sets of near-optimal, different system configurations. However, they typically generate and analyse alternatives in the model variable space without ensuring stakeholder relevance. Multi-criteria decision analysis (MCDA), in contrast, provides a structured means to account for conflicting objectives and heterogeneous stakeholder interests but often relies on a limited set of pre-defined alternatives that may not appropriately represent the feasible solution space. To address these limitations, this work proposes value-focused modelling to generate alternatives (VF-MGA), a novel methodology that bidirectionally couples MGA and MCDA. Stakeholder objectives elicited within the MCDA inform the MGA-algorithm, enabling a stakeholder-orientated diversification of the alternatives, which are subsequently evaluated within the MCDA based on elicited stakeholder preferences, thereby providing a comprehensive decision basis. Applied to a case study on the decarbonised energy supply of a large university campus, involving eleven stakeholders representing diverse institutional groups, VF-MGA (i) systematically integrates stakeholder objectives into the generation of 691 alternatives reflecting stakeholder-relevant interests, (ii) enables the identification of stakeholder-relevant alternatives from this large set through MCDA-based evaluation, and (iii) provides more differentiated stakeholder preference information by evaluating a large and diverse set of alternatives, thereby revealing acceptable ranges for system options. With this, VF-MGA provides a generalisable methodology for complex planning decision integrating quantitative modelling with participatory decision analysis.

2605.23539 2026-05-25 econ.EM

Process Utility in High-Stakes Competition

高风险竞争中的过程效用

Arnaud Dupuy

AI总结 本文研究了在高风险战略决策中,个体如何权衡结果(“什么”)与过程(“如何”)的效用,以职业网球比赛为例。通过最优条件和第二发规则,作者推导出过程效用权重的非参数下界为正的充分条件,并发现大多数选手对过程效用持正向评价。进一步构建了结构模型,揭示了选手在提升过程效用时会系统性地牺牲获胜概率,这对比赛结果和预期收益具有显著的经济影响。

详情
AI中文摘要

我们研究个体在高风险战略决策(即职业网球)中如何权衡结果(“什么”)和过程(“如何”)效用。利用最优性条件和二阶发球规则,我们推导出过程效用权重为正的非参数下界的充分条件。在温和的形状限制下,高频数据表明大多数球员可能正向评价过程效用。然后,我们开发了一个结构模型,恢复球员对结果和过程的特定偏好。估计显示,球员系统性地牺牲成功概率以增加过程效用,这对比赛结果和预期收入具有经济意义上的重要影响。

英文摘要

We study how individuals trade off outcome ("what") and process ("how") utility in high-stakes strategic decisions, namely professional tennis. Using optimality conditions and the second-service rule, we derive a sufficient condition for the nonparametric lower bound on the weight of process utility to be positive. Under mild shape restrictions, the high-frequency data indicate that most players likely value process utility positively. We then develop a structural model that recovers player-specific preferences over outcomes and processes. Estimates show that players systematically sacrifice success probabilities to increase process utility, with economically meaningful consequences for match outcomes and expected earnings.

2605.23400 2026-05-25 econ.GN q-fin.EC

De-risking renewable energy investments: Assessing contract design and project finance using operational wind park data

降低可再生能源投资风险:利用运营风电场数据评估合同设计与项目融资

Jorge Sánchez Canales, Lion Hirth

AI总结 该研究探讨了如何通过设计公共合同来降低可再生能源项目投资的风险,并分析其对融资效果的影响。研究利用德国63个陆上风电场的高精度发电数据,模拟了不同类型的合同差价(CfD)对项目现金流、债务能力及电力平准化成本的影响。结果表明,金融型CfD在风险对冲效果上与传统双边CfD相当,揭示了合同设计在平衡收益稳定与市场整合效率中的关键作用,为可再生能源政策制定提供了重要参考。

详情
Comments
Submitted to Journal of Corporate Finance. 28 pages. 9 figures
AI中文摘要

可再生能源发电投资高度资本密集,因此强烈依赖于融资条件。在欧洲,大部分此类投资发生在类似于长期公共合同的公共支持计划下,例如上网电价(FiTs)和差价合约(CfDs)。这些合同不仅补贴可再生能源发电,还通过减少对电价波动的暴露来稳定项目现金流,从而提高债务能力并降低融资成本。同时,它们可能因削弱对批发市场价格信号的暴露而扭曲运营和投资激励。本文研究替代性公共合同设计如何降低收入风险,以及这如何转化为融资结果。利用2013-2024年覆盖德国63个陆上风电场的每小时涡轮级发电的新数据集,我们模拟了双向CfD、单向CfD和金融CfD下的项目现金流。然后,我们基于保守的偿债覆盖率(DSCR)约束的项目融资模型,评估它们对现金流波动性、债务能力和平准化电力成本的影响。我们发现金融CfD提供的对冲性能与传统的双向CfD相当。结果表明,通常假设的收入稳定与高效市场整合之间的权衡并非固有,而是取决于合同设计。更广泛地说,公共合同可以替代缺失的长期对冲市场。这些结果对可再生能源支持计划的设计具有直接的政策意义。

英文摘要

Investment in renewable electricity generation is highly capital intensive and therefore strongly dependent on financing conditions. In Europe, much of this investment has occurred under public support schemes that resemble long-term public contracts such as feed-in tariffs (FiTs) and contracts-for-differences (CfDs). These contracts not only subsidize renewable generation but also stabilize project cash flows by reducing exposure to electricity price volatility, thereby improving debt capacity and lowering financing costs. At the same time, they may distort operational and investment incentives by weakening exposure to wholesale market price signals. This paper studies how alternative public contract designs reduce revenue risk and how this translates into financing outcomes. Using a novel dataset of hourly turbine-level generation covering 63 German onshore wind parks over the period 2013-2024, we simulate project cash flows under two-sided CfDs, one-sided CfDs, and financial CfDs. We then evaluate their implications for cash-flow volatility, debt capacity, and the levelized cost of electricity using a project finance model based on a conservative debt-service coverage ratio (DSCR) constraint. We find that financial CfDs provide hedging performance comparable to conventional two-sided CfDs. The results suggest that the commonly assumed trade-off between revenue stabilization and efficient market integration is not inherent but depends on contract design. More broadly, public contracts can substitute missing long-term hedging markets. These results have direct policy implications for the design of renewable energy support schemes.

2605.23159 2026-05-25 econ.GN cs.AI q-fin.EC

Generative AI and the Reorganization of Labor Demand

生成式AI与劳动力需求的重组

Fangyan Wang, Zaiyan Wei, Yang Wang

AI总结 本文研究生成式人工智能(AI)对劳动力需求的重塑影响,探讨企业在技术扩散过程中如何调整招聘岗位和岗位任务结构。通过构建基于美国全行业招聘广告数据的动态暴露度指标,研究发现,生成式AI的暴露程度随时间变化,并非固定不变;企业主要通过岗位间的招聘调整(占52%)和岗位内部任务重构(占39.5%)来适应AI技术,且不同层级岗位的调整路径存在差异。研究揭示了劳动力市场对生成式AI的适应过程是组织结构和任务架构的重新配置。

详情
AI中文摘要

生成式人工智能(AI)预计将改变工作方式,但关于随着技术扩散,企业如何重组劳动力需求的研究尚不充分。现有研究主要关注哪些职业暴露于AI或暴露的工作是否减少。我们通过考察企业是否通过改变招聘地点、工作内容或两者兼而有之来调整,扩展了这一讨论。利用覆盖美国经济所有部门的全国职位发布数据集,我们通过两阶段大语言模型管道构建了一个动态的、职位级别的生成式AI暴露度度量。该管道识别每个职位发布中描述的任务,并分类生成式AI能够执行或辅助这些任务的程度。然后,我们将总暴露度的变化分解为两个边际:跨职位需求重新分配和职位内任务重新设计。我们记录了三个主要发现。首先,生成式AI暴露度是动态而非固定的,随时间显著变化。其次,劳动力需求通过两个边际进行调整。招聘重新分配解释了总暴露度下降的最大份额,平均占52%,而职位内重新设计变得越来越重要,占39.5%。补充的Oaxaca-Blinder分解显示,职业构成的变化解释了可归因于可观察职位特征的暴露度变化的约90%。第三,调整在职业阶梯上有所不同。高级职位调整更早,主要通过重新分配,而初级职位则通过重新分配、重新设计及其相互作用的更广泛组合进行调整。这些发现表明,劳动力市场对生成式AI的调整是一个组织重构的过程,在此过程中,企业重塑了招聘需求和工作的任务架构。

英文摘要

Generative artificial intelligence (AI) is expected to transform work, but less is known about how firms reorganize labor demand as the technology diffuses. Existing research has largely focused on which occupations are exposed to AI or whether exposed jobs decline. We extend this debate by examining whether firms adjust by changing where they hire, what jobs contain, or both. Using a nationwide dataset of job postings in the United States, covering all sectors of the economy, we construct a dynamic, posting-level measure of generative AI exposure with a two-stage large language model pipeline. The pipeline identifies the tasks described in each posting and classifies the extent to which generative AI can perform or assist them. We then decompose changes in aggregate exposure into two margins: reallocation of demand across jobs and redesign of tasks within jobs. We document three main findings. First, generative AI exposure is dynamic rather than fixed, changing substantially over time. Second, labor demand adjusts through both margins. Hiring reallocation explains the largest share of the aggregate decline in exposure, accounting for 52% on average, while within-job redesign becomes increasingly important, accounting for 39.5%. A complementary Oaxaca-Blinder decomposition shows that shifts in occupational composition account for about 90% of the exposure change attributable to observable job characteristics. Third, adjustment differs across the job ladder. Senior jobs adjust earlier and mainly through reallocation, whereas junior jobs adjust through a broader mix of reallocation, redesign, and their interaction. These findings suggest that labor-market adjustment to generative AI is a process of organizational reconfiguration, in which firms reshape both hiring demand and the task architecture of work.

2605.23053 2026-05-25 econ.EM

A Comparative Multi-Hazard Risk Assessment of the US High-Voltage Transmission Network

美国高压输电网络的综合多灾害风险评估

D. Bor, E. J. Oughton, R. S. Weigel, R. Yang, T. Clower, A. Newman, A. R. Valle

AI总结 本研究针对美国高压输电网络面临的多种自然灾害风险,构建了一个综合评估框架,将灾害特征、脆弱性建模与宏观经济影响传播相结合,填补了以往单一灾害评估无法比较整体风险的空白。研究基于全国灾害数据和详尽的电网数据,评估了九类主要灾害及复合冰灾与强风情景对电网的破坏概率、经济损失及影响人口,发现强风和冰灾复合情景造成的下游经济损失最大。该框架为提升电网韧性提供了统一的评估基准。

详情
Comments
38 pages, 8 figures and 2 tables
AI中文摘要

现代经济严重依赖高压电力传输网络。然而,这种基础设施经常受到从地震、洪水到龙卷风和地磁风暴等自然灾害的干扰。历史上的风险评估通常孤立地处理每种灾害,缺乏比较整个灾害组合中经济影响的共同基础。本研究通过开发一个集成的框架来填补这一空白,该框架将灾害特征描述、脆弱性建模和宏观经济影响传播联系起来。该框架一致地应用于九种主要灾害和一种复合冻雨和阵风灾害。利用国家灾害数据集和包含超过13,000条线路段和10,000个变电站的美国高压输电网络,我们推导出失效概率、预期损害、受影响人口和下游经济产出损失。在单个灾害中,热带气旋风产生的预期日损害最大,为1.37亿美元/天,其次是闪电8700万美元/天、地震4700万美元/天、洪水4600万美元/天、龙卷风4200万美元/天和滑坡3400万美元/天。下游经济产出损失最大的是龙卷风,为49.3亿美元/天,其次是洪水35.9亿美元/天和地震30.2亿美元/天。250年一遇的地磁风暴产生20.7亿美元/天,使空间天气处于主要陆地灾害的范围内。复合冻雨和阵风情景产生了最大的压力测试中断,影响2.374亿人,并导致模型预测的下游产出损失为851.6亿美元/天。这些结果应被解释为一阶边界估计,其中复合情景代表上限压力测试。总体而言,该框架为优先投资输电网络韧性建立了一致的基线。

英文摘要

Modern economies depend critically on high-voltage power transmission networks. Yet this infrastructure is routinely disrupted by natural hazards ranging from earthquakes and floods to tornadoes and geomagnetic storms. Risk assessments have historically addressed hazards in isolation, leaving no common basis for comparing economic impacts across the full hazard portfolio. This study addresses this gap by developing an integrated framework linking hazard characterization, fragility modeling, and macroeconomic impact propagation. The framework is applied consistently across nine primary hazards and one compound freezing rain and wind gust hazard. Using national hazard datasets and a US high-voltage transmission network of over 13,000 line segments and 10,000 substations, we derive failure probabilities, expected damage, affected population, and downstream economic output losses. Among individual hazards, tropical cyclone wind produces the largest expected daily damage at $137 M/day, followed by lightning at $87 M/day, earthquake at $47 M/day, flood at $46 M/day, tornado at $42 M/day, and landslide at $34 M/day. Downstream economic output losses are largest for tornado at $4.93 B/day, followed by flood at $3.59 B/day and earthquake at $3.02 B/day. A 250-year geomagnetic storm produces $2.07 B/day, placing space weather within the range of major terrestrial hazards. The compound freezing rain and wind gust scenario produces the largest stress-test disruption, affecting 237.4 M people and yielding a modeled downstream output loss of $85.16 B/day. These results should be interpreted as first-order bounding estimates, with the compound scenario representing an upper-bound stress test. Overall, the framework establishes a consistent baseline for prioritizing investments in transmission network resilience.

2605.22994 2026-05-25 econ.EM q-fin.GN

Dynamic Evolution of Corporate Emissions Determinants

企业排放决定因素的动态演化

George Kapetanios, Steven Ongena, Alexia Ventouri, Huiyan Xiao

AI总结 本文研究了企业在适应环境法规、经济条件和组织约束过程中,其工业排放决定因素如何随时间演变。通过分析1992年至2023年间美国204家工业设施的面板数据,将设施排放数据与企业财务特征、管理层属性、本地劳动力市场状况及宏观经济指标相联系,采用时间变化的均值组估计方法,允许平均关系随时间平滑变化并容纳设施间的持续异质性。研究发现,多个变量在不同阶段与排放增长表现出间歇性关联,揭示了排放决定因素具有明显的阶段性动态特征,企业特征和宏观条件在不同时期占据主导地位,突显了企业对环境政策的响应具有时间依赖性并受其适应能力的影响。

详情
AI中文摘要

本文考察了企业层面的工业排放决定因素如何随时间演变,因为企业适应环境规制、经济条件和组织约束。利用1992年至2023年观察到的204个美国工业设施的面板数据,我们将设施层面的排放(来自有毒物质释放清单)与企业财务特征、管理属性、当地劳动力市场状况以及总体宏观经济指标联系起来。我们采用时变均值组估计器,允许平均关系随时间平滑变化,同时容纳设施间的持续异质性。我们发现几个协变量与排放增长呈现阶段性关联。结果揭示了排放决定因素中显著的阶段式动态,企业层面特征和总体条件在不同时期占主导地位。从创新政策的角度来看,研究结果强调企业对环境规制的反应具有时间依赖性,并受其适应能力的塑造。

英文摘要

This paper examines how firm-level determinants of industrial emissions evolve over time as firms adapt to environmental regulation, economic conditions, and organisational constraints. Using a panel of 204 U.S. industrial facilities observed from 1992 to 2023, we link facility-level emissions from the Toxics Release Inventory to firm financial characteristics, managerial attributes, local labour-market conditions, and aggregate macroeconomic indicators. We employ a time-varying mean-group estimator that allows average relationships to change smoothly over time while accommodating persistent heterogeneity across facilities. We find several covariates display episodic associations with emissions growth. The results reveal pronounced stage-like dynamics in emissions determinants, with firm-level characteristics and aggregate conditions dominating in different periods. From an innovation-policy perspective, the findings highlight that firms' responses to environmental regulation are time-dependent and shaped by their adaptive capacity.

2602.11442 2026-05-25 econ.GN q-fin.EC

Ecosystem service demand relationship and trade-off patterns in urban parks across China

中国城市公园生态系统服务需求关系与权衡模式

Shuyao Wu, Delong Li, Ximan Sun, Kai-di Liu, Wentao Zhang, Binbin V. Li, Shuangcheng Li, Lumeng Liu, Fangjin Xu, Jinwei Dong, Laibao Liu, Weili Duan, Zhonghao Zhang

AI总结 本研究旨在探讨中国城市公园生态系统服务需求之间的关系及权衡模式,采用全国范围内的20,075份调查数据和点分配实验,首次直接量化了九类城市公园生态系统服务需求之间的关联与权衡。研究发现,中国城市居民对空气净化和休闲服务有较强偏好,形成了三种典型的需求组合,并揭示了社会经济与环境因素对服务需求权衡强度的重要影响。该研究为城市公园的差异化设计提供了科学依据,以满足多样化生态服务需求,提升城市生活质量。

详情
AI中文摘要

理解公众对城市生态系统服务(ES)的需求对于有效的绿地管理至关重要,然而这些多样化需求之间错综复杂的关系和潜在权衡仍知之甚少。以往研究结果不一致,通常受限于小样本或依赖间接代理变量。在此,我们提供了首个全国范围内对九种城市公园ES需求之间关系的直接评估,使用包含中国20,075份回复的调查数据集和直接量化服务需求间权衡模式的点分配实验。我们发现中国城市居民对空气净化和休闲服务有特别强烈的偏好,以牺牲其他服务为代价。这些偏好进一步反映在三种不同的需求束中:空气净化主导型、休闲主导型和平衡型需求,每种需求束描绘了一组具有不同代表性特征的典型人群。社会经济和环境因素,如年龄、环境兴趣和年平均降水量,显著影响服务需求间的权衡强度。我们的研究开创了对生态系统服务需求之间关系的直接定量分析,结果强调需要定制化的城市公园设计,以满足多样化的服务需求,从而可持续地提升中国及其他地区城市生活质量。

英文摘要

Understanding public demand for urban ecosystem services (ES) is crucial for effective green space management, yet the intricate relationships and potential trade-offs among these diverse demands remain poorly understood. Previous studies have yielded inconsistent findings, often limited by small samples or reliance on indirect proxies. Here, we provide the first national-scale, direct assessment of the relationship among demands for nine urban park ES using a survey dataset comprising 20,075 responses across China and a point-allotment experiment that directly quantifies the trade-off patterns among service demands. We found particularly strong preferences among urban residents in China for air purification and recreation services, at the expense of other services. These preferences were further reflected in three distinct demand bundles: air purification-dominated, recreation-dominated, and balanced demands, each delineating a typical group of people with distinct representative characteristics. Socio-economic and environmental factors, such as age, environmental interest, and mean annual precipitation, significantly influence the trade-off intensity among service demands. Our study pioneers the direct, quantitative analysis of relationships among ecosystem service demands, and the results underscore the need for tailored urban park designs that address diverse service demands to sustainably enhance the quality of city life in China and beyond.

2512.21429 2026-05-25 econ.EM

US labor market conditions and migration: a reassessment of Bahar (2025)

美国劳动力市场状况与移民:对Bahar(2025)的重新评估

Francisco Rodriguez, Giancarlo Bravo

AI总结 本文重新评估了Bahar(2025)关于美国就业空缺与西南边境越境人数之间存在长期协整关系的结论,指出其研究基于对一阶差分数据误用Engle-Granger检验,导致结论不可靠。当正确应用该检验于原始数据时,协整关系的证据消失,从而否定了该文对短期和长期弹性进行估计的方法。因此,Bahar的研究未能有效揭示美国劳动力市场状况与移民之间的关系。

详情
AI中文摘要

Bahar(2025)认为美国职位空缺与西南边境越境人数之间存在长期协整关系。我们表明,这一结论基于对一阶差分错误应用的Engle-Granger检验。一旦正确地对水平值应用Engle-Granger检验,协整关系的证据消失,从而使得该论文估计短期和长期弹性的方法无效。因此,Bahar的方法无法揭示美国劳动力市场状况与移民之间的关系。

英文摘要

Bahar (2025) argues that there is a long-term cointegrating relationship between US job vacancies and southwest border crossings. We show that this conclusion is based on a misspecified Engle-Granger test applied to first differences. Once the Engle-Granger test is correctly applied to levels, evidence for a cointegrating relationship vanishes, invalidating the paper's approach to estimating short- and long-run elasticities. Bahar's approach is therefore uninformative about the relationship between US labor market conditions and migration.

2512.15244 2026-05-25 stat.ME econ.EM

Non-parametric Causal Inference in Dynamic Thresholding Designs

动态阈值设计中的非参数因果推断

Aditya Ghosh, Stefan Wager

AI总结 本文研究了在动态阈值设定下进行因果推断的问题,其中处理分配基于随时间变化的状态变量阈值。不同于静态设定,动态环境中过去的处理可能影响当前状态和未来的处理,使得传统回归不连续方法不再适用。作者提出了一种新的局部线性回归估计方法,能够一致估计动态阈值设计下的边际政策效应,并在连续血糖监测的仿真实验中验证了该方法的有效性。

详情
AI中文摘要

我们考虑动态设置中的因果推断,其中治疗通过随时间变化的状态变量阈值分配。关于回归间断方法的大量文献基于这样一个事实:在静态设置中,通过阈值跨越的治疗分配产生准实验设计,从而实现实用的因果推断。但动态设置涉及静态设置中不存在的挑战,例如,过去的治疗可能影响当前状态,从而影响未来的治疗,因此现有的回归间断方法不适用。在这里,我们证明动态阈值设计识别出一个边际政策效应,该效应在静态设置中嵌套了经典的回归间断参数;并提出了一个定制的局部线性回归估计量,该估计量对此边际政策效应具有一致性。我们使用一个实验来演示我们的方法,该实验使用FDA批准的模拟器生成的数据,模拟了连续血糖监测阈值的真实世界优化。

英文摘要

We consider causal inference in dynamic settings where treatment is assigned by thresholding a state variable that can change over time. There is a large literature on regression-discontinuity methods building on the fact that, in the static setting, treatment assignment via threshold crossing induces a quasi-experimental design that enables pragmatic causal inference. But dynamic settings involve challenges not present in the static setting, e.g., past treatments may affect current state and thus future treatments, and so existing regression-discontinuity methods do not apply. Here, we show that dynamic thresholding designs identify a marginal policy effect that nests the classical regression-discontinuity parameter in the static setting; and propose a tailored local linear regression estimator that is consistent for this marginal policy effect. We demonstrate our approach using an experiment that emulates real-world optimization of thresholds for continuous glucose monitoring using data generated from an FDA-approved simulator.

2506.12337 2026-05-25 econ.TH

Artificial Intelligence in Team Dynamics: Who Gets Replaced and Why?

团队动态中的人工智能:谁被取代以及为什么?

Xienan Cheng, Mustafa Dogan, Pinar Yildirim

AI总结 本文研究了组织中引入人工智能(AI)对团队动态的影响,探讨了雇主如何最优地部署有限的AI资源以替代团队成员、哪些岗位更容易被AI取代、是否应充分利用所有AI资源以及AI部署对工资水平和团队内收入不平等的影响。研究构建了一个序列团队生产模型,发现最优策略是随机替代工人而非固定某个岗位,且通常不会替代中间岗位,因为其对信息传递至关重要;此外,雇主可能有意保留部分AI资源,并且AI的引入有助于提高平均工资并缩小团队内部的收入差距。

详情
AI中文摘要

本研究调查了组织中采用人工智能(AI)的影响。我们提出:第一,委托人应如何最优地分配有限的AI资源以取代团队中的工人?第二,在序贯工作流程中,哪些工人面临最高的AI替代风险?第三,委托人是否总是倾向于充分利用所有可用的AI资源,还是保留一些闲置的AI容量有好处?第四,最优AI部署对工资水平和团队内部工资不平等有何影响?我们开发了一个序贯团队生产模型,其中委托人可以使用同伴监督——每个工人观察其前任努力程度的信号——来约束团队成员。委托人可以用AI代理取代一些工人,AI代理的行为不受道德风险影响。我们的分析得出四个关键结果。第一,最优AI策略是随机取代工人,而不是固定于某个职位。第二,委托人取代工作流程开始和结束处的工人,但不取代中间工人,因为该工人对于维持同伴监督所获得的信息流至关重要。第三,委托人可能最优地未充分利用可用的AI容量。第四,最优AI采用提高了平均工资并减少了团队内部的工资不平等。

英文摘要

This study investigates the effects of artificial intelligence (AI) adoption in organizations. We ask: First, how should a principal optimally deploy limited AI resources to replace workers in a team? Second, in a sequential workflow, which workers face the highest risk of AI replacement? Third, would the principal always prefer to fully utilize all available AI resources, or are there any benefits to keeping some slack AI capacity? Fourth, what are the effects of optimal AI deployment on the wage level and intra-team wage inequality? We develop a sequential team production model in which a principal can use peer monitoring--where each worker observes a signal of their predecessor's effort--to discipline team members. The principal may replace some workers with AI agents, whose actions are not subject to moral hazard. Our analysis yields four key results. First, the optimal AI strategy stochastically replaces workers rather than fixating on a single position. Second, the principal replaces workers at the beginning and at the end of the workflow, but does not replace the middle worker, since this worker is crucial for sustaining the flow of information obtained by peer monitoring. Third, the principal may optimally underutilize available AI capacity. Fourth, the optimal AI adoption increases average wages and reduces intra-team wage inequality.

2407.14635 2026-05-25 econ.EM

Predicting the Distribution of Treatment Effects: A Covariate-Adjustment Approach

预测处理效应的分布:一种协变量调整方法

Bruno Fava

AI总结 该研究关注于评估政策或干预措施对个体影响的分布,而不仅仅是平均效应。作者提出了一种基于协变量调整的预测反事实方法,用于推断处理效应分布的特定点。该方法在有限样本下通过样本分割实现有效推断,在渐近条件下通过交叉拟合保证有效性。研究重新分析了五个关于小额信贷的随机对照试验,发现尽管平均效应不显著,但处理效应在个体层面上存在明显差异,部分人受益,部分人受损。

详情
AI中文摘要

影响评估的重要问题不仅需要了解平均效应,还需要了解处理效应的分布。无法观察个体反事实使得回答这些实证问题具有挑战性。我提出了一种通过协变量调整预测反事实来推断处理效应分布点的方法。我在较弱的条件下,使用样本分割提供有限样本有效推断,并使用交叉拟合提供渐近有效推断。重新审视五项关于小额信贷的随机对照试验,这些试验报告了零平均效应,我发现重要的分布影响,一些个体因信贷增加而受益,另一些则受到损害。

英文摘要

Important questions for impact evaluation require knowledge not only of average effects, but of the distribution of treatment effects. The inability to observe individual counterfactuals makes answering these empirical questions challenging. I propose an inference approach for points of the distribution of treatment effects that uses predicted counterfactuals through covariate adjustment. I provide finite-sample valid inference using sample-splitting and asymptotically valid inference using cross-fitting under arguably weak conditions. Revisiting five randomized controlled trials on microcredit that reported null average effects, I find important distributional impacts, with some individuals helped and others harmed by the increased credit access.

2405.18089 2026-05-25 econ.EM

Semi-nonparametric models of multidimensional matching: an optimal transport approach

多维匹配的半非参数模型:一种最优传输方法

Dongwoo Kim, Young Jun Lee

AI总结 本文研究多维匹配模型中的半非参数估计问题,以工人与职位的匹配为例,提出了一种基于最优运输理论的灵活且可操作的筛估计方法。该方法突破了传统参数模型对特征变量联合正态分布的限制,实现了对生产技术、均衡工资和匹配函数的识别,并提出了高效、一致且渐近正态的估计方法。实证分析表明,美国经济在1990至2010年间经历了比以往研究认为更大的有利于认知能力的技术进步,且新模型对工资不平等演变的拟合效果显著提升。

详情
AI中文摘要

本文针对可转移效用的半非参数多维匹配模型(聚焦于工人-工作匹配),开发了一套经验上易于处理且灵活的筛子估计量。我们推广了Bojilov和Galichon(2016)以及Lindenlaub(2017)采用的参数二次高斯框架,该框架依赖于观测特征的联合正态性。我们允许特征分布不受限制,并利用最优传输理论证明了生产技术、均衡工资和匹配函数的可识别性。在识别的基础上,我们提出了有效、一致且渐近正态的筛子估计量。我们重新审视了Lindenlaub的实证应用,并表明,在1990年至2010年间,美国经济经历了比原始发现所暗示的更大的偏向认知能力的技术进步。此外,我们的灵活模型规范为工资不平等演变模式提供了显著更好的拟合。

英文摘要

This paper develops a set of empirically tractable and flexible sieve estimators for semi-nonparametric multidimensional matching models with transferable utility, focusing on worker-job matching. We generalize the parametric quadratic-Gaussian framework employed by Bojilov and Galichon (2016) and Lindenlaub (2017), which relies on joint normality of observed characteristics. We allow unrestricted distributions of characteristics and show identification of the production technology and the equilibrium wage and matching functions using optimal transport theory. Given identification, we propose efficient, consistent, and asymptotically normal sieve estimators. We revisit Lindenlaub's empirical application and show that, between 1990 and 2010, the U.S. economy experienced much larger technological progress favoring cognitive abilities than the original findings suggest. Furthermore, our flexible model specifications provide a significantly better fit for patterns in the evolution of wage inequality.

2605.22877 2026-05-25 econ.EM

Modeling Commuter Mobility in Stockholm: A Spatial Panel Approach Using Mobile Phone Data

建模斯德哥尔摩通勤流动性:使用手机数据的空间面板方法

Marina Toger, Umut Türk, John Östh, Manfred M. Fischer

AI总结 本文利用移动电话数据构建异方差空间杜宾面板模型,研究斯德哥尔摩大区通勤流动性的社会人口和经济决定因素。通过贝叶斯马尔可夫链蒙特卡洛方法估计模型,分析了2018至2023年间675个区域的平衡面板数据,揭示了教育水平和汽车拥有率是影响通勤流动性的主要因素,而年龄结构影响相对较小。研究强调,传统非空间方法忽视空间相互依赖性可能导致估计偏差,而本文方法能够准确识别直接和间接空间效应,为评估通勤政策的社会效益提供了更可靠的依据。

详情
Comments
18 pages, 1 figure, 2 tables, 1 appendix
AI中文摘要

本文使用通过贝叶斯马尔可夫链蒙特卡洛方法估计的异方差空间杜宾面板数据模型,研究大斯德哥尔摩地区区域通勤流动性的社会人口和社会经济决定因素。利用来自MIND数据库的手机衍生起点-终点流量,分析利用了2018-2023年期间675个区域平衡面板的异常精细的空间和时间粒度。通过贝叶斯模型比较选择的k近邻空间权重矩阵(k=18)捕捉了区域间连通性的拓扑结构。通过对因变量和自变量进行空间滞后建模,该框架能够从偏导数矩阵的标量汇总度量中明确恢复直接(本区域)和间接(溢出)效应——为社会人口和社会经济条件如何通过空间传播提供稳健的后验推断。这种方法解决了传统非空间方法的一个关键局限性,即忽略空间相互依赖可能导致有偏估计。实证结果证实,空间溢出效应主导直接效应,教育程度和汽车拥有量成为通勤流动性的主要决定因素,而年龄构成的作用相对较小。这些发现强调,孤立评估直接效应会系统性地低估增强流动性的区域政策的更广泛社会回报。

英文摘要

This paper examines the sociodemographic and socioeconomic determinants of regional commuter mobility in the Greater Stockholm Area using a heteroscedastic spatial Durbin panel data model estimated via Bayesian Markov Chain Monte Carlo methods. Drawing on mobile phone-derived origin-destination flows from the MIND database, the analysis exploits unusually fine spatial and temporal granularity across a balanced panel of 675 regions over the period 2018-2023. A k-nearest neighbor spatial weight matrix (k = 18), selected via Bayesian model comparison, captures the topological structure of interregional connectivity. By modeling spatial lags in both the dependent and independent variables, the framework enables explicit recovery of direct (own-region) and indirect (spillover) effects from scalar summary measures of the matrix of partial derivatives -- providing robust posterior inference on how sociodemographic and socioeconomic conditions propagate through space. This approach addresses a key limitation of conventional non-spatial methods, which risk producing biased estimates by ignoring spatial interdependence. Empirical results confirm that spatial spillovers predominate over direct effects, with educational attainment and car ownership emerging as the principal determinants of commuter mobility, while age composition plays a comparatively modest role. These findings underscore that evaluating direct effects in isolation systematically underestimates the broader societal returns to mobility-enhancing regional policies.

2605.22865 2026-05-25 cs.GT cs.ET cs.MA econ.EM econ.TH

Multi-Dimensional Matching in Market Design

市场设计中的多维匹配

Irene Aldridge

AI总结 本文研究了多维匹配市场中的机制设计问题,提出了一种计算高效的匹配方法,该方法允许代理基于物品特征而非完整效用评估进行偏好报告。通过奇异值分解(SVD)识别特征空间中的主要变化方向,并沿该方向进行匹配,将复杂的多维问题简化为近似的一维问题,时间复杂度仅为 $O(N \log N)$。该机制在数据具有低有效维度时,能够近似最大化纳什社会福利,满足分布性诚实性,并具有对称性,同时建立了纳什社会福利与几何分布鲁棒优化之间的新联系,实验表明其效率远高于直接优化方法,适用于学校选择、劳动力市场和课程分配等实际场景。

详情
Comments
27 pages
AI中文摘要

本文提出了一种计算高效的多维匹配市场机制,其中代理报告对物品特征的偏好,而非完整的效用评估。我们使用奇异值分解(SVD)识别特征空间中的主要变化方向,并沿该维度将代理匹配到物品,从而将复杂的多维问题简化为可在 $O(N \log N)$ 时间内求解的有效单维问题。我们证明,当数据呈现低有效维度时,我们的机制近似最大化纳什社会福利,满足分布真实性,并实现对称性。我们建立了纳什社会福利与几何分布鲁棒优化之间的新联系,提供了鲁棒性保证。数值实验表明,我们的方法实现了99%的最优福利,同时运行速度比直接优化快三个数量级。该框架自然适用于学校选择、劳动力市场和课程分配,其中基于特征的偏好表达减轻了代理的认知负担。

英文摘要

This paper proposes a computationally efficient mechanism for multi-dimensional matching markets where agents report preferences over object features rather than complete utility assessments. We use Singular Value Decomposition (SVD) to identify the principal direction of variation in feature space and match agents to objects along this dimension, reducing a complex multi-dimensional problem to an effectively one-dimensional problem solvable in $O(N \log N)$ time. We show that when data exhibit low effective dimensionality, our mechanism approximately maximizes Nash Social Welfare, satisfies distributional truthfulness, and achieves symmetry. We establish a novel connection between Nash Social Welfare and Geometric Distributionally Robust Optimization, providing robustness guaranties. Numerical experiments demonstrate that our approach achieves 99\% optimal welfare while running three orders of magnitude faster than direct optimization. The framework applies naturally to school choice, labor markets, and course allocation, where feature-based elicitation reduces the cognitive burden on agents.

2605.22847 2026-05-25 cs.GT econ.TH math.CO

The Geometry of Cooperative Game Solutions: Stratified Egalitarian Shapley Values

合作博弈解的空间几何:分层平等Shapley值

Frank M. V. Feys

AI总结 本文研究了有限玩家合作博弈中线性价值映射的几何结构,引入了基于Harsanyi分红分解的内积,揭示了该内积的内在性质,并在此基础上对高效、对称、线性价值映射子空间进行了结构分析。研究提出了分层平等夏普利值的概念,将每个高效对称线性价值唯一分解为不同规模联盟对应的分层参数,并通过正交投影得到最优参数及拟合优度指标,为博弈论中的价值分配提供了新的统计解释与分类方法。

详情
Comments
34 pages
AI中文摘要

有限玩家合作博弈G^N上的线性值映射空间L是有限维的,并具有由G^N的Harsanyi红利分解诱导的典范内积。我们证明这个内积是内在的:对于Harsanyi内积,从G^N的任何标准正交基得到相同的值。在这个几何中,有效、对称、线性值映射的子空间L^{ESL}具有一个简洁的结构定理。由联盟大小诱导的L的正交分层产生一个典范线性同构L^{ESL} = R^{n-1},在此同构下,每个有效对称线性值映射唯一地分解为n-1个分层epsilon,每个联盟大小一个。Joosten (1996)的经典平等Shapley族正是这个R^{n-1}的对角切片。L^{ESL}中任何Psi到该对角线的正交投影产生一个最优参数eps*(Psi),等于在联盟大小上的显式概率分布{w_a}下分层epsilon的加权平均,拟合优度R^2(Psi)等于1减去这些epsilon的相对加权方差。该框架是决定系数的字面回归统计类比。当n=4时,它产生了Shapley值的标准替代方案的清晰三分法:Banzhaf值几乎正交于平等Shapley轴(R^2 ~ 1%);等剩余分配值中等对齐(R^2 ~ 38%);团结值几乎完全对齐(R^2 ~ 99.6%)。渐近地,R^2(ESD) -> 1,R^2(So) -> 1,R^2(Bz) -> 1/2,最后一点反映了Banzhaf值在每个联盟大小上的效率缺陷与平等Shapley偏差之间的结构恒等式。

英文摘要

The space L of linear value maps on a finite-player cooperative game G^N is finite-dimensional, and admits a canonical inner product induced by the Harsanyi-dividend decomposition of G^N. We show that this inner product is intrinsic: the same value arises from any orthonormal basis of G^N with respect to the Harsanyi inner product. Within this geometry, the subspace L^{ESL} of efficient, symmetric, linear value maps admits a clean structure theorem. The induced orthogonal stratification of L by coalition size yields a canonical linear isomorphism L^{ESL} = R^{n-1}, under which every efficient symmetric linear value map decomposes uniquely into n-1 stratified epsilons, one per coalition size. The classical egalitarian Shapley family of Joosten (1996) is precisely the diagonal slice of this R^{n-1}. The orthogonal projection of any Psi in L^{ESL} onto this diagonal yields an optimal parameter eps*(Psi) equal to the weighted mean of the stratified epsilons under an explicit probability distribution {w_a} over coalition sizes, and the goodness-of-fit R^2(Psi) equals one minus the relative weighted variance of those epsilons. The framework is a literal regression-statistics analogue of the coefficient of determination. At n=4 it produces a clean three-way classification of the standard alternatives to the Shapley value: the Banzhaf value is nearly orthogonal to the egalitarian Shapley axis (R^2 ~ 1%); the equal-surplus-division value is moderately aligned (R^2 ~ 38%); the solidarity value is almost entirely aligned (R^2 ~ 99.6%). Asymptotically R^2(ESD) -> 1, R^2(So) -> 1, and R^2(Bz) -> 1/2, the last reflecting a structural identity between the efficiency defect and the egalitarian-Shapley deviation of the Banzhaf value at every coalition size.

2605.22846 2026-05-25 cs.GT econ.TH math.CO

An Axiomatic Theory of Tie-Breaking: Impossibility, Characterization, and Decomposition

打破平局的公理化理论:不可能性、刻画与分解

Frank M. V. Feys

AI总结 本文建立了一个抽象的公理化理论来研究平局处理规则。研究证明了在存在内在对称情况时,任何产生严格线性排序的平局处理规则都无法满足匿名性;当允许输出为玩家的划分而非严格排名时,唯一满足两个自然公理的规则是输入对称群稳定子的轨道划分;此外,任何合理的严格平局处理规则都可以唯一分解为轨道划分与任意补充的组合。该框架统一描述了多种实际场景中的平局处理机制,如棋类比赛、体育联赛、投票和网络中心性排序等。

详情
Comments
31 pages, 1 figure. Submitted to Social Choice and Welfare
AI中文摘要

我们发展了一个抽象的打破平局的公理化理论。打破平局的输入包括一个有限玩家集合N,一个表示待细化排名的N上的弱序,以及一个来自对称群Sym(N)作用集合的辅助信息项。在这个最小框架内,我们证明了三个定理。首先,如果输入空间包含至少一个内在对称的情形(这在几乎所有实际应用中均满足),则不存在能够产生严格线性序且满足匿名性的打破平局规则。其次,当我们允许规则输出N的一个划分(而非严格排序)时,存在唯一满足两个自然公理的规则:它是输入联合稳定子群在N上的轨道划分。第三,每个合理的严格打破平局规则可唯一分解为规范轨道划分后接任意补全。该分解精确刻画了非正式观察:真实的打破平局系统在被迫任意之前是诚实的。该框架足够广泛,能够统一涵盖国际象棋比赛打破平局、体育联赛规则、投票打破平局、合作博弈中对称玩家的打破平局以及基于网络中心性度量的排名。

英文摘要

We develop an abstract axiomatic theory of tie-breaking. A tie-breaking input consists of a finite set N of players, a weak order on N representing the standings to be refined, and an auxiliary information item drawn from a set on which the symmetric group Sym(N) acts. Within this minimal framework we prove three theorems. First, no tie-breaking rule producing a strict linear order can be anonymous, provided the input space contains even one intrinsically symmetric situation, a condition met in essentially every realistic application. Second, when we allow the rule to output a partition of N (rather than a strict ranking), there is a unique rule satisfying two natural axioms: it is the partition of N into orbits of the joint stabilizer of the input. Third, every reasonable strict tie-breaking rule decomposes uniquely as the canonical orbit partition followed by an arbitrary completion. The decomposition makes precise the informal observation that real tie-breaking systems are honest until forced to be arbitrary. The framework is broad enough to capture chess tournament tie-breakers, sports league regulations, voting tie-breakers, tie-breaking among symmetric players in cooperative games, and ranking by network centrality measures, all within a single uniform formalism.

2605.22841 2026-05-25 physics.soc-ph cs.AI cs.CL cs.GT cs.MA econ.GN q-fin.EC

Strategic Coercion Within Alliances: The Greenland Sovereignty Game as an AI Stress Test

联盟内的战略胁迫:格陵兰主权博弈作为人工智能压力测试

Rommin Adl, Peyton Williams

AI总结 本文以2019-2026年美国试图从丹麦手中获得格陵兰主权的事件为案例,研究联盟内部强权对弱权的策略性施压问题,构建了多个博弈模型并通过八种前沿大语言模型进行多智能体模拟实验。研究揭示了在战略控制与联盟规范执行等集体行动难题下,不同模型在权力权重、行为策略和冲突升级等方面表现出显著差异,尤其指出中国来源模型在扮演美国角色时具有不同于西方模型的特征,并发现仅有少数模型能够实现和平的美国获取格陵兰的情景。

详情
Comments
78 pages, 17 figures, 18 tables. Multi-agent LLM simulation recovering structural utility parameters across 8 frontier models in the Greenland sovereignty crisis. v3: typo pass, fixes phantom action names (REQUEST_MULTILATERAL, INDEPENDENT) and a Blunden date mismatch. v2 added Section V safety findings (legitimacy-laundered escalation, signal decoupling) and Appendix H
AI中文摘要

当最强大的联盟成员在领土和战略控制问题上向较弱的成员施压时会发生什么?我们将格陵兰主权危机作为大语言模型地缘政治的压力测试,聚焦于2019-2026年美国推动从丹麦王国获取格陵兰的努力。该危机嵌套了两个集体行动问题:北极战略控制以及北约能否对主导成员执行联盟规范。我们开发了三个博弈(非对称胁迫;具有临界点转折的北约保证博弈;具有社会偏好的三元扩展式博弈),并通过多智能体模拟进行测试,其中八个前沿大语言模型扮演六个地缘政治角色(美国、丹麦、格陵兰、北约、俄罗斯、加拿大),共完成3604场博弈和108120个行动观测。利用逆向博弈论,我们恢复了每个模型的结构性效用参数(alpha、beta、gamma、delta、eta),分别对应物质自利、互惠、不平等厌恶、规范尊重和承诺一致性。三个发现突出:第一,所有八个模型在胁迫框架下变得更加升级(四步升级从10.7%上升至28.6%);第二,中国来源模型在扮演美国角色时显示出与西方来源模型系统性不同的权力权重分布;第三,和平的美国获取仅在1.9%的干净博弈中出现,且8个前沿模型中只有3个实现了这一点,最突出的是DeepSeek V3.2,它通过宗主国执行了稳定的五轮策略。强调强制法和自决的提示在仅英语的确认样本中将升级降低回基线附近;多语言对比作为探索性敏感性检验报告。我们将此定位为大语言模型地缘政治行为的结构性基准,补充行动频率基准。

英文摘要

What happens when the strongest alliance member pressures a weaker member over territory and strategic control? We examine the Greenland sovereignty crisis as a stress test for LLM geopolitics, centered on the 2019-2026 U.S. push to acquire Greenland from the Kingdom of Denmark. The crisis nests two collective-action problems: Arctic strategic control and whether NATO can enforce alliance norms against the dominant member. We develop three games (asymmetric coercion; a NATO assurance game with a critical-mass tipping point; a triadic extensive-form game with social preferences) and test them with a multi-agent simulation in which eight frontier LLMs play six geopolitical roles (United States, Denmark, Greenland, NATO, Russia, Canada) across 3,604 completed games and 108,120 action observations. Using inverse game theory, we recover each model's structural utility parameters (alpha, beta, gamma, delta, eta) for material self-interest, reciprocity, inequality aversion, norm respect, and commitment consistency. Three findings stand out. First, all eight models become more escalatory under coercion framing (four-action escalation rises from 10.7% to 28.6%). Second, Chinese-origin models show systematically different power-weight profiles from Western-origin models when playing the U.S. role. Third, peaceful US acquisition emerges in only 1.9% of clean games and only 3 of 8 frontier models ever achieve it, most prominently DeepSeek V3.2, which executes a stable five-round playbook through the metropole. Prompts emphasizing jus cogens and self-determination reduce escalation back near baseline in the English-only confirmatory sample; multilingual contrasts are reported as exploratory sensitivity checks. We position this as a structural benchmark for LLM geopolitical behavior, complementing action-frequency benchmarks.

2509.05076 2026-05-25 econ.TH

Randomization and ambiguity perception

随机化与模糊感知

Yutaro Akita, Kensei Nakamura

AI总结 本文研究了厌恶模糊性的决策者在面对不确定性时的行为特征,指出这类决策者不仅不喜欢模糊事件的存在,还特别排斥模糊性的增加,这与标准模糊模型的预测不同。通过公理化分析,作者提出该类决策者会避免事前对前景进行随机化,因为这会增加相关模糊事件的数量而无法对冲不确定性。研究得出了一种行为表示,表明决策者仿佛在以一定成本优化其对模糊性的感知,并证明了该表示的唯一性,同时比较了对模糊性及其增加的态度。

详情
AI中文摘要

厌恶模糊的决策者通常不仅不喜欢模糊事件的存在,也不喜欢其增加,这与标准模糊模型预测的相反。我们公理化地研究这样的决策者。她避免对前景进行事前随机化,因为这只会增加相关模糊事件的数量,而不提供对不确定性的对冲。我们的公理导致一种表示,其中决策者表现得好像在优化其模糊感知成本。我们证明了表示的唯一性,并对模糊态度及其增加进行了比较分析。这种识别在没有考虑事前随机化的情况下是无法实现的。

英文摘要

Ambiguity-averse decision makers typically dislike not only the presence of ambiguous events but also their increase, contrary to what standard ambiguity models predict. We axiomatically study such a decision maker. She avoids ex ante randomization over prospects since it only increases the number of relevant ambiguous events without providing a hedge against uncertainty. Our axioms lead to a representation in which the decision maker behaves as if optimizing her ambiguity perception at a cost. We show the uniqueness of the representation, and conduct comparatives of attitudes toward ambiguity and its increase. This identification is not achieved without considering ex ante randomization.