arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.31443 2026-06-01 stat.ME cs.LG econ.EM math.ST stat.TH

Modeling Covariate Transition for Efficient Estimation of Longitudinal Treatment Effects in Randomized Experiments

建模协变量转移以高效估计随机实验中的纵向处理效应

Naoki Chihara, Tatsushi Oka, Yasuko Matsubara, Yasushi Sakurai, Shota Yasui

AI总结 提出一种回归调整框架,通过建模协变量转移来估计随机实验中的纵向处理效应,并实现渐近正态性和半参数有效性。

详情
Journal ref
The 43rd International Conference on Machine Learning, 2026
Comments
Accepted by ICML'26
AI中文摘要

我们提出一个回归调整框架,用于在静态制度下估计随机实验中的纵向处理效应。虽然回归调整方法通过使用预处理协变量有助于随机实验中的方差减少,但它们通常只关注平均效应,从中我们无法获得关于效应何时出现以及持续多久的有价值见解。为了解决这个问题,我们考虑随时间变化的中间结果和事后协变量,并使用转移核表示这些动态轨迹。此外,我们建立了估计量的渐近正态性和半参数效率界,从而实现更强大的统计推断。使用日本某流媒体平台的A/B测试数据进行的模拟研究和实证分析显示了我们的方法的实际优势。

英文摘要

We present a regression-adjustment framework designed for the estimation of longitudinal treatment effects in randomized experiments under static regimes. While regression-adjustment methods are useful for variance reduction in randomized experiments by using pre-treatment covariates, they usually focus only on average effects, from which we cannot obtain valuable insights into when the effects appear and how long they continue. To address this issue, we consider intermediate outcomes and evolving post-treatment covariates over time, and we represent such dynamic trajectories using transition kernels. Furthermore, we establish the asymptotic normality and the semiparametric efficiency bound for our estimator, enabling more powerful statistical inference. Simulation studies and empirical analysis using A/B test data from a streaming platform in Japan show the practical advantages of our method.

2605.31306 2026-06-01 math.OC cs.SY econ.EM eess.SY stat.ME

Posterior and Likelihood Sensitivity in Bayesian Distributionally Robust Optimization

贝叶斯分布鲁棒优化中的后验和似然敏感性

Jun-ya Gotoh, Andrew E. B. Lim, Michael Jong Kim

AI总结 本文提出最坏情况后验和似然敏感性的概念,用于量化贝叶斯模型对后验和似然扰动的鲁棒性,并证明分布鲁棒优化可实现性能与鲁棒性的近似帕累托最优权衡。

详情
AI中文摘要

我们引入了最坏情况后验和似然敏感性的概念。这些分别衡量期望成本对后验分布最坏情况扰动和贝叶斯模型似然最坏情况扰动的敏感性。每个都定义了鲁棒性的定量度量。关心样本外期望成本对其假设偏差敏感性的决策者将希望两个敏感性都较小的决策。我们推导了由偏差度量定义的不确定性集的后验和似然敏感性。当后验方差缩小到零时,后验敏感性消失,这发生在参数不确定性通过学习消除时。参数学习不能消除似然敏感性。贝叶斯优化问题的分布鲁棒公式在性能(期望成本)和鲁棒性(后验和似然敏感性)之间实现了近似帕累托最优的权衡。

英文摘要

We introduce the notion of worst-case posterior and worst-case likelihood sensitivity. These measure, respectively, the sensitivity of the expected cost to worst-case perturbations of the posterior distribution and worst-case perturbations of the likelihood of a Bayesian model. Each defines a quantitative measure of robustness. A decision maker concerned about the sensitivity of the out-of-sample expected cost to deviations from her assumptions will want a decision for which both sensitivities are small. We derive posterior and likelihood sensitivities for uncertainty sets defined in terms of deviation measures. Posterior sensitivity vanishes when the posterior variance shrinks to zero, which occurs when parameter uncertainty is eliminated from learning. Parameter learning does not eliminate likelihood sensitivity. A distributionally robust formulation of a Bayesian optimization problem makes a near-Pareto-optimal tradeoff between performance (expected cost) and robustness (posterior and likelihood sensitivity).

2605.31072 2026-06-01 econ.TH cs.GT cs.MA econ.EM

Comparing Market Mechanism Efficiencies

比较市场机制效率

Irene Aldridge

AI总结 通过博弈论框架比较连续双向拍卖(透明订单簿与不透明订单簿)和定期批量拍卖的福利效率,证明在中等到达率和有限逆向选择下,暗池优于其他机制。

详情
Comments
79 pages
AI中文摘要

我们开发了一个博弈论框架,比较三种市场机制的福利效率:具有透明订单簿的连续双向拍卖(公开交易所)、不透明订单簿(暗池)和定期批量拍卖。每种机制都被建模为一个排队系统,其中异质交易者在执行价格、等待成本和交易成本之间进行权衡。我们的主要结果表明,在中等到达率和有限逆向选择下,暗池在事前总福利上优于其他两种替代方案。可观察的订单簿产生了代价高昂的战略性时机博弈,交易者延迟或提前提交以优化其在队列中的位置,从而产生浪费的社会等待成本。不透明的订单簿通过信息设计消除了这些时机博弈。我们正式刻画了每种机制中的均衡策略,并证明了福利排序 $W^{DARK} > W^{LIT} > W^{BATCH}$。扩展部分包含了信息不对称和内生场所选择。结果展示了信息结构和服务的纪律如何共同决定战略匹配环境中的效率。

英文摘要

We develop a game-theoretic framework that compares welfare efficiency across three market mechanisms: continuous double auctions with transparent order books (lit exchanges), opaque order books (dark pools), and periodic batch auctions. Each mechanism is modeled as a queuing system where heterogeneous traders face trade-offs between the execution price, waiting costs, and transaction costs. Our main result establishes that under moderate arrival rates and bounded adverse selection, dark pools dominate both alternatives in aggregate ex-ante welfare. Observable order books create costly strategic timing games in which traders delay or rush submissions to optimize their position in the queue, generating wasteful social waiting costs. Opaque order books eliminate these timing games through information design. We formally characterize the equilibrium strategies in each mechanism and prove the welfare ranking $W^{DARK} > W^{LIT} > W^{BATCH}$. Extensions incorporate asymmetric information and endogenous venue choice. The results demonstrate how the information structure and the discipline of the service jointly determine efficiency in strategic matching environments.

2605.30916 2026-06-01 cs.LG cs.GT econ.TH

Welfare, Improvability, and Variance: A Principal-Agent Approach to Optimal Benchmark Item Aggregation

福利、可改进性与方差:最优基准测试项聚合的主-代理方法

Andreas Haupt, Justin Hartenstein, Anka Reuel, Mykel Kochenderfer, Sanmi Koyejo

AI总结 提出将基准测试建模为多任务主-代理博弈,通过福利、可改进性和方差三个维度评估项目,并应用于OLMES数据集识别帕累托劣势项目。

详情
AI中文摘要

AI基准测试存在记录完善的局限性,先前研究探讨了污染、饱和以及构造不明确等问题。聚合受到的关注要少得多:基准测试通常通过统一平均项目级分数来总结,隐含地将每个测试项目视为同等重要。我们将基准测试建模为多任务主-代理博弈,并表明基准测试的福利损失由三个项目级原始要素共同决定:与规范性福利优先级的一致性、边际可改进性和性能方差。我们将该理论转化为一个审计框架,沿这三个轴对项目进行排序,并使用WORKBank(福利)、EvoLM 4B套件(可改进性)和PolyPythias 410M面板(方差)将其应用于OLMES项目。该框架揭示了在OLMES中,在亲工人福利操作化下帕累托劣势的项目。所有代码可在 https://github.com/stair-lab/principal-agent-benchmarks 获取。

英文摘要

AI benchmarks have well-documented limitations, with prior work examining contamination, saturation, and construct underspecification. Aggregation has received far less attention: benchmarks are typically summarized by uniformly averaging item-level scores, implicitly treating every test item as equally valuable. We model benchmarking as a multitask principal-agent game and show that the welfare loss from a benchmark is determined jointly by three item-level primitives: alignment with normative welfare priorities, marginal improvability, and performance variance. We translate the theory into an audit framework that ranks items along each of these three axes, and apply it to OLMES items using WORKBank for welfare, the EvoLM 4B suite for improvability, and the PolyPythias 410M panel for variance. The framework surfaces items that are Pareto-inferior within OLMES subject to a pro-worker welfare operationalization. All code is available at https://github.com/stair-lab/principal-agent-benchmarks.

2605.30890 2026-06-01 econ.TH

A Geometric Approach to the Transformation Problem of Values

价值转化问题的几何方法

Jiyuan Lyu

AI总结 针对复杂劳动还原为简单劳动的难题,提出两步求解框架:先证明存在有界“价值可行域”,再构造线性映射方法从该区域系统推导还原系数,并基于中国2017年投入产出表验证其优于传统方法。

详情
AI中文摘要

复杂劳动还原为简单劳动是马克思劳动价值论中一个未解决的难题,也是阻碍转化问题最终解决的关键障碍。本文提出了一个两步求解框架。首先,我们证明只要宏观经济产生物质剩余,尊重劳动力再生产底线的还原系数就构成一个有界的“价值可行域”;在该区域内,两个宏观总量等式可以在合理的利润率范围内同时成立。其次,我们提出一种线性映射方法,利用名义工资的可观测结构和再生产底线约束,从价值可行域中系统性地构造隐含的还原系数。我们证明该映射是价格可行域与价值可行域之间的同胚,并且保持边界结构。基于中国2017年包含1272个部门的省际投入产出表的实证校准表明,通过映射方法得到的还原系数在匹配宏观利润份额方面显著优于同质劳动法和工资代理法。

英文摘要

The reduction of complex labour to simple labour is an unresolved difficulty in Marx's labour theory of value, and a key obstacle that has prevented the transformation problem from being settled definitively. This paper proposes a two-step solution framework. First, we prove that as long as the macroeconomy generates a physical surplus, the reduction coefficients that respect the floor of labour-power reproduction form a bounded ``value feasible region''; within this region the two macro aggregate equalities can hold simultaneously for a reasonable range of the profit rate. Second, we propose a linear mapping method that exploits the observable structure of nominal wages and the reproduction floor constraint to systematically construct the implicit reduction coefficients from the value feasible region. We show that this mapping is a homeomorphism between the price feasible region and the value feasible region, and that it preserves the boundary structure. An empirical calibration based on China's 2017 inter-provincial input--output table with 1272 sectors shows that the reduction coefficients obtained by the mapping method substantially outperform the homogeneous labour method and the wage-proxy method in matching the macro profit share.

2605.30879 2026-06-01 econ.TH

Competitive Many-to-One Matching: Sorting vs. Equality

竞争性多对一匹配:排序 vs. 平等

Anton Kolotilin, Alexander Wolitzky

AI总结 研究具有转移支付和同伴效应的多对一匹配(如工人与公司、学生与学校),分析竞争均衡的存在性、效率以及劳动力技能隔离与压缩的特征,并比较灵活定价与统一定价下的结果差异。

详情
AI中文摘要

我们研究具有转移支付和同伴效应的多对一匹配,例如工人与公司、学生与学校、居民与社区或消费者与地位商品的匹配。在灵活价格(如劳动力市场)下,竞争均衡在一般条件下存在且有效。我们刻画了劳动力按技能隔离并以正选型方式与公司匹配的条件。通常,均衡呈现劳动力隔离与压缩(混合)交替的区间。比较静态分析揭示了劳动力何时更隔离或更压缩,以及利润和工资何时更不平等或更平等。在统一价格(如学校或社区选择)下,同伴效应产生的价值归学校而非学生所有,且均衡可能过度隔离。我们的模型推广了分配模型(最优运输)和贝叶斯说服。

英文摘要

We study many-to-one matching with transfers and peer effects, such as matching workers to firms, students to schools, residents to neighborhoods, or consumers to status goods. With flexible prices (as in the labor market), competitive equilibrium exists and is efficient under general conditions. We characterize when workforces are segregated by skill and matched to firms in a positively assortative manner. In general, equilibrium features alternating intervals of workforce segregation and compression (mixing). Comparative statics characterize when workforces are more segregated or more compressed, and when profits and wages are more or less unequal. With uniform prices (as in school or neighborhood choice), the value generated by peer effects accrues to schools rather than students, and equilibrium can be excessively segregated. Our model generalizes both assignment models (optimal transport) and Bayesian persuasion.

2605.30843 2026-06-01 cs.LG econ.EM

A Lecture Note on Offline RL and IRL, Part II: Foundations of Inverse Reinforcement Learning and Dynamic Discrete Choice Models

离线强化学习与逆强化学习讲义,第二部分:逆强化学习与动态离散选择模型的基础

Enoch Hyunwook Kang

AI总结 本文证明了逆强化学习(IRL)与动态离散选择(DDC)模型的等价性,回顾了经典识别结果和计算范式,并介绍了现代机器学习方法及其识别特性。

详情
AI中文摘要

在前向强化学习问题中,奖励是固定且已知的;学习者被要求找到一个好的策略或价值函数。这里我们反过来提问:给定由专家生成的离线数据,我们能否恢复专家所优化的奖励?这就是逆强化学习问题,值得注意的是,两个社区——研究动态离散选择(DDC)的结构计量经济学家和研究熵正则化IRL的机器学习者——一直在以不同的名称研究完全相同的概率模型。我们首先证明它们的等价性。然后,我们发展Magnac和Thesmar的经典识别结果以及由此产生的经典计算范式:Rust的嵌套不动点算法、Hotz和Miller的条件选择概率方法,以及Adusumilli和Eckardt的两种时间差分方法:线性半梯度TD和近似价值迭代。每种方法都有其局限性:维度、转移核估计、致命三元组或投影不动点偏差。接着,我们回顾现代ML/IRL分支:对抗性IRL、占用匹配、IQ-Learn和离线ML-IRL,推导每种方法的实际目标,并精确说明它识别了什么和没有识别什么。最后,我们介绍Kang等人的经验风险最小化框架,该框架为离线IRL/DDC提供了基于梯度的估计器。

英文摘要

In the forward reinforcement-learning problem, the reward is fixed and known; the learner is asked to find a good policy or value function. Here we turn the question around. Given offline data generated by an expert, can we recover the reward the expert was optimizing? This is the inverse reinforcement learning problem, and remarkably, two communities, structural econometricians studying dynamic discrete choice (DDC) and machine learners studying entropy-regularized IRL, have been working on exactly the same probabilistic model under different names. We begin by proving their equivalence. We then develop the classical identification result of Magnac and Thesmar and the classical computational paradigms that grew out of it: Rust's nested fixed-point algorithm, the conditional-choice-probability approach of Hotz and Miller, and the two temporal-difference approaches of Adusumilli and Eckardt: linear semi-gradient TD and approximate value iteration. Each route has its limits: dimensionality, transition-kernel estimation, the deadly triad, or projected fixed-point bias. We then walk through the modern ML/IRL strand: adversarial IRL, occupancy matching, IQ-Learn, and offline ML-IRL, deriving each method's actual objective and stating precisely what it does and does not identify. We close with the empirical-risk-minimization framework of Kang et al., which yields a gradient-based estimator for offline IRL/DDC.

2605.30720 2026-06-01 cs.LG cs.AI econ.GN q-fin.EC stat.ML

Kalimati Vegetable Price Index Forecasting with a Momentum Corrected Online Stacking Ensemble

Kalimati蔬菜价格指数预测:基于动量校正的在线堆叠集成方法

Sahaj Raj Malla

AI总结 针对新兴经济体农产品价格高波动性问题,提出动量校正在线堆叠集成模型,通过构建逆波动率加权综合指数和64个因果特征,在90天预测期实现RMSE=1.771、MAPE=0.68%、R²=0.845的优异性能。

详情
Comments
21 pages, 8 figures, 2 tables
AI中文摘要

由于高波动性、频繁的供应中断以及强烈的文化需求影响,新兴经济体的农产品价格预测十分困难。本研究引入了Kalimati蔬菜价格指数(KVPI),这是一个新的逆波动率加权综合指数,汇总了加德满都十年(2013-2023年)的135种日度批发商品。通过创建稳定的宏观信号,KVPI减少了单个作物建模固有的噪声。我们开发了包含64个因果有效特征的丰富特征集,包括节日领先滞后效应、滚动统计量和日历变量。对涵盖统计、树基、深度学习、混合和Transformer架构的14种预测模型,在短期(7天)、中期(14天和30天)和长期(90天)预测期上进行了严格评估。树基集成方法表现出显著的鲁棒性,而经典统计模型和复杂Transformer在处理噪声数据集时表现不佳。提出的动量校正在线堆叠集成模型取得了最强性能,在90天预测期上均方根误差(RMSE)为1.771,平均绝对百分比误差(MAPE)低至0.68%,并解释了84.5%的方差(R²=0.845)。这一开源流程为尼泊尔及类似市场的政策制定者和供应链参与者提供了实用、可靠的工具,以预测价格波动并加强粮食安全。

英文摘要

Forecasting agricultural commodity prices in emerging economies is difficult due to high volatility, frequent supply disruptions, and strong cultural influences on demand. This study introduces the Kalimati Vegetable Price Index (KVPI), a new inverse-volatility weighted composite index that aggregates 135 daily wholesale commodities from Kathmandu over ten years (2013-2023). By creating a stable macro-level signal, the KVPI reduces the noise inherent in modelling individual crops. A rich set of 64 causally valid features was developed, including festival lead-lag effects, rolling statistics, and calendar variables. Fourteen forecasting models spanning statistical, tree-based, deep learning, hybrid, and transformer architectures were rigorously evaluated across short (7-day), medium (14- and 30-day), and long-term (90-day) horizons. Tree-based ensembles proved notably robust, while classical statistical models and complex transformers struggled with the noisy dataset. The proposed Momentum-Corrected Online Stacking Ensemble achieved the strongest performance, yielding a Root Mean Square Error (RMSE) of 1.771, an exceptionally low Mean Absolute Percentage Error (MAPE) of 0.68%, and explaining 84.5% of the variance (R-squared = 0.845) at the 90-day horizon. This open-source pipeline provides policymakers and supply chain actors in Nepal and similar markets with a practical, reliable tool for anticipating price movements and strengthening food security.

2605.30718 2026-06-01 econ.EM stat.ME stat.ML

Moment-Based Inference for Regression with Latent Dirichlet Covariates

基于矩的潜狄利克雷协变量回归推断

Ziyu Jiang

AI总结 针对回归前使用主题模型降维导致的推断困难,提出一种基于校正谱矩的方法,直接识别回归系数β,避免估计文档级主题份额,并通过可交换性条件估计未知总浓度α0,实现有效推断。

详情
AI中文摘要

主题模型常被用作回归前的降维工具,将估计的文档级主题份额视为观测协变量。这种插件式工作流程产生了两个推断困难:有效推断需要规则的第一阶段到第二阶段展开以传播主题估计不确定性,并且在固定文档长度下,即使已知总体主题矩阵,文档的主题混合也无法从其自身词汇中一致恢复。潜狄利克雷分配(LDA)的校正谱矩方法提供了一个起点:当总狄利克雷浓度已知时,低阶词矩可被校正以得到在潜主题基上对角的算子。我们将其扩展到下游回归。在有限LDA模型下,当响应残差与用于识别的低阶词矩正交时,响应加权词矩允许相同的校正,由此得到的监督算子直接识别回归系数β,无需估计文档级主题份额。主要障碍在于校正依赖于未知总浓度α0。我们证明,对于k≥3个主题且在一般有限探针条件下,α0通过可交换性识别:在真实值处,一族校正词矩算子可交换,而在偏离时通常不可交换。这产生了可行的估计量,并让α̂0的不确定性传播到β的推断中。该估计量在文档数量增长而文档长度固定时是渐近线性的,其标准误差来自文档级矩贡献的夹心估计。模拟显示,在插件式主题份额回归可能覆盖不足的情况下,该方法具有接近名义水平的覆盖率;对顶级经济学期刊的应用说明了潜主题效应的对比推断。

英文摘要

Topic models are often used as dimension-reduction tools before regression, with estimated document-level topic shares treated as observed covariates. This plug-in workflow creates two inferential difficulties: valid inference requires a regular first-stage-to-second-stage expansion that propagates topic-estimation uncertainty, and, at fixed document length, a document's topic mixture cannot be consistently recovered from its own words even when the population topic matrix is known. Corrected spectral moment methods for latent Dirichlet allocation (LDA) offer a starting point: when the total Dirichlet concentration is known, low-order word moments can be corrected to yield operators diagonal in the latent topic basis. We extend this to downstream regression. Under a finite LDA model with response residuals orthogonal to the low-order token moments used for identification, response-weighted word moments admit the same correction, and the resulting supervised operator identifies the regression coefficient $β$ directly, without estimating document-level topic shares. The main obstacle is that the correction depends on the unknown total concentration $α_0$. We show that, for $k\ge3$ topics and under a generic finite-probe condition, $α_0$ is identified by commutativity: at the true value a family of corrected word-moment operators commute, whereas away from it they generically do not. This yields a feasible estimator and lets uncertainty in $\hatα_0$ propagate into inference for $β$. The estimator is asymptotically linear as the number of documents grows with fixed document length, with sandwich standard errors from document-level moment contributions. Simulations show near-nominal coverage where plug-in topic-share regressions can undercover, and an application to top economics journals illustrates contrast inference for latent topic effects.

2605.30683 2026-06-01 econ.GN q-fin.EC

Towards an Ideometrics-Based General Theory of Human Progress

迈向基于观念计量学的人类进步一般理论

Igor Rudan, Steven Kerr

AI总结 本文提出以观念计量学为基础,构建可检验的人类进步与文明进步一般理论,通过观念生命周期动态过程重新定义进步,并引入人类进步观念计量指数(IIHP)和文明进步观念计量指数(IICP)进行量化评估。

详情
Comments
27 pages, 1 table, 48 references
AI中文摘要

本文提出观念计量学作为人类进步和文明进步的一般化且可检验理论的基础,从而将观念计量学与经济学和历史研究联系起来。基于先前将人脑概念化为观念传感器的工作,人类进步主要不是通过财富、健康或技术进步等结果来理解,而是通过塑造未来状态的“观念生命周期”动态过程来理解。本文提出了人类进步的形式化定义:在给定可用信息和不确定性,以及人类能力、能量、时间和资源稀缺的条件下,个人和社会生成、评估、优先排序和实施观念的能力的可测量改进,这种改进使得优先排序的观念越来越与那些真正导致更优未来状态的观念相一致。它引入了人类进步观念计量指数(IIHP),该指数捕捉观念生成的质量(G)、评估的准确性(E)、优先排序的效率(P)以及实施的有效性(Ie)。研究表明,如果观念感知的未来价值与其真实实现的未来价值之间良好对齐(通过结果监测O评估),则未来进步将得以实现。这一表述将分析焦点从静态结果转移到评估观念的质量,从而为理解进步与倒退提供了新的视角。该概念还可通过文明进步观念计量指数(IICP)扩展到漫长的历史时期,其中增加了成功记录结果(D)和成功代际传递积累知识(T)的额外参数。通过将观念转化为可测量的分析单位,观念计量学为理解人类进步提供了一种潜在的变革性方法。

英文摘要

This paper proposes ideometrics as the foundation for a generalised and potentially testable theory of human progress and civilisational progress, thus linking ideometrics to studies in economics and history. Building on prior work that conceptualises the human brain as a sensor of ideas, human progress is understood not primarily through outcomes such as wealth, health, or technological advancement, but through the dynamic process of the "idea life cycle" that shapes future states. The paper advances a formal definition of human progress as a measurable improvement in the ability of individuals and societies to generate, evaluate, prioritise, and implement ideas in a way that increasingly aligns prioritised ideas with those that truly lead to preferred future states, given available information and uncertainty, and under scarcity of human capacity, energy, time and resources. It introduces the Ideometric Index of Human Progress (IIHP) that captures the quality of idea generation (G), accuracy of their evaluation (E), efficiency of their prioritisation (P), and effectiveness of their implementation (Ie). It shows that the future progress will be realised if there is good alignment between the perceived future value of ideas and their true, realised future value, assessed as outcome monitoring (O). This formulation shifts the analytical focus from static outcomes to the quality of evaluating ideas, thereby offering a novel lens for understanding progress and regress. The concept can also be extended to long periods of history through the Ideometric Index of Civilisational Progress (IICP), where additional parameters of successful documentation of outcomes (D) and successful intergenerational transmission of gathered knowledge (T) are added. By transforming ideas into measurable units of analysis, ideometrics offers a potentially transformative approach to understanding human progress.

2605.30672 2026-06-01 q-fin.GN econ.GN q-fin.EC

Residual Supply and the Price of Risk Absorption

剩余供给与风险吸收的价格

Ziyao Wang

AI总结 本文通过连续时间市场出清模型,研究开放式基金赎回时有限资本投资者吸收剩余供给所需的预期回报,并利用2003-2024年美国共同基金数据实证检验了剩余供给价格的影响因素及其对资产价格的影响。

详情
AI中文摘要

当开放式基金赎回时,如果自然买家没有立即介入,一些资本有限的投资者必须充当对手方并持有库存,直到价格恢复。本文探讨了这些投资者所需的回报。一个连续时间市场出清模型给出了一个预期收益约束,其中剩余供给的价格取决于库存风险、交易成本、融资摩擦以及可用于吸收的资产负债表的稀缺性。通过将2003-2024年美国共同基金流量映射到预定持仓上,我们衡量了这种剩余供给的一个可观测组成部分。强制卖出压力预测了基金的实际卖出、同期价格下跌以及随后1至6个月的正回报。当市场整体吸收能力紧张时,该溢价大约翻倍,并且集中在投资者基础薄弱和交易能力有限的股票中——这正是清除失衡成本最高的横截面,而机械性的收益反转无法产生这种模式。

英文摘要

When redeeming open-end funds sell and natural buyers do not step in at once, some limited-capital investor must take the other side and carry the inventory until prices recover. This paper asks what return that investor requires. A continuous-time market-clearing model delivers an expected-return restriction in which the price of residual supply depends on inventory risk, trading costs, funding frictions, and the scarcity of balance sheet available to absorb it. Mapping U.S. mutual fund flows through predetermined holdings over 2003--2024, we measure one observable component of this residual supply. Forced-sale pressure predicts actual fund selling, contemporaneous price declines, and positive returns over the following one to six months. The premium roughly doubles when market-wide absorption capacity is tight, and it concentrates in stocks with thin investor bases and limited trading capacity -- precisely the cross section in which clearing the imbalance should be most costly, and a pattern that mechanical return reversal does not generate.

2605.30609 2026-06-01 econ.EM math.ST stat.AP stat.ME stat.TH

Rectified Linear Unit Regression

修正线性单元回归

Tatsushi Oka

AI总结 提出一种名为修正线性单元(ReLU)回归的方法,通过将ReLU变换后的结果投影到协变量上,直接估计条件结果分布的积分泛函,并建立其渐近分布和推断方法,扩展了经验研究中可用的分布参数集。

详情
AI中文摘要

本文开发了一个用于直接估计条件结果分布积分泛函的回归框架。所提出的方法称为修正线性单元(ReLU)回归,它将ReLU变换后的结果投影到协变量上,并得到一个闭式估计量。其总体回归函数与结果的积分条件分布函数一致,而通过Legendre-Fenchel变换得到的凸共轭则恢复了积分条件分位数函数。回归及其共轭都只需要温和的分布假设,并适用于非连续结果。我们建立了估计量的均匀渐近分布,并通过Hadamard方向可微映射的delta方法对共轭泛函进行推断。基于这些结果,我们建立了概率水平任意子区间上平均分位数处理效应的识别和推断。这拓宽了经验工作中可用的分布参数集。

英文摘要

This paper develops a regression framework for the direct estimation of integrated functionals of conditional outcome distributions. The proposed method, termed rectified linear unit (ReLU) regression, projects the ReLU-transformed outcome onto covariates and admits a closed-form estimator. Its population regression function coincides with the integrated conditional distribution function of the outcome, and its convex conjugate, obtained via the Legendre-Fenchel transformation, recovers the integrated conditional quantile function. Both the regression and its conjugate require only mild distributional assumptions and accommodate non-continuous outcomes. We establish the uniform asymptotic distribution of the estimator and develop inference for the conjugate functional via the delta method for Hadamard directionally differentiable maps. Building on these results, we establish identification and inference for average quantile treatment effects over arbitrary subintervals of probability levels. This broadens the set of distributional parameters available to empirical work.

2605.30566 2026-06-01 physics.soc-ph econ.TH q-bio.PE

Participation Costs Narrow Democratic Cooperation

参与成本缩小民主合作

Mohammad Salahshour, Fjolle Shabani, Urs Fischbacher, Iain D. Couzin

AI总结 通过进化模型和在线实验,研究投票成本如何影响民主分配公共品回报的自我维持合作,发现投票成本会减少活跃参与者并导致民主搭便车。

详情
Comments
32 page, 6 figures
AI中文摘要

集体行动通常需要使合作对个体有价值的制度。我们探讨公共品回报的民主分配能否将重复公共品转化为自我维持的合作制度,以及参与成本如何重塑这一过程。一个简单的进化模型表明,投票再分配可以支持亲社会的分配秩序,但也可能维持反社会的分配秩序或民主搭便车,即个体通过他人维持的制度获益而避免参与成本。模型预测了投票成本的竞争效应:在强选择下,成本可能抑制使用制度来奖励低贡献者,但也可能削弱活跃选民群体并侵蚀对贡献奖励的支持。我们在一个预注册的在线实验中测试了这些预测,实验包含\NIncludedGroupsVone{}个五人小组。内生的民主再分配相对于等额公共品控制提高了贡献,零成本投票产生了最强的时间改善。投票成本并未主要使活跃选民转向奖励低贡献者的分配,而是促使行为转向弃权和民主搭便车,使弃权在局部有利,并扩大了任务后对民主参与的感知与行为记录之间的差距。因此,民主分配可以稳定合作,但参与成本会减少积极维持制度的人数,并使这种侵蚀对参与者自身不那么明显。

英文摘要

Collective action often requires institutions that make cooperation individually worthwhile. We ask whether democratic allocation of public-good return can transform a repeated public good into a self-sustaining cooperative institution, and how participation costs reshape that process. A simple evolutionary model shows that voted redistribution can support a prosocial allocation order, but can also sustain an antisocial allocation order or democratic free riding, in which individuals benefit from an institution maintained by others while avoiding the cost of participation. The model predicts competing effects of voting cost. Cost can suppress use of the institution to reward low contributors under strong selection, but can also thin the active electorate and erode contributor-rewarding support. We test these predictions in a preregistered online experiment with \NIncludedGroupsVone{} five-person groups. Endogenous democratic redistribution increased contributions relative to an equal-share public-goods control, with zero-cost voting producing the strongest temporal improvement. Voting costs did not mainly turn active voters toward low-contributor-rewarding allocation. Instead, they shifted behavior toward abstention and democratic free riding, made abstention locally rewarding, and widened the gap between post-task perceptions of democratic participation and the behavioral record. Democratic allocation can therefore stabilize cooperation, but participation costs can reduce the number of people actively sustaining the institution and can make that erosion less visible to participants themselves.

2605.30562 2026-06-01 q-fin.PR econ.EM q-fin.MF

Option Pricing under Stochastic Volatility and Jumps:A PIDE Framework with Empirical Evidence

随机波动率与跳跃下的期权定价:一个带有实证证据的PIDE框架

Abigail Anokyewaa Mensah, Ayush Jha, Hongwei Mei, Rui Wang, Svetlozar T. Rachev, Frank J. Fabozzi

AI总结 本文提出了一个联合随机波动率和跳跃动力学的偏积分微分方程(PIDE)框架用于期权定价,并通过S&P500指数期权数据实证表明随机波动率主导定价改进,而跳跃仅在短期和深度虚值区域有边际贡献。

详情
AI中文摘要

我们开发了一个偏积分微分方程(PIDE)框架,用于在联合随机波动率和跳跃动力学下进行期权定价,并使用三个到期日的S&P500指数期权合约评估其实证内容。该框架源自仿射Lévy型过程的无穷小生成元,并通过有限差分离散化和基于FFT的非局部跳跃算子处理实现。通过GMM校准发现,随机波动率占定价改进的主导份额,相对于Black-Scholes,Heston规范将隐含波动率RMSE降低了39%。通过Merton或CGMY规范进行的跳跃增强仅在短期和深度虚值区域产生边际改进。校准后的CGMY活动指数支持复合泊松结构,与S&P500指数收益的高频证据一致。

英文摘要

We develop a partial integro-differential equation (PIDE) framework for option pricing under joint stochastic volatility and jump dynamics, and evaluate its empirical content using the S&P500 index option contracts across three maturities. The framework is derived from the infinitesimal generator of an affine Lévy-type process and implemented via finite-difference discretization with FFT-based treatment of the nonlocal jump operator. Calibration via GMM reveals that stochastic volatility accounts for the dominant share of pricing improvement, where relative to Black-Scholes, the Heston specification reduces implied-volatility RMSE by 39%. Jump augmentation via either Merton or CGMY specifications yields marginal improvements concentrated at short maturities and in the deep out-of-the-money region. The calibrated CGMY activity index supports a compound-Poisson structure, consistent with high-frequency evidence on S&P500 index returns.

2605.30515 2026-06-01 econ.TH

Obviously Strategy-proof Choice of Social Acts

显然策略证明的社会行为选择

Abinash Panda, Anup Pramanik

AI总结 本文在Bahel和Sprumont(2020)提出的社会行为选择框架下,研究显然策略证明实施,并刻画了可通过显然策略证明机制实施的一致社会选择函数类,主要结果表明一致社会选择函数是显然策略证明可实施的当且仅当它是独裁的。

详情
AI中文摘要

我们在Bahel和Sprumont(2020)引入的社会行为选择框架下研究显然策略证明的实施。我们刻画了可通过显然策略证明机制实施的一致社会选择函数类。我们的主要结果表明,一致社会选择函数是显然策略证明可实施的当且仅当它是独裁的。

英文摘要

We study obviously strategy-proof implementation in the framework of social choice over acts introduced by Bahel and Sprumont (2020). We characterize the class of unanimous social choice functions that are implementable via obviously strategy-proof mechanisms. Our main result shows that a unanimous social choice function is obviously strategy-proof implementable if and only if it is dictatorial.

2501.02672 2026-06-01 stat.ML cs.LG econ.EM stat.ME

Re-examining Granger Causality with Causal Bayesian Networks and Reichenbachs Principles

重新审视格兰杰因果关系:基于因果贝叶斯网络和赖兴巴赫原理

S. A. Adedayo

AI总结 本文通过赖兴巴赫原理和因果贝叶斯网络重新解释格兰杰因果关系,提出因果化格兰杰因果关系(c-GC)算法,赋予其稳健的因果解释,并在合成数据上取得满意结果。

详情
AI中文摘要

表征复杂系统中的因果关系是理解其潜在机制的基础。格兰杰因果关系(GC)仍然是识别时间序列数据中因果关系的广泛使用的计算工具。然而,与其他因果发现方法一样,GC存在局限性,并因缺乏严格的因果基础而受到批评。在这项工作中,我们通过赖兴巴赫原理和因果贝叶斯网络的视角重新解释GC,从而解决了这一批评。这种重新解释被实现为一种算法,我们称之为因果化格兰杰因果关系(c-GC)。我们在理论上和图形上证明,这种重新表述在特定假设下赋予GC稳健的因果解释。c-GC在合成数据上取得了令人满意的结果,为观测数据集中的因果发现提供了一个更有原则的框架。

英文摘要

Characterising cause-effect relationships in complex systems is fundamental to understanding their underlying mechanisms. Granger causality (GC) remains a widely used computational tool for identifying causal relationships in time series data. However, like other causal discovery methods, GC has limitations and has been criticised for lacking a rigorous causal foundation. In this work, we present a fix to this criticism by reinterpreting GC through the lenses of Reichenbach's principles and causal Bayesian networks. This reinterpretation was implemented as an algorithm we call causalized Granger causality (c-GC). We demonstrate, both theoretically and graphically, that this reformulation endows GC with a robust causal interpretation under specific assumptions. c-GC yields satisfactory results on synthetic data, offering a more principled framework for causal discovery in observational datasets.

2412.09430 2026-06-01 econ.EM stat.AP

A Kernel Score Perspective on Forecast Disagreement and the Linear Pool

预测分歧与线性池的核分数视角

Fabian Krüger

AI总结 本文通过将线性池的结果从平方误差损失推广到所有核分数,揭示了预测分歧(组件分布的平均成对散度)对线性池性能的重要影响,并提出了在给定核评分规则下等权重最优的新条件。

详情
AI中文摘要

本文将线性池的几个结果从平方误差损失推广到所有核分数。后者是一类丰富的评分规则,涵盖了单变量和多变量、离散和连续设置的点预测和分布预测。其成员包括用于单变量分布预测的连续排名概率分数和用于多变量分布预测的能量分数。我们的结果表明,预测分歧(所有组件分布的平均成对散度)对线性池的性能有重要影响。这些结果对于理解和设计一般组合设置中的线性池很有用。特别是,它们激励使用线性池(相对于其他组合公式),并提出了在给定核评分规则下等组合权重最优的新条件。

英文摘要

This paper generalizes several results on linear pooling from squared error loss to all kernel scores. The latter are a rich family of scoring rules that covers point and distribution forecasts for univariate and multivariate, discrete and continuous settings. Its members include the Continuous Ranked Probability Score for univariate distribution forecasting and the Energy Score for multivariate distribution forecasting. Our results indicate that forecast disagreement (measured as the average pairwise divergence of all component distributions) has important implications for the linear pool's performance. The results are useful for understanding and designing linear pools in general combination settings. In particular, they motivate using the linear pool (as opposed to other combination formulas) and yield a novel condition under which equal combination weights are optimal under a given kernel scoring rule.

2510.15617 2026-06-01 econ.GN q-fin.EC

Price Pass-Through of Austria's Single-Use Plastics Producer Charges: Evidence from Retail Offer Spells

奥地利一次性塑料生产者收费的价格传导:来自零售报价序列的证据

Felix Reichel

AI总结 利用2020-2024年奥地利零售报价面板数据,通过双向固定效应模型估计一次性塑料合规成本对在线价格的传导效应,发现平均价格上涨约4.1%,且预期性传导早于实际缴费。

详情
Journal ref
Reg. Sci. Env. Econ. 3, 2026
Comments
56 pages
AI中文摘要

一次性塑料(SUP)带来巨大的环境成本。根据欧盟指令2019/904,奥地利引入了生产者收费并强制要求参与收集和回收系统。本文利用从价格比较平台提取的月度零售报价序列面板数据,估计合规成本在多大程度上传导至奥地利的在线标价。处理样本包括关键词匹配的SUP产品,如气球、外带杯、湿巾、塑料袋、食品容器、烟草过滤嘴产品、饮料瓶和塑料包装,同时观察2020-2024年期间非SUP列表作为对照组。双向固定效应(TWFE)模型显示,处理后的平均价格上涨约4.1%。将2023年3月的行政报告阶段和2024年3月的付款到期阶段分离的序贯TWFE模型显示,较大的调整发生在较早的报告阶段,仅报告效应约为8.1%,而增量付款阶段效应为5.6%。对于气球这一受监管费用影响显著的类别,事件研究估计在初始付款日期后立即超过50%,并在处理后的多数窗口期内保持高位。这些发现表明,奥地利在线零售商在费用支付截止日期之前调整了价格,这与预期合规成本的预期性传导而非对实际支付的离散反应一致。由于数据包含价格观测值而非数量数据,分析涉及价格归宿而非消费或环境结果。

英文摘要

Single use plastics (SUPs) impose substantial environmental costs. Following Directive (EU) 2019/904, Austria introduced producer charges and mandatory participation in collection and recycling systems. This paper exploits a monthly panel of retail offer spells drawn from a price comparison platform to estimate the extent to which compliance costs pass through to posted online prices in Austria. The treated sample comprises keyword matched SUP products including balloons, to go cups, wet wipes, plastic bags, food containers, tobacco filter items, beverage bottles, and plastic wraps observed alongside a control group of non SUP listings over 2020-2024. A two way fixed effects (TWFE) specification places the average post treatment price increase at approximately 4.1 percent. A sequential TWFE model separating the administrative reporting phase from March 2023 and the payment due phase from March 2024 reveals that the larger adjustment occurred during the earlier reporting stage, with a reporting only effect of approximately 8.1 percent and an incremental payment phase effect of 5.6 percent. For balloons, a category subject to pronounced regulatory fee exposure, event study estimates exceed 50 percent immediately following the initial payment date and remain elevated throughout most of the post treatment window. These findings indicate that Austrian online retailers adjusted prices in advance of fee payment deadlines, consistent with anticipatory pass through of expected compliance costs rather than a discrete response to realized payments. As the data contain price observations but not quantity data, the analysis speaks to price incidence and not to consumption or environmental outcomes.

2603.29972 2026-06-01 stat.ME cs.LG econ.EM stat.ML

Do covariates explain why these groups differ? The choice of reference group can reverse conclusions in the Oaxaca-Blinder decomposition

协变量能否解释这些群体差异?参考组的选择可能逆转Oaxaca-Blinder分解的结论

Manuel Quintero, Advik Shreekumar, William T. Stephenson, Tamara Broderick

AI总结 本文通过理论和实证证明,在Oaxaca-Blinder分解中,参考组的选择可能导致实质性不同的结论,且该问题在复杂回归模型中更为常见,建议研究者报告两种方向的分解结果。

详情
Comments
28 pages, 4 figures
AI中文摘要

科学家们常常试图解释为什么两个群体的结果存在差异。例如,两家医院患者死亡率的差异可能源于患者本身的差异(协变量)或医疗护理的差异(给定协变量下的结果)。Oaxaca-Blinder分解(OBD)是区分这些因素的标准工具。众所周知,OBD需要选择其中一个群体作为参考,且数值答案可能因参考组而异。据我们所知,目前尚无系统研究探讨OBD参考组的选择是否会导致不同的实质性结论以及该问题的普遍性。在本文中,我们通过真实数据和模拟数据给出了存在性证明,表明OBD参考组确实可能导致实质性不同的结论。我们的实证研究发现,当OBD扩展到更复杂的回归模型(包括预训练变换器)时,这种敏感性更为常见。我们的理论和实证结果共同表明,这些结论逆转并非完全由模型误设、小数据或对抗性参数选择导致。我们的结果表明,实践者应始终报告OBD的两个方向;现代机器学习和大数据集并不能自动解决结论逆转问题;且需要进一步研究这一问题。

英文摘要

Scientists often want to explain why an outcome is different in two groups. For instance, differences in patient mortality rates across two hospitals could be due to differences in the patients themselves (covariates) or differences in medical care (outcomes given covariates). The Oaxaca--Blinder decomposition (OBD) is a standard tool to tease apart these factors. It is well known that the OBD requires choosing one of the groups as a reference, and the numerical answer can vary with the reference. To the best of our knowledge, there has been no systematic investigation into whether the choice of OBD reference can yield different substantive conclusions and how common this issue is. In the present paper, we give existence proofs in real and simulated data that the OBD references can in fact yield substantively different conclusions. Our empirical exercises find that this sensitivity is more common when the OBD is extended to more complex regression models, including a pretrained transformer. Our theoretical and empirical results together establish that these conclusion reversals are not entirely driven by model misspecification, small data, or adversarial parameter choices. Our results suggest that practitioners should always report both directions of the OBD; that modern machine learning and large datasets do not automatically resolve the conclusion reversal problem; and that further work on this problem is needed.

2603.29317 2026-06-01 econ.GN q-fin.EC

Should I State or Should I Show? Aligning AI with Human Preferences

我应该陈述还是展示?使AI与人类偏好对齐

Keaton Ellis, Wanying Huang

AI总结 通过在线实验比较AI代理从陈述偏好(文字提示)和显示偏好(选择数据)中学习人类偏好的效果,发现显示偏好数据预测更准确,但用户常选择信息量较少的方式,且AI在冲突时更倾向于遵循提示。

详情
AI中文摘要

随着AI代理变得更加自主,将其目标与人类偏好正确对齐变得越来越重要。我们研究了AI代理在风险选择中通过陈述偏好与显示偏好学习人类委托人偏好的有效性。我们进行了一项在线实验,受试者通过书面指令(“提示”)陈述其偏好,并通过一系列二元彩票问题中的选择(“数据”)显示其偏好。我们发现,平均而言,给定显示偏好数据的AI代理预测受试者选择的准确性高于给定陈述偏好提示的AI代理。进一步分析表明,这种差距源于受试者难以将自己的偏好转化为书面指令。当被选择向AI代理提供哪种信息源时,大部分受试者未能选择信息量更大的那个。此外,当两个来源的预测冲突时,我们发现AI代理更频繁地与提示对齐,尽管其准确性较低。总体而言,这些结果突出了显示偏好方法作为向AI代理传达人类偏好的强大机制,但其成功取决于谨慎的实施。

英文摘要

As AI agents become more autonomous, properly aligning their objectives with human preferences becomes increasingly important. We study how effectively an AI agent learns a human principal's preference in choice under risk via stated versus revealed preferences. We conduct an online experiment in which subjects state their preferences through written instructions ("prompts") and reveal them through choices in a series of binary lottery questions ("data"). We find that on average, an AI agent given revealed-preference data predicts subjects' choices more accurately than an AI agent given stated-preference prompts. Further analysis suggests that the gap is driven by subjects' difficulty in translating their own preferences into written instructions. When given a choice between which information source to give to an AI agent, a large portion of subjects fail to select the more informative one. Moreover, when predictions from the two sources conflict, we find that the AI agent aligns more frequently with the prompt, despite its lower accuracy. Overall, these results highlight the revealed preference approach as a powerful mechanism for communicating human preferences to AI agents, but its success depends on careful implementation.

2602.07486 2026-06-01 econ.EM

Identification of Child Penalties

儿童惩罚的识别

Dor Leventer

AI总结 本文形式化了常用儿童惩罚三重差分估计量的识别框架,提出归一化三重差分(NTD)方法,并证明在平行趋势假设不成立时传统估计量存在偏误,进而提出新的目标参数——父母身份对性别收入比的影响,并在以色列行政数据中应用新估计量发现父母身份对性别收入不平等的贡献在不同处理组间存在异质性。

详情
AI中文摘要

本文形式化了常用儿童惩罚三重差分估计量背后的识别框架,该估计量通过反事实收入进行归一化。我从应用实践中使用的验证测试逆向推导出识别假设,并将其称为归一化三重差分(NTD)。我证明,即使NTD成立,如果平行趋势假设被违反,传统估计量对其目标因果估计量也是有偏的。我提出了一个新的目标——父母身份对性别收入比的影响,并证明它在NTD下是点识别的。将该框架应用于以色列行政数据,偏差边界练习表明,对于早期处理组,传统估计量存在显著偏差。使用新估计量,我发现父母身份对性别收入不平等的贡献在不同处理组间存在异质性。

英文摘要

This paper formalizes the identification framework underlying common child penalty triple difference estimators that normalize by counterfactual earnings. I reverse-engineer the identification assumptions from the validation tests used in applied practice and term this framework Normalized Triple Differences (NTD). I show that the conventional estimator is biased for its target causal estimand, even when NTD holds, if the parallel trends assumption is violated. I propose a new target, the effect of parenthood on the gender earnings ratio, and show it is point identified under NTD. Applying the framework to Israeli administrative data, a bias-bounding exercise suggests the conventional estimator is substantially biased for early treatment groups. Using the new estimator, I find that the contribution of parenthood to gender earnings inequality is heterogeneous across treatment groups.

2601.14150 2026-06-01 econ.GN q-fin.EC

Trade relationships during and after a crisis

危机期间及之后的贸易关系

Alejandra Martinez

AI总结 利用哥伦比亚2010-11年拉尼娜事件期间的道路中断作为外生冲击,研究临时供应中断如何重塑国际贸易中的企业关系组合,发现关系层面和公司层面存在相反效应。

详情
AI中文摘要

本文提供了临时供应中断重塑国际贸易中企业关系组合的因果证据。利用哥伦比亚2010-11年拉尼娜事件期间的外生道路中断,我在买方-卖方关系层面识别暴露程度,利用进口商供应商组合内的变化。当进口商有其他非暴露供应商时,暴露关系更不可能终止。然而,组合暴露范围更广的公司逐渐减少其关系网络,并可能最终退出市场。一个将关系剩余与组合构成联系起来的框架解释了这些对比反应,并展示了同一冲击如何在关系层面和公司层面产生相反效应。

英文摘要

This paper provides causal evidence that temporary supply disruptions reshape firms' relationship portfolios in international trade. Using exogenous road disruptions during Colombia's 2010-11 La Niña episode, I identify exposure at the buyer-seller relationship level, exploiting variation within importers' supplier portfolios. Exposed relationships are less likely to terminate when importers have alternative non-exposed suppliers. However, firms with broader portfolio exposure gradually reduce their relationship networks and may eventually exit the market. A framework linking relationship surplus to portfolio composition explains these contrasting responses and shows how the same shock generates opposite effects at relationship and firm levels.

2508.17671 2026-06-01 cs.GT cs.AI cs.MA econ.TH

Consistent Opponent Modeling in Imperfect-Information Games

不完全信息博弈中的一致对手建模

Sam Ganzfried

AI总结 针对不完全信息博弈中现有对手建模方法无法保证收敛到对手真实策略的问题,提出一种基于序列形式博弈表示和投影梯度下降的凸优化算法,实现高效且一致的对手建模。

详情
AI中文摘要

多智能体环境中智能体的目标是在与对手交互时最大化总收益。遵循博弈论解概念(如纳什均衡)在某些场景下可能获得强性能;然而,这类方法未能利用与对手重复交互中的历史和观测数据。对手建模算法整合机器学习技术,利用可用数据来利用次优对手;然而,这类方法在不完全信息博弈中的有效性至今相当有限。我们表明,即使面对来自已知先验分布的静态对手,现有对手建模方法也无法满足一个简单的理想性质;即,即使博弈迭代次数趋近无穷,它们也不能保证模型趋近对手的真实策略。我们开发了一种新算法,能够实现这一性质,并通过基于序列形式博弈表示和投影梯度下降求解凸最小化问题来高效运行。在标准贝叶斯可辨识性和访问假设下,该算法保证从游戏过程的观测以及可能可用的额外历史数据中高效收敛到对手的真实策略。

英文摘要

The goal of agents in multi-agent environments is to maximize total reward against the opposing agents that are encountered. Following a game-theoretic solution concept, such as Nash equilibrium, may obtain a strong performance in some settings; however, such approaches fail to capitalize on historical and observed data from repeated interactions against our opponents. Opponent modeling algorithms integrate machine learning techniques to exploit suboptimal opponents utilizing available data; however, the effectiveness of such approaches in imperfect-information games to date is quite limited. We show that existing opponent modeling approaches fail to satisfy a simple desirable property even against static opponents drawn from a known prior distribution; namely, they do not guarantee that the model approaches the opponent's true strategy even in the limit as the number of game iterations approaches infinity. We develop a new algorithm that is able to achieve this property and runs efficiently by solving a convex minimization problem based on the sequence-form game representation using projected gradient descent. The algorithm is guaranteed to efficiently converge to the opponent's true strategy under standard Bayesian identifiability and visitation assumptions, given observations from gameplay and possibly additional historical data if it is available.

2509.13323 2026-06-01 cs.HC econ.GN q-fin.EC

AI Behavioral Science

AI 行为科学

Matthew O. Jackson, Qiaozhu Me, Stephanie W. Wang, Yutong Xie, Walter Yuan, Seth Benzell, Erik Brynjolfsson, Colin F. Camerer, James Evans, Brian Jabarian, Jon Kleinberg, Juanjuan Meng, Sendhil Mullainathan, Asuman Ozdaglar, Thomas Pfeiffer, Moshe Tennenholtz, Robb Willer, Diyi Yang, Teng Ye

AI总结 本文提出“AI 行为科学”新领域,从三个视角探讨:利用社会科学工具评估AI行为、AI改变人类行为研究方法、以及人机交互对经济政治的影响。

详情
AI中文摘要

我们概述了“AI 行为科学”这一新领域的基础,涵盖三个视角。首先,随着AI变得无处不在且日益专有和不透明,开发评估AI行为的技术变得至关重要。我们概述了社会科学家用于评估人类行为的工具如何可用于评估和推断AI的行为偏差、倾向和启发式。其次,我们还讨论了AI如何改变我们了解人类行为的方式。除了其计算能力,AI还提供了模拟、推断和预测人类行为的新技术,我们对此进行了概述和讨论。第三,随着人类和AI在日益复杂和交织的系统中互动,我们需要理解由此产生的经济和政治结果的影响。我们概述了关于人机交互未来以及可能随之而来的变化和破坏日益紧迫的问题。

英文摘要

We outline a foundation for a new field of ``AI Behavioral Science,'' covering three perspectives. First, as AI becomes ubiquitous and is increasingly proprietary and opaque, it becomes vital to develop techniques for assessing AI behavior. We outline how tools developed to assess people's behaviors by social scientists can be used to assess and infer AI's behaviors biases, tendencies, and heuristics. Second, we also discuss how AI can change the ways in which we learn about human behavior. Beyond its computational power, AI offers new techniques for simulating, inferring, and predicting human behaviors that we outline and discuss. Third, as humans and AI are interacting in increasingly complex and intertwined systems, we need to understand the implications for the resulting economic and political outcomes. We outline issues that are increasingly pressing concerning the future of human-AI interactions and potential changes and disruptions that can ensue.

2404.19707 2026-06-01 econ.EM math.ST stat.ME stat.TH

Identification by non-Gaussianity in structural threshold and smooth transition vector autoregressive models

结构阈值和平滑转换向量自回归模型中的非高斯性识别

Savi Virolainen

AI总结 通过假设冲击相互独立且至多一个为高斯分布,证明了结构平滑转换向量自回归模型的统计可识别性,并提出了估计方法和混合识别策略。

详情
AI中文摘要

我们证明了如果冲击相互独立且至多一个为高斯分布,则结构平滑转换向量自回归模型在统计上是可识别的。这将对线性结构向量自回归的已知识别结果扩展到时变影响矩阵。我们还提出了一种估计方法,展示了如何采用混合识别策略来解决弱识别问题,并建立了遍历平稳性的充分条件。所引入的方法在随附的R包sstvars中实现。我们的实证应用发现,在低经济政策不确定性和高经济政策不确定性下,正向气候政策不确定性冲击都会减少产出并提高通胀,但其影响,尤其是对通胀的影响,在高不确定性时期更强。

英文摘要

We show that structural smooth transition vector autoregressive models are statistically identified if the shocks are mutually independent and at most one of them is Gaussian. This extends a known identification result for linear structural vector autoregressions to a time-varying impact matrix. We also propose an estimation method, show how a blended identification strategy can be adopted to address weak identification, and establish a sufficient condition for ergodic stationarity. The introduced methods are implemented in the accompanying R package sstvars. Our empirical application finds that a positive climate policy uncertainty shock reduces production and raises inflation under both low and high economic policy uncertainty, but its effects, particularly on inflation, are stronger during the latter.

2507.04545 2026-06-01 econ.GN cs.SI q-fin.EC

Measuring Social Media Network Effects

测量社交媒体网络效应

Sinan Aral, Seth G Benzell, Avinash Collis, Christos Nicolaides

AI总结 通过实验测量美国四大社交媒体平台的本地网络效应,发现其解释8.1-23.7%的平台价值,并揭示连接价值的异质性。

详情
AI中文摘要

网络效应——额外消费者带来的效用增益——被广泛认为对数字经济至关重要。然而,近期的理论和证据表明,本地网络效应——特定社交网络连接创造的经济价值——驱动着网络化在线平台的价值。通过对美国19,923名Facebook、Instagram、LinkedIn和X用户进行激励相容的在线选择实验,我们首次对数字经济中的本地网络效应进行了大规模实证测量,并测量了不同平台间连接价值的异质性。平台价值范围为每位消费者每月78至101美元,其中8.1-23.7%由本地网络效应解释。我们发现:1)在Facebook和Instagram上,强关系更有价值,而在LinkedIn和X上,弱关系更有价值;2)工作连接在LinkedIn上最有价值,在Facebook上最无价值,求职者显著更看重LinkedIn,而显著不看重Facebook;3)男性对女性连接的估值显著高于对其他男性的连接,尤其在Instagram、Facebook和X上,而女性对男性和女性的连接估值在各平台上相当;4)如果消费者在其他平台上也相连,则他们在任何平台上对连接的估值更高,表明平台是互补品而非替代品;5)白人消费者在Facebook上不成比例地看重同种族连接,而在Instagram上,对18岁及以下年龄连接者的估值显著高于其他任何年龄组——这两种模式在其他平台上未见。每个平台在美国每年产生530亿至2150亿美元的消费者剩余。这些结果表明,社交媒体产生巨大价值,其中本地网络效应驱动了相当一部分,且这些效应的来源和轮廓因平台、消费者和连接而异。

英文摘要

Network effects -- the utility gains from additional consumers of a good -- are widely regarded as critical to the digital economy. Yet recent theory and evidence suggest that local network effects -- the economic value created by specific social network connections -- drive value in networked online platforms. Using incentive-compatible online choice experiments with 19,923 Facebook, Instagram, LinkedIn, and X users in the United States, we provide the first large-scale empirical measurement of local network effects in the digital economy and measure heterogeneity in connection value across platforms. Platform value ranges from \$78 to \$101 per consumer per month, with 8.1-23.7% explained by local network effects. We find that 1) stronger ties are more valuable on Facebook and Instagram, while weaker ties are more valuable on LinkedIn and X; 2) work connections are most valuable on LinkedIn and least on Facebook, and job-seekers value LinkedIn significantly more and Facebook significantly less; 3) men value connections to women significantly more than to other men, particularly on Instagram, Facebook, and X, while women value connections to men and women equally across platforms; 4) consumers value connections on any platform more if they are also connected on other platforms, suggesting that platforms are complements, not substitutes; 5) white consumers disproportionately value same-race connections on Facebook while, on Instagram, connections to alters eighteen or younger are valued significantly more than any other age group -- two patterns not seen on other platforms. Each platform generates between \$53B and \$215B in annual US consumer surplus. These results suggest that social media generates significant value, that local network effects drive a substantial fraction of it, and that the sources and contours of these effects vary across platforms, consumers, and connections.

2504.20429 2026-06-01 econ.GN q-fin.EC

Estimating the housing production function with unobserved land heterogeneity

估计存在未观测土地异质性的住房生产函数

Yusuke Adachi

AI总结 本文提出一种方法,利用重复截面建筑数据和马尔可夫矩条件,在未观测局部条件影响资本选择时估计基于收入的住房生产函数,并应用于东京23区的新建住房数据。

详情
AI中文摘要

密集城市的住房供应取决于建筑商用资本替代稀缺土地的能力。由于建筑商在观察到研究者仅能部分观测的微观地理条件后才选择资本,这一替代弹性难以估计。本文开发了一种在此情境下估计基于收入的住房生产函数的方法。由于观测到的资本变化既反映了技术替代,也反映了对潜在局部条件的内生反应,现有估计量可能将未观测异质性传递到估计的生产函数中。该方法将影响资本选择的未观测局部条件视为标量马尔可夫状态,并将资本份额方程与使用重复截面建筑数据实现的马尔可夫矩相结合。蒙特卡洛模拟表明,当资本选择响应潜在局部条件时,该估计量能在灵活的生产技术下恢复资本和土地弹性。对东京23区新建住房的应用说明了如何在密集的单城市情境中实施该方法。结果表明,明确建模潜在局部异质性对估计的资本-土地弹性有重要影响,并暗示规模报酬接近1。

英文摘要

Housing supply in dense cities depends on the ability of builders to substitute capital for scarce land. This margin is difficult to estimate because builders choose capital after observing microgeographic conditions that are only imperfectly observed by researchers. This paper develops a method for estimating revenue-based housing production functions in this setting. Because observed capital variation reflects both technological substitution and endogenous responses to latent local conditions, existing estimators can transmit unobserved heterogeneity into the estimated production function. The method treats the unobserved local conditions that affect capital choice as a scalar Markov state and combines the capital share equation with Markov moments implemented using repeated cross-sectional construction data. Monte Carlo simulations show that the estimator recovers capital and land elasticities under flexible production technologies when capital choices respond to latent local conditions. An application to newly constructed housing in Tokyo's 23 special wards illustrates how the method can be implemented in a dense single-city setting. The results show that explicitly modeling latent local heterogeneity matters for estimated capital-land elasticities and implies returns to scale close to one.

2403.11240 2026-06-01 econ.TH

Speed, Accuracy, and Complexity

速度、准确性与复杂性

Duarte Gonçalves

AI总结 研究决策者根据问题复杂性调整信息获取时机时,反应时间与复杂性的倒U型关系,以及能力与反应时间的模糊关系。

详情
AI中文摘要

本文研究反应时间何时能揭示问题复杂性。它重新审视了一个经典的序贯抽样模型,其中决策者选择何时停止获取昂贵信息。问题复杂性由证据过程的噪声信号比衡量。在外生停止规则下——即决策者未根据问题复杂性进行最优调整时——反应时间随复杂性增加而增加。相比之下,当决策者事先观察到问题复杂性并对其进行最优调整时,这种单调性被打破。此时,期望停止时间与复杂性呈倒U型关系,因此非常简单和非常复杂的问题中决策都很快。能力与反应时间同样存在模糊关系:能力更强的决策者在简单问题上更快,但在复杂问题上更慢。最后,本文表明,复杂性和能力可以从决策对补贴的敏感性中推断出来,这种敏感性在更复杂的问题中和能力较弱的决策者中更大。

英文摘要

This paper studies when response time is informative about problem complexity. It revisits a canonical sequential-sampling model in which a decision-maker chooses when to stop acquiring costly information. Problem complexity is measured by the noise-to-signal ratio of the evidence process. Under exogenous stopping rules -- as when the decision-maker does not optimally adjust to problem complexity -- response time increases with complexity. By contrast, this monotonicity breaks down when the decision-maker observes problem complexity ex ante and optimally adjusts to it. Expected stopping time is then inverse-U-shaped in complexity, so choices are fast in both very simple and very complex problems. Ability and response time are similarly ambiguously related: more able decision-makers are faster on simple problems but slower on complex ones. Finally, this paper shows that complexity and ability can be inferred from the sensitivity of choices to subsidies, which is greater in more complex problems and for less able decision-makers.