arXivDaily arXiv每日学术速递 周一至周五更新
重置
econ.EM计量经济12
2606.12324 2026-06-11 econ.EM 新提交

Assumption-Lean Shrinkage and Model Averaging for Spatial Parameters

空间参数的假设稀疏收缩与模型平均

Harvey Barnhard

AI总结 针对空间相关单元的参数估计噪声问题,提出基于SURE的收缩估计器选择与平均方法,在应用中将均方误差降低约27%。

详情
Comments
104 pages
AI中文摘要

经济决策通常依赖于许多关于邻里效应、学校质量和医院绩效的噪声估计。收缩估计可以通过跨相关单元汇集信息来减少这种噪声。当单元通过地理、邻接或共享特征相关联时,主要挑战不仅在于收缩多少,还在于哪些关系应指导汇集。我们使用Stein无偏风险估计(SURE)来选择和平均灵活的收缩估计器,允许研究人员比较相关性的候选定义,而不将任何先验、协方差模型或邻接规则视为潜在参数的真实模型。在直接对估计量映射施加的正则条件下,SURE选择的表现几乎与候选类中的最佳规则一样好。SURE选择的加权平均同样几乎与训练候选者的最佳固定加权平均一样好,包括其拟合值使用完整噪声估计向量的非线性收缩规则。在应用于20个通勤区的机会图谱经济流动性数据时,最佳个体空间规范因区域而异,而SURE选择的平均将报告的SURE估计均方误差相对于表现最佳的非空间经验贝叶斯基准降低了约27%。

英文摘要

Economic decisions often depend on many noisy estimates of neighborhood effects, school quality, and hospital performance. Shrinkage estimation can reduce this noise by pooling information across related units. When units are related through geography, adjacency, or shared characteristics, the main challenge is not only how much to shrink, but which relationships should guide pooling. We use Stein's Unbiased Risk Estimate (SURE) to select among and average over flexible shrinkage estimators, allowing researchers to compare candidate definitions of relatedness without treating any one prior, covariance model, or adjacency rule as the true model for the latent parameters. Under regularity conditions stated directly on the estimator maps, SURE selection performs nearly as well as the best rule in a candidate class. The SURE-chosen weighted average likewise performs nearly as well as the best fixed weighted average of trained candidates, including nonlinear shrinkage rules whose fitted values use the full vector of noisy estimates. In an application to Opportunity Atlas economic mobility data from 20 commuting zones, the best individual spatial specification varies across zones, and the SURE-chosen average reduces reported SURE-estimated mean squared error by about 27% relative to the best-performing non-spatial empirical Bayes benchmark.

2606.12261 2026-06-11 econ.EM 新提交

Rbreak: An R Package for Estimating Structural Breaks under Linear Restrictions with Application to Linear Model Tree

Rbreak: 一个用于在线性约束下估计结构断点的R包及其在线性模型树中的应用

Cheolju Kim, Zhongjun Qu

AI总结 提出R包rbreak,实现在系数向量一般线性约束下检测线性回归模型结构断点并估计断点位置,支持断点置信区间、约束sup-F检验、蒙特卡洛临界值模拟及自举重启过程,并扩展至线性模型树。

详情
AI中文摘要

包\ exttt{rbreak}实现了在系数向量的一般线性约束下,检测线性多元回归模型的结构断点并估计断点位置的方法。约束可以是同一机制内、跨机制或两者兼有,并支持两种形式:仿射参数化(形式A:\ exttt{delta = S*theta + s})和显式线性约束(形式B:\ exttt{R*delta = r})。它提供了带置信区间的断点日期估计、无结构变化原假设的约束sup-F检验、通过蒙特卡罗模拟临界值,以及一种自举重启过程以降低收敛到虚假局部最优的风险。它还实现了一种广义回归树(线性模型树)过程,其中每个叶子包含线性回归而非局部平均值。本文解释了这些方法并通过应用加以说明。

英文摘要

The package \texttt{rbreak} implements methods for detecting structural breaks and estimating break locations for linear multiple regression models under general linear restrictions on the coefficient vector. Restrictions can be within regimes, across regimes, or both, and are supported in two forms: an affine parameterization (Form A: \texttt{delta = S*theta + s}) and explicit linear constraints (Form B: \texttt{R*delta = r}). It provides break date estimation with confidence interval, a restricted sup-F test for the null of no structural change, simulation of critical values by Monte Carlo, and a bootstrap restart procedure to reduce the risk of convergence to spurious local optima. It also implements a generalized regression tree (linear model tree) procedure where each leaf contains a linear regression rather than a local average. This note explains the methods and illustrates them with applications.

2606.12185 2026-06-11 econ.EM math.ST 新提交

Pivotal and identification-robust nonparametric inference in linear IV models

线性IV模型中的关键与识别鲁棒非参数推断

Bertille Antoine, Pascal Lavergne

AI总结 针对线性工具变量模型,提出对识别强度与异方差鲁棒且第一阶段非参数的新推断方法,包括渐近关键统计量、子向量推断和设定检验。

详情
AI中文摘要

我们为线性IV模型开发了新的推断程序,这些程序对识别强度和未知形式的异方差具有鲁棒性,并且对第一阶段方程是非参数的。我们的第一个检验专门用于内生解释变量的参数推断。我们的新统计量修改了Antoine和Lavergne(2003)的统计量,直接考虑了未知形式的异方差。因此,它是渐近关键的,从而在实践中大大简化了推断。我们还开发了(i)一个识别鲁棒的子向量推断程序,该程序不依赖于剩余参数的识别强度知识,以及(ii)一个纯设定检验。在这两种情况下,检验是保守但有效的。我们通过模拟和实际应用表明,我们的程序计算友好且与现有方法相比具有竞争力。

英文摘要

We develop new inference procedures for a linear IV model that are robust to identification strength and heteroskedasticity of unknown form, and nonparametric with respect to the first-stage equation. Our first test is tailored for inference on parameters of endogenous explanatory variables. Our new statistic modifies that of Antoine and Lavergne (2003) to directly account for heteroskedasticity of unknown form. As a result, it is asymptotically pivotal, so that inference is greatly facilitated in practice. We also develop (i) an identification-robust subvector inference procedure that does not rely on the knowledge of identification strength for the remaining parameters, and (ii) a pure specification test. In both cases, the tests are conservative but powerful. We show that our procedures are computationally friendly and competitive with existing ones in simulations and an application.

2606.12184 2026-06-11 econ.EM 新提交

Threshold Regression for Fixed-T Panel Data with Interactive Fixed Effects

具有交互固定效应的固定T面板数据阈值回归

Jan Ditzen (1), Yiannis Karavias (2 and 3), Joakim Westerlund (4 and 5) ((1) Free University of Bozen-Bolzano, (2) Brunel University of London, (3) University of Birmingham, (4) Lund University, (5) Deakin University)

AI总结 针对固定时间期数T的面板数据阈值回归模型,提出一种基于最小二乘的简单估计方法及推断工具,并应用于研究通胀对经济增长的影响。

详情
AI中文摘要

本文为具有交互固定效应和固定时间期数T的面板数据阈值回归模型开发了一套新的估计与推断工具箱。该工具箱设计简单、准确且计算高效,基于模型参数的简单最小二乘风格估计量,并包含多种推断程序,用于检验关于阈值及其他参数的假设。该新工具箱被应用于研究通货膨胀对经济增长的影响。

英文摘要

This paper develops a new toolbox for estimation and inference in panel data threshold regression models with interactive fixed effects and a fixed number of time periods, T. The toolbox is designed to be simple, accurate and computationally efficient. It is based on a simple least squares style estimator of the model parameters, and includes a number of inferential procedures for testing hypotheses regarding not only the threshold but also other parameters. The new toolbox is applied to study the impact of inflation on economic growth.

2606.11526 2026-06-11 stat.ME econ.EM 新提交

What is the Long-Term Value of Reliability?

可靠性的长期价值是什么?

Chenyu Qiu, Xu Kuang, Inessa Liskovich, Ali Rauh, Stefan Wager

AI总结 提出Chronos LTV系统,利用马尔可夫决策过程建模客户交互,通过协变量平衡算法估计延迟率对业务指标的长期影响。

详情
AI中文摘要

我们描述了Chronos LTV,一个用于衡量延迟和其他服务缺陷对关键业务指标长期影响的系统。我们使用马尔可夫决策过程对客户随时间推移的交互进行建模,并将我们的目标估计量形式化为相对于移动平均延迟率的边际政策效应。在此设定下,我们表明,在给定观察到的订单特征的情况下,延迟在顺序无混淆假设下(即延迟近似随机)可以识别长期效应;并且可以使用简单的协变量平衡算法来估计这些效应。

英文摘要

We describe Chronos LTV, a system to measure the long-term impact of delays and other service defects on key business metrics. We use Markov decision processes to model customer interactions over time, and formalize our target estimand as the marginal policy effect with respect to moving the average delay rate. Given this setup, we show that we can identify long-term effects under a sequential unconfoundedness assumption where delays are as good as random given observed order characteristics; and can estimate these effects using a simple covariate-balancing algorithm.

2605.01923 2026-06-11 econ.EM math.ST 版本更新

Estimation and Inference for the $τ$-Quantile of Individual Heterogeneous Coefficient

个体异质性系数的 $\tau$-分位数估计与推断

Antonio F. Galvao, Ulrich Hounyo, Jiahao Lin

AI总结 针对面板数据中个体异质性斜率系数的分位数,提出两步分位数估计框架,并建立渐近理论和自助法推断。

详情
AI中文摘要

本文提出了面板数据中个体异质性斜率系数分位数的估计与推断方法。我们开发了一个两步分位数估计框架,用于分析个体系数的异质性。与关注结果异质性的传统面板分位数回归不同,我们的方法针对个体特定斜率的横截面分布的 $\tau$-分位数。我们在随机设计和确定性设计下建立了渐近理论,收敛速度分别为 $\sqrt{N}$ 和 $\sqrt{N\sqrt{T}}$。我们还开发了两种相应的自助法程序用于实际推断,并正式建立了其有效性。所建议的方法具有实际意义,因为它们需要的样本量增长条件比标准固定效应分位数回归更弱,并且适用于大 $N$ 设置。数值模拟和对共同基金绩效的应用说明了所提出的方法及其在不同分位数上揭示的异质性模式。

英文摘要

This paper proposes estimation and inference procedures for quantiles of the heterogeneous individual-specific coefficients in panel data. Unlike conventional panel quantile regression, which focuses on outcome heterogeneity, our approach targets the $\tau$-quantile of the cross-sectional distribution of individual-specific slopes. We establish the asymptotic theory under both stochastic and deterministic designs, with convergence rates $\sqrt{N}$ and $\sqrt{N\sqrt{T}}$, respectively. We also develop two corresponding bootstrap procedures for practical inference, and formally establish their validity. The suggested methods are of practical interest since they require weaker sample size growth conditions than standard fixed-effect quantile regression, and accommodate large $N$ settings. Numerical simulations and an empirical application illustrate the empirical effectiveness of the methods under both designs.

2411.10959 2026-06-11 econ.EM cs.LG math.ST stat.AP stat.ME stat.ML 版本更新

Program Evaluation with Remotely Sensed Outcomes

利用遥感结果的程序评估

Ashesh Rambachan, Rahul Singh, Davide Viviano

AI总结 本文研究了在实验和准实验中,由于遥感变量不完全测量经济结果而引起的因果推断问题,提出了一种非参数识别因果参数的方法,结合实验和观测数据进行n^{-1/2}推断。

详情
AI中文摘要

我们研究了在实验和准实验中,经济结果由遥感变量不完全测量的因果推断问题。遥感变量是低成本、可扩展且在观测数据中预测经济结果的变量,例如卫星图像和移动电话活动。我们将遥感变量视为后结果:经济结果的变化导致遥感变量的变化。例如,环境质量的变化导致卫星图像的变化,而不是相反。在这一假设下,我们提出了一种结合实验和观测数据的公式,以非参数方式识别因果参数。我们开发了一种n^{-1/2}推断方法,该方法对规格不正确具有鲁棒性,并且不限制用于处理遥感变量的算法。

英文摘要

We study causal inference in experiments and quasi-experiments, where the economic outcome is imperfectly measured by a remotely sensed variable. The remotely sensed variable is low-cost, scalable, and predictive of the economic outcome in observational data; examples include satellite imagery and mobile phone activity. We model the remotely sensed variable as post-outcome: variation in the economic outcome causes variation in the remotely sensed variable. For example, changes in environmental quality cause changes in satellite imagery, not vice versa. Under this assumption, we propose a formula to nonparametrically identify the causal parameter by combining experimental and observational data. We develop a method for n^{-1/2} inference that is robust to misspecification and that does not restrict the algorithms used to process remotely sensed variables.

2511.11862 2026-06-11 econ.EM math.ST stat.ME 版本更新

Compound Selection Decisions: An Almost SURE Approach

复合选择决策:一种几乎无偏的SURE方法

Jiafeng Chen, Lihua Lei, Timothy Sudijono, Liyang Sun, Tian Xie

AI总结 针对高斯序列模型中的复合选择问题,提出基于SURE的几乎无偏估计量ASSURE,通过优化期望效用选择最优决策规则,并证明其渐近最优性。

详情
Comments
V2: Additional Results and Simulations. 110 pages. Comments welcome
AI中文摘要

本文提出了在高斯序列模型中生成复合选择决策的方法。给定未知的固定参数 $\mu_ {1:n}$ 和已知的 $\sigma_{1:n}$,观测值 $Y_i \sim \textsf{N}(\mu_i, \sigma_i^2)$,决策者希望选择一个子集 $S$ 以最大化效用 $\frac{1}{n}\sum_{i\in S} (\mu_i - K_i)$,其中 $K_i$ 为已知成本。受Stein无偏风险估计(SURE)启发,我们引入了一种几乎无偏的估计量,称为ASSURE,用于估计给定决策规则的期望效用。ASSURE允许用户通过优化估计福利,从预先指定的类别中选择福利最大化的规则,从而产生能够跨噪声估计借用强度的选择决策。我们证明,ASSURE产生的决策规则在渐近意义上不劣于预指定类别中最优但不可行的决策规则。我们将ASSURE应用于经济机会的人口普查区选择、歧视性企业的识别以及A/B测试中 $p$ 值决策程序的分析。

英文摘要

This paper proposes methods for producing compound selection decisions in a Gaussian sequence model. Given unknown, fixed parameters $\mu_ {1:n}$ and known $\sigma_{1:n}$ with observations $Y_i \sim \textsf{N}(\mu_i, \sigma_i^2)$, the decision maker would like to select a subset of indices $S$ so as to maximize utility $\frac{1}{n}\sum_{i\in S} (\mu_i - K_i)$, for known costs $K_i$. Inspired by Stein's unbiased risk estimate (SURE), we introduce an almost unbiased estimator, called ASSURE, for the expected utility of a proposed decision rule. ASSURE allows a user to choose a welfare-maximizing rule from a pre-specified class by optimizing the estimated welfare, thereby producing selection decisions that borrow strength across noisy estimates. We show that ASSURE produces decision rules that are asymptotically no worse than the optimal but infeasible decision rule in the pre-specified class. We apply ASSURE to the selection of Census tracts for economic opportunity, the identification of discriminating firms, and the analysis of $p$-value decision procedures in A/B testing.

2205.05002 2026-06-11 econ.EM

Estimating Discrete Games of Complete Information: Bringing Logit Back in the Game

Paul S. Koh

详情
英文摘要

Discrete games are central tools for empirical analysis of strategic interaction, but equilibrium multiplicity and partial identification often make them computationally difficult to estimate. This paper develops tractable methods for estimation and inference in complete-information discrete games. The key idea is to construct an outer set by comparing observed frequencies of action profiles with singleton-class generalized likelihoods: model-implied probabilities that those profiles can arise as equilibria. The resulting conditional moment inequalities avoid computationally expensive equilibrium enumeration, numerical simulation, and grid search. Under standard empirical assumptions used in discrete-game models, including logit payoff shocks, these restrictions have closed-form expressions and are convex in a subvector of structural parameters. I develop the approach for both unordered and ordered action spaces. Monte Carlo experiments and empirical applications show that the methods deliver informative outer sets and can reduce computation time by several orders of magnitude relative to existing approaches.

2403.16673 2026-06-11 stat.ME econ.EM 版本更新

Quasi-randomization tests for network interference: a random graph approach

网络干扰的准随机化检验:一种随机图方法

Supriya Tiwari, Pallavi Basu

AI总结 提出将网络视为随机变量,利用随机图零模型构建无溢出效应的零分布,克服了现有条件随机化检验的计算难题,在有限样本下精确有效,显著提升检验功效。

详情
AI中文摘要

当一个单元的处理状态影响其他单元的潜在结果时,就会产生网络干扰,导致难以检验的溢出效应。我们提出将网络视为随机变量而非固定量来应对这一挑战。这克服了原假设下潜在结果不可插补的关键难题,并避免了现有条件随机化检验的计算复杂性。我们的准随机化检验利用随机图零模型构建无溢出效应的零分布,在网络生成过程的温和假设下,在有限样本中精确有效,并且比现有方法(尤其是在整群随机试验中)提供显著更高的检验功效。我们通过模拟验证了该方法,并通过在中国农村的一项天气保险采纳实验中检验干扰效应进行了说明。

英文摘要

Network interference occurs when the treatment status of one unit affects the potential outcomes of other units, giving rise to spillover effects that are difficult to test for. We propose treating the network as a random variable rather than a fixed quantity to address this challenge. This overcomes a key challenge of non-imputability of potential outcomes under the null and avoids the computational intractability of existing conditional randomization tests. Our quasi-randomization test builds the null distribution of no spillover effects using random graph null models, is exactly valid in finite samples under mild assumptions on the network-generating process, and offers substantially improved power over existing methods, particularly in cluster-randomized trials. We validate our approach via simulation and illustrate it by testing for interference in a weather insurance adoption experiment in rural China.

2503.23569 2026-06-11 econ.GN econ.EM q-fin.EC

Where the Trees Fall: Macroeconomic Forecasts for Forest-Reliant States

Andrew Crawley, Adam Daigneault, Jonathan Gendron

详情
Journal ref
Forest Policy and Economics 186 (2026) 103739
英文摘要

Several key states in various regions of the U.S. have experienced recent sawtimber as well as pulp and paper mill closures, which raises an important policy question: how have and will key macroeconomic and industry specific indicators within the U.S. forest sector likely to change over time? This study provides empirical evidence to support forest-sector policy design by using a vector error correction (VEC) model to forecast economic trends in three major industries - forestry and logging, wood manufacturing, and paper manufacturing - across six of the most forest-dependent states found by the location quotient (LQ) measure: Alabama, Arkansas, Maine, Mississippi, Oregon, and Wisconsin. Overall, the results suggest a general decline in employment and the number of firms in the forestry and logging industry as well as the paper manufacturing industry, while wood manufacturing is projected to see modest employment gains. These results also offer key insights for regional policymakers, industry leaders, and local economic development officials: communities dependent on timber-based manufacturing may be more resilient than other forestry-based industries in the face of economic disruptions. Our findings can help prioritize targeted policy interventions and inform regional economic resilience strategies. We show distinct differences across forest-dependent industries and/or state sectors and geographies, highlighting that policies may have to be specific to each sector and/or geographical area. Finally, our VEC modeling framework is adaptable to other resource-dependent industries that serve as regional economic pillars such as mining, agriculture, and energy production offering a transferable tool for policy analysis in regions with similar economic structures.

2310.14983 2026-06-11 econ.EM math.ST stat.ME 版本更新

Causal clustering: design of cluster experiments under network interference

因果聚类:网络干扰下的聚类实验设计

Davide Viviano, Lihua Lei, Guido Imbens, Brian Karrer, Okke Schrijvers, Liang Shi

AI总结 研究网络干扰下估计全局处理效应的聚类实验设计,提出通过惩罚最小割优化选择聚类以最小化最坏情况均方误差,并给出选择聚类设计的简单条件。

详情
AI中文摘要

本文研究在存在网络溢出效应的情况下,用于估计全局处理效应的聚类实验设计。我们提供了一个框架来选择聚类,以最小化估计全局效应的最坏情况均方误差。我们证明最优聚类解决了一个新颖的惩罚最小割优化问题,可通过现成的半定规划算法计算。我们的分析还刻画了在任何两个聚类设计之间进行选择的简单条件,包括在聚类或个体水平随机化之间进行选择。我们使用来自Facebook用户宇宙的独特网络数据和现有现场实验数据来说明该方法的性质。

英文摘要

This paper studies the design of cluster experiments to estimate the global treatment effect in the presence of network spillovers. We provide a framework to choose the clustering that minimizes the worst-case mean-squared error of the estimated global effect. We show that optimal clustering solves a novel penalized min-cut optimization problem computed via off-the-shelf semi-definite programming algorithms. Our analysis also characterizes simple conditions to choose between any two cluster designs, including choosing between a cluster or individual-level randomization. We illustrate the method's properties using unique network data from the universe of Facebook's users and existing data from a field experiment.