arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.13679 2026-05-14 econ.GN q-fin.EC

How many parents does it take? Parental time allocation and the effectiveness of fertility subsidies

Jackie Dajin Young, Marwil J. Davila-Fernandez

AI总结 本文研究了父亲育儿时间增加对母亲育儿时间及家庭生育决策的影响,挑战了传统观点认为父亲参与能减轻母亲时间约束的共识。作者构建了一个重叠世代增长模型,指出育儿技术中父母时间的替代性或互补性决定了政策效果,当父母时间互补时,父亲参与反而会增加母亲育儿时间,降低生育率。研究还表明,生育补贴政策可能因忽视社会规范和育儿技能差异,产生适得其反的效果。

详情
英文摘要

There has long been an apparent consensus in the literature on intra-household allocation and fertility that greater paternal involvement in childcare relaxes maternal time constraints, enabling mothers to increase their labor supply or leisure. Recent evidence, particularly from South Korea, challenges this view: increases in fathers' childcare time have coincided with a further increase in mothers' time dedicated to child-rearing. This paper develops an Overlapping Generations (OLG) growth model to address such a puzzle. The central mechanism and our main innovation hinge on the functional form of the childcare technology. When maternal and paternal time are substitutes, the conventional result holds. However, when they are complements, greater paternal involvement necessarily raises maternal childcare time, depressing fertility and redirecting household resources toward child quality. We further argue that the elasticity of substitution should not be interpreted as a pure preference parameter, as it also reflects the social and institutional norms, the skills each parent brings to child-rearing and their intergenerational transmission. The model is extended to study the effectiveness of pro-natalist subsidies, suggesting that such policies may generate an unintended anti-fertility bias. Numerical simulations calibrated loosely to South Korean data confirm that the model is consistent with the observed quantity-quality trade-off and the persistence of low fertility despite active pro-natalist policy.

2602.21971 2026-05-14 econ.GN q-fin.EC

Modelling the Index of Sustainable Economic Welfare (ISEW) and its response to policies

Luzie Dallinger, Reo Van Eynde, Jefim Vogel, Lorenzo Di Domenico, Seán Fearon, Tina Beigi, Cédric Crofils, Kevin J. Dillman, Daniel W. O'Neill

AI总结 本文研究了可持续经济福利指数(ISEW)及其对政策的响应,旨在提供一种更全面衡量社会福利的替代方法。作者将ISEW纳入动态宏观生态模型COMPASS,分析碳税、收入再分配和减少工作时间三种政策对ISEW的影响,发现综合实施这些政策能显著提升ISEW,而单独实施也大多有积极效果,唯独减少工作时间可能降低ISEW。研究还指出,相比GDP,ISEW更能反映政策的社会环境影响,但未能完全体现经济增长的环境成本,而“甜甜圈”模型则在指导可持续福祉政策方面更具优势。

详情
Comments
31 pages, 6 figures
英文摘要

Given the challenge of achieving societal welfare in an environmentally sustainable way, the Index of Sustainable Economic Welfare (ISEW) has emerged as an alternative indicator of progress in response to critiques of Gross Domestic Product (GDP). The ISEW compares the benefits of economic activity with its social and environmental costs. So far, most studies empirically analyse the ISEW for past developments, while no studies have simulated the ISEW using a dynamic macroeconomic model. We address this important gap by incorporating the ISEW into COMPASS, an ecological macroeconomic model that features the Doughnut of biophysical boundaries and social thresholds. First, we analyse how the ISEW is affected by three social and environmental policies: a carbon tax, income redistribution, and working-time reduction. We find that the ISEW grows in all scenarios. The strongest improvement over business-as-usual arises when all policies are combined, while the individual policies mostly affect the ISEW positively. Only in the case of working-time reduction, the ISEW decreases. Our study underscores the benefit of dynamically modelling the ISEW for anticipating the net effect of multiple impulses and their interconnections on the indicator. Second, we explore how the ISEW compares to GDP and the Doughnut when evaluating social and environmental policies. Our results suggest that the ISEW is better than GDP at capturing their effects, but it omits the full environmental costs of growth. We argue that the Doughnut, with its comprehensive picture of biophysical boundaries and social thresholds, provides better guidance for policymakers striving for sustainable wellbeing.

2510.04698 2026-05-14 q-bio.NC cs.AI econ.TH

The Bayesian Origin of the Probability Weighting Function in Human Representation of Probabilities

Xin Tong, Thi Thu Uyen Hoang, Xue-Xin Wei, Michael Hahn

AI总结 人类在感知概率时普遍存在系统性的扭曲,表现为典型的反S型权重模式,但其成因长期未明。本文提出一种基于贝叶斯编码-解码的解释框架,认为概率通过带有噪声的内部信号进行编码,并通过最小化贝叶斯风险进行解码。研究发现,这种编码过程中的扭曲可分解为边界回归、似然排斥和先验吸引,从而预测出反S型权重模式源于编码精度的U型分布,即在概率接近0和1时更为敏感。实验结果表明,该框架能够从数据中自然恢复出U型编码结构,并在多个任务中优于传统确定性权重函数和其它模型。

详情
英文摘要

Humans systematically misrepresent probability in a stereotyped inverse-S pattern. It has been documented for decades, but its origin remains unexplained. We propose a Bayesian encoding-decoding account in which probabilities are represented by noisy internal signals and decoded by Bayes-risk minimization. For bounded probability stimuli, we show that distortion decomposes into boundary regression, likelihood repulsion, and prior attraction, yielding a key prediction: the classic inverse-S-shaped weighting pattern implies a U-shaped allocation of encoding precision with greater sensitivity near 0 and 1. Across judgment of relative frequency, lottery pricing, and risky choice, this U-shape is recovered from data without imposing any functional form on the encoding, and our framework outperforms deterministic weighting functions, bounded log-odds models, uniform-encoding Bayesian accounts, and matched efficient-coding models on held-out data. In a new dot probability estimation experiment with bimodal stimulus statistics, the recovered prior tracks the new distribution while the recovered encoding remains U-shaped. Together, these results identify the inverse-S-shaped probability weighting function as the joint product of a stable U-shaped encoding and a flexible prior, integrated by optimal Bayesian decoding.

2405.07860 2026-05-14 econ.EM math.ST stat.ML stat.TH

Order-Explicit Linearization of High-Dimensional $U$-Statistics

David M. Ritzwoller, Vasilis Syrgkanis

AI总结 本文研究了高维 $U$-统计量与其Hájek投影之间的偏差,并给出了一个与阶数显式相关的大型偏差界。通过发展新的高阶Hoeffding分量的显式矩不等式,作者证明了对于具有特定条件的$d$维核函数的$b$阶$U$-统计量,其最大偏差为$O_p(ϕb n^{-1}\log^2(dn))$,并表明这一速率在对数项的多项式因子内不可改进。研究结果进一步用于建立基于重采样的非参数回归估计器的同时置信区间的一致性,适用于包括广义随机森林在内的多种随机森林回归方法。

详情
英文摘要

We give an order-explicit large deviation bound for the difference between a high-dimensional $U$-statistic and its Hájek projection. In particular, we show that any $U$-statistic of order $b$ on $n$ observations, with a $d$-dimensional kernel whose coordinates have $ψ_1$-Orlicz norm at most $ϕ$, has a maximum deviation from its Hájek projection of order $O_p(ϕb n^{-1}\log^2(dn))$. The proof relies on the development of novel order-explicit moment inequalities for higher-order Hoeffding components. We show that this rate is unimprovable, up to the polynomial factor on the logarithmic term. As corollaries, we obtain new Bernstein-type concentration and Gaussian approximation results for high-dimensional $U$-statistics. We apply these results to establish the consistency of a set of resampling-based simultaneous confidence intervals built around a class of nonparametric regression estimators constructed with subsampled kernels. This class encompasses several forms of random forest regression, including Generalized Random Forests.

2311.02299 2026-05-14 econ.EM stat.ME

The Fragility of Sparsity

Michal Kolesár, Ulrich K. Müller, Sebastian T. Roelsgaard

AI总结 本文通过三个实证应用,揭示了基于稀疏性假设的线性回归估计存在两种脆弱性。首先,不同选择的回归矩阵(如分类变量的基线类别)虽不影响普通最小二乘(OLS)估计,却可能导致稀疏性估计发生两倍标准误以上的变动。其次,作者通过将稀疏性估计与OLS进行比较,提出了两种检验稀疏性假设的方法,结果在所有三个应用中均拒绝了稀疏性假设。除非解释变量数量接近或超过样本量,否则OLS在保持较高效率的同时能提供更稳健的推断。

详情
Comments
48 pages, including appendices
英文摘要

We show, using three empirical applications, that linear regression estimates predicated on the assumption of sparsity are fragile in two ways. First, we document that different choices of the regressor matrix which do not impact ordinary least squares (OLS) estimates, such as the choice of baseline category with categorical controls, can move sparsity-based estimates by two standard errors or more. Second, we develop two tests of the sparsity assumption by comparing sparsity-based estimators with OLS. The tests tend to reject the sparsity assumption in all three applications. Unless the number of regressors is comparable to or exceeds the sample size, OLS yields more robust inference at little efficiency cost.

2302.14234 2026-05-14 cs.GT econ.TH

Bicriteria Multidimensional Mechanism Design with Side Information

Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm

AI总结 本文研究了在多维机制设计中如何利用辅助信息同时实现高社会福利和高收入的问题。作者提出了一种可调节的机制,该机制结合了基于最弱类型(即福利最低的代理类型)改进的VCG类机制与辅助信息,如专家建议、机器学习预测等。该机制在高质量辅助信息下能够达到接近无先验总社会福利的性能,并且在辅助信息质量下降时仍能保持良好的性能表现。

详情
Comments
Mathematics of Operations Research Special Issue on Market Design; supersedes the NeurIPS 2023 paper of the same title
英文摘要

We develop a versatile methodology for multidimensional mechanism design that incorporates side information about agents to generate high welfare and high revenue simultaneously. Side information sources include advice from domain experts, predictions from machine learning models, and even the mechanism designer's gut instinct. We design a tunable mechanism that integrates side information with an improved VCG-like mechanism based on weakest types, which are agent types that generate the least welfare. We show that our mechanism, when its side information is of high quality, generates welfare and revenue competitive with the prior-free total social surplus, and its performance decays gracefully as the side information quality decreases. We consider a number of side information formats including distribution-free predictions, predictions that express uncertainty, agent types constrained to low-dimensional subspaces of the ambient type space, and the traditional setting with known priors over agent types. In each setting we design mechanisms based on weakest types and prove performance guarantees.

2605.13320 2026-05-14 q-fin.GN econ.EM

The fine structure of electricity price volatility

Thomas K. Kloster, Fred Espen Benth

AI总结 本文首次对欧洲三个发电区域的电力价格波动率进行了严谨研究,通过将日前电价视为由随机偏微分方程驱动的潜在价格过程的局部平均,构建了周度波动率估计方法。研究揭示了不同区域价格波动的驱动因素存在显著差异,并指出在考虑适当状态变量后,杠杆效应消失,表明电价波动对价格冲击的响应并不具有普遍的不对称性。

详情
英文摘要

We conduct the first rigorous study of electricity price volatility for the full panel of electricity prices across three European generation zones. By interpreting the observed day-ahead prices as local averages of a latent price process governed by a stochastic partial differential equation, we develop estimators of the weekly integrated variance. The inherently infinite dimensional setting introduce several complications that are not relevant in the conventional finite dimensional semimartingale setting, and we spend considerable effort in dealing with these. In particular, we must account for both mean-reversion in prices and semigroup-smoothing in the estimated variance. We provide a detailed decomposition and interpretation of the empirical estimates across three vastly different European generation zones, namely Germany, Norway, and Spain. Our findings indicate that each zone has very different drivers of volatility, and that the impact of generation variables differs considerably. We document that leverage effects appear to be present at first sight, but disappear once we condition on suitable state variables, thereby showing that electricity price volatility does not generally exhibit asymmetric responses to price shocks.

2605.13222 2026-05-14 econ.TH cs.GT

Extended Scenario Bundle Analysis: A Formal Framework for Strategic Scenario Modeling

Thomas Pitz, Vinícius Ferraz

AI总结 本文提出了一种扩展的场景束分析框架,用于支持战略危机分析中的定性判断与动态更新。该框架引入了两层架构,将静态场景数据库与动态场景树系统分离,并扩展了态度词汇以更准确地表达决策者的信念、意图和承诺等要素。研究的主要贡献在于提供了更精确的数学定义,支持计算实现,并增强了对场景空间的结构化描述与多标准评估能力。

详情
Comments
129 pages, 20 figures, 17 tables, 137 references. Preprint; comments welcome
英文摘要

Strategic crisis analysis needs representations that combine qualitative expert judgement, explicit interdependence, and auditable update rules without requiring fully specified payoffs or probabilities. Scenario Bundle Analysis (SBA), developed by Amos Perlmutter and Reinhard Selten, provides such a starting point, but the original formulation leaves several database, topology, and update interfaces implicit. This paper presents a formal refinement and extension of the original SBA framework, introducing a two-layer architecture that separates a static scenario database from a dynamic scenario tree system. The extended framework incorporates a richer attitude vocabulary: beliefs, desires, intentions, fears, and coalitional commitments, with expectations treated as doxastic attitudes. It also adds a domain/modifier layer for contextual framing, a topology on admissible scenario spaces\index{Scenario space}, typed assessment-state updates, and multi-criteria evaluation. Mathematical definitions are stated with sufficient precision to support computational implementation.

2605.13039 2026-05-14 econ.TH

Pitfall of Precision in Noisy Signaling

Shuhua Si, Yangfan Zhou

AI总结 本文研究了在存在噪声信号的情况下,委托人如何根据代理人的信号(如考试成绩)决定是否批准其申请的问题。研究发现,当信号精度已经较高时,进一步提升精度反而会降低筛选效果和委托人福利,这是因为更高的精度会激励更多低质量代理人进行策略性信号传递。这一“精度陷阱”现象对统计歧视也具有影响,噪声较大的群体可能面临更低的批准率,但在事前却可能被更倾向于选择。文章还探讨了承诺权在缓解该问题中的作用。

详情
英文摘要

A principal decides whether to approve an agent based on a noisy signal (e.g., test scores) generated by the agent. High-quality agents can produce high signals on average at lower cost, but the realizations are subject to noise that depends on the screening technology's precision. We uncover a paradoxical "pitfall of precision": when precision is already high, further improvements reduce screening accuracy and lower the principal's welfare. This occurs because greater precision incentivizes strategic signaling from more low-quality agents, outweighing the direct benefit from improved precision. The pitfall of precision also has implications for statistical discrimination: groups with noisier technologies face lower approval rates yet may be favored ex ante -- a reversal of discrimination. We also examine how commitment power helps mitigate the pitfall.

2605.12802 2026-05-14 econ.TH econ.GN q-fin.EC

Strategically Analogous Mechanisms

Joseph Feffer, Filip Tokarski

AI总结 本文研究了在一种机制中获得的战略理解能否转移到另一种机制中。作者提出了一种框架,将代理人的知识表示为他们能够进行的收益比较,并据此形式化了对均衡策略理解的含义。研究发现,当机制在战略上等价或具有战略类比关系时,只要代理人了解行动或类型之间的对应关系,其对均衡的理解即可跨机制转移。该成果可应用于单物品拍卖、评分拍卖和具有容量约束的非线性定价等场景。

详情
英文摘要

This paper studies when strategic understanding acquired in one mechanism can be transferred to another. We introduce a framework in which agents' knowledge is represented as a set of payoff comparisons they can make, and use it to formalize what it means to understand that a strategy profile is an equilibrium. We first apply this framework to mechanisms that are strategically equivalent-that is, share the same game form up to relabeling of actions-and show that agents' understanding of equilibrium transfers across such mechanisms once the relevant action correspondences are explained to them. We then define strategic analogy, a weaker notion that allows not only actions but also types to be remapped, and show that understanding of equilibrium transfers across strategically analogous mechanisms once agents recognize how actions and types correspond. Applications include single-item auctions, scoring auctions, and nonlinear pricing with capacity constraints.

2605.12676 2026-05-14 econ.GN q-fin.EC

Ballot Exhaustion in Multiwinner Single Transferable Vote Elections

David McCune, E. E. Naber

AI总结 本文研究多席位排序投票制(STV)选举中的选票耗尽现象,基于苏格兰地方选举的1070场选情数据,涵盖540万张选票。作者提出了多种选票耗尽的正式定义,区分了不同类型的耗尽情况,并指出多数耗尽选票实际上已对候选人当选产生贡献,表明原始耗尽率可能高估了选民影响力的丧失。研究还发现,在比例选票补全模型下,选票耗尽对选举结果的影响有限,仅导致3.5%的席位变化,揭示了STV选举中选票耗尽的实际与规范意义。

详情
英文摘要

We study ballot exhaustion in multiwinner single transferable vote (STV) elections using a dataset of 1,070 Scottish local government elections comprising over 5.4 million ballots. While ballot exhaustion has been studied extensively in single-winner elections, comparatively little work examines exhaustion in the multiwinner setting. We introduce formal definitions of several types of exhaustion in STV elections, distinguishing between exhausted ballots, non-first-choice exhausted ballots, unrepresented exhausted ballots, and weight exhaustion. These definitions clarify important conceptual differences between ballots that cease to transfer and ballots that fail to contribute meaningfully to representation. Our empirical analysis shows that 27.9\% of ballots are exhausted by the final round of counting, although the corresponding weight exhaustion rate is only 7.1\%, indicating that many exhausted ballots have already contributed to the election of a candidate. Moreover, most exhausted ballots correspond to voters who achieve some form of representation, either because their first-ranked candidate wins or because a candidate ranked among their top choices is elected. These results suggest that raw exhaustion rates alone substantially overstate the extent to which voters lose their influence or fail to obtain representation under STV. We also investigate whether exhaustion can affect electoral outcomes by extending partial ballots under several completion models. Under extreme assumptions, exhaustion can potentially alter a substantial number of outcomes, but under a proportional ballot-completion model only 3.5\% of seats change. Finally, we show that a substantial number of winners fail to reach quota, even after the elimination of all losing candidates. These results help clarify the practical and normative significance of ballot exhaustion in real-world STV elections.

2603.20609 2026-05-14 econ.TH

Strategy-proof Market Segmentation against Price Discrimination

Zhonghong Kuang, Sanxi Li, Yi Liu, Yang Yu

AI总结 本文研究了在数据监管背景下,消费者可自由切换市场细分的情况下,如何实现防止价格歧视的市场细分。作者构建了一个模型,其中消费者在垄断者定价前选择细分市场,并定义了“策略证明”的细分概念,即任何具有正测度的消费者都无法通过单方面偏离获利。研究发现,在所有策略证明的细分中,生产者剩余固定为统一垄断利润,消费者剩余则介于统一垄断水平与买方最优水平之间,且所有消费者至少不变得更差。作者还构造了能够实现该范围内所有可行结果的策略证明细分。

详情
英文摘要

Data regulations increasingly enable consumers to switch among market segments, making segmentation an endogenous outcome of strategic interaction. We study a model in which consumers choose segments before a monopolist sets segment-specific prices, and define a segmentation as strategy-proof when no consumer with positive measure can profitably deviate. Our main result provides a complete welfare characterization: in every strategy-proof segmentation, producer surplus is pinned at the uniform monopoly profit, consumer surplus ranges from the uniform monopoly level to the buyer-optimal level, and every consumer is weakly better off. We construct strategy-proof segmentations attaining every feasible outcome in this range. A finite-consumer model microfounds our solution concept, with equilibrium outcomes converging to our characterization as the population grows large.

2506.15310 2026-05-14 econ.TH

Self-Equivalent Voting Rules

Héctor Hermida-Rivera

AI总结 本文提出了一种新的稳定性公理——自等价性,用于随机投票规则,表明一个社会在考虑是否用自身来替代当前投票规则时,会选择不这么做。研究证明,在无限制严格偏好域下,唯一同时满足匿名性、最优性、单调性和中立性等民主原则以及自等价性稳定原则的投票规则,是每个投票者成为独裁者的概率相等的均匀随机独裁制。因此,任何追求稳定并遵循上述民主原则的社会,要么必须采用均匀随机独裁制,要么需用不同于自身的投票规则来决定是否更换规则。

详情
英文摘要

In this paper, I introduce a novel stability axiom for stochastic voting rules, called self-equivalence, by which a society considering whether to replace its voting rule using itself will choose not to do so. I then show that under the unrestricted strict preference domain, the unique voting rule satisfying the democratic principles of anonymity, optimality, monotonicity, and neutrality as well as the stability principle of self-equivalence must assign to every voter equal probability of being a dictator (i.e., uniform random dictatorship). Thus, any society that desires stability and adheres to the aforementioned democratic principles is bound to either employ the uniform random dictatorship or decide whether to change its voting rule using a voting rule other than itself.

2406.14174 2026-05-14 econ.TH

Redistribution Through Market Segmentation

Victor Augias, Alexis Ghersengorin, Daniel M. A. Barreto

AI总结 本文研究了如何在具有再分配目标的垄断市场中实现最优市场细分。研究发现,最优的再分配市场细分会使卖方对较富裕的消费者收取更高的价格,从而实现财富从富人向穷人的再分配。此外,这种细分方式可能不会最大化消费者剩余,而是为垄断者带来额外利润,并且可以通过价格管制来实现。

详情
英文摘要

We study how to optimally segment monopolistic markets with a redistributive objective. We characterize optimal redistributive segmentations and show that they (i) induce the seller to price progressively, i.e., charge richer consumers higher prices than poorer ones, and (ii) may not maximize consumer surplus, instead granting extra profits to the monopolist. We further show that optimal redistributive segmentations are implementable via price-based regulation.

2605.12559 2026-05-14 econ.TH

Coordination Failures and Stackelberg Leadership in Housing Development with Network Effects

Vaibhav Rangan

AI总结 本文研究了具有网络效应的住房开发市场中的协调失效问题,指出当网络效应足够强且凸时,市场可能出现多个均衡,包括低供给和高供给两种情况。作者引入了一个先行的大型开发商,在Stackelberg博弈中率先承诺住房供给,从而消除协调失效,确保市场达到高供给均衡。研究还表明,这种先行承诺可以提升社会福利,并分析了开发商在何种情况下会超出高均衡进入垄断区域,揭示了市场住房供给通常低于社会最优水平。

详情
Comments
43 pages, 7 figures
英文摘要

I study coordination failures in housing development markets with network effects, where the value of building depends on aggregate supply. When network effects are sufficiently strong and convex, multiple equilibria arise: a low-supply coordination failure and a high-supply outcome. Without a coordination mechanism, equilibrium is indeterminate. I introduce a large developer who moves first in a Stackelberg game, committing to housing supply before atomistic developers make entry decisions. The main result is that the large developer always commits at least to the high-supply equilibrium, eliminating the coordination failure by pushing past the unstable threshold that separates the low and high outcomes. The result is unconditional; it holds for general demand functions and cost distributions, and does not depend on which stable continuation equilibrium materializes. The leader's commitment inverts standard monopoly intuition: first-mover commitment can improve welfare by resolving a coordination problem that atomistic markets cannot solve on their own. I also characterize when the developer builds beyond the high equilibrium into a monopoly region, and show that the market underprovides housing relative to the social optimum.

2605.12551 2026-05-14 econ.EM

Analyzing the Impact of Release Season and Production Budget on Movie Revenue and Profitability

Mohammad Jalili Torkamani, Pedro Gomes, Amirmohammad Sadeghnejad, Jason Le

AI总结 该研究分析了电影上映季节和制作预算对票房收入及盈利的影响,利用完整的TMDB电影数据集,结合关联规则挖掘、聚类、机器学习和SHAP分析等方法,揭示了影响电影财务表现的关键因素。研究发现,制作预算、影片受欢迎程度和观众评分对收入和投资回报率具有显著影响,而上映季节的作用有限。研究结果表明,电影产业的财务成功更依赖于制作投入和市场关注度,而非上映时间策略,为电影预算制定、发行规划和风险管理提供了重要参考。

详情
Comments
17 pages, 18 figures, preprint
英文摘要

The film industry is characterized by significant financial uncertainty, where large production investments do not always guarantee commercial success. This study analyzes the relationship between release season, production budget, and movie financial performance using the Full TMDB Movies Dataset 2024. A data mining framework incorporating association rule mining, clustering, machine learning, and SHAP analysis was applied to identify key drivers of revenue and profitability. The results show that release season has limited predictive influence on revenue and return on investment (ROI). In contrast, production budget, popularity, and audience ratings are significantly more influential. Association rule mining revealed that high-budget films with poor ratings are strongly associated with negative ROI outcomes. Random Forest regression achieved substantially stronger predictive performance than Decision Tree regression, with an $R^2$ value of 0.652. SHAP analysis further confirmed that budget and popularity are the dominant predictors of box office revenue, while timing-related variables contribute minimally. These findings suggest that financial success in the film industry is driven more by production investment and market attention than by seasonal release strategies, providing practical insights for budgeting, release planning, and financial risk management.

2506.20035 2026-05-14 econ.EM

When is p-hacking detectable?

Stefan Faridani

AI总结 本文研究了p值操纵(p-hacking)在统计检验中的可检测性问题,指出基于t统计量或p值直方图的传统检测方法无法识别某些形式的p值操纵。为此,作者提出了一种新的检验方法,通过比较平滑后的经验t分布与理论诚实分布之间的距离,能够检测所有可通过t分布识别的报告选择行为。该方法在应用于Brodeur等(2020)的元数据集时,发现RCT和IV研究中的t分布扭曲程度显著高于随机或近似误差所能解释的范围。

详情
英文摘要

We show that some forms of p-hacking cannot be detected by examining the histogram of t-statistics or their p-values. Even when p-hacking is detectable, standard tests may lack power. We propose a novel test that detects every form of selective reporting that is detectable from the distribution of reported t-statistics. Our test statistic is the distance between the smoothed empirical t-curve and the set of possible honest distributions. This projection test is sharp and can only be evaded by selective reporting that also evades all other valid tests of restrictions on the t-curve. We also show how to avoid spurious rejections caused by some benign distortions in the t-curve. Applying the test to the Brodeur et al. (2020) meta-dataset, we find that the t-curves for RCTs and IVs are more distorted than could arise by chance, (de)rounding, or the Student-t approximation.