arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.27320 2026-05-27 cs.AI cs.CY econ.GN q-fin.EC

Modeling Agentic Technical Debt and Stochastic Tax: A Standalone Framework for Measurement, Simulation, and Dashboarding

建模代理技术债务与随机税:一个用于测量、模拟和仪表盘展示的独立框架

Muhammad Zia Hydari, Raja Iqbal, Narayan Ramasubbu

AI总结 本文提出一个形式化且可管理的框架,区分代理技术债务(累积的设计与治理负债存量)与随机税(使用随机代理时产生的运营负担流),并通过应付账款模拟和电子表格说明其应用。

详情
AI中文摘要

代理AI系统将概率推理与通过工具、上下文、记忆、编排和外部工作流集成进行的委托行动相结合。本文开发了一个形式化且可管理的模型,区分了代理技术债务与随机税。代理技术债务是累积的设计与治理负债存量。随机税是在业务流程中使用随机代理时产生的重复性运营负担流。这两个概念相关但不同:债务可能放大税收,而即使债务最小化,税收仍可能为正。本文从一个紧凑的仪表盘表达式开始,将其扩展为更完整的结构模型,定义所有变量和参数,展示如何从运营数据中估算每个成本类别,并通过应付账款模拟和配套电子表格说明该框架。

英文摘要

Agentic AI systems combine probabilistic reasoning with delegated action through tools, context, memory, orchestration, and external workflow integration. This note develops a formal and managerially usable model that distinguishes Agentic Technical Debt from Stochastic Tax. Agentic Technical Debt is a stock of accumulated design and governance liability. Stochastic Tax is a recurring flow of operating burden that arises when stochastic agents are used in business workflows. The two constructs are related, but they are not the same: debt can amplify the tax, while the tax can remain positive even when debt is minimized. The note starts from a compact dashboard expression, expands it into a fuller structural model, defines all variables and parameters, shows how each cost category can be estimated from operational data, and illustrates the framework with an accounts-payable simulation and companion spreadsheet.

2605.26948 2026-05-27 econ.TH

Integrating Proportionality and Egalitarianism in Claims Problems

整合比例性与平等主义:索赔问题中的分配规则

Anisha Bandyopadhyay, Sinan Ertemel, Rajnish Kumar, Saptarshi Mukherjee

AI总结 本文提出P-CEA妥协规则族,通过基线参数在比例规则与约束平等奖励规则之间插值,并基于无优势再分配和可持续下界两个阈值依赖原则进行公理化刻画。

详情
Comments
JEL Classifications: C71; D63; D61; H23. Keywords: Claims problem; Fair division; Resource allocation; Compromise rules; Proportionality; Constrained equal awards; Axiomatic foundations; Inequality measures
AI中文摘要

我们研究在总索赔超过可用资源时如何在代理人之间分配有限遗产的问题,这是索赔问题理论中的标准框架。两种典型规则体现了相互竞争的公平理想:比例规则按索赔比例分配,而约束平等奖励(CEA)规则在受索赔约束的前提下尽可能平等化奖励。我们引入P-CEA妥协规则族,该规则为每个代理人分配一个固定的基线奖励(以其索赔为上限),并将剩余遗产按剩余索赔比例分配。通过改变基线参数,该族生成一个连续的分配规则谱,在比例规则和CEA基准之间插值。我们基于两个阈值依赖原则提供公理化刻画:无优势再分配,防止索赔超过阈值的代理人通过保持阈值条件的协调索赔再分配获益;可持续下界,保证每个代理人至少获得其索赔与阈值中的较小值。我们进一步开发了重新分配损失而非奖励的对偶分析,并使用我们公理的对偶版本刻画相应的对偶族。

英文摘要

We study the problem of allocating a finite estate among agents whose total claims exceed the available resources, a standard framework in the theory of claims problems. Two canonical rules embody competing fairness ideals: the Proportional rule allocates in proportion to claims, while the Constrained Equal Awards (CEA) rule equalizes awards as much as possible subject to claim-boundedness. We introduce the P-CEA family of compromise rules, which assigns each agent a fixed baseline award, capped by her claim, and distributes the remaining estate proportionally to residual claims. By varying the baseline parameter, this family generates a continuum of allocation rules that interpolates between the Proportional and CEA benchmarks. We provide an axiomatic characterization based on two threshold-dependent principles: No Advantageous Reallocation, which prevents agents with claims above the threshold from benefiting through coordinated claim redistribution that preserves the threshold condition, and Sustainable Lower Bound, which guarantees each agent at least the minimum of her claim and the threshold. We further develop a dual analysis that reallocates losses instead of awards and characterize the corresponding dual family using the dual versions of our axioms.

2605.23162 2026-05-27 cs.CY cs.CR cs.DC cs.ET econ.GN q-fin.EC

SolarChain: Bridging Physical Law, Verifiable Trust, and Sustainable Markets for Urban Energy Resilience

SolarChain:连接物理定律、可验证信任与可持续市场的城市能源韧性

Shilin Ou, Yifan Xu, Zhenshan Zhang, Luyao Zhang, Ming-Chun Huang

AI总结 提出SolarChain平台,通过基于热力学极限的物理验证和点对点市场机制,解决城市太阳能数据篡改和投机问题,实现可信交易与可持续投资。

详情
AI中文摘要

城市脱碳需要在数百万分散的生产者中推广屋顶太阳能,但城市面临一个根本矛盾:能源数据容易被篡改,经济激励往往奖励投机而非实际基础设施部署。我们提出SolarChain,一个通过将数字问责制锚定在太阳能转换的热力学极限来解决这两个问题的平台。利用实时气象数据、地理坐标和太阳能产量的第一性原理计算,系统为每个面板的最大可能输出设定一个严格的物理边界;任何超过此限制的报告发电量在进入共享账本前自动被拒绝。这种无需信任的验证实现了一个点对点市场,具有程序化奖励结构,持续将价值再投资于设备维护和市场流动性,防止通常破坏基于区块链的市场稳定的投机囤积。当电力被消耗时,相应的数字信用按物理能量耗散的比例永久退役,在城市消费与碳核算之间创建可审计的一一映射。部署在异构城市节点上,该原型展示了抵御数据注入攻击的韧性,同时降低了社区级太阳能扩展的资本门槛。超越能源领域,该框架为任何分布式基础设施需要数据完整性和可持续投资的领域,提供了一个将经济活动与物理定律协调的通用模型。我们在GitHub上以开放获取方式发布数据和代码。

英文摘要

Urban decarbonization requires scaling rooftop solar across millions of fragmented producers, yet cities face a fundamental tension: energy data is easily manipulated, and economic incentives often reward speculation rather than actual infrastructure deployment. We present SolarChain, a platform that resolves both problems by anchoring digital accountability to the thermodynamic limits of solar energy conversion. Using real-time meteorological data, geospatial coordinates, and first-principles calculations of solar yield, the system establishes a hard physical boundary for every panel's maximum possible output; any reported generation exceeding this limit is automatically rejected before entering the shared ledger. This trustless verification enables a peer-to-peer marketplace with programmatic reward structures that continuously reinvest value into equipment maintenance and market liquidity, preventing the speculative hoarding that typically destabilizes blockchain-based marketplaces. When electricity is consumed, the corresponding digital credits are permanently retired in direct proportion to physical energy dissipation, creating an auditable one-to-one mapping between urban consumption and carbon accounting. Deployed across heterogeneous city nodes, the prototype demonstrates resilience against data injection attacks while lowering capital barriers for community-level solar expansion. Beyond energy, the framework offers a general model for coordinating economic activity with physical law in any domain where distributed infrastructure demands both data integrity and sustainable investment. We release the data and code as open-access on GitHub.

2605.26662 2026-05-27 cs.CL cs.AI econ.GN q-fin.EC

AI evaluation may bias perceptions: The importance of context in interpreting academic writing

AI评估可能扭曲认知:语境在解读学术写作中的重要性

Shang Wu, Randol Yao

AI总结 本文通过构建AI相似度基准,发现忽略国家和领域差异的评估方法会系统性高估或低估某些群体中的AI使用,提出基于具体语境的基准以更准确评估科学写作中的AI使用。

详情
AI中文摘要

本文研究了当评估方法忽略国家和领域的语境差异时,科学写作中AI使用估计可能产生的偏差。利用Dimensions中期刊论文的大规模数据,我们基于人类撰写和LLM重写的摘要之间的差异构建了AI相似度基准。我们表明,合并基准可能混淆已有的风格差异与AI生成的文本,即使在LLM之前的出版物中也会在跨国家-领域组中产生显著扭曲。相比之下,特定国家-领域的基准减轻了这种扭曲,并提供了更可信的比较基线。将这些方法应用于2025年的出版物,结果显示合并基准系统性高估了某些国家和领域的AI使用,同时低估了其他国家和领域的AI使用。这些发现强调了语境感知测量对于准确和公平评估科学中AI使用的重要性。

英文摘要

This paper examines how estimates of AI use in scientific writing can be biased when evaluation methods ignore contextual differences across countries and fields. Using large-scale data on journal publications from Dimensions, we construct AI-likeness benchmarks based on differences between human-written and LLM-rephrased abstracts. We show that a pooled benchmark may confound pre-existing stylistic variation with AI-generated text, producing substantial distortions across country-field groups even in pre-LLM publications. In contrast, country-field-specific benchmarks attenuate such distortions and provide a more credible baseline for comparison. Applying these methods to publications in 2025 reveals that the pooled benchmark systematically overestimates AI use in certain countries and fields while underestimating it in others. These findings highlight the importance of context-aware measurement for accurate and equitable evaluation of AI use in science.

2605.26639 2026-05-27 econ.TH

Suppression and Empowerment in Contests

竞赛中的压制与赋能

Alexander Matros, Constantine Sorokin

AI总结 基于截断三次竞赛成功函数研究两选手竞赛,通过战略反馈参数的正负决定领先者努力对落后者努力边际效率的压制或赋能效应,发现不确定性在压制下降低努力而在赋能下提高努力,且信息披露策略呈非对称性。

详情
AI中文摘要

我们研究了一个基于截断三次竞赛成功函数的可处理的两选手竞赛。其定义特征是一个战略反馈参数,其符号决定了领先选手的努力是降低(压制)还是提高(赋能)落后选手努力的边际效率;标准彩票竞赛通过构造施加了压制。基准模型在完全信息下产生闭式混合均衡,在独立同分布私人信息下产生唯一的仿射贝叶斯纳什均衡。期望努力在反馈参数上通常呈单峰。不确定性在压制下降低努力,但在赋能下提高努力,同样的非对称性支配着信息披露:努力最大化的设计者在压制下隐瞒信息,在赋能下完全披露。竞赛理论的几个熟悉结论实际上反映了压制性基准,而非竞赛本身。

英文摘要

We study a tractable two-player contest built on a truncated cubic contest success function. Its defining feature is a strategic-feedback parameter whose sign determines whether a leading player's effort lowers (suppression) or raises (empowerment) the marginal effectiveness of the trailing player's effort; standard lottery contests impose suppression by construction. The benchmark yields closed-form mixed equilibria under complete information and a unique affine Bayesian Nash equilibrium under IID private information. Expected effort is typically single-peaked in the feedback parameter. Uncertainty lowers effort under suppression but raises it under empowerment, and the same asymmetry governs information disclosure: an effort-maximizing designer withholds information under suppression and discloses fully under empowerment. Several familiar conclusions of contest theory turn out to reflect suppressive benchmarks rather than contests as such.

2605.26604 2026-05-27 cs.GT cs.DC cs.NI econ.TH

Credibility Trilemma in Polymatroidal Service Markets

多拟阵服务市场中的可信度三难困境

Lauri Lovén, Sujit Gujar, Kalle Timperi, Hassan Mehmood, Praveen Kumar Donta, Sasu Tarkoma, Schahram Dustdar

AI总结 本文研究多拟阵服务市场中市场运营者的策略行为,证明在非模多拟阵上不存在同时满足收益最优、激励兼容和可信的静态密封投标机制,并引入不可信成本度量该困境的福利损失。

详情
Comments
75 pages, 3 figures. Prepared for submission to the ACM Transactions on Economics and Computation (TEAC)
AI中文摘要

具有多拟阵可行性的机制中介服务市场允许高效、占优策略激励兼容(DSIC)分配,但这些保证隐含地假设市场运营者诚实执行。将运营者建模为策略性参与者,我们建立了一个可信度三难困境:对于非模多拟阵上的单参数代理,没有静态密封投标机制能同时实现收益最优、对代理DSIC以及对运营者可信。我们引入不可信成本(CoNC)作为类似无政府状态价格的福利损失度量,并在五个拓扑类别(单边、串联、并联、树、串并联)上获得紧的$Θ$界,以及在一般DAG上通过SP增广子族上的$Ω(|\mathcal{S}|)$见证实现匹配的上界$O(|\mathcal{S}|)$,将三难困境转化为一个结构性量。随后给出三种结构上不同的解决方案:公共广播或延迟揭示承诺、结算分离和四个附加条件下的管理域分离、以及不相干参与者下独立于机制执行的集成商竞争。在Amin等人的边定价市场上的实例级验证确认了三难困境在仲裁外部设置上的稳健性。该结果将市场中立性确立为多拟阵服务市场的一阶设计约束而非实现细节:当运营者是策略性参与者时,可信度与收益最优性和代理激励兼容性沿着结构性特征线进行权衡。

英文摘要

Mechanism-mediated service markets with polymatroidal feasibility admit efficient, dominant-strategy incentive-compatible (DSIC) allocation, but these guarantees implicitly assume truthful execution by the marketplace operator. Modelling the operator as a strategic player, we establish a credibility trilemma: for single-parameter agents on a non-modular polymatroid, no static sealed-bid mechanism is simultaneously revenue-optimal, DSIC for agents, and credible for the operator. We introduce the Cost of Non-Credibility (CoNC) as a price-of-anarchy-style welfare-loss measure and obtain tight $Θ$-bounds across five topology classes (single-edge, series, parallel, tree, series-parallel), plus a matching upper bound $O(|\mathcal{S}|)$ on general DAGs realised by an $Ω(|\mathcal{S}|)$ witness on the SP-augmented sub-family, turning the trilemma into a structural quantity. Three structurally distinct resolutions follow: public broadcast or deferred-revelation commitment, administrative domain separation under settlement separation and four side conditions, and integrator competition orthogonal to mechanism execution under disjoint actors. An instance-level grounding over the edge-pricing market of Amin et al. confirms the trilemma's robustness on a refereed external setting. The result establishes marketplace neutrality as a first-order design constraint on polymatroidal service markets rather than an implementation detail: where the operator is a strategic player, credibility trades off against revenue optimality and agent incentive compatibility along structurally characterised lines.

2605.26559 2026-05-27 cs.LG cs.AI econ.EM

Auditing and Fixing Economic Validity in Tabular Foundation Models for Discrete Choice

审计与修复离散选择中表格基础模型的经济有效性

Yingshuo Wang, Xian Sun, Yanhang Li, Zhichao Fan, Zexin Zhuang

AI总结 提出两阶段适配器,将表格基础模型预测嵌入效用最大化框架,在保证经济一致性的同时提升选择预测精度。

详情
Comments
5 pages, 1 table. Accepted at the FMSD Workshop, ICML 2026
AI中文摘要

表格基础模型在选择预测任务上取得了很高的准确率,但其预测常常违反这些任务所需的经济逻辑:提高价格有时会增加预测需求,隐含的支付意愿估计经常为负或不合理。我们提出了一种两阶段适配器,将基础模型预测嵌入效用最大化框架。在第一阶段,我们估计一个标准选择模型,其参数受经济理论约束。在第二阶段,我们冻结这些参数,并训练一个校正项,将基础模型的预测作为附加信息纳入。结果模型继承了基础模型的精度提升,同时保证了政策扰动下价格-需求的单调关系,并产生可解析计算的权衡指标。在两个交通数据集上,适配器在保持完美经济一致性的同时,相比标准logit模型恢复了高达13个百分点的准确率,这是原始基础模型或传统蒸馏都无法实现的。

英文摘要

Tabular foundation models achieve strong accuracy on choice prediction tasks, but their predictions often violate the economic logic those tasks require: raising a price sometimes increases predicted demand, and implied willingness-to-pay estimates are frequently negative or implausible. We propose a two-stage adapter that embeds foundation model predictions within a utility-maximization framework. In the first stage, we estimate a standard choice model whose parameters are constrained to obey economic theory. In the second stage, we freeze those parameters and train a correction term that incorporates the foundation model's predictions as additional information. The result is a model that inherits the foundation model's accuracy gains while guaranteeing monotonic price-demand relationships under policy perturbation and producing analytically computable trade-off measures. On two transportation datasets, the adapter recovers up to 13 percentage points of accuracy over a standard logit model while maintaining perfect economic consistency, something neither the raw foundation models nor conventional distillation achieve.

2605.26516 2026-05-27 econ.TH

State-Robust Nash Predictions In Population Games

群体博弈中的状态鲁棒纳什预测

Rui Sun, Junfei Guo

AI总结 本文引入状态鲁棒均衡(SRE)概念,用于在有限策略群体博弈中检验纳什预测对支付相关聚合状态误设的局部有效性,并证明其等价于局部最优反应不变性、无结构性暴露以及沿任意内点聚合状态误差的局部有效性。

详情
AI中文摘要

本文引入状态鲁棒均衡(SRE),这是对有限策略群体博弈中纳什预测的一种局部有效性检验,当支付相关的聚合状态可能被误设时。报告的策略和支付映射保持不变;仅用于评估支付比较的状态发生变化。SRE等价于局部最优反应不变性、无结构性暴露以及沿任意消失的内点聚合状态误差的有效性。在仿射博弈中,切锥、法锥和线性规划检验刻画了暴露特征,并识别出暴露群体、纯策略和聚合状态方向。主要含义是一个尖锐的负结果:鲁棒混合要求支持集上的局部支付恒等性;在一般仿射博弈中,SRE退化为严格纯纳什均衡,尽管弱边界均衡可以通过可行集保护存活。在具有多面体局部不确定性区域的仿射博弈中,相同的不等式为报告状态的有效性提供了确定性的有限诊断。

英文摘要

This paper introduces state-robust equilibrium (SRE), a local validity test for Nash predictions in finite-strategy population games when the payoff-relevant aggregate state may be misspecified. The reported prescription and payoff map are held fixed; only the state used to evaluate payoff comparisons varies. SRE is equivalent to local best-response invariance, absence of structural exposure, and validity along every vanishing interior aggregate-state error. In affine games, the tangent-cone, normal-cone, and linear-program tests characterize exposure and identify the exposing population, the pure strategy, and the aggregate-state direction. The main implication is a sharp negative result: robust mixing requires local payoff identity on the support; in generic affine games, SRE reduce to strict pure Nash equilibria, although weak boundary equilibria can survive through feasible-set protection. In affine games with polyhedral local uncertainty regions, the same inequalities yield a deterministic finite diagnostic for reported-state validity.

2605.26437 2026-05-27 econ.GN q-fin.EC

Divergent Minds, Convergent Baselines: A Bounded-Rationality Account of LLM-Human Strategic Behaviour

分歧的思维,趋同的基线:LLM与人类战略行为的有界理性解释

Po Han Teo

AI总结 本文提出有界理性框架,将人类与LLM在战略博弈中的行为差异归因于计算约束的不同,并给出四个操作测试来区分两者的偏差项δ。

详情
Comments
12 pages, 1 table, no figures. Theoretical prequel paper
AI中文摘要

研究人员已开始使用LLM代理替代人类受试者进行行为和政治科学实验,通常作为实验室样本池的更廉价替代品。然而,这种替代在战略环境中并不成立:人类和LLM可靠地做出不同选择,无论是基于人类响应数据的微调还是角色条件化都无法弥合这一差距。自Simon引入有界理性以来,行为经济学文献将人类战略行为建模为经典基线加上一个加性修正项δ。本文提出的框架将δ解读为有界计算的数学特征:无界理性代理会计算的结果与计算有界代理实际产生的结果之间的差距。对于标准训练语料库中存在解的标准博弈,LLM检索并重组语料库材料,绕过了在人类中产生δ的界限。该框架通过认知层次理论扩展到推理蒸馏模型:它们可访问的k级战略推理受计算预算和上下文长度限制,而非约束人类的认知限制,并且它们产生的δ(如果有)带有不同的结构特征。提出了四个操作测试(条件依赖性、分布不对称性、重复下的路径依赖性和释义鲁棒性)来区分人类形状的δ和LLM形状的δ。一个调节预测是,|δ|随决策环境中同伴信号的个性化程度而缩放,在命名对手和聚合对手设置之间具有Cohen's d ≥ 0.5的定量界限。

英文摘要

Researchers have started using LLM agents in place of human subjects in behavioural and political-science experiments, often as a cheaper substitute for laboratory pools. The substitution does not hold up in strategic settings: humans and LLMs reliably make different choices, and neither fine-tuning on human response data nor persona conditioning has closed the gap. The behavioural-economics literature has, since Simon's introduction of bounded rationality, modelled human strategic behaviour as a classical baseline plus an additive correction term $δ$. The framework proposed here reads $δ$ as the mathematical signature of bounded computation: the gap between what an unboundedly-rational agent would compute and what a computationally bounded agent actually produces. For canonical games whose solutions are present in standard training corpora, LLMs retrieve and recombine corpus material, bypassing the bound that produces $δ$ in humans. The framing extends to reasoning-distilled models through cognitive-hierarchy theory: their accessible level-$k$ strategic reasoning is bounded by compute budget and context length rather than by the cognitive constraints that bound humans, and the $δ$ they produce, if any, carries different structural signatures. Four operational tests (conditional dependence, distributional asymmetry, path-dependence under repetition, and paraphrase-robustness) are proposed to discriminate human-shaped $δ$ from LLM-shaped $δ$. A moderator prediction is that $|δ|$ scales with peer-signal individuation in the decision environment, with a quantitative bound of Cohen's $d \geq 0.5$ between named-opponent and aggregate-opponent settings.

2605.26367 2026-05-27 econ.TH

Random Matching with Minimums

具有最小值的随机匹配

Will Sandholtz, Andrew Tai

AI总结 针对具有最小和最大需求的对象随机分配问题(如课程上下限选课),提出最小概率序列机制(MPS),推广了Bogomolnaia和Moulin(2001)的概率序列机制,并证明其帕累托有效、无嫉妒和弱防策略。

详情
AI中文摘要

我们研究随机对象分配问题,其中对象可能有最小和最大需求,例如具有上下限招生人数的课程。我们构建了一种新的随机分配机制——最小概率序列(MPS)机制,它推广了Bogomolnaia和Moulin(2001)的概率序列机制。MPS产生的随机分配保证是帕累托有效的;即,不存在其他可实施的分配使得所有代理通过一阶随机占优偏好它。我们还证明了MPS是i)无嫉妒的,即没有代理会严格偏好另一个代理的分配,以及ii)弱防策略的,即代理不能通过虚报偏好来获得更好的分配。

英文摘要

We study stochastic object assignment problems in which objects may have minimum and maximum requirements, such as with classes with upper and lower enrollment bounds. We construct a new random assignment mechanism, the minimums probabilistic serial (MPS) mechanism, which generalizes the Probabilistic Serial mechanism of Bogomolnaia and Moulin (2001). The random allocation produced by MPS is guaranteed to be Pareto efficient; that is, there is no other implementable allocation that all agents prefer via first order stochastic dominance. We also show that MPS is i) envy-free, in that no agent will strictly prefer another agent's assignment, and ii) weak strategyproof, in that agents cannot achieve a better assignment by misreporting their preferences.

2605.26271 2026-05-27 stat.ML cs.LG econ.EM

Learning Nonlinear Factor Models with Unknown Monotone Links from Incomplete and Noisy Data

从不完整和含噪数据中学习具有未知单调链接的非线性因子模型

Yutong Chao, Resat Gökhan, Jalal Etesami, Ali Habibnia

AI总结 研究从含噪和不完整数据中联合恢复低秩因子、载荷和未知单调链接函数的问题,提出投影块坐标下降算法并建立收敛保证。

详情
AI中文摘要

我们研究了一个非线性因子模型,其中观测响应通过未知的单调链接函数依赖于低秩潜在因子。由于严重的非凸性和可识别性问题,这一设置具有挑战性且在很大程度上未被充分探索。链接函数假设位于再生核希尔伯特空间(RKHS)中,从而在保持可识别性的同时实现灵活的非参数建模。我们将问题表述为从可能不完整和含噪的观测中联合恢复低秩因子、载荷和非线性链接函数,并提出一种带有显式正则化的投影块坐标下降(BCD)算法以解决尺度和旋转模糊性。在因子的弱不相干性和标准采样条件下,我们建立了无噪声和有噪声情况下的收敛保证,以及链接函数更新的次线性遗憾界。我们的结果将经典线性因子模型推广到广泛的非线性领域,并为学习非线性潜在结构提供了一个原则性框架。我们通过受控的合成实验评估了所提出的方法,显示出有希望的性能。

英文摘要

We study a nonlinear factor model in which observed responses depend on low-rank latent factors through an unknown monotone link function. This setting is challenging and largely underexplored due to severe nonconvexity and identifiability issues. The link function is assumed to lie in a reproducing kernel Hilbert space (RKHS), enabling flexible nonparametric modeling while preserving identifiability. We formulate the problem as the joint recovery of the low-rank factors, loadings, and the nonlinear link function from possibly incomplete and noisy observations and propose a projected block coordinate descent (BCD) algorithm with explicit regularization to address scale and rotational ambiguities. Under mild incoherence of factors and standard sampling conditions, we establish convergence guarantees in both noiseless and noisy regimes, along with sublinear regret bounds for the link-function updates. Our results extend classical linear factor models to a broad nonlinear regime and provide a principled framework for learning nonlinear latent structures. We evaluate the proposed approach using controlled synthetic experiments, indicating promising performance.

2605.19884 2026-05-27 econ.TH

Contracting with Imperfect Commitment: Minimal Canonical Contracts

不完全承诺下的契约:最小规范契约

Seungjin Han, Siyang Xiong

AI总结 研究当委托人不能完全承诺时,何种情况下不完全承诺等价于完全承诺,并刻画最优单一委托人契约及竞争下的简单报价均衡。

详情
AI中文摘要

契约理论通常假设委托人具有完全承诺能力,但许多契约固定了某些与支付相关的决策,而将其他决策留作自由裁量。我们探讨不完全承诺何时等价于完全承诺。对于承诺的基准后跟随有界自由裁量调整的契约,如商业保险的费率评级或民事处罚,有界自由裁量是中性的。当可契约和不可契约决策是不同的工具时,等价性失效。我们刻画了最优单一委托人契约,并表明在竞争性委托人下简单报价均衡是稳健的。方法论贡献在于扩展的税收原理,使这些分析更易处理。

英文摘要

Contract theory typically assumes full commitment by the principal, but many contracts fix some payoff-relevant decisions while leaving others discretionary. We ask when imperfect commitment is equivalent to full commitment. For contracts in which a committed baseline is followed by a bounded discretionary adjustment, as in commercial-insurance schedule rating or civil penalties, bounded discretion is allocation-neutral. When contractible and non-contractible decisions are distinct instruments, the equivalence fails. We characterize optimal single-principal contracts and show that simple-offer equilibria are robust under competing principals. The methodological contribution is an extended taxation principle that makes these analyses more tractable.

2604.24660 2026-05-27 stat.ML econ.EM math.ST stat.ME stat.TH

Nonparametric Instrumental Variable Analysis Without Structural Equations: Debiased Inference on Functionals of Inverse Problems with No Solutions

无结构方程的非参数工具变量分析:无解反问题泛函的去偏推断

Zikai Shen, Nathan Kallus, Dimitri Meunier, Houssam Zenati, Arthur Gretton, Aurélien Bibaut

AI总结 针对无精确解的反问题,提出对有限维泛函进行去偏推断的方法,避免假设结构方程精确成立,确保推断在模型不成立时仍有效。

详情
AI中文摘要

我们考虑对反问题中无穷维最小二乘解的有限维泛函进行去偏推断,以避免必须假设精确解存在。这种假设是实质性的且并非无害,当我们将它们强加于统计模型时,其失败可能会危及推断。我们的方法允许我们对一个无论解是否存在都定义的量进行推断,并且当解存在时,该量与通常的估计量一致。对于工具变量的情况,这意味着我们可以用结构模型来激励分析,但这些模型不需要精确成立,半参数推断程序仍然有效。

英文摘要

We consider debiased inference on finite-dimensional functionals of infinite-dimensional least-squares solutions to inverse problems as a way to avoid having to assume exact solutions exist. Such assumptions are substantive and not innocuous, and their failure may imperil inference when we impose them on the statistical model. Our approach instead allows us to conduct inference on a quantity that is defined regardless of solutions existing and coincides with the usual estimands when they do. For the case of instrumental variables, this means we can motivate the analysis with structural models but these do not need to hold exactly for the semiparametric inferential procedure to remain valid.

2605.15902 2026-05-27 econ.EM stat.ME

Tweedie's Formula and Score-Driven Updating

Tweedie公式与得分驱动更新

Peter Reinhard Hansen, Chen Tong

AI总结 本文通过Tweedie公式为得分驱动模型提供了贝叶斯解释,证明了在自然指数族和一般条件密度下,得分更新要么是精确的贝叶斯滤波,要么是局部近似。

详情
AI中文摘要

得分驱动模型使用条件似然得分更新时变参数。本文通过Tweedie公式为这类更新提供了贝叶斯解释,该公式将后验均值修正与边际得分联系起来。在高斯信号提取中,这给出了精确的后验修正恒等式。对于自然指数族,相关恒等式刻画了自然参数空间和期望参数空间中的后验均值。基于这些恒等式,我们证明在局部精度折扣下,期望空间中的共轭贝叶斯滤波恰好与逆Fisher缩放的条件得分更新一致。对于一般条件密度,精确的贝叶斯修正涉及通常不可用的预测边际得分。局部高斯近似表明,条件似然得分提供了该后验修正的主要近似;在局部精度折扣下,预测协方差与逆Fisher信息成正比,从而得到熟悉的逆Fisher缩放得分递归。结果澄清了何时得分驱动更新是精确的贝叶斯滤波,以及何时应将其视为易处理的局部近似。

英文摘要

Score-driven models update time-varying parameters using conditional likelihood scores. This paper develops a Bayesian interpretation of such updates through Tweedie's formula, which connects posterior mean corrections with marginal scores. In Gaussian signal extraction, this gives an exact posterior-correction identity. For natural exponential families, related identities characterize posterior means in natural- and expectation-parameter spaces. Building on these identities, we show that conjugate Bayesian filtering in expectation space coincides exactly with an inverse-Fisher-scaled conditional score update under local precision discounting. For general conditional densities, the exact Bayesian correction involves a generally unavailable predictive-marginal score. A local Gaussian approximation shows that the conditional likelihood score provides the leading approximation to this posterior correction; under local precision discounting, the predictive covariance becomes proportional to inverse Fisher information, yielding the familiar inverse-Fisher-scaled score recursion. The results clarify when score-driven updates are exact Bayesian filters and when they should instead be viewed as tractable local approximations.

2501.00863 2026-05-27 econ.GN q-fin.EC

Paternalism and Deliberation: An Experiment on Making Formal Rules

家长主义与深思熟虑:关于制定正式规则的实验

Max R. P. Grossmann

AI总结 通过炸弹风险诱发任务实验,研究强制性等待期作为软家长主义政策是否替代硬性限制,以及延迟决策是否更受尊重,发现等待期是附加限制而非替代。

详情
AI中文摘要

强制性等待期已被用于医疗程序、枪支购买和其他高风险决策。这些软家长主义政策是否是更严格限制的替代品?延迟决策是否更受尊重?在一项一般人群调查实验中,选择架构师为面临高风险炸弹风险诱发任务的决策者制定规则。实验处理变量是决策时间:当场或一天后,以及初始决策是否可以修改。选择架构师设定决策者风险承担的上限;在一个处理中,他们还可以实施强制性等待期。外生深思熟虑对上限于无影响;等价检验(TOST)和贝叶斯分析($\text{BF}_{01} \approx 12$)为无效应提供了强有力的正面证据。内生规定的等待期是附加限制,并不替代上限。选择架构师相信,随着时间的推移,平均决策者将承担更少的风险,并且——由于选择架构师理想点的分布——更接近选择架构师的主观理想选择;由此导致的预测误差减少很小。软和硬的家长主义工具并非替代品:等待期被用作附加限制。

英文摘要

Mandatory waiting periods have been instituted for medical procedures, gun purchases, and other high-stakes decisions. Are these softly paternalistic policies substitutes for harder restrictions, and are delayed decisions more respected? In a general population survey experiment, Choice Architects make rules for decision-makers facing a high-stakes Bomb Risk Elicitation Task. Treatments vary when the decision is made: on the spot or after one day, and whether the initial decision can be revised. Choice Architects set a cap on the decision-maker's risk taking; in one treatment, they can additionally implement a mandatory waiting period. Exogenous deliberation has no effect on the cap; equivalence testing (TOST) and Bayesian analysis ($\text{BF}_{01} \approx 12$) provide strong positive evidence for the absence of an effect. Endogenously prescribed waiting periods are add-on restrictions that do not substitute for the cap. Choice Architects believe that, with time, the average decision-maker will take less risk and -- because of the distribution of Choice Architects' bliss points -- come closer to Choice Architects' subjective ideal choice; the resulting reduction in forecasted errors is small. Soft and hard paternalistic instruments are not substitutes: waiting periods are deployed as add-on restrictions.

2407.13204 2026-05-27 econ.GN q-fin.EC

The Pay and Non-Pay Content of Job Ads

招聘广告中的薪酬与非薪酬内容

Richard Audoly, Manudeep Bhuller, Tore Adam Reiremo

AI总结 通过挪威雇主的招聘广告数据,开发系统分类方法,验证广告中薪酬与非薪酬属性对雇主质量的信号作用,并量化其对劳动力市场不平等的影响。

详情
AI中文摘要

招聘广告对雇主实际提供的薪酬和非薪酬属性的信息量有多大?利用挪威雇主发布的招聘广告综合数据库,我们开发了一种系统分类方法,对空缺文本中广告的薪酬和非薪酬工作属性进行分类。约60%的招聘广告提供薪酬相关信息,几乎所有广告都包含非薪酬属性信息。我们将这些广告属性与发布广告的雇主联系起来,并针对雇主质量的显示性偏好度量、实现属性以及调查实验中的选择来验证这些信息。所有三种策略都证实,招聘广告提供了雇主质量的可靠信号。然后,我们将详细的工作属性纳入一个垄断框架,并量化它们对劳动力市场不平等的贡献。

英文摘要

How informative are job ads about the actual pay and non-pay attributes offered by employers? Using a comprehensive database of job ads posted by Norwegian employers, we develop a methodology to systematically classify the pay and non-pay job attributes advertised in vacancy texts. About 60% of job ads provide pay-related information and nearly all ads feature information on non-pay attributes. We link these advertised attributes to the employers posting the ads and validate this information against revealed-preference measures of employer quality, realized attributes, and choices from a survey experiment. All three strategies confirm that job ads provide reliable signals of employer quality. We then incorporate the detailed job attributes in a monopsony framework and quantify their contribution to labor market inequality.

2112.09259 2026-05-27 econ.EM

A Simple Measure of Robustness for External Validity under Covariate Shifts

协变量偏移下外部有效性的简单鲁棒性度量

Pietro Emilio Spini

AI总结 本文提出一种新的鲁棒性度量δ*,通过去偏GMM估计在协变量分布变化下评估政策效应的外部有效性,并利用基准测试和校准解释其大小。

详情
Comments
31 pages. Revised title and manuscript
AI中文摘要

本文研究了估计的政策效应对协变量分布变化的鲁棒性,这是(准)实验结果外部有效性的关键决定因素。我提出了一种新的鲁棒性度量δ*,它衡量使关于政策效应的经验主张(例如,ATE > 0)无效所需的最小协变量偏移。我通过去偏GMM估计δ*,实现了参数收敛速度,同时适应了处理效应异质性的机器学习估计器(例如,LASSO、随机森林、神经网络)。我开发了基准测试和校准练习来解释δ*的大小。我在俄勒冈健康保险实验的应用中说明了这些工具。研究人员可以将δ*与点估计和标准误差一起报告,作为衡量协变量偏移下外部有效性的第三个数字。

英文摘要

This paper studies the robustness of estimated policy effects to changes in the distribution of covariates, a key determinant of the external validity of (quasi)-experimental results. I propose a novel robustness metric $δ^*$ which measures the smallest covariate shift needed to invalidate an empirical claim about the policy effect (e.g., $ATE > 0$). I estimate $δ^*$ via de-biased GMM, achieving a parametric rate of convergence while accommodating machine-learning estimators of treatment-effect heterogeneity (e.g., LASSO, random forests, neural networks). I develop benchmarking and calibration exercises to interpret the magnitude of $δ^*$. I illustrate these tools in an application to the Oregon Health Insurance Experiment. Researchers can report $δ^*$ alongside the point estimate and standard error as a third number gauging external validity under covariate shifts.