arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13519 2026-06-12 econ.EM 新提交

Semiparametric Local Projections

半参数局部投影

Silvia Goncalves, Ana Maria Herrera, Lutz Kilian, Elena Peavento, Iones Kelanemer Holban

AI总结 提出一种半参数局部投影估计量,用于非线性脉冲响应函数,基于双稳健矩条件结合交叉拟合,实现√T一致性和渐近正态性。

详情
AI中文摘要

我们提出了一种半参数局部投影估计量,用于估计一类广泛的结构动态模型的非线性脉冲响应函数,这些模型与应用宏观经济学相关,包括具有非线性变换回归变量、状态依赖系数以及冲击与状态变量之间非线性相互作用的模型。该估计量基于一个双稳健矩条件,该条件将平均响应函数识别为非参数条件均值的线性泛函,并辅以一个密度比来捕捉移动感兴趣冲击的效果。我们将此矩条件与处理序列依赖的交叉拟合相结合。得到的估计量是$\sqrt{T}$一致且渐近正态的。我们在一系列非线性数据生成过程中检验了该估计量的有限样本性能,并通过两个实证示例说明了其应用。

英文摘要

We propose a semiparametric local projection estimator of nonlinear impulse response functions for a broad class of structural dynamic models relevant for applied macroeconomics, including models with nonlinearly transformed regressors, state dependent coefficients, and nonlinear interactions between shocks and state variables. The estimator is based on a doubly robust moment condition that identifies the average response function as a linear functional of a nonparametric conditional mean, augmented by a density ratio that captures the effect of shifting the shock of interest. We combine this moment condition with cross-fitting that handles serial dependence. The resulting estimator is $\sqrt{T}$-consistent and asymptotically normal. We examine the finite-sample performance of the estimator across a range of nonlinear data generating processes and illustrate its use in two empirical examples.

2606.12739 2026-06-12 econ.EM 新提交

Estimating Semiparametric and Nonparametric Fixed Effects Panel Data Models with mgcv

使用 mgcv 估计半参数和非参数固定效应面板数据模型

Ivan Korolev

AI总结 本文介绍如何使用 R 包 mgcv 估计半参数和非参数固定效应面板数据模型,重点讨论实现方法、平滑项指定和聚类稳健推断,并通过蒙特卡洛实验验证惩罚样条估计的准确性。

详情
AI中文摘要

本文提供了使用 R 中的 mgcv 包估计半参数和非参数固定效应面板数据模型的实用指南。重点在于实现:使用单位指示变量、一阶差分或惩罚单位效应处理固定效应;指定平滑项;以及进行聚类稳健推断。蒙特卡洛实验比较了 mgcv::bam 估计量与线性固定序列样条估计量。模拟表明,惩罚样条适应未知平滑度,并在本文研究的设计中准确估计函数。惩罚调整的聚类稳健协方差估计量对有限维参数产生接近名义水平的检验,置信带对中心化的未知函数提供准确的覆盖。

英文摘要

This paper provides a practical guide to estimating semiparametric and nonparametric fixed-effects panel data models using the mgcv package in R. The focus is implementation: handling fixed effects with unit indicators, first differencing, or penalized unit effects; specifying smooth terms; and conducting cluster-robust inference. Monte Carlo experiments compare \code{mgcv::bam} estimators with linear and fixed-series spline estimators. Simulations suggest that penalized splines adapt to unknown smoothness and estimate functions accurately in the designs studied here. A penalty-adjusted cluster-robust covariance estimator yields tests with near-nominal size for finite-dimensional parameters, and confidence bands provide accurate coverage for centered unknown functions.

2606.12571 2026-06-12 econ.TH 新提交

Cross-Validation Equilibrium

交叉验证均衡

Ran Spiegler, Stephan Waizmann

AI总结 研究玩家将信念形成委托给预测性机器学习时的策略互动,提出交叉验证均衡概念,分析其性质并应用于陪审团投票、投机性赌博和线性二次支付博弈。

详情
AI中文摘要

我们研究当玩家将信念形成委托给预测性机器学习(ML)时的策略互动。在一个静态贝叶斯博弈中,每个玩家的ML代理预测一个与收益相关的结果变量,作为玩家类型的函数。ML代理的训练样本是内生的:它来自于由玩家ML引导行为产生的结果分布。在交叉验证均衡(CVE)中,每个玩家的ML代理根据其实现的训练样本,选择预测模型以最小化期望的样本外平方误差,并且每个玩家对其ML代理选择的模型所产生的信念做出最优反应。我们分析CVE并将其与其他均衡概念联系起来。我们将CVE应用于陪审团投票、投机性赌博以及具有线性二次支付的博弈。例如,在团队努力博弈中,内生模型选择可能导致多重均衡。

英文摘要

We study strategic interaction when players delegate belief formation to predictive machine learning (ML). In a static Bayesian game, each player's ML agent predicts a payoff-relevant outcome variable as a function of the player's type. The ML agent's training sample is endogenous: it is drawn from the outcome distribution generated by players' ML-guided behavior. In Cross-Validation Equilibrium (CVE), each player's ML agent selects a predictive model to minimize expected out-of-sample squared error, given its realized training sample, and each player best-replies to the belief generated by the model her ML agent selected. We analyze CVE and relate it to other equilibrium concepts. We apply CVE to jury voting, speculative betting, and games with linear-quadratic payoffs. E.g., in a team-effort game, endogenous model selection can give rise to multiple equilibria.

2606.12492 2026-06-12 econ.TH 新提交

Continuity of equilibria in spaces of Bochner and Gel'fand economies

Bochner与Gel'fand经济空间中均衡的连续性

Matías Fuentes

AI总结 本文在商品空间为Banach格的无穷维框架下,证明均衡对应在允许均衡的经济域稠密子集上关于Polish拓扑是连续的,统一处理了多种经济模型,无需可微性假设。

详情
AI中文摘要

我们研究了无穷维环境(商品空间为Banach格)中均衡对应的连续性。经济被建模为特征空间上的Borel概率测度,总禀赋通过Bochner或Gel'fand积分定义。在此框架下,我们证明了均衡对应在允许均衡的经济域稠密子集上(赋予适当的Polish拓扑)是连续的。这些结果通过提供适用于更广泛局部凸空间类的统一分析处理,扩展了经典和近期的连续性定理,并涵盖了无限规划期限、垄断竞争、新古典经济、金融均衡和非对称信息等模型。重要的是,本研究证明了无需施加正则经济中通常需要的可微性假设来研究均衡连续性。

英文摘要

We examine the continuity of equilibrium correspondences in infinite-dimensional settings where the commodity spaces are Banach lattices. Economies are modeled as Borel probability measures on a space of characteristics, with aggregate endowments defined via Bochner or Gel'fand integrals. Within this framework, we prove that the equilibrium correspondence is continuous on a dense subset of the domain of economies admitting equilibria, endowed with a suitable Polish topology. These results extend both classical and recent continuity theorems by providing a unified analytical treatment applicable to a substantially broader class of locally convex spaces and encompass models with infinite planning horizons, monopolistic competition, neoclassical economies, financial equilibria, and asymmetric information. Importantly, this study demonstrates that there is no necessity to impose differentiability assumptions that are typically required in regular economies to study equilibrium continuity.

2606.13506 2026-06-12 econ.GN 新提交

Skill vs Education Types of Labour Mismatch and Their Association with Earnings

技能与教育类型的劳动错配及其与收入的关系

Vsevolod Iakovlev

AI总结 利用26国PIAAC数据,通过教育-技能指标和误差成分模型,揭示教育错配与技能错配对收入的不同影响,并控制国家异质性后证实过度教育与过度技能导致工资惩罚,不足教育与不足技能带来工资溢价。

详情
AI中文摘要

本文分析了教育类型和技能类型的劳动错配之间的区别及其与收入的关系。利用来自OECD(2012)成人技能调查(PIAAC)第一周期的26个国家横截面数据,我使用一套全面的基于教育和技能的指标考察了教育和技能错配,探索了工人特征之间的异质性,并通过误差成分模型调查了与国家层面收入相关性冲突的来源。结果表明,国家层面的未观测异质性导致内生性偏差,其方向和大小因错配指标而异。一旦控制了未观测异质性,过度教育和过度技能与工资惩罚相关,而教育不足和技能不足则与工资溢价相关。这些发现强调了教育错配与技能错配之间的概念和实证区别,并证明了指标选择在分析中的重要性。

英文摘要

This paper analyses the distinction between educational and skill types of labour mismatch and their association with earnings. Drawing on cross-sectional data for 26 countries from the 1st Cycle of the OECD (2012) Survey of Adult Skills (PIAAC), I examine educational and skill mismatch using a comprehensive set of education- and skill-based indicators, explore heterogeneity across worker characteristics, and investigate the sources of conflicting country-level correlations with earnings through an error components model. The results show that country-level unobserved heterogeneity induces endogeneity bias, with both its direction and magnitude varying across mismatch measures. Once unobserved heterogeneity is controlled for, over-education and over-skilling are associated with wage penalties, whereas under-education and under-skilling are linked to wage premiums. These findings highlight both conceptual and empirical distinctions between educational and skill mismatch and demonstrate the importance of indicator choice in the analysis.

2606.13314 2026-06-12 econ.GN 新提交

The Privilege of Exposure: Caste and Generative AI in India's Graduate Labour Market

暴露的特权:种姓与生成式AI在印度毕业生劳动力市场

Kaibalyapati Mishra

AI总结 研究利用印度最新劳动力调查数据,发现种姓影响毕业生对生成式AI的暴露程度,低种姓毕业生暴露度显著低于高种姓,且该差距通过职业分布和工资溢价加剧种姓收入不平等。

详情
AI中文摘要

在发展中国家劳动力市场中,谁暴露于生成式AI?我们将三个职业AI暴露指数映射到印度重新设计的定期劳动力调查(2025年),并记录了83,000名就业毕业生中显著的种姓梯度:在同一地区内,来自在册种姓和在册部落的毕业生比高种姓毕业生的暴露度低0.24-0.37个标准差。两个渠道导致了这一差距:四分之一的在册种姓和三分之一的在册部落毕业生从事不受AI影响的农业或初级职业,而那些从事白领工作的人在管理、软件和金融职业中的代表性不足。由于暴露度带来高达20%的工资溢价,生成式AI可能会扩大而非缩小印度的种姓收入差距。

英文摘要

Who is exposed to generative AI in a developing-country labour market? We map three occupational AI-exposure indices to India's redesigned Periodic Labour Force Survey (2025) and document a steep caste gradient among 83,000 employed graduates: graduates from the Scheduled Castes and the Scheduled Tribes are 0.24--0.37 standard deviations less exposed than upper-caste graduates within the same district. Two channels drive the gap: one in four SC and one in three ST graduates work in farm or elementary occupations untouched by AI, and those in white-collar work are underrepresented in managerial, software, and finance occupations. Because exposure commands a wage premium of up to 20 per cent, generative AI stands to widen, not narrow, India's caste earnings gap.

2606.12893 2026-06-12 econ.GN 新提交

Technology Shocks, Relative Performance Measures, and Outcomes: Evidence from Classical Chess

技术冲击、相对绩效度量与结果:来自经典国际象棋的证据

Dan Ben-Moshe, David Genesove

AI总结 利用390万局经典国际象棋比赛数据,发现2020年神经网络引擎普及后和棋率上升约4个百分点,而基于相对绩效的等级分变化不大,表明技术冲击被广泛吸收。

详情
AI中文摘要

2020年秋季,神经网络方法使得国际象棋引擎的性能大幅提升,并免费广泛可用。到2021年底,经典国际象棋的月度和棋率上升了约四个百分点,但通常被视为棋力指标的棋手等级分分布变化不大。然而,等级分是一种相对度量,基于与其他有等级分棋手对弈的结果构建,而非绝对棋力尺度,因此广泛共享的进步不必改变等级分。利用2015年3月至2023年11月的390万局有等级分的经典比赛,我们记录到和棋率上升在控制双方等级分后仍然存在,在重复同色对局中成立,并非先前趋势的延续,并持续到样本期末。一个线性变换将疫情后等级分映射到更高的疫情前等效值,且在低等级分处差距更大,解释了拟合的和棋、白胜和黑胜概率的疫情后减疫情前偏移的90%以上。相比之下,棋手的等级分和排名没有显示出额外的排名重新洗牌,也没有相对于疫情前基准的组内离散度普遍扩大。我们将这些发现解释为与各等级分水平的采用一致,且低等级分棋手获得了更大的等级分等效增益。

英文摘要

In the fall of 2020, neural-network methods produced a large improvement in chess engines that became freely and widely available. By the end of 2021, the monthly draw rate in classical chess had risen by about four percentage points, but the distribution of player ratings, which are commonly read as measures of playing strength, had changed little. Ratings, however, are a relative measure, built from results against other rated players rather than from an absolute scale of play quality, so an improvement shared broadly across players need not change their ratings. Using 3.9 million rated classical games from March 2015 to November 2023, we document that the increased draw rate remains after conditioning on both players' ratings, holds within repeated same-color matchups, is not a continuation of a pre-existing trend, and persists through the end of the sample. A linear transformation that maps post-Covid ratings to higher pre-Covid equivalents, with a larger gap at lower ratings, accounts for more than 90 percent of the post-minus-pre shift in the fitted draw, White-win, and Black-win probabilities. Players' ratings and ranks, by contrast, show no additional rank reshuffling and no general widening of within-group dispersion relative to the pre-Covid benchmark. We interpret these findings as consistent with adoption across rating levels, with larger rating-equivalent gains for lower-rated players.

2606.13555 2026-06-12 econ.EM cs.GT 新提交

Price Elasticity of Gas Demand on L1 and L2: Evidence from Ethereum and Arbitrum

L1和L2上天然气需求的价格弹性:来自以太坊和Arbitrum的证据

Pranay Anchuri, Akaki Mamageishvili

AI总结 利用工具变量法估计以太坊主网和Arbitrum One上天然气需求的价格弹性,发现两者总体缺乏弹性,但L2弹性更大,且不同资源类型和用户集群弹性差异显著。

详情
AI中文摘要

我们估计了以太坊主网(L1)和Arbitrum One(L2)上天然气需求的因果价格弹性,这是校准费用机制模拟、评估资源定价改革和解释观察到的使用模式所必需的数量。通过每个钱包自身的滞后基础费用进行工具变量的双向固定效应面板回归,消除了拥堵驱动的内生性,这种内生性导致朴素回归大幅低估需求敏感性。在以太坊主网(2025全年),合并IV弹性为-0.006***,接近无弹性:费用增加10%会使总天然气需求减少约0.06%。在Arbitrum One(2025年10月至2026年4月),合并IV弹性为-0.036**。两条链总体均缺乏弹性,且L2的响应性明显高于L1。对L2需求的按资源分解显示,弹性范围从适度弹性的计算(-0.027*)到退款的-0.27***,存储增长(-0.15***)和调用数据(-0.06*)介于两者之间。行为聚类识别出始终在线的协议钱包接近无弹性,而高容量运营商的响应性显著更高,集群级弹性高达合并估计的约6倍。这些结果为下游模拟和评估费用机制设计建立了实证基础。

英文摘要

We estimate the causal price elasticity of gas demand on Ethereum mainnet (L1) and Arbitrum One (L2), a quantity necessary for calibrating fee mechanism simulations, evaluating resource pricing reforms, and explaining observed usage patterns. A two-way fixed effects panel regression instrumented by each wallet's own lagged base fee removes the congestion-driven endogeneity that causes naive regressions to substantially underestimate demand sensitivity. On Ethereum mainnet (full year 2025), the pooled IV elasticity is -0.006***, near-inelastic: a 10% fee increase reduces total gas demand by approximately 0.06%. On Arbitrum One (October 2025--April 2026), the pooled IV elasticity is -0.036**. Both chains are inelastic in the aggregate, with L2 measurably more responsive than L1. A per-resource decomposition of L2 demand reveals elasticities ranging from modestly elastic computation (-0.027*) to -0.27*** for refunds, with storage growth (-0.15***) and calldata (-0.06*) in between. Behavioral clustering identifies always-on protocol wallets as near-inelastic and high-volume operators as substantially more responsive, with cluster-level elasticities up to roughly 6x the pooled estimate. These results establish an empirical foundation for downstream simulations and for evaluating fee mechanism designs.

2606.12585 2026-06-12 econ.GN cs.HC 新提交

Revisiting the ABCs of Working with AI: A Replication with Radiologists

重新审视与AI合作的ABC:一项针对放射科医生的复制研究

Daniel Martin

AI总结 本研究在放射科医生分析胸部X光片的场景中,复制了Caplin等人关于能力和信念校准影响AI辅助收益的发现,验证了其外部有效性。

详情
AI中文摘要

人工智能(AI)系统越来越多地协助人类专家,但AI辅助对生产力的影响可能具有异质性。Caplin、Deming、S. Li、Martin、Marx、Weidmann和Ye(2025b)提供的证据表明,两个特征——能力和信念校准——有助于确定AI辅助的回报。本文表明,他们的结果在专业放射科医生利用最先进的机器学习预测分析胸部X光片的场景中得到了复制。我利用了Moehring、Kutwal、Huang、Banerjee、Jacobi、Eber、Mendoza、Chung、Dayan、Gupta、Bui、Truong、Pareek、Langlotz、Lungren、Agarwal、Rajpurkar和Salz(2025)描述的公共Collab-CXR数据存储库,该数据首先由Agarwal、Moehring、Rajpurkar和Salz(2023)用于人机协作分析。为了忠实再现Caplin、Deming、S. Li、Martin、Marx、Weidmann和Ye(2025b)的分析,我使用了重复病例设计中的放射科医生评估,包括68名放射科医生和11,420个配对的放射科医生-患者-病理观察结果。本复制结果支持其核心发现的外部有效性:较低的基础能力和较高的校准预测了AI带来的更大增量价值。

英文摘要

Artificial intelligence (AI) systems increasingly assist human experts, but the consequences of AI assistance on productivity can be heterogeneous. Caplin, Deming, S. Li, Martin, Marx, Weidmann, and Ye (2025b) provide evidence that two characteristics, ability and belief calibration, help to determine the returns to AI assistance. This note shows that their results replicate to a setting where professional radiologists analyze chest X-rays with access to state-of-the-art machine learning predictions. I leverage the public Collab-CXR data repository described by Moehring, Kutwal, Huang, Banerjee, Jacobi, Eber, Mendoza, Chung, Dayan, Gupta, Bui, Truong, Pareek, Langlotz, Lungren, Agarwal, Rajpurkar, and Salz (2025) and first analyzed for human-AI collaboration by Agarwal, Moehring, Rajpurkar, and Salz (2023). To faithfully reproduce the analysis in Caplin, Deming, S. Li, Martin, Marx, Weidmann, and Ye (2025b), I use the radiologist assessments from the repeated-case designs, which include 68 radiologists and 11,420 paired radiologist-patient-pathology observations. The results of this replication support the external validity of their core findings: lower baseline ability and higher calibration predict larger incremental value from AI.

2606.12848 2026-06-12 cs.AI econ.GN 新提交

(Human) Attention Is (Still) All You Need: Human oversight makes AI-assisted social science reliable

(人类的)注意力(仍然)就是一切:人类监督使AI辅助的社会科学变得可靠

Chen Zhu, Xiaolu Wang, Weilong Zhang

发表机构 * China Agricultural University(中国农业大学) University of Cambridge(剑桥大学)

AI总结 提出人机协同决策架构HLER,通过预承诺、决策排序、问责和注意力分配,将AI辅助研究的失败率从72%降至16%。

详情
AI中文摘要

大型语言模型(LLMs)越来越多地被用于曾经只有训练有素的研究人员才能完成的任务,包括假设生成、规范选择和结论起草。我们认为,AI辅助研究的可靠性不仅取决于模型能力,还取决于认知劳动在人与机器之间的分配方式。我们通过人机协同经济研究(HLER)来研究这个问题,这是一种基于预承诺、决策排序、问责和注意力分配的决策架构。在一个预先指定的2*4因子实验中,涉及四个数据集的280个完整研究运行,无约束的多智能体基线在72%的运行中产生了关键失败。使用相同的底层模型、相同的智能体分解以及共享推理智能体的相同提示,HLER通过施加三个架构承诺将失败率降低到16%:LLMs进行推理但不执行数据工作,数据和估计以确定性方式处理,以及三个人类决策门约束工作流程。Fisher精确检验在p<0.001水平上拒绝失败率相等的假设。可靠性增益在公开代表性最低的数据集(一份清代人口登记册)上最大,这与基于任务的产出质量服从弗雷歇分布的生产模型一致。一项80次运行的消融研究表明,确定性计算和人类决策门独立贡献,并存在互补性的探索性证据。我们将HLER解释为一种研究框架而非自主的AI科学家:它大幅减少失败,使残留的弱点更加可见,并防止不可靠的主张作为可发表的成果被提出。

英文摘要

Large language models (LLMs) are increasingly used for tasks once reserved for trained researchers, including hypothesis generation, specification choice, and drafting conclusions. We argue that the reliability of AI-assisted research depends not only on model capability, but also on how cognitive labour is structured between humans and machines. We study this problem through Human-in-the-Loop Economic Research (HLER), a decision architecture based on pre-commitment, decision sequencing, accountability, and attention allocation. In a pre-specified 2*4 factorial experiment with 280 complete research runs across four datasets, an unconstrained multi-agent baseline produced critical failures in 72% of runs. Using the same underlying model, the same agent decomposition, and identical prompts for the shared reasoning agents, HLER reduced the failure rate to 16% by imposing three architectural commitments: LLMs reason but do not execute data work, data and estimation are handled deterministically, and three human decision gates bind the workflow. Fisher's exact test rejects equality of failure rates at p<0.001. Reliability gains were largest on the least publicly represented dataset, a Qing-dynasty population register, consistent with a task-based production model with Frechet-distributed output quality. An 80-run ablation suggests that deterministic computation and human gates contribute independently, with exploratory evidence of complementarity. We interpret HLER as a research harness rather than an autonomous AI scientist: it sharply reduces failures, makes residual weaknesses more visible, and prevents unreliable claims from being advanced as publication-ready outputs.

2606.12788 2026-06-12 cs.SI cs.CY cs.DC econ.GN eess.SY 新提交

To Share or Not to Share: Orchestrating Trustworthy Data in Global Value Chains

共享还是不共享:协调全球价值链中的可信数据

Han-Teng Liao, Chang-Yi Kao

AI总结 针对欧盟CBAM带来的监管透明与数据主权矛盾,提出基于IDSA框架的RegTech参考架构,通过主权数据交换实现数字产品护照,驱动全球商业服务能力需求,并集成Agentic AI与绿色金融,为全球产业集群提供可扩展蓝图。

详情
AI中文摘要

随着欧盟碳边境调节机制(CBAM)的临近,全球半导体价值链在监管透明度和数据主权之间面临日益增长的结构性紧张。本文提出了一种使用国际数据空间(IDSA)框架的RegTech参考架构,以在半导体-石化关联领域协调可信的环境遥测。该架构区分了强制性CBAM要求和自愿性科学碳目标倡议(SBTi)框架,同时解决了安全与可持续设计(SSbD)框架的附加复杂性。超越标准线性技术栈,我们引入了一种前瞻性路线图方法,将上游物理脆弱性转化为循环的负反馈循环。聚焦台北和槟城技术走廊,本文详细说明了主权数据交换如何使数字产品护照(DPP)能够驱动全球商业服务(GBS)能力需求。最后,我们讨论了集成Agentic AI以实现自主合规以及金融科技绿色融资,为全球产业集群实现主权、可持续和透明的价值链提供了可扩展蓝图。

英文摘要

As the EU Carbon Border Adjustment Mechanism (CBAM) approaches, the global semiconductor value chain faces growing structural tensions between regulatory transparency and data sovereignty. This article proposes a RegTech reference architecture using the International Data Spaces (IDSA) framework to orchestrate trustworthy environmental telemetry across the semiconductor-petrochemical nexus. The framework distinguishes the mandatory CBAM requirements from voluntary Science Based Targets initiative (SBTi) frameworks, while addressing the additive complexities of the Safe-and-Sustainable-by-Design (SSbD) framework. Moving beyond standard linear technology stacks, we introduce a prospective roadmapping methodology that transforms upstream physical vulnerabilities into circular, negative feedback loops. Focusing on the Taipei and Penang technology corridor, the article details how sovereign data exchange enables Digital Product Passports (DPPs) to drive Global Business Services (GBSs) capability demands. Finally, we discuss the integration of Agentic AI for autonomous compliance and FinTech green financing, providing a scalable blueprint for global industrial clusters to achieve sovereign, sustainable, and transparent value chains.

2606.12787 2026-06-12 cs.SI cs.CY econ.GN eess.SY q-fin.RM 新提交

Orchestrating the Twin Transition in Multinational Corporations: Technology Roadmapping for Green and Digital Global Business Services

跨国企业中的双重转型编排:面向绿色与数字全球商业服务的技术路线图

Han-Teng Liao, Karen Ang

AI总结 本文综合技术路线图与ITU创新生态系统工具,提出社会技术框架,分析跨国企业全球商业服务如何通过“可持续智能”演进,协调绿色与数字双重转型,并识别关键枢纽国家的作用。

详情
Comments
9 pages, 6 figures
AI中文摘要

全球商业服务(GBS)已成为绿色与数字双重转型的“活实验室”,因为跨国企业(MNCs)面临协调数字效率与环境管理的日益增长的压力。为推导出一个社会技术框架,本文将技术路线图(TRM)与国际电信联盟(ITU)以ICT为中心的创新生态系统工具包相结合。对研究集群的文献计量分析揭示了从基本流程自动化向“可持续智能”的演进转变,将GBS单元识别为中央“操作气闸”,在景观压力(如欧盟双重指令和碳边境调节机制)与AI原生工作流中的利基创新之间进行调解。研究进一步将这些集群映射到利益相关者参与画布上,突出显示波兰、葡萄牙和马来西亚的韧性“中等强国”枢纽如何绕过中等收入陷阱,在地缘政治分裂的云环境中为全球价值链提供“第三条道路”。结果为领导者及创业支持网络提供了数据驱动的设计方法,以编排人才和供应链流动,从而丰富对工业5.0的概念理解以及GBS作为在动荡、多极数字经济中导航的主要机制的作用。

英文摘要

Global Business Services (GBS) have emerged as a "living laboratory" for the Twin Transition of Green and Digital Transformation, as multinational corporations (MNCs) face increasing pressure to harmonize digital efficiency with environmental stewardship. Aiming to derive a socio-technical framework, this paper synthesizes Technology Roadmapping (TRM) with the International Telecommunication Union (ITU) ICT-centric innovation ecosystem toolkit. A bibliometric analysis of research clusters reveals an evolutionary shift from basic process automation toward "Sustainable Intelligence," identifying the GBS unit as a central "operational airlock" that mediates between landscape pressures -- such as the EU's dual mandate and Carbon Border Adjustment Mechanisms -- and niche innovations in AI-native workflows. The study further maps these clusters onto a stakeholder engagement canvas, highlighting how resilient "Middle Power" hubs in Poland, Portugal, and Malaysia are bypassing the middle-income trap to provide a "third way" for global value chains amidst a bifurcated geopolitical cloud. The results offer a data-driven design approach for leaders and entrepreneurial support networks to orchestrate talent and supply chain flows, thereby enriching the conceptual understanding of Industry 5.0 and the role of GBS as a primary mechanism for navigating a volatile, multipolar digital economy.

2606.12892 2026-06-12 stat.ML cs.LG econ.EM math.ST stat.ME 新提交

Prediction-Powered Causal Inference by Automatic Debiased Machine Learning and Semi-Supervised Riesz Regression

预测驱动的因果推断:自动去偏机器学习与半监督Riesz回归

Masahiro Kato

AI总结 研究半监督设置下因果参数的半参数有效估计,通过结合去偏机器学习和半监督Riesz回归,提出DML-PPCI和TMLE-PPCI方法,实现比仅用标注数据更小的渐近方差。

详情
AI中文摘要

本研究探讨了在半监督设置下因果和结构参数的半参数有效估计。在我们的设置中,除了由结果和回归变量组成的标注观测数据外,还有未标记的辅助回归变量可用。我们的目标是构建因果和结构参数的估计量,其渐近方差小于仅使用标注数据构建的估计量。我们将此框架称为预测驱动的因果推断(PPCI)。我们首先推导了有效影响函数和效率界,这表明使用辅助回归变量可以获得比仅从标注观测数据可达到的效率界更小的渐近方差。然后,通过将有效影响函数与去偏机器学习(DML)框架相结合,我们提出了称为DML-PPCI的方法。如果我们构建一个估计方程估计量,我们称之为EE-DML-PPCI;如果我们构建一个目标学习估计量,我们称之为TMLE-DML-PPCI。两种估计量的渐近方差都与我们推导的效率界相匹配。在构建估计量时,有效影响函数的估计起着重要作用。在我们的研究中,有效影响函数也是一个Neyman正交分数,它依赖于Riesz表示子和回归函数。对于Riesz表示子估计,我们开发了具有收敛速度保证的半监督广义Riesz回归。

英文摘要

This study investigates semiparametric efficient estimation of causal and structural parameters in a semi-supervised setting. In our setting, unlabeled auxiliary regressors are available in addition to labeled observations consisting of outcomes and regressors. Our goal is to construct estimators of causal and structural parameters whose asymptotic variances are smaller than those of estimators constructed using only labeled data. We refer to this framework as prediction-powered causal inference (PPCI). We first derive the efficient influence function and the efficiency bound, which imply that the use of auxiliary regressors can attain a smaller asymptotic variance than the efficiency bound attainable from labeled observations alone. Then, by combining the efficient influence function with the debiased machine learning (DML) framework, we propose methods that we call DML-PPCI. If we construct an estimating-equation estimator, we refer to the method as EE-DML-PPCI; if we construct a targeted-learning estimator, we refer to the method as TMLE-DML-PPCI. The asymptotic variances of both estimators match our derived efficiency bound. In the construction of the estimators, estimation of the efficient influence function plays an important role. In our study, the efficient influence function is also a Neyman orthogonal score, which depends on the Riesz representer and the regression function. For Riesz representer estimation, we develop semi-supervised generalized Riesz regression with convergence rate guarantees.

2606.12460 2026-06-12 physics.soc-ph econ.EM 新提交

Sovereign Stress Avalanches and Network Amplification in Latin America

拉丁美洲主权压力雪崩与网络放大效应

Diego Vallarino

AI总结 利用J.P.摩根EMBI全球多元化利差数据,通过幂律诊断、网络分析和安慰剂检验,发现拉丁美洲主权压力事件具有重尾分布(指数1.77),且同步性显著高于随机水平,但放大效应源于共同因子而非区域传播。

详情
AI中文摘要

本文利用2007-2026年间11个主权国家的J.P.摩根EMBI全球多元化月度利差数据,研究拉丁美洲信贷市场中的主权压力雪崩与网络放大效应。国家压力事件定义为正对数利差创新超过国家特定波动率阈值,区域雪崩统计每月压力国家数量。实证设计结合有限样本幂律诊断、阈值稳健性检验、国家级重排安慰剂以及滚动相关、偏相关和最小生成树网络。雪崩规模呈重尾分布,估计指数为1.77,而利差变化和事件间隔时间处于重尾边界区域。安慰剂显示同步性远高于独立压力时间,p值低于0.001。大型雪崩与更密集且频谱放大更强的原始相关网络同时出现,但经偏相关过滤后不再显著,表明是共同因子联动而非条件区域传播。网络指标描述同期压力状态而非早期预警信号。结果为监测新兴市场主权脆弱性提供了有限规模临界性框架。

英文摘要

This paper studies sovereign stress avalanches and network amplification in Latin American credit markets using monthly J.P. Morgan EMBI Global Diversified spreads for eleven sovereigns over 2007-2026. Country stress events are defined as positive log-spread innovations exceeding country-specific volatility thresholds, and regional avalanches count the number of stressed countries in each month. The empirical design combines finite-sample power-law diagnostics, threshold robustness checks, a country-level reshuffling placebo, and rolling correlation, partial-correlation, and minimum-spanning-tree networks. Avalanche sizes are heavy-tailed, with an estimated exponent of 1.77, while spread changes and inter-event times lie in a heavy-tail boundary regime. The placebo shows synchronization far above independent stress timing, with p-values below 0.001. Large avalanches coincide with denser and more spectrally amplifying raw-correlation networks, but not after partial-correlation filtering, indicating common-factor co-movement rather than conditional regional propagation. Network metrics describe contemporaneous stress regimes rather than early-warning signals. The results provide a finite-size criticality framework for monitoring sovereign fragility in emerging markets.

2606.07489 2026-06-12 cs.AI econ.GN 新提交

How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope

AI代理如何重塑知识工作:自主性、效率与范围

Jeremy Yang, Kate Zyskowski, Noah Yonack, Jerry Ma

发表机构 * Harvard Business School Perplexity AI

AI总结 基于Perplexity产品数据,研究发现AI代理通过端到端任务执行,将自主工作时间从33秒提升至26分钟,完成时间缩短87%,成本降低94%,并扩展了工作范围与认知层次。

详情
AI中文摘要

前沿AI系统正从对话式助手转向端到端执行任务的自主代理,弥合智能与实用性之间的差距。利用Perplexity的Search和Computer产品的生产数据,我们通过研究AI代理如何加速和重塑知识工作来考察这一转变。三个关键实证发现出现。首先,使用具有几乎相同初始查询对的会话作为同一底层任务的自然实验,Computer每个用户会话执行26分钟的自主工作,而Search为33秒。Computer自动化了Search用户可能手动编排和实现的任务分解与执行。因此,Computer将后续查询分布转向更高层次的工作,如验证和扩展。自主性也提高了执行质量,Computer上每次查询的不满意率比Search低55%。其次,由于其自主性优势,Computer在匹配任务上将完成时间从269分钟减少到36分钟,与仅配备Search的人类相比,估计时间和成本分别降低87%和94%。第三,Computer改变了用户尝试的工作范围:Computer查询更常跨越职业边界,需要更高层次的认知,利用更广泛的专业知识,采取将相互依赖的子任务捆绑到单个查询中的复合任务形式,并解锁了同一用户在Search使用中基本不存在的工作活动。综合来看,证据表明AI代理加速工作流程、提高输出质量、降低成本,并扩展自动化工作的广度和深度。

英文摘要

Frontier AI systems are bridging the gap between intelligence and utility by shifting from conversational assistants to autonomous agents that execute tasks end to end. Using production data from Perplexity's Search and Computer products, we study this transition by examining how AI agents accelerate and reshape knowledge work. Three key empirical findings emerge. First, using sessions with near-identical initial query pairs as natural experiments for the same underlying task attempted with both products, Computer performs 26 minutes of autonomous work per user session, versus 33 seconds for Search. Computer automates task decomposition and execution that Search users might otherwise manually orchestrate and implement. As a result, Computer shifts follow-up query distribution toward higher-order work such as verification and extension. Autonomy also increases execution quality, with per-query dissatisfaction rates 55% lower on Computer than on Search. Second, due to its autonomy advantage, Computer reduces completion time from 269 to 36 minutes on matched tasks, lowering estimated time and cost by 87% and 94%, respectively, compared to humans equipped with Search alone. Third, Computer changes the scope of work that users attempt: Computer queries more often cross occupational boundaries, require higher-order cognition, draw on broader expertise, take the form of composite tasks that bundle interdependent subtasks into a single query, and unlock work activities that are essentially absent from Search usage among the same users. Together, the evidence indicates that AI agents accelerate workflows, enhance output quality, reduce costs, and expand the breadth and depth of automated work.