arXivDaily arXiv每日学术速递 周一至周五更新
重置
Q-FIN定量金融54
2606.09820 2026-06-09 math.FA cs.LG math.PR q-fin.MF stat.ML 新提交

Weighted universal approximation of differentiable maps on infinite-dimensional manifolds

无限维流形上可微映射的加权通用逼近

Philipp Schmocker, Josef Teichmann

AI总结 通过加权Nachbin定理,将函数输入神经网络的通用逼近定理推广到可微映射,包括导数逼近,并应用于非预期泛函和路径空间泛函的逼近。

详情
Comments
77 pages, 3 figures
AI中文摘要

我们将函数输入神经网络(FNN)的通用逼近定理推广到可微映射,包括导数的逼近。FNN将输入从可能无限维的加权流形映射到实值隐藏层,在该层上应用非线性标量激活函数,然后通过一些线性读出将输出返回到Banach空间。通过证明加权Nachbin定理,我们建立了可微映射的通用逼近定理(UAT),该定理超越了紧集上的通常表述,并且还包括导数的逼近。这导致了非预期泛函(包括水平和垂直导数)的逼近结果。作为进一步的应用,我们证明了签名的线性函数能够逼近路径空间泛函,包括它们的方向导数。

英文摘要

We generalize the universal approximation theorem for functional input neural networks (FNN) to differentiable maps by including the approximation of the derivatives. A FNN maps the input from a possibly infinite-dimensional weighted manifold to the real-valued hidden layer, on which a non-linear scalar activation function is applied, and then returns the output into a Banach space via some linear readouts. By proving a weighted Nachbin theorem, we establish a universal approximation theorem (UAT) for differentiable maps, which goes beyond the usual formulation on compact sets and also includes the approximation of the derivatives. This leads us to approximation results for non-anticipative functionals including the horizontal and vertical derivatives. As a further application, we show that linear functions of the signature are able to approximate path space functionals including their directional derivatives.

2606.09642 2026-06-09 econ.GN q-fin.EC 新提交

The Dispossessed: Large-Scale Land Acquisitions, Elite Capture, and Dissent in Africa

被剥夺者:非洲的大规模土地收购、精英俘获与异议

Jonathan Dries

AI总结 利用随机失败的交易作为对照组,估计大规模土地收购对当地异议的因果影响,发现其导致公民骚乱持续增加158%,且国内投资者收购社区或国有土地用于粮食生产时抗议反应最强,指向地方剥夺和国内精英俘获。

详情
AI中文摘要

在过去二十年中,非洲数百万公顷的土地已转移给投资者,引发了人们对流离失所和冲突的担忧。本文通过将成功实施的项目与一组外生失败的交易作为对照组,估计了大规模土地收购(LSLAs)对当地异议的因果影响。使用跨1,391个地理编码交易的交错双重差分估计器,我发现LSLAs导致公民骚乱相对于处理前均值持续增加158%。抗议反应在国内投资者收购社区或国有土地用于粮食生产时最为强烈,指向地方剥夺和国内精英俘获。结合媒体、调查和选举数据,与这一假设一致,我记录了受影响选区中产权媒体话语的平行转变、传统权威的侵蚀以及更广泛的选举动员。

英文摘要

Over the past two decades, millions of hectares of land in Africa have been transferred to investors, raising fears of displacement and conflict. This paper estimates the causal impact of large-scale land acquisitions (LSLAs) on local dissent by comparing successfully implemented projects to a control group of exogenously failed deals. Using staggered difference-in-differences estimators across 1,391 geocoded deals, I find that LSLAs cause a sustained increase in civic unrest of 158% relative to the pre-treatment mean. Protest responses are strongest among domestic investors acquiring community or state land for food-crop production, pointing to local dispossession and domestic elite capture. Integrating media, survey, and electoral data consistent with this hypothesis, I document parallel shifts in property-rights media discourse, an erosion of traditional authority, and broader electoral mobilization in affected constituencies.

2606.09564 2026-06-09 q-fin.PR q-fin.MF q-fin.TR 新提交

Option prices from operational-time reaction-boundary lattices

来自操作时间反应边界格点的期权定价

Chris Angstmann, Tim Gebbie

AI总结 提出操作时间马尔可夫格点模型,推导期权定价方程,将局部波动率与风险中性买卖反应边界方差关联,分离操作核、日历时间投影和定价测度选择,阐明未跨度时钟、跳跃或更新风险导致的不完全市场定价。

详情
Comments
Working paper:18 pages, 1 Table, 2 Figures, a short outlined calculation in an appendix
AI中文摘要

我们考虑连续操作时间 u 及其到日历时间 t 的映射,以及它们如何与期权定价问题中的事件时间相关联。我们从操作时间马尔可夫格点而非日历时间扩散推导期权定价方程。原始模型是最近邻对数价格格点,具有状态和时间依赖的转移概率。其Chapman-Kolmogorov分解产生离散前向和后向方程,在局部有限方差标度下收敛到通常的连续伴随对。在价格变量中,后向方程给出广义欧式定价PDE,并在风险中性漂移限制和常数波动率下简化为Black-Scholes-Merton。解释为限价订单簿中间价格的反应边界模型,该构造将局部波动率识别为活动重新标度的风险中性买卖反应边界方差。该框架分离操作核、日历时间投影和定价测度选择,以阐明未跨度时钟、跳跃或更新风险如何导致不完全市场定价。

英文摘要

We consider the role of a continuum operational time u and its mapping to calendar time t and how these relate to event time for option pricing problems. We derive option-pricing equations from an operational-time Markov lattice rather than from a calendar-time diffusion. The primitive model is a nearest-neighbour log-price lattice with state- and time-dependent transition probabilities. Its Chapman-Kolmogorov decomposition yields discrete forward and backward equations, which converge under local finite-variance scaling to the usual continuum adjoint pair. In price variables, the backward equation gives a generalized European pricing PDE and reduces to Black-Scholes-Merton under the risk-neutral drift restriction and constant volatility. Interpreted as a reaction-boundary model for limit-order-book mid-prices, the construction identifies local volatility with an activity-rescaled risk-neutral bid-ask reaction-boundary variance. The framework separates the operational kernel, calendar-time projection, and pricing-measure choice, to clarify how unspanned clock, jump, or renewal risks can lead to incomplete-market pricing.

2606.09478 2026-06-09 q-fin.TR q-fin.CP q-fin.MF 新提交

Volatility Forecasting and Return Prediction under Market Regimes: Evidence from High-Frequency Chinese Equity Data

市场机制下的波动率预测与收益预测:来自中国高频股票数据的证据

Xinyue Fang, Robert Ślepaczuk

AI总结 研究将机制依赖的波动率预测与基于机器学习的收益预测相结合,使用中国高频股指数据构建两阶段框架,发现机制感知的波动率预测优于基准模型,但收益预测仅在低波动机制下有效,且需结合波动率缩放等策略才能提升经济表现。

详情
Comments
41 pages, 16 figures, 21 tables
AI中文摘要

本研究探讨了机制依赖的波动率预测与基于机器学习的收益预测是否可以联合集成,以改善股票市场的统计预测性能和经济策略结果。使用2005年至2023年的高频沪深300指数数据,开发了一个顺序两阶段框架。在第一阶段,使用机制增强的HARQ规格结合马尔可夫切换GJR-GARCH滤波对已实现波动率进行建模,以捕捉长记忆动态、非对称性和结构性市场机制。在第二阶段,将波动率预测、机制指标和收益相关预测因子纳入XGBoost收益预测模型,该模型通过严格的滚动外推样本外程序进行估计。实证结果表明,机制感知的波动率预测在预测评估指标上持续优于基准HARQ模型,并得到正式预测比较检验的支持。相比之下,收益可预测性仍然较弱、依赖于状态,并且主要集中在低波动机制中。尽管在考虑现实交易成本后,朴素预测交易策略通常失败,但精心设计的实施(包括波动率缩放、低波动门控、阈值校准和换手率控制)可以改善防御性经济表现。研究结果表明,金融市场上预测系统的实际价值可能较少依赖于生成强的无条件收益预测,而更多依赖于将弱的、依赖于状态的信号转化为经济上稳健的投资组合分配规则。总体而言,本研究通过将计量经济波动率建模、机制分类、机器学习收益预测和实施现实性整合在一个统一框架内做出了贡献。

英文摘要

This study investigates whether regime-dependent volatility forecasting and machine-learning-based return prediction can be jointly integrated to improve both statistical forecasting performance and economic strategy outcomes in equity markets. Using high-frequency CSI 300 Index data from 2005 to 2023, a sequential twostage framework is developed. In the first stage, realized volatility is modeled using regime-augmented HARQ specifications combined with Markov-switching GJR-GARCH filtering to capture long-memory dynamics, asymmetry, and structural market regimes. In the second stage, volatility forecasts, regime indicators, and return-related predictors are incorporated into an XGBoost return-prediction model estimated through a strictly walk-forward out-of-sample procedure. The empirical results demonstrate that regime-aware volatility forecasting consistently outperforms baseline HARQ models across forecast evaluation metrics and is generally supported by formal forecast comparison tests. In contrast, return predictability remains weak, state-dependent, and concentrated primarily in low-volatility regimes. Although naive predictive trading strategies generally fail after accounting for realistic transaction costs, carefully designed implementations incorporating volatility scaling, low-volatility gating, threshold calibration, and turnover controls can improve defensive economic performance. The findings suggest that the practical value of predictive systems in financial markets may depend less on generating strong unconditional return forecasts and more on transforming weak state-dependent signals into economically robust portfolio allocation rules. Overall, the study contributes by integrating econometric volatility modeling, regime classification, machine-learning return prediction, and implementation realism within a unified framework.

2606.09472 2026-06-09 econ.GN q-fin.EC 新提交

Parameter Sensitivity Analysis of Hierarchical Spatial Economy: Trade Strategy around Brexit

分层空间经济的参数敏感性分析:围绕英国脱欧的贸易策略

Kiyohiro Ikeda, Yosuke Kogure, Hiroki Aizawa, Yuki Takayama

AI总结 提出分层约简方法,将区域级方程压缩为国家及联盟级方程,分析经济参数敏感性,应用于英法德围绕英国脱欧的贸易竞争,发现降低国内运输成本是关键,关税是双刃剑。

详情
Comments
37 pages, 10 figures, 3 tables
AI中文摘要

本文提出了一个系统框架,用于分析经济地理模型中分层空间经济的参数敏感性。通过本研究提出的分层约简方法,原始的区域级控制方程被压缩为国家级和联盟级方程。基于约简后的控制方程,我们制定了经济变量对每个国家人口的敏感性。该方法应用于分析英国、法国和德国之间的国际贸易竞争——涵盖英国脱欧前后的贸易自由化和保护主义。我们发现,英国和欧盟都应专注于降低国内运输成本,而关税和报复性关税是一把双刃剑,可能加强或削弱其贸易地位。

英文摘要

This paper presents a systematic framework for analyzing the economic parameter sensitivity of a hierarchical spatial economy within economic geography models. Through the hierarchical reduction approach proposed in this study, the original region-level governing equation is condensed into country-level and alliance-level equations. Based on the reduced governing equation, we formulate the sensitivity of economic variables on each country's population. This approach is applied to the analysis of international trade competition -- covering both trade liberalization and protectionism around Brexit -- among the UK, France, and Germany. We find that both the UK and the EU should focus on reducing domestic transportation costs, whereas tariffs and retaliatory tariffs act as a double-edged sword that can either strengthen or weaken their trade positions.

2606.09463 2026-06-09 cs.SI econ.GN q-fin.EC 新提交

The Changing Global Division of Labor in Software: Emergence and Diffusion of New Programming Skills across IT Hubs

软件行业全球分工的变迁:IT中心新编程技能的出现与扩散

Johannes Wachs, Xiangnan Feng, Simone Daniotti, Frank Neffke

AI总结 利用6000万条软件问答数据,研究237种编程技能在城市的出现与扩散规律,发现软件行业遵循与传统行业相似的空间模式:新技能先在大型多样化IT中心出现,再向小城市扩散。

详情
AI中文摘要

随着新产业的兴起,往往会出现新的工作岗位。演化经济地理学,特别是产业生命周期视角预测,这些活动首先在有限数量的城市出现,然后随着工作描述的标准化扩散到其他地点。本文聚焦于一个特别重要的新产业:软件开发,这一活动在经济上重要、变化迅速,并且在少数全球IT中心具有显著的空间集中性。我们使用一个包含超过6000万个关于软件开发问题的问答在线数据库,生成了一个包含237种软件技能的纵向数据集。通过定期对300万发帖用户进行地理定位,我们将这些技能与全球城市联系起来。我们发现,尽管软件产业具有数字性质,但它表现出与传统行业类似的空间规律性。首先,城市会向与其现有技能相关的技能领域多元化。其次,新技能首先在拥有大型和多样化软件部门的城市出现,随后——基本不受地理距离阻碍——扩散到专门从事密切相关技能的小城市。我们发现有限但支持区位机会窗口假说的证据:尽管即使是全新的技能也首先出现在对相关技能有较强前期专业化的城市,但相关活动的集中对新技能出现的影响小于对现有技能扩散的影响。

英文摘要

With the rise of new industries, often new jobs emerge. Evolutionary Economic Geography and in particular Industry Life Cycle perspectives predict that these activities first emerge in a limited number of cities to then diffuse to other locations as job descriptions become more standardized. Here, we focus on a particularly important new industry: software development, an activity that is economically important, quickly changing, and has a pronounced spatial concentration in a small number of global IT hubs. We use an online database of over 60 million questions and answers about problems in software development that yields a longitudinal dataset of 237 software skills. By geo-locating 3 million posting users at regular intervals, we link these skills to cities worldwide. We find that, in spite of its digital nature, the software industry exhibits similar spatial regularities as previously observed in more traditional sectors. First, cities diversify into skills that are related to their existing ones. Second, new skills first emerge in cities with large and diversified software sectors, and later diffuse -- mostly unhindered by geographical distance -- to smaller cities specialized in closely related skills. We find suggestive but limited support for a windows of locational opportunity account: although even brand-new skills still emerge first in cities with strong prior specialization in related skills, concentrations of related activities impact less the emergence of new skills than the diffusion of existing ones.

2606.09454 2026-06-09 q-fin.MF econ.TH q-fin.TR 新提交

Axiomatic Market Making

公理化做市

Frank M. V. Feys

AI总结 本文通过八个公理和六个环境假设,唯一确定了一个三参数报价规则族,其中中间价与库存线性相关,价差分解为库存和逆向选择成分,并证明了参数的可识别性和结构推论。

详情
Comments
66 pages, 3 figures
AI中文摘要

本文对买卖价差做市商的报价规则进行了公理化。报价规则将做市商的状态(即库存、信念、方差、交易强度和知情交易者比例)映射到一个买卖价差对。八个自然公理,连同关于做市商库存成本的六个环境假设,强制得到一个唯一的三参数族:中间价与库存线性相关,价差可加性地分解为库存和逆向选择成分。每个参数从可观测报价规则的不同矩中识别,且三个识别相互解耦。八个公理分为一个四公理的核心、一个结构选择和三个模块化扩展。两个结构推论随之而来:潜在库存成本函数可从限价订单簿恢复,以及一个尖锐的相变将运行机制与冻结机制分开。一个结尾的元定理识别出公理系统内所有允许的结构原语中不变的四个特征。据我们所知,这是报价规则的第一个强制唯一性公理化。

英文摘要

This paper axiomatizes the bid-ask market maker's quoting rule. A quoting rule maps the maker's state, namely inventory, belief, variance, trade intensity, and informed-trader fraction, to a bid-ask pair. Eight natural axioms, together with six environmental assumptions on the maker's inventory cost, force a unique three-parameter family: the mid-quote is linear in inventory, and the spread decomposes additively into inventory and adverse-selection components. Each of the three parameters is identified from a distinct moment of the observable quoting rule, with the three identifications mutually decoupled. The eight axioms partition into a four-axiom indispensable core, one structural choice, and three modularity extensions. Two structural corollaries follow: the latent inventory cost function is recoverable from the limit order book, and a sharp phase transition separates a functioning regime from a frozen one. A closing meta-theorem identifies four features invariant across all admissible structural primitives within the axiom system. To our knowledge, this is the first forced-uniqueness axiomatization of the quoting rule.

2606.09420 2026-06-09 math.OC q-fin.PM 新提交

Benchmarking Deep Time Series Models for Equity Portfolios

深度时间序列模型在股票组合中的基准测试

Aoxin Zhang, Yuhan Cheng, Kwanting Leung

AI总结 构建2018-2024年CRSP日频股票基准,结合多准则可接受性分析与约束投资组合层,评估15种深度和统计时间序列架构,发现无模型占优,TransEnc-8排名第一可接受性0.352。

详情
Comments
51 pages, 28 figures, 43 tables; includes appendices
AI中文摘要

对日频股票组合的预测架构进行基准测试不仅仅是一个预测练习。它还询问在施加偏好、成本和投资组合约束后,哪些模型仍然可用。我们为2018-2024年期间的15种深度和统计时间序列架构构建了一个CRSP日频股票基准。该协议结合了共同窗口十分位投资组合、随机多准则可接受性分析、部署调整可接受性指数以及一个具有容量、贝塔、行业、风险、杠杆和换手率控制的约束二次投资组合层。该指数从SMAA排名可接受性分布出发,并降低那些准则层面胜利导致高投资组合遗憾的模型的权重;其Gibbs形式被刻画为从SMAA先验的熵更新。经验上,没有架构在原始基准中占主导地位:TransEnc-8具有最大的排名1可接受性0.352,且没有模型超过约0.36。排名随偏好、市场状态、特征空间和交易成本而变化。在推广的五模型约束投资组合比较中,TransEnc-8始终被选中,而面向收益的原始排名可能偏好TS-RIDGE。广泛空间的十分位信号可以承受成本,但基线约束QP在20个基点时的净夏普比率对每个推广模型均为负。该基准支持模型选择和诊断,而非独立的交易策略声明。

英文摘要

Benchmarking forecasting architectures for daily equity portfolios is not just a prediction exercise. It also asks which model remains usable after preferences, costs, and portfolio constraints are imposed. We build a CRSP daily-stock benchmark for 15 deep and statistical time-series architectures over 2018--2024. The protocol combines common-window decile portfolios, stochastic multi-criteria acceptability analysis, a deployment-adjusted acceptability index, and a constrained quadratic portfolio layer with capacity, beta, industry, risk, leverage, and turnover controls. The index starts from the SMAA rank-acceptability distribution and downweights models whose criteria-level wins produce high portfolio regret; its Gibbs form is characterized as an entropic update from the SMAA prior. Empirically, no architecture dominates the raw benchmark: TransEnc-8 has the largest rank-1 acceptability, 0.352, and no model exceeds about 0.36. Rankings vary with preferences, market state, feature universe, and transaction costs. In the promoted five-model constrained-portfolio comparison, TransEnc-8 is selected throughout, while return-oriented raw rankings can favor TS-RIDGE. Broad-universe decile signals can survive costs, but the baseline constrained-QP net Sharpe at 20 bps is negative for every promoted model. The benchmark supports model selection and diagnosis rather than a standalone trading-strategy claim.

2606.09274 2026-06-09 q-fin.RM q-fin.ST stat.ME 新提交

Reverse Stress Testing for Multivariate Scenarios: A Conditional Framework for Stressed Time Series

多变量场景的反向压力测试:压力时间序列的条件框架

Michele Sparviero, Lorenzo Viola

AI总结 提出一种反向压力测试方法,从单一资产类别的外生冲击出发,重建与市场经验依赖结构一致的多变量压力场景,并通过三种分布假设下的条件密度最大化求解。

详情
Comments
26 pages, 5 figures, 2 tables
AI中文摘要

本文开发了一种反向压力测试(RST)的方法框架,其中从单一资产类别上施加的外生冲击出发,重建与市场经验依赖结构一致的多变量压力场景。该问题被表述为在给定冲击下条件密度的最大化,并在三种逐渐减弱的分布假设下求解。在参数设置中,收益的联合高斯性产生了一个封闭形式的模态场景,该场景与非冲击分量的条件均值一致。在半参数设置中,通过经验似然方法非参数地估计模态场景,并通过高斯或学生t局部采样方案生成周围的压力轨迹。在完全非参数设置中,通过在估计场景的马氏邻域内对历史观测进行逆距离重采样来获得压力轨迹。这三种变体在实际市场数据上得到了验证。模拟的场景在经济上是一致的,并且能够再现压力市场体制中观察到的标准风险-回报不对称性。

英文摘要

This paper develops a methodological framework for reverse stress testing (RST) in which a multivariate stress scenario, coherent with the empirical dependence structure of a market, is reconstructed from a single exogenous shock prescribed on one asset class. The problem is formulated as the maximisation of the conditional density given the imposed shock, and is solved under three progressively weaker distributional assumptions. In the parametric setting, joint Gaussianity of the returns yields a closed-form modal scenario coinciding with the conditional mean of the non-shocked components. In the semiparametric setting, the modal scenario is estimated nonparametrically through the empirical likelihood methodology and the surrounding stressed trajectories are generated via a Gaussian or Student-t local sampling scheme. In the fully nonparametric setting, stressed trajectories are obtained by inverse-distance resampling of the historical observations within a Mahalanobis neighbourhood of the estimated scenario. The three variants are validated on real market data. The simulated scenarios prove to be economically coherent and capable of reproducing the standard risk-reward asymmetry observed in stressed market regimes.

2606.09190 2026-06-09 econ.GN math.OC q-fin.EC 新提交

Planning resilient hydrogen supply chains under disruption risk

规划中断风险下的弹性氢供应链

Silvian M. Radke, Philipp C. Verpoort, Falko Ueckerdt, Felix Müsgens

AI总结 采用随机优化模型研究欧盟氢进口,发现考虑供应中断的风险感知规划可减少12%福利损失,并通过多样化进口走廊和战略超额投资实现弹性。

详情
Comments
41 pages, 10 figures
AI中文摘要

尽管对能源安全的担忧日益加剧,新兴绿色燃料供应链的基础设施规划与建模常常忽视供应中断的风险。利用欧盟氢进口的随机优化模型,我们表明,与预期供应中断的风险感知规划相比,“天真”的基础设施规划会导致12%(240亿欧元)的福利损失。尽管需要更高的前期投资,预期规划实现的福利水平接近没有中断的理想系统,但基础设施配置明显不同。出现了两种互补的弹性策略:进口走廊多样化和战略超额投资。这导致欧洲内部运输能力增加、进口管道范围扩大,以及对氢载体昂贵航运终端的投资。我们的结果表明,将供应风险考虑纳入基础设施规划有助于在设计未来氢供应链时防止化石燃料系统中出现的结构性脆弱性。

英文摘要

Despite growing concerns over energy security, infrastructure planning and modelling for emerging green fuel supply chains often neglect risks from supply disruptions. Using a stochastic optimisation model of EU hydrogen imports, we show that 'naive' infrastructure planning results in welfare losses of 12 % (24 billion EUR) compared to risk-aware planning that anticipates supply disruptions. Despite requiring higher upfront investments, anticipatory planning achieves welfare levels close to those of an idealised system without disruptions, but entails a markedly different infrastructure configuration. Two complementary resilience strategies emerge: diversification across import corridors and strategic over-investment. This leads to increased intra-European transport capacity, a broader set of import pipelines, and investments in costly shipping terminals for hydrogen carriers. Our results show that incorporating supply risk considerations into infrastructure planning helps prevent the structural vulnerabilities seen in fossil fuel systems when designing future hydrogen supply chains.

2606.09104 2026-06-09 cs.LG cs.AI q-fin.PM 新提交

Addressing Market Regime Changes and Heavy-Tailed Returns in Portfolio Optimization via Bayesian VAR and Elliptical Black-Litterman

通过贝叶斯VAR和椭圆Black-Litterman解决投资组合优化中的市场机制变化和重尾收益问题

Daniil Mikriukov, Ruoyu Sun, Angelos Stefanidis, Jionglong Su, Zhengyong Jiang

发表机构 * University of Liverpool(利物浦大学) Xi'an Jiaotong-Liverpool University(西交利物浦大学)

AI总结 提出BAVAR-BLED算法,结合贝叶斯平均向量自回归和椭圆分布Black-Litterman模型,在TD3架构下自适应分配资产,在道琼斯工业平均指数成分股上实现夏普比率1.72和总收益57.26%。

详情
Comments
9 pages, 3 figures, 4 tables. Extends our prior work [Mikriukov et al., ICIC 2025] on Black-Litterman under Elliptical Distributions (BLED). Manuscript under review
AI中文摘要

用于投资组合优化的深度强化学习框架因其能够从市场数据中动态学习分配规则而显示出前景。然而,这些模型未能考虑肥尾收益,而肥尾收益以更频繁的极端事件为特征,描述了实际市场行为。此外,历史数据被同质化处理,未考虑时间重要性,导致模型在机制变化时失效。我们提出了一种新的BAVAR-BLED算法,该算法在TD3架构内结合了源自贝叶斯平均向量自回归(BAVAR)和使用椭圆分布的Black-Litterman模型(BLED)的方法。BAVAR捕获一组考虑多尺度时间特征的向量自回归表示,从而基于对收益预期和离散矩阵的机制感知估计实现自适应分配决策。这些估计作为BLED的先验输入,BLED使用学生t分布,允许更现实的肥尾收益估计。BAVAR-BLED算法使用Transformer网络进行观点构建,使用CNN进行风险厌恶估计,根据市场条件修改动态分配决策。对道琼斯工业平均指数29只成分股在十年市场周期内的评估表明,BAVAR-BLED显著优于最先进的方法,实现了1.72的夏普比率和2.70的索提诺比率,总收益为57.26%。

英文摘要

Deep reinforcement learning (DRL) frameworks for portfolio optimization have shown promise for their ability to learn allocation rules dynamically from market data. However, these models fail to account for fat-tailed returns, which characterize actual market behavior with more frequent extreme events. Furthermore, historical data is treated homogeneously, without accounting for temporal importance, leading models to fail during regime changes. We propose a new BAVAR-BLED algorithm that combines methods derived from Bayesian-Averaging Vector Autoregressive (BAVAR) and the Black-Litterman model using Elliptical Distributions (BLED) within a TD3 architecture. BAVAR captures a set of vector autoregressive representations that consider multi-scale temporal features, enabling adaptive allocation decisions based on regime-aware estimates of return expectations and dispersion matrices. These estimates serve as prior inputs to BLED, a model that uses Student's t-distributions, allowing for more realistic fat tail return estimates. The BAVAR-BLED algorithm uses transformer networks for view construction and CNNs for risk-aversion estimates, which modify dynamic allocation decisions based on market conditions. An evaluation of 29 Dow Jones Industrial Average constituents over a decade-long market period shows that BAVAR-BLED significantly outperforms state-of-the-art methods, achieving Sharpe and Sortino ratios of 1.72 and 2.70, respectively, and total returns of 57.26%.

2606.09025 2026-06-09 q-fin.PM 新提交

Continuous Cash-Overlay Filters for a Static Growth--Defensive Risk Sleeve: Slow-Tail Compensation, V-Shape Crash Brakes, Walk-Forward Validation, and Max-Cash Combination

静态增长-防御风险组合的连续现金覆盖过滤器:慢尾补偿、V型崩溃刹车、滚动前向验证和最大现金组合

Zheli Xiong

AI总结 本文针对静态增长-防御风险组合与现金之间的分配问题,开发了两种连续现金覆盖过滤器(慢尾补偿和V型崩溃刹车),并通过最大现金规则组合,显著提升收益并降低最大回撤。

详情
Comments
dynamic asset allocation; cash overlay; crash protection; VIX; interest rates; credit stress; walk-forward validation; drawdown control
AI中文摘要

本文研究了一个静态增长-防御风险组合与带息现金之间的现金覆盖分配问题。风险组合固定为等权增长和防御ETF篮子各50%的组合,因此现金覆盖独立于任何动态增长-防御风格择时策略。目标是未来风险组合相对于现金的超额收益,其中现金部分采用同期现金利率衡量。\n 我开发了两种连续过滤器。慢尾补偿过滤器针对风险组合补偿的持续恶化,特别是现金收益率上升而风险资产不稳定的时期。V型崩溃刹车过滤器针对快速回撤事件及其后的重新入场。两种过滤器通过固定的最大现金规则组合,即组合每日使用两者中较大的现金权重。\n 在2017-2026年共同窗口上,选定权重的最大现金组合实现了20.45%的年化复合增长率,而静态风险组合为16.62%,最大回撤从-33.59%改善至-16.77%。更严格的版本结合了各组件自身的滚动前向样本外权重。在主要样本外窗口中,扩展的最大现金组合实现了18.05%的收益率,而静态风险组合为16.09%,最大回撤为-22.05%对比-33.59%。证据支持模块化连续现金覆盖作为回撤控制工具,而多重检验调整推断和实时变量重新筛选留待未来工作。

英文摘要

This paper studies a cash-overlay allocation problem between a static growth-defensive risky sleeve and interest-bearing cash. The risky sleeve is fixed as a 50/50 combination of equal-weight growth and defensive ETF baskets, so the cash overlay is evaluated independently of any dynamic growth-defensive style-timing policy. The target is future risky-sleeve return over cash, with the cash leg measured using the contemporaneous cash rate. I develop two continuous filters. The slow-tail compensation filter targets persistent deterioration in risky-sleeve compensation, especially regimes in which cash yield rises and risky assets remain unstable. The V-shape crash-brake filter targets fast drawdown episodes and subsequent re-entry. The two filters are combined using a fixed max-cash rule, under which the portfolio uses the larger of the two cash weights each day. On the common 2017-2026 window, the selected-weight max-cash combination earns a 20.45 percent CAGR versus 16.62 percent for the static risky sleeve, and improves maximum drawdown from -33.59 percent to -16.77 percent. A stricter version combines each component's own walk-forward out-of-sample weights. In the main OOS window, the expanding max-cash combination earns 18.05 percent versus 16.09 percent for the static risky sleeve, with maximum drawdown of -22.05 percent versus -33.59 percent. The evidence supports modular continuous cash overlays as drawdown-control tools, while leaving multiple-testing-adjusted inference and real-time variable re-screening for future work.

2606.09003 2026-06-09 q-fin.MF math.OC q-fin.TR 新提交

Proof of Stake economy under centralized exchanges--a mean field model

中心化交易所下的权益证明经济——一个平均场模型

Wenpin Tang

AI总结 本文通过连续时间平均场模型,研究中心化交易所交易活动对PoS区块链的质押行为、代币分配和去中心化的影响,发现中心化交易可能增强质押参与并促进去中心化。

详情
Comments
18 pages
AI中文摘要

我们考虑中心化交易与去中心化权益证明(PoS)区块链生态系统之间的相互作用。受中心化交易所日益占据主导地位以及加密市场机构化的推动,我们研究了中心化交易所上的交易活动如何影响PoS区块链内的质押行为、代币分配和去中心化。我们构建了一个连续时间平均场模型,其中矿工同时充当PoS协议中的验证者和具有价格影响的中心化市场中的交易者。在适当假设下,我们建立了平均场系统的局部适定性,并推导出均衡交易策略的半显式刻画。数值结果表明,中心化交易活动可能增强质押参与,并通过市场激励促进质押分布的去中心化。我们还研究了交易成本和代币供应机制对均衡质押比率和集中度的影响。这些结果说明了市场微观结构和中心化流动性提供如何对去中心化区块链协议产生显著影响。

英文摘要

We consider the interaction between centralized trading and decentralized Proof of Stake (PoS) blockchain ecosystems. Motivated by the increasing dominance of centralized exchanges and the institutionalization of crypto markets, we study how trading activities on centralized exchanges affect staking behavior, token allocation, and decentralization within a PoS blockchain. We formulate a continuous-time mean field model, where the miners simultaneously act as validators in the PoS protocol and traders in a centralized market with price impact. Under suitable assumptions, we establish the local well-posedness of the mean field system, and derive a semi-explicit characterization of the equilibrium trading strategy. Numerical results suggest that centralized trading activities may enhance staking participation, and promote decentralization of the staking distribution through market incentives. We also study the effects of transaction costs and token supply mechanisms on the equilibrium staking ratio and concentration profile. These results illustrate how market microstructure and centralized liquidity provision can exert significant influence on decentralized blockchain protocols.

2606.08998 2026-06-09 cs.AI cs.CY econ.GN q-fin.EC 新提交

The Token Not Taken: Sampling, State, and the Variability of AI Agent Outputs

未被选取的令牌:采样、状态与AI智能体输出的变异性

Muhammad Zia Hydari, Raja Iqbal

发表机构 * University of Pittsburgh(匹兹堡大学) Ejento.ai

AI总结 本文分析AI智能体系统输出变异性的来源,区分令牌采样的内在随机性与环境、数据等外在因素,并讨论在匹配条件下变异性的可复现性及确定性执行在部署中未必导致相同行为的原因。

详情
AI中文摘要

智能体AI系统在不同运行中可能表现出不同的行为:相同的请求可能产生不同的计划、不同的工具调用、不同的代码编辑或不同的最终答案。这种变异性源于多个常被混淆的层面。基础模型是一个大型预训练模型,通常可适应许多下游任务,将输入上下文映射到输出的预测。在当前许多智能体中,该模型嵌入在一个编排循环中,该循环进行规划、调用工具、观察结果并更新状态。此类系统中一个明确的内在变异性来源是令牌生成:模型计算可能的下一个令牌的分数,分数被转换为概率,解码器可能使用伪随机数生成器采样令牌。一个微小的采样令牌差异随后可能向上传播为不同的工具调用、代码路径、搜索查询或智能体状态。其他变异性来源是令牌采样的外在因素,包括变化的环境、实时数据、服务基础设施、批次效应和数值细节。通过分离这些层面,本文阐明了将智能体AI系统称为随机系统的含义、在匹配条件下这种变异性何时可复现,以及为什么确定性执行在部署环境中不一定意味着相同的行为。

英文摘要

Agentic AI systems can behave differently across runs: the same request may produce a different plan, a different tool call, a different code edit, or a different final answer. Such variability arises from several layers that are often conflated. A foundation model is a large pretrained model, usually adaptable to many downstream tasks, that maps an input context to predictions over outputs. In many current agents, that model is embedded in an orchestration loop that plans, calls tools, observes results, and updates state. One explicit intrinsic source of variability in such systems is token generation: the model computes scores over possible next tokens, the scores are converted into probabilities, and a decoder may sample tokens using a pseudo-random number generator. A small sampled token difference can then propagate upward into a different tool call, code path, search query, or agent state. Other sources of variability are extrinsic to token sampling, including changing environments, live data, serving infrastructure, batch effects, and numerical details. By separating these layers, the manuscript clarifies what it means to call agentic AI systems stochastic, when such variability can be reproduced under matched conditions, and why deterministic execution need not imply identical behavior in deployed settings.

2606.08791 2026-06-09 econ.EM cs.AI q-fin.PM q-fin.RM q-fin.ST 新提交

Evaluating AI Investment Strategies

评估AI投资策略

Irene Aldridge

AI总结 研究通过可观测输入输出审计黑箱算法决策者,提出动态策略累积遗憾的精确分解,扩展至多期随机动态规划,并给出偏差修正与轨迹估计器。

详情
Comments
33 pages
AI中文摘要

我们研究仅从可观测输入和输出审计黑箱算法决策者的问题。主要结果是一个精确分解:在精确刻画条件下,动态策略的累积遗憾等于成本向量与策略决策之间每期协方差之和。这扩展了Aldridge (2026)的单期恒等式到随机动态规划的完整多期设置。我们证明了该恒等式在独立同分布成本和均值无偏马尔可夫策略下精确成立,推导了非平稳和时变情况下的闭式偏差修正,并建立了折现期模拟。协方差遗憾泛函的贝尔曼递归将该结果与标准强化学习算法联系起来;对于滚动窗口策略,估计误差偏差为$O(d/w)$。该分解对战略环境中的算法审计有直接影响:在平台机制设计中,它提供了基于福利的审计指标,无需访问代理的私人类型;在重复博弈中,协方差减少是策略改进的充分条件;在采购和广告拍卖中,偏差修正量化了战略误报导致的福利损失。相关的轨迹估计器是一致的、渐近正态的(具有HAC方差),并且可在$O(T \cdot nd)$时间内计算。这使得所提出的方法成为平台机制、算法投资策略以及任何受外部绩效审查的序列决策系统的可处理、无模型审计工具。

英文摘要

We study the problem of auditing a black-box algorithmic decision-maker from observable inputs and outputs alone. Our main result is an exact decomposition: under precisely characterized conditions, the cumulative \emph{regret} of a dynamic policy equals the sum of per-period covariances between the cost vector and the policy's decision. This extends the single-period identity of Aldridge~(2026) to the full multi-period setting of stochastic dynamic programming. We prove the identity holds exactly under i.i.d. costs and mean-unbiased Markov policies, derive closed-form bias corrections for non-stationary and time-varying cases, and establish the discounted-horizon analog. A Bellman recursion for the covariance regret functional connects the result to standard reinforcement learning algorithms; for rolling-window policies, the estimation-error bias is $O(d/w)$. The decomposition has direct implications for algorithmic auditing in strategic environments: in platform mechanism design, it provides a welfare-based audit metric without access to the agent's private type; in repeated games, covariance reduction is a sufficient condition for policy improvement; in procurement and ad auctions, the bias correction quantifies welfare loss from strategic misreporting. The associated trajectory estimator is consistent, asymptotically normal with HAC variance, and computable in $O(T \cdot nd)$ time. This makes the proposed approach a tractable, model-free audit tool for platform mechanisms, algorithmic portfolio strategies, and any sequential decision system subject to external performance review.

2606.08586 2026-06-09 q-fin.ST 新提交

Cross-sectional topological anomaly scores and intraday return predictability in the S&P 500: A BallMapper, decoder-conditional VAE, and Function-on-Function regression approach

标普500中的横截面拓扑异常分数与日内收益可预测性:BallMapper、解码器条件变分自编码器和函数对函数回归方法

Krzysztof Ozimek

AI总结 本文构建基于市场拓扑结构和横截面同行背景的股票级拓扑异常分数,通过函数对函数回归验证其对标普500成分股日内收益曲线的预测能力,发现异常分数具有累积、反转和分布式的预测特征。

详情
Comments
25 pages, 1 figure, 8 tables
AI中文摘要

金融时间序列中的异常检测方法通常对可观测数据中的统计异常观测值进行评分,而非对共同运动潜在结构中拓扑上不符合预期的持续偏差进行评分。本研究构建了一个股票级拓扑异常分数,该分数同时以市场级拓扑结构和横截面同行背景为条件,并检验其历史是否包含对收益曲线的预测内容。使用Takens延迟嵌入对十只流动性强的标普500成分股(2025年4月至2026年3月)的日内数据进行嵌入,通过BallMapper进行图构建,并由三种解码器条件变分自编码器变体进行评分。预测内容通过惩罚函数对函数回归进行评估,并在所有资产、日内柱频率和评分变体中得到确认,揭示了一致的时间特征——收益影响的逐渐累积、其方向的频繁早期反转,以及广泛分布的、偏向近期异常历史的预测内容。反转发生的时间取决于市场状态;异常历史对预测的贡献均匀程度取决于柱频率。

英文摘要

Anomaly detection methods in financial time series score statistically unusual observations in observable data, not topologically misexpected persistent deviations in the latent structure of co-movement. This study constructs a stock-level topological anomaly score jointly conditioned on market-level topological structure and cross-sectional peer context, and tests whether its history carries predictive content for return curves. Intraday data for ten liquid S&P 500 constituents (April 2025--March 2026) are embedded via Takens delay embedding, graphed by BallMapper, and scored by three decoder-conditional variational autoencoder variants. Predictive content is assessed by penalised function-on-function regression and confirmed across all assets, intraday bar frequencies, and scoring variants, revealing a consistent temporal fingerprint -- gradual accumulation of return impact, a frequent early reversal of its direction, and broadly distributed predictive content weighted toward recent anomaly history. When the reversal occurs depends on market regime; how evenly the anomaly history contributes to prediction depends on bar frequency.

2606.08569 2026-06-09 q-fin.PM 新提交

Stock Investment: The p-index Approach

股票投资:p-指数方法

Xinzhao Xie, Bopei Nie, Kuo-Ping Chang

AI总结 利用欧式看跌期权构建p-指数风险度量,评估中国上证50指数和美国标普500指数在2018-2023年间的投资策略表现,发现p-指数策略在不同市场条件下优于传统方法。

详情
Comments
arXiv admin note: text overlap with arXiv:2510.11074
AI中文摘要

本文利用欧式看跌期权构建了p-指数风险度量,以评估2018-2023年间中国上证50指数和美国标普500指数不同投资策略的表现。p-指数衡量每保险一美元所需的保险费,以保证资产在指定未来日期至少达到δ收益率。研究发现,采用公平价格策略时,一周和一个月持有期能获得更高收益;在七个经济板块中,材料板块股票的年化收益率最高:一周持有期11.04%,两周持有期11.93%,一个月持有期10.18%。采用一周持有期的动量和反转策略中,p-比率有效反转策略产生了最高的年化收益率(9.97%),其次是p-指数无效动量策略(9.01%)和p-指数有效反转策略(6.48%);使用p-指数的MCIRS方法始终比基于beta的方法获得更高收益;有效(跑赢)股票未能维持其动量,而无效(跑输)股票未表现出均值回归。还发现,p-指数有效反转策略在低情绪(低成交量)状态下表现更优,而p-指数无效动量策略在高情绪(高成交量)时期表现更优。对于2018-2023年间美国标普500指数的500只股票,发现有效股票维持了动量,而无效股票表现出均值回归。p-指数有效动量策略产生了最高的年化收益率(3.69%),其次是p-比率无效反转策略(3.67%)和beta有效动量策略(3.48%)。

英文摘要

This paper has used European put option to construct the p-index risk measure to evaluate the performance of different investment strategies in China's SSE 50 index and the US SP500 index during 2018-2023. The p-index measures the insurance fee for each insured dollar to guarantee that the asset achieves at least a delta rate of return on a specified future date. It is found that with the fair price strategy, one-week and one-month holding periods can earn more, and among seven economic sectors, materials sector stocks generated highest annualized rates of return: 11.04% (one-week period), 11.93% (two-week period) and 10.18% (one-month period). With momentum and contrarian strategies of one-week holding period, the p-ratio-efficient-contrarian strategy produced the highest annualized rate of return (9.97%), followed by the p-index-inefficient-momentum strategy (9.01%) and the p-index-efficient-contrarian strategy (6.48%), the MCIRS method employing the p-index consistently delivered higher returns than its beta-based approach, and efficient (outperforming) stocks failed to sustain their momentum while inefficient (underperforming) stocks exhibited no mean reversion. It is also found that the p-index-efficient-contrarian strategy outperformed in low-sentiment (low-volume) regimes, while the p-index-inefficient-momentum strategy outperformed during high-sentiment (high-volume) periods. For the five hundred stocks of the US S&P 500 index during 2018-2023, it is found that efficient stocks sustained their momentum while inefficient stocks exhibited mean reversion. The p-index-efficient-momentum strategy produced the highest annualized rate of return (3.69%), followed by the p-ratio-inefficient-contrarian strategy (3.67%) and the beta-efficient-momentum strategy (3.48%).

2606.08419 2026-06-09 econ.GN econ.TH q-fin.EC 新提交

The Winner's Bliss in Common-Value Auctions under Horizontal Differentiation

水平差异化下共同价值拍卖中的赢家幸福

Jiawei Chen, Anh Nguyen, Matthew Shum

AI总结 研究水平差异化偏好下共同价值拍卖中的赢家幸福现象,发现信息披露降低卖方收益,且有利选择维持不对称信息下的双边贸易。

详情
AI中文摘要

我们研究了投标人具有水平差异化偏好的共同价值拍卖。在一个特定的两投标人参数化中,获胜向赢家传递了关于物品价值的利好消息,我们称这种现象为赢家幸福,以区别于传统的赢家诅咒。其他含义也与传统分析不同。当投标人的偏好是水平差异化时,信息披露会降低卖方收益,并且有利选择在不对称信息下维持双边贸易。

英文摘要

We study common-value auctions in which bidders have horizontally differentiated preferences. In a specific two-bidder parameterization, winning conveys good news about the object's value to the winner, a phenomenon we call the winner's bliss in contrast to the conventional winner's curse. Additional implications also differ from the conventional analysis. When bidders' preferences are horizontally differentiated, seller revenue is reduced with information disclosure, and advantageous selection sustains bilateral trade under asymmetric information.

2606.08379 2026-06-09 cs.AI cs.CE cs.LG q-fin.CP q-fin.TR 新提交

TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution

TT-DAC-PS:用于最优交易执行的双目标确定性演员-评论家与策略平滑

Ilia Zaznov, Atta Badii, Julian Kunkel, Alfonso Dufour

发表机构 * University of Reading(雷丁大学) University of Göttingen(哥廷根大学) GWDG(哥廷根数据处理中心) Henley Business School(亨利商学院)

AI总结 提出TT-DAC-PS算法,结合双指数移动平均评论家目标、悲观最小备份、TD3风格策略平滑噪声、延迟演员更新和保守Q正则化,以抑制过高估计,并在限价订单簿数据上优于经典和强化学习基线。

详情
Comments
21 pages, 1 figure, 3 tables
AI中文摘要

本研究通过引入TT-DAC-PS(双目标确定性演员-评论家与策略平滑),解决了大规模股票卖单的最优执行问题。该确定性演员-评论家架构结合了双指数移动平均评论家目标与悲观最小备份、TD3风格的目标策略平滑噪声、延迟演员更新以及保守Q正则化,以抑制过高估计。探索使用Ornstein-Uhlenbeck(OU)噪声,并采用混合调度:确定性回合衰减、基于近期奖励离散度的方差引导调整,以及一个可学习并映射到噪声尺度的Soft Actor-Critic(SAC)风格温度。环境整合了Almgren-Chriss(AC)交易影响与限价订单簿(LOB)价格和成交量、归一化状态特征、每步成交量参与上限以及基于效用的奖励。该交易执行算法应用于十只美国股票的LOB数据。性能评估针对强化学习基线算法,包括近端策略优化(PPO)、软演员-评论家(SAC)和优势演员-评论家(A2C),以及替代交易执行算法,包括时间加权平均价格(TWAP)、成交量加权平均价格(VWAP)和AC。所提出的模型持续降低平均实现缺口百分比,并具有竞争性的方差,优于经典基线和标准强化学习基准模型。

英文摘要

This study addresses the optimal execution of large stock sell programs by introducing TT-DAC-PS (Twin-Target Deterministic Actor-Critic with Policy Smoothing), a deterministic actor-critic architecture that combines twin exponential-moving-average critic targets with pessimistic min backup, TD3-style target policy smoothing noise, delayed actor updates, and conservative Q regularisation to curb overestimation. Exploration uses Ornstein-Uhlenbeck (OU) noise with a hybrid schedule: deterministic episode-wise decay, variance-guided adjustment based on recent reward dispersion, and a Soft Actor-Critic (SAC)-style temperature that is learned and mapped to the noise scale. The environment integrates Almgren-Chriss (AC) trade impact with Limit Order Book (LOB) prices and volumes, normalised state features, per-step volume participation caps, and a utility-based reward. The trade execution algorithm is applied to LOB data for ten U.S. stocks. Performance is assessed against reinforcement-learning baseline algorithms, including Proximal Policy Optimisation (PPO), Soft Actor-Critic (SAC), and Advantage Actor-Critic (A2C), as well as alternative trade execution algorithms, including Time-Weighted Average Price (TWAP), Volume-Weighted Average Price (VWAP), and AC. The proposed model consistently reduces mean implementation shortfall percentage with competitive variance, outperforming classical baselines and standard reinforcement-learning benchmark models.

2606.08285 2026-06-09 cs.AI cs.CE q-fin.CP q-fin.TR 新提交

Beyond Agent Architecture: Execution Assumptions and Reproducibility in LLM-Based Trading Systems

超越智能体架构:基于LLM的交易系统中的执行假设与可复现性

Junyi Yao, Zihao Zheng

AI总结 本文通过审计30项相关研究,发现LLM交易研究中执行假设报告不足,导致结果难以比较,提出需建立执行现实性、可复现性和评估可比性的报告标准。

详情
AI中文摘要

大型语言模型(LLM)和智能体系统越来越多地被用于金融交易,但其报告的性能仍然难以比较,因为研究在数据来源、时间分割纪律、执行时机、周转处理和交易成本建模方面存在差异。本文对基于LLM的交易研究中的执行现实性进行了有针对性的主题回顾和可复现性审计。一个包含30项交易相关主要研究的编码证据矩阵用于评估时点控制、分割透明度、保留评估、成本和周转处理、执行语义、宇宙定义和工件发布。在审计样本中,架构报告通常比判断交易结果是否经济可解释或可复现所需的评估假设更清晰。一个包含10只股票的工作示例仅作为方法学框架,以说明明确的摩擦和时机选择如何实质性地压缩主动策略结果。主要结论是,LLM交易研究的下一步有用进展不仅是更好的智能体设计,还包括更清晰的执行现实性、可复现性和评估可比性的报告标准。

英文摘要

Large language models (LLMs) and agentic systems are increasingly proposed for financial trading, yet their reported performance remains difficult to compare because studies vary in data provenance, temporal split discipline, execution timing, turnover treatment, and transaction-cost modeling. This article presents a targeted topical review and reproducibility audit of execution realism in LLM-based trading research. A coded evidence matrix covering 30 trade-relevant primary studies is used to assess point-in-time controls, split transparency, held-out evaluation, cost and turnover treatment, execution semantics, universe definition, and artifact release. Across the audited sample, architecture reporting is generally clearer than the evaluation assumptions needed to judge whether a trading result is economically interpretable or reproducible. A 10-equity worked example is included only as a methodological scaffold to illustrate how explicit friction and timing choices can materially compress active-strategy results. The main conclusion is that the next useful step for LLM trading research is not only better agent design, but also clearer reporting standards for execution realism, reproducibility, and evaluation comparability.

2606.08283 2026-06-09 q-fin.PM q-fin.TR 新提交

Macro Economists in the Machine: A Multi-Agent LLM Framework for Commodity-Related ETF Portfolio Construction

机器中的宏观经济学家:用于商品相关ETF投资组合构建的多智能体LLM框架

Yiqing Wang, Dehao Dai, Ding Ma, Kerui Geng

AI总结 研究固定信息集和规则下,LLM在商品投资组合构建中能否增加价值。三种LLM策略(鹰派、鸽派、辩论)在夏普比率上优于规则代理,鹰派和辩论代理增益显著,但辩论代理未超越最佳单一代理,其贡献在于偏差校正。

详情
Comments
45 pages, 4 figures
AI中文摘要

我们测试了当信息集和执行规则在各策略间固定时,大型语言模型(LLM)是否能在商品投资组合构建中增加价值。一个鹰派代理(通胀紧缩先验)、一个鸽派代理(增长宽松先验)、一个辩论代理以及一个确定性z-score规则代理各自接收相同的FRED宏观z-score,并通过相同的投资组合引擎传递其倾斜信号。在跨越2023年美国利率峰值和2024-2025年软着陆的124个每周再平衡日期中,所有三种LLM策略在夏普比率上均优于规则代理;鹰派和辩论代理获得了最大的增益(Δ夏普比率 = +0.044和+0.040,在块自助法下p < 0.10),并在单向交易成本高达30个基点时保持相对于被动逆波动率基准的净成本优势,而规则代理相对于被动的微薄优势在大约5个基点时消失。辩论代理并未优于最佳单一代理(Δ夏普比率 = -0.004,p = 0.769);其贡献在于偏差校正——平均掉鸽派代理校准错误的先验——而非讨论产生的回报。性能优势集中在软着陆子时期,评估窗口跨越单个利率周期,报告的p值未针对多重比较进行调整。在这些限制内,结果表明,作为受约束的宏观解释函数的LLM可以在透明的规则层之上增加适度但具有经济意义的价值,尽管这一幅度很小,且其在该样本之外的持续性未知。

英文摘要

We test whether large language models (LLMs) add value in commodity portfolio construction when the information set and implementation rules are held fixed across strategies. A Hawkish Agent (inflation-tightening prior), a Dovish Agent (growth-easing prior), a Debate Agent, and a deterministic z-score Rule Agent each receive identical FRED macro z-scores and route their tilt signals through the same portfolio engine. Across 124 weekly rebalancing dates spanning the 2023 U.S. rate peak and the 2024-2025 soft landing, all three LLM strategies outperform the Rule Agent in Sharpe terms; the Hawkish and Debate Agents record the largest gains (ΔSharpe = +0.044 and +0.040, both p < 0.10 under a block bootstrap) and preserve a net-of-cost advantage over the passive inverse-volatility benchmark at one-way trading costs up to 30 basis points, while the Rule Agent's thin margin over passive disappears at approximately 5 basis points.The Debate Agent does not outperform the best single agent (ΔSharpe = -0.004, p = 0.769); its contribution is bias correction -- averaging out the Dovish Agent's miscalibrated prior -- rather than deliberation-generated return. The performance advantage is concentrated in the soft-landing sub-period, the evaluation window spans a single rate cycle, and the reported $p$-values are unadjusted for multiple comparisons. Within these limits, the results suggest that an LLM acting as a constrained macro-interpretation function can add modest but economically meaningful value over a transparent rule layer, though the margin is small and its persistence beyond this sample is unknown.

2606.08232 2026-06-09 q-fin.TR q-fin.CP q-fin.RM q-fin.ST 新提交

Hour-Aware Adaptive Risk Management for Autonomous Memecoin Trading: A Multi-Layer Intelligence Framework

小时感知的自适应风险管理用于自主Memecoin交易:一个多层智能框架

Arati Uday Kamat

AI总结 本文在Solana去中心化交易所上部署自主memecoin交易系统,通过190笔交易样本(40.5%胜率,累计+117.7%)评估小时效应、过滤器精度、脆弱性和已实现收益,发现盈利能力结构脆弱。

详情
Comments
15 pages, 4 figures. Companion paper to RED-2400 (arXiv:2605.12151) and PRFS methodology (arXiv submit/7684836). SSRN abstract ID 6564803. Zenodo concept DOI 10.5281/zenodo.20043302
AI中文摘要

本文在Solana去中心化交易所上对一个自主memecoin交易系统进行了为期15天的纸交易部署,测量了小时效应、过滤器精度、脆弱性和已实现收益。190笔交易样本(2026年3月29日至4月12日)显示胜率40.5%,每笔交易平均回报+0.62%,累计+117.7%(净SOL +0.039),偏度-1.21,超额峰度6.61。对三个表现最差的UTC小时(2、13、23)与其他小时进行Mann-Whitney U检验,得到U = 1,274,p = 0.22;在n=190时具有方向性但不显著。这三个小时是在样本内选择的,因此比较是探索性的,而非验证性的。一个并行的反事实拒绝追踪系统收集了184个不同拒绝事件中的4,874个前向样本观测值。在这些事件中,17.9%在24小时内从参考点回撤达到50%;26.0%的前向样本记录被拒绝的代币低于半参考点。过滤器堆栈避免了这些已实现回撤:证据表明拒绝标准相对于前向市场结果是净正向的。脆弱性是主要的警告。移除前三笔交易(样本的1.6%)会使累计回报变为亏损。盈利能力依赖于少数大赢家,并且结构上脆弱。数据集和审计脚本已根据CC-BY-4.0存放(Zenodo DOI 10.5281/zenodo.20043302)。

英文摘要

This paper measures hour-of-day effects, filter precision, fragility, and realised yield in a 15-day paper-traded deployment of an autonomous memecoin trading system on Solana decentralised exchanges. The 190-trade sample (March 29 to April 12, 2026) shows a 40.5 percent win rate, mean per-trade return of +0.62 percent, cumulative +117.7 percent (net SOL +0.039), skewness -1.21, excess kurtosis 6.61. A Mann-Whitney U test of three poorest-performing UTC hours (2, 13, 23) against the others yields U = 1,274, p = 0.22; directional but not significant at n = 190. The three hours were selected in-sample, so the comparison is exploratory, not confirmatory. A parallel counterfactual rejection-tracking system collected 4,874 forward-sample observations across 184 distinct rejection events. Of those events, 17.9 percent reached a 50 percent drawdown from reference within 24 hours; 26.0 percent of forward samples recorded the rejected token below half-reference. The filter stack avoided these realised drawdowns: evidence that the rejection criteria are net-positive against forward-market outcomes. Fragility is the principal caveat. Removing the top three trades (1.6 percent of sample) flips cumulative return unprofitable. Profitability rests on a small number of large winners and is structurally fragile. The dataset and audit script are deposited under CC-BY-4.0 (Zenodo DOI 10.5281/zenodo.20043302).

2606.08228 2026-06-09 q-fin.TR cs.LG q-fin.CP q-fin.ST 新提交

Post-Rejection Follow-up Sampling: A Methodology for Counterfactual Outcome Measurement in Algorithmic DEX Trading

拒绝后跟踪采样:算法DEX交易中反事实结果测量的一种方法

Arati Uday Kamat

AI总结 提出拒绝后跟踪采样(PRFS)方法,通过独立跟踪子系统采样被拒绝代币的价格和流动性,以评估过滤器精度,数据集包含2997个拒绝事件的67000条观测记录。

详情
Comments
12 pages. Companion methodology paper to RED-2400 (arXiv:2605.12151). Currently under review at Ledger. SSRN abstract ID 6607301. Zenodo concept DOI 10.5281/zenodo.20043516
AI中文摘要

去中心化交易所(DEX)上的算法交易系统拒绝了它们评估的大多数候选代币。被拒绝候选代币的反事实结果(如果系统进入会发生什么)很少被测量。本文介绍了拒绝后跟踪采样(PRFS)。一个独立的跟踪子系统以可配置的频率对每个被拒绝代币的价格和流动性进行采样,时间跨度长达二十四小时。PRFS提供了评估过滤器精度所需的数据,这些数据基于被拒绝候选代币的实际市场结果,而不是基于合成的回测重建。方法论、数据架构和存款格式在第三节中描述。配套数据集包含2997个拒绝事件的67000个前向结果观测行,涵盖457个独特的铸币厂,在连续八天的时间窗口内收集(2026-04-10至2026-04-19,UTC)。大约55%的拒绝事件至少有一个前向观测;铸币厂级别的覆盖是完整的。下游分类的主要约束是每个事件的时间密度,而不是事件级别的覆盖。PRFS是数据集无关的。它适用于任何拒绝次数大大超过执行次数的算法决策系统。

英文摘要

Algorithmic trading systems on decentralised exchanges (DEXs) reject most candidate tokens they evaluate. The counterfactual outcome of rejected candidates (what would have happened had the system entered) is rarely measured. This paper introduces Post-Rejection Follow-up Sampling (PRFS). A separate tracking subsystem samples each rejected token's price and liquidity at a configurable cadence, over a horizon of up to twenty-four hours. PRFS produces the data needed to evaluate filter precision against actual market outcomes of rejected candidates, not against synthetic backtest reconstructions. The methodology, data architecture, and deposit format are described in Section III. The companion dataset contains 67,000 forward-outcome observation rows across 2,997 rejection events spanning 457 unique mints, collected over a continuous eight-day window (2026-04-10 to 2026-04-19, UTC). Approximately 55 percent of rejection events receive at least one forward observation; coverage at the mint level is complete. The principal binding constraint on downstream classification is per-event horizon density, not event-level coverage. PRFS is dataset-independent. It generalises to any algorithmic decision system in which rejections substantially outnumber executions.

2606.08209 2026-06-09 q-fin.MF 新提交

Markets Are Not Random, They Are Hard to Predict

市场并非随机,它们难以预测

Miquel Noguer i Alonso

AI总结 本文论证金融市场并非本体论意义上的随机,而是因果经济系统,其预测困难源于隐藏原因、策略反馈、容量约束和规律变化,通过分离无套利、信息效率和可剥削性,利用Doob分解、熵度量等工具统一解释预测难度。

详情
AI中文摘要

金融收益常被称为“随机”,但这个词混淆了本体论机会、认知无知、策略反馈和模型不稳定性。本文认为,金融市场并非像量子测量那样在本体论意义上是随机的。它们是因果经济系统,其未来难以预测,因为相关原因被隐藏、观察成本高、被策略性使用、容量受限,有时还受变化规律支配。金融的形式语言已经编码了这一区别。价格存在于过滤概率空间,因为代理人拥有部分信息;衍生品在风险中性测度 $\Q \ne \Prob$ 下定价,因为定价是工具性的测度变换,而非关于真实数据生成规律的陈述;无套利给出等价定价测度下的鞅性,而非在每一个真实世界信息集下的完全预测失败。本文分离了无套利、信息效率和净可剥削性;使用Doob分解将风险补偿的可预测漂移与鞅创新分开;添加了容量和生存层,解释为什么正信号不一定可扩展;将 $\Prob$-$\Q$ 楔子与随机贴现因子几何和相对熵联系起来;形式化了过滤充分性、模型选择景观风险和干预稳定因果性;并将反身性、微观结构和奈特不确定性连接到统一的熵账本。因此,严谨的论点并非市场不可知,也非它们字面意义上的随机。市场难以预测,而最难预测之处正是预测成本高、竞争激烈、自我挫败、容量有限或被制度变迁所否定之处。

英文摘要

Financial returns are often called ``random,'' but the word conflates ontic chance, epistemic ignorance, strategic feedback, and model instability. This essay argues that financial markets are not random in the ontic sense in which a quantum measurement is random. They are causal economic systems whose future is hard to predict because relevant causes are hidden, costly to observe, strategically used, capacity constrained, and sometimes governed by a changing law. The formal language of finance already encodes this distinction. Prices live on filtered probability spaces because agents have partial information; derivatives are priced under a risk-neutral measure $\Q\ne\Prob$ because pricing is an instrumental change of measure rather than a statement about the real data-generating law; and no-arbitrage gives martingality under an equivalent pricing measure, not full predictability failure under every real-world information set. The paper separates no-arbitrage, informational efficiency, and net exploitability; uses the Doob decomposition to isolate risk-compensated predictable drift from martingale innovation; adds a capacity-and-survival layer explaining why positive signals need not be scalable; relates the $\Prob$--$\Q$ wedge to stochastic-discount-factor geometry and relative entropy; formalises filtration sufficiency, model-selection landscape risk, and intervention-stable causality; and connects reflexivity, microstructure, and Knightian ambiguity to a unified entropy ledger. The disciplined thesis is therefore not that markets are unknowable, nor that they are literally random. Markets are hard to predict, and hardest exactly where prediction is costly, competitive, self-defeating, capacity limited, or invalidated by regime change.

2606.08207 2026-06-09 econ.GN q-fin.EC 新提交

Opportunity-Normalized Residence-Workplace Matching and the Scale-Sensitive Structure of Urban Commuting

机会归一化的居住-工作匹配与城市通勤的尺度敏感结构

Mingzhi Xiao, Yuki Takayama

AI总结 通过机会归一化方法比较实际通勤距离分布与基于城市机会结构的分布,发现匹配强度随距离衰减,并呈现尺度敏感的城市特定规律。

详情
AI中文摘要

城市空间结构通常通过家庭和工作地点的空间分布或总体通勤结果来评估。然而,这些方法并未揭示城市形态创造的机会如何被选择性地转化为实际的居住-工作连接。本研究引入了机会归一化的居住-工作匹配,通过将观察到的通勤距离分布与基于所有城市内居住-工作对(按居住和就业质量加权)构建的机会分布进行比较。利用英国九个城市的输出区域级数据,我们表明实际配对在较短距离上比城市机会结构单独预测的更集中。归一化后,匹配强度随距离衰减,呈现重复但异质的模式,在许多城市中近似对数-对数空间中的线性关系,并可通过城市特定的距离衰减系数概括。伦敦进一步揭示了这种规律是尺度敏感的:一个相对平坦的城市范围模式在就业中心子系统间分离为一致负向但异质的关系。来自纽约和芝加哥的补充证据显示了类似的衰减模式。这些发现将实际的居住-工作匹配识别为城市结构的一个独特层次,并表明在复杂的大都市系统中,有意义的空间规律可能存在于连贯的匹配场中,而非总体城市边界内。

英文摘要

Urban spatial structure is commonly evaluated through the spatial distribution of homes and jobs or through aggregate commuting outcomes. Yet these approaches do not reveal how the opportunities created by urban form are selectively transformed into actual residence-workplace connections. This study introduces opportunity-normalized residence-workplace matching by comparing observed commuting distance distributions with opportunity-based distributions constructed from all within-city residence-workplace pairs weighted by residential and employment mass. Using Output Area-level data for nine British cities, we show that realized pairings are systematically more concentrated at shorter distances than the urban opportunity structure alone would predict. After normalization, matching intensity declines with distance in a recurrent but heterogeneous pattern that is approximately linear in log-log space in many cities and can be summarized by a city-specific distance-decay coefficient. London further reveals that this regularity is scale-sensitive: a comparatively flattened citywide pattern separates into consistently negative but heterogeneous relationships across employment-centered subsystems. Supplementary evidence from New York and Chicago shows similar attenuation patterns. These findings identify realized residence-workplace matching as a distinct layer of urban structure and suggest that, in complex metropolitan systems, meaningful spatial regularities may reside in coherent matching fields rather than in aggregate city boundaries.

2606.08141 2026-06-09 econ.EM q-fin.GN 新提交

A Structural Matrix Autoregressive Model for the Joint Dynamics of Volume, Volatility, and Returns

成交量、波动率与收益联合动态的结构矩阵自回归模型

Andrea Bucci, Giulio Palomba, Eduardo Rossi

AI总结 提出结构矩阵自回归模型,在大维度下联合分析资产收益、已实现波动率和交易量,通过参数化简约性捕捉动态溢出和截面依赖,实证发现波动率驱动交易活动,长期跨资产溢出解释超50%成交量变化。

详情
AI中文摘要

本文提出了一种结构矩阵自回归(SMAR)模型,用于在大维度环境下联合分析资产收益、已实现波动率和交易量。该框架同时捕捉金融变量之间的动态溢出效应和资产之间的截面依赖性,同时相对于传统向量自回归模型保持了简约的参数化。该模型基于道琼斯工业平均指数成分股在2021-2025年期间的日数据进行估计,并通过与混合分布假设和有效市场理论一致的约束进行结构识别。实证结果表明,波动率是交易活动的主要驱动因素,表明信息冲击主要通过价格波动纳入市场。预测误差方差分解进一步揭示,尽管内部冲击主导短期成交量动态,但在更长时期内,跨资产溢出效应解释了超过50%的交易量变化。最后,围绕FOMC公告的事件研究分析支持了所提出的分解,识别出公告日交易活动的信息成分显著增加,随后快速均值回归。

英文摘要

This paper proposes a Structural Matrix Autoregressive (SMAR) model for the joint analysis of asset returns, realized volatility, and trading volume in a large-dimensional setting. This framework simultaneously captures dynamic spillovers across financial variables and cross-sectional dependence across assets while preserving a parsimonious parameterization relative to conventional vector autoregressive models. The model is estimated on daily data for the constituents of the Dow Jones Industrial Average over the period 2021-2025 and is structurally identified through restrictions consistent with the Mixture of Distributions Hypothesis and efficient market theory. The empirical findings indicate that volatility is the primary driver of trading activity, suggesting that informational shocks are predominantly incorporated into markets through price variability. Forecast error variance decompositions further reveal that, although internal shocks dominate short-term volume dynamics, cross-asset spillovers account for more than 50% of trading volume variation at longer horizons. Finally, an event-study analysis around FOMC announcements supports the proposed decomposition by identifying significant increases in the informative component of trading activity on announcement days followed by rapid mean reversion.

2606.07727 2026-06-09 quant-ph cs.CL math.OC q-fin.PM 新提交

Benchmarking Quantum Algorithmic Resilience for CVaR Portfolio Optimization: The Expressibility-Coherence Trade-off

面向CVaR投资组合优化的量子算法韧性基准测试:可表达性-相干性权衡

Prashik N. Somkuwar, K. Srinivasan, G. Raghavan

AI总结 针对混合均值方差与条件风险价值投资组合优化,对比硬件高效变分量子神经网络与热启动量子近似优化算法,揭示NISQ设备上算法可表达性与硬件相干性之间的关键权衡。

详情
Comments
10 pages, 11 figures. Master's thesis research conducted at the School of Quantum Technology, Defence Institute of Advanced Technology (DIAT), Pune
AI中文摘要

量子组合优化为复杂金融建模提供了理论优势,但在噪声中等规模量子(NISQ)设备上的物理实现受到硬件拓扑的严重限制。本研究针对混合均值方差与条件风险价值(CVaR)投资组合目标,对硬件高效变分量子神经网络(HE-VQNN)和热启动量子近似优化算法(WS-QAOA)进行了硬件基准测试分析。通过实现一种新颖的经典量子混合代理矩阵来绕过CVaR辅助量子比特瓶颈,我们将NIFTY 50指数中多达16个资产映射到IBM heavy hex处理器上。我们系统地量化了算法对路由过程中产生的“SWAP代价”的韧性。实证结果揭示了一个关键的操作权衡:WS-QAOA提供了精确的理论映射,但由于指数级的非局部门开销而遭受灾难性的硬件退相干。相反,HE-VQNN保持了硬件相干性,但缺乏捕捉密集尾部风险资产相关性的数学可表达性。本研究揭示了当前架构下密集金融优化的局限性,迫使在算法不可表达性与硬件退相干之间做出不可行的选择。这指示了在缺乏全连接性的NISQ计算机上能做什么和不能做什么的更深层限制。

英文摘要

Quantum combinatorial optimization offers theoretical advantages for complex financial modeling, but physical implementation on Noisy Intermediate Scale Quantum (NISQ) devices is severely constrained by hardware topology. This study presents a hardware benchmarking analysis between a Hardware Efficient Variational Quantum Neural Network (HE-VQNN) and the Warm Start Quantum Approximate Optimization Algorithm (WS-QAOA) for a hybrid Mean Variance and Conditional Value at Risk (CVaR) portfolio objective. By implementing a novel classical quantum hybrid proxy matrix to bypass the CVaR auxiliary qubit bottleneck, we map up to 16 assets from the NIFTY 50 index onto an IBM heavy hex processor. We systematically quantify algorithmic resilience to the "SWAP tax" incurred during routing. Empirical results reveal a critical operational trade-off: WS-QAOA provides exact theoretical mapping but suffers catastrophic hardware decoherence due to exponential nonlocal gate overhead. Conversely, HE-VQNN preserves hardware coherence but lacks the mathematical expressibility to capture dense tail risk asset correlations. This study exposes the limitations of dense financial optimization on current architectures forces an nonviable choice between algorithmic inexpressibility and hardware decoherence. This is indicative of a deeper limitation as to what can and cannot be done with NISQ computers lacking in all-to-all connectivity.

2606.07575 2026-06-09 q-fin.RM cs.LG 新提交

Forward-Looking Stress Testing Under Macro Scenarios: Stable SVaR Estimation Using a Hybrid GPR-HS Framework with SACS

宏观情景下的前瞻性压力测试:基于混合GPR-HS框架与SACS的稳定SVaR估计

Ujjwala Vadrevu

AI总结 本文扩展混合高斯过程回归历史模拟框架至前瞻性压力情景,提出情景平均协方差稳定方法,在三种宏观情景下实现稳定的压力在险价值估计,满足监管要求。

详情
Comments
15 pages, 3 figures. Extension of a hybrid GPR-HS framework to forward-looking stress testing with scenario-based SVaR and covariance stabilization (SACS)
AI中文摘要

监管压力测试框架,包括全面资本分析与审查(CCAR)和内部资本充足评估程序(ICAAP),要求在前瞻性宏观经济情景下进行稳健的压力在险价值(SVaR)估计。传统的参数化方法在极端冲击下常表现出数值不稳定性,降低了资本预测的可靠性。\n本文将Vadrevu(2026)的混合高斯过程回归历史模拟(GPR-HS)框架扩展到前瞻性压力情景,展示了在三种情景(西亚战争、气候风险和AI泡沫/监管)下的稳定性。\n一个关键贡献是情景平均协方差稳定(SACS)框架,它将压力协方差构建为历史危机情景的加权聚合,提供稳定且可解释的依赖结构。压力收益路径通过确定性漂移和随机残差在252天的时间跨度内生成,而波动率通过具有激进噪声初始化(ANI)的高斯过程回归建模。\n该框架在所有资产和情景下表现出一致的收敛性。SVaR范围从-2.1020%到-2.2231%,并且保持了|SES| > |SVaR|的一致性属性。结果支持GPR-HS与SACS作为CCAR和ICAAP应用中前瞻性SVaR和SES估计的稳定且符合监管要求的方法。

英文摘要

Regulatory stress testing frameworks, including the Comprehensive Capital Analysis and Review (CCAR) and the Internal Capital Adequacy Assessment Process (ICAAP), require robust Stressed Value-at-Risk (SVaR) estimation under forward-looking macroeconomic scenarios. Traditional parametric approaches often exhibit numerical instability under extreme shocks, reducing the reliability of capital projections. This paper extends the Hybrid Gaussian Process Regression Historical Simulation (GPR-HS) framework of Vadrevu (2026) to forward-looking stress scenarios, demonstrating stability across three regimes: West Asia War, Climate Risk, and AI Bubble/Regulation. A key contribution is the Scenario-Averaged Covariance Stabilization (SACS) framework, which constructs stress covariance as a weighted aggregation of historical crisis regimes, providing stable and interpretable dependence structures. Stressed return paths are generated over a 252-day horizon using deterministic drift and stochastic residuals, while volatility is modeled via Gaussian Process Regression with Aggressive Noise Initialization (ANI). The framework exhibits consistent convergence across all assets and scenarios. SVaR ranges from -2.1020% to -2.2231%, with the coherence property |SES| > |SVaR| preserved. The results support GPR-HS with SACS as a stable and regulator-aligned approach for forward-looking SVaR and SES estimation in CCAR and ICAAP applications.

2606.06651 2026-06-09 econ.GN q-fin.EC 版本更新

Temporal Dynamics of Development Aid in Africa: Evidence from a Staggered Difference-in-Differences Study of China and World Bank Projects

非洲发展援助的时间动态:基于中国和世界银行在非洲项目的交错双重差分研究证据

Mattias Antar, Adel Daoud, Connor T. Jerzak

AI总结 利用2002-2013年35个非洲国家2166个DHS集群的面板数据,采用交错处理设计下的switcher-stayer估计器,发现项目选址具有选择性,传统TWFE高估了效果,而世界银行和中国项目的积极影响集中在特定部门。

详情
Comments
57 pages
AI中文摘要

关于援助有效性的次国家级研究通常依赖重复横截面或夜间灯光数据,这使得难以将局部处理效应与基线差异区分开来,并可能偏向于基础设施密集型项目。我们通过研究世界银行和中国在非洲的发展项目来解决这些局限性,使用了2002年至2013年间35个国家的2166个DHS集群的平衡面板数据。地理编码的AidData项目与卫星推算的国际财富指数估计值(一种以家庭为中心的物质生活水平衡量指标)相关联。我们比较了传统的双向固定效应事件研究与de Chaisemartin和D'Haultfoeuille提出的switcher-stayer估计量,后者避免了交错处理时间下的污染比较。处理前诊断显示,项目选址经常具有选择性:后来接受项目的集群在处理开始前往往处于较弱的相对位置。因此,TWFE通常暗示比首选交错处理设计所支持的更大的处理后收益。在dCdH下,证据变得更加选择性和部门特定。对于世界银行,积极的证据在卫生部门最强,而教育部门显示出积极但识别不够清晰的收益。对于中国,供水和卫生以及其他社会基础设施和服务与当地财富呈正相关,尽管仍存在残留的选择性问题。相比之下,中国的能源发电和供应在TWFE下显示出强烈的正面效果,但在dCdH下几乎为零。总体而言,结果不支持任何援助方普遍改善当地财富的说法。相反,估计的效果集中在有限的捐赠者-部门面板中,并且强烈依赖于如何处理处理时间、选择和结果测量。

英文摘要

Subnational studies of aid effectiveness often rely on repeated cross-sections or nighttime lights, making it difficult to separate local treatment effects from baseline differences and potentially favoring infrastructure-heavy projects. We address these limitations by studying World Bank and Chinese development projects in Africa with a balanced panel of 2,166 DHS clusters across 35 countries from 2002 to 2013. Geocoded AidData projects are linked to satellite-imputed International Wealth Index estimates, a household-centered measure of material living standards. We compare a conventional two-way fixed effects (TWFE) event-study with the switcher--stayer estimator of de Chaisemartin and D'Haultfoeuille (dCdH), which avoids contaminated comparisons under staggered treatment timing. Pre-treatment diagnostics show that project placement is frequently selective: clusters that later receive projects often begin from weaker relative positions before treatment onset. Consequently, TWFE often implies larger post-treatment gains than the preferred staggered-treatment design supports. Under dCdH, the evidence becomes more selective and sector-specific. For the World Bank, positive evidence is strongest in Health, while Education shows positive but less cleanly identified gains. For China, Water Supply and Sanitation and Other Social Infrastructure and Services show positive associations with local wealth, although residual selection concerns remain. By contrast, Chinese Energy Generation and Supply appears strongly positive under TWFE but falls close to zero under dCdH. Overall, the results do not support a donor-wide claim that either the World Bank or China uniformly improves local wealth. Instead, estimated effects are concentrated in a limited set of donor--sector panels and depend strongly on how treatment timing, selection, and outcome measurement are handled.

2606.04217 2026-06-09 cs.CE q-fin.ST q-fin.TR 版本更新

Polymarket-v1 Database

Polymarket-v1 数据库

Boka Qin, Rui Yang

AI总结 本文介绍 Polymarket-v1 数据库,包含 Polymarket 第一代 CTF 交易所的完整链上交易记录,利用区块链结算层提供的真实攻击方向,评估标准微观结构工具并揭示分类误差对下游指标的影响。

详情
Comments
35 pages, 17 figures, 15 tables. Dataset available at https://huggingface.co/datasets/TimeSeventeen/Polymarket-v1
AI中文摘要

我们介绍了 Polymarket-v1 数据库:Polymarket 在 Polygon 上的第一代 CTF 交易所的完整链上交易档案,时间跨度从 2022-11-21 到 2026-04-28,涵盖从首次结算到自然终止的完整合约生命周期。该数据集包含 13 亿笔交易记录,覆盖 130 万个市场,名义交易量达 610 亿美元。其显著特征是 100% 真实攻击方向来源于区块链结算层,这是现有预测市场档案(依赖启发式推断)所不具备的特性。我们利用这个与真实对齐的档案来基准测试标准微观结构工具,并记录三个发现。首先,Tick 规则和批量交易量分类实现了接近随机的总体准确率(49.83% 和 50.51%),但这掩盖了由正向交易方向自相关和集中做市驱动的系统性、可纠正的价格水平梯度——预测市场的这两个结构性特征违反了经典分类器中的均值回归假设。其次,这些分类误差会传播到下游指标:推断的 VPIN 与真实 VPIN 存在显著差异,OFI 估计存在方向性偏差,对交易成本分析产生实质性影响。第三,真实微观结构质量以分类代理无法恢复的方式预测预测性能:真实 VPIN 正向预测 Brier 分数,而 Gibbs 价差负向预测 Brier 分数——这反映了一个选择效应,即高利差利基市场吸引的是知情专家而非噪声交易者。用分类代理替换真实指标会减弱这两种关系,说明交易层面的测量准确性是可靠推断预测市场设计和概率校准的前提。

英文摘要

We introduce the Polymarket-v1 Database: the complete on-chain trade archive of Polymarket's first-generation CTF Exchange on Polygon, spanning 2022-11-21 to 2026-04-28 and covering the full contract lifecycle from first settlement to natural termination. The dataset comprises 1.20 billion trade records across 1.30 million markets with $61 billion in nominal volume. Its defining feature is 100% ground-truth aggressor direction derived from the blockchain settlement layer, a property unavailable in existing prediction market archives, which rely on heuristic inference. We use this truth-aligned archive to benchmark standard microstructure tools and document three findings. First, the tick rule and bulk volume classification achieve near-random aggregate accuracy (49.83% and 50.51%), but this masks a systematic, correctable price-level gradient driven by positive trade direction autocorrelation and concentrated market-making -- two structural features of prediction markets that violate the mean-reversion assumption embedded in classical classifiers. Second, these classification errors propagate into downstream metrics: inferred VPIN diverges substantially from ground-truth VPIN, and OFI estimates are directionally biased, with material consequences for Transaction Cost Analysis. Third, ground-truth microstructure quality predicts forecasting performance in ways that classification-based proxies cannot recover: True VPIN positively predicts Brier scores, while Gibbs spread negatively predicts them -- a selection effect reflecting that high-spread niche markets attract informed specialists rather than noise traders. Replacing ground-truth metrics with classified proxies attenuates both relationships, illustrating that measurement accuracy at the transaction level is a prerequisite for reliable inference about prediction market design and probability calibration.