arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.02528 2026-06-02 q-fin.GN cs.CY cs.LG

Auditing Asset-Specific Preferences in Financial Large Language Models: Evidence from Bitcoin Representations and Portfolio Allocation

审计金融大语言模型中的资产特定偏好:来自比特币表征与投资组合配置的证据

Wenbin Wu

AI总结 本研究通过三级审计协议,发现大型语言模型对比特币存在框架依赖的偏好,并识别出模型内部一个可因果干预的比特币选择性特征,该特征能显著影响下游投资组合配置。

详情
Comments
28 pages, 5 figures, 18 tables
AI中文摘要

大型语言模型现已驱动机器人顾问和交易代理,但它们是否对特定资产存在固有偏见尚未得到充分检验。我们提出三个问题:LLMs是否系统性地偏好某些金融工具;能否识别出对这些偏好具有因果杠杆作用的内部表征;以及该表征是否影响下游金融决策。我们开发了一个三级审计协议并将其应用于比特币。首先,对八个前沿LLMs的行为审计显示,比特币在货币类工具中的排名具有框架依赖性:模型将其置于“可靠货币”的第5位(共8位),但在危机和自主代理框架下接近榜首,且属性交换实验确认排名追踪功能属性而非名称。其次,我们打开模型内部:在Gemma 3中搜索数千个稀疏自编码器特征,识别出一个主导的比特币选择性特征。放大该特征会使模型偏向该资产,抑制则使其远离,即使提示中从未出现“比特币”。第三,我们测试金融后果:放大使比特币在投资组合中的份额提高5.2个百分点,而抑制降低4.6个百分点,放大在加密资产内重新分配,抑制则削减总加密敞口。我们将此描述为有界行为杠杆(杠杆指对输出的因果影响,而非金融杠杆):一个可识别的内部特征可被扰动以改变金融选择,但仅在可测量的限度内。该框架将内部表征与外部建议联系起来,并通过随机对照和机制边界进行验证。随着LLMs成为自主金融代理,这是迈向新兴“了解你的代理”(KYA)标准的行为层的第一步:了解代理偏好什么,以及该偏好可被移动多远。

英文摘要

Large language models now power robo-advisors and trading agents, yet whether they carry built-in biases toward specific assets is largely untested. We ask three questions: do LLMs systematically prefer certain financial instruments; can an internal representation with causal leverage over those preferences be identified; and does that representation affect downstream financial decisions? We develop a three-level audit protocol and apply it to Bitcoin. First, a behavioral audit of eight frontier LLMs shows that Bitcoin's ranking among money-like instruments is frame-dependent: models place it around rank 5 of 8 as "reliable money" but near the top under crisis and autonomous-agent frames, and an attribute-swap experiment confirms rankings track functional properties, not names. Second, we open a model's internals: a search across thousands of sparse-autoencoder features in Gemma 3 identifies a dominant Bitcoin-selective feature. Amplifying it shifts the model toward the asset and suppressing it shifts the model away, even when "Bitcoin" never appears in the prompt. Third, we test financial consequences: amplification raises Bitcoin's portfolio share by 5.2 percentage points while suppression lowers it by 4.6 pp, with amplification reallocating within crypto and suppression cutting total crypto exposure. We characterize this as bounded behavioral leverage (leverage meaning causal influence over outputs, not financial leverage): an identifiable internal feature can be perturbed to move financial choices, but only within measurable limits. The framework links internal representations to external recommendations, validated with random controls and mechanism boundaries. As LLMs become autonomous financial agents, this is a first step toward a behavioral layer for emerging know-your-agent (KYA) standards: knowing what an agent prefers, and how far that preference can be moved.

2606.02362 2026-06-02 econ.GN q-fin.EC

Endogenous Fertility Waves and the Dynamics of Utility in an Overlapping Generations Model

内生生育率波动与重叠世代模型中效用的动态

Wolfgang Kuhle

AI总结 本文在新古典重叠世代模型中研究Easterlin假设成立的条件,通过将经济转型映射到效用空间,证明当生育周期出现且子女为正常品时,小规模群体的效用严格高于大规模群体,且该福利不对称性由生育偏好驱动,与黄金律无关。

详情
Comments
14 pages, 1 figure
AI中文摘要

本文研究了在内生资本积累、工资、利率和生育率的新古典重叠世代模型中,Easterlin假设成立的条件。我们开发了一个易于处理的分析框架,通过一个连续可微的一阶差分方程将经济转型映射到群体终身效用的效用空间。这种重新表述允许对非稳态路径进行透明的规范评估,而无需显式求解底层非线性系统。在此框架内,我们证明当生育周期出现且子女为正常品时,小规模群体的效用严格高于大规模群体。关键在于,这种群体福利不对称性是由生育偏好驱动的,且与经济体相对于黄金律的位置无关。

英文摘要

This paper investigates the conditions under which the Easterlin hypothesis holds within a neoclassical overlapping generations model with endogenous capital accumulation, wages, interest rates, and fertility. We develop a tractable analytical framework that maps economic transitions into utility space via a continuously differentiable first-order difference equation for cohort lifetime utilities. This reformulation allows for a transparent normative evaluation of non-steady-state paths without requiring explicit solutions to the underlying nonlinear system. Within this framework, we show that when fertility cycles emerge and children are normal goods, the utility of small cohorts strictly exceeds that of large cohorts. Crucially, this cohort-welfare asymmetry is driven by fertility preferences and is independent of the economy's position relative to the golden rule.

2606.02336 2026-06-02 q-fin.PR

VIX options in Bergomi models

Bergomi模型中的VIX期权

Desen Guo, Dan Pirjol, Lingjiong Zhu

AI总结 研究Bergomi模型下VIX期权在短到期和小波动率波动率条件下的领先阶渐近行为,并给出封闭形式的渐近公式。

详情
Comments
46 pages, 3 figures
AI中文摘要

我们研究了Bergomi模型中VIX期权价格在短到期和小波动率波动率条件下的领先阶渐近行为。考虑了单因子、双因子Bergomi以及N因子模型中的虚值和平值渐近。领先阶渐近以封闭形式获得,并转化为VIX隐含波动率的小到期渐近预测。提供了数值示例以说明封闭形式渐近公式的效率。

英文摘要

We present a study of the leading-order asymptotics for VIX option prices in Bergomi models in the short-maturity and small volatility-of-volatility regimes. Both out-of-the-money (OTM) and at-the-money (ATM) asymptotics are considered for one-factor, two-factor Bergomi and $N$-factor models. The leading-order asymptotics are obtained in closed-form, which are translated into predictions for the small-maturity asymptotics of the VIX implied volatility. Numerical illustrations are provided to illustrate the efficiency of the closed-form asymptotic formulas.

2606.01575 2026-06-02 q-fin.MF

Boom, Bubble, or Buildout? A Multi-Method Evaluation of Whether Artificial Intelligence Is in an Ongoing Financial Bubble

繁荣、泡沫还是建设?——人工智能是否处于持续金融泡沫的多方法评估

Qianan Wang, Zen Chen

AI总结 本文提出混合审查与诊断框架,结合基本面估值、泡沫检测统计检验和叙事分析,评估截至2026年5月人工智能是否处于金融泡沫,结论认为AI是带有局部泡沫动态的真实技术革命。

详情
AI中文摘要

人工智能投资的快速扩张重新引发了金融经济学中的一个反复出现的问题:AI相关资产是否正在经历泡沫,还是市场正在为一种持久的通用技术定价?本文开发了一个混合审查与诊断框架,用于评估截至2026年5月人工智能是否处于持续的金融泡沫。分析从状态价格、随机贴现因子、鞅估值和定价核的资产定价基础开始,然后将这些基础与理性泡沫、行为泡沫、技术狂热和现代计量经济学泡沫检测方法联系起来。当前证据既显示了真实的基本面,也显示了类似泡沫的脆弱性。在基本面方面,已实现的收入增长、企业采用率和生产力证据支持AI估值的相当大一部分。在脆弱性方面,资本支出在某些层面加速超过了观察到的货币化,私人市场估值集中在少数公司,投资者叙事往往在现金流出现之前就将未来生产力收益资本化。本文提出了一个五支柱诊断框架,结合了基本面估值、残余狂热检验、SADF/GSADF爆炸根过程、LPPL/HLPPL价格模式诊断、情绪和发行指标以及资本支出回收期分析。中心结论是,AI最好被理解为一场带有局部泡沫动态的真实技术革命,而不是纯粹的投机狂热或无泡沫的生产力奇迹。

英文摘要

The rapid expansion of artificial intelligence (AI) investment has revived a recurrent question in financial economics: are AI-related assets experiencing a bubble, or is the market capitaliz- ing a durable general-purpose technology? This paper develops a hybrid review and diagnostic framework for evaluating whether AI is in an ongoing financial bubble as of May 2026. The analysis begins from asset-pricing foundations in state prices, stochastic discount factors, martingale valuation, and pricing kernels, then connects these foundations to rational bubbles, behavioral bubbles, technology manias, and modern econometric bubble-detection methods. Current evidence shows both genuine fundamentals and bubble-like fragilities. On the fundamental side, realized revenue growth, enterprise adoption, and productivity evidence support a nontrivial share of AI valuations. On the fragile side, capital expenditure has accelerated faster than observed monetization in some layers, private- market valuations are concentrated in a small number of firms, and investor narratives often capitalize future productivity gains before they have appeared in cash flows. The paper proposes a five-pillar diagnostic framework that combines fundamental valuation, residual-exuberance tests, SADF/GSADF explosive-root procedures, LPPL/HLPPL price-pattern diagnostics, sen- timent and issuance measures, and capex-payback analysis. The central conclusion is that AI is best understood as a real technological revolution with localized bubble dynamics rather than as either a pure speculative mania or a bubble-free productivity miracle.

2606.01356 2026-06-02 q-fin.MF q-fin.CP

A Formally Verified Library of Mathematical Finance in Lean 4

Lean 4 中形式化验证的金融数学库

Raphael Coelho

AI总结 本文在 Lean 4 证明助手中构建了一个涵盖连续时间随机微积分、衍生品定价、风险、投资组合和固定收益理论的广泛金融数学库,通过忠实性审计确保每个定理的声明与实际证明一致。

详情
Comments
7 pages. Lean 4 artifact (Apache-2.0): https://github.com/raphaelrrcoelho/formal-mathfin ; archived at doi:10.5281/zenodo.20477782
AI中文摘要

我们描述了一个在 Lean 4 证明助手中构建的金融数学库,它基于 Mathlib 和 BrownianMotion 包。该库范围广泛:涵盖从连续时间随机微积分的测度论基础到衍生品定价,再到应用风险、投资组合和固定收益理论的十一个领域,包含超过两百个无抱歉定理,据我们所知,这是迄今为止最全面的机器验证的金融数学发展。广度是背景,而非重点。有两件事使其不仅仅是目录。它深入连续理论,足以将 L2 Itô 积分构造为有界线性等距,并推导出风险中性定价测度,而非假设。它还审计自身的忠实性:每个结果都根据其 Lean 陈述与所声称的数学之间的关系进行分类,并且一个构建强制门固定了每个证明实际使用的公理,因此读者可以精确地看到哪些已被证明,哪些仅在附加假设下被证明。我们以一个坦诚的发现结束:经典金融数学的形式化基础产生了已知结果的认证统一,而非新的金融理论。因此,贡献是方法论和基础设施层面的:可重用的验证过的金融数学基础,以及忠实性审计。

英文摘要

We describe a library of mathematical finance built in the Lean 4 proof assistant, on top of Mathlib and the BrownianMotion package. It is broad: more than two hundred sorry-free theorems across eleven areas, from the measure-theoretic foundations of continuous-time stochastic calculus through derivative pricing to applied risk, portfolio, and fixed-income theory, and, to our knowledge, the most comprehensive machine-checked development of mathematical finance to date. Breadth is the setting, not the point. Two things make it more than a catalogue. It reaches into the continuous theory far enough to construct the L2 Itô integral as a bounded linear isometry and to derive, rather than assume, the risk-neutral pricing measure. And it audits its own faithfulness: every result is classified by how its Lean statement relates to the mathematics it claims, and a build-enforced gate pins the axioms each proof actually uses, so a reader can see precisely what has been proved and what has only been proved under added hypotheses. We close with a candid finding: a formal base over classical financial mathematics yields certified unification of known results rather than new financial theory. The contribution is therefore methodological and infrastructural, reusable verified foundations for mathematical finance, together with the faithfulness audit.

2606.01307 2026-06-02 econ.GN q-fin.EC

Tracking the Economy through Firm Creation:Evidence from Real-Time Administrative Data

通过企业创建追踪经济:来自实时行政数据的证据

Anthony Savagar, Yannis Galanakis

AI总结 利用英国公司注册处实时数据(CHRT)捕捉企业每日创建与解散活动,证明企业注册活动领先应税企业诞生并包含就业与产出增长的预测信息,通过结构向量自回归模型(SVAR)发现企业进入的正向冲击会持续增加就业与产出。

详情
AI中文摘要

我们引入了一个新颖的实时数据集——公司注册处实时数据(CHRT),该数据集捕捉了英国注册公司总体的每日创建与解散活动。CHRT提供了企业形成的及时度量,比官方商业人口统计数据早数月可用。我们表明,注册活动领先应税企业诞生,并包含关于就业和产出增长的前瞻性信息。与此一致,结构向量自回归模型(SVAR)表明,企业进入的正向冲击会持续增加就业和产出。

英文摘要

We introduce a novel real-time dataset, Companies House Real-Time (CHRT), that captures daily firm creation and dissolution activity for the full population of UK-registered companies. CHRT provides a timely measure of business formation, becoming available months before official business demography statistics. We show that incorporation activity leads taxable business births and contains forward-looking information about employment and output growth. Consistent with this, a structural vector autoregression (SVAR) indicates that positive shocks to firm entry generate persistent increases in employment and output.

2606.01274 2026-06-02 q-fin.TR cs.CR

Strategic Users in a Priority Queue with Bulk Service on Blockchains

区块链上批量服务的优先队列中的战略用户

Donghwa Seo, Kyoung-Kuk Kim

AI总结 本文通过M/G^K/1优先队列模型分析区块链交易费用,研究用户博弈行为,推导稳态半闭式表达式,并应用于比特币网络模拟及跨链比较。

详情
AI中文摘要

本文通过考虑交易形成优先队列且用户参与排队博弈,分析了区块链上的交易费用。利用M/G^K/1优先队列模型,我们提供了关于交易费用动态及其对用户行为影响的新见解。我们推导了稳态量的半闭式表达式,并将用户延迟成本与交易费用之间的关系推广到一般的区块生成时间。我们将该模型应用于比特币网络,并模拟了各种场景下的用户反应。对比特币、狗狗币和莱特币的跨链分析揭示了归一化成本结构的相似性。

英文摘要

This paper analyzes transaction fees on blockchains by considering that they form a priority queue and users play a queueing game. Using an M/G^K/1 priority queue model, we provide new insights into the dynamics governing transaction fees and their impact on user behavior. We derive semi-closed form expressions for steady-state quantities and extend the relationship between user delay costs and transaction fees to general block generation times. We apply the model to the Bitcoin network and simulate user responses under various scenarios. Cross-chain analysis across Bitcoin, Dogecoin, and Litecoin reveals similarities in normalized cost structures.

2606.01234 2026-06-02 econ.GN cs.CE cs.CV cs.GT cs.LG physics.soc-ph q-fin.EC

Differing Roles of Leisure and Productivity in GDP - A Machine Learning based comparative analysis of Germany and USA

休闲与生产力在GDP中的不同作用——基于机器学习的德国与美国比较分析

Achintya Ranjan, Uma Ranjan

AI总结 本研究通过随机森林模型分析工作时间和全要素生产率对GDP的影响,并利用Gini重要性、SHAP图和部分依赖图揭示德国与美国社会结构差异在GDP贡献中的体现。

详情
Comments
International Conference on Emerging Techniques in Computational Intelligence 2025
AI中文摘要

一个国家的GDP被建模为两个因素之间的相对相互作用——工作时间,反映人口的社会选择,以及全要素生产率,反映对生产力提升因素的集体投资。研究表明,随机森林模型可以从这两个因素准确预测GDP。通过Gini重要性、SHAP图和部分依赖图分析了德国和美国所做的选择差异。结果表明,国家社会结构的差异反映在工作时间和生产率对GDP的相对贡献中。

英文摘要

The GDP of a country is modelled as the relative interaction between two agents - working hours, reflecting the social choice of a population, and Total Factor Productivity, reflecting the collective investment in productivity enhancers. It is shown that a Random Forest model can accu- rately predict the GDP from these two factors. The differences in the choices made by Germany and USA are analysed though Gini importance, SHAP plots and partial dependency. It is shown that the differences in the social structure of the countries are reflected in the relative contribution of working hours and productivity to the GDP.

2606.01131 2026-06-02 cs.CE q-fin.CP

Tokenized but Illiquid? Evidence from Real-World Asset Markets

代币化但缺乏流动性?来自现实世界资产市场的证据

Rischan Mafrur

AI总结 本文通过构建以太坊上非稳定币现实世界资产的月度面板数据,使用换手率、活跃地址等指标衡量流动性,发现代币化与流动性是不同结果,黄金支持代币表现出更广泛的持有者基础和更持久的链上活动。

详情
AI中文摘要

现实世界资产代币化常被视为改善传统非流动性资产流动性的机制。然而,链上表示和二级市场流动性是不同的结果。本文考察代币化的现实世界资产是否表现出有意义的可观察流动性,并识别与更高市场活动相关的代币特征。利用来自RWA.xyz的代币级数据和来自Etherscan的补充合约级观察,本研究构建了基于以太坊的非稳定币现实世界资产的月度面板数据,涵盖三个主要类别:美国国债支持代币、黄金支持商品代币和私人信贷相关代币。流动性使用换手率、活跃地址和活跃月份指标衡量。实证设计结合了描述性统计、非参数组检验和适用于短而稀疏代币历史的探索性面板回归。结果显示各资产类别之间存在显著异质性。黄金支持代币表现出比许多国债和私人信贷相关产品更广泛的持有者基础和更持久的链上活动,而仅凭未偿资产价值无法可靠预测可观察的流动性。本文通过开发更清晰的现实世界资产流动性实证测量框架,并表明代币化和流动性应作为不同结果进行分析,为文献做出贡献。

英文摘要

Real-world asset tokenization is often presented as a mechanism for improving the liquidity of traditionally illiquid assets. However, on-chain representation and secondary-market liquidity are distinct outcomes. This paper examines whether tokenized real-world assets exhibit meaningful observed liquidity and identifies the token characteristics associated with higher market activity. Using token-level data from RWA.xyz and supplemental contract-level observations from Etherscan, the study constructs an Ethereum-based monthly panel of non-stablecoin real-world assets across three prominent categories: U.S. Treasury-backed tokens, gold-backed commodity tokens, and private-credit-related tokens. Liquidity is measured using turnover, active addresses, and an active-month indicator. The empirical design combines descriptive statistics, non-parametric group tests, and exploratory panel regressions suited to short and sparse token histories. The results show substantial heterogeneity across asset categories. Gold-backed tokens exhibit broader holder bases and more persistent on-chain activity than many Treasury and private-credit-related products, while outstanding asset value alone does not reliably predict observed liquidity. The paper contributes to the literature by developing a clearer empirical measurement framework for real-world-asset liquidity and showing that tokenization and liquidity should be analyzed as distinct outcomes.

2606.01122 2026-06-02 cs.LG q-fin.CP

A Per-Component Diagnostic Protocol for Neural HJB-PIDE Solvers under Control-Dependent Lévy Jumps

控制依赖 Lévy 跳跃的神经 HJB-PIDE 求解器的逐分量诊断协议

R. Drissi

AI总结 提出一个五步诊断协议,用于检测残差训练的神经 HJB-PIDE 求解器在控制依赖 Lévy 跳跃下的算子计算错误,并通过 CRRA-Merton-Variance-Gamma 基准案例验证其有效性。

详情
AI中文摘要

我们针对具有控制依赖 Lévy 跳跃的残差训练神经 HJB-PIDE 求解器,提出一个五步诊断协议,旨在解决神经 PDE 方法的一种常见失效模式:学习到的解可能匹配标量诊断指标,但错误计算了其训练损失内部的算子。该协议将每个神经求解与至少一个从零开始的独立参考配对,将哈密顿量分解为漂移、扩散、补偿器和非局部积分分量(在 u 网格上),并在 (t,x) 网格上比较值函数及其低阶导数,然后进行任何 argmax 比较。应用于标准 CRRA-Merton-Variance-Gamma 基准,它隔离了神经方法重要性提议密度中缺失的 1/2 混合因子,该因子将非局部积分恰好缩放了一半——这是常数提议尺度误差的教科书式特征,而更长的训练、网格细化和截断扫描均无法发现。修正该错误后,四个参考解——两个具有不连续离散化的有限差分求解器、神经求解器以及通过 CRRA 齐次性获得的半解析标量基线——在最优控制上达成约 2% 以内的一致。常数系数 CRRA 基准通过齐次性简化为标量最大化,因此标量基线是此处的高效方法;贡献在于该协议,原则上可应用于真正需要神经 HJB-PIDE 求解器的非齐次和高维场景。该案例是更广泛的神经 PDE 验证失效的具体实例:学习到的值或控制的逐点一致可能与系统性错误的非局部算子共存,因此在信任 argmax 策略之前,需要进行逐分量和表面层次的检查。

英文摘要

We propose a five-step diagnostic protocol for residual-trained neural HJB-PIDE solvers with control-dependent Lévy jumps, targeting a general failure mode of neural PDE methods: a learned solution can match headline scalar diagnostics while miscomputing an operator inside its training loss. The protocol pairs each neural solve with at least one from-scratch independent reference, decomposes the Hamiltonian into drift, diffusion, compensator, and nonlocal-integral components across a u-grid, and compares the value function and its low-order derivatives over a (t,x) grid before any argmax comparison. Applied to a standard CRRA-Merton-Variance-Gamma benchmark, it isolates a missing 1/2-mixture factor in the neural method's importance-proposal density that scaled the nonlocal integral by exactly half - a textbook signature of a constant proposal scale error, invisible to longer training, grid refinement, and truncation sweeps. With the bug corrected, four references - two finite-difference solvers with disjoint discretizations, the neural solver, and a semi-analytic scalar baseline obtained from CRRA homogeneity - agree on the optimal control to within ~2%. The constant-coefficient CRRA benchmark collapses by homogeneity to a scalar maximization, so the scalar baseline is the efficient method here; the contribution is the protocol, applicable in principle to non-homogeneous and higher-dimensional settings where neural HJB-PIDE solvers are genuinely needed. The episode is a concrete instance of a broader neural-PDE verification failure: pointwise agreement of a learned value or control can coexist with a systematically wrong nonlocal operator, so per-component and surface-level checks are needed before trusting the argmax policy.

2606.00989 2026-06-02 econ.GN q-fin.EC

Recession Detection Using Real Time GDP Data

使用实时GDP数据进行衰退检测

Neha Sikand, Rongjin Zhang

AI总结 本文利用1947-2021年美国实时GDP数据构建多种衰退指标,通过组合阈值生成完美分类器,证明实时GDP公告能可靠识别经济周期转折点。

详情
AI中文摘要

本文检验了实时GDP公告能否可靠地识别经济周期转折点。利用1947年至2021年美国实时GDP数据,我们基于不同的平滑方法和缩放变化构建了4,356个衰退指标。然后,我们将这些指标与替代阈值相结合,生成了137,457个完美衰退分类器。选定的分类器识别了所有12次历史衰退,且没有产生误报或漏报。将注意力限制在高精度部分,得到了两个检测误差标准差低于三个月的分类器,而选定的集成模型在其官方开始后平均3.04个月发出衰退信号。该框架在不同数据版本中准确识别衰退事件,表明先前工作中的差异可能反映了传统定年方法的局限性以及数据修订的影响。总体而言,结果表明实时GDP公告为NBER风格的衰退定年提供了实用的代理指标。

英文摘要

This paper examines whether real-time GDP announcements can reliably identify business-cycle turning points. Using U.S. real-time GDP vintages from 1947 to 2021, we construct 4,356 recession indicators based on alternative smoothing methods and scaling variations. We then combine these indicators with alternative thresholds to generate 137,457 perfect recession classifiers. The selected classifiers identify all 12 historical recessions without generating false positives or false negatives. Restricting attention to the high-precision segment yields two classifiers with a standard deviation of detection errors below three months, while the selected ensemble signals recessions, on average, 3.04 months after their official onset. The framework accurately identifies recession episodes across vintages, suggesting that discrepancies in prior work may reflect limitations of traditional dating methods in addition to data revisions. Overall, the results indicate that real-time GDP announcements provide a practical proxy for NBER-style recession dating.

2606.00948 2026-06-02 econ.GN q-fin.EC

Recession Detection in Japan using Labor Market Data

使用劳动力市场数据检测日本经济衰退

Neha Sikand, Rongjin Zhang

AI总结 通过校准阈值和平滑参数,将基于劳动力市场的Sahm规则和Michez规则应用于日本数据,构建大量衰退指标,并选出统计上完美的分类器,实现实时衰退检测。

详情
AI中文摘要

衰退指标通常被视为美国特有的,这引出了一个问题:基于劳动力市场的规则(如Sahm规则和Michez规则)能否可靠地检测其他国家的衰退。为了回答这个问题,我们通过将阈值和平滑参数校准到日本劳动力市场数据,评估这些规则是否适用于日本。我们构建了一个包含95,832个衰退指标的大型集合,结合了失业和职位空缺数据。选定的分类器在统计上是完美的,因为它们识别了1970-2021年训练期间的所有11次历史衰退,且没有产生任何误报。其中,193个分类器位于预期-精度前沿。将注意力限制在高精度段,得到了六个检测误差标准差低于3个月的分类器。选定的分类器集成平均在衰退实际开始后0.06个月发出信号。总体而言,这些发现表明,基于劳动力市场松弛的规则为改善各国实时衰退检测提供了一个通用框架。

英文摘要

Recession indicators are often viewed as U.S. specific, raising the question of whether labor market-based rules such as the Sahm Rule and the Michez Rule can reliably detect recessions in other countries. To answer this, we evaluate whether such rules can be adapted to Japan by calibrating thresholds and smoothing parameters to Japanese labor market data. We construct a large set of 95,832 recession indicators combining unemployment and vacancy data. The selected classifiers are statistically perfect as they identify all 11 historical recessions in the 1970-2021 training period without generating any false positives. Among these, 193 classifiers lie on the anticipation-precision frontier. Restricting attention to the high-precision segment yields six classifiers with a standard deviation of detection errors below 3 months. The selected classifier ensemble signals recessions, on average, 0.06 months after their true onset. Overall, these findings suggest that slack-based labor market rules provide a general framework for improving real-time recession detection across countries.

2606.00800 2026-06-02 q-fin.PR q-fin.ST

Multiplicative Langevin Process for Volatilities Produces Observed Q-Variance Regularities

波动率的乘法朗之万过程产生观测到的Q方差规律

William H. Press, Alex Dannenberg

AI总结 本文证明Q方差关系等价于波动率服从逆伽马分布,并表明该分布可由具有任意可调相干时间的乘法朗之万过程精确生成。

详情
Comments
7 pages
AI中文摘要

Q方差(所谓的)提出了一种统计关系 $\mathbf{E}(σ^2 | z) = σ_0^2 + frac{1}{2}z^2$,其中资产波动率 $σ^2$ 在时间区间 $T$ 内观测,$z$ 是同一区间内(适当缩放的)收益率。我们在此证明,这一关系与假设 $σ^2$ 本身服从逆伽马概率分布是{\em 完全等价的}。然后我们证明,这样的分布恰好由具有任意可调相干时间 $τ_c$ 的乘法朗之万过程生成,因此对于所有 $T \ll τ_c$,几乎相同的Q方差关系都将成立。

英文摘要

Q-variance (so-called) posits a statistical relationship $\mathbf{E}(σ^2 | z) = σ_0^2 + \tfrac{1}{2}z^2$ between an asset's volatility $σ^2$, as observed in a time interval $T$, and its (suitably scaled) return $z$ in the same interval. We here show that this relationship is {\em exactly equivalent} to to positing an Inverse Gamma probability distribution for $σ^2$ itself. We then show that such a distribution is exactly generated by a multiplicative Langevin process with an arbitrary, settable coherence time $τ_c$, so that very nearly the same Q-variance relationship will hold for all $T \ll τ_c$.

2606.00624 2026-06-02 q-fin.ST q-fin.CP

Macro-aware time series forecasting via hierarchical mixed-frequency attention models

宏观感知的时间序列预测:基于层次混合频率注意力模型

Daniel Cunha Oliveira, Kieran Wood, Stefan Zohren, Mihai Cucuringu, André Fujita

AI总结 提出HANET(层次注意力网络),通过层次交叉注意力机制将低频宏观状态与高频资产收益结合,在多个资产类别上提升预测性能和风险调整收益。

详情
AI中文摘要

深度学习模型在金融预测中展现出潜力,但其泛化能力常受限于小数据集、噪声信号和非平稳性。虽然元学习和相关技术缓解了部分问题,但它们通常未考虑宏观金融预测的核心限制:驱动资产收益的独特宏观经济状态的稀缺性。我们提出HANET(层次注意力网络),一种基于LSTM的混合架构,通过关注长期宏观背景的注意力机制整合宏观经济领域知识,同时保留高频市场动态。HANET以层次混合频率结构组织信息,将每日资产收益信号嵌套在月度宏观经济窗口内,并引入层次交叉注意力机制,在不丢弃细粒度日信息的情况下协调低频宏观信号与高频收益。通过将状态选择建模为对宏观背景的注意力,该模型适应稀缺且变化的状态。实证上,在涵盖多个资产类别的55种流动性期货中,HANET持续优于忽略宏观经济信息的神经预测器,尤其是在动荡时期,改善了风险调整收益并减少了损失。消融研究表明,这些收益依赖于结构化的宏观条件而非简单的特征增强:具有相同宏观表示的LSTM表现较差,而打乱宏观背景会显著降低性能。最后,HANET通过注意力权重提供可解释性,突出每个预测中最具影响力的历史状态,并将宏观条件与投资组合结果联系起来。这些结果确立了HANET作为将宏观经济信息整合到基于注意力的深度学习金融预测中的系统方法。

英文摘要

Deep learning models show promise in financial forecasting, yet their generalization is often undermined by small datasets, noisy signals, and non-stationarity. While meta-learning and related techniques mitigate some of these issues, they typically do not account for a core limitation in macro-financial prediction: the scarcity of distinct macroeconomic regimes that drive asset returns. We introduce HANET (Hierarchical Attention Network), a hybrid LSTM-based architecture that integrates macroeconomic domain knowledge through attention over long-run macro contexts while preserving high-frequency market dynamics. HANET organizes information in a hierarchical mixed-frequency structure, with daily asset-return signals nested within monthly macroeconomic windows, and introduces a Hierarchical Cross-Attention mechanism that reconciles low-frequency macro signals with high-frequency returns without discarding granular daily information. By framing regime selection as attention over macroeconomic contexts, the model adapts to scarce and shifting regimes. Empirically, across 55 liquid futures spanning multiple asset classes, HANET consistently outperforms neural forecasters that ignore macroeconomic information, particularly during turbulent periods, improving risk-adjusted returns and mitigating losses. Ablation studies show that these gains rely on structured macro conditioning rather than naive feature augmentation: an LSTM with the same macro representation performs poorly, and shuffling macro contexts substantially degrades performance. Finally, HANET provides interpretability through attention weights, highlighting which historical regimes are most influential for each forecast and linking macro conditions to portfolio outcomes. These results establish HANET as a systematic approach to integrating macroeconomic information into attention-based deep learning for financial forecasting.

2606.00614 2026-06-02 econ.GN q-fin.EC

Mitigation of spatial economic impact propagation of highway disruptions by redundant networks

高速公路中断的空间经济影响传播的冗余网络缓解

Tomoki Ishikura

AI总结 本研究结合区域间道路网络连通性与空间可计算一般均衡模型,评估冗余交通网络在灾害中缓解经济脆弱性的有效性,并以日本中国地区平行高速公路中断场景为例,发现经济脆弱性降低效应比交通影响更深远。

详情
Comments
15 pages, 12 figures
AI中文摘要

灾害对交通基础设施造成的损害可能通过经济相互依存间接导致经济损失,即使在未直接受影响的地区也是如此。然而,即使交通路线因灾害中断,如果确保有替代路线,损害也可以得到缓解。交通网络密度低的农村地区在灾害中更容易受到交通中断的影响。本研究开发了一种评估冗余交通网络在灾害中缓解经济脆弱性有效性的方法。我们的方法将区域间道路网络连通性与空间可计算一般均衡(SCGE)模型相结合。我们将该方法应用于日本中国地区的道路中断情景,该地区拥有平行高速公路系统。受影响地区在地理上靠近许多农村地区,并与它们有很强的经济相互依存关系。几个反事实模拟描绘了没有替代道路和没有灾害的情况。我们分别评估了交通影响(以旅行时间变化衡量)和经济影响(以负效益衡量)。结果表明,经济脆弱性降低效应比交通影响更为深远。

英文摘要

The damage to transportation infrastructure caused by disasters can indirectly lead to economic damage through economic interdependence, even in areas that are not directly affected. However, even when transportation routes are interrupted by a disaster, the damage can be mitigated if alternative routes are secured. Rural areas with low-density transportation networks are more vulnerable to traffic disruptions in a disaster. This study develops a method for evaluating the effectiveness of redundant transportation networks in mitigating economic vulnerability in the event of a disaster. Our methodology combines inter-regional road network connectivity with a spatial computable general equilibrium (SCGE) model. We apply the method to road disruption scenarios in the Chugoku region of Japan, which has a system of parallel highways. The affected areas are in close geographical proximity to many rural areas and have strong economic interdependencies with them. Several counterfactual simulations depicted the situation without the alternative road and the disaster. We evaluate the transportation impacts, measured by changes in travel time, and the economic impacts, measured by negative benefits, respectively. The results suggest that the economic vulnerability reduction effect is more far-reaching than the transportation impacts.

2606.00143 2026-06-02 q-fin.PM cs.AI

Regime-Adaptive Continual Learning for Portfolio Management

Regime-Adaptive Continual Learning for Portfolio Management

Chaofan Pan, Lingfei Ren, Linbo Xiong, Yonghao Li, Wei Wei, Xin Yang

AI总结 提出ReCAP框架,通过自适应制度检测和持续学习实现投资组合管理的快速适应与长期优异回报。

详情
Comments
Accepted by KDD 2026
AI中文摘要

金融市场本质上是不稳定的,频繁出现制度转变和结构性变化,使得传统的投资组合管理方法失效。现有的补救措施,如滚动窗口重新训练和朴素在线微调,分别受到高计算成本和知识利用不足的困扰,导致低回报和有限的适应性。持续学习通过使交易代理能够跨顺序任务积累和转移知识,提供了一种有前景的范式。在本文中,我们提出了 extbf{Re}gime-aware extbf{C}ontinual extbf{A}daptive extbf{P}ortfolio management ( extbf{ReCAP}),一个将CL集成到PM中以应对动态金融环境挑战的新框架。ReCAP采用自适应制度检测模块将历史市场数据分割成可变长度的制度,实现制度特定的策略向量学习和策略库构建。在持续交易过程中,制度门控模块根据当前市场状态自适应地组合策略库中的策略向量,促进对新检测到的制度的快速适应。只有制度门控和当前制度的策略向量被持续更新,以有效保留有用知识。在五个真实世界数据集上的广泛实验表明,ReCAP持续优于流行的基线,在长期投资视野中实现卓越回报,并快速适应制度转变。

英文摘要

Financial markets are inherently non-stationary, exhibiting frequent regime shifts and structural changes that render traditional Portfolio Management (PM) approaches ineffective. Existing remedies, such as rolling-window retraining and naive online fine-tuning, are hindered by high computational costs and insufficient knowledge utilization, respectively, resulting in low returns and limited adaptability. Continual learning (CL) offers a promising paradigm by enabling trading agents to accumulate and transfer knowledge across sequential tasks. In this paper, we propose \textbf{Re}gime-aware \textbf{C}ontinual \textbf{A}daptive \textbf{P}ortfolio management (\textbf{ReCAP}), a novel framework that integrates CL into PM to address the challenges of dynamic financial environments. ReCAP employs an adaptive regime detection module to segment historical market data into variable-length regimes, enabling regime-specific learning of policy vectors and the construction of a policy library. During continual trading, a regime-gate module adaptively combines policy vectors from the library based on the current market state, facilitating rapid adaptation to newly detected regimes. Only the regime-gate and the current regime's policy vector are continually updated to preserve useful knowledge effectively. Extensive experiments on five real-world datasets demonstrate that ReCAP consistently outperforms popular baselines, achieving superior returns in long-term investment horizons and rapid adaptation to regime shifts.

2606.00071 2026-06-02 q-fin.GN cs.CE cs.DC econ.GN q-fin.EC

Bitcoin Price Prediction: Peer-Reviewed Evidence and Social Media Discourse

比特币价格预测:同行评审证据与社交媒体讨论

Carlos Baquero

AI总结 本文通过调查同行评审文献和社交媒体讨论,发现比特币价格预测在1-6个月时间跨度上尚无模型能稳健超越朴素基准,并提出了改进评估方法的具体建议。

详情
AI中文摘要

比特币价格预测已吸引了数百篇学术论文和持续的社交媒体辩论,然而该领域甚至在基本问题上缺乏共识:任何模型能否在1至6个月的时间跨度上击败“今日价格”的朴素基准?我们调查了同行评审领域,按评估方法对论文进行分类,并将学术发现与X/Twitter上非正式但实质性的讨论进行对比。得出的图景令人警醒。在短期至中期跨度上,没有同行评审研究显示出在多个市场制度下对朴素基准的稳健优越性。日度可预测性是存在的,但不会延伸到小时或月度跨度,并且可能无法承受交易成本。存量-流量模型在正式的样本外测试中失败,而梅特卡夫定律估值被质疑为虚假相关。比特币价格幂律虽然经验上引人注目,但尚未经过正式分布检验。与此同时,社交媒体从业者提出了有效的统计批评——普通最小二乘法(OLS)违反、回测过拟合、虚假回归——而学术文献尚未将其形式化。我们识别了开放的研究方向,并为未来工作提出了具体的方法论标准——滚动窗口评估、多制度保留窗口、朴素基准比较、在超参数网格中包含零值以及Diebold-Mariano显著性检验——认为该领域的主要需求不是更多模型,而是更好的评估。

英文摘要

Bitcoin price prediction has attracted hundreds of academic papers and continuous social media debate, yet the field lacks consensus on even basic questions: can any model beat a naive "today's price" baseline at horizons of one to six months? We survey the peer-reviewed landscape, categorize papers by evaluation methodology, and contrast academic findings with informal but substantive discourse on X/Twitter. The picture that emerges is sobering. At short-to-medium horizons, no peer-reviewed study has shown robust superiority over the naive baseline across multiple market regimes. Daily predictability is real but does not extend to hourly or monthly horizons, and may not survive transaction costs. The stock-to-flow model has failed formal out-of-sample testing, and Metcalfe's Law valuations have been challenged as spurious. The Bitcoin price power law, while empirically compelling, has not been subjected to formal distributional tests. Meanwhile, social media practitioners raise valid statistical critiques -- ordinary least squares (OLS) violations, backtest overfitting, spurious regressions -- that the academic literature has not formalized. We identify open research directions and propose concrete methodological standards for future work -- walk-forward evaluation, multi-regime holdout windows, naive baseline comparison, inclusion of zero in hyperparameter grids, and Diebold-Mariano significance testing -- arguing that the field's primary need is not more models but better evaluation.

2606.00061 2026-06-02 q-fin.ST q-fin.TR

Reflexivity as Prompt: Does Awareness of Self-Reinforcing Market Dynamics Improve LLMs as Financial Market Forecasters?

反身性作为提示:对自我强化市场动态的认知是否提高了LLM作为金融市场预测者的表现?

Eugene Park

AI总结 研究在繁荣-萧条市场周期中,逐步引入索罗斯反身性理论时,前沿大语言模型作为金融预测者的行为变化,通过方向性预测准确率和夏普比率评估,发现反身性认知在不同模型和上下文窗口中产生不同的预测改进。

详情
AI中文摘要

我们研究了在繁荣-萧条市场周期中,当逐步意识到索罗斯的反身性理论时,前沿大语言模型(LLMs)作为金融预测者的行为。标准的人工智能辅助预测将市场视为外生系统。反身性理论则持不同观点:价格塑造基本面,每个预测者都是其所分析循环中的参与性主体。我们在两个历史上不同的时期——互联网泡沫(1996-2001年)和全球金融危机(2004-2009年)——评估了三种前沿模型(GPT5、Claude Sonnet 4.6和Gemini 3 Pro),在四种累积的零样本条件下。主要指标是方向性预测准确率;我们还报告了隐含的多头/现金策略的夏普比率,以捕捉预测的风险调整后经济价值。所有输入均经过匿名化和归一化处理,以防止记忆。我们发现,包含反身性认知的条件在不同模型和上下文窗口中提高了预测准确性,揭示了相同的理论认知在不同前沿LLM中可能产生定性的不同预测行为。

英文摘要

We study how frontier large language models (LLMs) behave as financial forecasters during boom-bust market cycles when made progressively aware of Soros's theory of reflexivity. Standard AI-assisted forecasting treats the market as an exogenous system. Reflexivity theory holds otherwise: prices shape fundamentals, and every forecaster is a participative agent in the loop it analyzes. We evaluate three frontier models - GPT5, Claude Sonnet 4.6, and Gemini 3 Pro - under four accumulating zero-shot conditions across two historically distinct episodes: the dot-com bubble (1996-2001) and the global financial crisis (2004-2009). The primary metric is directional forecasting accuracy; we also report the Sharpe ratio of an implied long/cash strategy to capture the risk-adjusted economic value of the forecasts. All inputs are anonymized and normalized to guard against memorization. We find that conditions incorporating reflexivity awareness improve forecasting accuracy differently across models and context windows, revealing that the same theoretical awareness can produce qualitatively different forecasting behavior across frontier LLMs.

2606.00060 2026-06-02 q-fin.TR cs.CE cs.LG

Machine Learning-Based Bitcoin Trading Under Transaction Costs: Evidence From Walk-Forward Forecasting

基于机器学习的比特币交易:考虑交易成本的滚动前向预测证据

Andrei Bysik, Robert Ślepaczuk

AI总结 研究在交易成本下,利用XGBoost、LSTM和iTransformer等机器学习模型预测BTC-USDT小时收益率,并通过成本感知执行过滤器将预测转化为盈利交易策略。

详情
Comments
42 pages,
AI中文摘要

本文研究机器学习对BTC-USDT小时收益率的预测能否在扣除交易成本后转化为具有经济意义的交易表现。使用2018-2026年间约70,000个小时观测值,在27折滚动前向协议中评估XGBoost、LSTM和iTransformer。所有三种模型在选定配置下均产生正的毛交易表现,但一旦施加十个基点的交易成本,基于符号的朴素策略便失效。一种成本感知的执行过滤器(仅当预测幅度超过基于交易成本的阈值时才阻止交易)显著降低了换手率,并在选定配置下恢复了盈利能力。最强的纯多头XGBoost策略年化收益率超过65%,夏普比率高于1。额外测试表明,技术指标在选定情况下提升了表现,EGARCH导出的特征并未提供一致的稳健收益,且XGBoost在描述性上优于神经替代模型,尽管自助法证据不支持正式的统计优势。损失函数和模型选择效应是次要的且统计上脆弱。结果表明,小时级加密货币交易的主要障碍不仅在于弱可预测性,还在于将预测转化为交易的方式。

英文摘要

This paper investigates whether machine learning forecasts of hourly BTC-USDT returns can be converted into economically meaningful trading performance after transaction costs. Using approximately 70,000 hourly observations from 2018-2026, XGBoost, LSTM, and iTransformer are evaluated in a 27-fold walk-forward protocol. All three models produce positive gross trading performance in selected configurations, but naive sign-based strategies fail once transaction costs of ten basis points are imposed. A cost-aware execution filter, which prevents trades only when the forecast magnitude exceeds a transaction-cost-based threshold, sharply reduces turnover and restores profitability in selected configurations. The strongest long-only XGBoost strategy produces annualised returns above 65% with a Sharpe ratio above one. Additional tests show that technical indicators improve performance in selected cases, EGARCH-derived features do not provide uniformly robust gains, and XGBoost is descriptively stronger than the neural alternatives, although bootstrap evidence does not support formal statistical dominance. Loss-function and model-selection effects are secondary and statistically fragile. The results show that the main obstacle in hourly cryptocurrency trading is not only weak predictability, but also the way forecasts are converted into trades.

2605.30567 2026-06-02 q-fin.PR

Valuation of GLWB-LTC Annuities with Lévy Equity Dynamics, Stochastic Interest Rates and Health-State Transitions

具有Lévy权益动态、随机利率和健康状态转换的GLWB-LTC年金估值

Andrea Molent

AI总结 本文针对具有长期护理特征的最低提取利益保证合同,提出结合Lévy跳扩散权益动态、Hull-White随机利率和七状态健康模型的估值框架,并采用混合树-IMEX有限差分方法进行数值求解。

详情
AI中文摘要

本文开发了一个估值框架,用于当参考基金遵循指数Lévy动态且短期利率遵循Hull-White模型时,具有长期护理(LTC)特征的保证终身提取利益(GLWB)合同。该合同结合了金融担保、长寿保护、健康相关的LTC支付以及退保期权,需要联合处理跳跃风险、随机贴现和残疾风险。数值方法将可重组Hull-White三叉树与隐式-显式(IMEX)有限差分方案相结合。该框架包含一个七状态健康模型、年费、LTC支付、保证提取和bang-bang保单持有人行为,并以蒙特卡洛模拟为基准。数值结果表明,混合树-IMEX方法能够提供与模拟基准一致且稳定的长期限价格。它们还表明,Lévy权益动态和随机利率对公平费用和退保激励有实质性影响,并影响合同价值的分解。这些发现强调了在为具有LTC或有利益的长期保险担保定价时,联合建模金融尾部风险和利率风险的重要性。

英文摘要

This paper develops a valuation framework for guaranteed lifetime withdrawal benefit (GLWB) contracts with long-term care (LTC) features when the reference fund follows exponential Levy dynamics and the short rate follows the Hull-White model. The contract combines financial guarantees, longevity protection, health-contingent LTC payments, and surrender optionality, requiring the joint treatment of jump risk, stochastic discounting, and disability risk. The numerical method couples a recombining Hull-White trinomial tree with an implicit-explicit (IMEX) finite difference scheme. The framework incorporates a seven-state health model, annual fees, LTC payments, guaranteed withdrawals, and bang-bang policyholder actions, and is benchmarked against Monte Carlo simulation. Numerical results show that the hybrid tree-IMEX method delivers stable long-maturity prices consistent with simulation benchmarks. They also show that Levy equity dynamics and stochastic interest rates have a material impact on fair fees and surrender incentives, and affect the decomposition of contract value. The findings highlight the importance of modelling financial tail risk and interest-rate risk jointly when pricing long-term insurance guarantees with LTC-contingent benefits.

2605.30435 2026-06-02 econ.GN q-fin.EC

Global Science Sustains U.S. Innovation

全球科学支撑美国创新

Christopher R. Esposito

AI总结 通过追踪多代引用路径揭示美国创新的全球科学知识供应链,并模拟跨境障碍对创新生产力的负面影响。

详情
AI中文摘要

与实体产品一样,新技术是利用全球采购的投入开发的。然而,尽管实体商品背后的供应链已得到充分理解,我们对支撑美国创新的科学知识的国际供应链,以及它可能受到的干扰知之甚少。在这里,我通过追踪连接NSF资助的研究与下游专利的多代引用路径来揭示这一供应链,并通过模拟美国边境科学知识流动的障碍对其进行压力测试。美国的知识供应链延伸至全球,阻碍思想跨越美国边境的摩擦会降低其连通性、延长其长度并降低创新生产力。这些影响延伸到美国国会认为对国家优先事项至关重要的技术领域,包括半导体、量子科学和人工智能。

英文摘要

Like physical products, new technologies are developed using globally sourced inputs. Yet while the supply chains behind physical goods are well understood, we know far less about the international supply chain of scientific knowledge that powers U.S. innovation, or how vulnerable it may be to disruption. Here, I uncover this supply chain by tracing multi-generational citation paths connecting NSF-funded research to downstream patents, and stress-test it by simulating barriers to scientific knowledge flows across the U.S. border. The U.S. knowledge supply chain extends globally, and frictions impeding the movement of ideas across the U.S. border reduce its connectivity, extend its length, and lower innovation productivity. These impacts extend to technology areas deemed critical to national priorities by U.S. Congress, including Semiconductors, Quantum Science, and AI.

2605.25631 2026-06-02 cs.GT cs.CR math.PR q-fin.TR

The Privacy Subsidy in Continuous-Time Kyle: Cumulative Welfare under Noise-Perturbed Order-Flow Observation

连续时间Kyle模型中的隐私补贴:噪声扰动订单流观测下的累积福利

Yuki Nakamura

AI总结 将单周期Kyle模型的隐私补贴闭式解扩展到连续时间,推导出价格影响系数和累积预期转移的解析形式,并建立与损失-再平衡(LVR)的结构对应关系。

详情
Comments
v4: Framing reconciliation (competition-robust coarse-signal subsidy); explicit per-increment covariance; LVR stated as a structural correspondence (not identity). 14 pages. Third paper in a privacy-subsidy cluster (companions: arXiv:2605.15746, arXiv:2605.19742)
AI中文摘要

我们将Nakamura(2026,arXiv:2605.15746)的单周期Kyle模型隐私补贴闭式解扩展到连续时间。一个承诺的贝叶斯自动做市商观察到被独立布朗隐私通道(扩散强度$σ_\varepsilon$)扰动的总订单流。在马尔可夫线性均衡下,价格影响系数为$λ= σ_v / \sqrt{σ_u^2 + σ_\varepsilon^2}$(时间恒定),并且从协议流动性池到交易者在$[0,1]$上的累积预期转移为$|Π_M| = σ_v σ_\varepsilon^2 / \sqrt{σ_u^2 + σ_\varepsilon^2}$。然后,我们建立了该累积隐私补贴与损失-再平衡(Milionis等人,2022)之间的结构对应关系,将隐私噪声福利识别为订单流观测中LVR价格观测差距的模拟。该结果完成了在隐私聚合信息环境下量化承诺AMM交易所盈亏平衡费用的连续时间Kyle模型部分。

英文摘要

We extend the closed-form privacy-subsidy result of Nakamura~(2026, arXiv:2605.15746) from the single-period Kyle model to continuous-time. A committed Bayesian automated market maker observes the aggregate order flow perturbed by an independent Brownian privacy channel of diffusion intensity $σ_\varepsilon$. Under the Markovian linear equilibrium, the price-impact coefficient is $λ= σ_v / \sqrt{σ_u^2 + σ_\varepsilon^2}$ -- constant in time -- and the cumulative expected transfer from the protocol's liquidity pool to traders over $[0,1]$ is $|Π_M| = σ_v σ_\varepsilon^2 / \sqrt{σ_u^2 + σ_\varepsilon^2}$. We then establish a structural correspondence between this cumulative privacy subsidy and Loss-Versus-Rebalancing (Milionis et al.~2022), identifying privacy-noise welfare as the order-flow observation analog of LVR's price observation gap. The result completes the continuous-time Kyle leg of the program of quantifying break-even fees for committed-AMM exchanges under privacy-aggregated information environments.

2508.15237 2026-06-02 q-fin.MF

Option pricing under non-Markovian stochastic volatility models: A deep signature approach

非马尔可夫随机波动率模型下的期权定价:一种深度签名方法

Jingtang Ma, Xianglin Wu, Wenyuan Li

AI总结 针对非马尔可夫随机波动率模型,提出基于深度签名的期权定价方法,通过将粗糙随机微分方程转化为经典随机微分方程,并证明算法收敛性,数值实验验证了其有效性。

详情
AI中文摘要

本文研究了标的资产遵循非马尔可夫随机波动率模型的定价问题。经典偏微分方程方法在此背景下面临重大挑战,因为期权价格不仅取决于当前状态,还取决于过程的整个历史路径。为了克服这些困难,我们将资产动态重新表述为粗糙随机微分方程,然后通过时间扩展布朗运动签名的线性或非线性组合来表示粗糙路径。这种表示将粗糙随机微分方程转化为经典随机微分方程,从而允许应用标准分析工具。我们针对线性和非线性表示提出了一种深度签名方法,并严格证明了算法的收敛性。数值例子证明了我们的方法在马尔可夫和非马尔可夫波动率模型下的有效性,为期权定价提供了一个有理论依据且计算高效的框架。

英文摘要

This paper studies the pricing problem in which the underlying asset follows a non-Markovian stochastic volatility model. Classical partial differential equation methods face significant challenges in this context, as the option prices depend not only on the current state, but also on the entire historical path of the process. To overcome these difficulties, we reformulate the asset dynamics as a rough stochastic differential equation and then represent the rough paths via linear or non-linear combinations of time-extended Brownian motion signatures. This representation transforms a rough stochastic differential equation to a classical stochastic differential equation, allowing the application of standard analytical tools. We propose a deep signature approach for both linear and nonlinear representations and rigorously prove the convergence of the algorithm. Numerical examples demonstrate the effectiveness of our approach for both Markovian and non-Markovian volatility models, offering a theoretically grounded and computationally efficient framework for option pricing.

2605.28850 2026-06-02 cs.LG q-fin.CP

Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents

表示签名与LLM交易智能体中的风险反馈对齐

Weicheng Xue

AI总结 通过TradeArena测试平台研究LLM交易智能体在金融决策中的行为对齐与表示动态,发现故障前表示签名(规划嵌入漂移、流形有效秩收缩)并验证风险反馈作为外部对齐信号的有效性。

详情
AI中文摘要

我们研究了大型语言模型(LLM)智能体在金融决策环境中的行为对齐与表示动态。TradeArena是一个可审计的交易智能体测试平台,提供风险报告、执行模拟、记忆和可重放轨迹,使我们能够分析在市场压力下推理、持仓和干预的演变。代码和数据工件可通过TradeArena仓库获取。我们发现了故障前签名:规划嵌入偏离正常质心,融合的计划-风险表示将正常状态与预回撤状态分离,局部流形表现出有效秩收缩。在80个滚动故障锚点和8条LLM轨迹中,这一模式在哈希、LSA、Transformer和白盒隐藏状态探针中持续存在。使用无CoT目标权重、词汇控制、OHLCV噪声和虚假审计的压力测试表明,无推理时推理级收缩消失,而意图空间和融合签名仍具有信息性。结构化风险反馈可以在不微调的情况下作为外部对齐信号,但并非通用性能增强器:真实审计反馈改善了一些模型的校准,另一些模型的收益,并暴露出安慰剂或隐藏反馈在短周期内收益更高但对齐诊断较弱的情况。一项51只股票的日内实验揭示了相关性盲点:LLM推理为风险层会削减的相关资产敞口提供理由。最后,一个金融审计任务套件将比较从“哪个模型交易最好”转向模型能否审计轨迹、尊重执行边界、重现工件并避免过度声明。这些结果支持研究主张而非盈利主张:可审计的风险反馈和表示轨迹揭示了LLM金融推理何时对齐、漂移或失败。

英文摘要

We study behavioral alignment and representation dynamics of large language model (LLM) agents in financial decision environments. TradeArena, an auditable trading-agent testbed with risk reports, execution simulation, memory, and replayable trajectories, lets us analyze how rationales, positions, and interventions evolve under market stress. Code and data artifacts are available through the \href{https://github.com/weich97/TradeArena.git}{TradeArena repository}. We find pre-failure signatures: planning embeddings drift from normal centroids, fused plan-risk representations separate normal from pre-drawdown states, and local manifolds exhibit effective-rank contraction. Across 80 rolling failure anchors and eight LLM trajectories, this pattern persists across hash, LSA, Transformer, and white-box hidden-state probes. Stress tests with CoT-free target weights, lexical controls, OHLCV noise, and false audits show that rationale-level contraction can vanish without rationales, while intent-space and fused signatures remain informative. Structured risk feedback can act as an external alignment signal without fine-tuning, but not as a universal performance enhancer: true audit feedback improves calibration for some models, returns for others, and exposes cases where placebo or hidden feedback has higher short-horizon return but weaker alignment diagnostics. A 51-stock intraday experiment reveals a correlation blind spot: LLM rationales justify exposure to coupled assets that the risk layer clips. Finally, a financial-audit task suite shifts comparison from ``which model trades best'' to whether models can audit trajectories, respect execution boundaries, reproduce artifacts, and avoid claim overreach. These results support a research claim, not a profitability claim: auditable risk feedback and representation trajectories reveal when LLM financial reasoning is aligning, drifting, or failing.

2212.07944 2026-06-02 cs.LG math.OC q-fin.CP q-fin.PM q-fin.ST

Variable Clustering via Distributionally Robust Nodewise Regression

基于分布鲁棒节点回归的变量聚类

Kaizheng Wang, Xiao Xu, Xun Yu Zhou

AI总结 本文提出一种分布鲁棒节点回归方法,通过凸松弛、数据驱动鲁棒区域选择和ADMM算法,实现多因子块模型下的变量聚类,并在数值实验中展示其优越性能。

详情
Comments
ICML 2026
AI中文摘要

我们研究了一个用于变量聚类的多因子块模型,并通过分布鲁棒版本的节点回归将其与正则化子空间聚类联系起来。为了解决后一个问题,我们推导了一个凸松弛,提供了一种数据驱动的方法来选择鲁棒区域的大小,并开发了一种ADMM算法以实现高效实现。我们在广泛的数值研究中验证了我们的方法,并展示了其优越的性能。

英文摘要

We study a multi-factor block model for variable clustering and connect it to regularized subspace clustering through a distributionally robust version of nodewise regression. To solve the latter problem, we derive a convex relaxation, provide a data-driven approach for selecting the size of the robust region, and develop an ADMM algorithm for efficient implementation. We validate our method in extensive numerical studies and demonstrate its superior performance.

2605.19742 2026-06-02 cs.GT cs.CR math.PR q-fin.TR

The Privacy Subsidy in Glosten-Milgrom: Bid-Ask Spread and Welfare under Flip-Noise Direction Observation

Glosten-Milgrom中的隐私补贴:在翻转噪声方向观测下的买卖价差与福利

Yuki Nakamura

AI总结 本文在Glosten-Milgrom序贯交易模型中引入二进制翻转信道噪声,推导出买卖价差的闭式解和福利分解,揭示了隐私补贴机制,并将其从连续高斯模型扩展到离散两状态微观结构。

详情
Comments
10 pages. Companion to arXiv:2605.15746
AI中文摘要

我们推导了Glosten-Milgrom 1985序贯交易模型在噪声方向观测下的买卖价差和福利分解的闭式解,其中市场做市商观察到由概率为$\eta$的二进制翻转信道扰动的交易方向——这是对方向信号施加隐私机制的自然信息论模型。在承诺的贝叶斯市场做市商定价规则下,均衡价差为$\mu(1-2\eta)\Delta$,其中$\mu$是知情交易者比例,$\Delta = v_H - v_L$是价值范围。福利分解识别出从协议流动性池向交易者的每笔交易转移$\mu\eta\Delta$——即“隐私补贴”,这与先前工作中建立的高斯-凯尔类比相对应。该结果将隐私补贴概念从连续高斯模型扩展到离散两状态微观结构,展示了在两个经典模型中的稳健性。主要应用:基于MPC的匹配引擎,具有$\varepsilon$-差分隐私的方向披露,其中引擎基于噪声方向信号定价。

英文摘要

We derive a closed-form bid-ask spread and welfare decomposition for the Glosten-Milgrom 1985 sequential-trading model when the market maker observes the trade direction perturbed by a binary flip channel of probability $η$ -- a natural information-theoretic model of privacy mechanisms acting on the direction signal. Under a committed Bayesian market-maker pricing rule, the equilibrium spread is $μ(1-2η)Δ$, where $μ$ is the informed-trader fraction and $Δ= v_H - v_L$ the value range. The welfare decomposition identifies a per-trade transfer $μηΔ$ from the protocol's liquidity pool to traders -- the "privacy subsidy", mirroring the Gaussian-Kyle analog established in prior work. The result extends the privacy-subsidy concept from continuous Gaussian to discrete two-state microstructure, demonstrating robustness across both classical models. Primary application: MPC-based matching engines with $\varepsilon$-differentially-private direction disclosure, where the engine prices on a noisy direction signal.

2605.15746 2026-06-02 cs.GT cs.CR math.PR q-fin.TR

The Privacy Subsidy: Kyle's $λ$ under Noise-Perturbed Order-Flow Observation

隐私补贴:噪声扰动订单流观测下的 Kyle's λ

Yuki Nakamura

AI总结 研究隐私保护交易所中,市场做市商观测到被高斯噪声扰动的订单流时的线性Kyle均衡,发现价格影响系数与知情交易者策略按隐私参数的倒数缩放,并推导出隐私补贴的闭式表达式。

详情
Comments
v5: Framing reconciliation (the privacy subsidy is the generic, competition-robust cost of pricing on a signal coarser than the settled flow); reference corrections; manuscript body refactored. 17 pages, 1 figure
AI中文摘要

隐私保护的加密货币交易所改变了定价机制对订单流的观测方式。我们推导了当承诺的贝叶斯市场做市商观测到被独立高斯隐私噪声扰动的订单流时,唯一的线性Kyle均衡。价格影响系数和知情交易者策略按隐私参数的倒数因子重新缩放(一个下降,一个上升),因此它们的乘积保持不变。然后,福利分解识别出从协议的LP池到交易者的每期转移的闭式形式——即“隐私补贴”,任何隐私聚合交易所必须收取的盈亏平衡费用。该结果是单期闭式隐私噪声版本的Loss-Versus-Rebalancing(Milionis等人,2022)。主要应用是带有显式加性噪声注入的屏蔽AMM(例如,差分隐私);相关设计(批量交换、密封投标拍卖、预言机挂钩交叉)需要单独的框架,我们将其留给未来工作。

英文摘要

Privacy-preserving cryptocurrency exchanges alter what the pricing mechanism observes about order flow. We derive the unique linear Kyle equilibrium when a committed Bayesian market maker observes order flow perturbed by independent Gaussian privacy noise. The price-impact coefficient and informed-trader strategy rescale by reciprocal factors of the privacy parameter (one down, one up), so their product is invariant. A welfare decomposition then identifies a closed-form per-period transfer from the protocol's LP pool to traders -- the "privacy subsidy", the break-even fee any privacy-aggregated exchange must charge. The result is the single-period closed-form privacy-noise analog of Loss-Versus-Rebalancing (Milionis et al. 2022). The primary application is shielded AMMs with explicit additive-noise injection (e.g., differential privacy); related designs (batched swaps, sealed-bid auctions, oracle-pegged crossings) require separate frameworks that we leave to future work.

2512.07526 2026-06-02 q-fin.RM econ.GN q-fin.EC q-fin.GN

Strategic Preemption Under Shared Catastrophic Risk: The Suicide Region and the Race to Artificial General Intelligence

共享灾难性风险下的战略先发制人:自杀区域与通用人工智能竞赛

David Tan

AI总结 本文通过连续时间先发制人博弈模型,分析了共享灾难性外部性下竞争压力迫使理性主体在负风险调整净现值下部署的“自杀区域”,并应用于通用人工智能竞赛,展示了系统性毁灭成本越高该区域越大的现象,以及私人责任和奖金共享两种机制如何消除该区域。

详情
Comments
41 pages, 1 figure
AI中文摘要

我们分析了一个具有共享灾难性外部性的连续时间先发制人博弈。当灾难成本嵌入双方收益时,风险项在均衡无差异条件中抵消。这创造了一个“自杀区域”,其中竞争压力迫使理性主体在负风险调整净现值下部署。我们将此框架应用于通用人工智能(AGI)竞赛。我们表明,随着系统性毁灭成本的增加,这个自杀区域扩大:更高的灾难性风险不会阻止竞赛,反而扩大了理性行为者在负社会价值下部署的条件集合。我们描述了相对于社会规划者基准的福利扭曲,并展示了两种互补机制——私人责任和奖金共享——如何关闭自杀区域。私人责任提高了不安全部署的成本,而奖金共享减少了先发制人的战略必要性。“警告射击”(次存在性灾难)将无法阻止AGI加速,因为竞赛的赢家通吃性质保持不变。

英文摘要

We analyze a continuous-time preemption game with shared catastrophic externalities. When the cost of catastrophe is embedded in both players' payoffs, the risk term cancels out in the equilibrium indifference condition. This creates a "suicide region" where competitive pressures force rational agents to deploy despite negative risk-adjusted net present values. We apply this framework to the race for artificial general intelligence (AGI). We show that this suicide region widens as the cost of systemic ruin grows: higher catastrophic risk does not deter the race but instead enlarges the set of conditions under which rational actors deploy despite negative social value. We characterize the resulting welfare distortion against a social planner's benchmark and demonstrate how two complementary mechanisms - private liability and prize-sharing - can close the suicide region. Private liability raises the cost of unsafe deployment while prize-sharing reduces the strategic imperative to deploy first. "Warning shots" (sub-existential disasters) will fail to deter AGI acceleration, as the winner-takes-all nature of the race remains intact.

2605.17628 2026-06-02 quant-ph math.OC q-fin.PM

A Penalty-Free Pipeline for Direct Quantum-Annealer Portfolio Optimization

无惩罚流水线:直接量子退火器投资组合优化

Luis Lozano

AI总结 提出一种无惩罚流水线,通过移除基数约束惩罚项,直接采样仅含目标函数的QUBO,并采用经典确定性投影器强制基数约束,从而将链断裂率从71%-92%降至0.04%以下,解决了当前直接QPU执行中惩罚编码导致的不可行问题。

详情
AI中文摘要

基数约束的投资组合选择通常被表述为二次无约束二元优化(QUBO)并提交给量子处理单元(QPU)进行直接退火。我们证明,这种标准惩罚编码是当前D-Wave Pegasus和Zephyr硬件上直接QPU执行的限制因素。展开精确的基数惩罚会贡献一个密集的秩一项,使得逻辑交互图变得完全,无论协方差如何,在小型宇宙中产生83%的链断裂分数,在完整的四十九行业Fama-French宇宙中高达92%,并且在每个测试规模下零可行原始样本。拓扑感知稀疏化将链断裂降至接近零,但任何移除非对角条目的稀疏化器也会稀释基数约束;消融实验表明,这种稀疏化-投影流水线主要由经典投影器主导,而非QPU。我们建议完全移除惩罚:在硬件上采样一个仅包含目标函数的QUBO,该QUBO由期望收益和风险缩放协方差构建,并通过确定性可行性投影器经典地强制执行基数约束。在实时Pegasus和Zephyr硬件上的4,468个保存的嵌入记录中,涵盖最多四十九个资产的股票和最多四十八个的足球博彩实例,这种无惩罚流水线将平均链断裂分数从71%-92%降低至最多0.04%,并且后处理遗憾相对于每个测试规模下的贪婪经典参考最多为0.03%。我们不声称量子优势;惩罚编码,而非稀疏硬件拓扑,是当前可访问规模下直接QPU投资组合优化的限制因素。

英文摘要

Cardinality-constrained portfolio selection is routinely cast as a quadratic unconstrained binary optimization (QUBO) and submitted to a quantum processing unit (QPU) for direct annealing. We show that this standard penalty encoding is the binding constraint for direct-QPU execution on current D-Wave Pegasus and Zephyr hardware. Expanding the exact cardinality penalty contributes a dense rank-one term that makes the logical interaction graph complete regardless of the covariance, producing chain-break fractions from 83% at small universes up to 92% at the full forty-nine-industry Fama--French universe, and zero feasible raw samples at every tested scale. Topology-aware sparsification reduces chain breaks to near zero, but any sparsifier that removes off-diagonal entries also dilutes the cardinality constraint; an ablation reveals that this sparsify-and-project pipeline is dominated by the classical projector, not the QPU. We propose removing the penalty entirely: sample an objective-only QUBO built from expected returns and the risk-scaled covariance on hardware, and enforce cardinality classically through a deterministic feasibility projector. Across 4,468 saved embedding records on live Pegasus and Zephyr hardware, spanning equities up to forty-nine assets and football-betting instances up to forty-eight, this penalty-free pipeline reduces mean chain-break fractions from 71%--92% down to at most 0.04%, and post-processed regret is at most 0.03% relative to greedy classical references at every tested scale. We do not claim quantum advantage; the penalty encoding, not the sparse hardware topology, is the limiting factor for direct-QPU portfolio optimization at currently accessible scales.

2603.05917 2026-06-02 cs.LG cs.AI q-fin.ST

Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis

结合BERT情感分析的节点Transformer架构用于股票市场预测

Mohammad Al Ridhawi, Mahtab Haj Ali, Hussein Al Osman

AI总结 提出一种将节点Transformer与BERT情感分析相结合的框架,通过图结构建模股票间依赖关系并融合社交媒体情感,在S&P 500股票上实现0.80%的MAPE,显著优于传统方法。

详情
Journal ref
IEEE Access, vol. 14, pp. 72613-72631, 2026
Comments
18 pages, 5 figures, 12 tables. Accepted for publication in IEEE Access
AI中文摘要

股票市场预测对在噪声、非平稳和行为动态的复杂市场环境中操作的投资者、金融机构和政策制定者提出了相当大的挑战。传统的预测方法,包括基本面分析和技术指标,往往无法捕捉金融市场中固有的复杂模式和横截面依赖性。本文提出了一种结合节点Transformer架构与基于BERT的情感分析的集成框架,用于股票价格预测。该模型将股票市场表示为图结构,其中个股构成节点,边捕捉关系,包括行业隶属关系、相关价格变动和供应链连接。一个微调的BERT模型从社交媒体帖子中提取情感信息,并通过基于注意力的融合机制将其与定量市场特征相结合。节点Transformer处理历史市场数据,同时捕捉股票间的时间演变和横截面依赖性。在1982年1月至2025年3月期间20只S&P 500股票上进行的实验表明,集成模型在一天前预测中实现了0.80%的平均绝对百分比误差(MAPE),而ARIMA为1.20%,LSTM为1.00%。情感分析的加入使预测误差总体降低10%,在财报公告期间降低25%,而基于图的架构通过捕捉股票间依赖性额外贡献了15%的改进。方向准确率在一天预测中达到65%。通过配对t检验的统计验证确认了这些改进的显著性(所有比较p < 0.05)。该模型在高波动期保持较低的误差,MAPE为1.50%,而基线模型范围为1.60%至2.10%。

英文摘要

Stock market prediction presents considerable challenges for investors, financial institutions, and policymakers operating in complex market environments characterized by noise, non-stationarity, and behavioral dynamics. Traditional forecasting methods, including fundamental analysis and technical indicators, often fail to capture the intricate patterns and cross-sectional dependencies inherent in financial markets. This paper presents an integrated framework combining a node transformer architecture with BERT-based sentiment analysis for stock price forecasting. The proposed model represents the stock market as a graph structure where individual stocks form nodes and edges capture relationships including sectoral affiliations, correlated price movements, and supply chain connections. A fine-tuned BERT model extracts sentiment information from social media posts and combines it with quantitative market features through attention-based fusion mechanisms. The node transformer processes historical market data while capturing both temporal evolution and cross-sectional dependencies among stocks. Experiments conducted on 20 S&P 500 stocks spanning January 1982 to March 2025 demonstrate that the integrated model achieves a mean absolute percentage error (MAPE) of 0.80% for one-day-ahead predictions, compared to 1.20% for ARIMA and 1.00% for LSTM. The inclusion of sentiment analysis reduces prediction error by 10% overall and 25% during earnings announcements, while the graph-based architecture contributes an additional 15% improvement by capturing inter-stock dependencies. Directional accuracy reaches 65% for one-day forecasts. Statistical validation through paired t-tests confirms the significance of these improvements (p < 0.05 for all comparisons). The model maintains lower error during high-volatility periods, achieving MAPE of 1.50% while baseline models range from 1.60% to 2.10%.