arXivDaily arXiv每日学术速递 周一至周五更新
重置
Q-FIN定量金融17
2606.11962 2026-06-11 stat.ME q-fin.ST stat.CO 新提交

Composite likelihood inference of fractional Gaussian processes with sequentially optimal subset selection

具有顺序最优子集选择的分数高斯过程的复合似然推断

Mathis Fourreau, Matthieu Garcin

AI总结 针对分数高斯过程,提出通过顺序最大化Godambe信息来选择子集,以平衡估计精度与计算成本,并推导了Fisher信息和Godambe信息的理论表达式。

详情
AI中文摘要

复合似然方法通过考虑观测的几个子集而非全部来降低时间序列参数估计的计算成本。该方法的渐近性质与Godambe信息相关,Godambe信息是Fisher信息的扩展,考虑了观测子集之间的依赖性。我们旨在将该方法应用于线性高斯模型,特别是分数布朗运动和分数高斯噪声。我们推导了其Fisher信息和Godambe信息的理论表达式,并推导出一种顺序最大化Godambe信息的子集选择设计。子集的大小使我们能够控制估计精度与计算成本之间的权衡。通过模拟,我们将该方法与矩方法和最大似然估计进行比较,并将其应用于真实数据,即股票指数的波动率序列和风速时间序列。

英文摘要

The composite likelihood method reduces the computational cost of parameter estimation in time series by considering several subsets of observations instead of all observations at once. The asymptotic properties of this method are related to the Godambe information, an extension of the Fisher information that accounts for the dependence between subsets of observations. We aim to apply this method to linear Gaussian models, in particular fractional Brownian motion and fractional Gaussian noise. We derive theoretical expressions for their Fisher information and their Godambe information and deduce a subset selection design that sequentially maximizes the Godambe information. The size of the subsets then allows us to control the trade-off between estimation accuracy and computational cost. Through simulations, we compare this method with the method of moments and maximum likelihood estimation, and we apply it to real data, namely volatility series of a stock index and a wind speed time series.

2606.11859 2026-06-11 q-fin.ST q-fin.RM 新提交

Scenario Generation for Time Series and Curves: A Comparison of Nonparametric and Semiparametric Bootstrap

时间序列与曲线的场景生成:非参数与半参数自助法的比较

Nicola Baldoni, Michele Sparviero, Lorenzo Viola

AI总结 针对金融时间序列场景生成中非参数自举法产生不现实路径的问题,本文综述了结合参数结构与残差重采样的半参数方法,并在利率与收益率曲线模拟中验证其有效性。

详情
Comments
25 pages, 6 figures, 11 tables
AI中文摘要

为资产类别生成随机轨迹是量化金融中日益重要的任务。传统方法(如平稳自举法)通过构造保留了资产类别收益的经验分布,但不能确保每个模拟路径在经济上现实:场景可能在分布上有效,而单个轨迹未能代表世界的合理状态。为解决这一局限性,我们回顾了半参数模拟方法,这些方法将强制实现现实动态的参数结构与模型残差的重采样相结合,从而保留历史数据中观察到的随机成分。对于利率而言,这一问题尤为突出,因为对利率变化的直接重采样可能产生不合理的收益率曲线演变,尽管分布性质正确。我们的实证分析显示了基于自回归或均值回复设定的半参数自举法的有效性。在固定收益环境中,将这些方法与完全参数化的期限结构模型相结合,可以产生更一致且现实的收益率曲线动态模拟。

英文摘要

Generating stochastic trajectories for asset classes is an increasingly relevant task in quantitative finance. Traditional approaches, such as the stationary bootstrap, preserve by construction the empirical distribution of asset-class returns, but do not ensure that each individual simulated path is economically realistic: scenarios may be valid in distribution while single trajectories fail to represent plausible states of the world. To address this limitation, we review semiparametric simulation methodologies that combine a parametric structure, which enforces realistic dynamics, with the resampling of model residuals, which preserves the stochastic component observed in historical data. The issue is particularly acute for interest rates, where direct resampling of rate changes may produce implausible yield-curve evolutions despite correct distributional properties. Our empirical analysis shows the effectiveness of semiparametric bootstrap methods based on autoregressive or mean-reverting specifications. In the fixed-income setting, combining these methods with fully parametric term-structure models yields more coherent and realistic simulations of yield-curve dynamics.

2606.11798 2026-06-11 q-fin.CP cs.LG math.OC 新提交

Deterministic Policy Gradient for Learning Equilibrium in Time-Inconsistent Control Problems

时间不一致控制问题中学习均衡的确定性策略梯度

Xin Guo, Yijie Huang, Xiang Yu

AI总结 提出一种连续时间无模型强化学习算法,通过确定性策略梯度和内定点迭代学习时间不一致控制问题的均衡策略,并在均值-方差投资组合和非指数贴现跟踪投资组合中验证有效性。

详情
Comments
Keywords: Time-inconsistent control, two-stage reformulation, model-free continuous-time reinforcement learning, deterministic policy gradient, fixed point iteration
AI中文摘要

在本文中,我们开发了一种连续时间无模型强化学习算法,用于学习一般时间不一致控制问题中的确定性均衡策略。利用扩展的Hamilton-Jacobi-Bellman系统,我们将原始时间不一致问题转化为一个等价的两阶段问题。在第一阶段,对于给定的辅助函数,我们采用确定性策略梯度方法在辅助的时间一致控制问题中学习最优策略。在第二阶段,给定更新后的策略,我们利用内定点迭代和某些鞅特征来学习辅助函数。作为理论贡献,我们提供了一些温和的模型假设,并建立了内定点迭代的收敛性。通过在两阶段之间重复这种演员-评论家风格的迭代,我们的算法旨在以统一的方式学习不同时间不一致性来源下的均衡。该算法在两种经典的时间不一致金融应用中的优越有效性得到了说明:均值-方差投资组合管理和非指数贴现下的最优跟踪投资组合。

英文摘要

In this paper, we develop a continuous-time model-free reinforcement learning algorithm to learn deterministic equilibrium policies in general time-inconsistent control problems. Utilizing the extended Hamilton-Jacobi-Bellman system, we recast the original time-inconsistent problem into an equivalent two-stage problem. In the first stage, for given auxiliary functions, we employ the deterministic policy gradient approach to learn an optimal policy in an auxiliary time-consistent control problem. In the second stage, given the updated policy, we exploit the inner fixed point iterations and some martingale characterizations to learn the auxiliary functions. As a theoretical contribution, we provide some mild model assumptions and establish the convergence of inner fixed point iterations. By repeating this actor-critic style of iterations across two stages, our algorithm aims to learn the equilibrium under different sources of time-inconsistency in a unified manner. The superior effectiveness of the proposed algorithm are illustrated in two classical financial applications with time-inconsistency: mean-variance portfolio management and optimal tracking portfolio under non-exponential discounting.

2606.11318 2026-06-11 q-fin.PM 新提交

Mean-Variance Optimization in Ambiguous Financial Markets with Learning

带学习的模糊金融市场中的均值-方差优化

Nicole Bäuerle, Anne MacKay

AI总结 针对多资产Black-Scholes市场中漂移未知且存在模型模糊性的问题,提出一种考虑平滑模糊厌恶的均值-方差准则,通过新颖方法得到允许学习的动态最优投资策略,并数值分析了参数影响。

详情
AI中文摘要

我们考虑一个多资产Black-Scholes市场中的连续时间投资问题,具有以下特征:资产的漂移未知,构成模型模糊性的来源。然而,存在关于可能漂移的先验分布(知识)。我们的投资者是模糊厌恶的,并希望最大化终端财富的均值-方差准则,其中模糊厌恶以平滑的方式纳入。我们考虑Maccheroni等人2013年引入的准则,其中方差被分解,每个部分被赋予不同的权重,以考虑不同水平的市场风险和模型模糊厌恶。我们使用一种新颖的方法,在允许学习的适应策略类中寻找最优动态投资策略。我们还提供了一些数值结果,有助于理解模型参数如何影响最优投资策略。总的来说,结果表明模糊厌恶的投资者在风险资产上的投资较少。

英文摘要

We consider a continuous time investment problem in a multi-asset Black-Scholes market with the following features: The assets' drifts are not known and constitute a source of model ambiguity. However, there is a prior distribution (knowledge) on the possible drifts. Our investor is ambiguity averse and wants to maximize a mean-variance criterion for the terminal wealth where ambiguity aversion is incorporated in a smooth way. We consider here the criterion introduced in Maccheroni et al. 2013 where the variance is decomposed and each part is weighted differently to account for different levels of market risk and model ambiguity aversion. We use a novel approach to find the optimal dynamic investment strategy within the class of all adapted strategies which allow for learning. We also present a number of numerical results which help to understand how the model parameters affect the optimal investment strategy. In general it turns out that ambiguity averse investors invest less in the risky assets.

2606.11238 2026-06-11 q-fin.GN cs.AI 新提交

Artificial Intelligence in Ship Finance: Applications, Opportunities, and a Case Study in AI-Augmented Loan Origination

人工智能在船舶金融中的应用:机遇与AI增强贷款发起的案例研究

Lasse Dierich, Orestis Schinas

AI总结 本文探讨AI在船舶金融中的应用,提出基于大语言模型的模块化架构,用于文档理解、信息提取和工作流自动化,以支持贷款申请流程。

详情
Comments
9 pages, 1 figure
AI中文摘要

船舶金融是资产担保贷款中数据密集且文档繁重的领域,需要整合来自异构且高度非结构化来源的财务、技术、合同和监管信息。日益严格的环境法规和ESG报告要求进一步增加了承销和贷款发起流程的复杂性。人工智能(AI)的最新进展,特别是大语言模型(LLMs),为处理和分析此类信息创造了新的机遇。本文回顾了AI在船舶金融中的潜在应用,特别关注基于LLM的系统用于文档理解、信息提取和工作流自动化。我们提出了this http URL,一个模块化代理架构,用于支持船舶金融中的贷款申请工作流。所提出的系统结合了基于LLM的提取模块、财务分析组件、外部海事数据服务以及带有聊天机器人界面的受控文档生成模块,以支持标准化融资申请的准备工作。本文讨论了在生产中使用此类模型的关键挑战。我们认为,AI辅助系统可以支持海事金融专业人士管理日益复杂的信息和报告要求。

英文摘要

Ship finance is a data-intensive and document-heavy segment of asset-based lending, requiring the integration of financial, technical, contractual, and regulatory information from heterogeneous and largely unstructured sources. Increasing environmental regulation and ESG reporting requirements are adding further complexity to underwriting and loan-origination processes. Recent advances in artificial intelligence (AI), particularly large language models (LLMs), create new opportunities for processing and analysing such information. This paper reviews potential applications of AI in ship finance, with a particular focus on LLM-based systems for document comprehension, information extraction, and workflow automation. We present this http URL, a modular agentic architecture to support loan application workflows in ship finance. The proposed system combines an LLM-based extraction module, financial analysis components, external maritime data services, and a controlled document-generation module with a chatbot interface to support the preparation of standardized financing applications. The paper discusses the key challenges for using such models in production. We argue that AI-assisted systems can support maritime finance professionals in managing increasingly complex information and reporting requirements.

2606.11237 2026-06-11 q-fin.PR math.PR 新提交

A Hybrid LSMC-PDE Method for Bermudan Option Pricing under the Gatheral Double Mean-Reverting Model

Gatheral双均值回复模型下百慕大期权定价的混合LSMC-PDE方法

Mara Kalicanin Dimitrov, Ying Ni

AI总结 针对Gatheral双均值回复随机波动率模型,提出混合最小二乘蒙特卡洛-偏微分方程方法,通过条件模拟和傅里叶变换降维,实现百慕大期权高效定价。

详情
AI中文摘要

我们研究了Gatheral双均值回复(GDMR)随机波动率模型下的百慕大期权定价。该模型包含一个方差过程以及一个随机长期均值方差过程,并在扩散系数中允许常弹性方差(CEV)型指数。该模型具有吸引力,因为它为波动率动态提供了灵活的规范。然而,文献中关于GDMR模型下早期行权衍生品定价的研究仍基本空白。为应对这一挑战,我们将混合最小二乘蒙特卡洛-偏微分方程(LSMC-PDE)框架应用于GDMR模型,并提供了详细的模型特定实现。在模拟方差路径的条件下,定价问题简化为资产价格的一维问题,通过基于傅里叶的方法求解,而对方差变量的剩余依赖通过最小二乘回归近似。我们的数值实验表明,混合LSMC-PDE方法能产生准确的定价估计,并且通常比普通LSMC具有更低的定价误差,特别是在低和中等模拟路径数下,显示了在早期行权期权定价中利用模型结构的好处。

英文摘要

We study Bermudan option pricing under the Gatheral Double Mean-Reverting (GDMR) stochastic volatility model. The model features a variance process together with a stochastic long-run mean variance process and allows Constant Elasticity of Variance (CEV)-type exponents in the diffusion coefficients. This model is attractive since it provides a flexible specification for volatility dynamics. However, the pricing of early-exercise derivatives under the GDMR model remains largely unexplored in the literature. To address this challenge, we adapt a Hybrid Least-Squares Monte Carlo-Partial Differential Equation (LSMC-PDE) framework to the GDMR model and provide a detailed model-specific implementation. Conditioning on simulated variance paths, the pricing problem reduces to a one-dimensional problem in the asset price, which is solved by a Fourier-based approach, while the remaining dependence on the variance variables is approximated by least-squares regression. Our numerical experiments demonstrate that the Hybrid LSMC-PDE approach yields accurate pricing estimates and often lower pricing errors than plain LSMC, particularly for low and moderate numbers of simulation paths, showing the benefit of using the model structure in early-exercise option pricing.

2606.11223 2026-06-11 q-fin.CP cs.FL 新提交

Scenario Constraints with Memory: A Finite-State Approach to Quantitative Financial Analysis

带记忆的场景约束:一种面向定量金融分析的有限状态方法

Vitaly Nürnberg

AI总结 提出基于事件历史自动机(EHA)和加权金融有限自动机(WFFA)的定量框架,通过同步乘积计算极端收益边界,并提取可解释的见证事件序列,用于金融系统的精确极值分析。

详情
AI中文摘要

在复杂市场场景下量化最坏和最佳性能是金融风险管理及路径依赖金融工具(如奇异期权和结构化产品)验证中的持续挑战。基于模拟的方法适用于概率估计,但无法直接对所有可行场景提供穷尽保证或显式给出极端结果的见证。为解决这一问题,我们引入了一种基于定量自动机的框架,用于在声明性场景约束下对金融系统进行精确极值分析。该框架的核心是事件历史自动机(EHAs),一种新的形式化模型,它将正则表达式事件模式与可行数值区间相结合,以表示带记忆的受约束事件历史。定量收益由加权金融有限自动机(WFFAs)表示,其转移权重依赖于观测到的市场价值。通过计算EHAs和WFFAs的同步乘积,我们的框架能够精确计算收益的上界和下界。此外,该方法自动提取可解释的见证事件历史,这些历史实现了这些极端结果。我们通过一个具有路径依赖机制的自动赎回结构化产品的案例研究,展示了该方法的实际可行性。该案例研究分析了不同场景约束如何影响票息累积、提前赎回和保护损失结果。可扩展性实验表明,对于实际合同期限和非平凡约束配置,该框架的执行在计算上是可行的。总体而言,该方法为标准金融模拟方法提供了数学上严格的补充。

英文摘要

Quantifying worst-case and best-case performance under complex market scenarios is a persistent challenge in financial risk management and the verification of path-dependent financial instruments, such as exotic options and structured products. Simulation-based methods are well suited for probabilistic estimation, but they do not directly provide exhaustive guarantees over all admissible scenarios or explicit witnesses for extremal outcomes. To address this, we introduce a quantitative automata-based framework for the exact extremal analysis of financial systems under declarative scenario constraints. At the core of our approach are event history automata (EHAs), a new formal model that integrates regular-expression event patterns with admissible numerical intervals to represent constrained event histories with memory. Quantitative payoffs are represented by weighted finance finite automata (WFFAs), which allow transition weights to depend on observed market values. By computing the synchronized product of EHAs and WFFAs, our framework enables the exact calculation of upper and lower payoff bounds. Furthermore, the method automatically extracts interpretable witness event histories that realize these extremal outcomes. We demonstrate the practical viability of the approach through a case study of an autocallable structured product with path-dependent mechanisms. The case study analyzes how different scenario constraints affect coupon accumulation, early redemption, and protection-loss outcomes. Scalability experiments indicate that the framework's execution remains computationally feasible for practical contract horizons and nontrivial constraint configurations. Overall, this approach provides a mathematically rigorous complement to standard financial simulation methods.

2606.01650 2026-06-11 q-fin.PM q-fin.TR stat.AP stat.ME 版本更新

Post Selection Estimation of Sharpe Ratios

夏普比率的事后选择估计

Steven E. Pav

AI总结 针对从众多资产中选择具有最高样本内夏普比率的资产,研究基于多面体引理、James-Stein收缩、期望最大夏普比率去偏、阈值法和经验贝叶斯的估计器,并通过模拟评估其偏差、均方根误差和秩相关性。

详情
AI中文摘要

我们考虑估计一个资产的真实夏普比率的问题,该资产因在众多资产中具有最高的样本内夏普比率而被选中。我们讨论了基于多面体引理、James-Stein收缩、期望最大夏普比率去偏、阈值法和经验贝叶斯的估计器。我们在模拟中测试了这些估计器,计算了不同样本量、资产数量以及总体夏普比率的分布范围和形状下的偏差和均方根误差。我们还计算了估计器与潜在真实值的秩相关性,模拟了这些估计器如何用于比较或排序执行此选择过程的不同团队的结果。我们发现James-Stein估计器在相关参数的许多不同实际值下提供了最佳性能,其次是Jiang和Zhang的GMLEB估计器。这些结果对资产收益的相关性相当稳健,但有一些注意事项。

英文摘要

We consider the problem of estimating the true Sharpe ratio of an asset selected for having the highest observed in-sample Sharpe ratio among many assets. We discuss estimators based on the polyhedral lemma, James Stein shrinkage, debiasing the expected maximum Sharpe ratio, thresholding and empirical Bayes. We test these estimators in simulations, computing bias and root mean square error across different values of sample size, number of assets, and spread and shape of population Sharpe ratios. We also compute rank correlation of the estimators against the underlying quantity, simulating how these estimators might be used to compare or rank the output of different teams which perform this selection process. We find that the James Stein estimator provides the best performance across many different realistic values of the relevant parameters, followed by the GMLEB estimator of Jiang and Zhang. These results are fairly robust to correlation of asset returns, with some caveats.

2605.18343 2026-06-11 q-fin.CP q-fin.PR 版本更新

Explicit Rational Formulae for Bachelier (Normal) Implied Volatility

Bachelier(正态)隐含波动率的显式有理公式

Fabien Le Floc'h

AI总结 提出两个无需迭代的显式有理公式,通过期权价格、远期、行权价和到期时间直接计算Bachelier隐含波动率,精度接近机器精度。

详情
AI中文摘要

我们提出了两个用于Bachelier(或正态)隐含波动率的显式有理公式。这些公式以期权价格、远期、行权价和到期时间为输入,无需迭代即可返回隐含正态波动率。它们遵循LFK-4的分支结构,但在近价区域使用了更简单的变量,即远期-行权价绝对差除以尾部时间价值,避免了该区域的对数和小参数泰勒分支。LFK-2026是面向精度的公式,在远尾区域直接近似倒数绝对标准化货币度。LFK-2026C保持相同的平移虚值有理尾近似,但将近价分支拆分为一个非常小的低u有理分支和一个中程有理分支。在双精度测试中,两者均保持接近机器精度,而LFK-2026C在当前基准混合上是更快的标量实现。

英文摘要

We present two explicit rational formulae for Bachelier, or normal, implied volatility. The formulae take the option price, forward, strike, and expiry as inputs and return the implied normal volatility without iteration. They follow the branch structure of LFK-4, but use the simpler near-the-money variable given by the absolute forward-strike difference divided by the tail time value, avoiding a logarithm and a small-argument Taylor branch in that region. LFK-2026 is the accuracy-oriented formula and approximates reciprocal absolute standardized moneyness directly in the far tail. LFK-2026C keeps the same shifted out-of-the-money rational tail approximation, but splits the near-the-money branch two low degree rationals. In double precision tests both remain close to machine accuracy, while LFK-2026C is the faster scalar implementation on the current benchmark mix.

2411.19444 2026-06-11 q-fin.MF math.PR q-fin.ST 版本更新

Capital Asset Pricing Model with Size Factor and Normalizing by Volatility Index

包含规模因子的资本资产定价模型及波动率指数归一化

Abraham Atsiwo, Andrey Sarantsev

AI总结 本文在CAPM中引入规模效应,并用波动率指数归一化收益率,构建包含波动率、相对规模和CAPM的离散时间模型,通过实际数据拟合证明长期稳定性,并与随机投资组合理论关联。

详情
Comments
18 pages, 2 tables, 4 figures. Keywords: Capital Asset Pricing Model, stochastic volatility, ergodic Markov process, stationary distribution, size effect, autoregression, capital distribution curve
AI中文摘要

资本资产定价模型(CAPM)将一个充分分散的股票投资组合与一个基准投资组合联系起来。我们在CAPM中引入规模效应,捕捉到小盘股平均而言比大盘股具有更高风险和收益的观察结果。对于某些基于规模的股票投资组合,将其收益率除以波动率指数可使它们更接近独立正态分布。在本文中,我们结合这些想法创建了一个新的离散时间模型,该模型包含波动率、相对规模和CAPM。我们使用真实世界数据拟合该模型,证明其长期稳定性,并将这项研究与随机投资组合理论联系起来。我们填补了之前关于包含规模因子的CAPM文章中的重要空白。

英文摘要

The Capital Asset Pricing Model (CAPM) relates a well-diversified stock portfolio to a benchmark portfolio. We insert size effect in CAPM, capturing the observation that small stocks have higher risk and return than large stocks, on average. For some size-based stock portfolios, dividing their returns by the Volatility Index makes them closer to independent and normal. In this article, we combine these ideas to create a new discrete-time model, which includes volatility, relative size, and CAPM. We fit this model using real-world data, prove the long-term stability, and connect this research to Stochastic Portfolio Theory. We fill important gaps in our previous article on CAPM with the size factor.

2603.19225 2026-06-11 cs.CE cs.AI cs.CL cs.IR q-fin.CP 版本更新

FinTradeBench: A Financial Reasoning Benchmark for LLMs

FinTradeBench: 面向LLM的金融推理基准

Yogesh Agrawal, Aniruddha Dutta, Md Mahadi Hasan, Santu Karmaker, Aritra Dutta

AI总结 提出FinTradeBench基准,通过结合公司基本面与交易信号,评估大语言模型在金融推理中的表现,发现检索增强对数值和时间序列推理帮助有限。

详情
Comments
9 pages main text, 31 pages total (including references and appendix). 5 figures, 16 tables. Preprint under review. Code and data will be made available upon publication
AI中文摘要

现实世界的金融决策是一个具有挑战性的问题,需要对异构信号进行推理,包括从监管文件中提取的公司基本面和从价格动态计算出的交易信号。最近,随着大语言模型(LLM)的进步,金融分析师开始将它们用于金融决策任务。然而,现有的用于测试这些模型的金融问答基准主要关注公司资产负债表数据,很少评估关于公司股票如何在市场中交易或它们与基本面相互作用的推理。为了利用这两种方法的优势,我们引入了FinTradeBench,这是一个评估金融推理的基准,它整合了公司基本面和交易信号。FinTradeBench包含1400个问题,这些问题基于纳斯达克-100公司十年历史窗口的数据。该基准分为三个推理类别:基本面聚焦、交易信号聚焦以及需要跨信号推理的混合问题。为了确保大规模可靠性,我们采用了一个校准然后扩展的框架,该框架结合了专家种子问题、多模型响应生成、模型内自过滤、数值审计以及人类-LLM判断对齐。我们在零样本提示和检索增强设置下评估了14个LLM,并观察到了明显的性能差距。检索显著改善了对文本基本面的推理,但对交易信号推理的益处有限。这些发现突显了当前LLM在数值和时间序列推理方面的根本性挑战,并激励了未来在金融智能方面的研究。

英文摘要

Real-world financial decision-making is a challenging problem that requires reasoning over heterogeneous signals, including company fundamentals derived from regulatory filings and trading signals computed from price dynamics. Recently, with advances in Large Language Models (LLMs), financial analysts have begun to use them for financial decision-making tasks. However, existing financial question-answering benchmarks for testing these models primarily focus on company balance sheet data and rarely evaluate reasoning about how company stocks trade in the market or their interactions with fundamentals. To leverage the strengths of both approaches, we introduce FinTradeBench, a benchmark for evaluating financial reasoning that integrates company fundamentals and trading signals. FinTradeBench contains 1,400 questions grounded in NASDAQ-100 companies over a ten-year historical window. The benchmark is organized into three reasoning categories: fundamentals-focused, trading-signal-focused, and hybrid questions requiring cross-signal reasoning. To ensure reliability at scale, we adopt a calibration-then-scaling framework that combines expert seed questions, multi-model response generation, intra-model self-filtering, numerical auditing, and human-LLM judge alignment. We evaluate 14 LLMs under zero-shot prompting and retrieval-augmented settings and witness a clear performance gap. Retrieval substantially improves reasoning over textual fundamentals, but provides limited benefit for trading-signal reasoning. These findings highlight fundamental challenges in the numerical and time-series reasoning for current LLMs and motivate future research in financial intelligence.

1911.04090 2026-06-11 stat.ME q-fin.PM 版本更新

A post hoc test on the Sharpe ratio

夏普比率的事后检验

Steven E. Pav

AI总结 提出一种夏普比率的事后检验方法,类似于Tukey检验,用于在拒绝所有总体信噪比相等的假设后,比较资产夏普比率的差异。

详情
AI中文摘要

我们描述了一种针对夏普比率的事后检验,类似于Tukey检验用于均值的两两相等性检验。该检验可以在拒绝所有总体信噪比相等的假设后应用。该检验适用于资产收益间具有简单相关结构的情形。模拟表明,该检验在广泛条件下维持名义第一类错误率,并在合理备择假设下具有中等功效。

英文摘要

We describe a post hoc test for the Sharpe ratio, analogous to Tukey's test for pairwise equality of means. The test can be applied after rejection of the hypothesis that all population Signal-Noise ratios are equal. The test is applicable under a simple correlation structure among asset returns. Simulations indicate the test maintains nominal type I rate under a wide range of conditions and is moderately powerful under reasonable alternatives.

2504.06717 2026-06-11 q-fin.TR 版本更新

Optimal Execution and Macroscopic Market Making

最优执行与宏观市场做市

Ivan Guo, Shijia Jin

AI总结 提出一个随机博弈模型,刻画做市商与交易者之间的策略互动,通过FBSDEs刻画纳什均衡,并在特定模型中建立全局适定性,模拟揭示报价与策略订单负相关。

详情
Comments
42 pages
AI中文摘要

我们提出了一个随机博弈模型,模拟做市商与交易者之间的策略互动。从交易者的角度来看,传统的外生永久价格影响被做市商的内生报价策略所取代。相反,从做市商的角度来看,订单流不再假设为外生,而是由策略交易者内生驱动。通过前向-后向随机微分方程(FBSDEs)刻画纳什均衡,我们建立了该一般博弈的局部适定性结果。对于特定的“Almgren-Chriss-Avellaneda-Stoikov”模型,解耦方法通过将其简化为具有$M_+$-矩阵系数的后向随机Riccati方程,保证了FBSDEs的全局适定性。最后,通过将小扩散项引入库存过程作为一般博弈的近似,我们建立了其全局适定性。模拟揭示了报价与策略订单之间的负相关,这与报价与噪声订单之间的正相关形成对比。

英文摘要

We propose a stochastic game modelling the strategic interaction between market makers and traders. From the trader's perspective, the conventional exogenous permanent price impact is replaced by the endogenous quoting strategies of the market makers. Conversely, from the market maker's perspective, order flows are no longer assumed to be exogenous, but are driven endogenously by the strategic traders. Characterizing the Nash equilibria via forward-backward stochastic differential equations (FBSDEs), we establish a local well-posedness result for the general game. For the specific `Almgren-Chriss-Avellaneda-Stoikov' model, the decoupling approach guarantees the global well-posedness of the FBSDEs by reducing it to a backward stochastic Riccati equation with $M_+$-matrix coefficients. Finally, by introducing small diffusion terms into the inventory processes as an approximation to the general game, we establish its global well-posedness. Simulations reveal a negative correlation between quotes and strategic orders, in contrast to the positive correlation observed between quotes and noise orders.

2411.13579 2026-06-11 q-fin.MF math.OC q-fin.PM 版本更新

Optimal portfolio under ratio-type periodic evaluation in stochastic factor models under convex trading constraints

凸交易约束下随机因子模型中基于比率型周期性评估的最优投资组合

Wenyuan Wang, Kaixin Yan, Xiang Yu

AI总结 研究凸交易约束下不完全随机因子模型中基于相邻财富比率周期性评估的无限期最优投资组合问题,通过辅助问题和对偶方法推导并验证了最优策略。

详情
Comments
Keywords: Periodic evaluation, relative portfolio performance, incomplete market, stochastic factor model, convex trading constraints, convex duality approach. This manuscript combines two previous preprints arXiv:2311.12517 and arXiv:2401.14672 into one paper with more general and improved results
AI中文摘要

本文研究了在具有凸交易约束的不完全随机因子模型中,一类周期性效用最大化问题在投资组合管理中的应用。投资组合的表现通过无限时间范围内相邻两个财富水平的相对比率进行周期性评估,体现了根据过去业绩动态调整投资决策的特点。在幂效用函数下,我们将原始无限期最优控制问题转化为一个修正效用函数下的辅助终端财富优化问题。为应对凸交易约束,我们进一步引入修正市场模型中的辅助无约束优化问题,并发展鞅对偶方法以建立对偶最小化子的存在性,从而通过其对偶表示获得最优无约束财富过程。借助辅助问题中的对偶结果、约束与无约束模型之间的关系以及一些不动点论证,我们推导并验证了原始问题在无限期上的最优约束投资组合过程。

英文摘要

This paper studies a type of periodic utility maximization problem for portfolio management in incomplete stochastic factor models with convex trading constraints. The portfolio performance is periodically evaluated on the relative ratio of two adjacent wealth levels over an infinite horizon, featuring the dynamic adjustments in portfolio decision according to past achievements. Under power utility, we transform the original infinite horizon optimal control problem into an auxiliary terminal wealth optimization problem under a modified utility function. To cope with the convex trading constraints, we further introduce an auxiliary unconstrained optimization problem in a modified market model and develop the martingale duality approach to establish the existence of the dual minimizer such that the optimal unconstrained wealth process can be obtained using the dual representation. With the help of the duality results in the auxiliary problems, the relationship between the constrained and unconstrained models as well as some fixed point arguments, we derive and verify the optimal constrained portfolio process for the original problem over an infinite horizon.

2509.24508 2026-06-11 econ.GN q-fin.EC

Identifying the post-pandemic determinants of low performing students in Latin America through Interpretable Machine Learning methods

通过可解释机器学习方法识别拉丁美洲后疫情时代低表现学生的决定因素

Marcos Delprato

AI总结 基于2022年PISA数据,使用堆叠模型和SHAP分析,识别拉丁美洲低表现学生的关键决定因素,发现少数语言、留级、无数字设备、贫困家庭、兼职工作及学校劣势是主要风险因素。

详情
Journal ref
Engineering Applications of Artificial Intelligence, 2026
Comments
48 pages, 13 figures
AI中文摘要

引言。拉丁美洲(LAC)学生未达到基本学习能力的比例很高,考虑到该地区深层次的结构性不平等和更大的疫情后学习损失,这令人担忧。在此背景下,本文旨在帮助识别低表现和表现不佳学生(低于2级)的决定因素。方法。基于2022年国际学生评估项目(PISA)中10个LAC国家的数据,使用集成二元分类模型的堆叠模型,并应用Shapley加法解释(SHAP)分析以实现可解释性,我们识别了影响低表现群体学生表现的关键因素。结果。我们发现,最有可能成为未达标学生的学生讲少数语言且曾留级,家中没有数字设备,来自贫困家庭,每周有一半时间打工赚钱,且其所在学校存在广泛劣势,如学校氛围差、信息和通信技术(ICT)基础设施薄弱以及教学质量差(仅三分之一的教师持有资格证书)。对于各国估计,我们发现排名靠前的因素的贡献模式相当一致,其中小学留级、家庭财富和教育ICT投入在10个国家中至少有8个进入前十名协变量。讨论。本文的研究结果有助于广泛研究识别和瞄准拉丁美洲教育系统中被落在后面的学生的策略。

英文摘要

Introduction. The high prevalence of students not achieving basic learning competencies in Latin America (LAC) is concerning, even more so considering the region's deep structural inequalities and the larger post-pandemic learning losses. Within this scenario, the paper aims to contribute to the identification of the determinants of bottom and low performers (below level 2). Methodology. Based on 2022 data from the Programme for International Student Assessment (PISA) for 10 LAC countries, and using a stacking model integrating binary classification models as well as by applying Shapley Additive Explanations (SHAP) analysis for interpretability, we identify critical factors impacting on the student performance across low performers groups. Results. We find that a student with the highest probability of being a not achiever speaks a minority language and had repeated, has no digital devices at home, comes from a poor family and works for payment half of the week, and the school the student attends has wide disadvantages such as bad school climate, weak Information and Communication Technology (ICT) infrastructure and poor teaching quality (only a third of teachers being certified). For countries' estimates, we find quite homogeneous patterns regarding the contribution of top ranked factors, with repetition at primary, household wealth, and educational ICT inputs being top ten ranked covariates in at least 8 out of the 10 total countries. Discussions. The paper findings contribute to the broad literature on strategies to identify and to target those most left behind in Latin American education systems.

2509.25353 2026-06-11 econ.GN q-fin.EC

Cognitive and non-cognitive efficiency gaps between private and public schools in the Latin America region-a hybrid DEA and machine learning approach based on PISA 2022

Marcos Delprato

详情
Journal ref
Cogent Education, 2026
Comments
30 pages, 7 figures
英文摘要

Latin America's education systems are fragmented and segregated, with substantial differences by school type. The concept of school efficiency (the ability of school to produce the maximum level of outputs given available resources) is policy relevant due to scarcity of resources in the region. Knowing whether private and public schools are making an efficient use of resources --and which are the leading drivers of efficiency-- is critical, even more so after the learning crisis brought by the COVID-19 pandemic. In this paper, relying on data of 2,034 schools and nine Latin American countries from PISA 2022, I offer new evidence on school efficiency (both on cognitive and non-cognitive dimensions) using Data Envelopment Analysis (DEA) by school type and, then, I estimate efficiency leading determinants through interpretable machine learning methods (IML). This hybrid DEA-IML approach allows to accommodate the issue of big data (jointly assessing several determinants of school efficiency). I find a cognitive efficiency gap of nearly 0.10 favouring private schools and of 0.045 for non-cognitive outcomes, and with a lower heterogeneity in private than public schools. For cognitive efficiency, leading determinants for the chance of a private school of being highly efficient are higher stock of books and PCs at home, lack of engagement in paid work and school's high autonomy; whereas low-efficient public schools are shaped by poor school climate, large rates of repetition, truancy and intensity of paid work, few books at home and increasing barriers for homework during the pandemic.

2503.23569 2026-06-11 econ.GN econ.EM q-fin.EC

Where the Trees Fall: Macroeconomic Forecasts for Forest-Reliant States

Andrew Crawley, Adam Daigneault, Jonathan Gendron

详情
Journal ref
Forest Policy and Economics 186 (2026) 103739
英文摘要

Several key states in various regions of the U.S. have experienced recent sawtimber as well as pulp and paper mill closures, which raises an important policy question: how have and will key macroeconomic and industry specific indicators within the U.S. forest sector likely to change over time? This study provides empirical evidence to support forest-sector policy design by using a vector error correction (VEC) model to forecast economic trends in three major industries - forestry and logging, wood manufacturing, and paper manufacturing - across six of the most forest-dependent states found by the location quotient (LQ) measure: Alabama, Arkansas, Maine, Mississippi, Oregon, and Wisconsin. Overall, the results suggest a general decline in employment and the number of firms in the forestry and logging industry as well as the paper manufacturing industry, while wood manufacturing is projected to see modest employment gains. These results also offer key insights for regional policymakers, industry leaders, and local economic development officials: communities dependent on timber-based manufacturing may be more resilient than other forestry-based industries in the face of economic disruptions. Our findings can help prioritize targeted policy interventions and inform regional economic resilience strategies. We show distinct differences across forest-dependent industries and/or state sectors and geographies, highlighting that policies may have to be specific to each sector and/or geographical area. Finally, our VEC modeling framework is adaptable to other resource-dependent industries that serve as regional economic pillars such as mining, agriculture, and energy production offering a transferable tool for policy analysis in regions with similar economic structures.