Generating Financial Time Series by Matching Random Convolutional Features
通过匹配随机卷积特征生成金融时间序列
Konrad J. Mueller, Nikita Zozoulenko, Ben Wood, Thomas Cass, Lukas Gonon
AI总结 提出SOCK(软竞争核)可微随机卷积特征图,通过匹配真实与生成时间序列的随机卷积特征来训练生成器,在小样本金融数据集上优于签名和扩散基线方法。
详情
生成逼真的金融时间序列具有挑战性,因为训练数据通常仅限于单个历史路径。在如此稀缺的数据下,过拟合难以避免,尤其是在对抗训练中,训练好的判别器可能记忆训练样本。为了缓解这一问题,近期的方法训练生成器以最小化真实与生成时间序列的未训练特征表示之间的差异。在这些工作中,特征图基于路径签名,而路径签名在可处理的截断深度下可能无法捕捉相关的时间序列属性。在本工作中,我们通过匹配真实与生成时间序列的随机卷积特征来训练生成器。现有的随机卷积特征图,如Rocket和Hydra,已被证明能为真实世界的时间序列提供信息丰富的表示,但由于不可微,无法监督生成模型。我们引入了SOCK(软竞争核),一种完全可微的随机卷积特征图,适用于训练生成时间序列模型。我们表明,通过匹配随机SOCK特征训练的生成器在多种小样本金融数据集上始终优于签名和扩散基线。我们进一步在双样本假设检验和时间序列分类任务中展示了SOCK的表达能力,在这些任务中SOCK匹配或超越了现有的无监督特征图。
Generating realistic financial time series is challenging as training data is often limited to a single historical path. With such scarce data, overfitting is hard to avoid, especially under adversarial training where a trained discriminator can memorize the training samples. To mitigate this, recent approaches train generators to minimize the discrepancy between untrained feature representations of real and generated time series. In these works, the feature maps are based on path signatures, which can fail to capture relevant time series properties at tractable truncation depths. In this work, we instead train generators by matching random convolutional features of real and generated time series. Existing random convolutional feature maps, such as Rocket and Hydra, have been shown to provide informative representations of real-world time series, but cannot supervise generative models because they are non-differentiable. We introduce SOCK (SOft Competing Kernels), a fully differentiable random convolutional feature map, suited to train generative time series models. We show that generators trained by matching random SOCK features consistently outperform signature and diffusion baselines across a wide range of small-sample financial datasets. We further demonstrate SOCK's expressiveness on two-sample hypothesis testing and time series classification tasks, where SOCK matches or outperforms existing unsupervised feature maps.