How to spot outliers: an Ensemble Anomaly Detection Framework
如何发现异常值:一种集成异常检测框架
Daniil Peysakhovich, Rafał Sieradzki
AI总结 针对风险估值输出中的异常问题,提出集成质量评估框架(EQAF),结合多种无监督异常检测方法,在信用衍生品数据上实现F1分数61-79%,优于最佳单一方法(6-66%),并揭示纯统计方法无法检测冻结馈送异常。
详情
由数据馈送失败、模型配置错误或系统故障引起的风险估值输出错误可能通过投资银行的风险基础设施未被检测地传播,并产生重大操作损失。利用一家全球大型投资银行涵盖129个交易日183笔交易的专有每日信用衍生品数据,我们设计、实施并实证评估了集成质量评估框架(EQAF),这是一种分层无监督架构,结合互补的异常检测方法,实时监控风险计算完整性。通过使用八种操作现实场景的受控异常注入协议,我们表明校准后的集成在四个不同风险度量数据集上实现了61-79%的F1分数,显著优于最佳单一方法(6-66%)。AUC-ROC提高4-6个百分点证实了这种优势对阈值选择具有鲁棒性。我们进一步证明,纯统计检测方法系统地无法识别冻结值异常,这是一类冻结馈送错误,其中估值输出与先前观测相同,因此与正常数据无法区分,并且领域特定的确定性规则在架构上是不可或缺的。这些发现对巴塞尔III和交易账簿基本审查(FRTB)下的模型风险管理具有直接影响,其中对内部风险模型的自动化和可审计质量控制要求日益增加。
Errors in risk valuation outputs arising from data-feed failures, model misconfiguration, or system malfunctions can propagate undetected through an investment bank's risk infrastructure and generate material operational losses. Using proprietary daily credit-derivatives data from a major global investment bank covering 183 trades across 129 trading days, we design, implement, and empirically evaluate the Ensemble Quality Assessment Framework (EQAF), a layered unsupervised architecture that combines complementary outlier-detection methods to monitor risk calculation integrity in real time. Using a controlled anomaly-injection protocol with eight operationally realistic scenarios, we show that the calibrated ensemble achieves F1 scores of 61-79%, substantially outperforming the best individual method (6-66%) across four distinct risk-measure datasets. Improvements of 4-6 percentage points in AUC-ROC confirm that this advantage is robust to threshold selection. We further demonstrate that purely statistical detection methods systematically fail to identify stale-value anomalies, a class of frozen-feed errors in which valuation outputs are identical to prior observations and therefore indistinguishable from normal data, and that domain-specific deterministic rules are architecturally indispensable. These findings have direct implications for model risk management under Basel III and the Fundamental Review of the Trading Book (FRTB), where automated and auditable quality controls for internal risk models are increasingly required.