CIG-Bench: A Comprehensive Survey and Benchmark for AI-Driven Subsurface Imaging Understanding
CIG-Bench:AI驱动的地下成像理解的综合调查与基准
Yimin Dou, Xinming Wu, Hui Gao, Mingliang Liu, Tao Zhao, Zhi Zhong, Haibin Di, Min Jun Park, Robert G. Clapp, Zhixiang Guo, Long Han, Sergey Fomel
AI总结 本文系统回顾2015-2025年间652篇文献,将地下成像理解归纳为四大任务,总结三大挑战,并提出包含统一评估协议、预训练模型和混合数据集的社区基准CIG-Bench。
详情
地下成像理解连接观测地球物理数据与定量地质模型,支持油气勘探、CO2储存场地评估和地质灾害监测。过去十年,深度学习显著重塑了解释工作流程。为总结这一进展,我们系统分析了2015年至2025年的652篇出版物,并将该领域组织为四大主要任务:构造解释、地质体识别、地震相分析和属性估计。然而,地下成像解释与其他AI驱动任务根本不同,面临模糊信号、显著的解释非唯一性、稀疏语义、不确定的目标位置和稀缺的可靠标注。基于所综述的文献,我们总结了定义其前沿的三个相互关联的挑战:复杂地质条件下的解释、低信息密度下的跨测线语义泛化以及缺乏可靠基准。应对这些挑战将依赖于将人类专业知识、物理约束和地质先验整合到模型训练或推理中,并将不确定性量化视为模型的内在输出。其中,统一基准的缺乏尤其重要,它使得公平比较困难,阻碍了可重复性,并分裂了社区的努力。因此,我们提出了一个社区基准,涵盖断层分割、相对地质时间估计、地质体分割和属性建模。它整合了统一评估协议、预训练模型以及结合用于定量评估的合成数据和用于定性评估的真实测线的数据集。通过将跨越十年的综述与不断发展的基准相结合,这项工作为加速未来研究和部署提供了及时的参考和可重复的基础。
Subsurface imaging understanding bridges observed geophysical data and quantitative geological models, supporting hydrocarbon exploration, CO2 storage site assessment, and geohazard monitoring. Over the past decade, deep learning has substantially reshaped interpretation workflows. To take stock of this progress, we systematically analyze 652 publications from 2015 to 2025 and organize the field into four major tasks: structural interpretation, geobody identification, seismic facies analysis, and property estimation. Yet subsurface imaging interpretation differs fundamentally from other AI-driven tasks, facing ambiguous signals, pronounced interpretive non-uniqueness, sparse semantics, unfixed target locations, and scarce reliable annotations. Building on the reviewed literature, we summarize three interrelated challenges that define its frontier: interpretation under complex geological conditions, cross-survey semantic generalization under low information density, and the absence of reliable benchmarks. Addressing them will hinge on integrating human expertise, physical constraints, and geological priors into model training or inference, and on treating uncertainty quantification as an intrinsic model output. Among these, the lack of unified benchmarks has been particularly consequential, making fair comparison difficult, hindering reproducibility, and fragmenting community efforts. We therefore propose a community benchmark spanning fault segmentation, relative geologic time estimation, geobody segmentation, and property modeling. It integrates unified evaluation protocols, pretrained models, and datasets that combine synthetic data for quantitative evaluation with real surveys for qualitative assessment. By coupling a decade-spanning review with an evolving benchmark, this work offers a timely reference and a reproducible foundation to accelerate future research and deployment.