AI中文摘要
数据驱动建模正成为多相输运、电子冷却、声学诊断和热流体数字孪生的核心,但进展受到数据集碎片化和原始仪器文件难以解码、重用或基准测试的限制。本文介绍了由纳米能源与数据驱动发现(NED3)实验室开发的开放多模态数据集和开源软件包生态系统,用于可复现的AI赋能热流体研究。我们提出了一个空间加时间维度框架,记为S+TD,用于按测量或模拟场的维度对数据集进行分类,包括0+0D点值、0+1D时间序列、1+0D剖面、2+0D图像、2+1D视频、3+0D体积场以及多模态组合。我们整理了公开的NED3数据集,涵盖沸腾图像、声学和热测量、高速视频、红外热成像、热阻测量、CFD生成场、设计文件和声发射数据。我们还描述了配套的软件包,包括BubbleID、SeqReg、CFDTwin、IRISApp、decode-wfs、AELab和FlowLab,这些软件支持计算机视觉、序列回归、代理建模、红外分析、波形解码、声发射分析和多模态诊断。特别强调了SeqReg,这是一个用于0+1D、1+1D和2+1D数据的通用序列回归库,应用包括非侵入式热通量估计。最后,我们讨论了未来社区努力构建可互操作的热流体数据库和精选的AI/ML工具库,以连接数据集、元数据、解码器、基线、基准和物理可解释模型。
英文摘要
Data-driven modeling is becoming central to multiphase transport, electronics cooling, acoustic diagnostics, and thermal-fluid digital twins, but progress is limited by fragmented datasets and raw instrument files that are difficult to decode, reuse, or benchmark. This paper presents an open ecosystem of multimodal datasets and open-source software packages developed by the Nano Energy and Data-Driven Discovery (NED3) Laboratory for reproducible AI-enabled thermal-fluid research. We introduce a spatial-plus-temporal dimensionality framework, denoted S+TD, to classify datasets by the dimensionality of measured or simulated fields, including 0+0D point values, 0+1D time series, 1+0D profiles, 2+0D images, 2+1D videos, 3+0D volumetric fields, and multimodal combinations. We organize public NED3 datasets spanning boiling images, acoustic and thermal measurements, high-speed videos, infrared thermography, thermal-resistance measurements, CFD-generated fields, design files, and acoustic-emission data. We also describe complementary software packages, including BubbleID, SeqReg, CFDTwin, IRISApp, decode-wfs, AELab, and FlowLab, which support computer vision, sequence regression, surrogate modeling, infrared analysis, waveform decoding, acoustic-emission analysis, and multimodal diagnostics. Particular emphasis is placed on SeqReg, a general sequence-regression library for 0+1D, 1+1D, and 2+1D data, with applications such as nonintrusive heat-flux estimation. Finally, we discuss future community efforts to build interoperable thermal-fluid databanks and curated AI/ML tool libraries that connect datasets, metadata, decoders, baselines, benchmarks, and physically interpretable models.