Genome-Factory: A Library for Tuning, Deploying, and Interpreting Genomic Foundation Models
发表机构 * Center for Foundation Models ; Generative AI \& Department of Computer Science, Northwestern University, USA ; School of Natural Sciences, University of California at Merced, USA ; Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, USA ; Systems Biology Division, Lawrence Berkeley National Laboratory, USA ; Department of Statistics ; Data Science, Northwestern University, USA
AI总结 本文介绍了 Genome-Factory,一个用于调优、部署和解释基因组基础模型的首个集成 Python 库。该库通过统一数据收集、模型调优、推理、基准测试和可解释性分析的流程,简化了基因组模型的开发工作。其核心贡献包括自动化数据预处理、支持多种模型调优方式、提供嵌入提取与序列生成功能,并引入基于稀疏自编码器的生物解释器,显著提升了基因组模型在实际分析中的实用价值。