When Does Model Collapse Occur in Structured Interactive Learning?
在结构互动学习中模型崩溃何时发生?
Yuchen Wu, Kangjie Zhou, Weijie Su
AI总结 研究探讨了在结构互动学习环境中,生成模型性能下降(模型崩溃)的发生条件,通过分析交互图拓扑结构,推导出模型崩溃的必要和充分条件,并通过数值实验验证理论结果。
详情
- Comments
- 57 pages, 12 figures
生成式人工智能的普及催生了交互学习环境,其中模型参数通过自然过程生成的数据和由其他模型产生的合成输出不断更新。这种范式引入了两大挑战:(1)训练数据不再仅来自目标群体,破坏了经典统计学习的核心假设;(2)模型训练过程变得内在相关,因为模型通过反复接触彼此的合成输出进行交互,方式可能复杂。在这样的结构互动学习环境中建立可靠的统计推断仍然是一个重要开放问题。特别是,人们对模型崩溃现象日益关注,该现象是指生成模型在训练于早期模型生成的合成数据时性能逐步下降。先前关于模型崩溃的研究主要集中在单个模型训练其自身输出的情况,未能捕捉多模型交互环境中的模型性能。在本文中,我们填补了这一空白,通过研究具有通用交互模式的交互学习环境中的生成模型性能。特别是,我们利用有向图形式化模型交互,并证明模型崩溃的发生严重依赖于交互图的拓扑结构。我们进一步推导出一个显式的必要和充分条件,以表征模型崩溃何时发生,并为线性回归建立有限样本结果,为一般M估计量建立渐近保证。我们通过广泛的数值实验支持我们的理论发现。
The proliferation of generative artificial intelligence has given rise to an interactive learning environment, where model parameters are continuously updated using not only data generated by natural processes, but also synthetic outputs produced by other models. This paradigm introduces two major challenges: (1) training data are no longer drawn exclusively from the target population, undermining a core assumption of classical statistical learning, and (2) model training processes become inherently correlated, as models interact with one another through repeated exposure to each other's synthetic outputs in a potentially complex manner. Establishing reliable statistical inference in such structured interactive learning environments therefore remains an important open problem. In particular, there is growing concern about model collapse, a phenomenon in which the performance of generative models progressively degrades as they are trained on synthetic data produced by earlier model generations. Prior work on model collapse primarily focuses on a single model trained on its own output, failing to capture model performance in multi-model interactive settings. In this work, we fill this gap by investigating the performance of generative models in an interactive learning environment with general interaction patterns. In particular, we formalize model interactions using directed graphs and show that the occurrence of model collapse depends critically on the topology of the interaction graph. We further derive an explicit necessary and sufficient condition characterizing when model collapse occurs, and establish finite-sample results for linear regression and asymptotic guarantees for general M-estimators. We support our theoretical findings through extensive numerical experiments.