3DEditSafe: Defending 3D Editing Pipelines from Unsafe Generation
3DEditSafe: 防御3D编辑流程中的不安全生成
Nicole Meng, Zheyuan Liu, Meng Jiang, Yingjie Lao
AI总结 本文提出3DEditSafe框架,通过安全正则化约束不安全语义传播,减少3D编辑中的不安全内容生成,揭示安全与质量的权衡。
详情
近期3D生成编辑的进步,特别是基于3D高斯点散布(3DGS)的流程,实现了从文本提示中高保真的多视角一致场景操控。然而,我们发现这些流程在处理不安全提示时会产生传播和优化的不安全编辑。本文研究了3D编辑流程中的不安全生成,证明这种行为可能导致最终3D表示中一致但不适宜工作(NSFW)的内容。为解决此问题,我们提出了3DEditSafe,一个安全正则化的3D编辑框架,通过生成阶段的安全指导和渲染视图的3D安全正则化、安全语义投影、残差抑制和掩码感知保留,引导优化远离不安全的编辑方向。我们在EditSplat场景上使用对象兼容的不安全提示基准评估了我们的方法,并证明2D安全指导单独不足以防止不安全的3D编辑。3DEditSafe减少了不安全语义对齐和视图级攻击成功率,同时揭示了安全与质量之间的权衡,更强的不安全抑制可能引入伪影或降低不安全提示的保真度。到目前为止,这项工作是首次尝试研究并防御文本驱动的3D编辑流程中的不安全生成,强调了需要直接在优化的3D表示上操作的安全机制。
Recent advances in 3D generative editing, particularly pipelines based on 3D Gaussian Splatting (3DGS), have achieved high-fidelity, multi-view-consistent scene manipulation from text prompts. However, we find that these pipelines also introduce new safety risks when unsafe prompts produce edits that are propagated and optimized across views. In this work, we study unsafe generation in 3D editing pipelines and show that such behavior can lead to coherent, undesirable Not-Safe-For-Work (NSFW) content in the final 3D representation. To address this, we propose 3DEditSafe, a safety-regularized 3D editing framework that constrains unsafe semantic propagation during optimization. 3DEditSafe combines generation-stage safety guidance with rendered-view 3D safety regularization, safe semantic projection, residue suppression, and mask-aware preservation to steer optimization away from unsafe editing directions. We evaluate our approach on EditSplat scenes using an object-compatible unsafe prompt benchmark and show that 2D safety guidance alone is not consistently sufficient to prevent unsafe 3D edits. 3DEditSafe reduces unsafe semantic alignment and view-level attack success rates, while revealing a safety-quality tradeoff in which stronger unsafe suppression can introduce artifacts or reduce unsafe-prompt fidelity. To our knowledge, this work is the first attempt to study and defend against unsafe generation in text-driven 3D editing pipelines, highlighting the need for safety mechanisms that operate directly on optimized 3D representations.