MLP Splatting: Object-Centric Neural Fields
MLP Splatting: 以对象为中心的神经场
Shinjeong Kim, Yuzhou Cheng, Xin Kong, Paul H. J. Kelly, Andrew J. Davison
AI总结 提出MLP-Splatting方法,通过少量紧凑MLP原语实现场景分解和新视角合成,支持对象级编辑且内存和渲染效率优于现有方法。
详情
3D表示对于场景渲染、理解和交互至关重要。最近的方法,如3D高斯泼溅和神经辐射场,实现了令人印象深刻的光照真实感新视角合成,但缺乏将场景元素轻松分解为少数原语的能力,需要额外的分割或分组才能进行对象级操作。我们提出了MLP-Splatting,一种通过少量富有表现力的光场原语实现场景分解,同时提供光照真实感新视角合成的方法。MLP-Splatting将每个原语建模为一个独立的紧凑MLP,具有局部空间支持,预测辐射度和不透明度。与低级高斯原语或单个全局辐射场相比,我们的神经原语提供了更大的表达能力,同时保持空间局部性。通过高效的光线-原语交互稀疏体积合成进行渲染。我们的原语仅使用RGB监督进行训练,这产生了代表局部场景区域(通常对应于对象或对象部分)的原语,通过选择少量原语即可实现无需分割掩码的交互式对象级编辑。我们的方法辅以可选的语义特征蒸馏,支持开放词汇场景交互和开放集实例分割。与最先进的方法相比,我们在实验中表明,与语义3DGS方法相比,我们实现了显著更低的内存使用(1/15倍)和更快的渲染(3倍)。项目页面:此https URL
3D representations are fundamental to scene rendering, understanding, and interaction. Recent approaches, such as 3D Gaussian Splatting and Neural Radiance Fields, achieve impressive photorealistic novel-view synthesis, but lack the ability to easily decompose scene elements into a few primitives, requiring additional segmentation or grouping for object-level manipulation. We present MLP-Splatting, a method that enables scene decomposition via a few expressive light-field primitives while providing photorealistic novel-view synthesis. MLP-Splatting models each primitive as an independent compact MLP with localized spatial support that predicts radiance and opacity. In contrast to low-level Gaussian primitives or a single global radiance field, our neural primitives provide greater expressive capacity while remaining spatially localized. Rendering is performed through efficient sparse volumetric compositing over ray-primitive interactions. Our primitives are supervised using RGB supervision alone, which yields primitives that represent local scene regions often corresponding to objects or object parts, enabling interactive object-level editing without segmentation masks by selecting a handful of primitives. Our method, augmented with optional semantic feature distillation, enables open-vocabulary scene interaction and open-set instant segmentation. Compared to state-of-the-art methods, we achieve substantially lower memory usage (1/15$\times$) and faster rendering (3$\times$), as we show in our experiments compared to semantic 3DGS methods. Project Page: https://shinjeongkim.com/mlp-splatting