arXivDaily arXiv每日学术速递 周一至周五更新

视觉与机器人

3D 视觉

三维重建、NeRF、Gaussian Splatting、点云和空间智能。

今日/当前日期收录 3 信号源:cs.CV, cs.GR, cs.RO
2606.18588 2026-06-18 cs.DC cs.CV 新提交 专题 95

Splaxel: Efficient Distributed Training of 3D Gaussian Splatting for Large-scale Scene Reconstruction via Pixel-level Communication

Splaxel:通过像素级通信实现大规模场景重建的高效分布式3D高斯泼溅训练

Wenqi Jia, Zhewen Hu, Ying Huang, Yu Gong, Stavros Kalafatis, Yuke Wang, Wei Niu, Chengming Zhang, Ang Li, Sheng Di, Yuede Ji, Bo Fang, Miao Yin

发表机构 * Independent Researcher(独立研究者) Rice University(里士满大学) University of Georgia(佐治亚大学) University of Houston(休斯顿大学) University of Washington(华盛顿大学) Argonne National Labs(阿贡国家实验室)

专题命中 Gaussian Splatting :Splaxel框架高效分布式训练3DGS

AI总结 提出Splaxel框架,通过像素级局部渲染与全局组合替代高斯同步,在保持数学一致性的同时稳定通信开销,结合可见性预测和冲突消除策略,实现大规模3DGS分布式训练加速7.6倍。

Comments 17 pages, 25 figures

详情
AI中文摘要

3D高斯泼溅(3DGS)能够实现高保真、实时的3D场景重建,但将训练扩展到大规模场景需要跨多个GPU优化数亿个高斯体。现有的分布式方法要么将场景划分为孤立区域,导致全局不一致,要么依赖全局高斯级交换,导致GPU间通信量大幅增长并迅速主导迭代时间。我们提出Splaxel,一种基于像素级局部渲染和全局组合的通信高效分布式3DGS训练框架。每个GPU渲染其局部子集并仅交换部分像素值,而非同步高斯体,从而在保持数学一致性的同时,使通信成本随场景规模增长保持稳定。Splaxel通过几何和透射率可见性预测进一步减少像素级冗余,并通过无冲突的相机视图整合提高GPU利用率。在包含多达1.2亿个高斯体的大规模数据集上评估,Splaxel相比最先进的分布式3DGS框架实现了高达7.6倍的加速,同时保持高重建质量。

英文摘要

3D Gaussian Splatting (3DGS) enables high-fidelity and real-time 3D scene reconstruction, but scaling training to large-scale scenes requires optimizing hundreds of millions of Gaussians across multiple GPUs. Existing distributed approaches either partition scenes into isolated regions, causing global inconsistency, or rely on global Gaussian-level exchanges, which lead to substantial growth in inter-GPU communication and quickly dominate iteration time. We propose Splaxel, a communication-efficient distributed 3DGS training framework based on pixel-level local rendering and global composition. Instead of synchronizing Gaussians, each GPU renders its local subset and exchanges only partial pixel values, maintaining mathematical consistency while keeping communication cost stable as the scene size increases. Splaxel further reduces pixel-level redundancy through geometric and transmittance visibility prediction and improves GPU utilization via conflict-free camera-view consolidation. Evaluated on large-scale datasets with up to 120M Gaussians, Splaxel achieves up to 7.6$\times$ speedup over the state-of-the-art distributed 3DGS framework while preserving high reconstruction quality.

2606.18623 2026-06-18 cs.CV eess.IV 新提交 专题 85

Intrinsic 4D Gaussian Segmentation from Scene Cues

内在4D高斯分割:基于场景线索

Hasan Yazar, Mohamed Rayan Barhdadi, Erchin Serpedin, Mehmet Tuncel, Hasan Kurban

发表机构 * Istanbul Technical University(伊斯坦布尔理工大学) Texas A&M University(德克萨斯农工大学) Hamad Bin Khalifa University(哈马德·本·哈利法大学)

专题命中 Gaussian Splatting :无需训练和掩码的4D高斯分割方法

AI总结 提出Intrinsic-GS方法,无需训练和掩码,通过构建高斯原语的亲和图并利用社区检测实现4D场景分割,在Neu3D和HyperNeRF上达到与掩码监督方法相当的精度,且速度提升12.5倍。

Comments 15 pages, 4 figures, 7 tables. Includes supplementary material. Preprint

详情
AI中文摘要

动态4D高斯泼溅以高保真度重建变形场景,并越来越多地被用作动态3D场景的表示。要利用此类场景进行编辑、操作或运动分析,首先需要对其进行分割:将高斯原语分组为连贯的对象。当前流程通过从基础模型(如SAM)导入2D掩码,并将其提升或蒸馏到高斯表示中来获得这种分组。在动态场景中,这些掩码必须在多个帧和视角中生成,成本高昂,并且所得分割可能强烈依赖于这些外部掩码的质量和一致性。我们探究能否从高斯本身恢复更多的对象级结构,并提出Intrinsic-GS,一种无需训练、无需掩码的方法,该方法根据外观、方向、尺度、变形轨迹和非学习渲染边界线索,在高斯原语上构建稀疏亲和图。该图通过Leiden社区检测进行划分,无需基础模型,也无需学习特征场。在标准的4D高斯分割基准Neu3D和HyperNeRF上,Intrinsic-GS在没有掩码监督的情况下恢复了大量的对象结构,在Neu3D上达到0.746 mIoU,在HyperNeRF上达到0.575;在Neu3D上,仅几何变体达到0.902 mIoU,与SAM监督的TRASE相当。在HyperNeRF上,Intrinsic-GS的运行速度比掩码监督流程中使用的掩码生成和特征渲染阶段快12.5倍。这些结果表明,大部分分割信号已经编码在高斯本身中,为3D和4D高斯分割提供了一种快速、无需掩码的方向,也可能指向在外部掩码不可靠或昂贵的情况下更可泛化、更鲁棒的分割。

英文摘要

Dynamic 4D Gaussian Splatting reconstructs deforming scenes with high fidelity and is increasingly adopted as a representation for dynamic 3D scenes. Putting such a scene to use, for editing, manipulation or motion analysis, first requires segmenting it: grouping the Gaussian primitives into coherent objects. Current pipelines obtain this grouping by importing 2D masks from foundation models such as SAM and lifting or distilling them into the Gaussian representation. In dynamic scenes these masks must be generated across many frames and views, which is costly, and the resulting segmentation can depend strongly on the quality and consistency of those external masks. We ask how much object-level structure can instead be recovered from the Gaussians themselves, and propose Intrinsic-GS, a training-free, mask-free method that builds a sparse affinity graph over Gaussian primitives from appearance, orientation, scale, deformation-trajectory and non-learned rendered-boundary cues. The graph is partitioned with Leiden community detection, requiring no foundation model and no learned feature field. On the standard 4D Gaussian segmentation benchmarks, Neu3D and HyperNeRF, Intrinsic-GS recovers substantial object structure without mask supervision, reaching 0.746 mIoU on Neu3D and 0.575 on HyperNeRF; on Neu3D, a geometry-only variant reaches 0.902 mIoU, matching SAM-supervised TRASE. On HyperNeRF, Intrinsic-GS runs 12.5x faster than the mask-generation and feature-rendering stages used by mask-supervised pipelines. These results suggest that much of the segmentation signal is already encoded in the Gaussians themselves, offering a fast, mask-free direction for 3D and 4D Gaussian segmentation that may also point toward more generalizable, robust segmentation in settings where external masks are unreliable or expensive.

2606.18734 2026-06-18 eess.SP cs.LG 新提交 专题 80

Point-Cloud-Assistant Localized Statistical Channel Prediction by Tangent Gaussian Splatting

点云辅助的切线高斯溅射局部统计信道预测

Ye Xue, Yiheng Wang, Xinhua Shao, Qi Yan, Shutao Zhang, Tsung-Hui Chang

发表机构 * China Telecom(中国电信)

专题命中 Gaussian Splatting :使用高斯溅射进行信道预测

AI总结 提出点云辅助切线高斯溅射(PC-TGS)框架,通过融合稀疏无线电测量与密集LiDAR几何数据,将角功率谱外推到未测量网格,实现大规模无线数字孪生中的高效信道预测。

详情
AI中文摘要

准确、特定地点的信道信息对于优化下一代无线网络至关重要。在各种方法中,局部统计信道建模(LSCM)通过从参考信号接收功率(RSRP)测量中建模信道多径角功率谱(APS),已成为一种针对高效网络优化的最先进方法。然而,尽管其有效性,LSCM无法在绝大多数没有测量值的位置预测APS,这严重限制了其在大规模真实场景中的适用性。为了解决这一挑战,我们提出了\emph{点云辅助切线高斯溅射}(PC-TGS),这是第一个通过将稀疏无线电测量与密集的基于LiDAR的几何信息相结合,将APS\emph{外推}到未测量室外网格的框架。PC-TGS将环境散射体表示为各向异性的3D高斯分布,通过原始点云的松弛均值重新参数化进行初始化和细化。切线平面投影将每个高斯分布精确映射到局部角度域,而深度感知的电磁溅射过程聚合它们的贡献。为了确保实际部署,我们推导了用于APS bin积分的闭式高斯加权平均(GWA),并提供了可证明的误差界。在LiDAR扫描的城市规模数据集(500万个点,6310个RSRP样本)上的评估表明,与最先进的基线相比,PC-TGS在APS和RSRP预测性能上更优,并且在外推APS任务中推理时间更快。这些结果突显了PC-TGS在大规模无线数字孪生中实现几何感知和数据高效信道预测的潜力。

英文摘要

Accurate, site-specific channel information is crucial for optimizing next-generation wireless networks. Among various approaches, localized statistical channel modeling (LSCM), which models the channel multipath angular power spectrum (APS) from the reference signal received power (RSRP) measurement, has emerged as a state-of-the-art method tailored for efficient network optimization. However, despite its effectiveness, LSCM cannot predict APS at the vast majority of locations where no measurements are available, which significantly restricts its applicability in large-scale, real-world scenarios. To address this challenge, we present \emph{point-cloud-assisted tangent Gaussian splatting} (PC-TGS), the first framework to \emph{extrapolate} APS to unmeasured outdoor grids by integrating sparse radio measurements with dense LiDAR-based geometry. PC-TGS represents environmental scatterers as anisotropic 3D Gaussians, initialized and refined through a relaxed-mean reparameterization of the raw point cloud. A tangent-plane projection accurately maps each Gaussian into the local angular domain, while a depth-aware electromagnetic splatting process aggregates their contributions. To ensure practical deployment, we derive a closed-form Gaussian-weighted average (GWA) for APS bin integration and provide a provable error bound. { Evaluations on a LiDAR-scanned city-scale dataset (5M points, 6,310 RSRP samples) demonstrate that PC-TGS achieves better APS and RSRP prediction performance compared to state-of-the-art baselines and faster inference time for APS extrapolation task. These results highlight the potential of PC-TGS to enable geometry-aware and data-efficient channel prediction in large-scale wireless digital twins.