Feature-Optimized Vision for Adaptive 3D Scene Reconstruction
面向自适应3D场景重建的特征优化视觉
Eric Liang
AI总结 提出一种自适应特征优化视觉前端,通过评分纹理、可重复性、独特性、预期三角化角度和空间覆盖来分配每视图特征预算,以最大化有效轨迹并降低重建RMSE。
详情
三维场景重建依赖于局部图像证据,这些证据既要在视觉上具有判别性,又要在几何上有用。固定的特征阈值和均匀的特征预算易于部署,但可能会在重复纹理、低视差区域或不稳定点上浪费计算。本文提出了一种用于3D重建的自适应特征优化视觉前端。该方法通过纹理、可重复性、独特性、预期三角化角度和空间覆盖对候选特征进行评分,然后在固定重建流程下分配每视图特征预算以最大化有效轨迹。一个小型合成多视图原型在走廊、立面、物体桌面和杂乱场景中评估了四种选择策略。与随机、仅纹理和均匀网格基线相比,自适应策略在保持广泛图像覆盖的同时,获得了最佳的质量感知完整性和最低的聚合重建RMSE。结果并非替代现代学习匹配或神经重建系统;它是一个模块化的前端策略,可以使经典和学习的3D流程更审慎地决定将计算花费在哪些视觉证据上。
Three-dimensional scene reconstruction depends on local image evidence that is both visually discriminative and geometrically useful. Fixed feature thresholds and uniform feature budgets are easy to deploy, but they can waste computation on repeated texture, low-parallax regions, or unstable points. This paper proposes an adaptive feature-optimized vision front end for 3D reconstruction. The method scores candidate features by texture, repeatability, distinctiveness, expected triangulation angle, and spatial coverage, then allocates a per-view feature budget to maximize useful tracks under a fixed reconstruction pipeline. A small synthetic multi-view prototype evaluates four selection policies across corridor, facade, object-table, and cluttered scenes. Compared with random, texture-only, and uniform-grid baselines, the adaptive policy obtains the best quality-aware completeness and the lowest aggregate reconstruction RMSE while preserving broad image coverage. The result is not a replacement for modern learned matching or neural reconstruction systems; it is a modular front-end policy that can make classical and learned 3D pipelines more deliberate about which visual evidence they spend compute on.