This study evaluates the performance of three image-based 3D reconstruction methods—Structure-from-Motion with Multi-View Stereo (SfM-MVS), Neural Radiance Fields (NeRF), and Gaussian Splatting (GS)—for documenting Architectural Heritage (AH). Data acquisition was conducted using low-cost sensors: a drone to capture the stone portal of the external façade and a 360° camera to document the interior spaces. NeRF and GS significantly outperformed SfM-MVS in processing time, with NeRF excelling in reconstruction completeness. However, GS faced challenges with point number control, and NeRF reconstructions exhibited artifacts and noise, particularly on flat surfaces. Accuracy assessments, using TLS point clouds as a benchmark, revealed that SfM-MVS provided the highest geometric precision for the external façade, despite minor gaps in reconstruction. In contrast, NeRF and GS fell short of the accuracy required for precise geometric documentation, with NeRF exhibiting prominent artifacts in flat or poorly detailed regions. Interior reconstructions were further limited by the higher Ground Sampling Distance (GSD) caused by the technical constraints of the 360° camera and the increased capture distance for elevated areas. In conclusion, we can affirm that while NeRF and GS demonstrate strong potential for visualization due to their rendering quality and efficiency, SfM-MVS remains the most reliable method for achieving accurate geometric documentation of AH.
3D representation of Architectural Heritage: a comparative analysis of NeRF, Gaussian Splatting, and SfM-MVS reconstructions using low-cost sensors / Clini, Paolo; Nespeca, Romina; Angeloni, Renato; Coppetta, Laura. - In: INTERNATIONAL ARCHIVES OF THE PHOTOGRAMMETRY, REMOTE SENSING AND SPATIAL INFORMATION SCIENCES. - ISSN 2194-9034. - XLVIII-2/W8-2024:(2024), pp. 93-99. [10.5194/isprs-archives-xlviii-2-w8-2024-93-2024]
3D representation of Architectural Heritage: a comparative analysis of NeRF, Gaussian Splatting, and SfM-MVS reconstructions using low-cost sensors
Clini, Paolo;Nespeca, Romina
;Angeloni, Renato;Coppetta, Laura
2024-01-01
Abstract
This study evaluates the performance of three image-based 3D reconstruction methods—Structure-from-Motion with Multi-View Stereo (SfM-MVS), Neural Radiance Fields (NeRF), and Gaussian Splatting (GS)—for documenting Architectural Heritage (AH). Data acquisition was conducted using low-cost sensors: a drone to capture the stone portal of the external façade and a 360° camera to document the interior spaces. NeRF and GS significantly outperformed SfM-MVS in processing time, with NeRF excelling in reconstruction completeness. However, GS faced challenges with point number control, and NeRF reconstructions exhibited artifacts and noise, particularly on flat surfaces. Accuracy assessments, using TLS point clouds as a benchmark, revealed that SfM-MVS provided the highest geometric precision for the external façade, despite minor gaps in reconstruction. In contrast, NeRF and GS fell short of the accuracy required for precise geometric documentation, with NeRF exhibiting prominent artifacts in flat or poorly detailed regions. Interior reconstructions were further limited by the higher Ground Sampling Distance (GSD) caused by the technical constraints of the 360° camera and the increased capture distance for elevated areas. In conclusion, we can affirm that while NeRF and GS demonstrate strong potential for visualization due to their rendering quality and efficiency, SfM-MVS remains the most reliable method for achieving accurate geometric documentation of AH.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.