Recent advances in Novel-View Synthesis (NVS) and 3D Generation (3DGen) from 2D images have marked significant progress in various domains. While the Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipelines remain prevalent, their limitations have driven the exploration of Deep Learning (DL)-based methods. Among these, Neural Radiance Fields (NeRFs) stand out for their exceptional capabilities in novel view synthesis and 3D reconstruction. However, their reliance on large, diverse 2D images for training, which capture the same scene from different perspectives, poses challenges. To address these challenges, our research proposes a module that introduces innovative data-centric strategies to improve the fidelity of novel view synthesis and reconstruction of NeRFs. In particular, the adopted strategy relies on depth priors, RGB masks, geometrical warping, and deep learning-based image restoration to improve the training and performance of NeRF models, following a human-in-the-loop approach. This module paves the way for a novel data-centric and DL-driven, to improve performances in NeRFs, which is adaptable across different NeRF architectures. Through a comprehensive quantitative-qualitative analysis of such a framework, on a challenging NeRF benchmark dataset, we demonstrate the effectiveness and versatility of our approach.

A Data-Centric Module for Neural Rendering / Balloni, E., Stacchio, L., Gorgoglione, L., Paolanti, M., Pierdicca, R., Mancini, A., Frontoni, E., Zingaretti, P.. - 15628:(2025), pp. 312-329. (Workshops that were held in conjunction with the 18th European Conference on Computer Vision, ECCV 2024 ita 2024) [10.1007/978-3-031-91572-7_19].

A Data-Centric Module for Neural Rendering

Balloni, Emanuele
;
Gorgoglione, Lucrezia;Paolanti, Marina;Pierdicca, Roberto;Mancini, Adriano;Frontoni, Emanuele;Zingaretti, Primo
2025-01-01

Abstract

Recent advances in Novel-View Synthesis (NVS) and 3D Generation (3DGen) from 2D images have marked significant progress in various domains. While the Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipelines remain prevalent, their limitations have driven the exploration of Deep Learning (DL)-based methods. Among these, Neural Radiance Fields (NeRFs) stand out for their exceptional capabilities in novel view synthesis and 3D reconstruction. However, their reliance on large, diverse 2D images for training, which capture the same scene from different perspectives, poses challenges. To address these challenges, our research proposes a module that introduces innovative data-centric strategies to improve the fidelity of novel view synthesis and reconstruction of NeRFs. In particular, the adopted strategy relies on depth priors, RGB masks, geometrical warping, and deep learning-based image restoration to improve the training and performance of NeRF models, following a human-in-the-loop approach. This module paves the way for a novel data-centric and DL-driven, to improve performances in NeRFs, which is adaptable across different NeRF architectures. Through a comprehensive quantitative-qualitative analysis of such a framework, on a challenging NeRF benchmark dataset, we demonstrate the effectiveness and versatility of our approach.
2025
9783031915710
9783031915727
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/358172
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact