CNR Institutional Research Information System

Reconstructing large-scale outdoor environments is essential for advancing XR applications but is hindered by the high cost and limitations of traditional methods like LiDAR, depth sensors, and photogrammetry. We propose generative neural architectures to address these issues. Our initial Spatio-Temporal Diffusion model combines temporal image sequences and coarse spatial data with a novel SDF_MIP representation for efficient training. Building on this, we introduce Neural-Clipmap, a scalable framework using an enhanced octree structure and Triplane representations to refine 3D reconstructions iteratively. Additionally, we leverage monocular RGB image sequences with 2D diffusion priors via Score Distillation Sampling (SDS) to reconstruct missing data, addressing challenges like initialization coherence and color accuracy through a multi-phase inpainting process. These approaches reduce resource requirements while enabling efficient, high-quality reconstructions.

Beyond human imagination: the art of creating prompt-driven 3D scenes with Generative AI

Federico G.;Carrara F.;Amato G.;Di Benedetto M.

2024

Abstract

Reconstructing large-scale outdoor environments is essential for advancing XR applications but is hindered by the high cost and limitations of traditional methods like LiDAR, depth sensors, and photogrammetry. We propose generative neural architectures to address these issues. Our initial Spatio-Temporal Diffusion model combines temporal image sequences and coarse spatial data with a novel SDF_MIP representation for efficient training. Building on this, we introduce Neural-Clipmap, a scalable framework using an enhanced octree structure and Triplane representations to refine 3D reconstructions iteratively. Additionally, we leverage monocular RGB image sequences with 2D diffusion priors via Score Distillation Sampling (SDS) to reconstruct missing data, addressing challenges like initialization coherence and color accuracy through a multi-phase inpainting process. These approaches reduce resource requirements while enabling efficient, high-quality reconstructions.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Codice ISBN
	
				9789513887964
			
	Parole chiave
	
				Generative AI, Computer Graphics, Denoising Diffusion Probabilistic Model, Gaussian  Splatting, NeRF, Signed Distance Field, Video Reconstruction, Deep Learning, Machine  Learning, Artificial Intelligence, Text-to-3D, Image-to-3D, Urban Environment, Score  Distillation Sampling
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
published.pdf accesso aperto Descrizione: Beyond human imagination: The art of creating prompt-driven 3D scenes with Generative AI Tipologia: Versione Editoriale (PDF) Licenza: Altro tipo di licenza Dimensione 909.91 kB Formato Adobe PDF Visualizza/Apri	909.91 kB	Adobe PDF	Visualizza/Apri
POSTER_EUROXR.pdf accesso aperto Descrizione: Beyond human imagination: The art of creating prompt-driven 3D scenes with Generative AI Tipologia: Altro materiale allegato Licenza: Altro tipo di licenza Dimensione 6.61 MB Formato Adobe PDF Visualizza/Apri	6.61 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/514771

Citazioni

ND

ND

ND

social impact