GaussianSpace: Text guide 3D Gaussian Splatting Large Spatial Manipulations

Shengyu Meng1, Ximing Zhong2, Fujia Yu3,

1 MicroFeel   2 University of Innsbruck   3 Aalto University   4 Tampere University  

Paper & Code is under construction.

Abstract (WIP)

Integrating 2D diffusion models with 3D Gaussian splatting for text-guided generation of individual 3D objects and editing Gaussian scenes are recently receiving increasing attention. However, current methods for single object generation normally require additional constraints, such as masks and normal maps, which limit their applicability in handling complex spaces. Furthermore, current techniques for text-guided 3D Gaussian editing typically rely on Iterative Dataset Update (IDU) methods based on the instruct nerf2nerf, which are also unsuitable for large spatial manipulations. In response to these challenges, we propose a new method, GaussianSpace, which enables effective text-guided editing of large space in 3D Gaussian Splatting. The key innovation of this method lies in its consideration of both RGB loss from the ground-true images and Score Distillation Sampling (SDS) loss based on the diffusion model during the iterative process. Additionally, we have introduced an automatic weighted loss technique to steadily descend the overall loss, ensuring that the edited Gaussian scene adapts to text instructions while retaining its original structural features. This marks the first successful implementation of text-guided 3D Gaussian large spatial manipulations.

Results (Preview)

Original Space
Clik to view interactive gaussian demo
Fantasy toys Museum
Clik to view interactive gaussian demo
Cyberpunk Arsenal
Clik to view interactive gaussian demo
Abandoned Biological Museum
Clik to view interactive gaussian demo

Source of Original Space: 3D gaussain scene trained from 502 frames extracted from video taken in Landeszeughaus Graz Museum by iphone 14.
Prompt of Fantasy toys Museum: "anime painting of fantasy toys museum in the style of Studio Ghibli, surreal, maximalism, trending on artstation, 8K, HD, ultra details"
Prompt of Cyberpunk Arsenal: "An robotic arsenal of future weapons in the style of neon cyberpunk, cgsociety, trending on artstation, 8K, HD"
Prompt of Abandoned Biological Museum: "An abandoned futurism biological museum reclaimed by tropical rainforest with beautiful flowers, surreal, trending on artstation, 8K, HD, ultra details"

Optimization Progress


Optimization process (Video in 100X acceleration): Resume from the trained 3D Gaussian scene of the original space (30000 iterations), and re-trained for 20000 iterations on a singel RTX 3090 GPU for one hour.

Citation

@article{Meng2024gaussianspace,
  title={GaussianSpace: Text guide 3D Gaussian Splatting Large Spatial Manipulations},
  author={Meng, Shengyu and Zhong, Ximing and Yu, Fujia},
  year={2024}
}