SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting

Wenrui Li, Yapeng Mi, Fucheng Cai, Zhe Yang, Wangmeng Zuo, Xingtao Wang, Xiaopeng Fan

Introduction

SceneDreamer360 leverages a text-driven panoramic image generation model as a prior for 3D scene generation and employs 3D Gaussian Splatting (3DGS) to ensure consistency across multi-view panoramic images. Specifically, SceneDreamer360 enhances the fine-tuned Panfusion generator with a three-stage panoramic enhancement, enabling the generation of high-resolution, detail-rich panoramic images. During the 3D scene construction, a novel point cloud fusion initialization method is used, producing higher quality and spatially consistent point clouds. Our extensive experiments demonstrate that compared to other methods, SceneDreamer360 with its panoramic image generation and 3DGS can produce higher quality, spatially consistent, and visually appealing 3D scenes from any text prompt.

Visualization

Environment Setup

Follow these steps to set up the required environment:

conda env create -f environment_strict.yaml
conda activate scenedreamer360

pip install -r Enhance_img/requirements.txt

cd PanoSpaceDreamer/submodules/depth-diff-gaussian-rasterization-min
python setup.py install
cd ../simple-knn
python setup.py install
cd ../../..

Checkpoints

Download PanFusion checkpoints and move to logs/4142dlo4/checkpoints. Also, download the panoramic optimization model from Baidu Cloud, and extract models.zip into Enhance_img/.

Running

Modify the parameters in the config.json file to suit your test cases.

Update the text field to point to the prompt file you wish to test. campath_gen(default: fullscan) supports the following options: fullscan, layerscan, lowerscan, rotate360, lookaround, moveright, moveback, arc, lockdown, hemisphere. campath_render(default: 1440) supports the following options: 1440, 360, rotate360, headbanging, llff, back_and_forth. To run the test, use this command:

WANDB_MODE=offline WANDB_RUN_ID=4142dlo4 python main.py predict --data=Matterport3D --model=PanFusion --ckpt_path=last

Batch Processing

For batch testing, list the test prompts in the data/prompt.txt file (one prompt per line), and update the file name in the data/Matterport3D/mp3d_skybox/e9zR4mvMWw7/blip3_stitched directory to test.txt.

Run the following command to start batch processing:

python run.py

Results

The results of the runs will be saved in the logs/4142dlo4/predict directory.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Enhance_img		Enhance_img
PanFusion		PanFusion
PanoSpaceDreamer		PanoSpaceDreamer
data		data
logs/4142dlo4/checkpoints		logs/4142dlo4/checkpoints
.DS_Store		.DS_Store
README.md		README.md
config.json		config.json
image.png		image.png
main.py		main.py
multi_view_img.py		multi_view_img.py
run.py		run.py
visualization.png		visualization.png
visualization_sup.png		visualization_sup.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting

Introduction

Visualization

Environment Setup

Checkpoints

Running

Batch Processing

Results

About

Releases

Packages

Contributors 3

Languages

liwrui/SceneDreamer360

Folders and files

Latest commit

History

Repository files navigation

SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting

Introduction

Visualization

Environment Setup

Checkpoints

Running

Batch Processing

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages