MTN (Multi-Scale Triplane Network)

This repository contains the official implementation of Progressive Text-to-3D Generation for Automatic 3D Prototyping (https://arxiv.org/abs/2309.14600).

Paper

Video results

video_1.mp4

video_2.mp4

For more videos, please refer to https://www.youtube.com/watch?v=LH6-wKg30FQ

Instructions:

Install the requirements:

pip install -r requirements.txt

To use DeepFloyd-IF, you need to accept the usage conditions from hugging face, and login with huggingface-cli login in command line.

Start training!

# choose stable-diffusion version
python main.py --text "a hamburger" --workspace trial -O --sd_version 2.1

# use DeepFloyd-IF for guidance:

python main.py --text "a hamburger" --workspace trial -O --IF
python main.py --text "a hamburger" --workspace trial -O --IF --vram_O # requires ~24G GPU memory
python main.py -O --text "a tiger cub" --workspace trial_perpneg_if_tiger --iters 6000 --IF --batch_size 1 --perpneg
python main.py -O --text "a shiba dog wearing sunglasses" --workspace trial_perpneg_if_shiba --iters 6000 --IF --batch_size 1 --perpneg
python main.py -O --text "a octopus toy" --workspace trial_perpneg_if_octopus --iters 6000 --IF --batch_size 1 --perpneg

# larger absolute value of negative_w is used for the following command because the defult negative weight of -2 is not enough to make the diffusion model to produce the views as desired
python main.py -O --text "a shiba dog wearing sunglasses" --workspace trial_perpneg_if_shiba --iters 6000 --IF --batch_size 1 --perpneg --negative_w -3.0

# after the training is finished:
# test (exporting 360 degree video)
python main.py --workspace trial -O --test
# also save a mesh (with obj, mtl, and png texture)
python main.py --workspace trial -O --test --save_mesh
# test with a GUI (free view control!)
python main.py --workspace trial -O --test --gui

Tested environments

torch 1.13 & CUDA 11.5 on a V100.

Citation

If you find this work useful, a citation will be appreciated via:

@article{yi2023progressive,
  title={Progressive Text-to-3D Generation for Automatic 3D Prototyping},
  author={Yi, Han and Zheng, Zhedong and Xu, Xiangyu and Chua, Tat-seng},
  journal={arXiv preprint arXiv:2309.14600},
  year={2023}
}

Acknowledgement

This code base is built upon the following awesome open-source projects: Stable DreamFusion, threestudio

Thanks the authors for their remarkable job !

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
evaluation		evaluation
freqencoder		freqencoder
gridencoder		gridencoder
guidance		guidance
ldm		ldm
nerf		nerf
pretrained/zero123		pretrained/zero123
raymarching		raymarching
scripts		scripts
shencoder		shencoder
taichi_modules		taichi_modules
tets		tets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
activation.py		activation.py
dpt.py		dpt.py
encoding.py		encoding.py
main.py		main.py
meshutils.py		meshutils.py
optimizer.py		optimizer.py
preprocess_image.py		preprocess_image.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MTN (Multi-Scale Triplane Network)

Paper

Video results

Instructions:

Tested environments

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MTN (Multi-Scale Triplane Network)

Paper

Video results

Instructions:

Tested environments

Citation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages