[ECCV 2024] STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

Yifei Zeng¹ Yanqin Jiang*² Siyu Zhu³ Yuanxun Lu¹ Youtian Lin¹ Hao Zhu¹ Weiming Hu² Xun Cao¹ Yao Yao¹⁺

¹Nanjing University ²CASIA ³Fudan University

^*equal contribution ⁺corresponding author

[Project Page] •

Update

7.4 Our paper has been accepted by ECCV 2024. Congrats!

6.20: IMPORTANT. Fix the bug caused by new version of diff_gauss. Newest version of diff_gauss use color, depth, norm, alpha, radii, extra as an output. However, previous version use color, depth, alpha, radii as an output. Using older version of this code will cause mismatch error and may misuse normal for the alpha loss, resulting in bad results.

5.26: Update Text/Image to 4D data below.

5.21: Fix RGB loss into the batch loop. Add visualize code.

⚙️ Installation

pip install -r requirements.txt

git clone --recursive https://github.com/slothfulxtx/diff-gaussian-rasterization.git
pip install ./diff-gaussian-rasterization

pip install ./simple-knn

Video-to-4D

To generate the examples in the project page, you can download the dataset from google drive. Place them in the dataset folder, and run:

python main.py --config configs/stag4d.yaml path=dataset/minions save_path=minions

#use --gui=True to turn on the visualizer (recommend)
python main.py --config configs/stag4d.yaml path=dataset/minions save_path=minions gui=True

To generate the spatial-temporal consistent data from stratch, your should place your rgba data in the form of

├── dataset
│   | your_data 
│     ├── 0_rgba.png
│     ├── 1_rgba.png
│     ├── 2_rgba.png
│     ├── ...

and then run

python scripts/gen_mv.py --path dataset/your_data --pipeline_path xxx/guidance/zero123pp

python main.py --config configs/stag4d.yaml path=data_path save_path=saving_path gui=True

To visualize the result, use you can replace the main.py with visualize.py, and the result will be saved to the valid/xxx path, e.g.:

python visualize.py --config configs/stag4d.yaml path=dataset/minions save_path=minions

Text-to-4D

For Text to 4D generation, we recommend using SDXL and SVD to generate a reasonable video. Then, after matting the video, use the command above to generate a good 4D result. (This pipeline contains many independent parts and is kind of complex, so we may upload the whole workflow after integration if possible.)

If you want generate the examples in the paper, I also updated the corresponding data here in google drive. Remember to set size to 26 in config or use size=26 in the command:

python main.py --config configs/stag4d.yaml path=dataset/xxx save_path=xxx size=26

Tips for better quality

If you want sacrifice time for better quality, here is some tips you can try to further improve the generated quality.

1, Use larger batch size.

2, Run for more steps.

Citation

If you find our work useful for your research, please consider citing our paper as well as Consistent4D:

@article{zeng2024stag4d,
      title={STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians}, 
      author={Yifei Zeng and Yanqin Jiang and Siyu Zhu and Yuanxun Lu and Youtian Lin and Hao Zhu and Weiming Hu and Xun Cao and Yao Yao},
      year={2024},
      eprint={2403.14939},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@article{jiang2023consistent4d,
      title={Consistent4D: Consistent 360{\deg} Dynamic Object Generation from Monocular Video}, 
      author={Yanqin Jiang and Li Zhang and Jin Gao and Weimin Hu and Yao Yao},
      year={2023},
      eprint={2311.02848},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgment

This repo is built on DreamGaussian and Zero123plus. Thank all the authors for their great work.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
configs		configs
dataset/minions		dataset/minions
guidance		guidance
scripts		scripts
simple-knn		simple-knn
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
cam_utils.py		cam_utils.py
dataset_4d.py		dataset_4d.py
deform.py		deform.py
gs_renderer_4d.py		gs_renderer_4d.py
main.py		main.py
mini_trainer.ipynb		mini_trainer.ipynb
requirements.txt		requirements.txt
sh_utils.py		sh_utils.py
visualize.py		visualize.py
zero123.py		zero123.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[ECCV 2024] STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

[Project Page] •

Update

⚙️ Installation

Video-to-4D

Text-to-4D

Tips for better quality

Citation

Acknowledgment

About

Releases

Packages

Languages

zeng-yifei/STAG4D

Folders and files

Latest commit

History

Repository files navigation

[ECCV 2024] STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

[Project Page] •

Update

⚙️ Installation

Video-to-4D

Text-to-4D

Tips for better quality

Citation

Acknowledgment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages