GitHub - techthiyanes/VideoCrafter: A Toolkit for Text-to-Video Generation and Editing

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

🔥🔥 The VideoCrafter1 for high-quality video generation are now released! Please Join us and create your own film on Discord/Floor33.

Floor33 | Film

🔆 Introduction

🤗🤗🤗 VideoCrafter is an open-source video generation and editing toolbox for crafting video content.
It currently includes the Text2Video and Image2Video models:

1. Generic Text-to-video Generation

Click the GIF to access the high-resolution video.


"A girl is looking at the camera smiling. High Definition."	"an astronaut running away from a dust storm on the surface of the moon, the astronaut is running towards the camera, cinematic"


"A giant spaceship is landing on mars in the sunset. High Definition."	"A blue unicorn flying over a mystical land"

2. Generic Image-to-video Generation



"a black swan swims on the pond"	"a girl is riding a horse fast on grassland"	"a boy sits on a chair facing the sea"	"two galleons moving in the wind at sunset"

📝 Changelog

[2023.10.30]: Release VideoCrafter1 Technical Report!
[2023.10.19]: Release the 320x512 Text2Video Model, and HuggingFace demo.
[2023.10.13]: 🔥🔥 Release the VideoCrafter1, High Quality Video Generation!
[2023.08.14]: Release a new version of VideoCrafter on Discord/Floor33. Please join us to create your own film!
[2023.04.18]: Release a VideoControl model with most of the watermarks removed!
[2023.04.05]: Release pretrained Text-to-Video models, VideoLora models, and inference code.

⏳ Models

Models	Resolution	Checkpoints
Text2Video	576x1024	Hugging Face
Text2Video	320x512	Hugging Face
Image2Video	320x512	Hugging Face

⚙️ Setup

1. Install Environment via Anaconda (Recommended)

conda create -n videocrafter python=3.8.5
conda activate videocrafter
pip install -r requirements.txt

💫 Inference

1. Text-to-Video

Download pretrained T2V models via Hugging Face, and put the model.ckpt in checkpoints/base_1024_v1/model.ckpt.
Input the following commands in terminal.

  sh scripts/run_text2video.sh

2. Image-to-Video

Download pretrained I2V models via Hugging Face, and put the model.ckpt in checkpoints/i2v_512_v1/model.ckpt.
Input the following commands in terminal.

  sh scripts/run_image2video.sh

📋 Techinical Report

😉 Tech report: VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

😉 Citation

The technical report is currently unavailable as it is still in preparation. You can cite the paper of our image-to-video model and related base model.

@misc{chen2023videocrafter1,
      title={VideoCrafter1: Open Diffusion Models for High-Quality Video Generation}, 
      author={Haoxin Chen and Menghan Xia and Yingqing He and Yong Zhang and Xiaodong Cun and Shaoshu Yang and Jinbo Xing and Yaofang Liu and Qifeng Chen and Xintao Wang and Chao Weng and Ying Shan},
      year={2023},
      eprint={2310.19512},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@article{xing2023dynamicrafter,
      title={DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors}, 
      author={Jinbo Xing and Menghan Xia and Yong Zhang and Haoxin Chen and Xintao Wang and Tien-Tsin Wong and Ying Shan},
      year={2023},
      eprint={2310.12190},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@article{he2022lvdm,
      title={Latent Video Diffusion Models for High-Fidelity Long Video Generation}, 
      author={Yingqing He and Tianyu Yang and Yong Zhang and Ying Shan and Qifeng Chen},
      year={2022},
      eprint={2211.13221},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

🤗 Acknowledgements

Our codebase builds on Stable Diffusion. Thanks the authors for sharing their awesome codebases!

📢 Disclaimer

We develop this repository for RESEARCH purposes, so it can only be used for personal/research/non-commercial purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
assets		assets
configs		configs
lvdm		lvdm
prompts		prompts
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

🔥🔥 The VideoCrafter1 for high-quality video generation are now released! Please Join us and create your own film on Discord/Floor33.

Floor33 | Film

🔆 Introduction

1. Generic Text-to-video Generation

2. Generic Image-to-video Generation

📝 Changelog

⏳ Models

⚙️ Setup

1. Install Environment via Anaconda (Recommended)

💫 Inference

1. Text-to-Video

2. Image-to-Video

📋 Techinical Report

😉 Citation

🤗 Acknowledgements

📢 Disclaimer

About

Releases

Packages

Languages

techthiyanes/VideoCrafter

Folders and files

Latest commit

History

Repository files navigation

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

🔥🔥 The VideoCrafter1 for high-quality video generation are now released! Please Join us and create your own film on Discord/Floor33.

Floor33 | Film

🔆 Introduction

1. Generic Text-to-video Generation

2. Generic Image-to-video Generation

📝 Changelog

⏳ Models

⚙️ Setup

1. Install Environment via Anaconda (Recommended)

💫 Inference

1. Text-to-Video

2. Image-to-Video

📋 Techinical Report

😉 Citation

🤗 Acknowledgements

📢 Disclaimer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages