SimVP: Simpler yet Better Video Prediction

This repository contains the implementation code for paper:

SimVP: Simpler yet Better Video Prediction
Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li. In CVPR, 2022.

Introduction

From CNN, RNN, to ViT, we have witnessed remarkable advancements in video prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies. We admire these progresses but are confused about the necessity: is there a simple method that can perform comparably well? This paper proposes SimVP, a simple video prediction model that is completely built upon CNN and trained by MSE loss in an end-to-end fashion. Without introducing any additional tricks and complicated strategies, we can achieve state-of-the-art performance on five benchmark datasets. Through extended experiments, we demonstrate that SimVP has strong generalization and extensibility on real-world datasets. The significant reduction of training cost makes it easier to scale to complex scenarios. We believe SimVP can serve as a solid baseline to stimulate the further development of video prediction.

Dependencies

torch
scikit-image=0.16.2
numpy
argparse
tqdm

Overview

API/ contains dataloaders and metrics.
main.py is the executable python file with possible arguments.
model.py contains the SimVP model.
exp.py is the core file for training, validating, and testing pipelines.

Install

This project has provided an environment setting file of conda, users can easily reproduce the environment by the following commands:

  conda env create -f environment.yml
  conda activate SimVP

Moving MNIST dataset

  cd ./data/moving_mnist
  bash download_mmnist.sh

TaxiBJ dataset

We provide a Google Drive link to download TaxiBJ dataset. Users can download this dataset and put it into ./data/taxibj.

Citation

If you are interested in our repository and our paper, please cite the following paper:

@InProceedings{Gao_2022_CVPR,
    author    = {Gao, Zhangyang and Tan, Cheng and Wu, Lirong and Li, Stan Z.},
    title     = {SimVP: Simpler Yet Better Video Prediction},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2022},
    pages     = {3170-3180}
}

Contact

If you have any questions, feel free to contact us through email ([email protected], [email protected]). Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
API		API
data/moving_mnist		data/moving_mnist
readme_figures		readme_figures
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
exp.py		exp.py
main.py		main.py
model.py		model.py
modules.py		modules.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimVP: Simpler yet Better Video Prediction

Introduction

Dependencies

Overview

Install

Moving MNIST dataset

TaxiBJ dataset

Citation

Contact

About

Releases

Packages

Contributors 2

Languages

ryok/SimVP-Simpler-yet-Better-Video-Prediction

Folders and files

Latest commit

History

Repository files navigation

SimVP: Simpler yet Better Video Prediction

Introduction

Dependencies

Overview

Install

Moving MNIST dataset

TaxiBJ dataset

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages