Are All Frames Equal? Active Sparse Labeling for Video Action Detection

Video action detection requires annotations at every frame, which drastically increases the labeling cost. In this work, we focus on efficient labeling of videos for action detection to minimize this cost. We propose active sparse labeling (ASL), a novel active learning strategy for video action detection.

Project page

Visit the project page HERE for more details.

Description

This is an implementation for the NeurIPS 2022 paper titled: Are All Frames Equal? Active Sparse Labeling for Video Action Detection.

Pre-requisites

python >= 3.6
pytorch >= 1.6
numpy >= 1.19
scipy >= 1.5
opencv >= 3.4
scikit-image >= 0.17
scikit-learn >= 0.23
tensorboard >= 2.3

We developed our code base on Ubuntu 18.04 using anaconda3. We suggest to clone our anaconda environment using the following code:

$ conda create --name <env> --file spec-file.txt

Folder structure

The code expects UCF101 dataset in data/UCF101 folder (same format as direct download from source).

To use pretrained weights, please download charades pretrained i3d weights into weights folder from given link: https://github.com/piergiaj/pytorch-i3d/blob/master/models/rgb_charades.pt

The trained models will be saved under trained/active_learning/checkpoints_ucf101_capsules_i3d folder

The labels/annotations for ucf101 is saved as pickle files for easier processing.

Training step

To train, place the data and weights in appropriate folder. Then run as
python3 train_ucf101_capsules.py <percent>

APU step

This will use the APU algorithm to select frames and create new annotation pickle file. Run as:
python3 APU.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data/UCF101/UCF101_Videos		data/UCF101/UCF101_Videos
trained/active_learning/checkpoints_ucf101_capsules_i3d		trained/active_learning/checkpoints_ucf101_capsules_i3d
weights		weights
APU.py		APU.py
LICENSE		LICENSE
README.md		README.md
capsules_ucf101.py		capsules_ucf101.py
cust_losses.py		cust_losses.py
load_ucf101_pytorch_gaus.py		load_ucf101_pytorch_gaus.py
pytorch_i3d.py		pytorch_i3d.py
spec-file.txt		spec-file.txt
testing_annots.pkl		testing_annots.pkl
train_ucf101_capsules.py		train_ucf101_capsules.py
training_annots.pkl		training_annots.pkl
utils_caps.py		utils_caps.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Are All Frames Equal? Active Sparse Labeling for Video Action Detection

Project page

Description

Pre-requisites

Folder structure

Training step

APU step

About

Releases

Packages

Languages

License

aayushjr/ASL-video

Folders and files

Latest commit

History

Repository files navigation

Are All Frames Equal? Active Sparse Labeling for Video Action Detection

Project page

Description

Pre-requisites

Folder structure

Training step

APU step

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages