Skip to content

Latest commit

 

History

History
33 lines (24 loc) · 1.55 KB

README.md

File metadata and controls

33 lines (24 loc) · 1.55 KB

Spatio-Temporal Action Detection with Occlusion

Link: Spatio-Temporal Action Detection with Occlusion.


Overview

architecture

STADO can be decomposed into one module and three branches:

  1. Mask-Guided Attention Module

    Produces a spatial attention mask to modulate features generated by the backbone to focus on non-occlusion patterns.

  2. Multi-Task Branches

    (2.1) Center Branch for center localization and action recognition.

    (2.2) Movement Branch for movement estimation at adjacent frames to form moving point trajectories.

    (2.3) Box Branch for spatial extent detection by directly regressing bounding box size at the estimated center point of each frame.


Instructions

  1. Installation
  2. Dataset
  3. Train
  4. Evaluation

References