GitHub

Fusion Types and Cross Connection Study

Official Implementation used for training MATNet variants in our CVPR2022 Work. We provide the best models on MoCA with 70.6% mean IoU.

MoCA Results

Our paper uses threshold 0.2 and SOA MoCA comparison uses threshold 0.1. We recommend for better reporting on MoCA to compute area under the curve for different thresholds but is out of our current work scope.

MATNet variants results without additional YouTube-VOS data (NoYTB) and without Boundary Aware Refinement module (NoBAR)

Method	Th	Flip	mIoU	SR_0.5	SR_0.6	SR_0.7	SR_0.8	SR_0.9	mSR
FusionSeg Modified	0.2	No	42.3	47.9	43.6	35.9	24.2	9.4	39.2
RTNet	0.2	No	60.7	67.9	62.4	53.6	43.4	23.9	50.2
MATNet reproduced	0.2	No	67.3	75.9	70.8	61.9	48.6	26.0	56.6
MATNet NoBAR	0.2	No	65.1	73.6	68.0	58.9	44.7	21.5	53.3
MATNet NoYTB	0.2	No	54.7	59.9	53.5	44.0	31.0	13.4	40.3

In the main submission we found RTNet with reciprocal cross connections heavily static biased. Our additional experiments here shows that Reciprocal connections (motion-to-appearance and appearance-to-motion) can encourage dynamics if trained with proper fusion and training data without pretraining towards saliency. Training reciprocal cross connections (cross connections similar to RTNet) with gated fusion (fusion similar to MATNet), achieves best performance on MoCA and shows increase in dynamic bias unlike original RTNet. RTNet convex combination gated fusion has shown to cause accuracy degradation on the other hand.

Method	Th	Flip	mIoU	SR_0.5	SR_0.6	SR_0.7	SR_0.8	SR_0.9	mSR
NonRecip CC + Gated Fusion	0.1	Yes	70.2	79.4	74.1	64.6	49.0	23.8	58.2
NonRecip CC + Gated Fusion	0.2	Yes	68.5	77.3	72.2	63.5	50.6	27.0	58.1
Recip CC + Gated Fusion	0.2	Yes	70.6	81.2	75.5	65.0	48.1	23.0	58.6
Recip CC + Gated Fusion	0.1	Yes	67.6	77.9	70.1	59.1	40.7	16.8	52.9

Results showing the static dynamic bias for the new Recip CC + Gated Fusion for the final fusion layer w.r.t other models.

Results showing the mIoU on MoCA when masking top-K units per factor. It aligns with the previous results that fusion layer 2 is dynamic biased, while fusion layers 3,4 and 5 are static biased. In case of sampling random units, we select the (K+5%) of the least units that are biased towards the significant (i.e. dynamic in fusion layer 2 and static in the rest) and then randomly select within these. We do that to ensure random selection especially with higher percentages does not sample some of the units biased towards the corresponding significant factor either static or dyanmic.

Installation

The training and testing experiments are conducted using Python 3.7 PyTorch 1.9 with multi GPU support. Other minor Python modules can be installed by running

pip install -r requirements.txt

Datasets

Download Datasets

We follow MATNet and use the following two public available dataset for training. Here are some steps to prepare the data:

DAVIS-17: we use all the data in the train subset of DAVIS-16. However, please download DAVIS-17 to fit the code. It will automatically choose the subset of DAVIS-16 for training.
YoutubeVOS-2018: we sample the training data every 10 frames in YoutubeVOS-2018. We use the dataset version with 6fps rather than 30fps.
Create soft links: cd data; ln -s your/davis17/path DAVIS2017; ln -s your/youtubevos/path YouTubeVOS_2018;

Prepare Edge Annotatios, HED Results and Optical Flow

Use MATNet instructions from here

Prepare MoCA for Evaluation

Follow motiongrouping instructions from here

Train

Choose the right config:
- Original matnet: configs/two_stream.yaml
- reciprocal version with gated fusion: configs/two_stream_coatt_gating_recip.yaml
Set correct checkpoint path in CONFIG you choose
Run for training, it runs on two 1080TI GPUs.

CUDA_VISIBLE_DEVICES=2,3 python train_MATNet.py -cfg_file CONFIG -gpu_id 0 1 -wandb_run_name WANDB_RUN

Test

If not ckpt_path provided in the config it uses what is passed, if there is one in the config file it will give it higher priority

python test_MATNet.py -ckpt_epoch BEST_EPOCH -ckpt_path CKPT_PATH -result_dir RESULT_DIR

Inference and Evaluation on MoCA

bash scripts/eval_MoCA.sh CFG CKPT BEST_EPOCH MASK_RESULT_DIR GPU_ID CSV_RESULT_DIR

Trained Models

For original MATNet use their provided models and for the reciprocal version with gated fusion that achieved best MoCA results use this model.

BibTeX

If you find this repository useful, please consider citing our work 🦖

  @InProceedings{kowal2022deeper,
   title={A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information},
   author={Kowal, Matthew and Siam, Mennatullah and Islam, Md Amirul and Bruce, Neil and Wildes, Richard P. and Derpanis, Konstantinos G.},
   booktitle={Conference on Computer Vision and Pattern Recognition},
   year={2022}
 }

References

This repository heavily relies on MATNet repo.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
3rdparty		3rdparty
RAFT_core		RAFT_core
configs		configs
data		data
dataloader		dataloader
figures		figures
measures		measures
misc		misc
modules		modules
scripts		scripts
utils		utils
.gitignore		.gitignore
MoCA_eval.py		MoCA_eval.py
README.md		README.md
apply_densecrf_davis.py		apply_densecrf_davis.py
args.py		args.py
dataset_lmdb_generator.py		dataset_lmdb_generator.py
debug_MATNet.py		debug_MATNet.py
requirements.txt		requirements.txt
test_MATNet.py		test_MATNet.py
test_MATNet_davis.py		test_MATNet_davis.py
test_loader.py		test_loader.py
train_MATNet.py		train_MATNet.py
visualise_preds.ipynb		visualise_preds.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fusion Types and Cross Connection Study

MoCA Results

Installation

Datasets

Download Datasets

Prepare Edge Annotatios, HED Results and Optical Flow

Prepare MoCA for Evaluation

Train

Test

Inference and Evaluation on MoCA

Trained Models

BibTeX

References

About

Releases

Packages

Languages

MSiam/MATNet_FusionCrossConStudy

Folders and files

Latest commit

History

Repository files navigation

Fusion Types and Cross Connection Study

MoCA Results

Installation

Datasets

Download Datasets

Prepare Edge Annotatios, HED Results and Optical Flow

Prepare MoCA for Evaluation

Train

Test

Inference and Evaluation on MoCA

Trained Models

BibTeX

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages