Prismer

This directory is based off of Prismer which can be found here. Helpful links to the paper and their huggingface page above.

It contains the Prismer code with updates made to support finetuning for classification on the Hateful Memes dataset based on the Facebook challenge.

How to Run

Download the dataset from here. Note that the jsonl and images within hm_data are the actual data needed.
- Take note of where you download the files as configs/experts.yaml and configs/classifcation.yaml will need to be updated with the location.
- The unique combination of the _seen and _unseen datasets for dev and test can be created using hm_data/combine_sets.py. The train set is used for training and, the dev set is used for validation. The test set is used for the final metrics saved in the report, and is not touched until the very end.
Install all package dependencies by running bash pip install -r requirements.txt
Follow up setup steps in the original repo to download setup accelerate config, download expert pre-trained models, and download Prismer pre-trained models.
Run accelerate launch experts/generate_{EXPERT_NAME}.py for each expert to generate the transformed images for Prismer.
Fine-tuning can be run with accelerate launch train_classification.py --exp_name {pre-trained model}
- If using PrismerZ update configs/classification.yaml to not use any of the experts. Otherwise populate all experts.
- Outputs of training are saved under logging. Some sample runs are saved there, but this is not exhaustive of all runs ran.
Validation can be run with accelerate launch demo_caption.py --exp_name {trained model} --from_checkpoint.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
configs		configs
dataset		dataset
experts		experts
graphs		graphs
helpers		helpers
hm_data		hm_data
logging		logging
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
demo_classification.py		demo_classification.py
demo_vis.py		demo_vis.py
download_checkpoints.py		download_checkpoints.py
generate_config.py		generate_config.py
graph_best.py		graph_best.py
requirements.txt		requirements.txt
train_caption.py		train_caption.py
train_classification.py		train_classification.py
train_pretrain.py		train_pretrain.py
train_vqa.py		train_vqa.py
utils.py		utils.py