LayoutLMv2 Setup

Environment Setup

Create new conda env with python==3.10.4
Install all the dependencies from requirements.txt pip3 install -r requirements.txt
Activate created conda environment before further process.
Remove the existing detectron2 folder in the repository and run the following commands for installing detectron2

git clone https://github.com/facebookresearch/detectron2.git and

python -m pip install -e detectron2

Add required inputs in train_json.py

data_files= path to create json file

pretrained_model = use previously trained .pt model or else keep it as ""

model_to_save = desired trained model name ( ex: "layout_train.pt" )

num_train_epochs = As per requirement ( default 100 )

batch_size = As per requirement based on available data( default 16 ).
run python3 train_json.py
After completion model will be saved in Models/ folder and a text file will be saved in your data folder in Prepare/ this file has dictionary of ids and labelled mapped. This is used while testing to predict the correct label according to training labels.

Download the pretrained model from the drive link below and add it in Models/ folder

https://drive.google.com/file/d/1z78obbtNrn-enWRscWehlltw4LY_o-gy/view?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Prepare		Prepare
Results		Results
Readme.md		Readme.md
requirements.txt		requirements.txt
train_json.py		train_json.py