Three Method Fine-tune on CLIP for Vehicle Counting Task

Approach

1. General

2. Adapter

3. VPT (shallow)

Hardware

CPU: AMD EPYC 7742 64-Core Processor
RAM: 512GB
GPU: Nvidia A100 (40GB VRAM)
Disk Space Available: 1TB

Install the Required Packages

$ conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch
$ pip install ftfy regex tqdm torchinfo
$ pip install git+https://github.com/openai/CLIP.git

Prepare KITTI Dataset

Dataset:
Dataset Link
*Note: only need left color images of object data set (12 GB) and training labels of object data set (5 MB).

# And you must organize files into the following structure:

kitti_dataset
     ├── testing
     |      └── image_2 #Only including testing img files
     └── training
            ├── image_2 #Only including training img files
            └── label_2 #Only including txt files

Preprocessing Label

# You should modify the path of your training image_2 folder by yourself in the script (Line 4: kitti_label_file_path).
python text_generation.py

Fine-tune

# Replace "../KITTI_DATASET_ROOT/training/image_2/" into the path of your training image_2 folder.

# General fine tune on whole model
python train.py --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

# Using adapter to fine tune
python train.py --adapter --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"


# Using vpt to fine tune
python train.py --prompt --vpt_version 1or2 --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

test

# Replace "../KITTI_DATASET_ROOT/training/image_2/" into the path of your training image_2 folder.

# General fine tune on whole model
python test.py --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

# Using adapter to fine tune
python test.py --adapter --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"


# Using vpt to fine tune
python test.py --prompt --vpt_version 1or2 --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

Acknowledgement

This repo benefits from CLIP, AIM, and VPT. Thanks for their wonderful works.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github/workflows		.github/workflows
clip		clip
data		data
image		image
notebooks		notebooks
tests		tests
.gitignore		.gitignore
CLIP.png		CLIP.png
CLIP_ORIGINAL_OPEN_SOURCE_README.md		CLIP_ORIGINAL_OPEN_SOURCE_README.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
adapter_log.txt		adapter_log.txt
findbest.py.py		findbest.py.py
kitti_text_generation.py		kitti_text_generation.py
label_2_sentence.csv		label_2_sentence.csv
model-card.md		model-card.md
plot.py		plot.py
requirements.txt		requirements.txt
setup.py		setup.py
standard_log.txt		standard_log.txt
test.py		test.py
text_generation.py		text_generation.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Three Method Fine-tune on CLIP for Vehicle Counting Task

Approach

1. General

2. Adapter

3. VPT (shallow)

Hardware

CPU: AMD EPYC 7742 64-Core Processor

RAM: 512GB

GPU: Nvidia A100 (40GB VRAM)

Disk Space Available: 1TB

Install the Required Packages

Prepare KITTI Dataset

Preprocessing Label

Fine-tune

test

Acknowledgement

About

Releases

Packages

Languages

License

arthurleelee/Three-Method-Fine-Tune-On-CLIP-For-Vehicle-Counting-Task

Folders and files

Latest commit

History

Repository files navigation

Three Method Fine-tune on CLIP for Vehicle Counting Task

Approach

1. General

2. Adapter

3. VPT (shallow)

Hardware

CPU: AMD EPYC 7742 64-Core Processor

RAM: 512GB

GPU: Nvidia A100 (40GB VRAM)

Disk Space Available: 1TB

Install the Required Packages

Prepare KITTI Dataset

Preprocessing Label

Fine-tune

test

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages