GitHub - shgidi/stupid_cv: Computer vision pipeline

TLDR

1 line script for training an image classification model (including data acquisition)

Intro

Frameworks such as keras and fast ai allow “easy” training. Why not combine?

Introducing: stupid_cv.

All technologists are automation enthusiasts and - lets face it - script kiddies. And since according to Eric Schmidt computer vision is a solved problem, I’ve decided to assemble a small script that will be kind of auto solver for computer vision.

How does it work

Open images v4 allows to easilly and programmitacly download many images according classes (unlike imagenet, there may be numerous objects in every image)

Install

To install, clone this repo, and install the requirements. You'd better have GPU on your machine.

Usage

First, get the open images data frames:

Dataset type is one of train, validation, test https://storage.googleapis.com/openimages/2018_04/{dataset_type}/{dataset_type}-annotations-bbox.csv dataset_type = train (1.1MB), validation (16MB), test (50MB) (train is very big)

classes specification can be loaded from here:

https://storage.googleapis.com/openimages/2018_04/class-descriptions-boxable.csv

List all classes

python list_classes.py

Run all pipeline

Use it as follows:

python main.py --data_root <some_dir> --classes Apple Banana Orange
Wait for it..
Find the model in `<some_dir>/models
Profit!

Arguments:

data_root - the folder where the images should will be downloaded to, and the models will be saved. Open Images data frames should be placed in this folder
data_type - select the data type form open images, as stated above
classes - select classes to download and train a model on, from open images 600 classes
cut_image - should the training script "cut" the relevant objects from the images

Config

Config model in config.yaml

Todo:

Credit for open image downloader: https://github.com/qfgaohao/pytorch-ssd

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
config.yaml		config.yaml
dl_open_images_data.py		dl_open_images_data.py
list_classes.py		list_classes.py
main.py		main.py
mlflow_tracker.py		mlflow_tracker.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TLDR

Intro

How does it work

Install

Usage

List all classes

Run all pipeline

Config

Todo:

About

Releases

Packages

Languages

License

shgidi/stupid_cv

Folders and files

Latest commit

History

Repository files navigation

TLDR

Intro

How does it work

Install

Usage

List all classes

Run all pipeline

Config

Todo:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages