SVHN-deep-cnn-digit-detector

This project implements deep-cnn-detector (and recognizer) in natural scene. I used keras framework and opencv library to build the detector. This detector determine digit or not with CNN classifier for the region proposed by the MSER algorithm.

Prerequisites

python 2.7
keras 1.2.2
opencv 2.4.11
tensorflow-gpu==1.0.1
Etc.

A list of all the packages needed to run this project can be found in digit_detector.yml.

Anaconda Env

I recommend that you create and use an anaconda env that is independent of your project. You can create anaconda env for this project by following these simple steps.

Create anaconda env with the following command line:
- $ conda env create -f digit_detector.yml
Activate the env
- $ source activate digit_detector
Run the project in this env

Usage

The procedure to build digit detector is as follows:

0. Download Dataset

Download train.tar.gz in https://ufldl.stanford.edu/housenumbers/ and unzip the file.

1. load training samples (1_sample_loader.py)

Svhn provides cropped training samples in matlab format. However, it is not suitable for detecting bounding box because it introduces some distracting digits to the sides of the digit of interest. So I collected the training samples directly using full numbers images and its annotation file.

Train samples : (457723, 32, 32, 3)
Validation samples : (113430, 32, 32, 3)

2. train classifier (2_train.py)

2.1. classifier used for detection

I designed a Convolutional Neural Network architecture for detecting character. This network classify text and non-text.

The architecture is as follows:

INPUT: [32x32x1]
CONV3-32: [32x32x32]
CONV3-32: [32x32x32]
POOL2: [16x16x32]
CONV3-64: [16x16x64]
CONV3-64: [16x16x64]
POOL2: [8x8x64]
FC: [1x1x1024]
- I used drop out in this layer.
FC: [1x1x2]

The accuracy of the classifier is as follows

Training Accuracy : 97.91%
Test Accuracy : 96.98%

2.2. classifier used for recognition

This Convolutional Neural Network recognize numbers. The architecture is same except for the number of class.

The architecture is as follows:

INPUT: [32x32x1]
CONV3-32: [32x32x32]
CONV3-32: [32x32x32]
POOL2: [16x16x32]
CONV3-64: [16x16x64]
CONV3-64: [16x16x64]
POOL2: [8x8x64]
FC: [1x1x1024]
- I used drop out in this layer.
FC: [1x1x10]
- number of class is 10.

The accuracy of the classifier is as follows

Training Accuracy : 95.41%
Test Accuracy : 94.52%

3. Run the detector (3_detect.py)

In the running time, the detector operates in the 2-steps.

The detector finds candidate region proposed by the MSER algorithm.

The classifier determines whether or not it is a number in the proposed region.

4. Evaluate the detector (4_evaluate.py)

4.1. Performance of the MSER proposer

recall value : 0.630
precision value : 0.045
f1_score : 0.084

4.1. Performance of the MSER+CNN detector

recall value : 0.513
precision value : 0.714
f1_score : 0.597

Name		Name	Last commit message	Last commit date
Latest commit History 683 Commits
annotation		annotation
build		build
conf		conf
digit_detector		digit_detector
examples		examples
tests		tests
1_sample_loader.py		1_sample_loader.py
2_train.py		2_train.py
3_detect.py		3_detect.py
4_evaluate.py		4_evaluate.py
README.md		README.md
detector_model.hdf5		detector_model.hdf5
digit_detector.yml		digit_detector.yml
license		license
recognize_model.hdf5		recognize_model.hdf5
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVHN-deep-cnn-digit-detector

Prerequisites

Anaconda Env

Usage

0. Download Dataset

1. load training samples (1_sample_loader.py)

2. train classifier (2_train.py)

2.1. classifier used for detection

2.2. classifier used for recognition

3. Run the detector (3_detect.py)

4. Evaluate the detector (4_evaluate.py)

4.1. Performance of the MSER proposer

4.1. Performance of the MSER+CNN detector

Related Projects

About

Releases

Packages

Languages

License

penny4860/SVHN-deep-digit-detector

Folders and files

Latest commit

History

Repository files navigation

SVHN-deep-cnn-digit-detector

Prerequisites

Anaconda Env

Usage

0. Download Dataset

1. load training samples (1_sample_loader.py)

2. train classifier (2_train.py)

2.1. classifier used for detection

2.2. classifier used for recognition

3. Run the detector (3_detect.py)

4. Evaluate the detector (4_evaluate.py)

4.1. Performance of the MSER proposer

4.1. Performance of the MSER+CNN detector

Related Projects

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages