PP-OCR

1. Introduction
2. Features
3. Benchmark
4. Visualization
5. Tutorial
- 5.1 Quick start
- 5.2 Model training / compression / deployment
6. Model zoo

1. Introduction

PP-OCR is a self-developed practical ultra-lightweight OCR system, which is slimed and optimized based on the reimplemented academic algorithms, considering the balance between accuracy and speed.

PP-OCR is a two-stage OCR system, in which the text detection algorithm is DB, and the text recognition algorithm is CRNN. Besides, a text direction classifier is added between the detection and recognition modules to deal with text in different directions.

PP-OCR pipeline is as follows:

PP-OCR system is in continuous optimization. At present, PP-OCR and PP-OCRv2 have been released:

[1] PP-OCR adopts 19 effective strategies from 8 aspects including backbone network selection and adjustment, prediction head design, data augmentation, learning rate transformation strategy, regularization parameter selection, pre-training model use, and automatic model tailoring and quantization to optimize and slim down the models of each module (as shown in the green box above). The final results are an ultra-lightweight Chinese and English OCR model with an overall size of 3.5M and a 2.8M English digital OCR model. For more details, please refer to the PP-OCR technical article (https://arxiv.org/abs/2009.09941).

[2] On the basis of PP-OCR, PP-OCRv2 is further optimized in five aspects. The detection model adopts CML(Collaborative Mutual Learning) knowledge distillation strategy and CopyPaste data expansion strategy. The recognition model adopts LCNet lightweight backbone network, U-DML knowledge distillation strategy and enhanced CTC loss function improvement (as shown in the red box above), which further improves the inference speed and prediction effect. For more details, please refer to the technical report of PP-OCRv2 (https://arxiv.org/abs/2109.03144).

2. Features

Ultra lightweight PP-OCRv2 series models: detection (3.1M) + direction classifier (1.4M) + recognition 8.5M) = 13.0M
Ultra lightweight PP-OCR mobile series models: detection (3.0M) + direction classifier (1.4M) + recognition (5.0M) = 9.4M
General PP-OCR server series models: detection (47.1M) + direction classifier (1.4M) + recognition (94.9M) = 143.4M
Support Chinese, English, and digit recognition, vertical text recognition, and long text recognition
Support multi-lingual recognition: about 80 languages like Korean, Japanese, German, French, etc

3. benchmark

For the performance comparison between PP-OCR series models, please check the benchmark documentation.

4. Visualization more

PP-OCRv2 English model

PP-OCRv2 Chinese model

PP-OCRv2 Multilingual model

5. Tutorial

5.1 Quick start

You can also quickly experience the ultra-lightweight OCR : Online Experience
Mobile DEMO experience (based on EasyEdge and Paddle-Lite, supports iOS and Android systems): Sign in to the website to obtain the QR code for installing the App
One line of code quick use: Quick Start

5.2 Model training / compression / deployment

For more tutorials, including model training, model compression, deployment, etc., please refer to tutorials。

6. Model zoo

PP-OCR Series Model List（Update on September 8th）

Model introduction	Model name	Recommended scene	Detection model	Direction classifier	Recognition model
Chinese and English ultra-lightweight PP-OCRv2 model（11.6M）	ch_PP-OCRv2_xx	Mobile & Server	inference model / trained model	inference model / trained model	inference model / trained model
Chinese and English ultra-lightweight PP-OCR model (9.4M)	ch_ppocr_mobile_v2.0_xx	Mobile & server	inference model / trained model	inference model / trained model	inference model / trained model
Chinese and English general PP-OCR model (143.4M)	ch_ppocr_server_v2.0_xx	Server	inference model / trained model	inference model / trained model	inference model / trained model

For more model downloads (including multiple languages), please refer to PP-OCR series model downloads.

For a new language request, please refer to Guideline for new language_requests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppocr_introduction_en.md

ppocr_introduction_en.md

PP-OCR

1. Introduction

2. Features

3. benchmark

4. Visualization more

5. Tutorial

5.1 Quick start

5.2 Model training / compression / deployment

6. Model zoo

PP-OCR Series Model List（Update on September 8th）

Files

ppocr_introduction_en.md

Latest commit

History

ppocr_introduction_en.md

File metadata and controls

PP-OCR

1. Introduction

2. Features

3. benchmark

4. Visualization more

5. Tutorial

5.1 Quick start

5.2 Model training / compression / deployment

6. Model zoo

PP-OCR Series Model List（Update on September 8th）