scene_text

Text Detection

Text Recognition

End-to-End & Text Spotting

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting -ustc, PAMI2023,code
Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting -bytedance, arxiv2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting -naver, arxiv2022, code
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition -jinlianwen, CVPR2022, code
End-to-End Video Text Spotting with Transformer -shenchunhua, arxiv2022, code
Text Spotting Transformers -intel, CVPR2022
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System -baidu, arxiv2022
[light]PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System -baidu, arxiv2021, code
[icdar competition]1st Place Solution to ICDAR 2021 RRC-ICTEXT End-to-end Text Spotting and Aesthetic Assessment on Integrated Circuit -hikvision, arxiv2021
ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting -jinlianwen, arxiv2021, code
[light]PP-OCR: A Practical Ultra Lightweight OCR System -baidu, arxiv2020, code
Character Region Attention For Text Spotting -ECCV2020
Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting -baixiang, ECCV2020, code
Text Detection and Recognition in the Wild: A Review -arxiv2020
Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting -Liang Qiao, AAAI2020
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting -baixiang, AAAI2020
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network -jin lianwen, CVPR2020
Convolutional Character Networks -Linjie Xing, Zhi Tian, Weilin Huang, Matthew R. Scott, ICCV2019
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting -Chenglin Liu, CVPR2019
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes -baixiang, TPAMI2019
Towards Unconstrained End-to-End Text Spotting -google ai, arxiv2019
Towards End-to-End Text Spotting in Natural Scenes -Hui Li, Peng Wang, Chunhua Shen, arxiv2019
Weakly supervised precise segmentation for historical document images -JIn Lianwen, Neurocomputing2019
A Novel Integrated Framework for Learning both Text Detection and Recognition -alibaba, arxiv2018
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network -baidu, arxiv2018
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes -Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai, arxiv2018
FOTS: Fast Oriented Text Spotting with a Unified Network -Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan, CVPR2018
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text -Yash Patel, et al, arxiv2018
SEE: Towards Semi-Supervised End-to-End Scene Text Recognition -Christian Bartz, Haojin Yang, Christoph Meinel, AAAI2018
An end-to-end TextSpotter with Explicit Alignment and Attention -Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun, CVPR2018
Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks -Hui Li, et al, ICCV2017
Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework -Michal Busta, et al, ICCV2017, code
Reading Text in the Wild with Convolutional Neural Networks -Max Jaderberg, et al, IJCV2016

Text Retrieval

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers -zisserman, CVPR2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval -zisserman, arxiv2021
Scene Text Retrieval via Joint Text Detection and Similarity Learning -baixiang, CVPR2021, code/CSVTR database

Other

AnyText: Multilingual Visual Text Generation And Editing -alibaba, arxiv2023, code
Stroke-Based Scene Text Erasing Using Synthetic Data for Training -TIP2021
Page Layout Analysis System for Unconstrained Historic Documents -ICDAR2021
EraseNet: End-to-End Text Removal in the Wild -Jinlianwen, TIP2020, code
SwapText: Image Based Texts Transfer in Scenes -Qiangpeng Yang, CVPR2020
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World -Cong Yao, CVPR2020
EnsNet: Ensconce Text in the Wild -JinLianwen, AAAI2019, code
TextSR: Content-Aware Text Super-Resolution Guided by Recognition -forevision, arxiv2019
Editing Text in the Wild -baixiang, ACM MM2019
MTRNet: A Generic Scene Text Eraser -ICDAR2019
Scene Text Detection and Recognition: The Deep Learning Era -face++, arxiv2018
Text/non-text image classification in the wild with convolutional neural networks -X Bai, B Shi, C Zhang, X Cai, L Qi, PR2017
Scene text script identification with convolutional recurrent neural networks -J Mei, L Dai, B Shi, X Bai, ICPR2016

Seq2Seq

Convolutional Sequence to Sequence Learning -FAIR, ICML2017
Sequence Level Training with Recurrent Neural Networks -FAIR, ICLR2016
A Convolutional Encoder Model for Neural Machine Translation -FAIR, arxiv2016

Reading Order

LayoutReader: Pre-training of Text and Layout for Reading Order Detection -MSRA, EMNLP2021, code/database

Database & Generation

chinese

TRW15: ICDAR 2015 Text Reading in the Wild Competition
RCTW-17: ICDAR2017-Reading Chinese Text in the Wild
STV2k: A New Benchmark for Scene Text Detection and Recognition
CTW: Chinese Text in the Wild
PAL10K
COCO TS Dataset
ICPR MTWI 2018 挑战赛一：网络图像的文本识别

other

Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition
Textual Visual Semantic Dataset for Text Spotting
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos DDI-100: Dataset for Text Detection and Recognition Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes -Fangneng Zhan, Shijian Lu, and Chuhui Xue, arxiv2018
Total-Text -1555 images
SCUT-CTW1500 -Curved text in the wild
MLT: Multi-lingual scene text detection and script identification -Multi-lingual text: 18,000 images, 9 different languages representing 6 different scripts
Synthetic Word Dataset, Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
Total-text: A comprehensive dataset for scene text detection and recognition - -Chee Kheng Ch'ng, Chee Seng Chan
Street View Text(SVT)
IIIT 5k-words
MSRA-TD500
KAIST Scene_Text Database
ICDAR2011, ICDAR2013, ICDAR2015, ICDAR2017, robust reading-Focused Scene Text
ICDAR2017-ICDAR 2017 Robust Reading Challenge on Omnidirectional Video(DOST)
COCO-Text
Google French Street Name Signs (FSNS) dataset
ICDAR2017-ICDAR2017 Competition on Multi-lingual scene text detection and script identification(MLT)
ICDAR2017-Born-Digital Images (Web and Email)
Detecting Curve Text in the Wild: New Dataset and New Solution
Synthetic Word
Synthetic Data for Text Localisation in Natural Images -Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR2016

vietnamese

VinText

Competition

ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17) -B Shi, C Yao, M Liao, M Yang, P Xu, L Cui, arxiv2017
ICDAR 2015 competition on robust reading
Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 4 -Cong Yao, Jianan Wu, Xinyu Zhou, Chi Zhang, Shuchang Zhou, Zhimin Cao, Qi Yin

Link

awesome-deep-text-detection-recognition
Awesome-Scene-Text-Recognition
Scene Text Detection

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scene_text

Text Detection

Text Recognition

End-to-End & Text Spotting

Text Retrieval

Synthesis

Evaluation

Script identification

Super Resolution

Other

Seq2Seq

Reading Order

Database & Generation

chinese

other

vietnamese

Competition

Link

About

Releases

Packages

yflv-yanxia/scene_text

Folders and files

Latest commit

History

Repository files navigation

scene_text

Text Detection

Text Recognition

End-to-End & Text Spotting

Text Retrieval

Synthesis

Evaluation

Script identification

Super Resolution

Other

Seq2Seq

Reading Order

Database & Generation

chinese

other

vietnamese

Competition

Link

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages