Skip to content

yflv-yanxia/scene_text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 

Repository files navigation

scene_text

Text Detection

DETRs Beat YOLOs on Real-time Object Detection -baidu, arxiv2023, code
Real-time Scene Text Detection Based on Global Level and Word Level Features -arxiv2022
Kernel Proposal Network for Arbitrary Shape Text Detection -yinxucheng, TNNLS2022,code
Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion -baixiang, PAMI2022, code
Towards End-to-End Unified Scene Text Detection and Layout Analysis -CVPR2022, google, code
Arbitrary Shape Text Detection using Transformers -arxiv2022
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection -baixiang, CVPR2022
Vision-Language Pre-Training for Boosting Scene Text Detectors -baixiang, CVPR2022
UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection -guoyouhui, ICME2022
Arbitrary Shape Text Detection via Boundary Transformer -yinxucheng, arxiv2022
Arbitrary Shape Text Detection via Segmentation with Probability Maps -yinxucheng, PAMI2022, code
HRRegionNet: Chinese Character Segmentation in Historical Documents with Regional Awareness -ICDAR2021
[Real-Time]Real-Time Scene Text Detection with Differentiable Binarization -baixiang, AAAI2020, code
Deep relational reasoning graph network for arbitrary shape text detection -yinxucheng, CVPR2020, code
All you need is boundary: Toward arbitrary-shaped text spotting -baixiang, AAAI2020
All you need is a second look: Towards Tighter Arbitrary shape text detection -arxiv2020
Self-Training for Domain Adaptive Scene Text Detection -arxiv2020
NENET: An Edge Learnable Network for Link Prediction in Scene Text -arxiv2020
Efficient Scene Text Detection with Textual Attention Tower -Liang Zhang, ICASSP2020
Scale-Invariant Multi-Oriented Text Detection in Wild Scene Images -Kinjal Dasgupta, arxiv2020
PuzzleNet: Scene Text Detection by Segment Context Graph Learning -Hao Liu, arxiv2020
Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units -Yu Qiao, arxiv2020
HRCenterNet: An Anchorless Approach to Chinese Character Segmentation in Historical Documents -BigData2020, code
Look more than once: An accurate detector for text of arbitrary shapes -baidu, CVPR2019
Gliding vertex on the horizontal bounding box for multi-oriented object detection -Xiang Bai, arxiv2019code
Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection -jinlianwen, arxiv2019
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network -face++, ICCV 2019, code
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning -Pengfei Wang, arxiv2019
It's All About The Scale -- Efficient Text Detection Using Adaptive Scaling -Elad Richardson, arxiv2019
FaSTExt: Fast and Small Text Extractor -Alexander Filonenko, arxiv2019
Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning -Xugong Qin, arxiv2019
Learning Shape-Aware Embedding for Scene Text Detection -CUHK, Tencent, CVPR2019
Shape Robust Text Detection with Progressive Scale Expansion Network -megi++, CVPR2019
Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation -Xiaobing Wang, Yingying Jiang, Zhenbo Luo, Cheng-Lin Liu, Hyunsoo Choi, Sungjin Kim, CVPR2019
Character Region Awareness for Text Detection -Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee, CVPR2019
Towards Robust Curve Text Detection with Conditional Spatial Expansion -Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh, CVPR2019
Pyramid Mask Text Detector -sensetime, arxiv2019
Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes -baidu, CVPR2019
Character Region Awareness for Text Detection -Clova, CVPR2019
Detecting Text in the Wild with Deep Character Embedding Network -baidu, arxiv2019
TextField: Learning A Deep Direction Field for Irregular Scene Text Detection -Yongchao Xu, Yukang Wang, Wei Zhou, Yongpan Wang, Zhibo Yang, Xiang Bai, arxiv2018
TextMountain: Accurate Scene Text Detection via Instance Segmentation -Yixing Zhu, Jun Du, arxiv2018
Mask R-CNN with Pyramid Attention Network for Scene Text Detection -MSRA, arxiv2018
Scene Text Detection with Supervised Pyramid Context Network -face++, AAAI2019
Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks -cloudwalk, arxiv2018
Improving Rotated Text Detection with Rotation Region Proposal Networks -facebook, arxiv2018
IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection -Alibaba, IJCAI2018
TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes -peking, face++, arxiv2018
PSENET: Shape Robust Text Detection with Progressive Scale Expansion Network -deepinsight, CVPR2019
Arbitrary-Oriented Scene Text Detection via Rotation Proposals -J Ma, W Shao, H Ye, L Wang, H Wang, TMM2018
TextBoxes++: A Single-Shot Oriented Scene Text Detector -Minghui Liao, Baoguang Shi, Xiang Bai, arxiv2018 code
Dense and Tight Detection of Chinese Characters in Historical Documents: Datasets and a Recognition Guided Detector -JinLianwen, IEEEaccess2018
R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection -Samsung, arxiv2018
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation -Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai, arxiv2018
PixelLink: Detecting Scene Text via Instance Segmentation -Dan Deng, Haifeng Liu, Xuelong Li, Deng Cai, aaai2018
EAST: an efficient and accurate scene text detector -Megvii, cvpr2017, code
Scene text detection and segmentation based on cascaded convolution neural networks -Y Tang, X Wu, TIP2017
TextBoxes: A Fast Text Detector with a Single Deep Neural Network. -M Liao, B Shi, X Bai, X Wang, W Liu, AAAI2017, code
Deep direct regression for multi-oriented scene text detection -W He, XY Zhang, F Yin, CL Liu, ICCV2017
Detecting oriented text in natural images by linking segments -B Shi, X Bai, S Belongie, CVPR2017, code
Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection -Yuliang Liu, Lianwen Jin, CVPR2017
Feature Enhancement Network: A Refined Scene Text Detector -Sheng Zhang, Yuliang Liu, Lianwen Jin, Canjie Luo, arxiv2017
Single Shot Text Detector with Regional Attention -Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, and Xiaolin Li, ICCV2017
A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling -Xiaohang Ren, Yi Zhou, Jianhua He, Kai Chen, Xiaokang Yang, Jun Sun, TMM2017
Fused Text Segmentation Networks for Multi-oriented Scene Text Detection -Yuchen Dai, et al, arxiv2017
Scene Text Detection with Novel Superpixel Based Character Candidate Extraction -Cong Wang, Fei Yin, Cheng-Lin Liu, ICDAR2017
WeText: Scene Text Detection under Weak Supervision -Shangxuan Tian, Shijian Lu, Chongshou Li, ICCV2017
WordSup: Exploiting Word Annotations for Character based Text Detection -MSRA, IDL, ICCV2017
Deep Residual Text Detection Network for Scene Text -Xiangyu Zhu, et al, arxiv2017
Cascaded Segmentation-Detection Networks for Word-Level Text Spotting -Siyang Qin, Roberto Manduchi, arxiv2017
Arbitrary-Oriented Scene Text Detection via Rotation Proposals -Jianqi Ma, et al, TMM2017
Multi-oriented text detection with fully convolutional networks -Z Zhang, C Zhang, W Shen, C Yao, CVPR2016
Scene text detection via holistic, multi-channel prediction -C Yao, X Bai, N Sang, X Zhou, S Zhou, arxiv2016

Text Recognition

Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer -bytedance, CVPR2024, code
Revisiting Scene Text Recognition: A Data Perspective -jinlianwen, ICCV2023, code
Context Perception Parallel Decoder for Scene Text Recognition -baidu,arxiv2023
Cdistnet: Perceiving multi-domain character distance for robust text recognition -fudan, IJCV2023, code
Trocr: Transformer-based optical character recognition with pre-trained models -Microsoft, AAAI2023, code
Context-Based Contrastive Learning for Scene Text Recognition -AAAI2022
SVTR: Scene Text Recognition with a Single Visual Model -baidu, IJCAI2022, code
Multi-modal Text Recognition Networks: Interactive enhancements between visual and semantic features -ECCV2022
Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition -hikvision, ICDAR2021, code
Dictionary-Guided Scene Text Recognition -CVPR2021,code
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models -beihang, arxiv2021, code
RecycleNet: An Overlapped Text Instance Recovery Approach -tencent, MMM21
Vision Transformer for Fast and Efficient Scene Text Recognition Rowel-ICDAR2021
Visual-semantic transformer for scene text recognitio-pingan, arxiv2021
PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network -baidu, AAAI2021, code
What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels -tokyo, CVPR2021, code
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition -ShanchengFang, CVPR2021, code
[light]Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition --pingan, arxiv2020
Gaussian Constrained Attention Network for Scene Text Recognition -qiaozhi, ICPR2020, code
Adaptive Text Recognition through Visual Matching -zisserman, ECCV2020
On Vocabulary Reliance in Scene Text Recognition -megvii, CVPR2020
Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization -JinLianwen, ICFHR2020
Text Recognition in Real Scenarios with a Few Labeled Samples -arxiv2020
RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition -ECCV2020
On Recognizing Texts of Arbitrary Shapes With 2D Self-Attention -CVPRW2020
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition -zhi qiao, CVPR2020
Text Recognition in the Wild: A Survey -jinlianwen, arxiv2020
GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition -Wenyang Hu, AAAI2020
A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling -yao cong, ICASSP2020
SCATTER: Selective Context Attentional Scene Text Recognizer -Ron Litman, CVPR2020
Scene Text Recognition via Transformer -Xinjie Feng, arxiv2020
Efficient Backbone Search for Scene Text Recognition -baixiang, arxiv2020
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks -Baidu, CVPR2020
Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition -jinlianwen, CVPR2020, code
Decoupled Attention Network for Text Recognition -jianlianwen, AAAI2020
Fast Dense Residual Network: Enhancing Global Dense Feature Flow for Text Recognition -Zhao Zhang, arxiv2020
Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild -jin lianwen, arxiv2020
TextScanner: Reading Characters in Order for Robust Scene Text Recognition -yao cong, AAAI2020
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis clova-ICCV2019,code
A Feasible Framework for Arbitrary-Shaped Scene Text Recognition -champion in ICDAR2019, arxiv2019code
Deep Neural Network for Semantic-based Text Recognition in Images -Yi Zheng, arxiv2019
Symmetry-constrained Rectification Network for Scene Text Recognition -baixiang, ICCV2019
Adaptive Embedding Gate for Attention-Based Scene Text Recognition -Linwen Jin, arxiv2019
Focus-Enhanced Scene Text Recognition with Deformable Convolutions -Yanxiang Gong, arxiv2019, code
Rethinking Irregular Scene Text Recognition -yao cong, ICDAR19 art champion, code
Aggregation Cross-Entropy for Sequence Recognition -Zecheng Xie, Yaoxiong Huang, Yuanzhi Zhu, Lianwen Jin, Yuliang Liu, Lele Xie, CVPR2019,code
Sequence-to-Sequence Domain Adaptation Networkfor Robust Text Image Recognition, CASIA, CVPR2019
Towards End-to-End Text Spotting in Natural Scenes -LiHui, et al, arxiv2019
2D Attentional Irregular Scene Text Recognizer -Tencent, arxiv2019
ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification -Fangneng Zhan, Shijian Lu, CVPR2019
FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition -Qingqing Wang, et al, arxiv2019
A Multi-Object Rectified Attention Network for Scene Text Recognition -Canjie Luo, Lianwen Jin, Zenghui Sun, PR2019
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition -Hui Li, Peng Wang, Chunhua Shen, Guyu Zhang, AAAI2019, code
Scene Text Recognition from Two-Dimensional Perspective -Minghui Liao, Cong Yao, Xiang Bai, et al, AAAI2019
Recurrent Calibration Network for Irregular Text Recognition -Hanqing Lu, arxiv2018
ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification -Fangneng Zhan, Shijian Lu, arxiv2018
Synthetically Supervised Feature Learning for Scene Text Recognition -Adobe, ECCV2018
Connectionist Temporal Classification with Maximum Entropy Regularization -Tsinghua, NeurIPS2018,code
ASTER: An Attentional Scene Text Recognizer with Flexible Rectification -Baixiang, PAMI2018, code
Edit Probability for Scene Text Recognition -Fudan, Hikvision, cvpr2018
SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network -Zichuan Liu, et al, AAAI2018
State of the Art Optical Character Recognition of 19th Century Fraktur Scripts using Open Source Engines -arxiv2018
SCAN: Sliding Convolutional Attention Network for Scene Text Recognition -Yichao Wu, et al, arxiv2018
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition -Fenfen Sheng, et al, arxiv2018
AON: Towards Arbitrarily-Oriented Text Recognition -Hikvision, et al, CVPR2018
An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition -B Shi, X Bai, C Yao , TPAMI2017 code
Scene Text Recognition with Sliding Convolutional Character Models -fei yin, et al, arxiv2017
Focusing Attention: Towards Accurate Text Recognition in Natural Images -Hikvision, et al, ICCV2017
AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition -Chun Yang, Xu-Cheng Yin, arxiv2017
Strokelets: A learned multi-scale mid-level representation for scene text recognition -X Bai, C Yao, W Liu , TIP2016
Reading Scene Text in Deep Convolutional Sequences -P He, W Huang, Y Qiao, CC Loy, X Tang, AAAI2016
Text-Attentional Convolutional Neural Network for Scene Text Detection -Tong He, Weilin Huang, Yu Qiao, Jian Yao, TIP2016
Robust Scene Text Recognition with Automatic Rectification -Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai, CVPR2016
DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images -Zhuoyao Zhong, Lianwen Jin, Shuye Zhang, Ziyong Feng, arxiv2016
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild -Yahoo, CVPR2016

End-to-End & Text Spotting

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting -ustc, PAMI2023,code
Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting -bytedance, arxiv2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting -naver, arxiv2022, code
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition -jinlianwen, CVPR2022, code
End-to-End Video Text Spotting with Transformer -shenchunhua, arxiv2022, code
Text Spotting Transformers -intel, CVPR2022
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System -baidu, arxiv2022
[light]PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System -baidu, arxiv2021, code
[icdar competition]1st Place Solution to ICDAR 2021 RRC-ICTEXT End-to-end Text Spotting and Aesthetic Assessment on Integrated Circuit -hikvision, arxiv2021
ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting -jinlianwen, arxiv2021, code
[light]PP-OCR: A Practical Ultra Lightweight OCR System -baidu, arxiv2020, code
Character Region Attention For Text Spotting -ECCV2020
Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting -baixiang, ECCV2020, code
Text Detection and Recognition in the Wild: A Review -arxiv2020
Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting -Liang Qiao, AAAI2020
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting -baixiang, AAAI2020
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network -jin lianwen, CVPR2020
Convolutional Character Networks -Linjie Xing, Zhi Tian, Weilin Huang, Matthew R. Scott, ICCV2019
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting -Chenglin Liu, CVPR2019
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes -baixiang, TPAMI2019
Towards Unconstrained End-to-End Text Spotting -google ai, arxiv2019
Towards End-to-End Text Spotting in Natural Scenes -Hui Li, Peng Wang, Chunhua Shen, arxiv2019
Weakly supervised precise segmentation for historical document images -JIn Lianwen, Neurocomputing2019
A Novel Integrated Framework for Learning both Text Detection and Recognition -alibaba, arxiv2018
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network -baidu, arxiv2018
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes -Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai, arxiv2018
FOTS: Fast Oriented Text Spotting with a Unified Network -Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan, CVPR2018
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text -Yash Patel, et al, arxiv2018
SEE: Towards Semi-Supervised End-to-End Scene Text Recognition -Christian Bartz, Haojin Yang, Christoph Meinel, AAAI2018
An end-to-end TextSpotter with Explicit Alignment and Attention -Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun, CVPR2018
Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks -Hui Li, et al, ICCV2017
Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework -Michal Busta, et al, ICCV2017, code
Reading Text in the Wild with Convolutional Neural Networks -Max Jaderberg, et al, IJCV2016

Text Retrieval

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers -zisserman, CVPR2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval -zisserman, arxiv2021
Scene Text Retrieval via Joint Text Detection and Similarity Learning -baixiang, CVPR2021, code/CSVTR database

Synthesis

https://github.com/clovaai/synthtiger
Editing Text in the Wild -baidu, ACM MM 2019
Data Augmentation for Scene Text Recognition -ICCV2021 workshop, code
text_renderer
SynthText
SynthText
TextRecognitionDataGenerator
UnrealText
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation -Sharon Fogel, CVPR2020
SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds -Minghui Liao, Boyu Song, Minghang He, Shangbang Long, Cong Yao, Xiang Bai, arxiv2019code
Spatial Fusion GAN for Image Synthesis -Fangneng Zhan, Hongyuan Zhu, Shijian Lu, CVPR2019, code
Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes -Fangneng Zhan, Shijian Lu, Chuhui Xue, ECCV2018

Evaluation

CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks -arxiv2020
End-To-End Measure for Text Recognition -ICDAR2019
Tightness-aware Evaluation Protocol for Scene Text Detection -jinlianwen, CVPR2019

Script identification

Patch Aggregator for Scene Text Script Identification --baixiang, arxiv2019

Super Resolution

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution -fudan, AAAI2022, code
Restormer: Efficient Transformer for High-Resolution Image Restoration google, CVPR2022, code
Scene Text Telescope: Text-Focused Scene Image Super-Resolution -fudan, CVPR2021
Text Prior Guided Scene Text Image Super-resolution -arxiv2021, code
Scene Text Image Super-Resolution in the Wild -baixiang, ECCV2020

Other

AnyText: Multilingual Visual Text Generation And Editing -alibaba, arxiv2023, code
Stroke-Based Scene Text Erasing Using Synthetic Data for Training -TIP2021
Page Layout Analysis System for Unconstrained Historic Documents -ICDAR2021
EraseNet: End-to-End Text Removal in the Wild -Jinlianwen, TIP2020, code
SwapText: Image Based Texts Transfer in Scenes -Qiangpeng Yang, CVPR2020
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World -Cong Yao, CVPR2020
EnsNet: Ensconce Text in the Wild -JinLianwen, AAAI2019, code
TextSR: Content-Aware Text Super-Resolution Guided by Recognition -forevision, arxiv2019
Editing Text in the Wild -baixiang, ACM MM2019
MTRNet: A Generic Scene Text Eraser -ICDAR2019
Scene Text Detection and Recognition: The Deep Learning Era -face++, arxiv2018
Text/non-text image classification in the wild with convolutional neural networks -X Bai, B Shi, C Zhang, X Cai, L Qi, PR2017
Scene text script identification with convolutional recurrent neural networks -J Mei, L Dai, B Shi, X Bai, ICPR2016

Seq2Seq

Convolutional Sequence to Sequence Learning -FAIR, ICML2017
Sequence Level Training with Recurrent Neural Networks -FAIR, ICLR2016
A Convolutional Encoder Model for Neural Machine Translation -FAIR, arxiv2016

Reading Order

LayoutReader: Pre-training of Text and Layout for Reading Order Detection -MSRA, EMNLP2021, code/database

Database & Generation

chinese

TRW15: ICDAR 2015 Text Reading in the Wild Competition
RCTW-17: ICDAR2017-Reading Chinese Text in the Wild
STV2k: A New Benchmark for Scene Text Detection and Recognition
CTW: Chinese Text in the Wild
PAL10K
COCO TS Dataset
ICPR MTWI 2018 挑战赛一:网络图像的文本识别

other

Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition
Textual Visual Semantic Dataset for Text Spotting
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos DDI-100: Dataset for Text Detection and Recognition Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes -Fangneng Zhan, Shijian Lu, and Chuhui Xue, arxiv2018
Total-Text -1555 images
SCUT-CTW1500 -Curved text in the wild
MLT: Multi-lingual scene text detection and script identification -Multi-lingual text: 18,000 images, 9 different languages representing 6 different scripts
Synthetic Word Dataset, Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
Total-text: A comprehensive dataset for scene text detection and recognition - -Chee Kheng Ch'ng, Chee Seng Chan
Street View Text(SVT)
IIIT 5k-words
MSRA-TD500
KAIST Scene_Text Database
ICDAR2011, ICDAR2013, ICDAR2015, ICDAR2017, robust reading-Focused Scene Text
ICDAR2017-ICDAR 2017 Robust Reading Challenge on Omnidirectional Video(DOST)
COCO-Text
Google French Street Name Signs (FSNS) dataset
ICDAR2017-ICDAR2017 Competition on Multi-lingual scene text detection and script identification(MLT)
ICDAR2017-Born-Digital Images (Web and Email)
Detecting Curve Text in the Wild: New Dataset and New Solution
Synthetic Word
Synthetic Data for Text Localisation in Natural Images -Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR2016

vietnamese

VinText

Competition

ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17) -B Shi, C Yao, M Liao, M Yang, P Xu, L Cui, arxiv2017
ICDAR 2015 competition on robust reading
Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 4 -Cong Yao, Jianan Wu, Xinyu Zhou, Chi Zhang, Shuchang Zhou, Zhimin Cao, Qi Yin

Link

awesome-deep-text-detection-recognition
Awesome-Scene-Text-Recognition
Scene Text Detection

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published