Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Lee, Chen-Yu; Osindero, Simon

Computer Science > Computer Vision and Pattern Recognition

arXiv:1603.03101v1 (cs)

[Submitted on 9 Mar 2016]

Title:Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Authors:Chen-Yu Lee, Simon Osindero

View PDF

Abstract:We present recursive recurrent neural networks with attention modeling (R$^2$AM) for lexicon-free optical character recognition in natural scene images. The primary advantages of the proposed method are: (1) use of recursive convolutional neural networks (CNNs), which allow for parametrically efficient and effective image feature extraction; (2) an implicitly learned character-level language model, embodied in a recurrent neural network which avoids the need to use N-grams; and (3) the use of a soft-attention mechanism, allowing the model to selectively exploit image features in a coordinated way, and allowing for end-to-end training within a standard backpropagation framework. We validate our method with state-of-the-art performance on challenging benchmark datasets: Street View Text, IIIT5k, ICDAR and Synth90k.

Comments:	accepted at CVPR 2016
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1603.03101 [cs.CV]
	(or arXiv:1603.03101v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1603.03101

Submission history

From: Chen-Yu Lee [view email]
[v1] Wed, 9 Mar 2016 23:49:51 UTC (1,245 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chen-Yu Lee
Simon Osindero

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators