The hOCR Embedded OCR Workflow and Output Format
This repository contains a Markdown version of the hOCR format specification edited by Thomas Breuel, converted from the May 2010 edition hosted on Google Docs.
The goal of this project is to make the hOCR specification more accessible and easier to maintain:
- cross-reference other specs
- harmonize style
- track changes without the spam of a world-editable Google Doc
- structured improvements with Github tools
- add samples