OCR Solutions

OCR Solutions is a comprehensive tool that combines multiple OCR (Optical Character Recognition) technologies to extract text from images. The project demonstrates how to utilize Tesseract OCR, EasyOCR, and Google Cloud Vision to convert images into readable and editable text.

Features

Multiple OCR Technologies: Leverages Tesseract OCR, EasyOCR, and Google Cloud Vision for text extraction.
Easy Integration: Simple and straightforward integration of each OCR tool.
Text Extraction: Efficiently extracts text from images and saves it to a file.
Error Handling: Robust error handling to manage exceptions and errors during OCR processing.

Installation

To get started with OCR Solutions, follow these steps:

Clone the repository:

git clone https://github.com/yourusername/OCR-Solutions.git
cd OCR-Solutions

Install the required dependencies:

pip install pytesseract Pillow easyocr google-cloud-vision

Setup Google Cloud Vision API

To use Google Cloud Vision, you need to set up the API and get the credentials JSON file. Follow these steps:

Create a Google Cloud Project:
- Go to the Google Cloud Console.
- Click on the project drop-down and select New Project.
- Enter a project name and click Create.
Enable the Vision API:
- In the Google Cloud Console, navigate to APIs & Services > Library.
- Search for "Vision API" and click on Google Cloud Vision API.
- Click the Enable button.
Set Up Service Account:
- Navigate to APIs & Services > Credentials.
- Click on Create Credentials and select Service Account.
- Enter a name and description for the service account and click Create.
- In the Role dropdown, select Project > Owner, then click Continue.
- Click Done.
Create and Download the JSON Key:
- After creating the service account, click on it to open its details.
- Go to the Keys tab and click Add Key > Create New Key.
- Select JSON and click Create. This will download the JSON key file to your computer.
- Rename the downloaded JSON file to ocr-cloud-vision-424222-0778b1f83e3e.json and place it in your project directory.
Set the Environment Variable:
- Set the environment variable for the Google Cloud Vision API key:
```
export GOOGLE_APPLICATION_CREDENTIALS="ocr-cloud-vision-424222-0778b1f83e3e.json"
```

Usage

Tesseract OCR

To extract text using Tesseract OCR:

Install Tesseract on your system:
- On Windows: Download the installer from here.
- On macOS: Use Homebrew: brew install tesseract.
- On Linux: Use your package manager, e.g., sudo apt-get install tesseract-ocr.

Use the following code:

import pytesseract
from PIL import Image

def image_to_text(image_path):
    try:
        img = Image.open(image_path)
        text = pytesseract.image_to_string(img)
        return text
    except Exception as e:
        return str(e)

if __name__ == "__main__":
    image_path = 'image.jpeg'
    extracted_text = image_to_text(image_path)
    print(extracted_text)

EasyOCR

To extract text using EasyOCR:

Use the following code:

import easyocr

def image_to_text(image_path):
    reader = easyocr.Reader(['en'])
    result = reader.readtext(image_path, detail=0)
    text = "\n".join(result)
    return text

def save_text_to_file(text, file_path):
    with open(file_path, 'w') as file:
        file.write(text)

if __name__ == "__main__":
    image_path = 'image.jpeg'
    output_file = 'output_easyocr.txt'
    raw_text = image_to_text(image_path)
    print("Raw text extracted from image:")
    print(raw_text)
    save_text_to_file(raw_text, output_file)
    print(f"\nRefined text saved to {output_file}")

Google Cloud Vision

To extract text using Google Cloud Vision:

Use the following code:

import os
from google.cloud import vision

def image_to_text(image_path):
    client = vision.ImageAnnotatorClient()
    with open(image_path, 'rb') as image_file:
        content = image_file.read()
    image = vision.Image(content=content)
    response = client.text_detection(image=image)
    texts = response.text_annotations
    if texts:
        return texts[0].description
    else:
        return "No text detected"

def save_text_to_file(text, file_path):
    with open(file_path, 'w') as file:
        file.write(text)

if __name__ == "__main__":
    image_path = 'image6.jpeg'
    output_file = 'output6.txt'
    raw_text = image_to_text(image_path)
    print("Raw text extracted from image:")
    print(raw_text)
    save_text_to_file(raw_text, output_file)
    print(f"\nRefined text saved to {output_file}")

Contributing

Contributions are welcome! Please fork this repository and submit a pull request for any enhancements or bug fixes.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Feel free to reach out if you have any questions or need further assistance with OCR Solutions!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
ocr.ipynb		ocr.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Solutions

Table of Contents

Features

Installation

Setup Google Cloud Vision API

Usage

Tesseract OCR

EasyOCR

Google Cloud Vision

Contributing

License

About

Releases

Packages

Languages

License

fatimaazfar/OCR-Solutions

Folders and files

Latest commit

History

Repository files navigation

OCR Solutions

Table of Contents

Features

Installation

Setup Google Cloud Vision API

Usage

Tesseract OCR

EasyOCR

Google Cloud Vision

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages