TextScan Summary Generator

This project combines image-to-text conversion, text summarization, and digitalize documents This system enables users to extract insights, generate concise summaries, and digitize documents for enhanced productivity and information retrieval.

The image showcases the Visual Studio Code (VSCode) interface, connected to a Jetson Nano device via SSH. In the VSCode window, three tabs are open:

The first tab displays an image, captured using a webcam.
In the second tab, there is a text-recognition file and it was responsible for extracting text from the captured image.
The third tab shows a summary generated using the contents of the text recognition file.

The Algorithm

The first file, capture_image.py utilizes the OpenCV library to interact with a USB webcam and capture images.

The second file, text_recognition.py involves reading the image, preprocessing it to optimize OCR accuracy, and then utilizing the Pytesseract library to interact with the Tesseract OCR engine to perform the text recognition process. The extracted text is finally saved in a text file.

The third file, generate_summary.py uses a pre-trained model from the Transformers library to perform text summarization. It reads input text from a file, processes it through the summarization model, and saves the resulting summary in another file.

Running this project

Steps to run:

Run capture_image.py and input 'y' to capture a frame from the webcam. The frame will be saved in captured_image.jpg.
Run text_recognition.py. This will run text recognition on captured_image.jpg and save the text to captured_text.txt.
Run generate_summary.py. This will use the transformers module to summarize the text in captured_text.txt and upload the summary to text_summary.txt.

Required libraries: CV2, Pytesseract, Transformers

[View a video explanation here](video link)]

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
README.md		README.md
capture_image.py		capture_image.py
generate_summary.py		generate_summary.py
text_recognition.py		text_recognition.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TextScan Summary Generator

The Algorithm

Running this project

About

Releases

Packages

Languages

Rahuldeb5/nvidia-project

Folders and files

Latest commit

History

Repository files navigation

TextScan Summary Generator

The Algorithm

Running this project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages