text-extraction

A simple web application built with React which allows to upload images containing text, select the language of the text for recognition, and extract the text from the image. As quick as a finger snap - SnapText.

react reactjs web-application text-extraction text-recognition copy-to-clipboard multi-language-support simple-app copy-text-to-clipboard text-extraction-from-image copy-result

Updated Dec 10, 2023
HTML

AndyTheFactory / article-extraction-dataset

Star

Article title, authors, date and body extraction dataset.

text-mining news html-to-markdown scraping corpus news-aggregator text-extraction dataset web-scraping readability datasets scraping-websites html2text news-crawler corpus-builder corpus-tools article-extractor text-cleaning text-preprocessing

Updated Mar 26, 2024
HTML

importcjj / go-readability

Star

Go package that cleans a HTML page for better readability.

go html golang text extractor text-extraction readability html2text html-extractor

Updated Aug 1, 2023
HTML

Jaha96 / tesseract-quick-implementation

Star

Tesseract-OCR quick implementation. Linked with stack-overflow question

tesseract text-extraction tesseract-ocr pyinstaller tesseract-4 tesseract-python

Updated Nov 26, 2019
HTML

sharmaroshan / Text-Classification

Star

This is a Project Assignment where I have Learned to Classify the Different Texts Using Clustering Techniques. Natural Language Processing and Clustering both of these Concepts are Being Used. I have Used K-means Clustering Techniques to Implement the Problem.

python natural-language-processing text-mining numpy text-analysis pandas text-extraction nltk bag-of-words tf-idf text-processing jupyter-notebooks text-cleaning

Updated Aug 18, 2019
HTML

MaarkNassef / GraduationProject

Star

HR Assistant: Web application for efficient HR recruitment and resume management. Utilizes OCR for text extraction and similarity analysis to rearrange resumes based on job descriptions. Simplifies the hiring process for HR recruiters and enhances candidate selection.

python resume flask text-extraction hr similarity-measures recruitment pyotp ocr-python

Updated Jul 11, 2023
HTML

sanidhyajadaun / MediLink

Star

MediLink is a web application that revolutionizes health record management by seamlessly integrating NLP techniques for handwritten text extraction on prescriptions and blockchain technology for secure data storage.

nlp blockchain text-extraction health-record-management