Skip to content
View KiLJ4EdeN's full-sized avatar
😄
😄
Block or Report

Block or report KiLJ4EdeN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".

Python 93 12 Updated May 3, 2024

Aggregation Cross-Entropy for Sequence Recognition. CVPR 2019.

Python 303 60 Updated Dec 9, 2021

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Python 627 152 Updated Dec 6, 2022

Unofficial implementation of CVPR 2020 paper "SCATTER: Selective Context Attentional Scene Text Recognizer"

Python 66 9 Updated Mar 3, 2022

ParsBench provides toolkits for benchmarking LLMs based on the Persian language tasks.

Python 31 1 Updated Jul 19, 2024

A toolbox of ocr models and algorithms based on MindSpore

Python 181 44 Updated Jul 23, 2024

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

Python 2,054 472 Updated Mar 11, 2024

This is a pytorch implementation of CTPN(Detecting Text in Natural Image with Connectionist Text Proposal Network). You may want to finetune from: https://drive.google.com/open?id=1JHhI4sEIXfs5gDa1…

Python 291 124 Updated Apr 23, 2019

Image classification on Sentinel-2 satellite imagery.

Python 29 5 Updated Jul 6, 2023

the AI-native open-source embedding database

Rust 13,744 1,160 Updated Jul 23, 2024

A cloud-native vector database, storage for next generation AI applications

Go 28,436 2,739 Updated Jul 23, 2024

Implementation of Stable Diffusion with PyTorch

Jupyter Notebook 243 13 Updated Jul 23, 2024

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 12,432 2,847 Updated Jul 23, 2024

Tesseract 5.0 trainer Docker image

Shell 1 Updated Nov 5, 2021

Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The official github repository for Leptonica is: danbloomberg/le…

C 1,723 385 Updated Jul 21, 2024

Text page dewarping using a "cubic sheet" model

Python 1,419 239 Updated Mar 2, 2023

OCR engine for all the languages

Python 690 125 Updated Jul 10, 2024

Rust library and CLI tool for OCR (extracting text from images)

Rust 1,017 41 Updated Jul 17, 2024

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 4,674 449 Updated Mar 7, 2024

A Python wrapper for Google Tesseract

Python 5,686 714 Updated Jul 5, 2024

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Python 7,812 1,370 Updated Jul 22, 2024

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 10,358 715 Updated Jul 23, 2024

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 5,617 1,176 Updated Jul 22, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,015 2,299 Updated Jul 23, 2024

A PyTorch-based Speech Toolkit

Python 8,290 1,333 Updated Jul 22, 2024

Audio waveform player

TypeScript 8,449 1,597 Updated Jul 22, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,577 733 Updated Jul 21, 2024

Google Drive CLI Client

Rust 1,350 75 Updated Mar 15, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,382 407 Updated Jul 22, 2024
Next