![docker logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/docker/docker.png)
-
Ariyadis
- Tehran
- www.google.com/search?q=abdolkarim+saeedi
Block or Report
Block or report KiLJ4EdeN
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
Aggregation Cross-Entropy for Sequence Recognition. CVPR 2019.
MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
Unofficial implementation of CVPR 2020 paper "SCATTER: Selective Context Attentional Scene Text Recognizer"
ParsBench provides toolkits for benchmarking LLMs based on the Persian language tasks.
A toolbox of ocr models and algorithms based on MindSpore
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
This is a pytorch implementation of CTPN(Detecting Text in Natural Image with Connectionist Text Proposal Network). You may want to finetune from: https://drive.google.com/open?id=1JHhI4sEIXfs5gDa1…
Image classification on Sentinel-2 satellite imagery.
the AI-native open-source embedding database
A cloud-native vector database, storage for next generation AI applications
Implementation of Stable Diffusion with PyTorch
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The official github repository for Leptonica is: danbloomberg/le…
Text page dewarping using a "cubic sheet" model
Rust library and CLI tool for OCR (extracting text from images)
A Unified Toolkit for Deep Learning Based Document Image Analysis
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
Understanding Deep Learning - Simon J.D. Prince
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.