Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 3,539 242 Updated Aug 11, 2024

Stirling-Tools / Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Java 36,519 2,722 Updated Aug 9, 2024

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 1,775 139 Updated Jul 30, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,573 732 Updated Aug 7, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 45,576 4,841 Updated Aug 11, 2024

facebookresearch / detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 29,682 7,375 Updated Aug 9, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 7,412 562 Updated Aug 10, 2024

hiroi-sora / GapTree_Sort_Algorithm

【间隙·树·排序算法】对OCR结果或PDF提取的文本进行版面分析，按人类阅读顺序进行排序。

Python 76 12 Updated Feb 28, 2024

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 8,777 1,367 Updated Aug 8, 2024

tstanislawek / awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

1,222 140 Updated Jun 2, 2023

deepdoctection / deepdoctection

A Repo For Document AI

Python 2,419 123 Updated Jul 29, 2024

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 60,226 9,300 Updated Aug 10, 2024

labelmeai / labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 12,948 3,354 Updated Aug 4, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 14,441 1,279 Updated Aug 9, 2024

chineseocr / trocr-chinese

transformers ocr for chinese

Python 333 50 Updated Jan 13, 2023

RapidAI / RapidLaTeXOCR

Formula recognition based on LaTeX-OCR and ONNXRuntime.

Python 261 25 Updated Jul 11, 2024

modelscope / dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

C++ 121 13 Updated Jul 29, 2024