A Repo For Document AI
-
Updated
Oct 1, 2024 - Python
A Repo For Document AI
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
[MM'2024] Official implementation of "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction."
📄 Anonymize and redact uploaded text document files.
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.
Transcription project consisting of Python scripting and usage of ML text extraction models.
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
This Flask application Google Cloud Document AI to extract name, IPK (GPA), university details, etc.
ReadingBank: A Benchmark Dataset for Reading Order Detection
Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
This repository includes all computer vision, audio, document AI, and multimodal projects.
Spacy for Key:Value pairs
FastAPI application for document classification using a multimodal LayoutLM model, designed to classify PDF documents into RVL-DCIP categories.
AI & Data, Google Cloud Skills Boost
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
Add a description, image, and links to the document-ai topic page so that developers can more easily learn about it.
To associate your repository with the document-ai topic, visit your repo's landing page and select "manage topics."