Sparrow

Data extraction with ML and LLM

The Principle

Sparrow is an innovative open-source solution designed for efficient data extraction and processing from various documents and images. It seamlessly handles forms, invoices, receipts, and other unstructured data sources. Sparrow stands out with its modular architecture, offering independent services such as OCR, Donut fine-tuning/inference, and a data labeling UI, all optimized for robust performance. Our current development efforts are focused on enhancing the LLM pipeline, promising exciting new features and capabilities. Our vision for Sparrow is to become the leading tool in data extraction, catering to diverse business domains. With a strong commitment to local data processing, we aim to empower customers with secure, cutting-edge technology. Join us in this journey to redefine data handling in the enterprise world.

Services

sparrow-data-donut - This service focuses on data preparation specifically for the Donut ML model, including fine-tuning and OCR integration.
sparrow-data-ocr - A standalone OCR service, providing robust optical character recognition as part of the Sparrow suite.
sparrow-ml-donut - Dedicated to the Donut ML model, this service handles both fine-tuning and inference, streamlining the machine learning workflow.
sparrow-ml-lemming - A specialized service for the LLM RAG pipeline, enhancing the capabilities of language model processing.
sparrow-ui-donut - A user-friendly interface for managing Donut ML model data labeling services and a dashboard.

Summary:

For LLM RAG Enthusiasts - Opt for the lemming service, specifically designed to cater to your needs in LLM RAG applications.
For Traditional ML Implementations - The donut service is available for those seeking machine learning solutions independent of LLM.

Sparrow offers a diverse range of services, as outlined previously. Our current developmental focus is primarily on enhancing and expanding the capabilities of the lemming service.

Installation

You have the flexibility to install either the Lemming or the Donut service independently. Each service is designed to operate as a standalone entity, without any dependencies on the other. This modular approach ensures that you can select the service that best meets your specific needs.

Lemming

Install Weaviate local DB with Docker:

docker compose up -d

Install the requirements:

pip install -r requirements.txt

Install Ollama and pull LLM model specified in config.yml

Donut

Follow the install steps outlined here:

Donut Data install steps
Donut ML install steps
Donut UI install steps

OCR

Follow the install steps outlined here:

Sparrow OCR services install steps

Usage

Lemming

Copy text PDF files to the data folder or use the sample data provided in the data folder.
Run the script, to convert text to vector embeddings and save in Weaviate:

./sparrow.sh ingest

Run the script, to process data with LLM RAG and return the answer:

./sparrow.sh "invoice_number, invoice_date, client_name, client_address, client_tax_id, seller_name, seller_address,
seller_tax_id, iban, names_of_invoice_items, gross_worth_of_invoice_items, total_gross_worth" "int, str, str, str, str,
str, str, str, str, List[str], List[float], str"

Answer:

{
    "invoice_number": 61356291,
    "invoice_date": "09/06/2012",
    "client_name": "Rodriguez-Stevens",
    "client_address": "2280 Angela Plain, Hortonshire, MS 93248",
    "client_tax_id": "939-98-8477",
    "seller_name": "Chapman, Kim and Green",
    "seller_address": "64731 James Branch, Smithmouth, NC 26872",
    "seller_tax_id": "949-84-9105",
    "iban": "GB50ACIE59715038217063",
    "names_of_invoice_items": [
        "Wine Glasses Goblets Pair Clear Glass",
        "With Hooks Stemware Storage Multiple Uses Iron Wine Rack Hanging Glass",
        "Replacement Corkscrew Parts Spiral Worm Wine Opener Bottle Houdini",
        "HOME ESSENTIALS GRADIENT STEMLESS WINE GLASSES SET OF 4 20 FL OZ (591 ml) NEW"
    ],
    "gross_worth_of_invoice_items": [
        66.0,
        123.55,
        8.25,
        14.29
    ],
    "total_gross_worth": "$212,09"
}

FastAPI Endpoint for Local LLM RAG

Sparrow enables you to run a local LLM RAG as an API using FastAPI, providing a convenient and efficient way to interact with our services.

To set this up:

Start the Endpoint

Launch the endpoint by executing the following command in your terminal:

python api.py

Access the Endpoint Documentation

You can view detailed documentation for the API by navigating to:

http:https://127.0.0.1:8000/api/v1/sparrow-llm/docs

For visual reference, a screenshot of the FastAPI endpoint

Donut

Follow the steps outlined here:

Donut Data usage steps
Donut ML usage steps
Donut UI usage steps

OCR

Follow the steps outlined here:

Sparrow OCR services usage steps

Examples

Inference with local LLM RAG

Inference with Donut ML model

Author

Katana ML, Andrej Baranovskij

Name		Name	Last commit message	Last commit date
Latest commit History 524 Commits
sparrow-data		sparrow-data
sparrow-ml		sparrow-ml
sparrow-ui/donut		sparrow-ui/donut
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sparrow

The Principle

Services

Installation

Lemming

Donut

OCR

Usage

Lemming

Donut

OCR

Examples

Inference with local LLM RAG

Inference with Donut ML model

Author

License

About

Releases

Packages

Languages

License

elmarouani/sparrow

Folders and files

Latest commit

History

Repository files navigation

Sparrow

The Principle

Services

Installation

Lemming

Donut

OCR

Usage

Lemming

Donut

OCR

Examples

Inference with local LLM RAG

Inference with Donut ML model

Author

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages