Stars
📚 Process PDFs, Word documents and more with spaCy
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it…
LlamaIndex is a data framework for your LLM applications
A script that helps generate a rich GitHub Contribution Graph for your account 🤖
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
CLI Tool for converting pydantic models into typescript definitions
The framework for building scalable agentic applications.
The missing star history graph of GitHub repos - https://star-history.com
🦀⚙️ Sudoless performance monitoring for Apple Silicon processors. CPU / GPU / RAM usage, power consumption & temperature 🌡️
Create and modify Word documents with Python
An open-source RAG-based tool for chatting with your documents.
Open source project for data preparation of LLM application builders
Simple package to extract text with coordinates from programmatic PDFs
A Comprehensive Toolkit for High-Quality PDF Content Extraction
InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.
Taxonomy tree that will allow you to create models tuned with your data
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Interact with the Deep Search platform for new knowledge explorations and discoveries