Skip to content
View cau-git's full-sized avatar

Organizations

@IBM

Block or report cau-git

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📚 Process PDFs, Word documents and more with spaCy

Python 115 5 Updated Nov 23, 2024

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it…

Python 24 4 Updated Nov 5, 2024

LlamaIndex is a data framework for your LLM applications

Python 36,907 5,286 Updated Nov 23, 2024

A script that helps generate a rich GitHub Contribution Graph for your account 🤖

Python 2,628 210 Updated Nov 7, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,151 82 Updated Nov 13, 2024

CLI Tool for converting pydantic models into typescript definitions

Python 287 48 Updated Nov 22, 2024

The framework for building scalable agentic applications.

TypeScript 1,094 107 Updated Nov 22, 2024

The missing star history graph of GitHub repos - https://star-history.com

TypeScript 6,628 263 Updated Nov 20, 2024

🦀⚙️ Sudoless performance monitoring for Apple Silicon processors. CPU / GPU / RAM usage, power consumption & temperature 🌡️

Rust 308 9 Updated Nov 17, 2024

Get your documents ready for gen AI

Python 10,605 511 Updated Nov 22, 2024

Create and modify Word documents with Python

Python 4,649 1,133 Updated Aug 20, 2024

Virtual machines for iOS and macOS

Swift 27,150 1,344 Updated Nov 23, 2024

Running Docling as an API service

Makefile 16 3 Updated Oct 11, 2024

An open-source RAG-based tool for chatting with your documents.

Python 17,498 1,353 Updated Nov 20, 2024

Open source project for data preparation of LLM application builders

Jupyter Notebook 309 134 Updated Nov 23, 2024

Simple package to extract text with coordinates from programmatic PDFs

C++ 28 8 Updated Nov 22, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,822 384 Updated Oct 24, 2024

Build document-native LLM applications

Python 51 1 Updated Sep 11, 2024

InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Python 986 338 Updated Nov 22, 2024

Taxonomy tree that will allow you to create models tuned with your data

Python 202 839 Updated Nov 19, 2024
Python 42 10 Updated Nov 20, 2024

Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.

C++ 24 4 Updated Oct 23, 2024

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

276 15 Updated Feb 1, 2023

Interact with the Deep Search platform for new knowledge explorations and discoveries

Python 135 19 Updated Oct 17, 2024