Stars
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
Generative AI extensions for onnxruntime
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Efficient visual programming for AI language models
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Hands-On Graph Neural Networks Using Python, published by Packt
Tools for merging pretrained large language models.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Build your own ChatPDF and run them locally
the AI-native open-source embedding database
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
Seamlessly integrate LLMs as Python functions
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
An OAI compatible exllamav2 API that's both lightweight and fast
A fast inference library for running LLMs locally on modern consumer-class GPUs
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
Ghost ESP is a ESP32 Firmware that Revolutionizes the way we use ESP32 devices in a Pen Testing aspect
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
High-performance In-browser LLM Inference Engine
Stable Diffusion web UI
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Low-code framework for building custom LLMs, neural networks, and other AI models