🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformer…

C++ 21,795 1,667 Updated Jul 11, 2024

typesense / typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

C++ 18,766 580 Updated Jul 11, 2024

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 16,965 843 Updated Jul 6, 2024

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 15,755 898 Updated Jul 11, 2024

pybind / pybind11

Seamless operability between C++11 and Python

C++ 15,124 2,057 Updated Jul 10, 2024

exaloop / codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

C++ 13,970 499 Updated Jul 9, 2024

spotify / annoy

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 12,923 1,151 Updated Feb 6, 2024

hoffstadt / DearPyGui

Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies

C++ 12,645 669 Updated Jul 9, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 11,945 1,419 Updated Jul 11, 2024

ggerganov / ggml

Tensor library for machine learning

C++ 10,326 952 Updated Jul 9, 2024

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,233 2,086 Updated Jul 11, 2024

VowpalWabbit / vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,444 1,927 Updated Jun 20, 2024

rapidsai / cudf

cuDF - GPU DataFrame Library

C++ 8,032 872 Updated Jul 11, 2024

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,655 407 Updated Jul 1, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,445 803 Updated Jul 11, 2024