- Woodinville, WA
Lists (2)
Sort Oldest
Stars
A large-scale simulation framework for LLM inference
The simplest, fastest repository for training/finetuning medium-sized GPTs.
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
The complete set of tools for energy consumption analysis of programming languages, using Computer Language Benchmark Game
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
A collection of lock-free data structures written in standard C++11
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A complement to pgvector for high performance, cost efficient vector search on large workloads.
A Bulletproof Way to Generate Structured JSON from Language Models
Constrained Decoding for LLMs against JSON Schema
MemoryCache is an experimental development project to turn a local desktop environment into an on-device AI agent
Interact with your documents using the power of GPT, 100% privately, no data leaks
Supercharge Your LLM Application Evaluations 🚀
Retrieval and Retrieval-augmented LLMs
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
TypeChat is a library that makes it easy to build natural language interfaces using types.
A guidance language for controlling large language models.
Excel spreadsheet crawler and table parser for data extraction and querying
Graph-based method for end-to-end code completion with context awareness on repository
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
A comprehensive deep dive into the world of tokens