Skip to content
View gaziqbal's full-sized avatar

Block or report gaziqbal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A large-scale simulation framework for LLM inference

Python 275 42 Updated Oct 10, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,349 5,941 Updated Aug 19, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,606 205 Updated Nov 15, 2024

The complete set of tools for energy consumption analysis of programming languages, using Computer Language Benchmark Game

C 693 114 Updated Oct 12, 2023

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,486 725 Updated May 31, 2024

A fast multi-producer, multi-consumer lock-free concurrent queue for C++11

C++ 9,995 1,688 Updated Jun 19, 2023

A collection of lock-free data structures written in standard C++11

C++ 799 49 Updated Jul 22, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 32,738 3,919 Updated Nov 14, 2024

A complement to pgvector for high performance, cost efficient vector search on large workloads.

Rust 1,322 56 Updated Nov 15, 2024

LLM Analytics

TypeScript 614 23 Updated Oct 19, 2024

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,461 156 Updated Feb 24, 2024

Today I Learned

HTML 1,120 93 Updated Nov 12, 2024

Constrained Decoding for LLMs against JSON Schema

Python 322 8 Updated May 16, 2023

MemoryCache is an experimental development project to turn a local desktop environment into an on-device AI agent

JavaScript 557 25 Updated Apr 19, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 54,172 7,290 Updated Nov 13, 2024

Supercharge Your LLM Application Evaluations 🚀

Python 7,222 737 Updated Nov 14, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,575 551 Updated Nov 15, 2024

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,828 582 Updated Sep 4, 2024

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,135 221 Updated Nov 15, 2024

Inference Llama 2 in one file of pure 🔥

Mojo 2,099 142 Updated May 21, 2024

Blazingly fast LLM inference.

Rust 4,456 309 Updated Nov 14, 2024

TypeChat is a library that makes it easy to build natural language interfaces using types.

TypeScript 8,239 391 Updated Sep 21, 2024

A guidance language for controlling large language models.

Jupyter Notebook 19,098 1,042 Updated Nov 11, 2024

Excel spreadsheet crawler and table parser for data extraction and querying

Python 115 9 Updated Oct 16, 2024

Prompt engineering for developers

Python 673 23 Updated Feb 13, 2024

Graph-based method for end-to-end code completion with context awareness on repository

Jupyter Notebook 47 10 Updated Sep 3, 2024

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Python 447 29 Updated Mar 19, 2024

The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 73,745 7,129 Updated Nov 15, 2024

A comprehensive deep dive into the world of tokens

Python 214 8 Updated Jun 24, 2024
Next