Skip to content
View gaziqbal's full-sized avatar

Block or report gaziqbal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
31 stars written in Python
Clear filter

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 83,965 22,630 Updated Nov 16, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 54,172 7,290 Updated Nov 13, 2024

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 50,885 16,383 Updated Nov 8, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,350 5,941 Updated Aug 19, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,464 4,119 Updated Nov 15, 2024

Let us control diffusion models!

Python 30,385 2,730 Updated Feb 25, 2024

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,063 3,683 Updated Jul 4, 2024

Convert PDF to markdown quickly with high accuracy

Python 17,745 1,020 Updated Nov 15, 2024

Static Type Checker for Python

Python 13,426 1,470 Updated Nov 15, 2024

a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb

Python 11,201 1,370 Updated Nov 12, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,330 1,483 Updated Nov 16, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,575 551 Updated Nov 15, 2024

Supercharge Your LLM Application Evaluations 🚀

Python 7,222 737 Updated Nov 14, 2024

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,559 370 Updated Oct 23, 2024

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,483 257 Updated Nov 21, 2022

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Python 2,821 275 Updated Oct 1, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,606 205 Updated Nov 15, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,897 175 Updated Nov 8, 2024

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Python 1,656 151 Updated Oct 23, 2024

A fast, efficient universal vector embedding utility package.

Python 1,627 120 Updated Aug 3, 2023

🐟 Python profile viewer

Python 1,382 33 Updated Nov 14, 2024

Prompt engineering for developers

Python 673 23 Updated Feb 13, 2024

Collective Knowledge (CK, CM, CM4MLOps and CMX) is an educational project to learn how to run AI, ML and other emerging workloads in the most efficient and cost-effective way across diverse models,…

Python 608 114 Updated Nov 11, 2024

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Python 447 29 Updated Mar 19, 2024

Constrained Decoding for LLMs against JSON Schema

Python 322 8 Updated May 16, 2023

A large-scale simulation framework for LLM inference

Python 275 42 Updated Oct 10, 2024

Common utilities for ONNX converters

Python 251 66 Updated Jun 20, 2024

A comprehensive deep dive into the world of tokens

Python 214 8 Updated Jun 24, 2024

A Python Search Engine for Humans 🥸

Python 185 22 Updated Apr 22, 2024

Excel spreadsheet crawler and table parser for data extraction and querying

Python 115 9 Updated Oct 16, 2024
Next