Block or Report
Block or report jojo1899
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (2)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Fast and memory-efficient exact attention
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Concepts and examples on using and training LLMs
Generative AI extensions for onnxruntime
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Neural Networks: Zero to Hero
Build your own Custom RAG Chatbot using Gradio, Langchain and Llama2
A cloud-native vector database, storage for next generation AI applications
Llama2 transformer walkthrough with code examples
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A library for efficient similarity search and clustering of dense vectors.
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
A Gradio web UI for Large Language Models.
An LLM playground you can run on your laptop
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs