Skip to content
View jojo1899's full-sized avatar
Block or Report

Block or report jojo1899

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!

TypeScript 20,967 2,243 Updated Jul 31, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 23,745 2,473 Updated Jul 30, 2024

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…

C++ 12,556 856 Updated Jul 16, 2024

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,104 251 Updated Jul 31, 2024

Play with neural networks!

TypeScript 11,875 2,524 Updated Jul 25, 2024

The LLM Evaluation Framework

Python 2,547 183 Updated Jul 30, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,757 759 Updated Jul 30, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,411 128 Updated Jul 26, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,392 356 Updated Jul 11, 2024

Fast and memory-efficient exact attention

Python 12,666 1,133 Updated Jul 30, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,945 1,261 Updated Jul 31, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 9,136 913 Updated Jul 30, 2024

Topic Modelling for Humans

Python 15,479 4,370 Updated Jul 23, 2024

Concepts and examples on using and training LLMs

Jupyter Notebook 37 4 Updated May 27, 2024

Numbers every LLM developer should know

4,010 139 Updated Jan 16, 2024

Generative AI extensions for onnxruntime

C++ 351 80 Updated Jul 31, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,245 5,450 Updated Jul 19, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 11,193 1,376 Updated Jul 6, 2024

Build your own Custom RAG Chatbot using Gradio, Langchain and Llama2

Python 45 11 Updated Jan 26, 2024

A cloud-native vector database, storage for next generation AI applications

Go 28,605 2,753 Updated Jul 31, 2024

Llama2 transformer walkthrough with code examples

C 28 4 Updated Nov 9, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,927 3,668 Updated Jul 28, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 29,750 3,503 Updated Jul 31, 2024

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Python 2,577 291 Updated Jun 22, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,731 842 Updated Jul 30, 2024

A Gradio web UI for Large Language Models.

Python 38,728 5,100 Updated Jul 29, 2024

An LLM playground you can run on your laptop

TypeScript 6,174 478 Updated Jul 10, 2024

LLM inference in C/C++

C++ 62,779 9,008 Updated Jul 31, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,710 395 Updated Jul 15, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 5,455 317 Updated Jul 5, 2024
Next