jojo1899

jojo1899

0 followers · 3 following

Block or Report

Block or report jojo1899

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (2)

Sort

Inference engines

3 repositories

LLMs

5 repositories

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

AmruthPillai / Reactive-Resume

A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!

TypeScript 20,967 2,243 Updated Jul 31, 2024

rasbt / LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 23,745 2,473 Updated Jul 30, 2024

AaronFeng753 / Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…

C++ 12,556 856 Updated Jul 16, 2024

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,104 251 Updated Jul 31, 2024

tensorflow / playground

Play with neural networks!

TypeScript 11,875 2,524 Updated Jul 25, 2024

confident-ai / deepeval

The LLM Evaluation Framework

Python 2,547 183 Updated Jul 30, 2024

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,757 759 Updated Jul 30, 2024

microsoft / Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,411 128 Updated Jul 26, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,392 356 Updated Jul 11, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 12,666 1,133 Updated Jul 30, 2024

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 13,945 1,261 Updated Jul 31, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 9,136 913 Updated Jul 30, 2024

piskvorky / gensim

Topic Modelling for Humans

Python 15,479 4,370 Updated Jul 23, 2024

danielsobrado / llm_notebooks

Concepts and examples on using and training LLMs

Jupyter Notebook 37 4 Updated May 27, 2024

ray-project / llm-numbers

Numbers every LLM developer should know

4,010 139 Updated Jan 16, 2024

microsoft / onnxruntime-genai

Generative AI extensions for onnxruntime

C++ 351 80 Updated Jul 31, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,245 5,450 Updated Jul 19, 2024

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 11,193 1,376 Updated Jul 6, 2024

Niez-Gharbi / PDF-RAG-with-Llama2-and-Gradio

Build your own Custom RAG Chatbot using Gradio, Langchain and Llama2

Python 45 11 Updated Jan 26, 2024

milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications

Go 28,605 2,753 Updated Jul 31, 2024

bdzwillo / llama_walkthrough

Llama2 transformer walkthrough with code examples

C 28 4 Updated Nov 9, 2023

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,927 3,668 Updated Jul 28, 2024

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 29,750 3,503 Updated Jul 31, 2024

NVIDIA / ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Python 2,577 291 Updated Jun 22, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,731 842 Updated Jul 30, 2024

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models.

Python 38,728 5,100 Updated Jul 29, 2024

nat / openplayground

An LLM playground you can run on your laptop

TypeScript 6,174 478 Updated Jul 10, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 62,779 9,008 Updated Jul 31, 2024

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,710 395 Updated Jul 15, 2024

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 5,455 317 Updated Jul 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly