Skip to content
View Nagi-ovo's full-sized avatar
🌬️
Breezing
🌬️
Breezing

Highlights

  • Pro

Organizations

@bjut-swift

Block or report Nagi-ovo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

36 repositories

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,175 862 Updated Jul 1, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,356 840 Updated Oct 3, 2024

LlamaIndex is a data framework for your LLM applications

Python 36,640 5,246 Updated Nov 12, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,987 509 Updated Nov 11, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,284 507 Updated Jul 31, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,528 310 Updated May 21, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,619 470 Updated Nov 12, 2024

基于向量数据库与GPT3.5的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)

Python 3,631 318 Updated May 12, 2023

a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮

Rust 401 35 Updated Oct 21, 2024

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 594 62 Updated Oct 30, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 97,327 7,745 Updated Nov 12, 2024

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,247 415 Updated Apr 24, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,262 5,927 Updated Aug 19, 2024

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

Python 3,489 229 Updated Jul 3, 2024

Grok open release

Python 49,535 8,319 Updated Aug 30, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,800 191 Updated Nov 1, 2024

LLM Finetuning with peft

Jupyter Notebook 2,150 597 Updated Jul 8, 2024

<Beat AI> 又名 <零生万物> , 是一本专属于软件开发工程师的 AI 入门圣经,手把手带你上手写 AI。从神经网络到大模型,从高层设计到微观原理,从工程实现到算法,学完后,你会发现 AI 也并不是想象中那么高不可攀、无法战胜,Just beat it !

Handlebars 3,459 203 Updated Apr 22, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,956 362 Updated Oct 29, 2024

LLM training in simple, raw C/CUDA

Cuda 24,367 2,753 Updated Oct 2, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,206 277 Updated May 4, 2024

tiny vision language model

Jupyter Notebook 5,627 463 Updated Nov 11, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,579 150 Updated Sep 25, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,554 159 Updated Aug 17, 2024

从零实现一个小参数量中文大语言模型。

Python 257 30 Updated Aug 22, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,584 497 Updated Aug 13, 2024

Implementation for MatMul-free LM.

Python 2,918 183 Updated Nov 5, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

2,176 107 Updated Sep 24, 2024

LLM101n: Let's build a Storyteller

30,074 1,641 Updated Aug 1, 2024