Skip to content
View Nagi-ovo's full-sized avatar
🌬️
Breezing
🌬️
Breezing

Highlights

  • Pro

Organizations

@bjut-swift
Block or Report

Block or report Nagi-ovo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

32 repositories

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,819 806 Updated Jul 1, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,337 769 Updated Jul 10, 2024

LlamaIndex is a data framework for your LLM applications

Python 34,046 4,796 Updated Jul 30, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,830 496 Updated Jul 30, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,190 493 Updated Jul 11, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,368 290 Updated May 21, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,257 401 Updated Jul 30, 2024

基于向量数据库与GPT3.5的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)

Python 3,599 313 Updated May 12, 2023

a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮

Rust 383 33 Updated Jun 17, 2024

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 563 55 Updated Apr 4, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 82,220 6,283 Updated Jul 30, 2024

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,140 401 Updated Apr 24, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,234 5,450 Updated Jul 19, 2024

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

Python 3,443 228 Updated Jul 3, 2024

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

803 31 Updated Jul 25, 2024

Grok open release

Python 49,218 8,312 Updated May 29, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,092 143 Updated Jul 29, 2024

LLM Finetuning with peft

Jupyter Notebook 1,933 537 Updated Jul 8, 2024

<Beat AI> 又名 <零生万物> , 是一本专属于软件开发工程师的 AI 入门圣经,手把手带你上手写 AI。从神经网络到大模型,从高层设计到微观原理,从工程实现到算法,学完后,你会发现 AI 也并不是想象中那么高不可攀、无法战胜,Just beat it !

Handlebars 3,255 190 Updated Apr 22, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,729 351 Updated Jul 18, 2024

LLM training in simple, raw C/CUDA

Cuda 22,381 2,484 Updated Jul 30, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,111 277 Updated May 4, 2024

tiny vision language model

Jupyter Notebook 4,635 413 Updated Jul 30, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,160 119 Updated Jun 26, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,354 142 Updated Jul 19, 2024

从零实现一个小参数量中文大语言模型。

Python 112 11 Updated Jul 20, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,136 400 Updated Jul 26, 2024

Implementation for MatMul-free LM.

Python 2,774 169 Updated Jun 27, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

1,545 75 Updated Jul 3, 2024