github-luffy

Follow

💭

I may be slow to respond.

yxqAIxp github-luffy

💭

I may be slow to respond.

Follow

Sharing makes people happy.

24 followers · 28 following

浙江杭州

Achievements

Achievements

Starred repositories

391 results for source starred repositories

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,555 4,030 Updated Aug 30, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,992 823 Updated Jul 1, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 12,929 1,032 Updated May 23, 2024

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,606 917 Updated Aug 28, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,198 1,190 Updated Aug 29, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,249 777 Updated Aug 21, 2024

test-time-training / ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 336 23 Updated Aug 11, 2024

ChenglongMa / SkinToneClassifier

An easy-to-use library for skin tone classification

Python 89 13 Updated Mar 14, 2024

hengjiUSTC / learn-llm

Jupyter Notebook 74 12 Updated Jul 15, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 3,651 619 Updated Aug 30, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,585 502 Updated Jul 16, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,651 793 Updated Aug 15, 2024

unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,979 1,006 Updated Sep 1, 2024

UnicomAI / Unichat-llama3-Chinese

Python 340 34 Updated Jul 27, 2024

JunnYu / RoFormer_pytorch

RoFormer V1 & V2 pytorch

Python 459 39 Updated May 18, 2022

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,859 763 Updated Aug 30, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 25,856 2,884 Updated Aug 12, 2024

THUDM / P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 1,955 196 Updated Nov 16, 2023

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,441 339 Updated Aug 17, 2024

THUDM / ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,295 1,541 Updated Jul 10, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,325 419 Updated Aug 20, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,335 5,173 Updated Jun 27, 2024

liucongg / ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Python 2,617 290 Updated Dec 12, 2023

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,102 1,723 Updated Sep 2, 2024

OpenDocCN / huggingface-doc-zh

📚 HuggingFace 中文文档

JavaScript 10 1 Updated Feb 16, 2024

huggingface / huggingface_hub

The official Python client for the Huggingface Hub.

Python 1,953 509 Updated Aug 30, 2024

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection

Python 9,080 829 Updated Aug 8, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,908 780 Updated Aug 20, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,466 55 Updated Aug 31, 2024

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 956 45 Updated Jan 16, 2024

Starred topics

Tensorflow