Skip to content
View github-luffy's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • 浙江杭州

Block or report github-luffy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

391 results for source starred repositories
Clear filter

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,555 4,030 Updated Aug 30, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,992 823 Updated Jul 1, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 12,929 1,032 Updated May 23, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,606 917 Updated Aug 28, 2024

Fast and memory-efficient exact attention

Python 13,198 1,190 Updated Aug 29, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,249 777 Updated Aug 21, 2024

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 336 23 Updated Aug 11, 2024

An easy-to-use library for skin tone classification

Python 89 13 Updated Mar 14, 2024
Jupyter Notebook 74 12 Updated Jul 15, 2024

Utilities intended for use with Llama models.

Python 3,651 619 Updated Aug 30, 2024

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,585 502 Updated Jul 16, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,651 793 Updated Aug 15, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,979 1,006 Updated Sep 1, 2024

RoFormer V1 & V2 pytorch

Python 459 39 Updated May 18, 2022

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,859 763 Updated Aug 30, 2024

The official Meta Llama 3 GitHub site

Python 25,856 2,884 Updated Aug 12, 2024

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 1,955 196 Updated Nov 16, 2023

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,441 339 Updated Aug 17, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,295 1,541 Updated Jul 10, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,325 419 Updated Aug 20, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,335 5,173 Updated Jun 27, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,617 290 Updated Dec 12, 2023

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,102 1,723 Updated Sep 2, 2024

📚 HuggingFace 中文文档

JavaScript 10 1 Updated Feb 16, 2024

The official Python client for the Huggingface Hub.

Python 1,953 509 Updated Aug 30, 2024

YOLOv10: Real-Time End-to-End Object Detection

Python 9,080 829 Updated Aug 8, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,908 780 Updated Aug 20, 2024

Tile primitives for speedy kernels

Cuda 1,466 55 Updated Aug 31, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 956 45 Updated Jan 16, 2024
Next