Skip to content
View github-luffy's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • 浙江杭州
Block or Report

Block or report github-luffy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Jupyter Notebook 55 8 Updated Jul 15, 2024

Utilities intended for use with Llama models.

Python 2,685 328 Updated Jul 28, 2024

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,348 482 Updated Jul 16, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,308 766 Updated Jul 10, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,195 868 Updated Jul 28, 2024

RoFormer V1 & V2 pytorch

Python 448 38 Updated May 18, 2022

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,746 759 Updated Jul 27, 2024

The official Meta Llama 3 GitHub site

Python 24,760 2,694 Updated Jul 27, 2024

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 1,941 196 Updated Nov 16, 2023

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 3,928 286 Updated Jul 27, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,171 1,518 Updated Jul 10, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,425 341 Updated Jul 27, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,128 5,162 Updated Jun 27, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,581 286 Updated Dec 12, 2023

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 12,703 1,658 Updated Jul 26, 2024

📚 HuggingFace 中文文档

JavaScript 9 1 Updated Feb 16, 2024

The official Python client for the Huggingface Hub.

Python 1,871 487 Updated Jul 26, 2024

YOLOv10: Real-Time End-to-End Object Detection

Python 8,553 748 Updated Jul 18, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,697 750 Updated May 19, 2024

Tile primitives for speedy kernels

Cuda 1,414 51 Updated Jul 27, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 933 43 Updated Jan 16, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,145 119 Updated Jun 26, 2024

Efficient Lane Detection

Python 32 Updated Jun 1, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,657 1,323 Updated Jul 21, 2024

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 269 27 Updated Jul 22, 2024

Official implementation of SCTNet (AAAI2024)

Python 152 11 Updated Jan 17, 2024

[arXiv preprint] The official code of paper "Open-Vocabulary SAM".

Python 775 25 Updated Jul 22, 2024
Python 38 2 Updated Jan 10, 2024
Next