Skip to content
View x22x22's full-sized avatar
👑
GitHub SVIP
👑
GitHub SVIP

Organizations

@apache
Block or Report

Block or report x22x22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用

Python 51 8 Updated Mar 16, 2024

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 507 66 Updated Jul 17, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 494 38 Updated Mar 4, 2024

Go ahead and axolotl questions

Python 6,971 765 Updated Jul 21, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,672 407 Updated Jul 15, 2024

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 2,992 623 Updated Jan 22, 2024

Awesome machine learning model compression research papers, quantization, tools, and learning material.

455 62 Updated May 8, 2024

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Train…

Python 1,326 128 Updated May 28, 2024

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 4,854 443 Updated Jul 21, 2024

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

TypeScript 1,099 185 Updated Jul 5, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

2,665 220 Updated Jul 3, 2024

Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.

Go 325 35 Updated Jul 18, 2024

Must-read Papers on Large Language Model (LLM) Continual Learning

122 10 Updated Nov 14, 2023

Continual Learning of Large Language Models: A Comprehensive Survey

170 11 Updated Jul 2, 2024

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 962 77 Updated Aug 16, 2023

An Open-Source Framework for Prompt-Learning.

Python 4,249 436 Updated Jul 16, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,793 109 Updated Jul 20, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,770 166 Updated Jul 10, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,311 1,225 Updated Jul 22, 2024

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,309 116 Updated May 31, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,776 800 Updated Jul 1, 2024

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 234 17 Updated Jun 22, 2024
Python 99 9 Updated Apr 16, 2024

unified embedding model

Python 793 60 Updated Sep 1, 2023

Python 3.9+ installers that support Windows 7 SP1 and Windows Server 2008 R2

505 61 Updated Jun 9, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 14,647 1,125 Updated Jul 21, 2024

This is an implementation of the paper Attention is all you need.

Jupyter Notebook 9 2 Updated Oct 4, 2023

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 9,864 1,138 Updated Jul 10, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,392 274 Updated Jul 19, 2024
Next