Skip to content
View ymjiang's full-sized avatar

Organizations

@bytedance @dmlc
Block or Report

Block or report ymjiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userā€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,666 87 Updated Jan 21, 2024

Microsoft Automatic Mixed Precision Library

Python 498 37 Updated Aug 15, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,129 139 Updated Jun 25, 2024

Fast and memory-efficient exact attention

Python 12,952 1,166 Updated Aug 15, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,423 471 Updated Jan 8, 2024

šŸ™ Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 47,064 4,562 Updated Aug 12, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,635 5,520 Updated Aug 2, 2024
Python 209 28 Updated Nov 9, 2022

Pipeline Parallelism for PyTorch

Python 685 84 Updated Aug 7, 2024
Python 202 23 Updated Aug 17, 2023

IntelĀ® Performance Counter Monitor (IntelĀ® PCM)

C++ 2,708 461 Updated Aug 14, 2024

A multi-party collaborative machine learning framework

Python 890 173 Updated Jul 8, 2024

Ongoing research training transformer models at scale

Python 9,690 2,180 Updated Aug 15, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,323 4,011 Updated Aug 16, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,676 1,959 Updated Apr 16, 2024

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 4 4 Updated Nov 9, 2019

A lightweight parameter server interface

C++ 70 24 Updated Jan 13, 2023

A high performance and generic framework for distributed DNN training

Python 3,605 490 Updated Oct 3, 2023

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,107 2,221 Updated Aug 1, 2024