jeffra

Jeff Rasley jeffra

@snowflakedb AI Research Team, DeepSpeed co-founder.

274 followers · 0 following

Achievements

x4 x4 x3

Achievements

x4 x4 x3

Highlights

Organizations

Stars

muellerzr / minimal-trainer-zoo

Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines

Python 195 13 Updated May 6, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 10,614 641 Updated Sep 2, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,677 1,507 Updated Sep 3, 2024

microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,837 172 Updated Aug 28, 2024

yandex / YaLM-100B

Pretrained language model with 100B parameters

Python 3,732 296 Updated Jul 10, 2023

aqlaboratory / openfold

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 2,693 500 Updated Jul 23, 2024

facebookresearch / SLIP

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 737 67 Updated Feb 9, 2023

Azure / azhpc-images

Azure HPC/AI VM Images

Shell 95 77 Updated Aug 27, 2024

facebookresearch / bitsandbytes

Library for 8-bit optimizers and quantization routines.

714 38 Updated Aug 18, 2022

bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,305 211 Updated Mar 20, 2024

logicalclocks / maggy

Distribution transparent Machine Learning experiments on Apache Spark

Python 89 14 Updated Feb 21, 2024

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,790 984 Updated Aug 27, 2024

Xirider / finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 427 73 Updated Jun 14, 2023

microsoft / archai

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Python 465 91 Updated Dec 22, 2023

Mellanox / nccl-rdma-sharp-plugins

RDMA and SHARP plugins for nccl library

C 153 31 Updated Aug 15, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 5,975 1,010 Updated Aug 28, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,584 4,035 Updated Sep 3, 2024

Pseudomanifold / latex-mimosis

A minimal & modern LaTeX template for your (bachelor's | master's | doctoral) thesis

TeX 1,151 128 Updated Nov 16, 2023

bestephe / topo_opt

Find the smallest number of switches necessary to build topologies of a given number of hosts and bisection bandwidth for the EGFT, HyperX, and Jellyfish topologies.

Python 2 Updated Jul 24, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jeff Rasley jeffra

Achievements

Achievements

Highlights

Organizations

Block or report jeffra

Stars

muellerzr / minimal-trainer-zoo

stas00 / ml-engineering

huggingface / peft

microsoft / DeepSpeed-MII

yandex / YaLM-100B

aqlaboratory / openfold

facebookresearch / SLIP

Azure / azhpc-images

facebookresearch / bitsandbytes

bigscience-workshop / Megatron-DeepSpeed

logicalclocks / maggy

EleutherAI / gpt-neox

Xirider / finetune-gpt2xl

microsoft / archai

Mellanox / nccl-rdma-sharp-plugins

microsoft / DeepSpeedExamples

microsoft / DeepSpeed

Pseudomanifold / latex-mimosis

bestephe / topo_opt