Skip to content
View jeffra's full-sized avatar

Highlights

  • Pro

Organizations

@brownsys

Block or report jeffra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines

Python 195 13 Updated May 6, 2024

Machine Learning Engineering Open Book

Python 10,614 641 Updated Sep 2, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,677 1,507 Updated Sep 3, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,837 172 Updated Aug 28, 2024

Pretrained language model with 100B parameters

Python 3,732 296 Updated Jul 10, 2023

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 2,693 500 Updated Jul 23, 2024

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 737 67 Updated Feb 9, 2023

Azure HPC/AI VM Images

Shell 95 77 Updated Aug 27, 2024

Library for 8-bit optimizers and quantization routines.

714 38 Updated Aug 18, 2022

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,305 211 Updated Mar 20, 2024

Distribution transparent Machine Learning experiments on Apache Spark

Python 89 14 Updated Feb 21, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,790 984 Updated Aug 27, 2024

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 427 73 Updated Jun 14, 2023

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Python 465 91 Updated Dec 22, 2023

RDMA and SHARP plugins for nccl library

C 153 31 Updated Aug 15, 2024

Example models using DeepSpeed

Python 5,975 1,010 Updated Aug 28, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,584 4,035 Updated Sep 3, 2024

A minimal & modern LaTeX template for your (bachelor's | master's | doctoral) thesis

TeX 1,151 128 Updated Nov 16, 2023

Find the smallest number of switches necessary to build topologies of a given number of hosts and bisection bandwidth for the EGFT, HyperX, and Jellyfish topologies.

Python 2 Updated Jul 24, 2013