Skip to content
View jeffra's full-sized avatar

Highlights

  • Pro

Organizations

@brownsys
Block or Report

Block or report jeffra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines

Python 196 13 Updated May 6, 2024

Machine Learning Engineering Open Book

Python 10,305 618 Updated Jul 27, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,240 1,464 Updated Jul 29, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,788 168 Updated Jul 25, 2024

Pretrained language model with 100B parameters

Python 3,731 297 Updated Jul 10, 2023

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 2,648 488 Updated Jul 23, 2024

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 735 67 Updated Feb 9, 2023

Azure HPC/AI VM Images

Shell 92 75 Updated Jul 29, 2024

Library for 8-bit optimizers and quantization routines.

715 38 Updated Aug 18, 2022

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,289 210 Updated Mar 20, 2024

Distribution transparent Machine Learning experiments on Apache Spark

Python 89 14 Updated Feb 21, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,733 982 Updated Jul 26, 2024

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 426 73 Updated Jun 14, 2023

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Python 462 90 Updated Dec 22, 2023

RDMA and SHARP plugins for nccl library

C 150 32 Updated Jun 12, 2024

Example models using DeepSpeed

Python 5,891 997 Updated Jul 26, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,052 3,985 Updated Jul 28, 2024

A minimal & modern LaTeX template for your (bachelor's | master's | doctoral) thesis

TeX 1,147 125 Updated Nov 16, 2023

Find the smallest number of switches necessary to build topologies of a given number of hosts and bisection bandwidth for the EGFT, HyperX, and Jellyfish topologies.

Python 2 Updated Jul 24, 2013