pranaybaldev / gpt-neox Public

forked from EleutherAI/gpt-neox

Notifications You must be signed in to change notification settings
Fork 0
Star 1

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
configs		configs
gpt_neox		gpt_neox
scripts		scripts
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py
train_enwik8.py		train_enwik8.py

Repository files navigation

GPT-NeoX

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Requirements

$ pip install -r requirements.txt

Test deepspeed locally

$ deepspeed train_enwik8.py \
	--deepspeed \
	--deepspeed_config ./configs/base_deepspeed.json

About

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Readme

Apache-2.0 license

Activity

1 star

1 watching

0 forks

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 84.4%
C++ 12.8%
Cuda 1.2%
C 0.9%
Dockerfile 0.6%
Shell 0.1%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-NeoX

Requirements

About

Releases

Packages

Languages

License

pranaybaldev/gpt-neox

Folders and files

Latest commit

History

Repository files navigation

GPT-NeoX

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages