Name		Name	Last commit message	Last commit date
Latest commit History 294 Commits
.github/workflows		.github/workflows
configs/deepspeed_configs		configs/deepspeed_configs
examples		examples
kubernetes		kubernetes
megatron		megatron
tasks		tasks
tools		tools
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
deepy.py		deepy.py
prepare_data.py		prepare_data.py
pretrain_gpt2.py		pretrain_gpt2.py
requirements.txt		requirements.txt

Repository files navigation

GPT-NeoX

This repository records EleutherAI's work-in-progress for training large scale GPU language models. Our current frameowkr is based on NVIDIA's Megatron model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. If you are looking for our TPU codebase, see GPT-Neo.

Getting Started

TO DO

Training

TO DO

Datasets

TO DO

Pretrained Models

TO DO

Downloading Checkpoints

TO DO

Inference

TO DO

Fine-Tuning

TO DO

Licensing

GPT-NeoX is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program. If not, see <https://www.gnu.org/licenses/>.

This repository is based off code written by NVIDIA that is licensed under the Apache License, Version 2.0. In accordance with the Apache License, all files that are modifications of code originally written by NIVIDIA maintain a NVIDIA copyright header. All files that do not contain such a header are original to EleutherAI. When the NVIDIA code has been modified from its original version, that fact is noted in the copyright header. All derivative works of this repository must preserve these headers under the terms of the Apache License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-NeoX

Getting Started

Training

Datasets

Pretrained Models

Downloading Checkpoints

Inference

Fine-Tuning

Licensing

About

Releases

Packages

Languages

License

jamestiotio/gpt-neox

Folders and files

Latest commit

History

Repository files navigation

GPT-NeoX

Getting Started

Training

Datasets

Pretrained Models

Downloading Checkpoints

Inference

Fine-Tuning

Licensing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages