-
Notifications
You must be signed in to change notification settings - Fork 976
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
There is no train_script.py in this project
documentation
Improvements or additions to documentation
#104
by Carolingliang
was closed Feb 2, 2021
NameError: name 'RepeatingLoader' is not defined
bug
Something isn't working
#112
by Carolingliang
was closed Feb 4, 2021
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://api.wandb.ai/graphql
#114
by Carolingliang
was closed Feb 4, 2021
Version conflict on colab
bug
Something isn't working
#116
by ShivanshuPurohit
was closed Feb 17, 2021
Figure out why 1-bit Adam pretends to run
bug
Something isn't working
documentation
Improvements or additions to documentation
experiments
Experiments we wish to perform on the codebase
#128
by StellaAthena
was closed Mar 4, 2021
train scripts for reproducing 3d-parallel results with megatron-gpt.
documentation
Improvements or additions to documentation
#135
by gongjingcs
was closed Mar 2, 2021
Ensure Checkpoint Saving / Loading works correctly
bug
Something isn't working
#151
by sdtblck
was closed Mar 6, 2021
Change corpora to use lm-dataformat
feature request
New feature or request
good first issue
Good for newcomers
#165
by leogao2
was closed Apr 11, 2021
Number of GPUs not automatically detected on a single node instance
#194
by sdtblck
was closed Apr 5, 2021
Update documentation
documentation
Improvements or additions to documentation
#27
by StellaAthena
was closed Jan 6, 2021
Are there any hadrdware requirements to implement this model on GPUs?
#201
by bpm246
was closed Mar 30, 2021
Why doesn't 1-bit adam improve pipeline parallel speed?
bug
Something isn't working
#206
by sdtblck
was closed Dec 30, 2021
Integrate DeepSpeed
feature request
New feature or request
#4
by StellaAthena
was closed Dec 26, 2020
2
Build a Tensorboard
feature request
New feature or request
#5
by StellaAthena
was closed Dec 27, 2020
Allow for alternative architectures
feature request
New feature or request
#6
by StellaAthena
was closed Jan 4, 2021
Integrate the full power of ZeRo into the code
feature request
New feature or request
#19
by StellaAthena
was closed Jan 5, 2021
Integrate ZeRO-Powered Data Parallelism
feature request
New feature or request
#20
by StellaAthena
was closed Jan 5, 2021
Hardcoded paths in gpt3_small.json
bug
Something isn't working
#26
by anthony-dipofi
was closed Jan 5, 2021
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.