-
Notifications
You must be signed in to change notification settings - Fork 982
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Integrate DeepSpeed
feature request
New feature or request
#4
by StellaAthena
was closed Dec 26, 2020
2
Build a Tensorboard
feature request
New feature or request
#5
by StellaAthena
was closed Dec 27, 2020
Allow for alternative architectures
feature request
New feature or request
#6
by StellaAthena
was closed Jan 4, 2021
Hardcoded paths in gpt3_small.json
bug
Something isn't working
#26
by anthony-dipofi
was closed Jan 5, 2021
Dataset downloads <number of GPUs> times when running deepspeed train.py
#37
by sdtblck
was closed Jan 5, 2021
Version conflict on colab
bug
Something isn't working
#116
by ShivanshuPurohit
was closed Feb 17, 2021
Figure out why 1-bit Adam pretends to run
bug
Something isn't working
documentation
Improvements or additions to documentation
experiments
Experiments we wish to perform on the codebase
#128
by StellaAthena
was closed Mar 4, 2021
Change corpora to use lm-dataformat
feature request
New feature or request
good first issue
Good for newcomers
#165
by leogao2
was closed Apr 11, 2021
Why doesn't 1-bit adam improve pipeline parallel speed?
bug
Something isn't working
#206
by sdtblck
was closed Dec 30, 2021
Timer logging innacurate if pp=0
bug
Something isn't working
#238
by sdtblck
was closed Apr 30, 2021
Get rid of codepath where pp = 0
feature request
New feature or request
#243
by sdtblck
was closed Apr 30, 2021
Write Sampling Documentation
documentation
Improvements or additions to documentation
#252
by sdtblck
was closed Aug 21, 2021
Implement Bf16
feature request
New feature or request
#302
by sdtblck
was closed Jun 22, 2021
1 task
Add ReLU / other activation fns
feature request
New feature or request
#303
by sdtblck
was closed May 7, 2021
Eval Harness doesn't log during training
feature request
New feature or request
#366
by StellaAthena
was closed Aug 31, 2021
ImportError: cannot import name 'LocalSlidingWindowSparsityConfig' from 'deepspeed.ops.sparse_attention.sparsity_config
bug
Something isn't working
#431
by rokosbasilisk
was closed Oct 18, 2021
Ensure NeoX is compatible with HF
bug
Something isn't working
#485
by StellaAthena
was closed Feb 7, 2022
HF Equivalent Pretrained Models
feature request
New feature or request
#489
by sameeravithana
was closed Sep 25, 2022
Previous Next
ProTip!
Follow long discussions with comments:>50.