-
Notifications
You must be signed in to change notification settings - Fork 977
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Write dataset class that tokenizes on the fly
feature request
New feature or request
#40
by sdtblck
was closed Feb 15, 2021
Build a Tensorboard
feature request
New feature or request
#5
by StellaAthena
was closed Dec 27, 2020
Allow for alternative architectures
feature request
New feature or request
#6
by StellaAthena
was closed Jan 4, 2021
Create experiment runners
feature request
New feature or request
good first issue
Good for newcomers
#7
by StellaAthena
was closed Feb 17, 2021
2 tasks
Integrate the full power of ZeRo into the code
feature request
New feature or request
#19
by StellaAthena
was closed Jan 5, 2021
Integrate ZeRO-Powered Data Parallelism
feature request
New feature or request
#20
by StellaAthena
was closed Jan 5, 2021
Hardcoded paths in gpt3_small.json
bug
Something isn't working
#26
by anthony-dipofi
was closed Jan 5, 2021
Update documentation
documentation
Improvements or additions to documentation
#27
by StellaAthena
was closed Jan 6, 2021
ftfy used in create_tfrecords.py but not listed in requirements.txt
bug
Something isn't working
#28
by anthony-dipofi
was closed Jan 4, 2021
Dataset downloads <number of GPUs> times when running deepspeed train.py
#37
by sdtblck
was closed Jan 5, 2021
Fix tfrecord dataset to load less files into memory
bug
Something isn't working
#41
by sdtblck
was closed Mar 2, 2021
Add Deepspeed Transformer Kernel
feature request
New feature or request
good first issue
Good for newcomers
#43
by sdtblck
was closed Jan 10, 2021
Ensure learning rate scheduler is functioning correctly
bug
Something isn't working
documentation
Improvements or additions to documentation
#44
by sdtblck
was closed Jan 23, 2021
Implement Pipeline Parallelism
feature request
New feature or request
#45
by sdtblck
was closed Jan 14, 2021
Implement Gradient Checkpointing
feature request
New feature or request
good first issue
Good for newcomers
#55
by StellaAthena
was closed Jan 12, 2021
Implement Generation / Eval with deepspeed model engine
feature request
New feature or request
#58
by sdtblck
was closed Feb 15, 2021
Pipeline parallelism and gradient checkpointing (edit: and ZeRO 2!) don’t work together
bug
Something isn't working
#62
by StellaAthena
was closed Jan 28, 2021
(T5) Relative positional encodings?
feature request
New feature or request
#66
by CRG2K
was closed Mar 4, 2021
Fix DeepSpeed (ZeRO2 + Pipeline Parallel)
bug
Something isn't working
help wanted
This issue needs assistance
#67
by StellaAthena
was closed Jan 16, 2021
Expand to all 8 CoreWeave Machines
feature request
New feature or request
#68
by StellaAthena
was closed Jan 24, 2021
Cannot perform inference, be it unconditional. input-file or interactive
bug
Something isn't working
#1228
by srivassid
was closed May 30, 2024
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.