Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Integrate DeepSpeed feature request New feature or request
#4 by StellaAthena was closed Dec 26, 2020 updated Dec 26, 2020
2
Build a Tensorboard feature request New feature or request
#5 by StellaAthena was closed Dec 27, 2020 updated Dec 27, 2020
Can't install Triton bug Something isn't working
#22 by StellaAthena was closed Jan 3, 2021 updated Jan 3, 2021
Data loading feature request New feature or request
#3 by StellaAthena was closed Jan 4, 2021 updated Jan 4, 2021
ftfy used in create_tfrecords.py but not listed in requirements.txt bug Something isn't working
#28 by anthony-dipofi was closed Jan 4, 2021 updated Jan 4, 2021
Allow for alternative architectures feature request New feature or request
#6 by StellaAthena was closed Jan 4, 2021 updated Jan 4, 2021
Integrate ZeRO-Powered Data Parallelism feature request New feature or request
#20 by StellaAthena was closed Jan 5, 2021 updated Jan 5, 2021
Integrate the full power of ZeRo into the code feature request New feature or request
#19 by StellaAthena was closed Jan 5, 2021 updated Jan 5, 2021
Fix depreciated code bug Something isn't working
#32 by StellaAthena was closed Jan 5, 2021 updated Jan 5, 2021
Hardcoded paths in gpt3_small.json bug Something isn't working
#26 by anthony-dipofi was closed Jan 5, 2021 updated Jan 5, 2021
Update documentation documentation Improvements or additions to documentation
#27 by StellaAthena was closed Jan 6, 2021 updated Jan 6, 2021
Add Deepspeed Transformer Kernel feature request New feature or request good first issue Good for newcomers
#43 by sdtblck was closed Jan 10, 2021 updated Jan 10, 2021
Implement Gradient Checkpointing feature request New feature or request good first issue Good for newcomers
#55 by StellaAthena was closed Jan 12, 2021 updated Jan 12, 2021
Implement Pipeline Parallelism feature request New feature or request
#45 by sdtblck was closed Jan 14, 2021 updated Jan 14, 2021
gpt3small is broken bug Something isn't working
#71 by StellaAthena was closed Jan 21, 2021 updated Jan 21, 2021
Implement 1-Bit Adam feature request New feature or request good first issue Good for newcomers
#69 by StellaAthena was closed Jan 23, 2021 updated Jan 23, 2021
Ensure learning rate scheduler is functioning correctly bug Something isn't working documentation Improvements or additions to documentation
#44 by sdtblck was closed Jan 23, 2021 updated Jan 23, 2021
AttributeError: module 'torch.utils' has no attribute 'checkpoint' in gpt-neox/gpt-neox bug Something isn't working
#80 by kinoc was closed Jan 23, 2021 updated Jan 23, 2021
Expand to all 8 CoreWeave Machines feature request New feature or request
#68 by StellaAthena was closed Jan 24, 2021 updated Jan 24, 2021
parameters
#94 by 1660678083Alice was closed Jan 26, 2021 updated Jan 26, 2021
How to change parameters
#95 by Carolingliang was closed Jan 26, 2021 updated Jan 26, 2021
How to calculate parameters
#100 by Carolingliang was closed Jan 28, 2021 updated Jan 28, 2021
Pipeline parallelism and gradient checkpointing (edit: and ZeRO 2!) don’t work together bug Something isn't working
#62 by StellaAthena was closed Jan 28, 2021 updated Jan 28, 2021
ProTip! Mix and match filters to narrow down what you’re looking for.