Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Data loading feature request New feature or request
#3 by StellaAthena was closed Jan 4, 2021
Integrate DeepSpeed feature request New feature or request
#4 by StellaAthena was closed Dec 26, 2020
2
Build a Tensorboard feature request New feature or request
#5 by StellaAthena was closed Dec 27, 2020
Allow for alternative architectures feature request New feature or request
#6 by StellaAthena was closed Jan 4, 2021
Integrate the full power of ZeRo into the code feature request New feature or request
#19 by StellaAthena was closed Jan 5, 2021
Integrate ZeRO-Powered Data Parallelism feature request New feature or request
#20 by StellaAthena was closed Jan 5, 2021
Hardcoded paths in gpt3_small.json bug Something isn't working
#26 by anthony-dipofi was closed Jan 5, 2021
Update documentation documentation Improvements or additions to documentation
#27 by StellaAthena was closed Jan 6, 2021
Fix depreciated code bug Something isn't working
#32 by StellaAthena was closed Jan 5, 2021
Write dataset class that tokenizes on the fly feature request New feature or request
#40 by sdtblck was closed Feb 15, 2021
Add Deepspeed Transformer Kernel feature request New feature or request good first issue Good for newcomers
#43 by sdtblck was closed Jan 10, 2021
Ensure learning rate scheduler is functioning correctly bug Something isn't working documentation Improvements or additions to documentation
#44 by sdtblck was closed Jan 23, 2021
Implement Gradient Checkpointing feature request New feature or request good first issue Good for newcomers
#55 by StellaAthena was closed Jan 12, 2021
Implement Generation / Eval with deepspeed model engine feature request New feature or request
#58 by sdtblck was closed Feb 15, 2021
parameters
#94 by 1660678083Alice was closed Jan 26, 2021
How to change parameters
#95 by Carolingliang was closed Jan 26, 2021
How to calculate parameters
#100 by Carolingliang was closed Jan 28, 2021
There is no train_script.py in this project documentation Improvements or additions to documentation
#104 by Carolingliang was closed Feb 2, 2021
NameError: name 'RepeatingLoader' is not defined bug Something isn't working
#112 by Carolingliang was closed Feb 4, 2021
Version conflict on colab bug Something isn't working
#116 by ShivanshuPurohit was closed Feb 17, 2021
ProTip! Type g i on any issue or pull request to go back to the issue listing page.