Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Broken] Generation with Sequential Model bug Something isn't working help wanted This issue needs assistance
#854 by satpalsr was closed May 15, 2023
Add FLAN and T0 finetuning data feature request New feature or request
#486 by StellaAthena was closed Apr 23, 2023
Add support for mutransfer feature request New feature or request
#679 by Quentin-Anthony was closed Dec 11, 2022
5 of 7 tasks
The class with the same name was imported twice bug Something isn't working
#999 by D-X-Y was closed Sep 25, 2023
MoE loss variable not defined in gpt j residual code path bug Something isn't working
#1174 by tf-nv was closed Mar 8, 2024
Build a Tensorboard feature request New feature or request
#5 by StellaAthena was closed Dec 27, 2020
Allow for alternative architectures feature request New feature or request
#6 by StellaAthena was closed Jan 4, 2021
Create experiment runners feature request New feature or request good first issue Good for newcomers
#7 by StellaAthena was closed Feb 17, 2021
2 tasks
Integrate the full power of ZeRo into the code feature request New feature or request
#19 by StellaAthena was closed Jan 5, 2021
Can't install Triton bug Something isn't working
#22 by StellaAthena was closed Jan 3, 2021
Hardcoded paths in gpt3_small.json bug Something isn't working
#26 by anthony-dipofi was closed Jan 5, 2021
Fix depreciated code bug Something isn't working
#32 by StellaAthena was closed Jan 5, 2021
Write dataset class that tokenizes on the fly feature request New feature or request
#40 by sdtblck was closed Feb 15, 2021
Fix tfrecord dataset to load less files into memory bug Something isn't working
#41 by sdtblck was closed Mar 2, 2021
Ensure learning rate scheduler is functioning correctly bug Something isn't working documentation Improvements or additions to documentation
#44 by sdtblck was closed Jan 23, 2021
Implement Pipeline Parallelism feature request New feature or request
#45 by sdtblck was closed Jan 14, 2021
Implement Gradient Checkpointing feature request New feature or request good first issue Good for newcomers
#55 by StellaAthena was closed Jan 12, 2021
Implement Generation / Eval with deepspeed model engine feature request New feature or request
#58 by sdtblck was closed Feb 15, 2021
(T5) Relative positional encodings? feature request New feature or request
#66 by CRG2K was closed Mar 4, 2021
Fix DeepSpeed (ZeRO2 + Pipeline Parallel) bug Something isn't working help wanted This issue needs assistance
#67 by StellaAthena was closed Jan 16, 2021
Expand to all 8 CoreWeave Machines feature request New feature or request
#68 by StellaAthena was closed Jan 24, 2021
ProTip! no:milestone will show everything without a milestone.