-
Notifications
You must be signed in to change notification settings - Fork 971
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Officially Support AMD GPUs
feature request
New feature or request
#954
by Quentin-Anthony
was closed Apr 21, 2024
4 tasks done
can you provide pre-built images for main branch
feature request
New feature or request
#1019
by xu-song
was closed Mar 17, 2024
Pretrained weights for smaller models
feature request
New feature or request
#598
by malteos
was closed May 5, 2022
Code Cleanup
documentation
Improvements or additions to documentation
#208
by sdtblck
was closed Apr 30, 2021
3 tasks done
Restore ability to get logits from generation?
feature request
New feature or request
help wanted
This issue needs assistance
#588
by moyix
was closed Sep 8, 2022
Pipeline parallelism and gradient checkpointing (edit: and ZeRO 2!) don’t work together
bug
Something isn't working
#62
by StellaAthena
was closed Jan 28, 2021
Error on interactive generation
bug
Something isn't working
good first issue
Good for newcomers
#555
by tonigi
was closed Sep 25, 2022
Wrong rotary embedding result between transformers structure and Megatron structure
bug
Something isn't working
#873
by GGGGGGXY
was closed Apr 18, 2023
Add Mixture of Experts
feature request
New feature or request
#479
by sdtblck
was closed Mar 7, 2024
lm_dataformat
is outdated
bug
#552
by 65536william
was closed Sep 18, 2022
Create experiment runners
feature request
New feature or request
good first issue
Good for newcomers
#7
by StellaAthena
was closed Feb 17, 2021
2 tasks
Integrate the full power of ZeRo into the code
feature request
New feature or request
#19
by StellaAthena
was closed Jan 5, 2021
Cannot perform inference, be it unconditional. input-file or interactive
bug
Something isn't working
#1228
by srivassid
was closed May 30, 2024
ftfy used in create_tfrecords.py but not listed in requirements.txt
bug
Something isn't working
#28
by anthony-dipofi
was closed Jan 4, 2021
Dataset downloads <number of GPUs> times when running deepspeed train.py
#37
by sdtblck
was closed Jan 5, 2021
Expand to all 8 CoreWeave Machines
feature request
New feature or request
#68
by StellaAthena
was closed Jan 24, 2021
Fix DeepSpeed (ZeRO2 + Pipeline Parallel)
bug
Something isn't working
help wanted
This issue needs assistance
#67
by StellaAthena
was closed Jan 16, 2021
Implement Pipeline Parallelism
feature request
New feature or request
#45
by sdtblck
was closed Jan 14, 2021
(T5) Relative positional encodings?
feature request
New feature or request
#66
by CRG2K
was closed Mar 4, 2021
Previous Next
ProTip!
Follow long discussions with comments:>50.