-
Notifications
You must be signed in to change notification settings - Fork 984
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
distributed training with multipy nodes.
bug
Something isn't working
#734
by cdj0311
was closed Dec 28, 2022
misindexing when converting llama weights to gpt-neox format
bug
Something isn't working
#971
by CRSilkworth
was closed Feb 28, 2024
Unable to install dependencies: No matching distribution found for triton==0.4.2
bug
Something isn't working
#628
by tsndr
was closed Sep 19, 2022
Add support for sequence parallelism
feature request
New feature or request
help wanted
This issue needs assistance
#812
by Quentin-Anthony
was closed Aug 23, 2024
Pipeline parallelism and gradient checkpointing (edit: and ZeRO 2!) don’t work together
bug
Something isn't working
#62
by StellaAthena
was closed Jan 28, 2021
Configs are not broadcasted properly in multi-node training
bug
Something isn't working
#925
by silverriver
was closed May 23, 2023
Adding data to continue training failed.
feature request
New feature or request
#860
by SefaZeng
was closed May 18, 2023
Running through Dockerfile broken
bug
Something isn't working
#419
by VHellendoorn
was closed Oct 12, 2021
Create experiment runners
feature request
New feature or request
good first issue
Good for newcomers
#7
by StellaAthena
was closed Feb 17, 2021
2 tasks
Apply new fused rotary embedding
feature request
New feature or request
#1077
by Quentin-Anthony
was closed Jan 5, 2024
Error on interactive generation
bug
Something isn't working
good first issue
Good for newcomers
#555
by tonigi
was closed Sep 25, 2022
ModuleNotFoundError: No module named 'deepspeed.ops.op_builder' on import deepspeed
bug
Something isn't working
#425
by shankyemcee
was closed Sep 25, 2022
Running with bf16 error
bug
Something isn't working
good first issue
Good for newcomers
#939
by Life-0-1
was closed May 18, 2023
[Broken] Generation with Sequential Model
bug
Something isn't working
help wanted
This issue needs assistance
#854
by satpalsr
was closed May 15, 2023
Support Mistral Models
feature request
New feature or request
#1050
by Quentin-Anthony
was closed Feb 26, 2024
RotaryEmbedding computation is wrong for certain position/feature pairs in reduced precision (both fp16 and bfloat)
bug
Something isn't working
#1003
by cbcase
was closed Sep 25, 2023
bf16 is incompatible with pipe parallelism
bug
Something isn't working
#963
by Life-0-1
was closed Sep 18, 2023
Robust testing suite
feature request
New feature or request
good first issue
Good for newcomers
help wanted
This issue needs assistance
#957
by StellaAthena
was closed Dec 4, 2023
25 tasks
Make Configs Consistent
bug
Something isn't working
good first issue
Good for newcomers
#920
by StellaAthena
was closed May 9, 2023
Gibberish text generation after converting to Huggingface.
#712
by kanwatchara-k
was closed Nov 15, 2022
Previous Next
ProTip!
Updated in the last three days: updated:>2024-08-29.