-
Notifications
You must be signed in to change notification settings - Fork 982
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Large model instantiation using New feature or request
DeepSpeed.zero.Init
under ZeRO-3
feature request
#1189
by R0n12
was closed Mar 19, 2024
MoE loss variable not defined in gpt j residual code path
bug
Something isn't working
#1174
by tf-nv
was closed Mar 8, 2024
Dockerfile installation fails to run pythia 14M
bug
Something isn't working
#1165
by tf-nv
was closed Mar 4, 2024
Converting Pythia checkpoint from HF to NeoX fails
bug
Something isn't working
#1161
by malteos
was closed Mar 4, 2024
Add PyTorch Memory Profiler
feature request
New feature or request
#1152
by Quentin-Anthony
was closed Feb 21, 2024
Add basic Mamba block
feature request
New feature or request
#1148
by Quentin-Anthony
was closed Mar 10, 2024
3 of 4 tasks
Update to current versions of python and pytorch
feature request
New feature or request
#1143
by segyges
was closed Feb 23, 2024
Port NVIDIA Nsight profiling to gpt-neox
feature request
New feature or request
#1134
by Quentin-Anthony
was closed Feb 23, 2024
1 of 2 tasks
Convert HF format or raw weights of Llama2 to NEOX format
feature request
New feature or request
#1112
by fmh1art
was closed Feb 8, 2024
Add a Contributor Guide
feature request
New feature or request
good first issue
Good for newcomers
help wanted
This issue needs assistance
#1110
by Quentin-Anthony
was closed Jan 29, 2024
Apply new fused rotary embedding
feature request
New feature or request
#1077
by Quentin-Anthony
was closed Jan 5, 2024
Recent LR Scheduler change does not account for inference/evaluation
bug
Something isn't working
#1059
by dashstander
was closed Oct 17, 2023
Add Instructions for Loading Llama2 Models
feature request
New feature or request
#1051
by Quentin-Anthony
was closed Feb 8, 2024
resume from checkpoint doesn't continue decaying the learning rate - it stays constant
bug
Something isn't working
#1029
by exnx
was closed Sep 27, 2023
CPU Tests CI task is failing
bug
Something isn't working
#1025
by dashstander
was closed Nov 8, 2023
Bug: nvcc does not exists in runtime version of nvidia base image used in Dockerfile
bug
Something isn't working
#1021
by changingivan
was closed Jan 4, 2024
'attention.bias' and 'attention.masked_bias' not in Something isn't working
hf_layer.state_dict()
when converting gpt-neox model to huggingface
bug
#1013
by johntzwei
was closed Sep 13, 2023
RotaryEmbedding computation is wrong for certain position/feature pairs in reduced precision (both fp16 and bfloat)
bug
Something isn't working
#1003
by cbcase
was closed Sep 25, 2023
The class with the same name was imported twice
bug
Something isn't working
#999
by D-X-Y
was closed Sep 25, 2023
how to use when --mask-before-token have values
feature request
New feature or request
#995
by xealml
was closed Oct 4, 2023
WARNING: shuffle index length is not equal to sample index length
bug
Something isn't working
#972
by 1ittlesnow
was closed Jun 22, 2023
bf16 is incompatible with pipe parallelism
bug
Something isn't working
#963
by Life-0-1
was closed Sep 18, 2023
Robust testing suite
feature request
New feature or request
good first issue
Good for newcomers
help wanted
This issue needs assistance
#957
by StellaAthena
was closed Dec 4, 2023
25 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-06-28.