Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Officially Support AMD GPUs feature request New feature or request
#954 by Quentin-Anthony was closed Apr 21, 2024 updated Apr 21, 2024
4 tasks done
Large model instantiation using DeepSpeed.zero.Init under ZeRO-3 feature request New feature or request
#1189 by R0n12 was closed Mar 19, 2024 updated Mar 19, 2024
Add basic Mamba block feature request New feature or request
#1148 by Quentin-Anthony was closed Mar 10, 2024 updated Mar 10, 2024
3 of 4 tasks
MoE loss variable not defined in gpt j residual code path bug Something isn't working
#1174 by tf-nv was closed Mar 8, 2024 updated Mar 8, 2024
Add Mixture of Experts feature request New feature or request
#479 by sdtblck was closed Mar 7, 2024 updated Mar 7, 2024
3
Converting Pythia checkpoint from HF to NeoX fails bug Something isn't working
#1161 by malteos was closed Mar 4, 2024 updated Mar 4, 2024
Dockerfile installation fails to run pythia 14M bug Something isn't working
#1165 by tf-nv was closed Mar 4, 2024 updated Mar 4, 2024
Update to current versions of python and pytorch feature request New feature or request
#1143 by segyges was closed Feb 23, 2024 updated Feb 23, 2024
Port NVIDIA Nsight profiling to gpt-neox feature request New feature or request
#1134 by Quentin-Anthony was closed Feb 23, 2024 updated Feb 23, 2024
1 of 2 tasks
Add PyTorch Memory Profiler feature request New feature or request
#1152 by Quentin-Anthony was closed Feb 21, 2024 updated Feb 21, 2024
Add Instructions for Loading Llama2 Models feature request New feature or request
#1051 by Quentin-Anthony was closed Feb 8, 2024 updated Feb 8, 2024
Convert HF format or raw weights of Llama2 to NEOX format feature request New feature or request
#1112 by fmh1art was closed Feb 8, 2024 updated Feb 8, 2024
Add a Contributor Guide feature request New feature or request good first issue Good for newcomers help wanted This issue needs assistance
#1110 by Quentin-Anthony was closed Jan 29, 2024 updated Jan 29, 2024
Apply new fused rotary embedding feature request New feature or request
#1077 by Quentin-Anthony was closed Jan 5, 2024 updated Jan 5, 2024
Bug: nvcc does not exists in runtime version of nvidia base image used in Dockerfile bug Something isn't working
#1021 by changingivan was closed Jan 4, 2024 updated Jan 4, 2024
Error in FLOPS Calculation bug Something isn't working
#1093 by passaglia was closed Dec 6, 2023 updated Dec 6, 2023
Robust testing suite feature request New feature or request good first issue Good for newcomers help wanted This issue needs assistance
#957 by StellaAthena was closed Dec 4, 2023 updated Dec 4, 2023
25 tasks
CPU Tests CI task is failing bug Something isn't working
#1025 by dashstander was closed Nov 8, 2023 updated Nov 8, 2023
Implement Bf16 feature request New feature or request
#302 by sdtblck was closed Jun 22, 2021 updated Nov 7, 2023
1 task
Incorporation of LION Optimizer in GPT-NeoX feature request New feature or request good first issue Good for newcomers help wanted This issue needs assistance
#950 by withwsf was closed Oct 20, 2023 updated Oct 20, 2023
Recent LR Scheduler change does not account for inference/evaluation bug Something isn't working
#1059 by dashstander was closed Oct 17, 2023 updated Oct 17, 2023
how to use when --mask-before-token have values feature request New feature or request
#995 by xealml was closed Oct 4, 2023 updated Oct 4, 2023
Organize the tools feature request New feature or request help wanted This issue needs assistance
#856 by Quentin-Anthony was closed Oct 2, 2023 updated Oct 2, 2023
Better Document Distributed Jobs feature request New feature or request good first issue Good for newcomers
#953 by Quentin-Anthony was closed Sep 29, 2023 updated Sep 29, 2023
1 task
A question about flops calculation
#981 by CSlearnerZM was closed Sep 27, 2023 updated Sep 27, 2023
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.