Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Finetune feature request New feature or request
#1088 by liuxinxin123 was closed Dec 5, 2023 updated Jun 28, 2024
Cannot perform inference, be it unconditional. input-file or interactive bug Something isn't working
#1228 by srivassid was closed May 30, 2024 updated May 30, 2024
'intermediate_size' not set in tools/ckpts/convert_neox_to_hf.py for neox model architecture bug Something isn't working
#1208 by jvendrow was closed May 4, 2024 updated May 4, 2024
When llama uses bf16 training, there is an abnormal loss bug Something isn't working
#947 by suolyer was closed Apr 21, 2024 updated Apr 21, 2024
Is there a way to disable data sampling? feature request New feature or request
#1005 by haozhouamzn was closed Apr 21, 2024 updated Apr 21, 2024
FileNotFoundError thrown when training bug Something isn't working
#1127 by obicons was closed Apr 21, 2024 updated Apr 21, 2024
How to convert gpt-neox to llama architecture..?
#1151 by yuri-son was closed Apr 21, 2024 updated Apr 21, 2024
is there any ignore_index ability in the loss calculation? feature request New feature or request
#1193 by exnx was closed Apr 21, 2024 updated Apr 21, 2024
بهترین تعمیرگاه موبایل در مشهد مقدس bug Something isn't working
#1173 by rezaarefi was closed Apr 19, 2024 updated Apr 19, 2024
Large model instantiation using DeepSpeed.zero.Init under ZeRO-3 feature request New feature or request
#1189 by R0n12 was closed Mar 19, 2024 updated Mar 19, 2024
can you provide pre-built images for main branch feature request New feature or request
#1019 by xu-song was closed Mar 17, 2024 updated Mar 17, 2024
continue training from a checkpoint with different number of gpu/node
#1158 by mackmake was closed Mar 15, 2024 updated Mar 15, 2024
Add basic Mamba block feature request New feature or request
#1148 by Quentin-Anthony was closed Mar 10, 2024 updated Mar 10, 2024
3 of 4 tasks
MoE loss variable not defined in gpt j residual code path bug Something isn't working
#1174 by tf-nv was closed Mar 8, 2024 updated Mar 8, 2024
misindexing when converting llama weights to gpt-neox format bug Something isn't working
#971 by CRSilkworth was closed Feb 28, 2024 updated Mar 6, 2024
pipe_parallel_size = 1 using DeepSpeed PipelineEngine bug Something isn't working
#1172 by DayOfThePenguin was closed Mar 6, 2024 updated Mar 6, 2024
Dockerfile installation fails to run pythia 14M bug Something isn't working
#1165 by tf-nv was closed Mar 4, 2024 updated Mar 4, 2024
files in multi-node training
#1146 by mackmake was closed Feb 26, 2024 updated Feb 26, 2024
Port NVIDIA Nsight profiling to gpt-neox feature request New feature or request
#1134 by Quentin-Anthony was closed Feb 23, 2024 updated Feb 23, 2024
1 of 2 tasks
Support for custom model architecture
#1117 by itsnamgyu was closed Feb 20, 2024 updated Feb 20, 2024
some Datasets are not available bug Something isn't working
#1071 by vangogh0318 was closed Dec 22, 2023 updated Feb 6, 2024
Error on inference of huggingface
#1142 by mackmake was closed Feb 4, 2024 updated Feb 4, 2024
calculate epoch
#1140 by mackmake was closed Feb 3, 2024 updated Feb 3, 2024
ProTip! Updated in the last three days: updated:>2024-07-02.