EleutherAI / gpt-neox Public

Notifications You must be signed in to change notification settings
Fork 971
Star 6.7k

Code
Issues 54
Pull requests 25
Actions
Projects 2
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

54 Open 373 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Add Basic RWKV Block to GPT-NeoX feature request

New feature or request

#1167 by Quentin-Anthony was closed Jun 19, 2024 updated Jun 19, 2024

4 tasks done

Cannot perform inference, be it unconditional. input-file or interactive bug

Something isn't working

#1228 by srivassid was closed May 30, 2024 updated May 30, 2024

'intermediate_size' not set in tools/ckpts/convert_neox_to_hf.py for neox model architecture bug

Something isn't working

#1208 by jvendrow was closed May 4, 2024 updated May 4, 2024

distributed training with multipy nodes. bug

Something isn't working

#734 by cdj0311 was closed Dec 28, 2022 updated Apr 22, 2024

When llama uses bf16 training, there is an abnormal loss bug

Something isn't working

#947 by suolyer was closed Apr 21, 2024 updated Apr 21, 2024

Officially Support AMD GPUs feature request

New feature or request

#954 by Quentin-Anthony was closed Apr 21, 2024 updated Apr 21, 2024

4 tasks done

Is there a way to disable data sampling? feature request

New feature or request

#1005 by haozhouamzn was closed Apr 21, 2024 updated Apr 21, 2024

FileNotFoundError thrown when training bug

Something isn't working

#1127 by obicons was closed Apr 21, 2024 updated Apr 21, 2024

NCCL error in: ProcessGroupNCCL.cpp:1269, internal error, NCCL version 2.14.3

#1147 by mackmake was closed Apr 21, 2024 updated Apr 21, 2024

How to convert gpt-neox to llama architecture..?

#1151 by yuri-son was closed Apr 21, 2024 updated Apr 21, 2024

is there any ignore_index ability in the loss calculation? feature request

New feature or request

#1193 by exnx was closed Apr 21, 2024 updated Apr 21, 2024

بهترین تعمیرگاه موبایل در مشهد مقدس bug

Something isn't working

#1173 by rezaarefi was closed Apr 19, 2024 updated Apr 19, 2024

Large model instantiation using DeepSpeed.zero.Init under ZeRO-3 feature request

New feature or request

#1189 by R0n12 was closed Mar 19, 2024 updated Mar 19, 2024

Is there a way to train on the entire dataset for N epochs without specifying train-iters?

#1164 by javirandor was closed Mar 18, 2024 updated Mar 18, 2024

can you provide pre-built images for main branch feature request

New feature or request

#1019 by xu-song was closed Mar 17, 2024 updated Mar 17, 2024

continue training from a checkpoint with different number of gpu/node

#1158 by mackmake was closed Mar 15, 2024 updated Mar 15, 2024

Add basic Mamba block feature request

New feature or request

#1148 by Quentin-Anthony was closed Mar 10, 2024 updated Mar 10, 2024

3 of 4 tasks

MoE loss variable not defined in gpt j residual code path bug

Something isn't working

#1174 by tf-nv was closed Mar 8, 2024 updated Mar 8, 2024

Add Mixture of Experts feature request

New feature or request

#479 by sdtblck was closed Mar 7, 2024 updated Mar 7, 2024

misindexing when converting llama weights to gpt-neox format bug

Something isn't working

#971 by CRSilkworth was closed Feb 28, 2024 updated Mar 6, 2024

pipe_parallel_size = 1 using DeepSpeed PipelineEngine bug

Something isn't working

#1172 by DayOfThePenguin was closed Mar 6, 2024 updated Mar 6, 2024

Converting Pythia checkpoint from HF to NeoX fails bug

Something isn't working

#1161 by malteos was closed Mar 4, 2024 updated Mar 4, 2024

Dockerfile installation fails to run pythia 14M bug

Something isn't working

#1165 by tf-nv was closed Mar 4, 2024 updated Mar 4, 2024

[BUG] Setting Finetune=True causes checkpoint loading to not work correctly bug

Something isn't working

#1121 by exnx was closed Mar 1, 2024 updated Mar 1, 2024

ImportError: /media/h/nvme/gpt-neox/.venv/lib/python3.8/site-packages/flash_attn_2_cuda.cpython-38-x86_64-linux-gnu.so: undefined symbol: bug

Something isn't working

#1079 by Drzhivago264 was closed Nov 14, 2023 updated Mar 1, 2024

Previous 1 2 3 4 5 … 14 15 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly