Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

How to set the ffn hidden size parameter in gpt neox feature request New feature or request
#1230 opened May 28, 2024 by IronMan-WangJinxi updated Jun 19, 2024
Cannot convert neox model to HF bug Something isn't working
#1231 opened May 28, 2024 by srivassid updated Jun 6, 2024
LoRA Support feature request New feature or request
#1204 opened Apr 23, 2024 by Quentin-Anthony updated Apr 23, 2024
4 tasks
Integrate TransformerEngine feature request New feature or request
#1098 opened Dec 21, 2023 by Quentin-Anthony updated Mar 13, 2024
Problems on generating with llama model
#921 opened May 4, 2023 by wiio12 updated Mar 4, 2024
PyTorch Lightning Fused optimizer step feature request New feature or request
#1160 opened Feb 29, 2024 by jahatef updated Feb 29, 2024
Tests fail when run with pytest --forked bug Something isn't working
#1132 opened Jan 25, 2024 by segyges updated Feb 21, 2024
[BUG?] Higher "gradient_accumulation_steps" still increases memory usage a lot bug Something isn't working
#1123 opened Jan 15, 2024 by exnx updated Feb 1, 2024
Investigate DeepSpeed Inference feature request New feature or request good first issue Good for newcomers
#845 opened Mar 21, 2023 by Quentin-Anthony updated Jan 25, 2024
Create Singularity Container feature request New feature or request good first issue Good for newcomers help wanted This issue needs assistance
#1119 opened Jan 11, 2024 by Quentin-Anthony updated Jan 19, 2024
AssertionError: zero stage 1 requires an optimizer bug Something isn't working good first issue Good for newcomers help wanted This issue needs assistance
#987 opened Jul 4, 2023 by yonglianglan updated Nov 27, 2023
Port DeepSpeed Ulysses feature request New feature or request
#1078 opened Nov 12, 2023 by Quentin-Anthony updated Nov 26, 2023
Add support for sequence parallelism feature request New feature or request help wanted This issue needs assistance
#812 opened Mar 7, 2023 by Quentin-Anthony updated Nov 26, 2023
Interoperability and GPT-NeoX documentation Improvements or additions to documentation question
#1058 opened Oct 12, 2023 by StellaAthena updated Nov 12, 2023
Fine-tuning gpt-neox on 8 A100s feature request New feature or request
#892 opened Apr 20, 2023 by rajhans updated Nov 7, 2023
Convert HF Llama Checkpoints to Neox Checkpoints feature request New feature or request
#994 opened Jul 10, 2023 by sxthunder updated Oct 12, 2023
Support for Mosaic Models feature request New feature or request
#1057 opened Oct 6, 2023 by rajveer43 updated Oct 11, 2023
Add StableLM as an example to the README documentation Improvements or additions to documentation
#896 opened Apr 22, 2023 by StellaAthena updated Oct 4, 2023
RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight bug Something isn't working good first issue Good for newcomers help wanted This issue needs assistance
#645 opened Jul 7, 2022 by mcao516 updated Oct 3, 2023
DeepSpeed Sparse Attention is Broken bug Something isn't working
#863 opened Mar 29, 2023 by dashstander updated Oct 3, 2023
block-sparse flash attention support feature request New feature or request good first issue Good for newcomers
#851 opened Mar 22, 2023 by jordiclive updated Sep 25, 2023
[BUG] Inconsistent loss between overlap_comm=true and overlap_comm=false bug Something isn't working
#1004 opened Jul 27, 2023 by 0x6b64 updated Sep 15, 2023
ProTip! Add no:assignee to see everything that’s not assigned.