-
Notifications
You must be signed in to change notification settings - Fork 972
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
AttributeError: 'NoneType' object has no attribute 'dp_process_group' at evaluating medium gpt-2 model
bug
Something isn't working
#474
by sameeravithana
was closed Dec 12, 2021
Negative document indices caused by 64 bit integer stored in a 32 bit integer array.
bug
Something isn't working
#493
by pwstegman
was closed Apr 3, 2023
HF Equivalent Pretrained Models
feature request
New feature or request
#489
by sameeravithana
was closed Sep 25, 2022
Add FLAN and T0 finetuning data
feature request
New feature or request
#486
by StellaAthena
was closed Apr 23, 2023
Ensure NeoX is compatible with HF
bug
Something isn't working
#485
by StellaAthena
was closed Feb 7, 2022
Update QuickStart to Something Usable
feature request
New feature or request
#484
by StellaAthena
was closed Sep 18, 2022
Skipped 50 iterations in a row due to Overflow - Exiting training.
bug
Something isn't working
#482
by ScTof
was closed Dec 17, 2021
Add Mixture of Experts
feature request
New feature or request
#479
by sdtblck
was closed Mar 7, 2024
ZeRO 2 cpu_offload causes RuntimeError: expected input to be on cuda
bug
Something isn't working
#478
by pwstegman
was closed Sep 18, 2022
Ways to load GPT-NeoX checkpoints in GPT-Neo for TPU training?
#475
by frankxu2004
was closed Dec 4, 2021
Running through Dockerfile broken
bug
Something isn't working
#419
by VHellendoorn
was closed Oct 12, 2021
Hangs up when finishing up a medium model training
bug
Something isn't working
#473
by sameeravithana
was closed Jan 7, 2022
Sparse attention map::at triton error
bug
Something isn't working
#472
by pwstegman
was closed Nov 29, 2021
Add NeoXArgs Documentation Generation to CI
feature request
New feature or request
#462
by sdtblck
was closed Mar 22, 2022
Add black code formatting to CI
feature request
New feature or request
#461
by sdtblck
was closed Feb 12, 2022
Fix checkpoint / config file management
feature request
New feature or request
#459
by sdtblck
was closed Nov 26, 2021
Handling multiple fields of the custom input data in the preprocess_data.py
bug
Something isn't working
#455
by sameeravithana
was closed Sep 18, 2022
ImportError: cannot import name 'LocalSlidingWindowSparsityConfig' from 'deepspeed.ops.sparse_attention.sparsity_config
bug
Something isn't working
#431
by rokosbasilisk
was closed Oct 18, 2021
ModuleNotFoundError: No module named 'deepspeed.ops.op_builder' on import deepspeed
bug
Something isn't working
#425
by shankyemcee
was closed Sep 25, 2022
ProTip!
What’s not been updated in a month: updated:<2024-05-22.