Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Fine-tuning 20B model doesn't seem to work bug Something isn't working deprioritized Issues that are not closed, but are low priority and unlikely to be solved soon
#767 opened Jan 10, 2023 by abar-75
Add support for sequence parallelism feature request New feature or request help wanted This issue needs assistance
#812 opened Mar 7, 2023 by Quentin-Anthony
Investigate DeepSpeed Inference feature request New feature or request good first issue Good for newcomers
#845 opened Mar 21, 2023 by Quentin-Anthony
Migrate tensor parallelism code to use OSLO feature request New feature or request oslo issues relating to refactoring NeoX to use OSLO
#578 opened Mar 1, 2022 by sdtblck
3 tasks
Integrate TransformerEngine feature request New feature or request
#1098 opened Dec 21, 2023 by Quentin-Anthony
Fine-tuning gpt-neox on 8 A100s feature request New feature or request
#892 opened Apr 20, 2023 by rajhans
Introduce improvements from OSLO feature request New feature or request
#571 opened Feb 23, 2022 by hyunwoongko
Cannot load the checkpoint bug Something isn't working
#782 opened Feb 6, 2023 by jmlongriver12
Hosted Github Runners for CI feature request New feature or request
#531 opened Feb 9, 2022 by Mistobaan
2 tasks
AssertionError: zero stage 1 requires an optimizer bug Something isn't working good first issue Good for newcomers help wanted This issue needs assistance
#987 opened Jul 4, 2023 by yonglianglan
20B pretrained model inference OOM on 8xA100 40GB bug Something isn't working good first issue Good for newcomers
#901 opened Apr 23, 2023 by Mutinifni
Add StableLM as an example to the README documentation Improvements or additions to documentation
#896 opened Apr 22, 2023 by StellaAthena
block-sparse flash attention support feature request New feature or request good first issue Good for newcomers
#851 opened Mar 22, 2023 by jordiclive
Cannot convert neox model to HF bug Something isn't working
#1231 opened May 28, 2024 by srivassid
ProTip! Updated in the last three days: updated:>2024-06-27.