EleutherAI / gpt-neox Public

Notifications You must be signed in to change notification settings
Fork 996
Star 6.9k

Code
Issues 55
Pull requests 21
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

55 Open 379 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

instruction finetune feature request

New feature or request

#1091 by liuxinxin123 was closed Dec 5, 2023 updated Dec 5, 2023

Error in FLOPS Calculation bug

Something isn't working

#1093 by passaglia was closed Dec 6, 2023 updated Dec 6, 2023

Help with: No such file or directory: '/fsx/hailey/math-lm/gpt-neox/megatron/fused_kernels'

#1083 by andrewarrow was closed Dec 19, 2023 updated Dec 19, 2023

Bug: nvcc does not exists in runtime version of nvidia base image used in Dockerfile bug

Something isn't working

#1021 by changingivan was closed Jan 4, 2024 updated Jan 4, 2024

Apply new fused rotary embedding feature request

New feature or request

#1077 by Quentin-Anthony was closed Jan 5, 2024 updated Jan 5, 2024

Support for lm_eval 0.4.0 feature request

New feature or request

#1114 by ZhiYuanZeng was closed Jan 8, 2024 updated Jan 8, 2024

convert_hf_to_module（pipeline_parallel>1） feature request

New feature or request

#1092 by liuxinxin123 was closed Jan 15, 2024 updated Jan 15, 2024

Support Megatron LayerNorm kernel feature request

New feature or request

good first issue

Good for newcomers

#952 by Quentin-Anthony was closed Jan 26, 2024 updated Jan 26, 2024

2 tasks

Add a Contributor Guide feature request

New feature or request

good first issue

Good for newcomers

help wanted

This issue needs assistance

#1110 by Quentin-Anthony was closed Jan 29, 2024 updated Jan 29, 2024

calculate epoch

#1140 by mackmake was closed Feb 3, 2024 updated Feb 3, 2024

Error on inference of huggingface

#1142 by mackmake was closed Feb 4, 2024 updated Feb 4, 2024

some Datasets are not available bug

Something isn't working

#1071 by vangogh0318 was closed Dec 22, 2023 updated Feb 6, 2024

Add Instructions for Loading Llama2 Models feature request

New feature or request

#1051 by Quentin-Anthony was closed Feb 8, 2024 updated Feb 8, 2024

Convert HF format or raw weights of Llama2 to NEOX format feature request

New feature or request

#1112 by fmh1art was closed Feb 8, 2024 updated Feb 8, 2024

Support for custom model architecture

#1117 by itsnamgyu was closed Feb 20, 2024 updated Feb 20, 2024

Add PyTorch Memory Profiler feature request

New feature or request

#1152 by Quentin-Anthony was closed Feb 21, 2024 updated Feb 21, 2024

Port NVIDIA Nsight profiling to gpt-neox feature request

New feature or request

#1134 by Quentin-Anthony was closed Feb 23, 2024 updated Feb 23, 2024

1 of 2 tasks

Update to current versions of python and pytorch feature request

New feature or request

#1143 by segyges was closed Feb 23, 2024 updated Feb 23, 2024

Argument List too long error bug

Something isn't working

#1076 by kavlekar101 was closed Feb 23, 2024 updated Feb 23, 2024

Support Mistral Models feature request

New feature or request

#1050 by Quentin-Anthony was closed Feb 26, 2024 updated Feb 26, 2024

files in multi-node training

#1146 by mackmake was closed Feb 26, 2024 updated Feb 26, 2024

ImportError: /media/h/nvme/gpt-neox/.venv/lib/python3.8/site-packages/flash_attn_2_cuda.cpython-38-x86_64-linux-gnu.so: undefined symbol: bug

Something isn't working

#1079 by Drzhivago264 was closed Nov 14, 2023 updated Mar 1, 2024

[BUG] Setting Finetune=True causes checkpoint loading to not work correctly bug

Something isn't working

#1121 by exnx was closed Mar 1, 2024 updated Mar 1, 2024

Dockerfile installation fails to run pythia 14M bug

Something isn't working

#1165 by tf-nv was closed Mar 4, 2024 updated Mar 4, 2024

Converting Pythia checkpoint from HF to NeoX fails bug

Something isn't working

#1161 by malteos was closed Mar 4, 2024 updated Mar 4, 2024

Previous 1 2 … 12 13 14 15 16 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly