-
Notifications
You must be signed in to change notification settings - Fork 996
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
instruction finetune
feature request
New feature or request
#1091
by liuxinxin123
was closed Dec 5, 2023
updated Dec 5, 2023
Error in FLOPS Calculation
bug
Something isn't working
#1093
by passaglia
was closed Dec 6, 2023
updated Dec 6, 2023
Help with: No such file or directory: '/fsx/hailey/math-lm/gpt-neox/megatron/fused_kernels'
#1083
by andrewarrow
was closed Dec 19, 2023
updated Dec 19, 2023
Bug: nvcc does not exists in runtime version of nvidia base image used in Dockerfile
bug
Something isn't working
#1021
by changingivan
was closed Jan 4, 2024
updated Jan 4, 2024
Apply new fused rotary embedding
feature request
New feature or request
#1077
by Quentin-Anthony
was closed Jan 5, 2024
updated Jan 5, 2024
Support for lm_eval 0.4.0
feature request
New feature or request
#1114
by ZhiYuanZeng
was closed Jan 8, 2024
updated Jan 8, 2024
convert_hf_to_module(pipeline_parallel>1)
feature request
New feature or request
#1092
by liuxinxin123
was closed Jan 15, 2024
updated Jan 15, 2024
Support Megatron LayerNorm kernel
feature request
New feature or request
good first issue
Good for newcomers
#952
by Quentin-Anthony
was closed Jan 26, 2024
updated Jan 26, 2024
2 tasks
Add a Contributor Guide
feature request
New feature or request
good first issue
Good for newcomers
help wanted
This issue needs assistance
#1110
by Quentin-Anthony
was closed Jan 29, 2024
updated Jan 29, 2024
some Datasets are not available
bug
Something isn't working
#1071
by vangogh0318
was closed Dec 22, 2023
updated Feb 6, 2024
Add Instructions for Loading Llama2 Models
feature request
New feature or request
#1051
by Quentin-Anthony
was closed Feb 8, 2024
updated Feb 8, 2024
Convert HF format or raw weights of Llama2 to NEOX format
feature request
New feature or request
#1112
by fmh1art
was closed Feb 8, 2024
updated Feb 8, 2024
Support for custom model architecture
#1117
by itsnamgyu
was closed Feb 20, 2024
updated Feb 20, 2024
Add PyTorch Memory Profiler
feature request
New feature or request
#1152
by Quentin-Anthony
was closed Feb 21, 2024
updated Feb 21, 2024
Port NVIDIA Nsight profiling to gpt-neox
feature request
New feature or request
#1134
by Quentin-Anthony
was closed Feb 23, 2024
updated Feb 23, 2024
1 of 2 tasks
Update to current versions of python and pytorch
feature request
New feature or request
#1143
by segyges
was closed Feb 23, 2024
updated Feb 23, 2024
Argument List too long error
bug
Something isn't working
#1076
by kavlekar101
was closed Feb 23, 2024
updated Feb 23, 2024
Support Mistral Models
feature request
New feature or request
#1050
by Quentin-Anthony
was closed Feb 26, 2024
updated Feb 26, 2024
ImportError: /media/h/nvme/gpt-neox/.venv/lib/python3.8/site-packages/flash_attn_2_cuda.cpython-38-x86_64-linux-gnu.so: undefined symbol:
bug
Something isn't working
#1079
by Drzhivago264
was closed Nov 14, 2023
updated Mar 1, 2024
[BUG] Setting Finetune=True causes checkpoint loading to not work correctly
bug
Something isn't working
#1121
by exnx
was closed Mar 1, 2024
updated Mar 1, 2024
Dockerfile installation fails to run pythia 14M
bug
Something isn't working
#1165
by tf-nv
was closed Mar 4, 2024
updated Mar 4, 2024
Converting Pythia checkpoint from HF to NeoX fails
bug
Something isn't working
#1161
by malteos
was closed Mar 4, 2024
updated Mar 4, 2024
ProTip!
Add no:assignee to see everything that’s not assigned.