-
Notifications
You must be signed in to change notification settings - Fork 976
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
NCCL error in: ProcessGroupNCCL.cpp:1269, internal error, NCCL version 2.14.3
#1147
by mackmake
was closed Apr 21, 2024
Port NVIDIA Nsight profiling to gpt-neox
feature request
New feature or request
#1134
by Quentin-Anthony
was closed Feb 23, 2024
1 of 2 tasks
FileNotFoundError thrown when training
bug
Something isn't working
#1127
by obicons
was closed Apr 21, 2024
Support for lm_eval 0.4.0
feature request
New feature or request
#1114
by ZhiYuanZeng
was closed Jan 8, 2024
Add a Contributor Guide
feature request
New feature or request
good first issue
Good for newcomers
help wanted
This issue needs assistance
#1110
by Quentin-Anthony
was closed Jan 29, 2024
convert_hf_to_module(pipeline_parallel>1)
feature request
New feature or request
#1092
by liuxinxin123
was closed Jan 15, 2024
instruction finetune
feature request
New feature or request
#1091
by liuxinxin123
was closed Dec 5, 2023
Help with: No such file or directory: '/fsx/hailey/math-lm/gpt-neox/megatron/fused_kernels'
#1083
by andrewarrow
was closed Dec 19, 2023
ImportError: /media/h/nvme/gpt-neox/.venv/lib/python3.8/site-packages/flash_attn_2_cuda.cpython-38-x86_64-linux-gnu.so: undefined symbol:
bug
Something isn't working
#1079
by Drzhivago264
was closed Nov 14, 2023
some Datasets are not available
bug
Something isn't working
#1071
by vangogh0318
was closed Dec 22, 2023
ImportError: cannot import name 'helpers' from 'megatron.data'
bug
Something isn't working
#1045
by shaunstoltz
was closed Sep 26, 2023
AssertionError: Not sure how to proceed, we were given deepspeed configs in the deepspeed arguments and deepspeed.initialize() function call
bug
Something isn't working
#1043
by shaunstoltz
was closed Sep 26, 2023
Turkish Language Support
feature request
New feature or request
#1034
by samitugal
was closed Sep 20, 2023
CPU Tests CI task is failing
bug
Something isn't working
#1025
by dashstander
was closed Nov 8, 2023
Bug: nvcc does not exists in runtime version of nvidia base image used in Dockerfile
bug
Something isn't working
#1021
by changingivan
was closed Jan 4, 2024
RuntimeError: Input tensor data type is not supported for NCCL process group: BFloat16
bug
Something isn't working
#1020
by xu-song
was closed Sep 12, 2023
can you provide pre-built images for main branch
feature request
New feature or request
#1019
by xu-song
was closed Mar 17, 2024
ProTip!
Find all open issues with in progress development work with linked:pr.