-
Notifications
You must be signed in to change notification settings - Fork 977
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Fix output_layer_parallelism = "row"
bug
Something isn't working
help wanted
This issue needs assistance
#916
by Quentin-Anthony
was closed May 19, 2023
[question] How were the default values of
checkpoint-factor
selected?
#914
by obicons
was closed May 1, 2023
Do pythia untied embedding and unembedding matrics?
bug
Something isn't working
#913
by Life-0-1
was closed May 1, 2023
Default of Something isn't working
output_layer_parallelism = "row"
is broken for model-parallel training
bug
#905
by cbcase
was closed May 12, 2023
cant preprocess_data using custom tokinzer.
bug
Something isn't working
#898
by snirbenyosef
was closed Apr 22, 2023
Can I pretrain a 65B or 175B model use this code?
feature request
New feature or request
#890
by lc222
was closed Apr 20, 2023
Unable to run generate text
bug
Something isn't working
#885
by hax4usincupad
was closed Apr 21, 2023
tuple index out of range in _exec_send_grads p2p.send
bug
Something isn't working
#884
by drcege
was closed Apr 22, 2023
How to convert a model parallel model to hugging face model?
feature request
New feature or request
#879
by guozhiyao
was closed Apr 13, 2023
Got an empty gpt2-tokenizer while pretraining with THE-PILE dataset
bug
Something isn't working
#876
by LostSpirit1307
was closed May 11, 2023
RuntimeError: probability tensor contains either inf, nan or element < 0
bug
Something isn't working
#871
by believeland23
was closed Apr 22, 2023
RuntimeError: The expanded size of the tensor (1) must match the existing size (10) at non-singleton dimension 2
bug
Something isn't working
#870
by crazyofapple
was closed Apr 13, 2023
read in/have a additional column in the training data
feature request
New feature or request
#865
by davidvblumenthal
was closed Apr 5, 2023
Adding data to continue training failed.
feature request
New feature or request
#860
by SefaZeng
was closed May 18, 2023
cannot import name '_ALL_VERSIONS' from 'regex._regex_core'
bug
Something isn't working
#852
by DaoD
was closed Mar 23, 2023
Does the model support the task in Chinese?
feature request
New feature or request
#844
by wsh2836741
was closed Mar 20, 2023
Instruction tuned version of GPT-Neo-X
feature request
New feature or request
#838
by djaym7
was closed Mar 16, 2023
Support fp16 scale tolerance
feature request
New feature or request
help wanted
This issue needs assistance
#829
by Quentin-Anthony
was closed May 18, 2023
how much GPU memory at least ?
feature request
New feature or request
#827
by XuJianzhi
was closed Mar 13, 2023
ProTip!
Find all open issues with in progress development work with linked:pr.