-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pythia-13b size mismatch #41
Comments
Does this occur in any 13b checkpoints other than 143000? I looked at a subset of checkpoints and saw the right config filesize for all the others I looked at. This seems to just be the wrong |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
When I run the following code to load up
pythia-13b
, I get a bunch of size mismatch errors.Errors:
These continue for every layer of the model. When I use
ignore_mismatched_sizes=True
inGPTNeoXForCausalLM.from_pretrained
, I get this error instead:I imagine that some config just needs to be updated to reflect the actual model sizes? I do not get this error with any of the smaller models.
The text was updated successfully, but these errors were encountered: