-
Notifications
You must be signed in to change notification settings - Fork 960
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
convert fine tuned model to ggml? #280
Comments
This seams to be an error with the tokenizer you are using, i encountered similar issues and decided to refactor and combine many of the conversion scripts for our |
so something very interesting happened. just for fun, i copied over all of our fine tuend files but deleted the .bin. then i copied over the GGML .bin to the same directory. For GGML, copying over all of the files from gpt-cmd and removing the .bin and just having the converted ggml bin worked to load it. but, it acted strange: test prompt: Prompt = my birth date is august 8, 1942.\n any ideas? 8-) |
The included GGML tokenizer is very lossy, in |
cool will do! |
we are trying to convert a fine tuned gpt-j model to ggml.
but it always comes back with this error:
python convert-h5-to-ggml.py /home/silvacarl/Desktop/models/gpt-cmd 1
[2023-06-23 15:54:20,338] [INFO] [real_accelerator.py:110:get_accelerator] Setting ds_accelerator to cuda (auto detect)
╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮
│ /home/silvacarl/Desktop/ggml/examples/gpt-j/convert-h5-to-ggml.py:113 in │
│ │
│ 110 │ fout.write(text) │
│ 111 │
│ 112 for key in encoder_added: │
│ ❱ 113 │ text = bytearray([byte_decoder[c] for c in key]) │
│ 114 │ fout.write(struct.pack("i", len(text))) │
│ 115 │ fout.write(text) │
│ 116 │
│ │
│ /home/silvacarl/Desktop/ggml/examples/gpt-j/convert-h5-to-ggml.py:113 in │
│ │
│ 110 │ fout.write(text) │
│ 111 │
│ 112 for key in encoder_added: │
│ ❱ 113 │ text = bytearray([byte_decoder[c] for c in key]) │
│ 114 │ fout.write(struct.pack("i", len(text))) │
│ 115 │ fout.write(text) │
│ 116 │
╰──────────────────────────────────────────────────────────────────────────────────────────────────╯
KeyError: ' '
any ideas what we are missing?
The text was updated successfully, but these errors were encountered: