-
Notifications
You must be signed in to change notification settings - Fork 966
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inference broken with starcoderplus-guanaco-gpt4.ggmlv1.q8_0.bin since 43ffec5 #378
Comments
FYII I only uploaded the quants of Starcoderplus Guanaco 8 hours ago, and the unquantised model was only released 17 hours ago, so you couldn't have been testing it 2 days ago :) Are you earlier comparisons with a different model, like WizardCoder Guanaco? https://huggingface.co/TheBloke/WizardCoder-Guanaco-15B-V1.0-GGML |
I know, I downloaded the model a few hours ago from your huggingface account, thanks for providing it. With "couple days go", I'm referring to an old ggml build of 965568d that I used to test Starcoderplus-Guanaco-GPT4-15B-V1.0-GGML (which worked), before I pulled ggml from today, which is broken. Thus the bisect. Here are the attempts I made with Starcoderplus-Guanaco-GPT4-15B-V1.0-GGML in more detail:
|
Please try the latest master and let me know if the issue persists. I think I just fixed a bug that could have caused this. |
I just tested the $ ▶ make -j && ./bin/starcoder -t 8 -m models/starcoder/starcoderplus-guanaco-gpt4.ggmlv1.q8_0.bin -n 4096 --top_p 0.3 --temp 1 --top_k 9999 -f p-prompt.txt main: temp = 1.000 Assistant: Sure, here's the function:
This function is a state machine that checks each byte of a string for a valid UTF-8 encoding. It is not a complete implementation of the UTF-8 standard, but it will catch most of the common errors. The function returns It is important to note that this function is not foolproof. It will not catch all invalid UTF-8 encodings. For instance, it will not check for overlong encodings, or encodings that use the 0xFE and 0xFF bytes.```<|endoftext|> main: mem per token = 462024 bytes Please reopen if the issue persists |
Confirmed, current master 9e3c293 works as expected. |
A couple days ago, starcoder with starcoderplus-guanaco-gpt4 was perfectly capable of generating a C++ function that validates UTF-8 strings. That is not the case anymore, the inference gives answers that do not fit the prompt, most often it says that the question is unclear or it references the civil war, toxic words, etc. I've bisected this starting July 02:
965568d from 2023-07-02. Generates isUTF8() just fine, GOOD.
d5c4ce0 from 2023-07-04. Generates isUTF8() just fine. Last GOOD version.
bfc6d42 Aborts with GGML_ASSERT, skipping in bisect
d8fbf15 Aborts with GGML_ASSERT, skipping in bisect
43ffec5 from 2023-07-05. Fails, first BAD version. Generates e.g. "Your question is a bit unclear" or starts talking about some random website.
Here's how to reproduce it, prompt (with teacher forcing):
And command line:
Expected output is along the lines of:
The text was updated successfully, but these errors were encountered: