GGML GPT-2 consistently dies at around 825 tokens with: ggml_new_object: not enough space in the context's memory pool #480

rmc135 · 2023-08-26T07:06:42Z

Firstly, thanks to GG and contributors for a great library/utility.

When generating using gpt-2, ggml bombs out at around 824 or 825 tokens, reporting an error then dumping core.

I would expect there to be a problem (hopefully not involving fatal errors and core dumps) when the total tokens equal the context size, but 824 or 825 total seems an odd number?

The same error is referenced in the llama.cpp repo, but possibly for a different reason: ggerganov/llama.cpp#2404

REPRODUCE:

Clean build, CPU only, Ubuntu 22: git pull && rm -Rf build && mkdir build && cd build && cmake .. && make

with ggml-model-f16.bin (gpt2-xl), eg bin/gpt-2 -m ~/gpt-2/models/1558M/ggml-model-f32.bin -n ...
-n 823: ok (run completes without error)
-n 824: ggml_new_object: not enough space in the context's memory pool (needed 268457104, available 268435456)
-n 825: ggml_new_object: not enough space in the context's memory pool (needed 268457104, available 268435456)

with ggml-model-f32.bin (gpt2-xl):
-n 823: ok
-n 824: ok
-n 825: ggml_new_object: not enough space in the context's memory pool (needed 268457104, available 268435456)

Note: I had to repeat some runs several times as ggml will stop prematurely if an <|endoftext|> token is generated. Getting to 823+ tokens can take a few tries.

The text was updated successfully, but these errors were encountered:

slaren · 2023-08-26T12:28:47Z

You can try increasing the buffer size here:

ggml/examples/gpt-2/main.cpp

Line 408 in 1a5d5f3

static size_t buf_size = 256u*1024*1024;

If you want a more reliable solution for different context sizes, you can use the allocator in ggml-alloc.h instead. @ggerganov should we update the examples to use the allocator?

…ov#480)

rmc135 mentioned this issue Aug 27, 2023

LLM.generate ignores max_new_tokens config marella/ctransformers#108

Closed

ravenscroftj mentioned this issue Aug 27, 2023

gpt-j, starcoder, gptneox examples cause "not enough space in the context's memory pool" for batches >32 #484

Open

slaren mentioned this issue Aug 27, 2023

gpt-2 : use ggml-alloc #486

Merged

ggerganov closed this as completed in #486 Aug 28, 2023

CCLDArjun pushed a commit to CCLDArjun/ggml that referenced this issue Dec 18, 2023

Disable prompt verbosity by default and add option to enable (ggergan…

502a400

…ov#480)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GGML GPT-2 consistently dies at around 825 tokens with: ggml_new_object: not enough space in the context's memory pool #480

GGML GPT-2 consistently dies at around 825 tokens with: ggml_new_object: not enough space in the context's memory pool #480

rmc135 commented Aug 26, 2023 •

edited

Loading

slaren commented Aug 26, 2023

GGML GPT-2 consistently dies at around 825 tokens with: ggml_new_object: not enough space in the context's memory pool #480

GGML GPT-2 consistently dies at around 825 tokens with: ggml_new_object: not enough space in the context's memory pool #480

Comments

rmc135 commented Aug 26, 2023 • edited Loading

slaren commented Aug 26, 2023

rmc135 commented Aug 26, 2023 •

edited

Loading