mpt : fix `n_ctx` #165

marella · 2023-05-18T13:22:59Z

Also max_seq_len doesn't seem to be used anywhere. Is that expected?

ggerganov · 2023-05-20T14:05:14Z

Hm, I'm confused. It looks like max_seq_len corresponds to n_ctx.
Looking at HF repo, it seems that it should be 2048, but here we have it defined as 4096:

https://huggingface.co/mosaicml/mpt-7b/blob/main/config.json#L39

Need to download the model and test it

marella · 2023-05-20T14:11:28Z

Yeah I also think max_seq_len should be n_ctx.
I also mentioned it to the author who created it in #168 (comment)

marella · 2023-05-20T15:27:14Z

Should I change max_seq_len to n_ctx? Replit example is using max_seq_len as n_ctx everywhere.

lukasmoellerch · 2023-05-20T15:32:17Z

The main reason was that we always allocate max seq length which might require too much ram for kv cache on low-ram machines

ggerganov · 2023-05-20T15:48:02Z

Ah yes, I remember now. Let me think about this some more then.

ggerganov · 2023-05-20T15:50:45Z

The main reason was that we always allocate max seq length which might require too much ram for kv cache on low-ram machines

Ok, but the way it is implemented now, we allocate with n_ctx = 4096:

ggml/examples/mpt/main.cpp

Line 21 in c2fab8a

int n_ctx = 4096;

This is 2x more than max_seq_len.
What is the reasoning for line 21 ?

lukasmoellerch · 2023-05-21T17:29:11Z

The main reason was that we always allocate max seq length which might require too much ram for kv cache on low-ram machines

Ok, but the way it is implemented now, we allocate with n_ctx = 4096:

ggml/examples/mpt/main.cpp

Line 21 in c2fab8a

int n_ctx = 4096;

This is 2x more than max_seq_len. What is the reasoning for line 21 ?

It was mainly intended as a fix for running the storywriter model which has a much higher seq length

marella · 2023-05-21T17:54:43Z

Should we do something like n_ctx = min(max_seq_len, 4096)?

klosax · 2023-05-21T18:49:43Z

I suggest adding a command line parameter for setting n_ctx. Default value could be 512 as in llama. Maximum value could be max_seq_len.

ggerganov · 2023-05-22T13:19:54Z

Yes, we can do both of these suggestions

marella · 2023-05-22T15:33:10Z

Added the min() condition.

I can add the command line parameter later in a separate PR. Can it be added to all models not just MPT?

lukasmoellerch · 2023-05-22T17:28:44Z

I suggest adding a command line parameter for setting n_ctx. Default value could be 512 as in llama. Maximum value could be max_seq_len.

Yes, I like the suggestion, many new models don’t have an inherent max seq length either, thus setting it to that was a bit arbitrary in the first place.

mpt : move global variable n_ctx to mpt_hparams

fcf734c

marella mentioned this pull request May 20, 2023

MPT quantize does not include quantization version #168

Closed

ggerganov approved these changes May 20, 2023

View reviewed changes

Resolve conflicts

c501a90

ggerganov self-requested a review May 20, 2023 15:48

marella added 2 commits May 21, 2023 02:05

Merge branch 'master' of https://github.com/ggerganov/ggml into mpt

fffa6e3

mpt : fix warnings

eec32d1

marella added 2 commits May 22, 2023 20:22

Merge branch 'master' of https://github.com/ggerganov/ggml into mpt

6c223b2

mpt : fix n_ctx

b717fae

marella changed the title ~~mpt : move global variable n_ctx to mpt_hparams~~ mpt : fix n_ctx May 22, 2023

ggerganov closed this in a651d58 May 24, 2023

marella mentioned this pull request May 28, 2023

ctx limiter. marella/ctransformers#16

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mpt : fix `n_ctx` #165

mpt : fix `n_ctx` #165

marella commented May 18, 2023

ggerganov commented May 20, 2023

marella commented May 20, 2023

marella commented May 20, 2023

lukasmoellerch commented May 20, 2023

ggerganov commented May 20, 2023

ggerganov commented May 20, 2023 •

edited

Loading

lukasmoellerch commented May 21, 2023

marella commented May 21, 2023

klosax commented May 21, 2023

ggerganov commented May 22, 2023

marella commented May 22, 2023

lukasmoellerch commented May 22, 2023

mpt : fix n_ctx #165

mpt : fix n_ctx #165

Conversation

marella commented May 18, 2023

ggerganov commented May 20, 2023

marella commented May 20, 2023

marella commented May 20, 2023

lukasmoellerch commented May 20, 2023

ggerganov commented May 20, 2023

ggerganov commented May 20, 2023 • edited Loading

lukasmoellerch commented May 21, 2023

marella commented May 21, 2023

klosax commented May 21, 2023

ggerganov commented May 22, 2023

marella commented May 22, 2023

lukasmoellerch commented May 22, 2023

mpt : fix `n_ctx` #165

mpt : fix `n_ctx` #165

ggerganov commented May 20, 2023 •

edited

Loading