Mistral token counting/batching #46

pbeart · 2024-01-26T12:48:54Z

Hi, I'm interested in making a PR so that the embedding methods chunk data to ensure each request has less than the maximum (16384) required by Mistral. Unfortunately, there doesn't seem to be a lightweight way to count tokens for Mistral, such as tiktoken for OpenAI models - I guess most of the time it's going to be safe to use tiktoken with a safety margin, but is there a way to count tokens accurately without a heavy library?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mistral token counting/batching #46

Mistral token counting/batching #46

pbeart commented Jan 26, 2024

Mistral token counting/batching #46

Mistral token counting/batching #46

Comments

pbeart commented Jan 26, 2024