llama.cpp ./embedding #17

Philipp-Sc · 2023-10-25T04:49:00Z

Is there no rust binding to get the embeddings?

Using llama.cpp one would use:

./embedding -m ./path/to/model --log-disable -p "Hello World!" 2>/dev/null

mdrokz · 2023-10-25T17:27:03Z

Is there no rust binding to get the embeddings?

Using llama.cpp one would use:
./embedding -m ./path/to/model --log-disable -p "Hello World!" 2>/dev/null

have you tried this

rust-llama.cpp/src/lib.rs

Line 287 in baa1bcf

pub fn embeddings(

method ?

this will get you the embeddings for the prompt

Philipp-Sc · 2023-10-26T01:53:29Z

@mdrokz thank you for your response.

I tried it before trying the direct llama.cpp ./embedding executable.

The function would always return an empty vector:

[]

I tried multiple configurations but could not fix the issue.

mdrokz · 2023-10-26T19:25:39Z

@mdrokz thank you for your response.

I tried it before trying the direct llama.cpp ./embedding executable.

The function would always return an empty vector:
[]
I tried multiple configurations but could not fix the issue.

Alright i will test on my end see whats happening. Thanks

Philipp-Sc · 2023-10-27T03:34:07Z

I was using zephyr-7B-alpha-GGUF with:

context_size: 8192 
n_batch: 512 
embeddings: true

without any GPU assistance.

Note:
There also was some strange behavior regarding n_batch and n_token where a longer prompt (still way below the context length) lead to an unexpected error:

GGML_ASSERT: n_token <= n_batch

Right now my workaround is to use a rust wrapper (Command::new) around the ./embedding binary and parse the float values from stdout back into a string, then vector. The only parameter I set is --ctx-size 8192 and --mlock.
I imagine this is less efficient as the model needs to be reloaded for each call.

mdrokz added the question Further information is requested label Oct 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp ./embedding #17

llama.cpp ./embedding #17

Philipp-Sc commented Oct 25, 2023

mdrokz commented Oct 25, 2023

Philipp-Sc commented Oct 26, 2023

mdrokz commented Oct 26, 2023

Philipp-Sc commented Oct 27, 2023 •

edited

Loading

llama.cpp ./embedding #17

llama.cpp ./embedding #17

Comments

Philipp-Sc commented Oct 25, 2023

mdrokz commented Oct 25, 2023

Philipp-Sc commented Oct 26, 2023

mdrokz commented Oct 26, 2023

Philipp-Sc commented Oct 27, 2023 • edited Loading

Philipp-Sc commented Oct 27, 2023 •

edited

Loading