add way to compile ggml with intel mkl #804

kannon92 · 2024-04-23T14:28:28Z

I have been playing around with ollama and llama.cpp. I have an intel laptop and I wanted to use blas.

llama.cpp supports intel mkl and it seems to perform very well. I wanted to see if I can get this library to also compile with MKL.

I think I got it working.

I have a gnu build where test-vec1 runs in about 10 seconds while the intel + mkl runs in about 2 seconds.

Signed-off-by: Kevin Hannon <[email protected]>

Green-Sky · 2024-04-23T21:28:47Z

The cmake BLAS find module is generic and supports different libraries, like mkl.
See llama.cpp's CMakeLists.txt: https://github.com/ggerganov/llama.cpp/blob/4e96a812b3ce7322a29a3008db2ed73d9087b176/CMakeLists.txt#L297-L377

edit: use can use the vendor option to select "Intel"

elseif (${LLAMA_BLAS_VENDOR} MATCHES "Intel")

kannon92 · 2024-04-23T21:38:29Z

Yea I use that code for llama.cpp but when I was looking at this repo, I noticed that blas was not set up in the same way.

Maybe I am confused because I don't exactly follow the relationship between llama.cpp and ggml.

In the CMakeFile, there was explict support for each blas option in this repo but in llama.cpp there is a smarter way to detect it.

I wasn't sure if refactoring to make it similar as ggml is necessary. I tried to keep the logic similar as what is present in this repo.

slaren · 2024-04-23T21:49:40Z

Once upon a time, ggml consisted of two files, ggml.c and ggml.h. At the time it didn't make much sense to have a separate build script for ggml, so each project baked their own build scripts with ggml.c and ggml.h as additional sources. Now ggml has grown a lot, there are multiple backends that require intricate build processes, but the build model has not changed. As a result, the evolution of the build scripts happens mostly in the project repositories, mainly in llama.cpp which is the most active one, and occasionally some of the changes are ported to the ggml repository, but usually only the minimum effort to keep things working. Needless to say this model does not scale anymore, and we should look to decouple the build script of ggml from the build of the derived projects.

kannon92 · 2024-04-23T22:05:02Z

Is the eventual goal that ggml is used as a tensor library for whisper and llama? It wasn't clear to me if this code is used in llama.cpp. I see that syncing is done manually but I didn't see any linking or compilation of this library.

slaren · 2024-04-23T22:09:26Z

ggml is already used as the tensor library of whisper.cpp, llama.cpp and other projects. The changes to ggml usually happen in llama.cpp, but they regularly synced back to this repository. However each project has their own build scripts, there isn't an unified way to build ggml, and that's a problem.

zhouwg · 2024-04-27T05:24:04Z

ggml is already used as the tensor library of whisper.cpp, llama.cpp and other projects. The changes to ggml usually happen in llama.cpp, but they regularly synced back to this repository. However each project has their own build scripts, there isn't an unified way to build ggml, and that's a problem.

already done in project kantv:a build script for llama.cpp,whisper.cpp,stablediffusion.cpp, but this script only works for Android

add way to compile ggml with intel mkl

5f3a1fa

Signed-off-by: Kevin Hannon <[email protected]>

ggerganov mentioned this pull request Apr 25, 2024

ggml : unified CMake build ggerganov/llama.cpp#6913

Open

kannon92 closed this Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add way to compile ggml with intel mkl #804

add way to compile ggml with intel mkl #804

kannon92 commented Apr 23, 2024

Green-Sky commented Apr 23, 2024 •

edited

Loading

kannon92 commented Apr 23, 2024

slaren commented Apr 23, 2024

kannon92 commented Apr 23, 2024

slaren commented Apr 23, 2024

zhouwg commented Apr 27, 2024 •

edited

Loading

add way to compile ggml with intel mkl #804

add way to compile ggml with intel mkl #804

Conversation

kannon92 commented Apr 23, 2024

Green-Sky commented Apr 23, 2024 • edited Loading

kannon92 commented Apr 23, 2024

slaren commented Apr 23, 2024

kannon92 commented Apr 23, 2024

slaren commented Apr 23, 2024

zhouwg commented Apr 27, 2024 • edited Loading

Green-Sky commented Apr 23, 2024 •

edited

Loading

zhouwg commented Apr 27, 2024 •

edited

Loading