[Question] What is the status of Vulkan backend? #542

DanielMazurkiewicz · 2023-09-27T18:01:25Z

Vulkan may be not the the best/fastest/easiest or so solution for inference, but is probably most portable GPU acceleration approach.

Is anyone working actively to add support for it? And if so what is status/progress? If not, is it planned?

Green-Sky · 2023-09-28T13:32:04Z

there are ggerganov/llama.cpp#2059 and ggerganov/llama.cpp#2039

0cc4m · 2023-09-29T21:13:26Z

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Calandiel · 2023-11-16T09:06:13Z

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

0cc4m · 2023-11-16T09:29:08Z

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

Calandiel · 2023-11-16T13:20:47Z

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

I have. I've written vulkan based render pipelines professionally and made toy neural networks in vulkan trained with sgd. Been working with it at least in some capacity for the last 4 years or so.

0cc4m · 2023-11-16T14:11:16Z

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

I have. I've written vulkan based render pipelines professionally and made toy neural networks in vulkan trained with sgd. Been working with it at least in some capacity for the last 4 years or so.

Oh cool, I'd be glad to work something out. If you have Discord, send me a message (_occam), otherwise send me an Email and we'll find another way.

Calandiel · 2023-11-17T07:11:44Z

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

I have. I've written vulkan based render pipelines professionally and made toy neural networks in vulkan trained with sgd. Been working with it at least in some capacity for the last 4 years or so.

Oh cool, I'd be glad to work something out. If you have Discord, send me a message (_occam), otherwise send me an Email and we'll find another way.

Will do, see you on Discord!

sorasoras · 2023-12-10T17:17:41Z

I think nomic-ai have functional kompute of llama.cpp right now
https://github.com/nomic-ai/llama.cpp
and, GPT4ALL is plenty fast on my 7900XTX via vulkan.
but I am not sure how to integrate this on ggml as i am not a programmer.

Green-Sky · 2023-12-15T15:26:00Z

@sorasoras ggerganov/llama.cpp#4456

…ov#542)

* Revert 7e53955 (ggerganov#542) Still needs to be fixed properly * Fix linking on mingw32

slaren · 2024-01-29T23:19:37Z

The vulkan and kompute backends have been merged in llama.cpp, all that is left is update the cmake build files to be able to use them in other ggml projects.

Kreijstal · 2024-03-04T21:42:39Z

The vulkan and kompute backends have been merged in llama.cpp, all that is left is update the cmake build files to be able to use them in other ggml projects.

does this affect anything using ggml? whisper.cpp stabledifussion.cpp? etc?

Green-Sky · 2024-03-04T22:07:54Z

@Kreijstal backends are usually upstreamed to ggml, but ggml-api consumers need to use them explicitly.

eg new-ish backend code here in ggml:
https://github.com/ggerganov/ggml/blob/master/src/ggml-kompute.h
https://github.com/ggerganov/ggml/blob/master/src/ggml-sycl.h
https://github.com/ggerganov/ggml/blob/master/src/ggml-vulkan.h

CCLDArjun pushed a commit to CCLDArjun/ggml that referenced this issue Dec 18, 2023

Fix missing ggml link in cmake for examples/* on w64-mingw32 (ggergan…

7e53955

…ov#542)

CCLDArjun pushed a commit to CCLDArjun/ggml that referenced this issue Dec 18, 2023

llama : fix linkage with mingw (ggerganov#551)

7f4c5c6

* Revert 7e53955 (ggerganov#542) Still needs to be fixed properly * Fix linking on mingw32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] What is the status of Vulkan backend? #542

[Question] What is the status of Vulkan backend? #542

DanielMazurkiewicz commented Sep 27, 2023 •

edited

Loading

Green-Sky commented Sep 28, 2023

0cc4m commented Sep 29, 2023

Calandiel commented Nov 16, 2023

0cc4m commented Nov 16, 2023

Calandiel commented Nov 16, 2023

0cc4m commented Nov 16, 2023

Calandiel commented Nov 17, 2023

sorasoras commented Dec 10, 2023

Green-Sky commented Dec 15, 2023

slaren commented Jan 29, 2024

Kreijstal commented Mar 4, 2024

Green-Sky commented Mar 4, 2024

[Question] What is the status of Vulkan backend? #542

[Question] What is the status of Vulkan backend? #542

Comments

DanielMazurkiewicz commented Sep 27, 2023 • edited Loading

Green-Sky commented Sep 28, 2023

0cc4m commented Sep 29, 2023

Calandiel commented Nov 16, 2023

0cc4m commented Nov 16, 2023

Calandiel commented Nov 16, 2023

0cc4m commented Nov 16, 2023

Calandiel commented Nov 17, 2023

sorasoras commented Dec 10, 2023

Green-Sky commented Dec 15, 2023

slaren commented Jan 29, 2024

Kreijstal commented Mar 4, 2024

Green-Sky commented Mar 4, 2024

DanielMazurkiewicz commented Sep 27, 2023 •

edited

Loading