sync : llama.cpp (fused soft max, gpu cpy ops, etc.) #640

ggerganov · 2023-12-07T11:57:04Z

No description provided.

ggml-ci

ggerganov · 2023-12-07T19:13:53Z

All 3 repos should be synced now

examples/whisper/whisper.cpp

ggerganov · 2023-12-07T19:32:08Z

src/ggml.c

@@ -8446,7 +8469,7 @@ static void ggml_compute_forward_concat_f32(
 GGML_ASSERT(nb10 == sizeof(float));

 for (int i3 = 0; i3 < ne3; i3++) {
- for (int i2 = ith; i2 < ne2; i2++) {
+ for (int i2 = ith; i2 < ne2; i2 += nth) {


@FSSRepo FYI there was a bug in ggml_concat

This PR already solves it anyway. Right?

Yes - waiting for CI to pass and will merge

ggerganov and others added 11 commits December 7, 2023 13:56

sync : llama.cpp (fused soft max, gpu cpy ops, etc.)

63f00d5

ggml-ci

cuda : restore accidentally deleted changes

62ec008

ggml-ci

cuda : fix rope + disable device-side dequantize

26433ea

ggml-ci

test-backend-ops : enable stablelm rope test

9322c9f

cuda : remove rope assert

4c39e94

sync.sh : add test-backend-ops

4473554

ggml : fix ggml_concat + ggml_get_n_tasks logic

c646115

Merge branch 'master' into sync

b422db6

ggml-ci

sync : whisper.cpp

870b2b8

ggml-ci

metal : fix assert

8459363

ci : fix Metal path to shaders

6b6dcae

ggml-ci

ggerganov force-pushed the sync branch from cdf4250 to 6b6dcae Compare December 7, 2023 19:11

slaren reviewed Dec 7, 2023

View reviewed changes

examples/whisper/whisper.cpp Outdated Show resolved Hide resolved

whisper : fix bug if metal init fails

aa7241b

slaren approved these changes Dec 7, 2023

View reviewed changes

ggerganov commented Dec 7, 2023

View reviewed changes

ggerganov merged commit c57aa8e into master Dec 7, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync : llama.cpp (fused soft max, gpu cpy ops, etc.) #640

sync : llama.cpp (fused soft max, gpu cpy ops, etc.) #640

ggerganov commented Dec 7, 2023 •

edited

Loading

ggerganov commented Dec 7, 2023

ggerganov Dec 7, 2023

FSSRepo Dec 7, 2023 •

edited

Loading

ggerganov Dec 7, 2023

sync : llama.cpp (fused soft max, gpu cpy ops, etc.) #640

sync : llama.cpp (fused soft max, gpu cpy ops, etc.) #640

Conversation

ggerganov commented Dec 7, 2023 • edited Loading

ggerganov commented Dec 7, 2023

ggerganov Dec 7, 2023

Choose a reason for hiding this comment

FSSRepo Dec 7, 2023 • edited Loading

Choose a reason for hiding this comment

ggerganov Dec 7, 2023

Choose a reason for hiding this comment

ggerganov commented Dec 7, 2023 •

edited

Loading

FSSRepo Dec 7, 2023 •

edited

Loading