Skip to content

Commit

Permalink
Vulkan Mixture of Experts (MoE) support (llama/7628)
Browse files Browse the repository at this point in the history
* Finish Vulkan mul_mat_id implementation

* Add Vulkan sum_rows and div ops

* Fix MUL_MAT_ID matrix matrix shader

* Fix MUL_MAT_ID matrix vector shader dispatch size

* Fix MUL_MAT_ID matrix vector shader and dispatch code

* Update Vulkan CPU offload for MUL_MAT_ID

* Fix crash when using split mode none and setting a main GPU
  • Loading branch information
0cc4m authored and ggerganov committed Jun 15, 2024
1 parent 5ed1871 commit 1981d5b
Showing 1 changed file with 448 additions and 315 deletions.
Loading

0 comments on commit 1981d5b

Please sign in to comment.