Sparsity - The next performance frontier. #93

JohnnyOpcode · 2023-04-19T15:37:38Z

Great work going on with GGML. Bravo to so many contributors. You are champions!

Maybe more performance (on CPU) can be had with bringing sparsity into the workflow. Here is one of the many efforts out there at the moment.

https://github.com/neuralmagic/deepsparse

What are peoples thoughts on this?

DifferentialityDevelopment · 2023-05-22T06:45:03Z

The process for converting a model to a SparseML compatible model doesn't seem all that complicated. Sparsity has a lot of benefits to offer for inference, while you can quantize models to the GGML format, reducing their size and complexity, whereas making a model sparse involves both quantizing and pruning irrelevant parts?

JohnnyOpcode · 2023-05-23T16:03:04Z

Here is a good explanation if anyone is interested.

https://neuralmagic.com/blog/sparsegpt-remove-100-billion-parameters-for-free/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparsity - The next performance frontier. #93

Sparsity - The next performance frontier. #93

JohnnyOpcode commented Apr 19, 2023

DifferentialityDevelopment commented May 22, 2023 •

edited

Loading

JohnnyOpcode commented May 23, 2023

Sparsity - The next performance frontier. #93

Sparsity - The next performance frontier. #93

Comments

JohnnyOpcode commented Apr 19, 2023

DifferentialityDevelopment commented May 22, 2023 • edited Loading

JohnnyOpcode commented May 23, 2023

DifferentialityDevelopment commented May 22, 2023 •

edited

Loading