Added GGML Type 3 dequantization capability in transformers #32062

kskd1804 · 2024-07-18T18:17:54Z

What does this PR do?

This PR introduces the dequantize_q4_1 function which allows users to load Q4_1 quantization models using transformers.

Fixes # (issue)
#31847

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@SunMarc

amyeroberts · 2024-08-19T11:45:39Z

Gentle ping @SunMarc

SunMarc · 2024-08-19T13:09:17Z

Thanks for adding this @kskd1804 ! Going forward, I think it will be better if we use the dequantize function introduced in gguf package. This PR should unlock your feature ! Thanks again for contributing.

SunMarc · 2024-09-03T13:36:42Z

The dequantize function PR has been merged !

kskd1804 added 4 commits July 17, 2024 17:57

Added dequantize_q4_1 function to ggml

69739f9

Added ggml type for q4_1

b0e94fc

Added calls to dequantize_q4_1 function

31ec19c

Merge branch 'huggingface:main' into dequantize_q4_1_fn

3def5bc

amyeroberts added the Quantization label Jul 18, 2024

huggingface deleted a comment from github-actions bot Aug 19, 2024

SunMarc closed this Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added GGML Type 3 dequantization capability in transformers #32062

Added GGML Type 3 dequantization capability in transformers #32062

kskd1804 commented Jul 18, 2024

amyeroberts commented Aug 19, 2024

SunMarc commented Aug 19, 2024

SunMarc commented Sep 3, 2024

Added GGML Type 3 dequantization capability in transformers #32062

Added GGML Type 3 dequantization capability in transformers #32062

Conversation

kskd1804 commented Jul 18, 2024

What does this PR do?

Before submitting

Who can review?

amyeroberts commented Aug 19, 2024

SunMarc commented Aug 19, 2024

SunMarc commented Sep 3, 2024