Fix marlin model loading compat with autogptq #290

Liurl21 · 2024-03-13T04:11:05Z

Fix Issue #289
Autogptq uses is_marlin_format for Marlin. So depending on how/tool used to quantize, it can be quant_method=="marlin" or quant_method=="gptq" and is_marlin_format==True. Technically, Marlin is a format, not new quant method so autogptq way is more correct. But regardless, this PR will allow both methods to work.

Qubitium · 2024-03-13T04:14:27Z

I have reviewed this PR. @merrymercy @hnyls2002 Ready for review/merge.

AutoGPTQ main actually has broken Marlin direct quantize support. Use my pending PR AutoGPTQ/AutoGPTQ#586 to quant Marlin.

Fix marlin model loading compat with autogptq.

4415cee

Qubitium mentioned this pull request Mar 13, 2024

[BUG] Marlin model quantized with AutoGPTQ is not loadable #289

Closed

hnyls2002 linked an issue Mar 13, 2024 that may be closed by this pull request

[BUG] Marlin model quantized with AutoGPTQ is not loadable #289

Closed

hnyls2002 merged commit ed31579 into sgl-project:main Mar 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix marlin model loading compat with autogptq #290

Fix marlin model loading compat with autogptq #290

Liurl21 commented Mar 13, 2024

Qubitium commented Mar 13, 2024 •

edited

Loading

Fix marlin model loading compat with autogptq #290

Fix marlin model loading compat with autogptq #290

Conversation

Liurl21 commented Mar 13, 2024

Qubitium commented Mar 13, 2024 • edited Loading

Qubitium commented Mar 13, 2024 •

edited

Loading