Popular repositories Loading
-
-
-
AutoAWQ
AutoAWQ PublicForked from casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
Python
-
text-generation-inference
text-generation-inference PublicForked from huggingface/text-generation-inference
Large Language Model Text Generation Inference
Python
-
peft
peft PublicForked from huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python
-
AutoGPTQ
AutoGPTQ PublicForked from AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.