mmnga

Follow

momonga mmnga

Follow

8 followers · 0 following

Popular repositories Loading

llama.cpp llama.cpp Public

Forked from ggerganov/llama.cpp

Port of Facebook's LLaMA model in C/C++

C++ 3 1
ggml ggml Public

Forked from ggerganov/ggml

Tensor library for machine learning

C++
AutoAWQ AutoAWQ Public

Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.

Python
text-generation-inference text-generation-inference Public

Forked from huggingface/text-generation-inference

Large Language Model Text Generation Inference

Python
peft peft Public

Forked from huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python
AutoGPTQ AutoGPTQ Public

Forked from AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python