Block or Report
Block or report jllllll
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention - Windows wheels
-
Wheels for llama-cpp-python compiled with cuBLAS support
-
ctransformers-cuBLAS-wheels Public
ctransformers wheels with pre-built CUDA binaries for additional CUDA and AVX versions.
-
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
-
bitsandbytes-windows-webui Public
Windows compile of bitsandbytes for use in text-generation-webui.
-
bitsandbytes Public
Forked from acpopescu/bitsandbytes8-bit CUDA functions for PyTorch
-
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
-
exllamav2 Public
Forked from turboderp/exllamav2A fast inference library for running LLMs locally on modern consumer-class GPUs
-
exllama Public
Forked from turboderp/exllamaA more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
-
GPTQ-for-LLaMa-CUDA Public
A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.
-
scikit-build-core Public
Forked from scikit-build/scikit-build-coreA next generation Python CMake adaptor and Python API for plugins
Python Apache License 2.0 UpdatedSep 20, 2023 -
one-click-installers Public
Forked from oobabooga/one-click-installersSimplified installers for oobabooga/text-generation-webui.
-
safetensors Public
Forked from huggingface/safetensorsSimple, safe way to store and distribute tensors
Python Apache License 2.0 UpdatedAug 23, 2023 -
ctransformers Public
Forked from marella/ctransformersPython bindings for the Transformer models implemented in C/C++ using GGML library.
C MIT License UpdatedAug 7, 2023 -
SillyTavern Public
Forked from SillyTavern/SillyTavernLLM Frontend for Power Users.
JavaScript GNU Affero General Public License v3.0 UpdatedJul 30, 2023 -
GPTQ-for-LLaMa-Wheels Public
Precompiled Wheels for GPTQ-for-LLaMa
-
h2ogpt Public
Forked from h2oai/h2ogptPrivate Q&A and summarization of documents+images or chat with local GPT, 100% private, no data leaks, Apache 2.0. Demo: https://gpt.h2o.ai/
-
GPTQ-for-LLaMa Public
Forked from 0cc4m/GPTQ-for-LLaMa4 bits quantization of LLMs using GPTQ
Python Apache License 2.0 UpdatedMay 19, 2023 -
windows-venv-installers Public
Standalone, dependency-less scripts for automatically setting up a virtual environment for easy project installation on Windows.