Block or Report
Block or report oobabooga
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
llama3 implementation one matrix multiplication at a time
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors, now in oobabooga text generation webui!
Memoir+ a persona extension for Text Gen Web UI. That includes memory, emotions, command handling and more.
Web page with political compass quiz results for open LLMs
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf
Tools for merging pretrained large language models.
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
Official implementation of Half-Quadratic Quantization (HQQ)
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
Distribute and run LLMs with a single file.
A web search extension for Oobabooga's text-generation-webui (now with nougat)
jllllll / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention - Windows wheels
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing the LLM's tendency to fixate on a single word, phrase, or se…
Integrate image generation capabilities to text-generation-webui using Stable Diffusion.
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
A natural language interface for computers
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023
Merge Transformers language models by use of gradient parameters.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Fast and memory-efficient exact attention
Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.