Skip to content
View oobabooga's full-sized avatar
Block or Report

Block or report oobabooga

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,526 822 Updated Jul 18, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,486 70 Updated Jul 6, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,299 852 Updated May 23, 2024

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors, now in oobabooga text generation webui!

Python 38 2 Updated Mar 21, 2024

Memoir+ a persona extension for Text Gen Web UI. That includes memory, emotions, command handling and more.

Python 150 17 Updated Jul 16, 2024

Web page with political compass quiz results for open LLMs

HTML 36 Updated Jan 31, 2024

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

Python 913 158 Updated Jul 18, 2024

Tools for merging pretrained large language models.

Python 4,124 359 Updated Jul 19, 2024
Python 128 5 Updated Jun 25, 2024

🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers

Python 65 9 Updated Jul 9, 2024

Science-driven chatbot development

Python 50 7 Updated May 5, 2024

Official implementation of Half-Quadratic Quantization (HQQ)

Python 573 53 Updated Jul 16, 2024

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…

HTML 688 73 Updated Jul 17, 2024

Distribute and run LLMs with a single file.

C++ 17,184 858 Updated Jul 18, 2024

A web search extension for Oobabooga's text-generation-webui (now with nougat)

Python 61 6 Updated Jul 7, 2024

Fast and memory-efficient exact attention - Windows wheels

Python 23 3 Updated Mar 3, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,084 3,860 Updated Jul 8, 2024

Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing the LLM's tendency to fixate on a single word, phrase, or se…

Python 33 2 Updated Nov 20, 2023

Integrate image generation capabilities to text-generation-webui using Stable Diffusion.

Python 51 5 Updated May 18, 2024

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Python 266 24 Updated Jul 12, 2024

A natural language interface for computers

Python 50,951 4,444 Updated Jul 18, 2024

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023

Python 1,060 87 Updated Jun 21, 2024

Merge Transformers language models by use of gradient parameters.

Python 189 20 Updated Oct 19, 2023

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,457 167 Updated Jul 15, 2024
Python 65 3 Updated Jul 6, 2024

Fast and memory-efficient exact attention

Python 12,458 1,108 Updated Jul 19, 2024

ChatGPT CSS style

CSS 11 1 Updated Apr 28, 2024

Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.

Jupyter Notebook 931 139 Updated Jul 17, 2024
Next