Skip to content
View YuanGYao's full-sized avatar
😐
😐

Block or report YuanGYao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 198 19 Updated Sep 1, 2024

A desktop application for viewing and analyzing tabular data

TypeScript 3,132 117 Updated Aug 21, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,638 92 Updated Jul 31, 2024
Python 30 1 Updated Jul 5, 2024

ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Python 62 3 Updated Jun 17, 2024

ComfyUI custom nodes kit

JavaScript 76 4 Updated Aug 15, 2024

Wrapper to use DynamiCrafter models in ComfyUI

Python 583 20 Updated Aug 15, 2024

Basic Stable Diffusion Workflows for ComyUI using minimal custom nodes

95 3 Updated Jun 8, 2024

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Python 425 31 Updated Jul 3, 2024

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

Python 3,578 371 Updated Jun 24, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,579 76 Updated Aug 5, 2024

Your image is almost there!

Python 7,158 415 Updated Jul 26, 2024

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,074 148 Updated Aug 7, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,898 116 Updated Sep 1, 2024

More relighting!

Python 4,800 324 Updated Jun 27, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 87,645 6,817 Updated Sep 2, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,052 2,089 Updated Aug 12, 2024

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

Python 352 28 Updated Jul 31, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 68,906 7,559 Updated Aug 30, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,893 121 Updated May 15, 2024

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,775 325 Updated Aug 30, 2024

A project aiming to detect artstyles from images. It queries Wikimedia Commons to collect images for the training set.

Python 3 2 Updated Jul 15, 2023

The Prodigy optimizer and its variants for training neural networks.

Python 291 17 Updated Jul 7, 2024

Command-line program to download image galleries and collections from several image hosting sites

Python 11,241 918 Updated Sep 1, 2024

Efficient Train Data Collector for Anime Waifu

Python 249 19 Updated Aug 24, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,690 354 Updated Aug 7, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,507 530 Updated Jul 25, 2024

OneTrainer is a one-stop solution for all your stable diffusion training needs.

Python 1,590 128 Updated Sep 2, 2024