-
UCAS/iie
- beijing
Block or Report
Block or report fhlt
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
An extension for stable-diffusion-webui to remove any object.
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
日本語LLMまとめ - Overview of Japanese LLMs
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
Create transparent image with Diffusers!
collection of diffusion model papers categorized by their subareas
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
SigLIP-based Aesthetic Score Predictor
[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Unofficial implementation of Layer Diffuse in diffusers
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Transparent Image Layer Diffusion using Latent Transparency
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
Open-Sora: Democratizing Efficient Video Production for All
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.