Block or Report
Block or report Mikerhinos
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
An open source `vercel` like deployment platform for Comfy UI
AuraSR: GAN-based Super-Resolution for real-world
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Inference and training library for high-quality TTS models.
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
A ComfyUI plugin for generating word cloud images
G-code generator for 3D printers (Bambu, Prusa, Voron, VzBot, RatRig, Creality, etc.)
Official Code for Stable Cascade
🤖 Build voice-based LLM agents. Modular + open source.
Code and dataset for photorealistic Codec Avatars driven from audio
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild
WebUI extension for ControlNet
Unofficial implementation of AnyText for ComfyUI(EXP)
[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
AI powered speech denoising and enhancement
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
[arXiv 2023] Sketch Video Synthesis
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction