Skip to content
View rikabi89's full-sized avatar

Block or report rikabi89

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮

Python 172 26 Updated Jul 16, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,610 744 Updated Jun 24, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 2,945 262 Updated Oct 22, 2024

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Jupyter Notebook 487 52 Updated Sep 11, 2023

Focus on prompting and generating

Python 41,074 5,769 Updated Aug 21, 2024

An unofficial PyTorch implementation of VALL-E

Python 73 6 Updated Oct 26, 2024
Python 284 46 Updated Oct 23, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,138 110 Updated May 10, 2024

An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.

Python 100 17 Updated Aug 26, 2023

Evolved Fork of roop with Web Server and lots of additions

Python 2,215 514 Updated Oct 9, 2024

Easy tool that splits given audio based on speaker.

Python 11 1 Updated Jan 8, 2024

(discontinued) AudioSlicer (Editor) for ai-voice-cloning by mrq

Python 5 Updated Jun 15, 2023

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

562 31 Updated Jun 19, 2023

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Python 1,125 103 Updated Aug 9, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,693 3,451 Updated May 18, 2024

Full GUI Version

Jupyter Notebook 30 1 Updated May 5, 2023

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Python 1,285 107 Updated Jul 14, 2024

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,758 192 Updated Mar 15, 2024
Python 507 50 Updated Dec 26, 2023

Audio datasets, easier.

Python 82 24 Updated Aug 19, 2023

TorToiSe fine-tuning with DLAS

Python 215 103 Updated Aug 1, 2024

[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation

Python 535 41 Updated May 21, 2023

Bringing Old Photo Back to Life (CVPR 2020 oral)

Python 15,075 1,990 Updated Oct 26, 2023

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,631 2,269 Updated Sep 24, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 28,211 3,543 Updated Aug 6, 2024

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 15,708 3,305 Updated Oct 9, 2024
Jupyter Notebook 2,426 451 Updated Dec 16, 2023
Next