Skip to content
View visionjo's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report visionjo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

A best practice for deep learning project template architecture.

Python 1,243 292 Updated Apr 12, 2019

SOTA Re-identification Methods and Toolbox

Python 3,346 827 Updated Jul 15, 2024

Official implementation of AnimateDiff.

Python 9,916 811 Updated Jul 20, 2024

This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Python 123 15 Updated Mar 31, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,250 872 Updated Jun 17, 2024

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,627 245 Updated Jun 24, 2024

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Python 878 159 Updated Jul 7, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,154 911 Updated Apr 1, 2024

[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"

Python 81 7 Updated Feb 16, 2024

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

Python 1,118 154 Updated Jun 29, 2022

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 9,907 845 Updated Jul 6, 2024

Python bindings for llama.cpp

Python 7,275 866 Updated Jul 24, 2024

Real time transcription with OpenAI Whisper.

Python 2,112 363 Updated Jun 1, 2024

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 1,305 115 Updated Jul 19, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,505 187 Updated Jun 6, 2024

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

Python 913 97 Updated Aug 29, 2023

A Flow-based Generative Network for Speech Synthesis

Python 2,250 528 Updated Oct 19, 2023

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 4,988 1,374 Updated Jun 12, 2024

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

Python 793 91 Updated Jul 16, 2024

Real-Time Lip Sync for Live 2D Animation

126 16 Updated Nov 7, 2019

A lip-syncing program written in python

Python 42 7 Updated Jan 10, 2014

Demo for the "Talking Head Anime from a Single Image."

Python 1,987 286 Updated Jun 29, 2022
Jupyter Notebook 946 215 Updated Mar 20, 2024

Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other p…

C++ 1,742 209 Updated Jan 11, 2024

Papagayo is a lip-syncing program designed to help you line up phonemes (mouth shapes) with the actual recorded sound of actors speaking. Papagayo makes it easy to lip sync animated characters by m…

Python 233 51 Updated Apr 25, 2023

A curated list of references for MLOps

12,309 1,857 Updated Jun 10, 2024

Test your prompts, agents, and RAGs. Redteaming, pentesting, vulnerability scanning for LLMs. Improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and m…

TypeScript 3,680 261 Updated Jul 24, 2024

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Python 911 168 Updated Sep 25, 2023

Repo for our Paper: Explainable Model-Agnostic Similarity and Confidence in Face Verification

Python 16 2 Updated Sep 28, 2023

Template for model cards

17 8 Updated Jan 24, 2023
Next