![json logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/json/json.png)
Highlights
- Pro
Block or Report
Block or report visionjo
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (2)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
A best practice for deep learning project template architecture.
SOTA Re-identification Methods and Toolbox
This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Code and dataset for photorealistic Codec Avatars driven from audio
PantoMatrix: Co-Speech Talking Head and Gestures Generation
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Real time transcription with OpenAI Whisper.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Whisper realtime streaming for long speech-to-text transcription and translation
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
A Flow-based Generative Network for Speech Synthesis
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
Demo for the "Talking Head Anime from a Single Image."
yzhou359 / MakeItTalk
Forked from adobe-research/MakeItTalkRhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other p…
Papagayo is a lip-syncing program designed to help you line up phonemes (mouth shapes) with the actual recorded sound of actors speaking. Papagayo makes it easy to lip sync animated characters by m…
Test your prompts, agents, and RAGs. Redteaming, pentesting, vulnerability scanning for LLMs. Improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and m…
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Repo for our Paper: Explainable Model-Agnostic Similarity and Confidence in Face Verification
Template for model cards