Stars
MooER: Open-sourced LLM for audio understanding trained on 80,000 hours of data
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
UI-Lovelace-Minimalist is a "theme" for HomeAssistant
citruz / haos-rockpi
Forked from home-assistant/operating-systemHome Assistant OS for Rock Pi 4
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
Play ChatGPT and other LLM with Xiaomi AI Speaker
A multi-voice TTS system trained with an emphasis on quality
Image-to-Image Translation in PyTorch
This github contains the network architectures of NeuralVoicePuppetry.
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
📖 A curated list of resources dedicated to talking face.
A curated list of resources of audio-driven talking face generation
Basic GAN frameworks and approaches for face swap, reenactment, and stylizing.
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
The FastLED library for colored LED animation on Arduino. Please direct questions/requests for help to the FastLED Reddit community: https://fastled.io/r We'd like to use github "issues" just for tr…
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Command line utility for forced alignment using Kaldi
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Production First and Production Ready End-to-End Text-to-Speech Toolkit