Skip to content
View BigcowPeking's full-sized avatar

Block or report BigcowPeking

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,384 289 Updated Jul 3, 2024

【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。

794 118 Updated Nov 15, 2024

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 4,078 582 Updated Nov 6, 2024

一个超轻量级、可以在移动端实时运行的数字人模型

Python 954 151 Updated Nov 13, 2024

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,554 185 Updated Nov 6, 2024

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 787 43 Updated Oct 23, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,558 172 Updated Nov 14, 2024

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,551 295 Updated Oct 18, 2024

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,330 161 Updated Aug 28, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,084 277 Updated Nov 5, 2024

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python 947 109 Updated Oct 18, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,024 862 Updated Jul 6, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,500 913 Updated Aug 21, 2024

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,039 66 Updated Apr 15, 2024

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Python 925 168 Updated Jan 6, 2024

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Python 454 42 Updated Jul 15, 2024

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Python 468 33 Updated Apr 15, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,009 719 Updated Nov 14, 2024

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,034 850 Updated Nov 4, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 2,832 354 Updated Nov 15, 2024

HeadGAN - Official PyTorch Implementation (ICCV 2021)

Python 70 6 Updated Aug 4, 2023

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,790 2,293 Updated Oct 30, 2024

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Python 246 28 Updated Jul 7, 2024

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Python 992 175 Updated Sep 25, 2023

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Python 693 79 Updated Jan 6, 2024

每个人都能用的数字人

Python 693 151 Updated Nov 6, 2024

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,933 341 Updated Nov 14, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,648 977 Updated Aug 5, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,483 1,302 Updated Sep 14, 2024

Official implementation of AnimateDiff.

Python 10,586 872 Updated Jul 31, 2024
Next