songcheng

songcheng

7 followers · 192 following

Achievements

Lists (5)

Sort

diffusion

face

llm

speech

tools

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

xiaozhuai / imageinfo

Free Palestine🇵🇸🇵🇸🇵🇸Cross platform super fast single header c++ library to get image size and format without loading/decoding. Support avif, bmp, cur, dds, gif, hdr (pic), heic (heif), icns, ico, j…

C++ 109 27 Updated Aug 5, 2024

Sanster / seemore

Python 5 Updated Oct 31, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 676 24 Updated Oct 31, 2024

qTipTip / Pylette

A Python library for extracting color palettes from supplied images.

Python 106 11 Updated Oct 8, 2024

yisol / IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 3,876 608 Updated Jul 30, 2024

cure-lab / MotionCraft

Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"

Python 41 1 Updated Sep 20, 2024

snap-research / GenAU

Jupyter Notebook 9 Updated Oct 25, 2024

KelianB / SPARK

Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture

276 7 Updated Sep 13, 2024

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 764 124 Updated Nov 4, 2024

wangxuanx / TalkingStyle

The official pytorch code for TalkingStyle: Personalized Speech-Driven Facial Animation with Style Preservation

Python 15 3 Updated Jul 3, 2024

yerfor / MimicTalk

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code

Python 331 32 Updated Oct 16, 2024

rongakowang / MMDMC

[ECCV 2024] Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation - MMDMC Dataset

Python 51 4 Updated Oct 21, 2024

XingliangJin / MCM-LDM

[CVPR 2024] Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model

Python 37 7 Updated Oct 30, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 3,305 305 Updated Oct 18, 2024

Boeun-Kim / MoST

Official implementation of "MoST: Motion Style Transformer between Diverse Action Contents"

Python 28 2 Updated Jun 26, 2024

cheind / py-bilateral-filter

Vectorized Bilateral Filter in Python using Numpy

Python 1 Updated Oct 20, 2024

getomni-ai / zerox

Zero shot pdf OCR with gpt-4o-mini

Python 5,794 308 Updated Nov 4, 2024

fudan-generative-vision / hallo2

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,382 469 Updated Oct 28, 2024

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

515 10 Updated Nov 4, 2024

linyqh / NarratoAI

利用AI大模型，一键解说并剪辑视频； Using AI models to automatically provide commentary and edit videos with a single click.

Python 1,790 211 Updated Nov 3, 2024

tvaranka / fineface

Towards Localized Fine-Grained Control for Facial Expression Generation

Python 57 1 Updated Aug 18, 2024

usefulsensors / moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 1,937 72 Updated Nov 2, 2024

yunik1004 / SAiD

SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion

Python 89 17 Updated Jan 25, 2024

Coral79 / Unimotion

Pytorch implementation of Unimotion: Unifying 3D Human Motion Synthesis and Understanding.

26 Updated Oct 9, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,650 323 Updated Oct 27, 2024

NVlabs / ProtoMotions

Python 321 22 Updated Nov 3, 2024

hpc203 / liveportrait-onnxrun

使用onnxruntime部署LivePortrait人像动画生成，包含C++和Python两个版本的程序

C++ 18 4 Updated Aug 5, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 4,085 363 Updated Nov 4, 2024

kyutai-labs / moshi

Python 6,615 504 Updated Oct 31, 2024

LordLiang / DrawingSpinUp

(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings

Python 545 50 Updated Oct 30, 2024