Lists (5)
Sort Name ascending (A-Z)
Stars
Free Palestine🇵🇸🇵🇸🇵🇸Cross platform super fast single header c++ library to get image size and format without loading/decoding. Support avif, bmp, cur, dds, gif, hdr (pic), heic (heif), icns, ico, j…
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
A Python library for extracting color palettes from supplied images.
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"
Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture
The official pytorch code for TalkingStyle: Personalized Speech-Driven Facial Animation with Style Preservation
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
[ECCV 2024] Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation - MMDMC Dataset
[CVPR 2024] Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model
Multilingual Voice Understanding Model
Official implementation of "MoST: Motion Style Transformer between Diverse Action Contents"
Vectorized Bilateral Filter in Python using Numpy
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Towards Localized Fine-Grained Control for Facial Expression Generation
Fast and accurate automatic speech recognition (ASR) for edge devices
SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion
Pytorch implementation of Unimotion: Unifying 3D Human Motion Synthesis and Understanding.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
使用onnxruntime部署LivePortrait人像动画生成,包含C++和Python两个版本的程序
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…
(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings