Stars
✨✨Latest Advances on Multimodal Large Language Models
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
[AAAI 2024] O2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"
Official implementation of arxiv paper "4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions"
Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'
This is the PyTorch implementation of the Siggraph 2023 paper "Efficient Video Portrait Reenactment via Grid-based Codebook"
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons (ACM MM 2023 Oral)
[ICCV-2023] Official code for work "HumanMAC: Masked Motion Completion for Human Motion Prediction".
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility A…
The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.
Official PyTorch Implementation of EDGE (CVPR 2023)
Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dance Genres", CVPR 2022
Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"
[CVPR 2022 Oral] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
ICRA 2021 "Towards Precise and Efficient Image Guided Depth Completion"
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
Dense Depth Priors for Neural Radiance Fields from Sparse Input Views
Official code for CVPR 2023 Paper, HexPlane: A Fast Representation for Dynamic Scenes
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Volume rendering based surface reconstruction using Unsigned Distance Fields
Freeform Body Motion Generation from Speech
SGToolkit: An Interactive Gesture Authoring Toolkit for Embodied Conversational Agents (UIST 2021)
HumanML3D: A large and diverse 3d human motion-language dataset.
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)