Skip to content
View fcchit's full-sized avatar
  • Zhejiang University, Harbin Institute of Technology
  • Shanghai

Block or report fcchit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 2,388 846 Updated Jun 21, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 2,509 306 Updated Sep 23, 2024

[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

Python 114 10 Updated Apr 30, 2024

[NeurIPS 2024] Official implementation of MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.

Python 86 Updated Sep 26, 2024

[ICRA23] Efficient Implicit Neural Reconstruction Using LiDAR

Python 76 7 Updated Aug 28, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 13,668 2,039 Updated Jul 24, 2024

[NeurIPS 2023] FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing

Python 105 8 Updated Dec 17, 2023
Python 2 Updated Jan 13, 2024

Large Motion Model for Unified Multi-Modal Motion Generation

164 1 Updated Apr 2, 2024

Towards Variable and Coordinated Holistic Co-Speech Motion Generation, CVPR 2024

Python 44 1 Updated Jun 27, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,189 278 Updated May 4, 2024

A deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video [ToG 2020]

Python 562 84 Updated Apr 28, 2022

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,868 185 Updated Sep 19, 2024

[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"

Python 127 9 Updated Mar 16, 2023

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Python 838 74 Updated Jul 19, 2024

Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 2024)

Python 61 9 Updated Feb 20, 2024

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Python 2,182 444 Updated Jan 4, 2024

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 8,920 842 Updated Aug 14, 2024

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocula…

C++ 846 135 Updated Sep 26, 2024

[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".

Python 200 8 Updated May 22, 2024

This repository contains an example script to convert from a SMPL model to a bvh file.

Python 138 14 Updated Jun 9, 2023

SMPL-X

Python 1,807 306 Updated Aug 12, 2024

Pytorch implementation of our paper MaxQ: Multi-Axis Query for N:M Sparsity Network accepted by CVPR 2024.

Python 33 Updated Mar 12, 2024

Denoising Diffusion Probabilistic Models

Python 3,700 363 Updated Aug 29, 2023

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,462 91 Updated Apr 3, 2024

Official implementation of "TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts (ECCV2022)"

Python 104 14 Updated Aug 18, 2024

Resource, Evaluation and Detection Papers for ChatGPT

452 24 Updated Mar 21, 2024

[ECCV2024] Event-Based Motion Magnification

Python 45 2 Updated Jul 4, 2024

Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)

Python 244 35 Updated Dec 14, 2021

Erase specific content from the video that you don't wanna see

Python 267 70 Updated Apr 12, 2023