Skip to content
View enjoybo's full-sized avatar

Block or report enjoybo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

High-resolution models for human tasks.

Python 4,277 229 Updated Oct 15, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 1,840 155 Updated Oct 17, 2024

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 384 26 Updated Oct 18, 2024

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Python 963 168 Updated Jul 7, 2024
34 3 Updated Jan 4, 2024

[CVPR 2024] SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion

Python 128 10 Updated Aug 15, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 433 23 Updated Sep 16, 2024

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 900 64 Updated Jan 17, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 8,117 765 Updated Oct 18, 2024

Towards Variable and Coordinated Holistic Co-Speech Motion Generation, CVPR 2024

Python 45 1 Updated Jun 27, 2024

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 4,211 546 Updated Oct 13, 2024

This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].

Python 300 26 Updated Nov 1, 2023

[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

Python 122 10 Updated Apr 30, 2024

Video-Infinity generates long videos quickly using multiple GPUs without extra training.

Python 163 15 Updated Aug 4, 2024

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,193 149 Updated Oct 8, 2024

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,142 756 Updated Sep 20, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,430 3,267 Updated Jul 23, 2024

VideoSys: An easy and efficient system for video generation

Python 1,714 115 Updated Oct 16, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,914 2,134 Updated Aug 9, 2024

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 4,569 574 Updated Jul 10, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,247 96 Updated Oct 18, 2024

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,689 315 Updated Aug 15, 2024

3D Gaussian Splat Editor

TypeScript 1,359 129 Updated Oct 17, 2024

HaMeR: Reconstructing Hands in 3D with Transformers

Python 381 37 Updated Oct 11, 2024

4DHumans: Reconstructing and Tracking Humans with Transformers

Python 1,219 117 Updated May 17, 2024

[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

Python 69 2 Updated May 18, 2024

This is the code for siggrapha paper "An Implicit Neural Representation for the Image Stack: Depth, All in Focus, and High Dynamic Range"

Python 8 1 Updated Mar 19, 2024

TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing (CVPR 2024)

Python 19 Updated Jul 7, 2024
Python 176 4 Updated Jul 15, 2024

Understand Human Behavior to Align True Needs

Python 3,373 297 Updated Jul 20, 2024
Next