Skip to content
View Vincent-ZHQ's full-sized avatar
  • Nanyang Technological University
  • Singapore

Block or report Vincent-ZHQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨Latest Advances on Multimodal Large Language Models

12,571 803 Updated Nov 10, 2024
Python 138 12 Updated Jul 9, 2024

A visual editor for manually annotating facial landmarks in images of human faces.

C++ 213 56 Updated Nov 7, 2018

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,070 856 Updated Jul 26, 2024

📖 A curated list of resources dedicated to talking face.

1,325 111 Updated Nov 3, 2024

TRACER: Extreme Attention Guided Salient Object Tracing Network (AAAI 2022) implementation in PyTorch

Python 195 41 Updated Sep 11, 2024

A collection of datasets for the purpose of emotion recognition/detection in speech.

HTML 292 40 Updated Sep 30, 2024

Code for Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection

Python 20 2 Updated Apr 6, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,921 1,055 Updated Aug 15, 2024

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 13,502 3,402 Updated Sep 20, 2024

关键点标注工具 | Landmark-Annotation

Python 13 2 Updated Jan 2, 2024

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,290 136 Updated Nov 8, 2024

Download and preprocess voxceleb datasets.

Python 20 4 Updated May 28, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,954 2,225 Updated Jun 26, 2024

Code for UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning (ACL 2023)

Python 32 4 Updated May 31, 2024

A self-supervised learning framework for audio-visual speech

Python 847 136 Updated Dec 7, 2023

Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Python 794 143 Updated Apr 19, 2022

iCartoonFace dataset, and baseline approaches, the project is supported by iQIYI

276 18 Updated Jun 25, 2021

The source code for paper "Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model".

Python 578 109 Updated Oct 3, 2023

Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not include.

1,073 68 Updated Oct 7, 2024

Official implementation of "AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment" (ECCV 2022)

Python 120 3 Updated Sep 20, 2024

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,546 295 Updated Oct 18, 2024

An unofficial inversion code of eg3d.

Jupyter Notebook 109 12 Updated Apr 21, 2023

[CVPR 2023] 3D-Aware Face Swapping

Python 75 5 Updated Aug 24, 2023

[CVPR 2023 Highlight] Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars

Python 471 29 Updated Oct 13, 2024
Python 3,233 362 Updated Jun 10, 2023

Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"

Python 1,914 238 Updated Feb 5, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,528 310 Updated May 21, 2024

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Python 91 5 Updated Apr 7, 2022
Next