Skip to content
View zhang0jhon's full-sized avatar
Block or Report

Block or report zhang0jhon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,101 1,304 Updated May 23, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 5,596 590 Updated Jun 28, 2024

Collection of AWESOME vision-language models for vision tasks

1,975 180 Updated May 27, 2024
Python 2 1 Updated May 28, 2024

A collection of resources and papers on Diffusion Models

HTML 10,400 906 Updated Jun 29, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,108 48 Updated Jun 26, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,087 148 Updated Jun 4, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,396 319 Updated Jul 4, 2024

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 478 56 Updated Jun 7, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,566 161 Updated Jun 25, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,076 274 Updated May 4, 2024

The official Meta Llama 3 GitHub site

Python 22,889 2,411 Updated Jul 3, 2024
Python 8,196 478 Updated Jan 27, 2024

Grok open release

Python 49,149 8,310 Updated May 29, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,575 244 Updated Jun 12, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 1,983 120 Updated Jun 25, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 34,455 5,142 Updated Jul 6, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 51,492 8,636 Updated Jul 5, 2024

Next generation face swapper and enhancer

Python 16,447 2,408 Updated Jul 6, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 9,731 2,108 Updated May 29, 2024

Character Animation (AnimateAnyone, Face Reenactment)

Python 2,918 228 Updated May 31, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,178 844 Updated Jun 17, 2024

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Python 997 108 Updated Apr 10, 2024

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 634 52 Updated Jul 4, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,296 318 Updated May 17, 2024

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,063 161 Updated Jul 2, 2024

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Python 578 64 Updated Nov 10, 2023

Animating Arbitrary Objects via Deep Motion Transfer

Python 465 83 Updated Nov 22, 2022

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,549 258 Updated Jun 2, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 41,458 4,943 Updated Jul 6, 2024
Next