zhang0jhon

zhang0jhon

39 followers · 9 following

Achievements

Block or Report

Block or report zhang0jhon

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,101 1,304 Updated May 23, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 5,596 590 Updated Jun 28, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

1,975 180 Updated May 27, 2024

zhang0jhon / otamatch

Python 2 1 Updated May 28, 2024

diff-usion / Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

HTML 10,400 906 Updated Jun 29, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,108 48 Updated Jun 26, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,087 148 Updated Jun 4, 2024

OpenBMB / MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,396 319 Updated Jul 4, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 478 56 Updated Jun 7, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,566 161 Updated Jun 25, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,076 274 Updated May 4, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 22,889 2,411 Updated Jul 3, 2024

apple / ml-ferret

Python 8,196 478 Updated Jan 27, 2024

xai-org / grok-1

Grok open release

Python 49,149 8,310 Updated May 29, 2024

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,575 244 Updated Jun 12, 2024

IDEA-Research / T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 1,983 120 Updated Jun 25, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 34,455 5,142 Updated Jul 6, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 51,492 8,636 Updated Jul 5, 2024

facefusion / facefusion

Next generation face swapper and enhancer

Python 16,447 2,408 Updated Jul 6, 2024

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 9,731 2,108 Updated May 29, 2024

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 2,918 228 Updated May 31, 2024

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,178 844 Updated Jun 17, 2024

Wangt-CN / DisCo

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Python 997 108 Updated Apr 10, 2024

alibaba / animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 634 52 Updated Jul 4, 2024

AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,296 318 Updated May 17, 2024

Doubiiu / DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,063 161 Updated Jul 2, 2024

wyhsirius / LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Python 578 64 Updated Nov 10, 2023

AliaksandrSiarohin / monkey-net

Animating Arbitrary Objects via Deep Motion Transfer

Python 465 83 Updated Nov 22, 2022

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,549 258 Updated Jun 2, 2024

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 41,458 4,943 Updated Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhang0jhon

Achievements

Achievements

Block or report zhang0jhon

Stars

IDEA-Research / Grounded-Segment-Anything

IDEA-Research / GroundingDINO

jingyi0000 / VLM_survey

zhang0jhon / otamatch

diff-usion / Awesome-Diffusion-Models

facebookresearch / MetaCLIP

baaivision / EVA

OpenBMB / MiniCPM

FoundationVision / Groma

hustvl / Vim

dvlab-research / MGM

meta-llama / llama3

apple / ml-ferret

xai-org / grok-1

xinyu1205 / recognize-anything

IDEA-Research / T-Rex

babysor / MockingBird

CorentinJ / Real-Time-Voice-Cloning

facefusion / facefusion

Rudrabha / Wav2Lip

MooreThreads / Moore-AnimateAnyone

HumanAIGC / EMO

Wangt-CN / DisCo

alibaba / animate-anything

AILab-CVC / VideoCrafter

Doubiiu / DynamiCrafter

wyhsirius / LIA

AliaksandrSiarohin / monkey-net

dvlab-research / LongLoRA

geekan / MetaGPT