zhang0jhon

zhang0jhon

41 followers · 9 following

Achievements

Stars

317 results for source starred repositories

Clear filter

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 14,563 1,049 Updated Oct 8, 2024

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,627 77 Updated Aug 5, 2024

Fanghua-Yu / SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,275 377 Updated Jul 30, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,307 224 Updated Jun 14, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,773 2,111 Updated Aug 9, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,086 540 Updated May 31, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,398 976 Updated Oct 5, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 4,354 768 Updated Oct 4, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,888 1,379 Updated Sep 5, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,449 663 Updated Aug 12, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,289 206 Updated Oct 8, 2024

zhang0jhon / otamatch

Python 2 1 Updated May 28, 2024

diff-usion / Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

HTML 10,850 934 Updated Aug 1, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,193 50 Updated Oct 7, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,244 165 Updated Aug 1, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 6,983 439 Updated Sep 28, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 545 57 Updated Jun 7, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,873 188 Updated Sep 19, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,188 277 Updated May 4, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,513 2,997 Updated Aug 12, 2024

apple / ml-ferret

Python 8,342 486 Updated Jan 27, 2024

xai-org / grok-1

Grok open release

Python 49,464 8,323 Updated Aug 30, 2024

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,778 271 Updated Aug 1, 2024

IDEA-Research / T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,172 130 Updated Aug 29, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,043 5,195 Updated Aug 29, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,329 8,753 Updated Aug 14, 2024

facefusion / facefusion

Industry leading face manipulation platform

Python 18,599 2,805 Updated Oct 7, 2024

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,403 2,231 Updated Sep 24, 2024

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,106 241 Updated May 31, 2024

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,444 901 Updated Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhang0jhon

Achievements

Achievements

Block or report zhang0jhon

Stars

black-forest-labs / flux

PixArt-alpha / PixArt-sigma

Fanghua-Yu / SUPIR

luosiallen / latent-consistency-model

hpcaitech / Open-Sora

facebookresearch / DiT

facebookresearch / sam2

meta-llama / llama-models

IDEA-Research / Grounded-Segment-Anything

IDEA-Research / GroundingDINO

jingyi0000 / VLM_survey

zhang0jhon / otamatch

diff-usion / Awesome-Diffusion-Models

facebookresearch / MetaCLIP

baaivision / EVA

OpenBMB / MiniCPM

FoundationVision / Groma

hustvl / Vim

dvlab-research / MGM

meta-llama / llama3

apple / ml-ferret

xai-org / grok-1

xinyu1205 / recognize-anything

IDEA-Research / T-Rex

babysor / MockingBird

CorentinJ / Real-Time-Voice-Cloning

facefusion / facefusion

Rudrabha / Wav2Lip

MooreThreads / Moore-AnimateAnyone

HumanAIGC / EMO