Skip to content
View zhang0jhon's full-sized avatar

Block or report zhang0jhon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
317 results for source starred repositories
Clear filter

Official inference repo for FLUX.1 models

Python 14,563 1,049 Updated Oct 8, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,627 77 Updated Aug 5, 2024

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,275 377 Updated Jul 30, 2024

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,307 224 Updated Jun 14, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,773 2,111 Updated Aug 9, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,086 540 Updated May 31, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,398 976 Updated Oct 5, 2024

Utilities intended for use with Llama models.

Python 4,354 768 Updated Oct 4, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,888 1,379 Updated Sep 5, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,449 663 Updated Aug 12, 2024

Collection of AWESOME vision-language models for vision tasks

2,289 206 Updated Oct 8, 2024
Python 2 1 Updated May 28, 2024

A collection of resources and papers on Diffusion Models

HTML 10,850 934 Updated Aug 1, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,193 50 Updated Oct 7, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,244 165 Updated Aug 1, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 6,983 439 Updated Sep 28, 2024

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 545 57 Updated Jun 7, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,873 188 Updated Sep 19, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,188 277 Updated May 4, 2024

The official Meta Llama 3 GitHub site

Python 26,513 2,997 Updated Aug 12, 2024
Python 8,342 486 Updated Jan 27, 2024

Grok open release

Python 49,464 8,323 Updated Aug 30, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,778 271 Updated Aug 1, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,172 130 Updated Aug 29, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,043 5,195 Updated Aug 29, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,329 8,753 Updated Aug 14, 2024

Industry leading face manipulation platform

Python 18,599 2,805 Updated Oct 7, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,403 2,231 Updated Sep 24, 2024

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,106 241 Updated May 31, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,444 901 Updated Aug 21, 2024
Next