Skip to content
View xingyizhou's full-sized avatar
🕊️
.
🕊️
.
Block or Report

Block or report xingyizhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 6,847 338 Updated Aug 1, 2024

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 310 22 Updated Jul 25, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,617 99 Updated Jul 26, 2024

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,538 788 Updated Dec 8, 2022
Python 1,414 78 Updated Jul 29, 2024

[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.

Python 105 9 Updated Aug 1, 2024
Python 1,715 53 Updated Jun 28, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,170 72 Updated Jul 30, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,475 2,027 Updated Jul 31, 2024

[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization

Python 68 Updated Jun 12, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,100 39 Updated Jul 14, 2024
Python 368 21 Updated Jul 10, 2024

[CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"

Python 64 7 Updated Jul 20, 2024

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 487 19 Updated Jul 4, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,491 329 Updated Jun 16, 2024

The official Meta Llama 3 GitHub site

Python 25,081 2,756 Updated Jul 31, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,471 890 Updated Aug 1, 2024

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 5,239 620 Updated Apr 17, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,897 295 Updated Jul 16, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 989 80 Updated Jul 26, 2024

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,010 503 Updated Feb 29, 2024

A family of lightweight multimodal models.

Python 830 64 Updated Jul 31, 2024

Open weights LLM from Google DeepMind.

Python 2,290 282 Updated Jul 30, 2024

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 916 44 Updated Jan 17, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,560 504 Updated Jul 17, 2024

OMG-LLaVA and OMG-Seg codebase

Python 1,174 45 Updated Jul 29, 2024

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,682 184 Updated Mar 15, 2024

A simple, performant and scalable Jax LLM!

Python 1,390 251 Updated Aug 1, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,079 142 Updated Jul 12, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,544 352 Updated Aug 1, 2024
Next