xingyizhou

🕊️

Xingyi Zhou xingyizhou

🕊️

Research Scientist in Google Research

1.9k followers · 119 following

Google
Seattle
xingyizhou.xyz

Achievements

x4 x2

Achievements

x4 x2

Block or Report

Block or report xingyizhou

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 6,847 338 Updated Aug 1, 2024

test-time-training / ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 310 22 Updated Jul 25, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,617 99 Updated Jul 26, 2024

XavierXiao / Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,538 788 Updated Dec 8, 2022

LLaVA-VL / LLaVA-NeXT

Python 1,414 78 Updated Jul 29, 2024

dengxl0520 / MemSAM

[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.

Python 105 9 Updated Aug 1, 2024

ytongbai / LVM

Python 1,715 53 Updated Jun 28, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,170 72 Updated Jul 30, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,475 2,027 Updated Jul 31, 2024

zhaoyue-zephyrus / bsq-vit

[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization

Python 68 Updated Jun 12, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,100 39 Updated Jul 14, 2024

tianweiy / DMD2

Python 368 21 Updated Jul 10, 2024

syp2ysy / VRP-SAM

[CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"

Python 64 7 Updated Jul 20, 2024

baaivision / tokenize-anything

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 487 19 Updated Jul 4, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,491 329 Updated Jun 16, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 25,081 2,756 Updated Jul 31, 2024

unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,471 890 Updated Aug 1, 2024

sczhou / ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 5,239 620 Updated Apr 17, 2024

FoundationVision / VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,897 295 Updated Jul 16, 2024

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 989 80 Updated Jul 26, 2024

geekyutao / Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,010 503 Updated Feb 29, 2024

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 830 64 Updated Jul 31, 2024

google-deepmind / gemma

Open weights LLM from Google DeepMind.

Python 2,290 282 Updated Jul 30, 2024

google-research / magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 916 44 Updated Jan 17, 2024

LiheYoung / Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,560 504 Updated Jul 17, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Python 1,174 45 Updated Jul 29, 2024

hkchengrex / XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,682 184 Updated Mar 15, 2024

google / maxtext

A simple, performant and scalable Jax LLM!

Python 1,390 251 Updated Aug 1, 2024

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,079 142 Updated Jul 12, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,544 352 Updated Aug 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xingyi Zhou xingyizhou

Achievements

Achievements

Block or report xingyizhou

Stars

facebookresearch / segment-anything-2

test-time-training / ttt-lm-jax

cambrian-mllm / cambrian

XavierXiao / Dreambooth-Stable-Diffusion

LLaVA-VL / LLaVA-NeXT

dengxl0520 / MemSAM

ytongbai / LVM

EvolvingLMMs-Lab / lmms-eval

haotian-liu / LLaVA

zhaoyue-zephyrus / bsq-vit

FoundationVision / LlamaGen

tianweiy / DMD2

syp2ysy / VRP-SAM

baaivision / tokenize-anything

rom1504 / img2dataset

meta-llama / llama3

unslothai / unsloth

sczhou / ProPainter

FoundationVision / VAR

FoundationVision / GLEE

geekyutao / Inpaint-Anything

BAAI-DCAI / Bunny

google-deepmind / gemma

google-research / magvit

LiheYoung / Depth-Anything

lxtGH / OMG-Seg

hkchengrex / XMem

google / maxtext

google-research / big_vision

OpenGVLab / InternVL