Andy1621

😇

Paper Reading

Kunchang Li Andy1621

😇

Paper Reading

Ph. D. Student at UCAS, Intern @OpenGVLab

209 followers · 38 following

UCAS
Shanghai
07:12 (UTC +08:00)
https://andy1621.github.io/
@likunchang1998

Achievements

Block or Report

Block or report Andy1621

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 8,136 452 Updated Aug 4, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 2,910 166 Updated Aug 1, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,622 102 Updated Jul 29, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,104 39 Updated Jul 14, 2024

lllyasviel / Omost

Your image is almost there!

Python 7,027 411 Updated Jul 26, 2024

maitrix-org / Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 438 29 Updated May 27, 2024

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Python 1,910 30 Updated Jun 6, 2024

imagegridworth / IG-VLM

Python 93 4 Updated Apr 15, 2024

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 374 139 Updated Jul 4, 2024

FoundationVision / VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,902 295 Updated Jul 16, 2024

zh460045050 / V2L-Tokenizer

Python 102 7 Updated Jun 21, 2024

FutureXiang / edm2

Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"

Python 23 Updated Mar 5, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,111 277 Updated May 4, 2024

OpenGVLab / InternVideo2

186 1 Updated Apr 15, 2024

willisma / SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 565 25 Updated Mar 12, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,057 2,002 Updated Aug 4, 2024

sail-sg / MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 490 35 Updated Apr 23, 2024

state-spaces / mamba

Mamba SSM architecture

Python 12,017 1,008 Updated Aug 3, 2024

OpenGVLab / VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 726 57 Updated Jul 6, 2024

lllyasviel / LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

1,940 24 Updated Jun 16, 2024

LargeWorldModel / LWM

Python 7,041 545 Updated Jul 25, 2024

VITA-Group / LiGO

[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David …

Python 80 8 Updated Feb 26, 2024

OscarXZQ / weight-selection

Python 161 11 Updated Jan 16, 2024

hkproj / mamba-notes

Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)

135 9 Updated Jan 7, 2024

Vchitect / Vlogger

[CVPR2024] Make Your Dream A Vlog

Python 394 40 Updated Mar 19, 2024

HarborYuan / ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 861 27 Updated Jul 31, 2024

allenai / unified-io-2

Python 547 25 Updated Feb 15, 2024

PRIS-CV / DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

Jupyter Notebook 1,948 228 Updated Apr 15, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,403 5,039 Updated Aug 4, 2024

LTH14 / rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 775 35 Updated Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kunchang Li Andy1621

Achievements

Achievements

Block or report Andy1621

Stars

facebookresearch / segment-anything-2

Kwai-Kolors / Kolors

facebookresearch / chameleon

FoundationVision / LlamaGen

lllyasviel / Omost

maitrix-org / Pandora

yuweihao / MambaOut

imagegridworth / IG-VLM

TRI-ML / prismatic-vlms

FoundationVision / VAR

zh460045050 / V2L-Tokenizer

FutureXiang / edm2

dvlab-research / MGM

OpenGVLab / InternVideo2

willisma / SiT

hpcaitech / Open-Sora

sail-sg / MDT

state-spaces / mamba

OpenGVLab / VideoMamba

lllyasviel / LayerDiffuse

LargeWorldModel / LWM

VITA-Group / LiGO

OscarXZQ / weight-selection

hkproj / mamba-notes

Vchitect / Vlogger

HarborYuan / ovsam

allenai / unified-io-2

PRIS-CV / DemoFusion

huggingface / diffusers

LTH14 / rcg