-
UCAS
- Shanghai
-
07:12
(UTC +08:00) - https://andy1621.github.io/
- @likunchang1998
Block or Report
Block or report Andy1621
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Pandora: Towards General World Model with Natural Language Actions and Video States
MambaOut: Do We Really Need Mamba for Vision?
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Open-Sora: Democratizing Efficient Video Production for All
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Transparent Image Layer Diffusion using Latent Transparency
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David …
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
Let us democratise high-resolution generation! (CVPR 2024)
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701