csuhan

🐇

Focusing

Jiaming Han csuhan

🐇

Focusing

173 followers · 93 following

CUHK
https://csuhan.com

Achievements

Highlights

Stars

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,314 82 Updated Sep 6, 2024

DepthAnything / Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,201 253 Updated Aug 14, 2024

baaivision / DIVA

Diffusion Feedback Helps CLIP See Better

Python 196 10 Updated Aug 24, 2024

leloykun / mmsg

Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.

Python 21 1 Updated Aug 6, 2024

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

722 18 Updated Jul 31, 2024

weihaox / UMBRAE

[ECCV 2024] UMBRAE: Unified Multimodal Brain Decoding | Unveiling the 'Dark Side' of Brain Modality

Jupyter Notebook 24 2 Updated Sep 2, 2024

lyndonzheng / CVQ-VAE

[ICCV 2023] Online Clustered Codebook

Python 133 9 Updated Dec 1, 2023

fusiming3 / MARS

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

74 2 Updated Jul 16, 2024

bytedance / 1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 382 15 Updated Aug 28, 2024

Yangyi-Chen / SOLO

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 102 2 Updated Jul 28, 2024

Anima-Lab / MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Python 349 13 Updated May 15, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 3,430 219 Updated Sep 4, 2024

gcorso / disco-diffdock

Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024

Python 65 2 Updated Jun 12, 2024

xichenpan / Kosmos-G

Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Python 40 3 Updated May 25, 2024

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 629 35 Updated Aug 5, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,236 499 Updated Jul 31, 2024

baaivision / EVE

EVE: Encoder-Free Vision-Language Models

Python 200 4 Updated Jul 20, 2024

voxel51 / fiftyone

The open-source tool for building high-quality datasets and computer vision models

Python 8,062 537 Updated Sep 7, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,743 107 Updated Jul 29, 2024

LeapLabTHU / ImprovedNAT

A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"

Python 23 1 Updated Jun 13, 2024

Hoar012 / Rap-VLM

1 Updated Jun 18, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,185 46 Updated Aug 15, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,227 68 Updated Aug 21, 2024

Vchitect / Latte

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,625 170 Updated Sep 8, 2024

lyogavin / train_your_own_sora

Python 174 28 Updated Mar 7, 2024

Hritikbansal / talc

Python 21 1 Updated May 9, 2024

jjihwan / FIFO-Diffusion_public

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training

Python 331 23 Updated Jul 15, 2024

HigherOrderCO / Bend

A massively parallel, high-level programming language

Rust 17,166 423 Updated Sep 5, 2024

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,259 282 Updated Aug 15, 2024

huzeyann / BrainDecodesDeepNets

PyTorch implementation of "Brain Decodes Deep Nets"

Jupyter Notebook 50 4 Updated Feb 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiaming Han csuhan

Achievements

Achievements

Highlights

Block or report csuhan

Stars

EvolvingLMMs-Lab / lmms-eval

DepthAnything / Depth-Anything-V2

baaivision / DIVA

leloykun / mmsg

mlfoundations / MINT-1T

weihaox / UMBRAE

lyndonzheng / CVQ-VAE

fusiming3 / MARS

bytedance / 1d-tokenizer

Yangyi-Chen / SOLO

Anima-Lab / MaskDiT

Kwai-Kolors / Kolors

gcorso / disco-diffdock

xichenpan / Kosmos-G

GAIR-NLP / anole

google / gemma_pytorch

baaivision / EVE

voxel51 / fiftyone

facebookresearch / chameleon

LeapLabTHU / ImprovedNAT

Hoar012 / Rap-VLM

FoundationVision / LlamaGen

yunlong10 / Awesome-LLMs-for-Video-Understanding

Vchitect / Latte

lyogavin / train_your_own_sora

Hritikbansal / talc

jjihwan / FIFO-Diffusion_public

HigherOrderCO / Bend

Tencent / HunyuanDiT

huzeyann / BrainDecodesDeepNets