Skip to content
View CS123n's full-sized avatar
😺
😺
Block or Report

Block or report CS123n

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 959 70 Updated Jul 30, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 7,860 420 Updated Aug 2, 2024

The Pytorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024)

Jupyter Notebook 4 1 Updated Jul 5, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 351 15 Updated Jul 31, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,080 142 Updated Jul 12, 2024

Point cloud diffusion for 3D model synthesis

Python 6,435 747 Updated Jul 4, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 570 29 Updated Jul 15, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,619 101 Updated Jul 29, 2024

Multimodal Models in Real World

Jupyter Notebook 347 17 Updated Jul 12, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 626 47 Updated Jul 29, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,621 99 Updated Jul 26, 2024

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)

Python 495 28 Updated Jan 8, 2024

[ECCV 2024] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Python 24 3 Updated Jul 25, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,450 137 Updated Aug 2, 2024
Python 199 14 Updated Apr 10, 2024

PartCraft: Crafting Creative Objects by Parts (ECCV2024)

Python 69 1 Updated Jul 13, 2024

[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google

12 Updated Jul 24, 2024
Python 368 21 Updated Jul 10, 2024

Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Python 115 3 Updated Apr 30, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 351 10 Updated Jul 10, 2024

Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)

Jupyter Notebook 90 11 Updated Apr 8, 2024
Python 1,421 79 Updated Jul 29, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,104 39 Updated Jul 14, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,150 157 Updated Aug 1, 2024

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Python 120 2 Updated Jul 10, 2024

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 455 19 Updated Jul 13, 2024

Your image is almost there!

Python 7,017 411 Updated Jul 26, 2024

NSGA2, NSGA3, R-NSGA3, MOEAD, Genetic Algorithms (GA), Differential Evolution (DE), CMAES, PSO

Python 2,150 378 Updated Aug 1, 2024
Next