- Hangzhou, China
Block or Report
Block or report CS123n
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (2)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The Pytorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024)
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
SEED-Story: Multimodal Long Story Generation with Large Language Model
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
[ECCV 2024] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…
PartCraft: Crafting Creative Objects by Parts (ECCV2024)
[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google
Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
EVA Series: Visual Representation Fantasies from BAAI
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
NSGA2, NSGA3, R-NSGA3, MOEAD, Genetic Algorithms (GA), Differential Evolution (DE), CMAES, PSO