Stars
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
Train ImageNet *fast* in 500 lines of code with FFCV
FFCV: Fast Forward Computer Vision (and other ML workloads!)
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
A high-throughput and memory-efficient inference and serving engine for LLMs
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
When do we not need larger vision models?
Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)
😸 Soothing pastel theme for the high-spirited!
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
GPU Accelerated t-SNE for CUDA with Python bindings
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
a fork of https://jonbarron.info/ for use in jekyll builds with markdown page updates
A terminal workspace with batteries included
[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
Official PyTorch implementation of "Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models" (ECCV 2024)
[ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Making large AI models cheaper, faster and more accessible
DataComp: In search of the next generation of multimodal datasets
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"