Stars
[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models
Run SOTA Vision-Language Model Florence-2 on your data!
Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024
Code to scrape CVPR website for list of accepted papers, find their arXiv links, extract metadata, and download pdfs
Hugging Face Plugins for FiftyOne
code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction
FiftyOne Plugin for Stable Diffusion Data Augmentation
Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)
🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
A repository for the FiftyOne Plugin Outlier Detection
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Track model training experiments with MLflow and FiftyOne!
A repo that shows a demo of a mlflow and fiftyone integration
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvus
This is an Audio Loader Plugin for FiftyOne.
Caption images across your datasets with state of the art models from Hugging Face and Replicate!
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Convert datasets from Hugging Face to FiftyOne for Visualization
Albumentations Data Augmentation Plugin for FiftyOne!
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
jacobmarks / CompressionVAE
Forked from maxfrenzel/CompressionVAEGeneral-purpose dimensionality reduction and manifold learning tool based on Variational Autoencoder, implemented in TensorFlow.
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.