Labeling
Easily compute clip embeddings and build a clip retrieval system with them
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))
FaRL for Facial Representation Learning [Official, CVPR 2022]
Face Parsing from RGB and Depth Using Cross-Domain Mutual Learning (CVPRW 2021) - IEEE AMFG Acceptance Rate ≈ 27%
Image to prompt with BLIP and CLIP
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
CLIP+MLP Aesthetic Score Predictor
Dataset of prompts, synthetic AI generated images, and aesthetic ratings.
A linear estimator on top of clip to predict the aesthetic quality of pictures
Easily compute clip embeddings from video frames
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Using pretrained encoder and language models to generate captions from multimedia inputs.
Very customizable imageboard/booru downloader with powerful filenaming features.