computer vision
Javascript/WebGL lightweight face tracking library designed for augmented reality webcam filters. Features : multiple faces detection, rotation, mouth opening. Various integration examples are provโฆ
OpenMMLab Pose Estimation Toolbox and Benchmark.
๐๐ค๐ AI web app and API to analyze basketball shots and shooting pose.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training andโฆ
Obsidian OCR allows you to search for text in your images and pdfs
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes
FaceAPI: AI-powered Face Detection & Rotation Tracking, Face Description & Recognition, Age & Gender & Emotion Prediction for Browser and NodeJS using TensorFlow/JS
The open-source tool for building high-quality datasets and computer vision models
We write your reusable computer vision tools. ๐
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Effortless data labeling with AI support from Segment Anything and other awesome models.
Segment-Anything + 3D. Let's lift anything to 3D.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Inpaint anything using Segment Anything and inpainting models.
Simple static web-based mask drawer, supporting semantic segmentation and video segmentation with interactive Segment Anything Model 2 (SAM2).
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
A latent text-to-image diffusion model
Segment Anything for Stable Diffusion WebUI