Stars
Agentic components of the Llama Stack APIs
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Generative Models by Stability AI
open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.
An open-source framework for training large multimodal models.
Collection of Summer 2025 tech internships!
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Scaled-YOLOv4: Scaling Cross Stage Partial Network
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Pytorch implementation of convolutional neural network visualization techniques
📊 Benchmark multiple object trackers (MOT) in Python
A paper list of object detection using deep learning.
A Simple and Versatile Framework for Object Detection and Instance Recognition
Alignedreid++: Dynamically Matching Local Information for Person Re-Identification.
Simple Online Realtime Tracking with a Deep Association Metric
Paper list and source code for multi-object-tracking
Collection of papers, datasets, code and other resources for object tracking and detection using deep learning
Interactive Revit RFA and RVT project database exploration tool to view and navigate BIM element parameters, properties and relationships.