- Scale-Invariant Feature Transform (SIFT)
- Speeded Up Robust Features (SURF)
- Histogram of Oriented Gradients (HOG)
- Support Vector Machine (SVM)
- Random Forest (RF)
- Adaboost
- K-Nearest Neighbors (KNN)
- K-Means Clustering
While these techniques might be relevant for learning purposes, they are not used in the current state-of-the-art methods for image understanding. The following techniques are more relevant for the current state-of-the-art methods.
- Convolutional Neural Networks (CNN)
- ResNet
- EfficientNet
- Region-based CNN (R-CNN)
- Fast R-CNN
- Faster R-CNN
- Mask R-CNN
- You Only Look Once (YOLO)
- YOLOv1, ..., YOLOv8
- Transformer-based
- Vision Transformer (ViT)
- Data-efficient Vision Transformer (DeiT)
- End-to-End Object Detection with Transformers (DETR)
- Swin Transformer: Hierarchical Vision Transformer
- Fully Transformer-based Object Detector (ViDT)
- Point Cloud Transformer (PCT)
- Vision Permutator: MLP-Like Architecture
- GAN
- Diffusion
- Tracking
- Key Points
- CoTracker