Because detection-models (Yolo, Mask-RCNN, etc) are often developed by the different frameworks and evaluated the performance on different hardware, it's hard to evaluate the performance (i.e, fps) and fine-tune these on your customized datasets. This project builds all detection-models with pytorch and provides the general API for training, fine-tuning and detection in supervised and unsupervised settings.
- Pytorch-1.0
- python-3.6
- CUDA-9.0
- 2 TiTAN XP with 12 GPU Memory
- Installation
- Supporting Models
- Loss visualization and analysis
- Fine-tuning
- Usage
- Mask-RCNN (in progress)
- RetinaNet (in progress)
- Yolov3
- M2Det (in progress)
- References
$ git clone https://github.com/jacksonly/Detection-Fine-tuning-API.git
$ cd Detection-Fine-tuning-API/
$ sudo pip3 install -r requirements.txt
$ cd weights/
$ bash ./yolov3/weights/download_weights.sh
Pretrained pytorch models (.pt):
- Two-stage Detection:
- Mask RCNN (CVPR'17) inspired by wkentaro
- One-stage Detection:
- RetinaNet (ICCV'17) inspired by yhenon
- Yolov3 (arXiv'18) inspired by ultralytics and TencentYoutuResearch
- M2Det (AAAI'19, the latest SSD) inspired by qijiezhao