ClearML experiment tracking integration #8620

thepycoder · 2022-07-18T12:28:36Z

This PR adds integration with the open-source experiment tracker ClearML. Installing the package pip install clearml will enable the integration and allow users to track every training run in ClearML. This in turn allows users to keep track of different experiments, compare them to see the differences and even run the experiment remotely (using the instructions in the ClearML readme)

Features

Experiment Tracking and Comparison

After installing and initializing ClearML (with clearml-init CLI command), one can run train.py with any configuration desired and ClearML integration will automatically be enabled.

What will be captured:

Source code + uncommitted changes
Installed packages
(Hyper)parameters
Model files (use --save-period n to save a checkpoint every n epochs)
Console output
Scalars (mAP_0.5, mAP_0.5:0.95, precision, recall, losses, learning rates, ...)
General info such as machine details, runtime, creation date etc.
All produced plots such as label correlogram and confusion matrix
Images with bounding boxes per epoch
Mosaic per epoch
Validation images per epoch
...

Each of these metrics can easily be compared between multiple experiments using the ClearML web console.

Versioned Dataset Support

Users will be able to provide a ClearML dataset version as part of the YOLOv5 command line interface, for training. This dataset will be downloaded or taken from cache and used to further train on.

Hyperparameter Optimization

A standalone script is provided, which will allow users to run HPO on YOLOv5 locally or in the cloud as well.

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Enhanced YOLOv5 with ClearML integration for advanced ML experiment tracking and management.

📊 Key Changes

Added recommendation for using Weights & Biases Logging.
Integrated ClearML tool for automatic experiment tracking and dataset versioning.
Modified README to introduce ClearML integration with its features and links.
Updated requirements to suggest the installation of ClearML.
Tweaked train.py to support data logging for ClearML.
Enhanced Jupyter notebook tutorial to include ClearML setup and usage guide.
Updated general.py to recognize 'clearml:https://' dataset IDs for dataset versioning.
Introduced ClearML logger in utils/loggers/__init__.py.
Provided a detailed README for ClearML integration at utils/loggers/clearml/README.md.
Crafted a separate ClearML utility module at utils/loggers/clearml/clearml_utils.py.
Included a hypothetical script for hyperparameter optimization (hpo) using ClearML at utils/loggers/clearml/hpo.py.
Adjusted wandb_utils.py to harmonize with ClearML dataset structures.
Modified metrics.py and plots.py to add plot titles for clarity.

🎯 Purpose & Impact

🚀 Purpose: Incorporate ClearML for better experiment tracking, dataset versioning, and model management, offering an alternative to Weights & Biases.
🎛 Impact:
- Users gain the ability to trace training runs comprehensively with live updates and detailed metrics.
- Offers additional capabilities like uncommitted code tracking and reproducibility across machines.
- Expands dataset management by allowing users to version their datasets using ClearML dataset IDs.
- Enhances the existing benchmarking system with an exhaustive hyperparameter optimization script.
- Facilitates easier overview and analysis of experiments through well-organized dashboards and logging systems within the ClearML environment.

for more information, see https://pre-commit.ci

github-actions

👋 Hello @thepycoder, thank you for submitting a YOLOv5 🚀 PR! To allow your work to be integrated as seamlessly as possible, we advise you to:

✅ Verify your PR is up-to-date with upstream/master. If your PR is behind upstream/master an automatic GitHub Actions merge may be attempted by writing /rebase in a new comment, or by running the following code, replacing 'feature' with the name of your local branch:

git remote add upstream https://github.com/ultralytics/yolov5.git
git fetch upstream
# git checkout feature  # <--- replace 'feature' with local branch name
git merge upstream/master
git push -u origin -f

✅ Verify all Continuous Integration (CI) checks are passing.
✅ Reduce changes to the absolute minimum required for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." -Bruce Lee

for more information, see https://pre-commit.ci

glenn-jocher · 2022-08-05T18:38:51Z

@thepycoder I've reviewed and updated the PR as best I could. I'll merge now, but please review and verify I haven't broken any functionality, and please work on updates to streamline ClearML ops as mentioned over email so we can improve the user experience for ClearML logging.

FYI inserted links for improved analytics for us:

glenn-jocher · 2022-08-05T18:50:21Z

@thepycoder I've removed the plot titles from labels.png and labels_correlogram.png as they were applied on the last subplot only. All other plot titles are nice additions, thanks!

glenn-jocher · 2022-08-05T18:51:10Z

@thepycoder PR is merged. Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐

glenn-jocher · 2022-08-05T23:47:13Z

@thepycoder I noticed the light/dark mode graphics look good in GitHub (very clever), but unfortunately do not extend well to other platforms with automatic README views that we use like Docker Hub and Paperspace Gradient (see links below). Can you please provide a single graphic for use here? Maybe the light mode graphic with an opaque white inside the circle would work.

* Add titles to matplotlib plots * Add ClearML Experiment Tracking integration. * Add ClearML Data Version Management automatic download when requested * Add ClearML Hyperparameter Optimization * ClearML save period integration * Fix wandb breaking when used with ClearML dataset * Fix wandb breaking when used with ClearML resume and dataset * Add ClearML documentation * fixed small bug in clearml integration that misreports epoch number * Final ClearMl additions before refactor * Add correct epoch reporting * Add remote execution and autoscaling docs for ClearML integration * Added images to clearml integration docs * fixed logo alignment bug and added hpo screenshot clearml * Fixed small epoch number bug in clearml integration * Remove saved model flush clearml * Cleanup clearml readme section * Cleaned up clearml logger docstring * Remove resume readme section clearml * Clearml integration cleanup * Updated ClearML documentation * Added dark vs light icons ClearML Readme * Clearml Readme styling * Add better gifs * Fixed gif file size * Add better images in tutorial notebook * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressed comments in PR ultralytics#8620 * Fixed circular import * Fixed circular import * Update tutorial.ipynb * Update tutorial.ipynb * Inline comment * Restructured tutorial notebook * Add correct ClearML link to README * Update tutorial.ipynb * Update general.py * Update __init__.py * Update __init__.py * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update __init__.py * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update __init__.py * Update README.md * Update __init__.py * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * spelling * Update tutorial.ipynb * notebook cutt.ly links * Update README.md * Update README.md * cutt.ly links in tutorial * Removed labels as they show up on last subplot only Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Glenn Jocher <[email protected]>

Robotatron · 2023-01-08T03:22:30Z

Will ClearML be supported for segmentation as well? If so, any ETA on this? @thepycoder @glenn-jocher

thepycoder · 2023-01-11T16:10:35Z

Hey @Robotatron! What features would you like to see specifically? I can def take a look :)

Robotatron · 2023-01-11T16:21:26Z

Hey @Robotatron! What features would you like to see specifically? I can def take a look :)

Hey @thepycoder, thanks for showing interest. The features would be all the common stuff you can think of (probably the same stuff you log with object detection for ClearML already), in order of importance:

Current YOLO5 segmentation Tensorboard scalar metrics that can be visualized in a graph with ClearML -> so losses and segmentation mAPs (AP50:95, AP50, etc). The ones YOLO5 logs into "results.csv" file
opt.yaml, the config of the run
If possible, per class mAP metrics (we get those printed in the console when running inference, but not during training sadly)
Inference images on the test set (the one we get with YOLO automatically in the experiment folder)
Train batch images (the one we get with YOLO automatically in the experiment folder)
(this will be probably too much, but prediction masks on the test set in the COCO format)

thepycoder · 2023-01-12T13:39:21Z

@Robotatron , please check out #10752 and let me know if it works for you! :)

Robotatron · 2023-01-13T17:39:27Z

@thepycoder Everything works great, thank you!
Would you know if I have to configure clearML or edit your logger to also log the best model under "artefacts"? I am new to ClearML so could be a stupid question :)

thepycoder · 2023-01-18T13:28:42Z

@Robotatron The best model should always be logged under the artifacts tab. If not, there's a bug somewhere.
To get the latest model too you'll have to set a save inteval using the yolo arguments themselves :)

thepycoder and others added 27 commits July 18, 2022 13:27

Add titles to matplotlib plots

ec7986a

Add ClearML Experiment Tracking integration.

11da722

Add ClearML Data Version Management automatic download when requested

d395a28

Add ClearML Hyperparameter Optimization

a160dfc

ClearML save period integration

16a1a48

Fix wandb breaking when used with ClearML dataset

8e957c9

Fix wandb breaking when used with ClearML resume and dataset

027ca12

Add ClearML documentation

a5ae4bb

fixed small bug in clearml integration that misreports epoch number

29a2686

Final ClearMl additions before refactor

be45d1b

Add correct epoch reporting

bd20628

Add remote execution and autoscaling docs for ClearML integration

c69d56f

Added images to clearml integration docs

358354d

fixed logo alignment bug and added hpo screenshot clearml

fd0b10d

Fixed small epoch number bug in clearml integration

1cbe74b

Remove saved model flush clearml

51f051d

Cleanup clearml readme section

2557ace

Cleaned up clearml logger docstring

3c9403b

Remove resume readme section clearml

85ac912

Clearml integration cleanup

a9bb3be

Updated ClearML documentation

a29eb1c

Added dark vs light icons ClearML Readme

806c22a

Clearml Readme styling

194cf62

Add better gifs

421eb19

Fixed gif file size

36ce901

Add better images in tutorial notebook

aa36080

[pre-commit.ci] auto fixes from pre-commit.com hooks

ba99667

for more information, see https://pre-commit.ci

github-actions bot reviewed Jul 18, 2022

View reviewed changes

glenn-jocher assigned thepycoder Jul 18, 2022

glenn-jocher added the TODO label Jul 18, 2022

pre-commit-ci bot and others added 13 commits August 5, 2022 16:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

9c0eab3

for more information, see https://pre-commit.ci

Update __init__.py

fd0cff6

[pre-commit.ci] auto fixes from pre-commit.com hooks

c97a2ad

for more information, see https://pre-commit.ci

Update __init__.py

8bbf04e

Update README.md

0930d59

Update __init__.py

68f9a9c

[pre-commit.ci] auto fixes from pre-commit.com hooks

b618614

for more information, see https://pre-commit.ci

spelling

918d861

Update tutorial.ipynb

092fe34

notebook cutt.ly links

e6eca5f

Update README.md

b45cac4

Update README.md

40d3b95

cutt.ly links in tutorial

5ed6620

Removed labels as they show up on last subplot only

00eda91

glenn-jocher merged commit 378bde4 into ultralytics:master Aug 5, 2022

glenn-jocher removed the TODO label Aug 5, 2022

Hojland mentioned this pull request Oct 17, 2022

feat/bump Go-Autonomous/yolov5#15

Merged

Robotatron mentioned this pull request Jan 11, 2023

Integration of v8 segmentation ultralytics/ultralytics#107

Merged

thepycoder mentioned this pull request Jan 12, 2023

Add segmentation and classification support for ClearML #10752

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClearML experiment tracking integration #8620

ClearML experiment tracking integration #8620

thepycoder commented Jul 18, 2022 •

edited by UltralyticsAssistant

Loading

github-actions bot left a comment

glenn-jocher commented Aug 5, 2022 •

edited

Loading

glenn-jocher commented Aug 5, 2022

glenn-jocher commented Aug 5, 2022

glenn-jocher commented Aug 5, 2022

Robotatron commented Jan 8, 2023 •

edited

Loading

thepycoder commented Jan 11, 2023

Robotatron commented Jan 11, 2023 •

edited

Loading

thepycoder commented Jan 12, 2023

Robotatron commented Jan 13, 2023

thepycoder commented Jan 18, 2023

ClearML experiment tracking integration #8620

ClearML experiment tracking integration #8620

Conversation

thepycoder commented Jul 18, 2022 • edited by UltralyticsAssistant Loading

Features

Experiment Tracking and Comparison

Versioned Dataset Support

Hyperparameter Optimization

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

github-actions bot left a comment

Choose a reason for hiding this comment

glenn-jocher commented Aug 5, 2022 • edited Loading

glenn-jocher commented Aug 5, 2022

glenn-jocher commented Aug 5, 2022

glenn-jocher commented Aug 5, 2022

Robotatron commented Jan 8, 2023 • edited Loading

thepycoder commented Jan 11, 2023

Robotatron commented Jan 11, 2023 • edited Loading

thepycoder commented Jan 12, 2023

Robotatron commented Jan 13, 2023

thepycoder commented Jan 18, 2023

thepycoder commented Jul 18, 2022 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Aug 5, 2022 •

edited

Loading

Robotatron commented Jan 8, 2023 •

edited

Loading

Robotatron commented Jan 11, 2023 •

edited

Loading