miscalculation of scores #321

alaineiturria · 2018-07-04T21:12:02Z

To calculate the size of the windows, is it calculated considering the number of anomalies per data set present in the combined_labels.json file or those present in the combined_windows.json file?

I have been able to observe that the scores in the results/null/null_standard_scores.csv file are obtained using the combined_windows.json file, however, it seems to me that the anomalies present in combined_labels.json file are used to calculate the rest of the scores.

subutai · 2018-07-05T18:36:15Z

The window size calculation is described in the paper: "we define anomaly window length to be 10% the length of a data file, divided by the number of anomalies in the given file."

For each data file, the number of anomalies and their centers are given in the combined_labels.json file. The resulting windows are then put in combined_windows.json. This whole process is run by the combine_labels.py script.

Some more details are in the NAB whitepaper

For questions like these it might be better to ask on the HTM NAB Forum unless you see a specific bug we need to address.

Hope this helps!

subutai closed this as completed Jul 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

miscalculation of scores #321

miscalculation of scores #321

alaineiturria commented Jul 4, 2018

subutai commented Jul 5, 2018

miscalculation of scores #321

miscalculation of scores #321

Comments

alaineiturria commented Jul 4, 2018

subutai commented Jul 5, 2018