test_model should give the user ability to define date and time of testing interval #34

MichaelClifford · 2019-07-25T12:55:43Z

test_model.py should include options that allow a user to dictate the exact start and end time of their testing window.

Current implementation takes rolling data window size in days and the current time automatically. This is an OK default, but we should also give the user the ability to specify specific time periods where they know an anomaly has occurred for testing purposes.

The text was updated successfully, but these errors were encountered:

MichaelClifford · 2019-07-25T15:01:49Z

To address @4n4nd's request in #35, below are the specific inputs we should add in our .env to achieve a user defined time range.

FLT_TEST_START_TIME = " "
FLT_TEST_END_TIME = " "

These two values will dictate the start_time and end_time for the data downloaded when running test_model.py :

prometheus-anomaly-detector/test_model.py

Lines 70 to 94 in 4f94267

 start_time=metric_start_time, 

 end_time=rolling_data_window, 

 chunk_size=None, 

 ) 

 ) 

 # If the training data downloaded is empty 

 if not train_data: 

 _LOGGER.error("No Metric data received, please check the data window size") 

 raise ValueError 

 # If more than one time-series match the given metric, raise an error 

 if len(train_data) > 1: 

 _LOGGER.error("Multiple timeseries matching %s were found") 

 _LOGGER.error("The timeseries matched were: ") 

 for timeseries in train_data: 

 print(timeseries.metric_name, timeseries.label_config) 

 _LOGGER.error("One metric should be specific to a single time-series") 

 raise ValueError 

 # Download test data 

 test_data_list = pc.get_metric_range_data( 

 metric_name=metric, 

 start_time=rolling_data_window, 

 chunk_size=str(Configuration.retraining_interval_minutes) + "m",

4n4nd · 2019-07-25T15:18:14Z

Does it test on all this test data once? or do we specify a training interval?

MichaelClifford · 2019-07-25T15:25:08Z

So I think we still specify the training interval. It should be the same value as FLT_ROLLING_DATA_WINDOW. maybe this could be renamed for clarity? maybe FLT_ROLLING_TRAINING_WINDOW?

4n4nd · 2019-07-25T16:18:07Z

okay FLT_ROLLING_TRAINING_WINDOW sounds good.

FLT_TEST_START_TIME and FLT_TEST_END_TIME specify the total test data
and FLT_RETRAINING_INTERVAL is the interval for training?

and maybe FLT_TRAIN_START_TIME and FLT_TRAIN_END_TIME as well?

MichaelClifford · 2019-07-25T16:48:16Z

Yes, FLT_TEST_START_TIME and FLT_TEST_END_TIME should specify the total data used by model_test.py.

FLT_RETRAINING_INTERVAL_MINUTES should specify the prediction range for the test (how far into the future we will forecast after each retraining) , as it represents how frequently we will retrain the model on the FLT_ROLLING_TRAINING_WINDOW timeframe and then make our forecast up to the next retraining.

I don't think we need FLT_TRAIN_START_TIME and FLT_TRAIN_END_TIME.

4n4nd · 2019-07-25T18:12:50Z

Okay I will add,

FLT_DATA_START_TIME: Data start time
FLT_DATA_END_TIME: Data End time
FLT_ROLLING_TRAINING_WINDOW_SIZE: Training data window size
FLT_RETRAINING_INTERVAL_MINUTES: retraining interval/ forecasting duration

Are these var names good?

goern · 2020-03-09T20:47:53Z

heya, what needs to be done to close this?

sesheta · 2021-07-01T16:30:46Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

/lifecycle stale

sesheta · 2021-10-12T11:36:15Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

/lifecycle rotten

sesheta · 2021-11-11T12:09:31Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

/close

sesheta · 2021-11-11T12:09:42Z

@sesheta: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

4n4nd mentioned this issue Jul 25, 2019

Change how data range input is taken #35

Closed

4n4nd self-assigned this Jul 25, 2019

sesheta added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 1, 2021

sesheta added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Oct 12, 2021

sesheta closed this as completed Nov 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test_model should give the user ability to define date and time of testing interval #34

test_model should give the user ability to define date and time of testing interval #34

MichaelClifford commented Jul 25, 2019

MichaelClifford commented Jul 25, 2019

4n4nd commented Jul 25, 2019

MichaelClifford commented Jul 25, 2019

4n4nd commented Jul 25, 2019 •

edited

Loading

MichaelClifford commented Jul 25, 2019

4n4nd commented Jul 25, 2019 •

edited

Loading

goern commented Mar 9, 2020

sesheta commented Jul 1, 2021

sesheta commented Oct 12, 2021

sesheta commented Nov 11, 2021

sesheta commented Nov 11, 2021

test_model should give the user ability to define date and time of testing interval #34

test_model should give the user ability to define date and time of testing interval #34

Comments

MichaelClifford commented Jul 25, 2019

MichaelClifford commented Jul 25, 2019

4n4nd commented Jul 25, 2019

MichaelClifford commented Jul 25, 2019

4n4nd commented Jul 25, 2019 • edited Loading

MichaelClifford commented Jul 25, 2019

4n4nd commented Jul 25, 2019 • edited Loading

goern commented Mar 9, 2020

sesheta commented Jul 1, 2021

sesheta commented Oct 12, 2021

sesheta commented Nov 11, 2021

sesheta commented Nov 11, 2021

4n4nd commented Jul 25, 2019 •

edited

Loading

4n4nd commented Jul 25, 2019 •

edited

Loading