timeseries

Time-series analysis is adventurous I'd say, forecasting with time-series data can be challenging.

Some points to keep in mind while dealing with time-series:

The uncertainty of forecast is just as important as the (point) forecast itself.
Model serving (deploying & scoring) is really tricky.
Cross-validation (with sliding or expanding window as testing strategy) is also tricky.

In general, some time-series exhibit ill-behaved uncertainty. The forecast errors do not follow known distributions. Such information is useful for making judgmental decisions, but cannot be modeled and used for forecasting. Such an uncertainty is coconut uncertainty - of unknown unknowns leading to unpredictability.

Other time-series exhibit well-behaved uncertainty. The forecast errors follow known distributions - Normal, Poisson etc.. Such information is useful for modeling and predictions bound within a certain range. This window of uncertainty is subway uncertainty - of known unknowns.

Forecast of level + trend is a baseline forecast. Baseline forecasts with the persistence model (Using an observation at the previous time step to learn what will happen in the next time step) quickly indicate whether you can do significantly better. If you can’t, you’re probably dealing with a random walk. The human mind is hardwired to look for patterns everywhere and we must be vigilant we're not fooling ourselves and wasting time by developing elaborate models for random walk processes.

Approaches to smoothing a time-series: Baseline models

Holt's method - there're level smoothing constant (alpha) and trend constant (beta).

Holt Winter's method - there's seasonal smoothing constant (delta) and considers seasonal baseline which is a regularly recurring pattern (day, week, month, quarter etc.) and baseline rises and falls at regular intervals. Deviation of each season from the baseline’s long-term (annual) average is used for forecasts.

Exponential Smoothing - Defines trend as the difference between observed values in consecutive (in time) records.

Smoothing models are for removal of noise. Moving averages are considered for these and they can be simple, exponential, and cumulative. Examples: https://www.kaggle.com/code/ranja7/energy-consumption-forecast-baseline-models

Forecasting approach

ARIMA handles data with trend. SARIMA handles data with a seasonal component. The trend, seasonality and noise in a time series are explained by model parameter set (p,d,q), also called the order. The auto-regressive parameter is p; d is difference parameter and q is the moving average parameter. Trend is the long-term change in the mean level of observations. Seasonality is the pattern that’s periodically repeated, and noise is the random variation in the data. A time series is additive when the 'trend' is linear (changes are at linear rate) and 'seasonality' is constant in time. A time series is multiplicative when the 'trend' is non-linear. A stationary time series has constant mean over time and does not exhibit a trend.

Y(t) = Level + Trend + Seasonality + Noise

Example: https://www.kaggle.com/code/ranja7/sarima-forecasts-auto-arima

Other libraries used for forecasting

https://unit8.com/resources/darts-time-series-made-easy-in-python/

https://pypi.org/project/statsforecast/

There can be outliers in time-series data, often called anomalies due to their deviation from 'normal'. Anomalies can be point or collective (subsequent).

https://www.kaggle.com/code/ranja7/anomaly-detection-in-timeseries-isolation-forest

One can also use the FACEBOOK/META developed package PROPHET for anomaly detection and forecasting: https://facebook.github.io/prophet/docs/outliers.html

Multivariate Time-series

For multivariate time-series data, one can follow VAR (vector autoregression) approach. One can utilize deep learning methods like LSTM in multivariate time-series. https://www.kaggle.com/code/ranja7/forecasting-with-lstm-tensorflow The VAR model is baseline and can be trained to benchmark advanced models like aforementioned. Please refer to the VAR directory for a sample/reference.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
BayesianApproaches		BayesianApproaches
EDA		EDA
LanguageModel		LanguageModel
VAR		VAR
baseline_models		baseline_models
cointegration tests		cointegration tests
README.md		README.md
requirement.txt		requirement.txt
tsa.jpg		tsa.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

timeseries

About

Releases

Packages

Languages

ranja-sarkar/Time_series

Folders and files

Latest commit

History

Repository files navigation

timeseries

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages