Intraday Volatility Forecasting with Deep Learning

Background

This repo serves to mark my efforts to forecast volatility using deep learning. While working on proprietary deep learning models to forecast volatility, I found this paper by a group of researchers as Oxford. The paper was fascinating, as the researchers discovered commonality between securities in their intraday volatility patterns. They leveraged this commonality to train forecasting models on a subset of securities.

Academic research serves to introduce novel ideas and methodologies, typical in the absence of a specific application or use. My interest in forecasting volatility is driven by the fact that, in theory, one who can accurately forecast volatility can leverage options strategies to generate significant returns.

Replication

The first author, Chao Zhang was kind enough to share the original code, which served as a starting point for this replication - which now includes a custom dataset and fully refactored code for readability and reusability. The original dataset, Lobster was more expensive than we liked, so I wrote a custom client to pull large amounts of minute data from Polygon. That was a project in and of itself, can can be found in this repo. Then, I reverse engineered the original code to understand the data processing and model training. I refactored the code to be more modular and extensible.

I was able to produce similar results to the original paper using my new dataset and refactored code. The dnn-refactored-v1.py script is the most performant model (MLP).

Data

I have not included my dataset in this repo, as it is too large. If you want it, shoot me an email and I'll send it to you.

processed-5yr-93-minute

This includes 5yrs of raw minute data from Polygon from 2018-10-11 to 2023-10-09 The data is processed in 3 ways:

65min aggregates (open, high, low, close, volume)
1. includes a 9:30 period, but not a 16:00 period
65min Realized Volatility (RV)
1. includes a 16:00 period instead of a 9:30 period
2. preprocess-5y-93-v2.ipynb
65min log returns
1. calculated as log((last trade in 65min bin) - log(first trade in 65min bin))
2. preprocess-5y-93-v4.ipynb

Completed

TODO

add the LASSO and HARD model classes
add the MLP model class
implement the training loop for the MLP
run a naive report on new rv dataset
start comparing the different models by changing 1 thing at a time and observing the difference
get an LSTM running

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
lib		lib
notebooks		notebooks
original-code		original-code
scripts		scripts
.gitignore		.gitignore
IDVF_Oxford.pdf		IDVF_Oxford.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intraday Volatility Forecasting with Deep Learning

Background

Replication

Data

processed-5yr-93-minute

Completed

TODO

About

Releases

Packages

Languages

beverm2391/IDVF-Oxford-v1

Folders and files

Latest commit

History

Repository files navigation

Intraday Volatility Forecasting with Deep Learning

Background

Replication

Data

processed-5yr-93-minute

Completed

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages