ClimateGDP

The project was made as part as the Big Data Infastructure and Technologies course in ITMO University.

Aim: to develop ML models to estimate GDPs of 34 European countries using meteorological and agricultural data.

Predictors:

2 m air temperature (t2m; ERA5 on single level);
total precipitations (tp; ERA5 on single level);
agricultural land area (agr; The World Bank);
cereal yield (yld; The World Bank).

Target:

gross domestic product (gdp; The World Bank).

1st Stage. According to a list of European countries all names were unified to one notation before data agreggation. Aftewards, the economy data were processed with drop out of states with NAN values. To gather climate data with spatial resolution of 0.25° it was necessary to slice the data accroding to the official borders (.shp file comprises this information) and average hourly data over years with following aggregation.

2d Statge. On this stage a k-means clustering was applied to split the countries on some groups (it was decided to have 3). Then heatmaps, lineplots and scatterpots were created to investigate the collected data.

3d Stage. To prove that meteorological data have an impact on GDP two datasets were obtained. Finally, XGBoost and RF models were built and trained with flollowing evaluation (R^2). For the best model (XGBoost) a sensetivity analysis was applied to see how the std of meteo variables can change the output of the model using Monte-Carlo technique.

Conclusions:

GDPs of 34 European countries were successfully modelled using two ML algortithms + k-means clustering and evaluated;
Climate (2 m air temperature and total precipitations) is one of the key components for well-being of all, even post-industrial economies;
It’s possible to forecast GDPs of European countries using different warming scenarios to lead a proper decision-making policy.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
2000.dat		2000.dat
2001.dat		2001.dat
2002.dat		2002.dat
2003.dat		2003.dat
2004.dat		2004.dat
2005.dat		2005.dat
2006.dat		2006.dat
2007.dat		2007.dat
2008.dat		2008.dat
2009.dat		2009.dat
2010.dat		2010.dat
2011.dat		2011.dat
2012.dat		2012.dat
2013.dat		2013.dat
2014.dat		2014.dat
2015.dat		2015.dat
2016.dat		2016.dat
2017.dat		2017.dat
2018.dat		2018.dat
2019.dat		2019.dat
2020.dat		2020.dat
API_AG.YLD.CREL.KG_DS2_en_csv_v2_5455770.csv		API_AG.YLD.CREL.KG_DS2_en_csv_v2_5455770.csv
API_NV.AGR.TOTL.ZS_DS2_en_csv_v2_5456183.csv		API_NV.AGR.TOTL.ZS_DS2_en_csv_v2_5456183.csv
API_NY.GDP.MKTP.CD_DS2_en_csv_v2_5454986.csv		API_NY.GDP.MKTP.CD_DS2_en_csv_v2_5454986.csv
Countries-Europe.csv		Countries-Europe.csv
Data Aggregation.ipynb		Data Aggregation.ipynb
ML_phase.ipynb		ML_phase.ipynb
README.md		README.md
World_Countries__Generalized_.shp		World_Countries__Generalized_.shp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ClimateGDP

About

Releases

Packages

Languages

alexxxroz/ClimateGDP

Folders and files

Latest commit

History

Repository files navigation

ClimateGDP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages