Explorations into processing geographic data
The data has all been downloaded to my local PC but it can be found in the locations below. It is all UK data with an open government licence, or equivalent. I've found other shapefiles online but was unsure on their licence requirements so have not used them
-
National statistics Geoportal
- National Statistics Postcode Lookup File NSPL
- shape files for Counties, National Parks, Countries, Local Authorities, LSOA 2011
-
Land Registry Price Paid data PP_data
-
Ordnance Survey OpenData
- Boundary-Line
- OS Open Greenspace
- OS Open Rivers
- OS Open Roads
- OS Open UDPRN (Not used)
- [Strategi] (As at 2016 and now no longer on the OS website)
- Used for coastline, railways, ferries...
-
National Public Transport Access Nodes (NaPTAN)
-
- Listed Buildings
- Conservation Areas
-
Not yet used, but will hopefully add
- Ordnance Survey OpenData
- BGS Data
- 2021 census data
- This will take some thought as there are lots of individual files
- 01_GetData.ipynb
- Imports & Processes all raw data into a standard format
- 02_CreatePostcodeFile.ipynb
- From the raw gdf made in part 1 it creates a dataframe for all UK postcodes with lots of calculated variables based on our inputs
- KeyFunctions.py
- The module containing the functions used by 01 and 02 and other items
This folder contains some of the tests done to create the code in the main folder
- Maps.ipynb
- An investigative piece of code containing different map outputs