- Created a chart to visualize the impact on emissions from reduced mobility due to Covid-19 quarantine in USA.
- Cleaned and joined Google mobility data with Air Quality Index data at the county-level using Pandas in Jupyter Notebook.
- Utilized Spark to distribute data processing on Amazon EMR cluster to import data from S3 and perform data analysis.
The mobility data was acquired from Google's mobility report
Lockdown dates data was taken from kaggle
County-level AQI data from the EPA