The purpose of this project is to extract, transform, and load (hence, ETL) data related to crowdfunding using both Python and SQL. Pandas and Python are used to extract and transform raw data, and PostgreSQL and pgAdmin were used to load clean data that is ready for analysis.
A quick snapshot of the crowdfunding.xlsx data:
Python and Pandas were used to tranform the raw data to a clean data. The python code to this work can be found here.
A quick snapshot of the contacts_clean.csv data:
A quick snapshot of the category.csv data:
A quick snapshot of the subcategory.csv data:
A quick snapshot of the campaign.csv data:
QuickDBD was used to model the data into an Entity Relationship Diagram. The table schema for the Entity Relationship Diagram can be found here.
Note:
One-to-one relationship: A straight line with a short, perpendicular line.
One-to-many relationship: A straight line with a short, perpendicular line with three short branches.
PostgreSQL and pgAdmin were used to create a database for the project and four tables corresponding to each of the cleaned up data using queries. These data were then imported using pgAdmin into the tables ready for use:
Below are the screenshots of the completed tables: