Movies-ETL

Project Overview

Overview

In this project I will create a an automated pipeline that takes in scraped data from Wikipedia and IMDB, then transform and load it o an already existing PostgreSQL database.

Worflow

Read three data files (IMDB- Wikipedia- Ratings).
Extract and Transform data.
Load data to a PostgreSQL Movie Database.

Prerequisites

Software: Python, Anaconda Navigator, Conda, Jupyter Notebook, PostgreSQL, pgAdmin 4.

Loading data in the PostgreSQL Movie Database

Summary

The ETL jupyter notebook created collects and cleans movie data from different sources (Wikipedia JSON and Kaggle and ratings csv files). It transforms and merges the data and loads it into two updatable PostgreSQL database table.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
ETL.ipynb		ETL.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movies-ETL

Project Overview

Overview

Worflow

Prerequisites

Loading data in the PostgreSQL Movie Database

Summary

About

Releases

Packages

Languages

Elmehdi9/ETL-Project-

Folders and files

Latest commit

History

Repository files navigation

Movies-ETL

Project Overview

Overview

Worflow

Prerequisites

Loading data in the PostgreSQL Movie Database

Summary

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages