Skip to content

Data warehousing project from identifying business opportunities, researching for data, designing, data modeling, performing data Extraction, Transformation, Loading using Python, building analytics in Tableau and made recommendations to optimize business strategies. Netflix is not only a successful Service But it is completely a Data-Driven

Notifications You must be signed in to change notification settings

AsfandAliMemon25/Data-Warehousing-Netflix-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data-warehousing-Netflix-project

The Business/Organization & Opportunity:

Netflix is a subscription-based streaming service that allows its members to watch tv shows and movies without commercials on an internet-connected device (ref. netflix.com). There is an opportunity to optimize the selection of tv shows and movies for the business to increase member subscriptions and/or stock prices by analyzing the current total number of TV shows/movies, top 20 director of movie and Tv shows etc.

Problem:

The system helps us to find out what the most popular genre of movies/shows are in the current Netflix shows/movies selection and in Which country has highest no of movie and also by each year

Dimensional Model:

![image](Screenshot 2024-05-09 131218 )

Detailed Design:

The datasets we found from Kaggle.com allow us to analyze the Netflix’s movies and shows selection. For ETL steps, we chose to use python because it is one of the popular general-purpose programming languages that is capable of performing the tasks and the team is familiar with. We chose to run our data warehouse on MySQL server because it is a popular relational database management system that provides secure and fast data storage, fast query speed and I am also familiar with it. Finally, I chose Tableau to create visualizations because it is a powerful visualization tool that offers ease of use and many visualization possibilities. using these technologies I can easily pick up how to use them because there are many instructions or documentations available for these technologies which will significantly reduce the training time and cost.

ETL:

image

Final Schema:

![image](Screenshot 2024-05-09 215504 )

The final schema for our project is a Star schema which contains two fact tables and four dimensional tables.

Dashboard Application:

![image](Screenshot 2024-05-18 162319 )

Use MySQL workbench to monitor performance

Connect Tableau to MySQL and Tableau analysis:

![image](Screenshot 2024-05-18 162654 )

The analytics/visualization in the dashboard help answer the questions in the Problem section.

What are total no of movies and Tv shows on Netflix [image](Screenshot 2024-05-18 162808 )

what are top 20 director of movies and Tv shows [image](Screenshot 2024-05-18 162917 )

In Which country has most no of Movie In which year most movie and Tv show released [image](Screenshot 2024-05-18 163014 )

Conclusion:

The analytics built in Tableau using data transformed by the data source helped answer the problems we identified during the planning section and learned from data analysis that there are room to improve the movie/show selections in genres that received high IMDB rating, produce more high quality Netflix original movies/shows and increase movies/shows from countries that have higher than average IMDB ratings to attract more and retain existing subscriptions.

During this project , I got the opportunity to work on tasks that are practical and identical to real life project and it was a tremendous and valuable experience. I have learned about ETL, MYSQL and tableau to Data Visualization and also Data Visualization using Python how to figure out what directions to seek answers when I was stuck, effectively ask and follow-up with questions, perform researches (mostly online) using keywords and quickly acquired the necessary skillsets to solve various problems I ran into. I hope it will be beneficial for me as a Data Analyst or any other positions in the future.

About

Data warehousing project from identifying business opportunities, researching for data, designing, data modeling, performing data Extraction, Transformation, Loading using Python, building analytics in Tableau and made recommendations to optimize business strategies. Netflix is not only a successful Service But it is completely a Data-Driven

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published