Skip to content

nikdav3126/Seinfeld_etl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

80 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Seinfeld_etl

Sometimes the hardest question for data collection can be "What do you have that you're passionate about?" Unfortunately when asked questions such as those your mind goes blank and all you can think is "nothing", well out of that "nothing" came the inspiration to do a Seinfeld ETL. This project shows how webscraping, transforming the data collected, then putting into a database can be used for many applications. While we used Seinfeld some practical applications could be a sentiment analysis that is used to see if a sequel to a movie would be well received, or if a new product launch will go well based on previous reviews.

Python, pandas, and SQL were used to put this project together. We built the code using Jupyter Notebook and PGAdmin4.

Sources

  1. https://github.com/4m4n5/the-seinfeld-chronicles

  2. https://en.wikipedia.org/wiki/List_of_Seinfeld_episodes#Season_1_1989-90

  3. https://www.imdb.com/name/nm0911320/?ref_=fn_al_nm_1#actor (example)

  4. "https://www.imdb.com/title/tt0098904/fullcredits/?ref_=tt_cl_sm"

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published