Skip to content

A Web scrapping and Exploratory data analysis project using the HackerNews API.

License

Notifications You must be signed in to change notification settings

david-adewoyin/HackerNews

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HackerNews ⭐

This is a data analysis project consisting of:

  • Web scrapping using Scrapy and Postgres.
  • And exploratory data analysis.

Web Scrapper:

Post items and usernames were scrapped from the publicly available Hacker News API and stored on a postgres database running on the cloud.More information about the scrapper can be found in the README in the hackernews_scrapper folder.

In total: 3 months of posts comprising:

  • 1.2 million posts
  • and 77k users were scrapped from the api.

Data Analysis:

  • Data cleaning was performed to tidy up data.
  • Exploratory Analysis and Named Entity Recognition were done on the data to answer questions such has:
    • Best day and hour to post.
    • What topics have the highest form of engagement during the period.

About

A Web scrapping and Exploratory data analysis project using the HackerNews API.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published