Online Contents Popularity Analysis with New York Times data

Why analyze online contents popularity?

In the digital era, everyone is competing for attention. Our goal is to analyze the relationship between online contents (description of the article on Twitter, NYT article content) and their popularity (number of Retweets and Likes on Twitter).
Methodology

We primarily use the GetOldTweets3 package to scrape data from the @nytimes Twitter acount, and regular expression and BeautifulSoup4 to extract features from each article.

Data scope

Every NYT article that was published from 2016/01/01 to 2018/12/31

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
2016-07-01_to_2016-12-01_nyt_twitter.p		2016-07-01_to_2016-12-01_nyt_twitter.p
README.md		README.md
업로드 시간, 비디오 및 사진 링크, 썸네일 링크 추출.ipynb		업로드 시간, 비디오 및 사진 링크, 썸네일 링크 추출.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Online Contents Popularity Analysis with New York Times data

About

Releases

Packages

Languages

kwonsunbin/nyt_analysis

Folders and files

Latest commit

History

Repository files navigation

Online Contents Popularity Analysis with New York Times data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages