TEXT MINING FOR INDONESIAN ONLINE NEWS ARTICLES ABOUT CORONA

Hi! In the notebook, we will start our text mining journey by scraping a list of news articles from tirto.id and detik.com about the Coronavirus using BeautifulSoup package. The contents will be saved to an individual .tsv (tab seperated value) files, which will be loaded again for further analysis. From there, we analyze the posting pattern for each sites and train a Word2Vec model using gensim package in order to analyze the semantic and syntactic similarity between each preprocessed words.

REFERENCES

Article Contents

Stopwords List

About Word2Vec

External Media

china-map.png: https://upload.wikimedia.org/wikipedia/commons/9/97/Flag_map_of_China_%26_Taiwan.png

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
results		results
scrape-results		scrape-results
stopwords-list		stopwords-list
README.md		README.md
TEXT MINING FOR INDONESIAN ONLINE NEWS ARTICLES ABOUT CORONA.ipynb		TEXT MINING FOR INDONESIAN ONLINE NEWS ARTICLES ABOUT CORONA.ipynb
china-map.png		china-map.png
word2vec_corona.model		word2vec_corona.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TEXT MINING FOR INDONESIAN ONLINE NEWS ARTICLES ABOUT CORONA

REFERENCES

Article Contents

Stopwords List

About Word2Vec

External Media

About

Releases

Packages

Languages

tomytjandra/text-mining-corona-articles

Folders and files

Latest commit

History

Repository files navigation

TEXT MINING FOR INDONESIAN ONLINE NEWS ARTICLES ABOUT CORONA

REFERENCES

Article Contents

Stopwords List

About Word2Vec

External Media

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages