A dataset analysis about spam messages.
Paper about the analysis.
Challenge description.
This project runs on Python 3 and the analysis are stored in the files below:
- 1 - Most frequent words
- 2 - Messages count
- 3 - Words max, min, mean, median, std, var
- 4 - Days with more messages
- Naive Bayes Classification using Scikit-learn
Just open the *.ipynb file on GitHub!
- Install Python 3
- Install Jupyter Notebook
pip install notebook
- Install depedencies
pip install -r requirements.txt
- Run Jupyter server
jupyter notebook