Skip to content

Created a classifier that can distinguish spam emails from ham (non-spam) emails

Notifications You must be signed in to change notification settings

cheahannah/spam-vs-ham-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Spam vs Ham Email Classifier (2019)

Tools: Python (numpy, pandas, seaborn, matplotlib), Jupyter Notebooks

  • Created logistic regression classification model for spam email detection
  • Improved accuracy through feature engineering using Regular Expresions and pandas
  • Validated model performance and minimized overfitting through cross validation
  • Participated in class-wide Kaggle competition and achieved a test accuracy of 0.90857