Skip to content

Features are extracted from text data using the bag of words model and a Naive Bayes classifier is trained and used to predict email spams.

Notifications You must be signed in to change notification settings

mekhod/spam-email

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spam-Emails practice project

Project Overview:

In this practice project, a labeled (spam or ham) email text dataset is processed using the Bag of Word (BoW) method and a Naive Bayes classifier is trained with the processed dataset.

Steps:

  • The dataset is loaded and preprocessed using the BoW method available in sklearn (sklearn.feature_extraction.text.CountVectorizer).
  • A Naive Bayes model is built.
  • The model is trained with the processed dataset.

Results:

  • An F1 score of 0.95 is achieved.

End

About

Features are extracted from text data using the bag of words model and a Naive Bayes classifier is trained and used to predict email spams.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages