GitHub - avcourt/spamfilter-py: A naïve Bayesian spam filter in Python

Spamfilter

A Naive Bayesian spam classifier in Python

This is a reimplementation of a previous spamfilter that I had written in Ruby. The original Ruby implementation can be found here and contains more details regarding its design and accuracy.

Once again, I have included small training and testing directories to illustrate usage. To get reasonable accuracy, larger training sets must be used, although the accuracy was surprisngly high even with these miniscule datasets. The included emails have been preprocessed to remove the headers and footers. You can experiment and see what provides higher accuracy.

At the moment, the email directory is hardcoded and uses *nix pathnames.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
emails		emails
.gitignore		.gitignore
readme.md		readme.md
spamfilter.py		spamfilter.py
test_spamfilter.py		test_spamfilter.py
token.py		token.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spamfilter

About

Releases

Packages

Languages

avcourt/spamfilter-py

Folders and files

Latest commit

History

Repository files navigation

Spamfilter

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages