SPAMICITY

What?

SPAMICITY is an implementation of a naive Bayesian spam filter, that displays spam messages and marks words in the spam that are considered high spamicity (words that the computer uses to identify the email as spam). From these high spamicity words identified, SPAMICITY creates a visualization overlayed on the spam message. If the you want to help SPAMICITY with identifying spam, feel free to click on words to toggle their spamicity between low and hi.

Why?

We receive a lot of spam every day with the only sign of its existence the spam folder's perpetually increasing counter. I wanted to create something that made people interested in what they discard and ignore. In the process I've also managed to create a window into how Bayesian spam filtering works, and into what the computer is seeing when it considers spam.

How?

I created this project using and modifying code from Daniel Shiffman's lesson on Bayesian Filtering. Additionally this project is built with Processing and RiTa. Lastly, thank you for all of the help and ideas from my class!

Goals and Problems