Want to create an interactive transcript for this episode?
Podcast: Data Skeptic
Episode: Spam Filtering with Naive Bayes
Description: Today's spam filters are advanced data driven tools. They rely on a variety of techniques to effectively and often seamlessly filter out junk email from good email. Whitelists, blacklists, traffic analysis, network analysis, and a variety of other tools are probably employed by most major players in this area. Naturally content analysis can be an especially powerful tool for detecting spam. Given the binary nature of the problem (Β orΒ ) its clear that this is a great problem to use machine learning to solve. In order to apply machine learning, you first need a labelled training se...