Search
This is an old revision of the document!
Spam filtering is a very practical assignment with a large real world application. It also represents certain class of problems, we have to contend with in machine learning.
In this assignment, your main task is not to create a perfect spam filter. You do not know the methods that would allow you to do that yet. Your task is:
Using this assignment we want to show the following:
You are given two sets of data to work with. While the final evaluation of your work will be done using different set of data, your spam filter should work on both.