Search
Spam filtering is a very practical assignment with a large real world application. It also represents certain class of problems, we have to contend with in machine learning.
In this assignment, your main task is not to create a perfect spam filter. You do not know the methods that would allow you to do that yet. Your task is:
Using this assignment we want to show the following:
We provide you with 2 sets of data to work with. While the final evaluation of your work will be done using different set of data, your spam filter should work on both. It is also important that you understand the format of the data that we will use; it is described on the page linked above.