Table of Contents

Spam filter

This assignment will be probably introduced during week 4 or 5. Spam filtering is a very practical assignment with large real world application. It is also a representative of a certain class of problems, we have to contend with in machine learning.

What will you learn?

Objectives

On this assignment we want to show the following:

  1. For some problems, the program's ability to adapt is essential.
  2. Automatic learning also has certain pitfalls that need to be avoided.
  3. There exists a kind of tasks, where it is hard to judge the quality of a solution.

The problem

In this assignment, your main task is not to create a perfect spam filter. You do not know the methods that would allow you to do that yet. Your task is:

Data

You are given two sets of data to work with. While the final evaluation of your work will be done using different set of data, your spam filter should work on both.

[n/a: Access denied]