CourseWare Wiki
Switch Term
Winter 2023 / 2024
Winter 2022 / 2023
Winter 2021 / 2022
Winter 2020 / 2021
Winter 2019 / 2020
Winter 2018 / 2019
Older
Search
Log In
old
courses
be5b33prg
homeworks
spam
data
Warning
This page is located in archive. Go to the latest version of this
course pages
.
Differences
This shows you the differences between two versions of the page.
View differences:
Side by Side
Inline
Go
Link to this comparison view
Both sides previous revision
Previous revision
2015/11/24 16:12 xposik
2015/11/24 16:06 xposik
2015/11/24 16:04 xposik
2015/11/24 15:54 xposik created
Go
2015/11/24 16:12 xposik
2015/11/24 16:06 xposik
2015/11/24 16:04 xposik
2015/11/24 15:54 xposik created
Go
Last revision
Both sides next revision
courses:be5b33prg:homeworks:spam:data [2015/11/24 16:04]
xposik
courses:be5b33prg:homeworks:spam:data [2015/11/24 16:06]
xposik
Line 1:
Line 1:
======Data format======
======Data format======
-
During this assignment you will work with data sets of emails, which will also contain meta-data about the emails. Such a set of data is usually called [[wp>Text_corpus|corpus]]. In our case, the meta-data
for our emails may
contain
the
information whether
it
is a spam or not and/or
what the decision of the
spam filter is.
+
During this assignment you will work with data sets of emails, which will also contain meta-data about the emails. Such a set of data is usually called
a
[[wp>Text_corpus|corpus]]. In our case, the meta-data
will
contain information whether
a particular email
is a spam or not
,
and/or
whether a
spam filter
thinks that the email
is
spam or not
.
You are given two sets of data to work with, they both come from the same source.
You are given two sets of data to work with, they both come from the same source.
courses/be5b33prg/homeworks/spam/data.txt
· Last modified: 2015/11/24 16:12 by
xposik