Search
On the previous tutorial, we looked at the process of collection and assembly of gene expression data. In this tutorial we will reproduce a certain breakthrough experiment [1] (in a simplified scenario, of course) regarding the analysis of such data. This way, we will also learn about the popular dimensionality reduction method PCA.
PCA (principal component analysis) is a dimensionality reduction method exploiting the correlations among features in the data. Here, our parameters and variables are the following:
We assume you have an installation of Matlab. If you don't there is a free license to university students. First, download and extract the file: ge_assignment.zip
Taken from [1].
Construct a decision model to differentiate these types of cancer. Just complete the code in the script attached ge_cv.m (or ge_cv_matlab2015.m).
ge_cv.m
ge_cv_matlab2015.m
fitctree
ClassificationTree
fit
pca.m
mineGenes
Science, 1999
Old slides