= Protocol = * A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Analysis) * How to create a rank file * the rank file contains only 2 columns. The gene names as the first column and the differential expression values for each gene as the second column. In this protocol, we will use the t value from a moderated Student's t-test. Headers (column names) should be removed. The format should be tab delimited (meaning that the columns are separated by tabs) and the file extension should be .rnk. * the rank file is a format described in the GSEA documentation: http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats * the rank file will be used to run the gene set enrichment analysis * How to create an expression file * How to run GSEA * How to create a map * What is the next step, how to use the map * How to preprocess the data using R * How to preprocess the data using Excel = FIRST EXAMPLE WITH AFFYMETRIX MICROARRAY DATA = * description of the data == Download the data from GEO == == Installation == * 1) install R (http://www.r-project.org/) * 2) install RStudio (http://www.rstudio.com/) * 3) Go through on online R tutorial (e.g. this one: http://www.cyclismo.org/tutorial/R/) == How to preprocess the data (normalization, QC, differential expression) == == How to update the annotations == == How to create a rank file == == How to create an expression file == == How to run GSEA == == How to create a map == == What is the next step, how to use the map ==