| Size: 932 Comment:  | Size: 1657 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 2: | Line 2: | 
| * A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Map) | * A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Analysis) | 
| Line 5: | Line 5: | 
| * the rank file contains only 2 columns. The gene names as the first column and the differential expression values for each gene as the second column. In this protocol, we will use the t value from a moderated Student's t-test. Headers (column names) should be removed. The format should be tab delimited (meaning that the columns are separated by tabs) and the file extension should be .rnk. * the rank file is a format described in the GSEA documentation: http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats * the rank file will be used to run the gene set enrichment analysis * How to get the pathway database file * How to run GSEA | |
| Line 6: | Line 14: | 
| * How to run GSEA | |
| Line 15: | Line 22: | 
| * description of the data | |
| Line 16: | Line 24: | 
| == step1: installation == | == Download the data from GEO == == Installation == | 
Protocol
- A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Analysis) 
- How to create a rank file - the rank file contains only 2 columns. The gene names as the first column and the differential expression values for each gene as the second column. In this protocol, we will use the t value from a moderated Student's t-test. Headers (column names) should be removed. The format should be tab delimited (meaning that the columns are separated by tabs) and the file extension should be .rnk.
- the rank file is a format described in the GSEA documentation: http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats 
- the rank file will be used to run the gene set enrichment analysis
 
- How to get the pathway database file
- How to run GSEA
- How to create an expression file
- How to create a map
- What is the next step, how to use the map
- How to preprocess the data using R
- How to preprocess the data using Excel
FIRST EXAMPLE WITH AFFYMETRIX MICROARRAY DATA
- description of the data
Download the data from GEO
Installation
- 1) install R (http://www.r-project.org/) 
- 2) install RStudio (http://www.rstudio.com/) 
- 3) Go through on online R tutorial (e.g. this one: http://www.cyclismo.org/tutorial/R/) 
How to preprocess the data (normalization, QC, differential expression)
How to update the annotations
How to create a rank file
How to create an expression file
How to run GSEA
How to create a map
