| Size: 412 Comment:  | Size: 1727 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 1: | Line 1: | 
| = A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Analysis) = | |
| Line 2: | Line 3: | 
| = Protocol = * A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Map) | * How to create a rank file * the rank file contains only 2 columns. The gene names as the first column and the differential expression values for each gene as the second column. In this protocol, we will use the t value from a moderated Student's t-test. Headers (column names) should be removed. The format should be tab delimited (meaning that the columns are separated by tabs) and the file extension should be .rnk. * the rank file is a format described in the GSEA documentation: http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats * the rank file will be used to run the gene set enrichment analysis | 
| Line 5: | Line 8: | 
| == Installation == == How to preprocess the data using R == == How to preprocess the data using Excel == | * How to get the pathway database file * How to run GSEA * How to create an expression file * How to create a map * What is the next step, how to use the map * How to create a figure * How to interpret the results * What next * (How to preprocess the data using R) * (How to preprocess the data using Excel) = FIRST EXAMPLE WITH AFFYMETRIX MICROARRAY DATA = * description of the data == Download the data from GEO == == Installation == * 1) install R (http://www.r-project.org/) * 2) install RStudio (http://www.rstudio.com/) * 3) Go through on online R tutorial (e.g. this one: http://www.cyclismo.org/tutorial/R/) == How to preprocess the data (normalization, QC, differential expression) == == How to update the annotations == | 
A to Z protocol to create an EnrichmentMap from gene expression data and using GSEA (Gene Set Enrichment Analysis)
- How to create a rank file - the rank file contains only 2 columns. The gene names as the first column and the differential expression values for each gene as the second column. In this protocol, we will use the t value from a moderated Student's t-test. Headers (column names) should be removed. The format should be tab delimited (meaning that the columns are separated by tabs) and the file extension should be .rnk.
- the rank file is a format described in the GSEA documentation: http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats 
- the rank file will be used to run the gene set enrichment analysis
 
- How to get the pathway database file
- How to run GSEA
- How to create an expression file
- How to create a map
- What is the next step, how to use the map - How to create a figure
- How to interpret the results
- What next
 
- (How to preprocess the data using R)
- (How to preprocess the data using Excel)
FIRST EXAMPLE WITH AFFYMETRIX MICROARRAY DATA
- description of the data
Download the data from GEO
Installation
- 1) install R (http://www.r-project.org/) 
- 2) install RStudio (http://www.rstudio.com/) 
- 3) Go through on online R tutorial (e.g. this one: http://www.cyclismo.org/tutorial/R/) 
How to preprocess the data (normalization, QC, differential expression)
How to update the annotations
How to create a rank file
How to create an expression file
How to run GSEA
How to create a map
