Enrichment Map Tutorial Direct from GSEA Interface

Contents

Enrichment Map Tutorial Direct from GSEA Interface

Outline

This quick tutorial will guide you through the generation of an Enrichment Map for an analysis performed using GSEA Gene Set Enrichment Analysis directly from the GSEA interface.

To run this tutorial

You need to have Cytoscape installed : minimally 2.6.3 must be installed but preferable to have the latest version of Cytoscap 2 (e.g. 2.8.3)
It does not work with Cytoscape 3
You need a version of GSEA that supports the direct creation of Enrichment Maps. (FILL IN VERSION NUMBER WHEN IT GETS RELEASED)
You need to download the test data: GSEATutorial.zip

Description of the tutorial files contained in the GSEATutorial folder

ES_NT.cls : phenotype definition for expression file required by GSEA.
MCF_ExpMX_v2_names.gct : Expression File - Estrogen treatment, Official Gene Name as key. - Data for 12hr,24hr and 48hr.
Human_GO_AllPathways_no_GO_iea_April_15_2013_symbol.gmt: Gene set definition file.

Instructions

Step 1: Run GSEA

Screenshot GSEA Input Panel

GO to GSEA website - http://www.broadinstitute.org/gsea/
Click on Downloads in the page header.
- From the javaGSEA Desktop Application right click on Launch with 1 Gb memory.
- Click on “Save Target as…” and save shortcut to your desktop or your folder of choice so you can launch GSEA for your analysis without having to navigate to it through your web browser.
Double click on GSEA icon you created.
Click on Load data in left panel.
Click on Browse for files… in newly opened Load data pane.
Navigate to directory where you stored tutorial test set files. Select raw expression (.gct) file, sample class file(.cls) and gene set (.gmt) file. Click on Open.
Wait until confirmation box appears indicating that all files loaded successfully. Click on Ok.
Click on Run GSEA in left panel.
Select the Expression dataset:
- Click on the arrow next to the Expression dataset text box.
- Select the expression set you wish to run the analysis on (MCF7_ExprMx_v2_names.gct).
Select the Gene Set Database:
- Click on … next to the text box of Gene Set Database.
- Click on Gene Matrix (local gmx/gmt) tab.
- Select gmt file Human_GO_AllPathways_no_GO_iea_April_15_2013_symbo.gmt and click on Ok.
Select the Phenotype labels file
- Click on … next to the text box of Phenotype labels.
- Make sure Select source file is set to ES_NT.cls.
- Select ES12_versus_NT12 and click on Ok.
Click on the down arrow next to the text box for Collapse dataset to gene symbols. Select false.
Click on the down arrow next to the text box for Permutation type. Select gene_set.
Click on Show next to Basic fields.
Click in text box next to Analysis name and rename (example:estrogen_treatment_12hr_gsea_enrichment_results).
Click on … next to “Save results in this folder text box. Navigate to the folder where you wish to save the results (preferably the same directory where all the input files have been saved).
Click on Run in the bottom right corner.

Step 2: Generate Enrichment Map

Screenshot GSEA-EM Input Panel

Once GSEA has completed in the GSEA reports panel next to the GSEA analysis you wish to create an Enrichment map for click on Visualize in Cytoscape.
Tune Parameters
- P-value cut-off 0.001
- Q-value cut-off 0.05
- Check Overlap Coefficient
  - Overlap coefficient cut-off 0.5
If you have conducted an GSEA analysis on a Preranked list of genes but wish to see the original expression file associated with your enrichment map update the path to the expression file next to Expression.
Click on Build Enrichment Map
Cytoscape should launch and create your Enrichment map.
Go to View, and activate Show Graphics Details

Step 3: Examining Results

Example EM session - Estrogen treatment vs no treatment at 12hr ES12_EM_example.cys

GSEA EM Result
Legend:

Node (inner circle) size corresponds to the number of genes in dataset 1 within the geneset
Colour of the node (inner circle) corresponds to the significance of the geneset for dataset 1.
Edge size corresponds to the number of genes that overlap between the two connected genesets. Green edges correspond to both datasets when it is the only colour edge. When there are two different edge colours, green corresponds to dataset 1 and blue corresponds to dataset 2.

GSEA Leading Edge Information: GSEA EM leading edge

To access GSEA leading edge information click on an individual Node. Leading edge information is currently only available when looking at a single gene set.
In the Data Panel the expression profile for the selected gene set should appear in the EM GenesetExpression viewer tab.
Change the Normalization to your desired metric.
Change the Sorting method to GSEARanking.
Genes part of the leading edge are highlighted in Yellow.

For more detailed tutorials check out:

-  ⇤ ← Revision 4 as of 2013-07-04 19:22:41 → 
  Size: 6760
  Editor: RuthIsserlin
  Comment:
+   ← Revision 5 as of 2013-07-04 19:23:17 → ⇥
  Size: 6546
  Editor: RuthIsserlin
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 56:
-'''Note''': repeat steps for the 24hrs time-point but use ES24_versus_NT24 phenotype labels in step 11 instead and in step 15 change the Analysis name (example:estrogen_treatment_24hr_gsea_enrichment_results)..