...
Numbered figure captions |
---|
SubtitleText | Gene Set Enrichment Analysis task in the toolbox |
---|
AnchorName | GSEA_task |
---|
|
Image RemovedImage Added |
Use the first dialog (Figure 2) to specify gene sets. You can run GSEA on pathways (currently based on Kyoto Encyclopedia of Genes and Genomse Genomes (KEGG) pathways) or on other gene set databases. When using the KEGG option, the KEGG database (i.e. the species) is automatically set, based on the upstream nodes. The Gene set size option allows you to restrict your analysis on gene sets of certain size (i.e. number of genes).
...
Numbered figure captions |
---|
SubtitleText | Select gene set for analysis dialog is used to specify gene sets |
---|
AnchorName | select_gene_set_for_analysis |
---|
|
Image Modified |
If you select Gene set database, two additional options will appear. Genome build will be detected automatically, based on the upstream nodes. The gene sets that are available for that build are listed in the drop down list (Figure 3). Custom databases will be labeled by their name as specified in the Library file management, while GO database will be labeled by the release date (as seen in Figure 3).
...
Numbered figure captions |
---|
SubtitleText | Specifying gene set database for GSEA analysis |
---|
AnchorName | specify_gene_set_database |
---|
|
Image Modified |
Once your choices are made, push Next to proceed.
...
Numbered figure captions |
---|
SubtitleText | Select factor for analysis dialog. Only factors with two categories are enabled. The warning message indicates that some of the factors in the current project have either a single category, or more than two categories |
---|
AnchorName | select_factor_for_analysis |
---|
|
Image Modified |
If the warning message is displayed, click on the details link to learn more about unavailable factors (an example is shown in Figure 5).
...
Numbered figure captions |
---|
SubtitleText | Excluded attributes details page listing experimental factors which are not available for the GSEA analysis |
---|
AnchorName | excluded_attributes_details |
---|
|
Image Modified |
Select the experimental factor that you want to run GSEA on and push Next.
...
Numbered figure captions |
---|
SubtitleText | Use the dialog to specify the reference category (Denominator) and the test category (Numerator). In the example, the Dependent group is the Numerator and is compared relative to the Control group (Denominator) |
---|
AnchorName | define_comparison |
---|
|
Image Modified |
Push Finish to launch GSEA with the default settings.
Alternatively, click on the Configure icon to access the advanced options (Figure 7). Number of data permutations (needed to calculate the normalised enrichment scores) can be controlled using the Permutations option. Low value filter is turned on by default and will remove all the genes with the lowest average coverage of 1.0 or below; if a filter feature task was performed before this task, the default low-value filter is set to None (for details please see the GSA chapter) . Finally, if you start your project by importing a count matrix (i.e. as opposed to generating the count matrix using Partek Flow), you need to specify whether the expression values were log transformed before the import (use the Data has been log transformed with base drop down).
...
Numbered figure captions |
---|
SubtitleText | Advanced GSEA options. Default settings are shown |
---|
AnchorName | advanced_GSEA_options |
---|
|
Image Modified |
GSEA Results
...
Numbered figure captions |
---|
SubtitleText | GSEA task node. Double click to view the report |
---|
AnchorName | GSEA_task_node |
---|
|
Image RemovedImage Added |
The report consists of two parts: the GSEA result table on the right and the filter panel on the left (Figure 9).
...
Numbered figure captions |
---|
SubtitleText | GSEA report table with the filter panel on the left |
---|
AnchorName | GSEA_report |
---|
|
Image Modified |
The comparison (i.e. Denominator vs. Numerator) is given at the top of the GSEA table. To download the table to your local computer as a text file, use the Download link in the bottom right. Each row of the table corresponds to one gene set and the gene sets are ranked by the P-value, ascending (lowest values at the top). The carrets icon () in the column headers are used for sorting. The columns of the table are as follows.
...
Numbered figure captions |
---|
SubtitleText | GSEA table: filter fine tuning options. The Results show the number of gene sets currently filtered in |
---|
AnchorName | GSEA_filter_settings |
---|
|
Image Modified |
Click on the View enrichment report icon () to open a new Data viewer session with the per-gene set report. The current gene set is in the title, at the top of the canvas (Enrichment profile). To quickly switch to another gene set, use the Configuration > Content > Data card in the menu (on the left). The individual plots are as follows (Figure 11; from top to bottom).
...
Numbered figure captions |
---|
SubtitleText | Enrichment plots. Three plots are generated for each gene set: enrichment score, gene set hits, and rank metric |
---|
AnchorName | view_enrichment_report |
---|
|
Image Modified |
Click on the View extra details plot () to open a gene set-specific report page (Figure 12).
...
Numbered figure captions |
---|
SubtitleText | Extra details report with key metrics for each gene set. The figure shows an example set (rRNA binding) |
---|
AnchorName | view_extra_details |
---|
|
Image RemovedImage Added |
References
- Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545-15550. doi:10.1073/pnas.0506580102
- Mootha VK, Lindgren CM, Eriksson KF, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267-273. doi:10.1038/ng1180