Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Enrichment analysis is a technique commonly used to add biological context to a list of genes, such as list of significant genes filtered from differential analysis report. The procedure is based on assigning genes to groups and then finding overrepresented groups in filtered gene lists using a Fisher's exact test.

Running Gene set

...

Enrichment

Gene set enrichment task can be invoked on a differential analysis output (or filtered differential analysis output) data node or filtered count matrix data node. Since the data node including all the features will server as serve as background, to get a meaningful result, always use a data node containing subset of features to invoke this task. Only gene names will be used in the computation. 

...

  • Gene set database is user defined database, see more details in the Adding a Gene Set chapter. The gene sets available for the current Assembly are listed under the Gene set  database drop-down list (Figure 2). The assembly is automatically selected, if possible. If the assembly cannot be detected, you can specify it using a the drop-down menu.

Partek distribute distributes Gene Ontology (GO) for human and mouse genomes, a bioinformatics initiative to unify the representation of gene and gene product attributes across various species [1, 2].

Numbered figure captions
SubtitleTextSelecting user defined gene set database
AnchorNameCreate gene list dialog

Image Removed

...

Image Added

  • Select feature identifier (optional) can be used to specify the feature format (e.g. Gene name, Gene ID, Feature ID). 
  • Specify the background gene list (optional) can be used for a feature list. Select the list using the drop-down. Click here for more information on List management.

The background gene list is used as the list of possible genes. By default, this is the genes included in the selected gene set database. If your assay limits the genes that could be detected, you may want to specify a background list.

...

Numbered figure captions
SubtitleTextGo enrichment report (truncated). Gene set column contains Gene Ontology identifiers (hyperlinks). Category labels are in the Description column. Enrichment score: negative natural logarithm of the enrichment P-value derived from the Fisher's exact test. Genes in list: number of genes that are present both in the list of significant genes and the gene set (GO category). Genes not in list: number of genes that are present in the gene set, but are not present in the list of significant genes. The column on the right contains links to gene breakdown chart and extra details
AnchorNameGo enrichment report

Image RemovedImage Added

The contingency table (Figure 4) can be displayed by selecting the View gene breakdown chart icon on the right (). The term "list" refers to the list of significant genes, while the term "set" refers to the respective GO category. The first row of the contingency table is also seen in the report, namely the Genes in list and Genes not in list columns.

Numbered figure captions
SubtitleTextContingency table used to calculate the enrichment p-value. List refers to the list of significant genes, set refers to the gene ontology category
AnchorNameContingency table

Image RemovedImage Added

The View extra details () button provides additional information on the GO category (Figure 5). In addition to the details already given in the report, a full list of Genes in list and Genes not in list can be inspected and downloaded (Download data) to the local computer as a text file. Use the arrow to expand these sections. 

Numbered figure captions
SubtitleTextGene ontology enrichment extra details
AnchorNameExtra enrichment details

Image RemovedImage Added

As previously mentioned, if you are using the GO gene sets distributed by Partek, the GO identifiers in the first column are hyperlinks to the Gene Ontology web-site entries (an example shown in Figure 6).

...

Numbered figure captions
SubtitleTextIf the gene ontology table has more than 100 rows, visualization of results is not possible
AnchorNamego_warning

Image RemovedImage Added


If needed, filter down the number results, for instance by using a cut-off based on the enrichment score. Type in the cut-off value in the text box beneath the Enrichment score and hit enter (an example is shown in Figure 11). Once the number or results falls below 100, a link to the Data Viewer will be displayed (Figure 8). Click on the View plots in Data Viewer link to open a new Data Viewer session.

Numbered figure captions
SubtitleTextUse the View plots in Data Viewer link to visualize the gene ontology enrichment results. The link is not visible if the table contains more than 100 rows
AnchorNameview_plots_data_viewer_link

Image RemovedImage Added


Two plots are loaded into Data Viewer (Figure 12). Both plots show enrichment score on the horizontal axis and gene ontology categories (i.e. the ones present in the gene enrichment table) on the vertical axis. The plots show enrichments scores (Enrichment score column of the gene ontology table) and - in addition - the plot on the left uses color range to depict enrichment P-value (green = low, red = high P-value). 

...