PGS Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
maxLevel2
minLevel2
excludeAdditional Assistance

During import, you created a categorical attribute called Tissue and assigned the 4 samples to either the muscle or not muscle groups. This step was to create replicates within a group, albeit this grouping is somewhat artificial and is only used in this tutorial because we want to illustrate ANOVA with a small data set. Replicates are a prerequisite for differential expression analysis using ANOVA. 

  • Select Differential Expression Analysis from the Analyze Known Genes section of the RNA-Seq workflow

The Differential Expression Analysis dialog offers the choice of analyzing at Gene-,Transcript-, or Exon-level. 

  • Select Gene-level 
  • Specify the 1/gene_rpkm (RNA-Seq_results.gene.rpkm) spreadsheet from the Spreadsheet drop-down menu (Figure 1)

Numbered figure captions
SubtitleTextChoosing the type of differential expression analysis
AnchorNameDifferential Expression Analysis dialog

  • Select OK to open the ANOVA dialog 

Available factors are listed in the Experimental Factor(s) panel on the left-hand side of the dialog. 

  • Select Tissue, then select Add Factor > to move Tissue to the ANOVA Factor(s) panel on the right-hand side of the dialog (Figure 2)

Numbered figure captions
SubtitleTextThe ANOVA dialog
AnchorNameANOVA initial configuration

If the ANOVA were now performed (without contrasts), a p-value for differential expression would be calculated, but it would only indicate if there are differences within the factor Tissue; it would not inform you which groups are different or give any information on the magnitude of the difference between groups (fold-change or ratio). To get this more specific information, you need to define linear contrasts.

  • Select Contrasts... to open the Configure dialog
  • For Select Factor/Interaction, Tissue will be the only factor available as it was the only factor included in the ANOVA model in the previous step; if multiple factors were included, they could be selected in the Select Factor/Interaction: drop-down menu. The levels in this factor are listed on the Candidate Level(s) panel on the left side of the dialog
  • For this data set, verify that No is selected for Data is already log transformed?
  • Left click to select muscle from the Candidate Level(s) panel and move it to the Group 1 panel (renamed muscle) by selecting Add Contrast Level > in the top half of the dialog. Label 1 will be changed to the subgroup name automatically, but you can also manually specify the label name 
  • Select not muscle from the Candidate Level(s) panel and move it to the Group 2 panel (renamed not muscle) 
  • The Add Contrast button can now be selected (Figure 3)
  • Select OK to return to the ANOVA dialog

Numbered figure captions
SubtitleTextDefining linear contrasts
AnchorNameConfiguring the ANOVA

 

  • Select OK to perform the ANOVA as configured (Figure 4)

Numbered figure captions
SubtitleTextFully configured ANOVA
AnchorNameFinal ANOVA Configuration

Once the ANOVA has been performed on each gene in the dataset, an ANOVA child spreadsheet ANOVA-1way (ANOVAResults) will appear under the gene_rpkm spreadsheet (Figure 5). The format of the ANOVA spreadsheet is similar for all workflows. Mouse over each column title for a description of the column contents. 

 

Numbered figure captions
SubtitleTextViewing ANOVA results
AnchorNameANOVA Results

 

In this tutorial, the overall p-value for the factor (column 4) is the same as the p-value for the linear contrast (column 5) as there are only two levels within Tissue. If we had more than two groups, the overall p-value and the linear contrast p-values would most likely differ. You can also see the ? symbol in the ratio/fold-change columns (6 and 7) for several genes that also have a low p-value because there are zero reads in one of the groups, thus making it impossible to calculate ratios and fold-changes between groups.

For using ANOVA with more complicated experimental designs, including multiple factors and linear contrasts, please refer to Identifying differentially expressed genes using ANOVA in the Gene Expression Analysis tutorial.

 

Page Turner
button-linkstrue

 

Additional assistance

 

Rate Macro
allowUsersfalse