View Source

Because different samples have different numbers of total reads, it would be misleading to calculate differential expression by comparing read count numbers for genes across samples without normalization.

Select the Gene counts data node
Select Normalization and scaling from the task menu
Select Normalize counts from the Normalization and scaling section of the task menu (Figure 1)

Flow Documentation > Normalizing counts > 2017-09-01 16_46_39-RNA-Seq Tutorial - Partek Flow.png

The Read count normalization menu will open (Figure 2).

Flow Documentation > Normalizing counts > 2017-09-01 16_47_42-Task setup - Partek Flow.png

Normalization can be performed by sample or by feature. By sample is selected by default; this is appropriate for the tutorial data set.

Available normalization methods are listed in the left-hand panel. For more information about these options, please see the Normalize Counts user guide.

For this tutorial, we will use the recommended default normalization settings.

Select

This adds Total count and Add 0.0001 to the Normalization order panel (Figure 3). Normalization steps are performed in descending order

Flow Documentation > Normalizing counts > 2017-09-01 16_49_30-Task setup - Partek Flow.png

Total Count normalizes read counts for each gene by the total count of the sample. This accounts for differences in total read counts between samples.

Add 0.0001 adds 0.0001 to the normalized read count of every gene. This prevents the read count data from having any 0 values. Values of 0 would prevent the gene specific analysis algorithm we will use for differential expression analysis from performing the necessary log transformation.

Select Finish to perform normalization

A Normalize counts task node and a Normalized counts data node are added to the pipeline (Figure 4)

Flow Documentation > Normalizing counts > 2017-09-01 16_43_51-RNA-Seq Tutorial - Partek Flow.png