Page History

...

Absolute value
TX_sf = | X_sf |
Add
TX_sf = X_sf + C
a constant value C needs to be specified
Antilog
TX_sf = bx_sf
A log base value b needs to be specified from the drop-down list; any positive number can be specified when Custom value is chosen
Divided by
When mean, median, Q1, Q3, std dev, or sum is selected, the corresponding statistics will be calculated based on the transform on sample or features option
Example: If transform on Samples is selected, Divide by mean is calculated as:
TX_sf = X_sf/M_s
where Ms is the mean of the sample.
Example: If transform on Features is selected, Divide by mean is calculated as:
TX_sf = X_sf/M_f
where M_f is the mean of the feature.
Log
TX_sf = log_bX_sf
A log base value b needs to be specified from the drop-down list; any positive number can be specified when Custom value is chosen
Logit
TX_sf=log_b(X_sf/(1-X_sf))
A log base value b needs to be specified from the drop-down list; any positive number can be specified when Custom value is chosen
Lower bound
A constant value C needs to be specified,
if X_sf is smaller than C, then TX_sf= C; otherwise, TX_sf = X_sf
Multiply by
TX_sf = X_sf x C
A constant value C needs to be specified
Quantile normalization, a rank based normalization method.
For instance, if transformation is performed on samples, it first ranks all the features in each sample. Say vector V_s is the sorted feature values of sample S in ascending order, it calculates a vector that is the average of the sorted vectors across all samples --- V_m, then the values in V_s is replaced by the value in V_m in the same rank. Detailed information can be found in [1].
RPKM (Reads per kilobase of transcript per million mapped reads [2])
TX_sf = (10⁹ * X_sf)/(TMR_s*L_f)
Where X_sf is the raw read of sample S on feature F,
TMR_s is the total mapped reads of sample S,
L_f is the length of the feature F,

If quantification is performed on an aligned reads data node, total mapped reads is the aligned reads. If quantification is generated from imported read count text file, the total mapped reads is the sum of all feature reads in the sample.
If the feature is a transcript, transcript length L_f is the sum of the lengths of all the exons. If the feature is a gene, gene length is the distance between the start position of the most downstream exon and the stop position of the most upstream exon. See Bullard et al. for additional comparisons with other normalization packages [3]

For paired reads, the Normalization option will show up as FPKM (Fragments per kilobase per million mapped reads).

Subtract
When mean, median, Q1, Q3, std dev or sum is selected, the corresponding statistics will be calculated based on the transform on sample or features option
Example: If transform on Samples is selected, Subtract mean is calculated as:
TX_sf = X_sf - M_s
where Ms is the mean of the sample
Example: If transform on Features is selected, Subtract mean is calculated as:
TX_sf = X_sf - M_f
where M_f is the mean of the feature
TMM (Trimmed mean of M-values)
The scaling factors is produced according to the algorithm described in Robinson et al [4]. The paper by Dillies et al. [5] contains evidence that TMM has an edge over other normalization methods.
TPM (Transcripts per million as described in Wagner et al [6])
The following steps are performed:

...

Partek Flow Documentation

Page tree

Versions Compared

Old Version 24

New Version 25

Key