Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The design pool can also be restricted by Min error degrees of freedom. The minimal When "Model types configuration" is set to Default , this is automated as follows: it is desirable to keep the error degrees of freedom is at or above six. Therefore, we automatically set to the largest k (k represents the error degree of freedom of the model) in the design model pool, with 0 <= k <=6 for which admissible models exist. Admissible model is one that can be estimated given the specified contrasts. In the above example, when we compare A vs B in Cell type, there are three possible design models. The error degree of freedom of model Cell type is largest and the error degree of freedom of model Cell type, Time, Cell type * Time is the smallest:

...

If the sample size is big, k >=6 in all three models, all the models will be evaluated and the best model will be selected for each feature. However, if the sample size is too small, none of the models will have k >=6, then only the model with maximal k will be used in the calculation. If the maximal k happens to be zero, we are forced to use Poisson response distribution only.

There are two types of p-value, F and Wald., Poisson, negative binomial and normal models can generate p-value using either Wald or F statistics. Lognormal models always employ the F statistics; the more replicates in the study, the less the difference between the two options. When there are no replicates, only Poisson can be used to generate p-value using Wald.

Note: Partek Flow keeps tracking the log status of the data, and no matter whether GSA is performed on logged data or not, the LSMeans, ratio and  fold change calculation is are always in linear scale. Ratio is the ratio of the two LSMeans from the two groups in the comparison (left is the numerator, right is the denominator); Fold change is converted from ratio: when ratio is greater than 1, fold change is same as ratio; when ratio is less than one, fold change is -1/ratio. In other words - fold change value is always >=1 or <=-1, there is no fold change value between -1 and 1. When the LSmean of numerator group is greater than that of denominator group, fold change is greater than 1; when LSmean of numerator group is less than denominator group, fold change is less than 1; when the group groups are the same, fold change is 1. Logratio is ratio is log2 transformed, which is equivalent to logfoldchange is some other software. 

GSA report

If there are multiple design models and multiple distribution types included in the calculation, the fraction of genes using each model and type will be displayed as pie charts in the task result (Figure 9).

...

  • Feature ID information: if transcript level analysis was performed, and the annotation file has both transcript and gene level information, both gene ID and transcript ID are displayed. Otherwise, the table shows only the available information.
  • Total readscounts: total number of read counts across all the samples observations from the input data in linear space. If the input data is quantification output, then the total reads is the sum of reads of raw counts; if the input data is normalized, then the total reads is the sum of normalized reads; if the input data is logged, then the total reads is the sum of anti-logged value.
  • Each contrast outputs p-value, FDR step up p-value, ratio and fold change in linear scale, LSmean of each group comparison in linear scale

When you click on the Optional columns link on the top-left corner of the table, extra information will be displayed in the table when select:

  • Maximum count: maximum number of reads counts across all the observations from the input data.
  • Geometric mean: geometric mean value of the input counts across all observations.
  • Arithmetic mean: arithmetic mean value of input counts across all observations.

Click on View extra details report () icon under View section to get more statistical information about the feature.  In a case that the task doesn't fail, but certain statistical information is not generated, e.g. p-value and/or fold change of a certain comparison are not generated for some or all feature, click on this icon to get more information by mousing over the read exclamation icon

...

Numbered figure captions
SubtitleTextVolcano plot in comparison A vs B. X-axis represents fold change (linear scale), Y-axis represents negative logged p-value (unadjusted), each dot is a feature. The horizontal line represents p-value of 0.05, two vertical lines represent fold change of -2 and 2. Lower left corner displays number of features passing the fold-change and p-value criteria
AnchorNameVolcano plot

Image RemovedImage Added

Feature list filter panel is on the left of the table (Figure 12). Click on the black triangle (  ) to collapse and expand the panel.

...


The filtered result can be saved into a filtered data node by selecting the Generate list button at the lower-left corner of the table ( Image RemovedImage Added ). Selecting the Download button at the lower-right corner of the table downloads the table as a text file to the local computer.

...