Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This will generate a Scree plot and a PC component loadings table, which are useful for determining how many PCs to use for downstream analysis tasks. 

...

  • Double-click the PCA task node to open the PCA task report

The PCA task report includes  additional-includes the PCA plot, the Scree plot, the component loadings table, and the PC projections table. To switch between these elements, use the buttons in the upper right-hand corner of the task report Image Added

  • Click Image Added to open the Scree plot

The Scree plot lists PCs on the x-axis and the amount of variance explained by each PC on the y-axis, measured in Eigenvalue. The higher the Eigenvalue, the more variance is explained by the PC. Typically, after an initial set of highly informative PCs,  the amount of variance explained by analyzing additional PCs is minimal. By identifying the point where the Scree plot levels off, you can choose an optimal number of PCs to use in downstream analysis steps like graph-based clustering and t-SNE. 

  •  Mouse over the Scree plot to identify the point where PCs begin to add little additional value

In this data set, a reasonable cut-off could be set anywhere between 8 and 25 PCs. We will use 10 in downstream steps. 

Viewing the genes highly correlated with each PC can be useful when choosing how many PCs to include. 

  • Click the first Image Added to open the Component loadings table

This table lists genes on rows and PCs on columns. 

  • Click PC1 to sort the genes in descending order by their correlation with PC1
  • Click PC1 again to switch to ascending order of correlation with PC1

In this case, we see a few marker genes are highly correlated with PC1. 

Graph-based clustering


 

additional-assistance

 

Rate Macro
allowUsersfalse