Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Uniform Manifold Approximation and Projection (UMAP) is a dimensional reduction technique [1]. UMAP aims to preserve the essential high-dimensional structure and present it in a low-dimensional representation. UMAP is particularly useful for visually identifying groups of similar samples in large high-dimensional data sets such as single cell RNA-Seq. 

Running UMAP

We recommend normalizing your data prior to running UMAP, but the task will run on any counts data node. 

  • Click the counts data node
  • Click the Exploratory analysis section of the toolbox
  • Click UMAP
  • Click Finish to run

UMAP produces a UMAP task node. Opening the task report launches a scatter plot showing the UMAP results. 

UMAP vs. t-SNE

Both t-SNE and UMAP are dimensional reduction techniques that are useful for identifying groups of similar samples in large high-dimensional data sets. A comparison of the techniques for visualizing single cell RNA-Seq data by the authors of UMAP suggests that UMAP runs faster, is more reproducible, gives a more meaningful organization of clusters, and preserves more information about the global structure of the data than t-SNE [2].

...

Numbered figure captions
SubtitleTextThe same data visualized by UMAP (left) and t-SNE (right). Cells in both plots are colored by the same Graph-based clustering results. UMAP clearly shows groups of similar clusters, while t-SNE does not.
AnchorNameUMAP vs. t-SNE

Running UMAP

We recommend normalizing your data prior to running UMAP, but the task will run on any counts data node. 

  • Click the counts data node
  • Click the Exploratory analysis section of the toolbox
  • Click UMAP
  • Click Finish to run

UMAP produces a UMAP task node. Opening the task report launches a scatter plot showing the UMAP results. 

Basic UMAP parameters

Initialize output values (--init)

...