Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

 

Table of Contents

This tutorial presents an outline of the basic series of steps for analyzing a 10x Genomics Gene Expression with Feature Barcoding (antibody) data set in Partek Flow starting with the output of Cell Ranger.  

...

Numbered figure captions
SubtitleTextRunning UMAP on the Gene Expression data
AnchorNameRunning UMAP

Image RemovedImage Added

The Analyses tab now includes a UMAP task node (Figure 18).

...

Numbered figure captions
SubtitleTextUMAP calculated on Gene Expression values. Colored by Graph-based clustering results.
AnchorNameUMAP results

Image Modified

An advantage of UMAP over t-SNE is that is preserves more of the global structure of the data. This means that with UMAP, more similar clusters are closer together while dissimilar clusters are further apart. With t-SNE, the relative positions of clusters to each other are often uninformative.  

...

Numbered figure captions
SubtitleTextViewing the 2D UMAP
AnchorName2D UMAP

Image Modified

Classifying cells by clustering, gene, and protein expression

  • Click  to activate the lasso tool
  • Draw a lasso around clusters 3, 4, and 6 (Figure 21) to select them

Numbered figure captions
SubtitleTextSelecting a group of clusters
AnchorNameSelecting a group of clusters

Image Modified

  • Click  to filter to include only the selected cells
  • Click  to rescale the axes to the included cells (Figure 22)

Numbered figure captions
SubtitleTextZooming to a group of clusters in UMAP
AnchorNameViewing one sub-clustering

Image Modified

Because we merged the gene and protein expression data, we can visualize a mix of genes and proteins on the gene expression UMAP.

...

Numbered figure captions
SubtitleTextUMAP colored by gene and protein expression
AnchorNameUMAP colored by three markers

Image Modified

In addition to coloring by the expression of genes and proteins, we can select cells by their expression levels.

...

Numbered figure captions
SubtitleTextSelecting by NKG7 expression
AnchorNameSelecting by NKG7

Image Modified

  • Type CD3 in the ID search bar of the Features tab
  • Click CD3_TotalSeqB from in the drop-down
  • Click  to add a filter for CD3 protein expression

...

  • Type CD4 in the ID search bar of the Features tab
  • Click CD4_TotalSeqB from in the drop-down
  • Click  to add a filter for CD4 protein expression
  • Set the CD4_TotalSeqB filter to <= 2 

...

Numbered figure captions
SubtitleTextFiltering using multiple genes and proteins
AnchorNameFiltering using multiple genes and proteins

Image Modified

We can classify these cells. Because they express the pan T cell maker, CD3, and the cytotoxic marker, NKG7, but not the helper T cell marker, CD4, we can classify these cells as Cytotoxic T cells. 

...

Numbered figure captions
SubtitleTextModifying the selection criteria lets us select helper T cells
AnchorNameSelecting helper T cells

Image Modified

  • Click Classify selection
  • Type Helper T cells for the name
  • Click Save

...

  • Click Clear selection
  • Select Classification from the Color by drop-down menu (Figure 29)

 

 


Numbered figure captions
SubtitleTextViewing cytotoxic and helper T cell classifications
AnchorNameClassified cells

Image Modified

To return to the full data set, we can clear the filter.

...

The zoom level will also be reset (Figure 30).

 

Numbered figure captions
SubtitleTextResetting filters also resets the zoom level
AnchorNameReset zoom to show UMAP

Image Added

In addition to T-cells, we would expect to see B lymphocytes, at least some of which are malignant, in a MALT tumor sample. We can color the plot by expression of a B cell marker to locate these cells on the UMAP plot. 

  • Choose Expression from the Color by drop-down menu
  • Type CD19 in the search box
  • Click CD19_TotalSeqB in the drop-down

 There are several clusters that show high levels of CD19 protein expression (Figure 31). We can filter to these cells to examine them more closely.

 

Numbered figure captions
SubtitleTextViewing CD19 protein expression on the UMAP plot
AnchorNameCD19 expressing cells

Image Added

  • Click Image Added to activate the lasso tool
  • Draw a lasso around the CD19 protein-expressing clusters to select them
  • Click Image Added to filter to include only the selected cells
  • Click Image Added to rescale the axes to the included cells (Figure 32)

Numbered figure captions
SubtitleTextFiltering to CD19 expressing clusters
AnchorNameFiltering to B lymphocytes

Image Added

We can use information from the graph-based clustering results to help us find sub-groups within the CD19 protein-expressing cells.

  • Choose Graph-based from the Color by drop-down menu 

With the help of the Group biomarkers table, we can quickly characterize a few notable sub-groups based on their clusters (Figure 33).

 

Numbered figure captions
SubtitleTextViewing B lymphocyte clusters
AnchorNameViewing B lymphocyte clusters

Image Added

Cluster 7, shown in pink, lists IL7R and CD3D, genes typically expressed by T cells, as two of its top biomarkers. Biomarkers are genes or proteins that are expressed highly in a clusters when compared with the other clusters. Therefore, the cells in cluster 7 are likely doublets as they express both B cell (CD19) and T cell (CD3D) markers. 

  • Choose Graph-based from the Select by drop-down in the Attributes tab of the Selection / Filtering section of the control panel (Figure 34)

Numbered figure captions
SubtitleTextSelecting by cluster
AnchorNameSelecting by cluster

Image Added

  • Click the check box for to select cluster 7
  • Click Classify selection 
  • Name the cells Doublets 
  • Click Save 
  • Click Clear selection 

The biomarkers for clusters 1 and 2 also show an interesting pattern. Cluster 1 lists IGHD as its top biomarker, while cluster 2 lists IGHA1. Both IGHD (Immunoglobulin Heavy Constant Delta) and IGHA1 (Immunoglobulin Heavy Constant Alpha 1) encode classes of the immunoglobulin heavy chain constant region. IGHD is part of IgD, which is expressed by mature B cells, and IGHA1 is part of IgA1, which is produced by plasma cells. We can color the plot by both of these genes to visualize their expression.

  • Click IGHD in the Group biomarkers table
  • Hold Ctrl on your keyboard and click IGHA1 in the Group biomarkers table

This will color the plot by IGHD and IGHA1 (Figure 35).

 

 

Numbered figure captions
SubtitleTextColoring by two genes from the Group biomarkers table
AnchorNameColoring by two biomarkers

Image Added

The clusters on the left show expression of IGHA1 while the larger or the two clusters on the right expresses IGHD. We can use the lasso tool to classify these populations.

  • Select the left-hand cluster with IGHA1 expression (Figure 36)

Numbered figure captions
SubtitleTextSelecting the IGHA1+ cells
AnchorNameSelecting the plasma cells

Image Added

  • Click Classify selection
  • Name them Plasma cells 
  • Click Save
  • Double-click any white-space on the plot to clear the selection

We can now classify the cluster that expresses IGHD as mature B cells. 

  • Draw a lasso around the right-hand cluster (Figure 37)

Numbered figure captions
SubtitleTextSelecting the IGHD+ mature B cells
AnchorNameSelecting the IGHD+ cells

Image Added

  • Click Classify selection
  • Name them Mature B cells
  • Click Save
  • Double-click any white-space on the plot to clear the selection

We can visualize our classifications.

  • Select Classifications from the Color by drop-down menu
  • Click Clear filters to view all cells (Figure 38)

Numbered figure captions
SubtitleTextViewing classifications
AnchorNameViewing all cells

Image Added

 

To use these classifications in downstream, we can apply the classifications.

  • Click Apply classifications 
  • Click Apply to confirm

This will produce a Classified groups data node.