Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Classification in Partek Flow

Cell Ranger v6.0.0[2] has been wrapped in Partek® Flow®  as Cell ranger task. It does not comprehensively cover all of the options and analysis cases Cell Ranger can handle for now, but converts FASTQ files from cellranger mkfastq and performs alignment, filtering, barcode counting, UMI counting. The output gene expression count matrix in .h5 format (both raw and filtered available for users to download in the output page of task details) then becomes the starting point for downstream analysis for scRNA-seq in Flow.

Running Cell ranger in Flow

To run the Cell ranger task in Flow, select Unaligned reads datanode, then select Cell ranger in the 10x Genomics section (Figure 1).

Image Removed

Figure 1. Selecting the Cell ranger task for converting fastqs to Single cell counts.

Users will be asked to create a Cell Ranger 6.0.0 reference genome if it is the first time to run the Cell ranger task in Flow (Figure 2). 

Image Removed

Figure 2. Create Cell Ranger 6.0.0 reference genome for the first time user.

Clicking the big grey button of Create Cell Ranger 6.0.0 reference would pop up a new window where lists the three pre-built reference genomes for human(hg38), mouse(mm10) and the mix of two(hg38-mm10), respectively (Figure 3). They are exactly the same reference genomes (2020-A) that are provided in Cell Ranger by default. In details, the transcriptome annotations are respectively GENCODE v32  for human and vM23 for mouse, which are equivalent to Ensembl 98[3]. References for other organisms currently are not available in Flow Cell ranger, and will be coming in the future.

Image Removed

Figure 3. The available reference genomes in Flow Cell ranger.

Once the right reference has been chosen, simply press the Create button to finish. The reference of ‘hg38’ has been selected as an example here (Figure 4).

Image Removed

Figure 4. Create Human reference genome (hg38) for Cell ranger.

The main task menu will be refreshed as below (Figure 5) if references have been added. After the correct reference has been selected, users can go ahead click the Finish button to run the task as default. 

Image Removed

Figure 5. Run Cell ranger task with reference(hg38) in Flow.

A new data node named ‘Single cell counts’ will be displayed in Flow if the task is running (Figure 6). This data node contains filtered feature barcode count matrix. To open the task report when the task is finished, double click the output data node, or select the ‘Task report’ in the section after single clicking the data node. Users then will find the task report (Figure 7) is the same to the ‘Summary HTML’ from Cell Ranger output.

Image Removed

Figure 6. The running Cell ranger task in Flow.Garnett[1] automated cell type classification has been wrapped as the Classification task in Partek Flow. Just like the original Garnett tool, Classification in Flow works on single-cell data, along with a cell type definition (marker) file, and trains a regression-based classifier. Once a classifier is obtained and published, it can be applied to classify future datasets from similar tissues. To improve the user experience, both maker file(.txt) and classifier file(.rds) have been implemented as library files in Flow. 

Garnett classifiers in Flow

Flow hosts a few pre-trained classifiers as Managed classifiers. The list of available classifiers can be found here[2]. If a managed classifier exists for your data type, we recommend you try it. Besides the Managed classifiers, the Project classifiers trained on the same dataset prior to your classification can also be used to classify cell type. Project classifiers can be promoted to Managed classifiers if users publish them. 


Cell ranger task report in Flow

...

If you need additional assistance, please visit our support page to submit a help ticket or find phone numbers for regional support.

Garnett[1] automated cell type classification has been wrapped as the Classification task in Partek Flow. Just like the original Garnett tool, Classification in Flow works on single-cell data, along with a cell type definition (marker) file, and trains a regression-based classifier. Once a classifier is obtained and published, it can be applied to classify future datasets from similar tissues. To improve the user experience, both maker file(.txt) and classifier file(.rds) have been implemented as library files in Flow. 

Flow hosts a few pre-trained classifiers as Managed classifiers. The list of available classifiers can be found here[2]. If a managed classifier exists for your data type, we recommend you try it. Besides the Managed classifiers, the Project classifiers trained on the same dataset prior to your classification can also be used to classify cell type. Project classifiers can be promoted to Managed classifiers if users publish them. 

















Classify cell type

To classify cell type using a pre-trained classifier:  

...