Partek Flow Documentation

Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 9 Next »

A common task in bulk and single-cell RNA-Seq analysis is to filter the data to include only informative genes. Because there is no gold standard for what makes a gene informative or not and ideal gene filtering criteria depend on your experimental design and research question, Partek Flow has a wide variety of flexible filtering options. 

Filter features task can be invoked from any counts or single cell data node. The filter is applied to the values in the selected data node and the output is a filtered version of the input data node.

In the task dialog, click the check box to activate one or more of the filter types, configure the filter(s), and click Finish to run. 

Noise reduction filter

The Noise reduction filter lets you exclude features that meet basic criteria (Figure 1).


Figure 1. Noise reduction filter

 Descriptive statistics you can choose are:

  • Coefficient of variation: std. dev divided by mean of the feature
  • Geometric mean: nth root of the product of the n numbers, n is the number of features
  • Maximum: the highest value of a feature
  • Mean: the average value of a feature
  • Median: value of the mid point of a feature
  • Minimum: lowest value of a feature
  • Range: the difference between the highest and lowest values of a feature
  • Std dev.: the square root of the variance
  • Sum: total value of the feature
  • Variance: the average of the squared differences from the mean
  • Dispersion: variance divided by mean of the feature

For each of these you can choose to exclude features that are:

  • <: less than
  • <=: less than or equal to
  • == equal to
  • >: greater than
  • >=: greater than or equal to

The threshold is set using the text box. The input must be a number; it can be an integer or decimal, positive or negative.

If you select value, you can also choose a percentage of samples or cells that must meet the criteria for the feature to be excluded (Figure 2).


Figure 2. Selecting value exposes additional options

Statistics based filter

The Statistics based filter lets you include a number or percentile of genes based on descriptive statistics (Figure 3).

Figure 3. Statistics based filter

Select Counts to specify a number of top features to include or select Percentiles to specify the top percentile of features to include. 

Descriptive statistics you can choose are:

  • Coefficient of variance
  • Geometric mean
  • Maximum
  • Mean
  • Median
  • Minimum
  • Range
  • Standard deviation (std dev)
  • Sum
  • Variance
  • Dispersion

Feature list filter

If you have added feature lists in Partek Flow using the List management feature, the Filter list filter will be available (Figure 4). 


Figure 4. Feature list filter

You can choose to include or exclude features in any list you have added. 

Select the list using the drop-down menu.

Use the Feature identifier option to choose which identifier from your annotation matches the values in the feature list. 

Filter features task report

The filter features task report lists the filter criteria, reports distribution statistics for the remaining features, and indicates the number and percentage of features that passed the filter (Figure ).


Figure 5. Feature filter report table

If the input was a count matrix data node, sample box plot and sample histograms are provided to show the distribution of features after filtering. These plots are not available if the input was a single cell counts data node. 


Figure 6. Filter features report figures



Additional Assistance

If you need additional assistance, please visit our support page to submit a help ticket or find phone numbers for regional support.

Your Rating: Results: 1 Star2 Star3 Star4 Star5 Star 0 rates


  • No labels