Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Samtools1 (version 1.2) utilizes the mpileup command to look at observed bases for in the reads covering every genomic position surveyed represented in the aligned sequence data and subsequently calculate a Bayesian prior probability of each possible genotypecalculate the likelihood of every possible genotype at a locus.  Subsequently, bcftools utilizes applies the prior probability distributions to calculate actual genotypes and output detected genotype and uses Bayesian inference to call actual genotypes, outputting variant information in Variant Call Format (vcf).  This method can identify both single nucleotide variants and insertions/deletionsdeletion events.  General information about the underlying algorithm are utilized by Samtools is detailed by Li. 2,3

Samtools dialog

Selecting Samtools from the context sensitive menu will bring up the Samtools task dialog, which contains three default sections: Variant detection method, Select Reference sequence, and Advanced options.

In the Variant detection method drop-down list, Against reference will compare base composition for each sample against the reference sequence assembly, independently (Figure 1).   Selection of the Among samples,

 

 

Numbered figure captions
SubtitleTextSelecting a variant detection method in the Samtools dialog
AnchorNamesantools detection method

Image Added

 

Selection of the Among samples (in the absence of sample attributes or when when Paired analysis is  is not selected, ) utilizes information from all samples in the project in a joint variant detection.  Against reference typically performs well when samples have good coverage and may contain unique SNPs.  In instances where there is low coverage, Among samples may perform well for identifying SNPs shared between samples (but not for unique SNPs).

 

 

Numbered figure captions
SubtitleTextSelecting a variant detection method in the Samtools dialog
AnchorNamesantools detection method

Image Removed

 

In the event paired samples exist within the project, Variant detection Among samples can be utilized to identify loci with differing genotypes between the paired samples pair once each sample has been compared to the reference sequence assembly.  To perform this analysis, sample attributes must be added in the Data tab of the project (Figure 2).  Specifically, an attribute must be added for sample ID (shared between the paired samples) and an attribute may must also be added for sample type that differ between differentiates the paired samples.  Examples of the latter can include case and control or tumor and normal.  If these attributes are present, a section for Analysis options will be displayed below the Variant detection method (Figure 3).  To utilize this feature, select Paired analysisMatch ID must then be specified and should correspond to the attribute that references the sample ID shared between the pair.   Selecting Case/control will allow for discriminating genotypes between paired samples in downstream tasks.  Attribute should correspond to the attribute that defines type within sample pairs, and Control can be specified for whatever category relates to the reference sample genotypes.

 

 

Numbered figure captions
SubtitleTextExample of attributes required for paired variant analysis
AnchorNamedata tab paired variant detect

 

 

Numbered figure captions
SubtitleTextSpecifying options for paired variant detection
AnchorNameamong samples paired

 

Select Reference sequence will specify the reference assembly to utilize for variant detection.  If the alignment was generated in Partek® Flow®, the Assembly will be displayed as text in the section, and you do not have the option to change the reference.  In the event that alignment was performed outside of Partek® Flow®, you will need to select the appropriate Assembly utilized for alignment in the drop-down list.  Assemblies previously added to library files (see Library File Management) will be available for selection or New assembly… in the menu can  can be utilized to import the reference sequence to library files from within the task.

Advanced options provides a means to tune parameters in the variant detection for optimal performance.  Upon invoking the task dialog, Option set is set to Default, and these parameters are provided by the Samtools developers.  Clicking Configure button will open a window to tune advanced options (Figure 3).  Moving the mouse cursor over the info button will provide details for each parameter.   Please  Please refer to the Samtools documentation for further details on any of these parameters.

...