Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

SamtoolsSAMtools1 (version 1.2) utilizes the mpileup command to look at observed bases for in the reads covering every genomic position surveyed represented in the aligned sequence data and subsequently calculate a Bayesian prior probability of each possible genotype.  Subsequently, bcftools utilizes the prior probability distributions to calculate actual genotypes and output detected genotype calculate the likelihood of every possible genotype at a locus. Subsequently, bcftools applies the prior probability and uses Bayesian inference to call actual genotypes, outputting variant information in Variant Call Format (vcf).  This This method can identify both single nucleotide variants and insertions/deletionsdeletion events.   General information about the underlying algorithm are utilized by SAMtools is detailed by Li. 2,3

...

SAMtools dialog

Selecting Samtools SAMtools from the context sensitive menu will bring up the Samtools SAMtools task dialog, which contains three default sections: Variant detection method, Select Reference sequence, and Advanced options.

In the Variant detection method drop-down list, Against reference will compare base composition for each sample against the reference sequence assembly, independently (Figure 1).  Selection of the Among samples,



Numbered figure captions
SubtitleTextSelecting a variant detection method in the Samtools dialog
AnchorNamesantools detection method

Image Added


Selection of Among samples (in the absence of sample attributes or when when Paired analysis is  is not selected, ) utilizes information from all samples in the project in a joint variant detection.   Against reference typically performs well when samples have good coverage and may contain unique SNPs.   In instances where there is low coverage, Among samples may perform well for identifying SNPs shared between samples (but not for unique SNPs).

 

 

Numbered figure captions
SubtitleTextSelecting a variant detection method in the Samtools dialog
AnchorNamesantools detection method

Image Removed

 

In the event paired samples exist within the project, Variant detection Among samples can be utilized to identify loci with differing genotypes between the paired samples pair once each sample has been compared to the reference sequence assembly.   In instances where there is limited information to accurately determine genotypes in one or both of the samples, the same genotype may be called for case and control if it differs from the reference. The Filter variants task can be used to exclude these spurious loci. To perform this analysis, sample attributes must be added in the Data tab of the project (Figure 2).   Specifically, an attribute must be added for sample ID (shared between the paired samples) and an attribute may must also be added for sample type that differ between differentiates the paired samples.   Examples of the latter can include case and control or tumor and normal.   If these attributes are present, a section for Analysis options will be displayed below the Variant detection method (Figure 3).   To utilize this feature, select Paired analysis.   Match ID must then be specified and should correspond to the attribute that references the sample ID shared between the pair.   Selecting Selecting Case/control will allow for discriminating genotypes between paired samples in downstream tasks.   Attribute should correspond to the attribute that defines type within sample pairs, and Control can be specified for whatever category relates to the reference sample genotypes.

 

...



...

Numbered figure captions
SubtitleTextExample of attributes required for paired variant analysis
AnchorNamedata tab paired variant detect

Image Modified

...



...

Numbered figure captions
SubtitleTextSpecifying options for paired variant detection
AnchorNameamong samples paired

Image Modified


 Select Reference sequence will specify the reference assembly to utilize for variant detection.   If the alignment was generated in Partek® Flow®, the Assembly will be displayed as text in the section, and you do not have the option to change the reference.   In the event that alignment was performed outside of Partek® Flow®Partek Flow, you will need to select the appropriate Assembly utilized for alignment in the drop-down list.   Assemblies previously added to library files (see Library File Management) will be available for selection or New assembly… in the menu can  can be utilized to import the reference sequence to library files from within the task.

Advanced options provides a means to tune parameters in the variant detection for optimal performance.   Upon invoking the task dialog, Option set is set to Default, and these parameters are provided by the Samtools SAMtools developers.   Clicking Configure button will  will open a window to tune advanced options (Figure 3).   Moving the mouse cursor over the info button will provide details for each parameter.   Please refer to the Samtools SAMtools documentation for further details on any of these parameters.

 

...



Numbered figure captions
SubtitleTextConfiguring advanced Samtools options
AnchorNamesamtools advanced options

Image Modified

...


References

  1. Li H, Handsaker B, Wysoker A, et al. The Sequence Alignment/Map format and SAMtools. Bioinforma Oxf Engl. 2009;25(16):2078-2079.
  2. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27(21):2987-2993.
  3. Li H. Improving SNP discovery by base alignment quality. Bioinformatics. 2011;27(8):1157-1158.


Additional assistance


 

Rate Macro
allowUsersfalse

...