Partek Flow Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The Chromosome view in Partek®

...

 Flow®

...

 is a visualization tool for next-generation sequencing (NGS) and microarray data. The viewer can display different types of information, including aligned reads, genomic databases (e.g. genes, transcripts, or variants), isoform proportions, and reference sequence. 

This chapter will illustrate how to:

Table of Contents
maxLevel2
minLevel2
excludeAdditional Assistance

 

Navigating Through the View

A user can browse through the results by using one of the tools in the navigation bar (on the top of the view; Figure 4). Select tracks tool is the topic of a separate section, while the remaining tools are described below.

 

Numbered figure captions
SubtitleTextNavigation bar of the chromosome view (from left): Select tracks tool, Search box, Position box, mode selector (pointer, zoom, pan), zoom tool, bookmarks, save icon (the position in the figure is an example)
AnchorNameChromosome navigation bar

Image Removed

You can use the Search box to zoom to genomic features that are available in the annotation track. Start typing a search term and Partek Flow will show you the first 10 suggestions (Figure 5). To select one, use the arrow keys or mouse, or type the full feature name and hit enter.

 

Numbered figure captions
SubtitleTextSearch box of the Chromosome view. To zoom in on a feature, start typing the feature name; Partek Flow will show suggestions available in the corresponding annotation file (the current annotation is visible in the column on the right) (an example is shown)
AnchorNameChromosome Search Box

Image Removed

The Position box enables the user to visualize a region in the genome. Coordinates are accepted in the following format: chromosome:start – end (zero-based). To show an entire chromosome, it is sufficient to enter just the chromosome number. The U-turn icon on the right Image Removed takes you back to the original view, i.e. resets the zoom level to the view that was shown when the viewer was first opened. 

Next, the mode selector (Figure 6) helps you to quickly navigate through the results.

 

Numbered figure captions
SubtitleTextMode selector (from left): pointer mode, zoom mode, pan mode
AnchorNameMode selector

Image Removed

When pointer mode is activated, the appearance of the cursor will change to an arrow (Figure 7). Pointer mode provides details on any item (e.g. short sequencing read, variant, microarray probe, annotation feature) selected on the canvas. The selected item is highlighted by a green box (Figure 7).

 

Numbered figure captions
SubtitleTextHighlighted item in chromosome view: the microarray probe highlighted by a green box was selected using the pointer mode (microarray probes are used just as an example)
AnchorNameMicroarray Highlight

Image Removed

When zoom mode is activated, the appearance of the cursor will change to a plus (plus). With the zoom mode on, you can magnify a specific region by positioning the cursor (plus) to the left of the area of interest and then <left-click> & drag the mouse to the right of the area of interest (Figure 8). When the viewer refreshes, it will come "closer" to the region that was selected (by halving the number of basis displayed on the screen). 

 

Numbered figure captions
SubtitleTextUsing <left-click> & dragging mouse to zoom into a region of genome (start magnification shown on the left). After releasing the left mouse button, Partek Flow will zoom into the highlighted region (right panel; an example is shown)
AnchorNameChromosome view zoom

Image Removed

Alternatively, <left-click> on the canvas and Partek Flow will zoom in one level, by halving the number of bases visible on the screen. To zoom out one level Ctrl & <left-click> should be used; as a result, the number of visible bases will be roughly doubled.

When panning mode is activated, the appearance of the cursor will change to four arrows (Figure 9). <Left-click> and drag the canvas to the left or to the right to move upstream or downstream in the genome, respectively.

 

Numbered figure captions
SubtitleTextAppearance of the mouse cursor when the panning mode is on
AnchorNamePanning mode

Image Removed

Zooming out and in can also be achieved with the zoom tool (Figure 10) by moving the golden slider left or right, respectively, or by selecting the magnifying glass icons (– and +).

 

 

Numbered figure captions
SubtitleTextZoom tool
AnchorNameZoom

Image Removed

The location of an interesting region can be bookmarked. Selecting the bookmark icon (i.e. the star) opens the dialog (Figure 11). To create a new bookmark, type the name of the region in the Create bookmark box and push Create.

 

Numbered figure captions
SubtitleTextBookmark dialog.The 'B2M' is shown as an example of an existing bookmark
AnchorNameBookmark

Image Removed

The next time you want to go directly to the same location, select the name of the bookmark (example in the Figure 11 lists B2M - exon #4 as the bookmark name) and Partek Flow will plot the region as defined in the Location column. To remove a bookmark, select the delete icon Image Removed.

Once the plot has been modified, you can save the current appearance of the canvas by using the save icon Image Removed . The resulting dialog (shown in Figure 12) enables you to change the image Format (options include: .svg, .png, .pdf), Size, and Resolution. The image will be saved in your Downloads directory.

 

Numbered figure captions
SubtitleTextSave image dialog. Upon selecting Save, saves the current visualisation on the canvas
AnchorNameImage dialog

Image Removed

Selecting Data Tracks for Visualization

Partek Flow plots genomic information on the canvas and is organized into horizontal sections called tracks. The exact number, type, and presentation of tracks depend on several factors, such as the underlying pipeline, available annotation, and the level of zoom. The tracks are added, removed, or customized via the Select tracks dialog (Figure 13).

 

Numbered figure captions
SubtitleTextSelect tracks button opens the Select tracks dialog
AnchorNameSelect tracks button

Image Removed

The content of the Select tracks dialog depends on the data nodes present on the Analysis tab of the current project (an example is shown in Figure 14). Current pipeline is depicted in the center of the window, while data nodes that can be visualised are highlighted by the colour of their layer. Tracks can be turned on or off by selecting the check boxes in the list of possible tracks (and data nodes) on the right. To uncheck all, use the Clear selection button.

 

Numbered figure captions
SubtitleTextSelect tracks dialog (the pipeline is an example). Data nodes that can be visualised are highlighted by the colour of their layer (sky blue in this example). Tracks can be turned on or off by selecting the check boxes in the list of data nodes and respective tracks (right panel)
AnchorNameTrack selection dialog

Image Removed

For the ease of use, the pipeline and the list of tracks are linked: hovering over the track list highlights the matching data node in the pipeline and vice versa, i.e. selecting a data node in the pipeline panel highlights the respective node in the track list (Figure 15). Once you decided on the tracks that should be plotted, push Display tracks to depict them on the canvas.

Numbered figure captions
SubtitleTextSelecting data tracks for the visualisation using the Select tracks dialog (an example). Hovering over the track list highlights the matching data node in the pipeline and vice versa, i.e. selecting a data node in the pipeline panel (e.g. Isoform proportion track) highlights the respective node in the track list (Normalised counts and Isoform proportion)
AnchorNameselector_selected

Image Removed

...

Visualizing the Results Using Data Tracks

Data tracks section of the Select tracks dialog enables you to specify the tracks for visualization on the canvas. An overview of the available track types is provided in Figure 16. Note that not all tracks are visible at all times and that their presence depends on the zoom level. The tracks can be customised and their appearance changed by using the control panel on the right.

 

Numbered figure captions
SubtitleTextData tracks in Chromosome view (examples)
AnchorNameData tracks

Alignments track

Image Removed

Isoform proportion track
Image Removed

Variants track
Image Removed

Amino acids track
Image Removed

Reads pileup track
Image Removed

Probe intensities track
Image Removed

Alignments Track

...

Numbered figure captions
SubtitleTextAlignments track: different colouring options. When colouring reads by sample, the reads are stacked (on top of each other), i.e. in the example above there are more reads in the red sample than in the blue sample
AnchorNameAlignment track

Reads coloured by sample
Image Removed

Reads coloured by base calls
Image Removed

Isoform Proportion Track

The Isoform proportion track displays the reads mapped to transcripts and helps to visualize differential expression and alternative splicing, using standard symbols for exons (boxes) and introns (lines connecting the boxes). The size of each transcript is proportional to the number of reads that map to that transcript. The color indicates the samples to which the reads belong. Figure 18 shows a gene with two transcripts in RefSeq database; the top transcript is more abundant than the bottom transcript and is preferentially expressed in the "blue" condition (labeled as 0 uM). The bottom transcript, on the other hand, seems to be expressed at the same level across all three conditions (i.e. 0 uM, 5 uM, 10 uM). The number and structure of transcripts on the plot depend on the transcript model that was used for mapping.

 

Numbered figure captions
SubtitleTextIsoform proportion track: the transcripts are shown as present in the transcript model that was used for mapping. Exons are depicted as boxes. The size of each transcript is proportional to the number of reads mapping to it. Colors indicate samples to which the reads belong
AnchorNameIsoform proportion track

Image Removed

Variants Track

Variant tracks show single nucleotide variants (SNVs) and indels, and appear in the Select track dialog if Detect variants task has been performed. Presentation of variants depends on the level of zoom. With low power magnification, SNVs are seen as purple columns and indels are bars (insertions: green bars; deletions: red bars) (Figure 19).

...

Numbered figure captions
SubtitleTextVariants track at low power magnification: SNVs are symbolized by purple columns and an insertion is presented as a green bar (an example is shown). A deletion is presented as a red bar (none is visible on the figure)
AnchorNameVariants track at low magnification

Image Removed

...

 

Numbered figure captions
SubtitleTextVariants track at high power magnification: each SNV is presented as a pie chart and each slice symbolises the relative frequency of each base call (an example is shown). Base call colour codes are given by the track name
AnchorNameVariants track at high power magnification

Image Removed

At higher modification, insertions are seen as green boxes, with individual inserted bases presented using a pie chart, while deletions look like red boxes and the affected bases are also presented by a pie (Figure 21). 

 

Numbered figure captions
SubtitleTextVariants track at high power magnification: insertion is presented as a green box, deletion is presented as a red box. An example is shown.
AnchorNameVariants track at high power magnification

Insertion
Image Removed

Deletion
Image Removed

Amino Acids Track

Amino acids track becomes available in the Select tracks dialog after completing the Annotate variants task. The actual appearance of the track depends on the zoom level. With low-power magnification, you will see a message View not available at this zoom level, Please zoom in to view amino acids.

When you zoom closer to the genome, all the amino acids become visible as colored boxes (Figure 22) and labeled using the single-letter amino acid code. Alternative amino acids are depicted as additional boxe on the top of the consensus sequence.

 

Numbered figure captions
SubtitleTextAmino acids track at high power magnification: consensus amino acid sequence is at the bottom of the track, while a variant is shown on the top (change from Threonine to Proline is shown)
AnchorNameAmino acids

Image Removed

If an amino acid spans two exons, its box will be truncated and the line connecting the exons will be dashed. An example is in Figure 23.

 

Numbered figure captions
SubtitleTextAmino acids track: exon-spanning amino acids indicated by truncated boxes (i.e. Alanine on the left) (an example is shown)
AnchorNameexon-spanning amino acids track

Image Removed

An empty gray box on the top of consensus sequence is used to indicate a STOP codon, which is a consequence of a mutation (Figure 24).

 

Numbered figure captions
SubtitleTextAmino acids track: A variant which is in fact a STOP codon is represented by an empty box, as seen on the top of the G (an example is shown)
AnchorNameSTOP codon

Image Removed

Untranslated bases, such as ones downstream of a STOP codon are depicted by lighter shades. Figure 25 shows two transcripts in an amino acid track; the direction is from left to right, so amino acids downstream of a STOP codon (P > G > L) are lightly shaded.

 

Numbered figure captions
SubtitleTextAmino acids track: amino acids downstream of a STOP codon are depicted by lighter shades. STOP codon is represented by "." in the middle, direction is from right to left (an example is shown)
AnchorNameDownstream amino acids

Image Removed

Reads Pileup Track

Reads pileup track plots the short sequencing reads, as present in the .bam file. The track is not on by default (go to Select tracks to turn it on) and its appearance depends on the magnification; if you are zoomed out a message - Zoom in to view individual reads - will be displayed.

...

 

Numbered figure captions
SubtitleTextReads pileup track: short sequencing reads are represented as bars. Paired-end reads are located within a gray box encompassing both pairs. Singletons, such as that on the top right, are depicted as thicker reads (an example is shown)
AnchorNameReads pileup track

Image Removed

If you used a junction-aware aligner (such as TopHat or STAR), the junction reads will be depicted using dashed lines, which connect exon-spanning parts of the same read (Figure 27).

 

Numbered figure captions
SubtitleTextReads pileup track: junction reads are depicted using dashed lines. A RefSeq track is added at the top, to visualise the exons (an example is shown)
AnchorNamejunction reads

Image Removed

Deleted bases can also be seen on a Reads pileup track, as fat black lines (Figure 28).

 

Numbered figure captions
SubtitleTextReads pileup track: deleted bases depicted using fat black lines (an example is shown)
AnchorNameDeleted bases

Image Removed

Probe Intensities Track

Microarray probes are visualised by the Probe intensities track. The probes are shown as bars and their colour depends on the probe intensity, ranging from white (low) to admiral blue (high) (Figure 29).

 

Numbered figure captions
SubtitleTextProbe intensities track: probes are depicted as bars and their colour reflects the intensity (an example is shown)
AnchorNameProbe intensities track

Image Removed

As with the Reads pileup track, probes may not be visible with low power magnification and you will see a message - Zoom in to view individual microarray probes.

Annotating the Results

Cytoband Track

By default, the Chromosome view shows a cytoband track at the top of the canvas. If a cytoband file for your genome has not been added to Partek Flow, a warning will appear (Figure 30). In that case, go to the Library File Management page and download or create a cytoband file.

 

Numbered figure captions
SubtitleTextWarning message indicating that Chromosome view can not be launched because of missing cytoband file
AnchorNameWarning

Image Removed

The red box (Figure 31) indicates the part of the chromosome that is currently depicted on the canvas.

 

Numbered figure captions
SubtitleTextCytoband track: highlighted part is currently depicted on the canvas (an example is shown)
AnchorNameCytoband track

Image Removed

 

Reference Genome

The sequence of the reference genome is added to the Chromosome view by default, as long as it has been added to the respective genome on the Library File Management page. However, its presence (or absence) in the viewer depends on the current magnification. At low power, the track is hidden and you will see the message - Track hidden (zoom to view). At high power, on the other hand, the Reference genome track becomes visible (Figure 32) and is supplemented by the genomic coordinates (below the sequence). A vertical guide helps you to align the bases between Aligned reads and Reference genome tracks. Depending on the reference genome file, some bases may be shown in lowercase letters, symbolizing repetitive sequences, or other sequences masked by a tool such as RepeatMasker.

 

 

Numbered figure captions
SubtitleTextReference genome track. Numbers beneath the sequence are coordinates
AnchorNameReference genome

Image Removed

 

Variant Database

...

The variants will be shown adjacent to the Reference genome track (Figure 33). If the database contains no frequency information on alternative alleles, the alleles will be drawn as bars (an example is the SNP on the left in Figure 33). If the frequency information is available, the relative frequency of each variant will be represented by a column (the SNP on the right in Figure 33).

 

Numbered figure captions
SubtitleTextReference genome track with added variant annotation: single nucleotide variants present in the chosen database are depicted as bars (if no frequency information is available) or columns (columns reflect relative frequency of each alternative allele as stored in the database)
AnchorNameVariant annotation

Image Removed

Note that the frequency information for each allele will be parsed out from the chosen database. That information can be retrieved by selecting a variant using the selection mode and will be shown in the Selection details section of the control panel. Using the example shown in Figure 33, the details of the left database variant can be seen in Figure 34. The most frequent allele at that locus is G (hence, yellow column is plotted above the Reference genome track), which matches the base call of the reference genome.

 

Numbered figure captions
SubtitleTextSelection details section of the control panel showing details of a SNV, as present in the selected database
AnchorNameSelection details

Image Removed

If your variant database stores indels, they will be depicted using green (insertion) or red (deletion) symbols (Figure 35) pointing to deleted bases.

 

Numbered figure captions
SubtitleTextReference genome track with added variant annotation: insertions are shown in green, deletions in red. In this example, an insertion of a single base has described in the database, between G and T. An adjacent deletion of T and C bases has also been seen before
AnchorNameinsertions_deletions

Image Removed

Other Annotation Tracks

Additional annotation tracks can be added to the viewer with the help of the Select tracks dialog (Figure 14) as long as they have been associated with the genome you are working on in the Library File Management.

...

 

Numbered figure captions
SubtitleTextTranscript database track: a gene with two transcripts is shown as an example. Exons are plotted as boxes and introns as lines connecting them. Untranslated regions (UTRs) are seen as narrow boxes. The arrows indicate directionality
AnchorNameTranscript database track

Image Removed

Customizing the View

Controls

Chromosome view can be customized by using the control panel on the left (Figure 37). The Attribute and Order By controls show options depending on the current project, while the content of the Annotate amino acids control depends on the annotation files associated with the current genome build in the Library File Management. In order for any change to take place, push the Apply button.

 

Numbered figure captions
SubtitleTextControl panel (an example is shown)
AnchorNameControl panel

Image Removed

Group data by

The first option, Group data by, specifies the number of Alignments tracks (Figure 38). All will result in only one track, with all the samples on it. Sample creates one track per sample, while Attribute produces one Alignments track per level of the Attribute (i.e. one track per group). 

 

Numbered figure captions
SubtitleTextGroup data by: All creates one Alignments track for the entire project, Sample creates one Alignments track for each sample, Attribute creates one Alignments track for each group (an example is shown)
AnchorNameGroup options

All

Image Removed

Sample

Image Removed

Attribute

Image Removed

Annotate amino acids by

Annotate amino acids by controls the appearance of the Amino acids track and allows you to pick the transcript database that will be used to plot codons (Figure 39). The drop down list shows the databases currently available for the selected genome (additional databases can be added via Library File Management). 

 

Numbered figure captions
SubtitleTextAnnotate amino acids by: transcript models currently associated with the chosen genome are displayed in the drop-down list and can be used to plot Amino acids track (an example is shown)
AnchorNameAmino annotation

Image Removed

Color by

Color by option affects the colouring of the Alignments track and Isoform proportion track. When Sample is selected from the drop-down list, individual samples will be shown on the aforementioned tracks, each sample being given a different colour. If attributes were assigned to samples, they will also be visible in the Color by drop-down (Figure 40) and you will be able to highlight levels of the selected attribute (Figure 41).

 

Numbered figure captions
SubtitleTextColor by: the options control colouring of Alignments and Isoform proportion tracks. Sample, Base, and Match options are present by default. If attributes have been assigned to samples, they will appear in the drop-down list. In this example, that is the "Tissue" attribute
AnchorNameColor options

Image Removed

Numbered figure captions
SubtitleTextDifference between Color by Sample and Color by <Attribute>. Color by Sample uses different colours to depict individual samples; Color by <Attribute> uses different colours to depict levels of the selected sample attribute (as present in the Data tab). Alignments and Isoform proportion tracks are shown (an example)
AnchorNameColor difference

Color by Sample

Image Removed

Color by <Attribute>

Image Removed

The effect of the option to Color by Base can be seen with high power magnification (Figure 42). Individual base calls are highlighted by different colours. When that option is chosen at low power magnification, all the bases are shown in grey.

 

Numbered figure captions
SubtitleTextColor by Base highlights the base calls by colours. Different colours are visible with high power magnification; otherwise all the bases are shown in gray (an example)
AnchorNameColor by Base

Image Removed

Finally, Color by Match can be used to quickly identify mismatches against the reference genome. A matching base is coloured in blue, while mismatch bases are shown in yellow. 

 

Read histogram Y-axis scales

The maximum of the y-axis of Alignments tracks is set by Read histogram Y axis scales option (Figure 43). When using Independent, the y-axis for each track is set individually, based on the maximum within that sample. On the other hand, Linked uses the maximum across all the samples and uses that value as the maximum for all. 

 

Numbered figure captions
SubtitleTextRead histogram Y axis scales. When set to Linked, all the tracks have the same Y axis maximum, which depends on the sample with the highest coverage. Using Independent sets Y axis maximum independently for each sample.
AnchorNameHistogram Y axis

Independent
Image Removed

Linked
Image Removed

Read histogram type

Read histogram type changes the presentation of the Alignments track and should be used in conjunction with the Group data by and Color by tracks to get the desired visualisation.

...

Numbered figure captions
SubtitleTextAlignments track: total coverage per locus is shown by using "Read histogram type" set to "Sum" and "Group data by" set to <Attribute>
AnchorNameTotal coverage

Image Removed

To show average coverage per locus, switch Read histogram type to Average and leave Color by as is (i.e. by group) (Figure 45). With this setting, Chromosome view will calculate the average by dividing the total coverage per locus by the number of samples. Note that using Color by Sample would not make sense here. Although Figure 44 looks quite like Figure 43, the y-axis range is different.

 

Numbered figure captions
SubtitleTextAlignments track: average coverage per locus is shown by using "Read histogram type" set to "Average", "Group data by" set to "Attribute", and "Color by" set to <Attribute>
AnchorNameAverage coverage

Image Removed

Finally, the option Overlay is useful if you want to directly compare base counts over several samples (or groups) as each will be represented by a line (i.e. no stacking). Example in Figure 46 is based on microarray data, showing three groups on the same Alignments track. The red group has the highest base counts, while the counts in the blue group are much lower.

Numbered figure captions
SubtitleTextAlignments track: coverage per locus is shown by using "Read histogram type" set to "Overlay". Each plot is a single experimental condition ("Group data by" set to "Attribute", "Color by" set to <Attribute>). Lines are rectangular since microarray data is used (an example)
AnchorNamealignmentstrack

Image Removed

 

Transcript label

...

 

Numbered figure captions
SubtitleTextranscript label: setting the control to Gene shows only gene label, while Transcript shows transcript labels. Both transcript database and Isoform proportion tracks are affected Short sequencing reads can be coloured by strand (Reads pileup color: Strand) or by base (Reads pileup color: Base). Both options are illustrated in Figure 48.
AnchorNameTranscript label

Transcript label: Gene
Image Removed

Transcript label: Transcript
Image Removed

Reads pileup and probe color 

 

Numbered figure captions
SubtitleTextReads pileup color: colouring of the short sequencing reads by Strand or by Base
AnchorNameReads pileup color

Reads pileup color: Strand
Image Removed 

Reads pileup color: Base
Image Removed

Probe color control customizes the appearance of Probe intensities track (Figure 49). When set to Intensity, colour of a probe reflects its intensity, using a colour gradient from white (low) to admiral (high). Alternatively, when Strand is turned on, probes on the reverse strand are in parakeet green, while probe on the forward strand are in sky blue.

 

Numbered figure captions
SubtitleTextProbe color: "intensities" colors probes proportionally to their intensity, "strand" uses colors to indicate probe positioning (an example is shown)
AnchorNameProbe color

Probe color: Intensity
Image Removed
Probe color: Strand
Image Removed

If a variant database is available for the current genome, the variants can be added to the Reference genome track (Figure 33). To show the variants, point the Variant database control to the database of your choice.

...

 

Numbered figure captions
SubtitleTextCustomize colors dialog: selecting a drop-down arrow opens the color-picker tool
AnchorNameCustom color dialog

Image Removed

Track Order

The position of the tracks on canvas can be controlled by using the Track order tool. If you want a track to be visible all the time, i.e. while scrolling up or down, pin it to the top or to the bottom. Figure 51 shows Cytoband track pinnned to the top of the canvas and Reference genome track pinned to the bottom of the canvas. To unpin a track, click on the pin icon ( Image Removed ). The track will be unpinned and a message No tracks are pinnned to the top / bottom will appear. To pin a track, drag the track name to the No tracks… message. Alternatively, you can use the green arrows ( Image Removed ) to pin a track. When you mouse over an arrow, the new position of the track will be highlighted on the canvas; click on the arrow to accept.

A track can be hidden (meaning it will not be visible) by selecting the red minus, or unhidden by selecting the green plus icon.

...

 

Numbered figure captions
SubtitleTextTrack order tool: To change the position of a track drag and drop to the new position. To pin a track to the top / bottom of the canvas, use the up and down arrows. To unpin a track, select the pin icon. A track can be hidden by clicking on the red minus symbol and unhidden by selecting the green plus. Coloured dot by a track names indicates the layers to which the track belongs (an example is shown)
AnchorNameTrack order tool

Image Removed

Selection Details

At the bottom of the control panel you will find the Selection details section (Figure 52). It is used to display information on the element selected on the canvas (using the Pointer mode).

 

Numbered figure captions
SubtitleTextSelection details showing information on the element selected on the canvas. The example shows details of a microarray probe. Note the two link-outs ("Browse on UCSC" and "BLAST this sequence")
AnchorNameSelection details

Image Removed

...

Children Display

additional-assistance

 

Rate Macro
allowUsersfalse