PGS Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
maxLevel2
minLevel2
excludeAdditional Assistance

During a previous section of this tutorial, a spreadsheet named unexplained_regions was generated. This spreadsheet contains locations where reads map to the genome but are not annotated by the transcript database, in this case, RefSeqGene. The unexplained_regions spreadsheet is potentially very interesting as it may contain novel findings.

  • Right click column 6. Average Coverage and select Sort Descending from the menu
  • Select Find Overlapping Genes from the Tools option in the command toolbar (Figure 1)

Numbered figure captions
SubtitleTextSelecting Find Overlapping Genes from Tools in the command toolbar
AnchorNameFind Overlapping Genes

  • Select Add a new column with the gene nearest to the region in the Find Overlapping Genes dialog (Figure 2)
  • Select OK 

Numbered figure captions
SubtitleTextFind Overlapping Genes
AnchorNameAdd a new column with nearest gene

  • Select RefSeqTranscripts – 2014-01-03 from the Output Overlapping Features dialog (Figure 3)

Please note that it is recommended that you annotate with the same database used when you performed mRNA quantification.

  • Select OK 

Numbered figure captions
SubtitleTextSelect the database to search for overlapping features
AnchorNameSelect Database with Overlapping Features

The closest overlapping feature and the distance to it is now included as columns 7. Overlapping Features and 8. Nearest Features in the unexplained_regions spreadsheet. 

Right-clicking on a row header and selecting Browse to Location will show the reads mapped to the chromosome. For this tutorial, a couple of genes are selected to show regions that are located after a known gene or in the intron of a gene.

  • With the unexplained_regions spreadsheet open, right-click on Average Coverage (column 6) and select Sort Descending (Figure 4)

Numbered figure captions
SubtitleTextSorting the unknown_regions spreadsheet by Average Coverage
AnchorNameSort Decending

  • Select row 45 and Browse to location to show a region within an intron of UNC45B. This may be a novel exon (Figure 5) 

Numbered figure captions
SubtitleTextA region within an intron of UNC45B that might be an novel exon
AnchorNameNovel Exon example

  • Select row 10482 and Browse to location to show a region that starts 1 bp after CD82. This peak may represent an extended exon (Figure 6) 

Numbered figure captions
SubtitleTextA region that starts 1 bp after CD82 that might represent an extended exon
AnchorNameExtended Exon example

While RefSeq was used to identify overlapping features, the choice of which database to use will depend on the biological context of your experiment. For example, you may wish to utilize promoter or miRNA databases if you are interested in regulation of expression. 

 

Additional assistance

 

Rate Macro
allowUsersfalse