PGS Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

To detect differential methylation between CpG loci in different experimental groups, we can perform an ANOVA test. For this tutorial, we will perform a simple onetwo-way ANOVA to compare the methylation states of the four two experimental groups. 

  • Select Defect Detect Differential Methylation from the Analysis section of the Illumina BeadArray Methylation workflow

A new child spreadsheet, mvalue, is created when Detect Differential Methylation is selected. M-values are an alternative metric for measuring methylation. β-values can be easily converted to M-values using the following equation: M-value = log2( β / (1 - β)).

An M-value close to 0 for a CpG site indicates a similar intensity between the methylated and unmethylated probes, which means the CpG site is about half-methylated. Positive M-values mean that more molecules are methylated than unmethylated, while negative M-values mean that more molecules are unmethylated than methylated.  As discussed by Du and colleagues, the β-value has a more intuitive biological interpretation, but the M-value is more statistically valid for the differential analysis of methylation levels.

Because we are performing differential methylation analysis, Partek Genomics Suite automatically creates an M-values spreadsheet to use for statistical analysis. 

  • Select 2. Cell Type and 3. shRNA treatment Gender from the Experimental Factor(s) panel 
  • Select Add Factor > to move 2. Cell Type and 3. shRNA treatment Gender to the ANOVA Factor(s) panel (Figure 1)

Numbered figure captions
SubtitleTextANOVA setup dialog. Experimental factors listed on the left can be added to the ANOVA model.
AnchorNameANOVA setup dialog

Image RemovedImage Added

  • Select Contrasts... 
  • Select Yes for Leave Data is already log transformed?  because M-values are based on logit transformation
  • Select Naive shPOU5F1 
  • Select set to No 
  • Leave Report comparisons as set to Difference

For methylation data, fold-change comparisons are not appropriate. Instead, comparisons should be reported as the difference between groups. 

  • Select 2. Cell Type from the Select Factor/Interaction drop-down menu
  • Select LCLs
  • Select Add Contrast Level > for the upper group Repeat to add Naive shNANOG to the upper group
  • Select B cells
  • Select Naive shCTRL Select Add Contrast Level > for the lower group
  • Select Add Contrast 
  • Repeat steps to create an additional contrast: Naive shPOU5F1 and Naive shNANOG vs. Primed shCTRL 
  • Select the Estimate box in the Other Statistics section of the Configure dialog (Figure 2)

By default, the fold-change value for each contrast will be calculated. Selecting Estimate will also include the difference in methylation levels between the groups at each CpG site in the output. These values will be needed later in the tutorial when we filter the differentially methylated loci. 

 

Numbered figure captions
SubtitleTextConfiguring ANOVA contrasts
AnchorNameConfiguring ANOVA Contrast

Image RemovedImage Added

  • Select Add CombinationSelect OK to close the Configuration dialog

...

  • Select Chromosome is in one column and the physical location is in another column for Choose the column configuration
  • Select Ilmn ID for Marker ID
  • Select CHR for Chromosome i
  • Select MAPINFO for Physical Position
  • Select Close

Select Close. This enable Partek enables Partek Genomics Suite to parse out probe annotation annotations from the manifest file. 

 

Numbered figure captions
SubtitleTextProcessing the annotation file. User needs to point to the columns of the annotation file that contain the probe identifier as well as the chromosome and coordinates of the probe.
AnchorNamespecify genomic position

The results will appear as ANOVA-1way 2way (ANOVAResults), a child spreadsheet of 1 (Differential Methylation Analysis) mvalue. Each row of the spreadsheet represents a single CpG locus (identified by Column ID). 

 

Numbered figure captions
SubtitleTextANOVA spreadsheet (truncated). Each row is a result of an ANOVA at a given CpG locus (identified by the Column ID column). The remaining columns contain annotation and statistical output
AnchorNameANOVA Spreadsheet

Image Removed

 

Going forward, analysis of differentially methylated loci typically includes removal of the probes on X and Y chromosomes (to avoid the problems with inactivation of one X chromosome). To annotate the ANOVA spreadsheet with the information required for filtering, right-click on the Gene Symbol column, select Insert Annotation, tick-mark the CHR filed (Figure 6) and push OK. A new column will be appended to the spreadsheet.

 

Numbered figure captions
SubtitleTextAdding annotation to the spreadsheet. The dialog provides access to the Illumina manifest file
AnchorNameadding annotation

Image Removed

 

To enable filtering right-click on the header of the CHR column > Properties and set the type to categorical (and OK). Now, activate the Interactive Filter tool (Image Removed) . If needed use the drop-down list to point to the CHR column. The column chart represents the number of appearances of each chromosome in the spreadsheet (i.e. the number of probes per chromosome). To remove the probes on the X and the Y chromosome left click on the two right-most columns (the pop up balloon will show you the chromosome label) and the columns will be grayed out (Figure 7).

Numbered figure captions
SubtitleTextUsing Interactive Filter tool to filter out probes by annotation. When pointed to a categorical column, the Interactive Filter tool summarises the content of the column by a column chart. Left-click to exclude a category (two columns on the right were excluded, so they are grayed out), right-click to include only
AnchorNameremoving probes by chromosome

Image Removed

 

After removal of the probes on the sex chromosomes, let us extract all the autosomal probes to a new spreadsheet. Right click on the ANOVA 1-way spreadsheet > Clone... Set the Name of new spreadsheet to AutosomalOnly and make it a child of the top-level spreadsheet (EPIC iDAT) (Figure 8). Push OK. The newly created autosomalonly spreadsheet will be a new starting point for all the downstream steps.

 

Numbered figure captions
SubtitleTextCreating a clone of a spreadsheet as a way of extracting features in a separate spreadsheet. The clone should have the same parent as the original (i.e. template) spreadsheet
AnchorNamecreating a clone of a spreadsheet

Image Removed

 

To come up with a list of differentially methylated loci, proceed to the workflow and select Create Marker List. That will open the List Manager functionality and you will be on the ANOVA Streamlined tab. Factors in the model are listed in the top section, contrasts specified in the model are in the middle, while the filter settings are at the bottom.

Select shNANOG vs Primed contrast to see the current filter configuration. By default, both fold-change and p-value with false discovery rate (FDR) correction are applied, with the number of CpG loci passing the filter given as # Pass. For this tutorial, set the fold-change to > 10 and < -10, and reduce the p-value with FDR down to 0.01 (Figure 9). Then push the Create button to save the list of significant loci under the default name (shNANOG vs. Primed). Repeat the procedure for the shPOU5F1 vs Primed, using the same cut offs.

 

Numbered figure captions
SubtitleTextCreating list of significant CpG loci using List Manager and the ANOVA Streamlined tab. You can specify either a factor or a contrast to work with. The filters are given at the bottom and can be configured as preferred. The Create button generates a new spreadsheet with significant CpG loci (the loci passing the filter)
AnchorNameANOVA streamlined tab

Image Removed

The List Manager can now be closed by selecting the Close button.

 

 

Image Added

For each contrast, a p-value, Difference, Difference (Description), Beta Difference, and Beta Difference (Description) are generated. The Difference column reports the difference in M-values between the two groups while the Beta Difference column reports the difference in beta values between the two groups. 

 

Page Turner
button-linkstrue

 

Additional assistance

 

Rate Macro
allowUsersfalse