PGS Documentation

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
maxLevel2
minLevel2
excludeAdditional Assistance

The approach described in previous sections relies on ANOVA to detect differentially methylated CpG sites and takes individual sites as a starting point for interpretation. Since ANOVA compares M values at each site independently, this strategy is robust to type I/type II probe bias. 

An alternative could be to first summarize all the probes belonging to a CpG island region (i.e. island, N-shore, N-shelf, S-shore, S-shelf) and then use ANOVA to compare regions across the groups. Since the summarization will include both type I and type II probes, you may want to split the analysis in two branches and analyze type I and type II probes independently.  

 

Additional assistance

 

Rate Macro
allowUsersfalse

As the first step of the summarization to CpG island regions, transpose the β-values spreadsheet, i.e. the top level one (Transform > Create Transposed

Spreadsheet…), using the Type as the Column header. After that, right click on a column header, select Insert Annotation and choose and

USCC_CPG_ISLANDS_NAME (OK to accept). A new column will be inserted

(Figure 28).

 

 

Figure 28: β-values spreadsheet transposed and annotated by CpG islands name. The spreadsheet was sorted by column ID.

 

Next, right-click on the header of the new column, select Properties, and set the Type to categorical (and OK). This step is required to enable the group statistics tool (next step).

 

Then select Stat > Descriptive > Column Statistics… (Figure 29), check the Group by box, and move the Mean from Candidate Measure(s) to Selected Measure(s) box by using the -> button. Select OK to compute.

 

 

Figure 29: Setting the column statistics to compute means per group

 

...

To do this, we need to annotate each probe as type I or type II.

  • Select the mvalue spreadsheet
  • Select Transform from the main toolbar
  • Select Create Transposed Spreadsheet... from the Transform drop-down menu (Figure 1)

Numbered figure captions
SubtitleTextCreating a transposed spreadsheet
AnchorNameCreating a Transposed Spreadsheet

Image Added

  • Select Sample ID for Column: and numeric for Data Type:
  • Select OK

A new temporary spreadsheet will be created with a row for each probe and columns for each sample. 

  • Right-click on column 1. ID to bring up the pop-up menu
  • Select Insert Annotation 
  • Select Add as categorical 
  • Select Infinium_Design_Type and UCSC_CpG_Islands_Name from the Column Configuration options (Figure 2)

Numbered figure captions
SubtitleTextAdding Infinium design type and CpG island annotations
AnchorNameAdding annotations to spreadsheet

Image Added

  • Select OK to add the Inifinium design type and UCSC CpG island name as categorical columns on the spreadsheet 

Now, we can use the interactive filter to create separate spreadsheets for type I and type II probes.

  • Select (Image Added) to launch the interactive filter
  • Select 2. Infinium_Design_Type from the drop-down menu if not selected by default
  • Left-click the type I column to exclude it 
  • Right-click the temporary spreadsheet in the spreadsheet tree to bring up the pop-up dialog
  • Select Clone... (Figure 3)

Numbered figure captions
SubtitleTextCreating a probe list with only Infinium type II probes
AnchorNameCreating spreadsheet with only type II probes

Image Added

  • Name the new spreadsheet female_only_typeII_probes
  • Select OK
  • Save the created spreadsheet, we chose the file name female_only_typeII_probes
  • Repeat process to create a spreadsheet for type I probes

The temporary spreadsheet is no longer needed so we can close it.

  • Close the temporary spreadsheet by selecting it in the file tree and selecting (Image Added)

We can use these spreadsheets to generate lists of M values at CpG island regions

  • Select spreadsheet female_only_typeII_probes 
  • Select Stat from the main toolbar
  • Select Column Statistics... under Descriptive (Figure 4)

Numbered figure captions
SubtitleTextSelecting column statistics
AnchorNameCalling column statistics

Image Added

  • Add Mean to the Selected Measure(s) panel 
  • Select Group By and set it to 3. UCSC_CpG_Islands_Name (Figure 5)

Numbered figure captions
SubtitleTextConfiguring column statistics
AnchorNameColumn statistics configuration

Image Added

  • Select OK 

The new temporary spreadsheet has one CpG island region per row (Figure 6), samples on columns, and the values in the cells represent the mean of

...

M values of all the CpG probes in the region.

...

 

 

Figure 30: β-values summarized to CpG island regions; the spreadsheet features one region per row and samples are on columns

Numbered figure captions
SubtitleTextNew spreadsheet with average M values for probes at each CpG island; probes not at CpG islands are collected into the first row "- Mean"
AnchorNameCpG Islands Type II list

Image Added 

Note the first row, with label “– Mean”. It corresponds to all the probes that map outside of

...

UCSC CpG islands.

...

 As it is not needed for the downstream analysis, we will remove it

...

.

  • Right-click on the row header

...

 

  • for Mean 
  • Select Delete to remove the row

The final step is to transpose

...

the data back to its original orientation. 

  • Select Transform from the main toolbar
  • Select Create Transposed Spreadsheet... from the Transform drop-down menu
  • Select 2. Level for Column: and numeric for Data Type:
  • Select OK

The layout of the new transposed spreadsheet is as follows: one sample per row with CpG island regions on columns; cell entries correspond to mean methylation status of the region (Figure

...

7). The column with a blank value for the column header is the average of all probes not associated with CpG island regions. You can delete this column if you like. 

 

Numbered figure captions
SubtitleTextSpreadsheet with average M values of probes in each CpG island for each sample
AnchorNameSpreadsheet with CpG island M values for each sample

Image Added

  • Right-click the transposed spreadsheet, 2_transpose
  • Select Save as... from the pop-up menu
  • Name it mvalues_typeII_probes_CpG_islands 
  • Close the source temporary spreadsheet by selecting it in the spreadsheet tree and selecting (Image Added)

The mvalues_typeII_probes_CpG_islands spreadsheet can be used as a starting point for ANOVA and other

...

 

 

Figure 31: β-values summarized to CpG island regions; the spreadsheet features one sample per row with regions on columns 

analyses. You can also repeat the steps above to create an equivalent spreadsheet for type I probes. 

 

Page Turner
button-linkstrue

 

Additional assistance

 

Rate Macro
allowUsersfalse