PGS Documentation

Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

What is Hierarchical Clustering?

Hierarchical clustering groups similar objects into clusters.  To start, each row and/or column is considered a cluster. The two most similar clusters are then combined and this process is iterated until all objects are in the same cluster. Hierarchical clustering displays the resulting hierarchy of the clusters in a tree called a dendogram. Hierarchical clustering is useful for exploratory analysis because it shows how samples group together based on similarity of features.

Hierarchical clustering is considered an unsupervised clustering method. Unsupervised clustering methods do not take the identity or attributes of samples into account when clustering. This means that any experimental variables such as treatment, phenotype, tissue, number of expected groups, etc. do not guide or bias cluster building. Alternatively, supervised clustering methods do consider experimental variables when building clusters.

Partek Genomics Suite offers two alternatives to Hierarchical clustering - K-Means clustering and Self-Organizing Map. For a more in-depth description of how Partek Genomics Suite performs these different forms of clustering analysis, please see Chapter 8 Hierarchical & Partitioning Clustering of the Partek Manual. The Partek Manual can be accessed through Partek Genomics Suite under Help > On-Line Help

Hierarchical Clustering in Action

To illustrate the capabilities and customization options of hierarchical clustering in Partek Genomics Suite, we will explore an example of hierarchical clustering drawn from the tutorial Gene Expression Analysis. The data set in this tutorial includes gene expression data from patients with or without Down syndrome. Using this data set, 23 highly differentially expressed genes between Down syndrome and normal patient tissues were identified. These 23 differentially regulated genes were then used to perform hierarchical clustering of the samples. Follow the steps outlined in Hierarchical Clustering and Adding Information to Gene Lists to perform hierarchical clustering and launch the Hierarchical Clustering tab (Figure 1).

 

Figure 1. Heatmap showing results of hierarchical clustering

The right-hand section of the Hierarchical Clustering tab is a heatmap showing relative expression of the genes in the list used to perform clustering. The heatmap can be configured using the properties panel on the left-hand side of the tab. By default, down-regulated genes will be shown in green, genes with no change in expression will be shown in black, and up-regulated genes will be shown in red. The dendograms on the left-hand side and top of the heatmap show clustering of samples as rows and features (probes/genes in this example) as columns. 

 

 

 

Additional Assistance

If you need additional assistance, please visit our support page to submit a help ticket or find phone numbers for regional support.

Your Rating: Results: 1 Star2 Star3 Star4 Star5 Star 0 rates

  • No labels