Hierarchical Clustering (HC)
Metabolites and lipids multivariate statistical analysis in R
Hierarchical clustering is used in lipidomics and metabolomics to group samples with similar lipid or metabolite profiles. These groups of samples are known as clusters. Usually, we expect that samples from two severely different conditions (e.g., healthy and cancer patients) will separate from each other, but samples belonging to one biological group will cluster together. However, the hierarchical clustering applied here is an unsupervised method, different from dimensionality reduction. It can be used to find the data structures if no other clues are available. Except for samples, features like lipids or metabolites can also be clustered, suggesting relationships between molecules and potential biological roles. The clustering can be presented, among others, through dendrograms and heat maps. Here, we will introduce three excellent libraries for hierarchical clustering visualization:
1) ggtree - for preparing dendrograms,
2) ComplexHeatmap - for preparing beautiful, publication-ready heat maps with clustering,
3) InteractiveComplexHeatmap - for turning the ComplexHeatmap outputs into interactive plots.
First, we install all three libraries:
Let's begin!
Last updated