Heat maps with clustering
Metabolites and lipids multivariate statistical analysis in R
Last updated
Metabolites and lipids multivariate statistical analysis in R
Last updated
Heat maps with clustering are one of the most effective tools for visualizing alterations in OMICs data sets. Several lipids or metabolites can be depicted in one visualization across all measured samples. In heat maps, as you have just seen in the previous subchapter, normalized concentrations are assigned to continuously scaled fill colors. Upregulations in a target biological group are frequently presented with light, warm colors like red, orange, and yellow, while downregulations are shown with dark and cold colors like blue, green, and violet. Clustering features (lipids, metabolites) or samples can also give clues to interesting data structures and relationships.
Heat maps are one of the most commonly used visualization techniques for -omics data, as they can convey a substantial amount of information in a single, effective chart, whether it is lipid/metabolized normalized concentrations or clinical parameters across experimental groups (or clusters of samples). Check out the selected examples:
J. Wu et al. Lipidomic signatures align with inflammatory patterns and outcomes in critical illness. DOI: - e.g., Fig. 2c - e, Fig. 4 (the ComplexHeatmap library presented below was used to generate heat maps).
W. Wang et al. Metabolomics facilitates differential diagnosis in common inherited retinal degenerations by exploring their profiles of serum metabolites. DOI: - Fig. 2.
J. Fang et al. Integrated multi-omics analysis unravels the floral scent characteristics and regulation in “Hutou” multi-petal jasmine. DOI: - e.g., Fig. 2b (the ComplexHeatmap again in use!).
D. Wolrab et al. Lipidomic profiling of human serum enables detection of pancreatic cancer. DOI: - Fig. 5e, Fig. 6e.
S. Lam et al. A multi-omics investigation of the composition and function of extracellular vesicles along the temporal trajectory of COVID-19. DOI: - e.g., Fig. 2 (left part of the panel), Fig. 3a & b, etc.
R. Lerner et al. Four-dimensional trapped ion mobility spectrometry lipidomics for high throughput clinical profiling of human blood samples. DOI: - Fig. 5 (the authors of the study published in Nature Communications use heat maps for the comparison and cross-validation of quantified lipid concentrations).
J. Idkowiak et al. Robust and high-throughput lipidomic quantitation of human blood samples using flow injection analysis with tandem mass spectrometry for clinical use. DOI: - e.g., Fig. 3, Fig. 4 B - D, Fig. 7 (the authors use heat maps for the presentation of method validation results (Fig. 3 & 4), and for comparing and presenting trends in the case of Fig. 7).
M. Lange et al. AdipoAtlas: A reference lipidome for human white adipose tissue. DOI: - Fig. 4A & E.
V. de Laat et al. Intrinsic temperature increase drives lipid metabolism towards ferroptosis evasion and chemotherapy resistance in pancreatic cancer. DOI: - Fig. 2d & Fig. 4g
E. Rysman et al. De novo Lipogenesis Protects Cancer Cells from Free Radicals and Chemotherapeutics by Promoting Membrane Lipid Saturation. DOI: - Fig. 1d - e, Fig. 2 (right part of the panel), Fig. 3b.
To prepare an exemplary heat map for the PDAC data set, we will use the ComplexHeatmap library from Bioconductor. We will use the 20 most significant lipids from the Kruskal-Wallis test to create our heat map.
We obtain in the R console:
Hence, we can set the scale of fill colors for our heat map between -3 and 3:
We need to transform the data into a form expected by the function, which will produce our heat map. We need to create a matrix with column names. Here is the code:
Finally, we create our heat map:
To draw our heat map, we run:
And we obtain this beautiful chart:
We see interesting clustering outcomes for features - lipids are mostly grouped into classes, e.g., SM species, LPC species, PC with PC O-/PC P- species. We can finally apply clustering to columns too (samples clustering):
The updated heat map:
After applying the clustering to samples, we observe a separation of healthy volunteers (N) from patients with PDAC (T). For almost all lipids, except for Cer 36:1;O2, a downregulation of their concentrations was found in the serum of patients with pancreatic cancer. Patients with pancreatitis (PAN) do not form a separate cluster and are rather spread between healthy individuals and patients with cancer. Based on the heat map above, we would rather conclude that in their serum lipid profile, we do not observe a characteristic for pancreatic cancer patients' downregulations, e.g. in the long chain SM and Cer profiles, and then in LPC, PC, PC O-/PC P- profiles. As you see, we can make lots of observations based on elegant one image.
The ComplexHeatmap package offers many more amazing designs, which are presented in the detailed vignette of the package. We encourage you to read it carefully as well as the articles published by the authors of the ComplexHeatmap package: