Multivariable association discovery in population-scale meta-omics studies

2021 PLoS Computational Biology 2,052 citations

Abstract

It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2’s linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles.

Keywords

Computational biologyScale (ratio)Multivariable calculusPopulationBiologyComputer scienceBioinformaticsData scienceMedicineEngineeringGeographyCartography

Affiliated Institutions

Related Publications

Observer Reliability and Agreement

Abstract The terms observer reliability and observer agreement represent different concepts. Reliability coefficients express the ability to differentiate between subjects. Agre...

2005 Encyclopedia of Biostatistics 62 citations

Publication Info

Year
2021
Type
article
Volume
17
Issue
11
Pages
e1009442-e1009442
Citations
2052
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2052
OpenAlex

Cite This

Himel Mallick, Ali Rahnavard, Lauren J. McIver et al. (2021). Multivariable association discovery in population-scale meta-omics studies. PLoS Computational Biology , 17 (11) , e1009442-e1009442. https://doi.org/10.1371/journal.pcbi.1009442

Identifiers

DOI
10.1371/journal.pcbi.1009442