Analysis of compositions of microbiomes with bias correctionLin Huang, Shyamal Peddada|Nature Communications|2020 Differential abundance (DA) analysis of microbiome data continues to be a challenging problem due to the complexity of the data. In this article we define the notion of "sampling fraction" and demonstrate a major hurdle in performing DA analysis of microbiome data is the bias introduced by differences in the sampling fractions across samples. We introduce a methodology called Analysis of Compositions of Microbiomes with Bias Correction (ANCOM-BC), which estimates the unknown sampling fractions and corrects the bias induced by their differences among samples. The absolute abundance data are modeled using a linear regression framework. This formulation makes a fundamental advancement in the field because, unlike the existing methods, it (a) provides statistically valid test with appropriate p-values, (b) provides confidence intervals for differential abundance of each taxon, (c) controls the False Discovery Rate (FDR), (d) maintains adequate power, and (e) is computationally simple to implement.
Multigroup analysis of compositions of microbiomes with covariate adjustments and repeated measuresMicrobiome differential abundance analysis methods for two groups are well-established in the literature. However, many microbiome studies involve more than two groups, sometimes even ordered groups such as stages of a disease, and require different types of comparison. Standard pairwise comparisons are inefficient in terms of power and false discovery rates. In this Article, we propose a general framework, ANCOM-BC2, for performing a wide range of multigroup analyses with covariate adjustments and repeated measures. We illustrate our methodology through two real datasets. The first example explores the effects of aridity on the soil microbiome, and the second example investigates the effects of surgical interventions on the microbiome of patients with inflammatory bowel disease.
Analysis of microbial compositions: a review of normalization and differential abundance analysisLin Huang, Shyamal Peddada|npj Biofilms and Microbiomes|2020 Increasingly, researchers are discovering associations between microbiome and a wide range of human diseases such as obesity, inflammatory bowel diseases, HIV, and so on. The first step towards microbiome wide association studies is the characterization of the composition of human microbiome under different conditions. Determination of differentially abundant microbes between two or more environments, known as differential abundance (DA) analysis, is a challenging and an important problem that has received considerable interest during the past decade. It is well documented in the literature that the observed microbiome data (OTU/SV table) are relative abundances with an excess of zeros. Since relative abundances sum to a constant, these data are necessarily compositional. In this article we review some recent methods for DA analysis and describe their strengths and weaknesses.
Environmental toxicants in breast milk of Norwegian mothers and gut bacteria composition and metabolites in their infants at 1 monthBACKGROUND: Early disruption of the microbial community may influence life-long health. Environmental toxicants can contaminate breast milk and the developing infant gut microbiome is directly exposed. We investigated whether environmental toxicants in breastmilk affect the composition and function of the infant gut microbiome at 1 month. We measured environmental toxicants in breastmilk, fecal short-chain fatty acids (SCFAs), and gut microbial composition from 16S rRNA gene amplicon sequencing using samples from 267 mother-child pairs in the Norwegian Microbiota Cohort (NoMIC). We tested 28 chemical exposures: polychlorinated biphenyls (PCBs), polybrominated flame retardants (PBDEs), per- and polyfluoroalkyl substances (PFASs), and organochlorine pesticides. We assessed chemical exposure and alpha diversity/SCFAs using elastic net regression modeling and generalized linear models, adjusting for confounders, and variation in beta diversity (UniFrac), taxa abundance (ANCOM), and predicted metagenomes (PiCRUSt) in low, medium, and high exposed groups. RESULTS: PBDE-28 and the surfactant perfluorooctanesulfonic acid (PFOS) were associated with less microbiome diversity. Some sub-OTUs of Lactobacillus, an important genus in early life, were lower in abundance in samples from infants with relative "high" (> 80th percentile) vs. "low" (< 20th percentile) toxicant exposure in this cohort. Moreover, breast milk toxicants were associated with microbiome functionality, explaining up to 34% of variance in acetic and propionic SCFAs, essential signaling molecules. Per one standard deviation of exposure, PBDE-28 was associated with less propionic acid (- 24% [95% CI - 35% to - 14%] relative to the mean), and PCB-209 with less acetic acid (- 15% [95% CI - 29% to - 0.4%]). Conversely, PFOA and dioxin-like PCB-167 were associated with 61% (95% CI 35% to 87%) and 22% (95% CI 8% to 35%) more propionic and acetic acid, respectively. CONCLUSIONS: Environmental toxicant exposure may influence infant gut microbial function during a critical developmental window. Future studies are needed to replicate these novel findings and investigate whether this has any impact on child health.
Linear and nonlinear correlation estimators unveil undescribed taxa interactions in microbiome dataIt is well-known that human gut microbiota form an ecosystem where microbes interact with each other. Due to complex underlying interactions, some microbes may correlate nonlinearly. There are no measures in the microbiome literature we know of that quantify these nonlinear relationships. Here, we develop a methodology called Sparse Estimation of Correlations among Microbiomes (SECOM) for estimating linear and nonlinear relationships among microbes while maintaining the sparsity. SECOM accounts for both sample and taxon-specific biases in its model. Its statistical properties are evaluated analytically and by comprehensive simulation studies. We test SECOM in two real data sets, namely, forehead and palm microbiome data from college-age adults, and Norwegian infant gut microbiome data. Given that forehead and palm are related to skin, as desired, SECOM discovers each genus to be highly correlated between the two sites, but that is not the case with any of the competing methods. It is well-known that infant gut evolves as the child grows. Using SECOM, for the first time in the literature, we characterize temporal changes in correlations among bacterial families during a baby's first year after birth.