Pulmonary Metagenomic Sequencing Suggests Missed Infections in Immunocompromised ChildrenBACKGROUND: Despite improved diagnostics, pulmonary pathogens in immunocompromised children frequently evade detection, leading to significant mortality. Therefore, we aimed to develop a highly sensitive metagenomic next-generation sequencing (mNGS) assay capable of evaluating the pulmonary microbiome and identifying diverse pathogens in the lungs of immunocompromised children. METHODS: We collected 41 lower respiratory specimens from 34 immunocompromised children undergoing evaluation for pulmonary disease at 3 children's hospitals from 2014-2016. Samples underwent mechanical homogenization, parallel RNA/DNA extraction, and metagenomic sequencing. Sequencing reads were aligned to the National Center for Biotechnology Information nucleotide reference database to determine taxonomic identities. Statistical outliers were determined based on abundance within each sample and relative to other samples in the cohort. RESULTS: We identified a rich cross-domain pulmonary microbiome that contained bacteria, fungi, RNA viruses, and DNA viruses in each patient. Potentially pathogenic bacteria were ubiquitous among samples but could be distinguished as possible causes of disease by parsing for outlier organisms. Samples with bacterial outliers had significantly depressed alpha-diversity (median, 0.61; interquartile range [IQR], 0.33-0.72 vs median, 0.96; IQR, 0.94-0.96; P < .001). Potential pathogens were detected in half of samples previously negative by clinical diagnostics, demonstrating increased sensitivity for missed pulmonary pathogens (P < .001). CONCLUSIONS: An optimized mNGS assay for pulmonary microbes demonstrates significant inoculation of the lower airways of immunocompromised children with diverse bacteria, fungi, and viruses. Potential pathogens can be identified based on absolute and relative abundance. Ongoing investigation is needed to determine the pathogenic significance of outlier microbes in the lungs of immunocompromised children with pulmonary disease.
Unbiased Metagenomic Sequencing for Pediatric Meningitis in Bangladesh Reveals Neuroinvasive Chikungunya Virus Outbreak and Other Unrealized PathogensGlobally, there are an estimated 10.6 million cases of meningitis and 288,000 deaths every year, with the vast majority occurring in low- and middle-income countries. In addition, many survivors suffer from long-term neurological sequelae. Most laboratories assay only for common bacterial etiologies using culture and directed PCR, and the majority of meningitis cases lack microbiological diagnoses, impeding institution of evidence-based treatment and prevention strategies. We report here the results of a validation and application study of using unbiased metagenomic sequencing to determine etiologies of idiopathic (of unknown cause) cases. This included CSF from patients with known neurologic infections, with idiopathic meningitis, and without infection admitted in the largest children's hospital of Bangladesh and environmental samples. Using mNGS and machine learning, we identified and confirmed an etiology (viral or bacterial) in 40% of idiopathic cases. We detected three instances of Chikungunya virus (CHIKV) that were >99% identical to each other and to a strain previously recognized to cause systemic illness only in 2017. CHIKV qPCR of all remaining stored 472 CSF samples from children who presented with idiopathic meningitis in 2017 at the same hospital uncovered an unrecognized CHIKV meningitis outbreak. CSF mNGS can complement conventional diagnostic methods to identify etiologies of meningitis, and the improved patient- and population-level data can inform better policy decisions.
Towards precision quantification of contamination in metagenomic sequencing experimentsMetagenomic next-generation sequencing (mNGS) experiments involving small amounts of nucleic acid input are highly susceptible to erroneous conclusions resulting from unintentional sequencing of occult contaminants, especially those derived from molecular biology reagents. Recent work suggests that, for any given microbe detected by mNGS, an inverse linear relationship between microbial sequencing reads and sample mass implicates that microbe as a contaminant. By associating sequencing read output with the mass of a spike-in control, we demonstrate that contaminant nucleic acid can be quantified in order to identify the mass contributions of each constituent. In an experiment using a high-resolution (n = 96) dilution series of HeLa RNA spanning 3-logs of RNA mass input, we identified a complex set of contaminants totaling 9.1 ± 2.0 attograms. Given the competition between contamination and the true microbiome in ultra-low biomass samples such as respiratory fluid, quantification of the contamination within a given batch of biological samples can be used to determine a minimum mass input below which sequencing results may be distorted. Rather than completely censoring contaminant taxa from downstream analyses, we propose here a statistical approach that allows separation of the true microbial components from the actual contribution due to contamination. We demonstrate this approach using a batch of n = 97 human serum samples and note that despite E. coli contamination throughout the dataset, we are able to identify a patient sample with significantly more E. coli than expected from contamination alone. Importantly, our method assumes no prior understanding of possible contaminants, does not rely on any prior collection of environmental or reagent-only sequencing samples, and does not censor potentially clinically relevant taxa, thus making it a generalized approach to any kind of metagenomic sequencing, for any purpose, clinical or otherwise.
Miniaturization and optimization of 384-well compatible RNA sequencing library preparationPreparation of high-quality sequencing libraries is a costly and time-consuming component of metagenomic next generation sequencing (mNGS). While the overall cost of sequencing has dropped significantly over recent years, the reagents needed to prepare sequencing samples are likely to become the dominant expense in the process. Furthermore, libraries prepared by hand are subject to human variability and needless waste due to limitations of manual pipetting volumes. Reduction of reaction volumes, combined with sub-microliter automated dispensing of reagents without consumable pipette tips, has the potential to provide significant advantages. Here, we describe the integration of several instruments, including the Labcyte Echo 525 acoustic liquid handler and the iSeq and NovaSeq Illumina sequencing platforms, to miniaturize and automate mNGS library preparation, significantly reducing the cost and the time required to prepare samples. Through the use of External RNA Controls Consortium (ERCC) spike-in RNAs, we demonstrated the fidelity of the miniaturized preparation to be equivalent to full volume reactions. Furthermore, detection of viral and microbial species from cell culture and patient samples was also maintained in the miniaturized libraries. For 384-well mNGS library preparations, we achieved cost savings of over 80% in materials and reagents alone, and reduced preparation time by 90% compared to manual approaches, without compromising quality or representation within the library.
The pulmonary metatranscriptome prior to pediatric HCT identifies post-HCT lung injuryLung injury after pediatric allogeneic hematopoietic cell transplantation (HCT) is a common and disastrous complication that threatens long-term survival. To develop strategies to prevent lung injury, novel tools are needed to comprehensively assess lung health in HCT candidates. Therefore, this study analyzed biospecimens from 181 pediatric HCT candidates who underwent routine pre-HCT bronchoalveolar lavage (BAL) at the University Medical Center Utrecht between 2005 and 2016. BAL fluid underwent metatranscriptomic sequencing of microbial and human RNA, and unsupervised clustering and generalized linear models were used to associate microbiome gene expression data with the development of post-HCT lung injury. Microbe-gene correlations were validated using a geographically distinct cohort of 18 pediatric HCT candidates. The cumulative incidence of post-HCT lung injury varied significantly according to 4 pre-HCT pulmonary metatranscriptome clusters, with the highest incidence observed in children with pre-HCT viral enrichment and innate immune activation, as well as in children with profound microbial depletion and concomitant natural killer/T-cell activation (P < .001). In contrast, children with pre-HCT pulmonary metatranscriptomes containing diverse oropharyngeal taxa and lacking inflammation rarely developed post-HCT lung injury. In addition, activation of epithelial-epidermal differentiation, mucus production, and cellular adhesion were associated with fatal post-HCT lung injury. In a separate validation cohort, associations among pulmonary respiratory viral load, oropharyngeal taxa, and pulmonary gene expression were recapitulated; the association with post-HCT lung injury needs to be validated in an independent cohort. This analysis suggests that assessment of the pre-HCT BAL fluid may identify high-risk pediatric HCT candidates who may benefit from pathobiology-targeted interventions.