MicrobiotaProcess: A comprehensive R package for deep mining microbiome

Shuangbin Xu(Zhujiang Hospital), Li Zhan(Southern Medical University), Wenli Tang(Southern Medical University), Qianwen Wang(Southern Medical University), Zehan Dai(Southern Medical University), Lang Zhou(Zhujiang Hospital), Tingze Feng(Southern Medical University), Meijun Chen(Southern Medical University), Tianzhi Wu(Southern Medical University), Erqiang Hu(Southern Medical University), Guangchuang Yu(Zhujiang Hospital)
The Innovation
February 2, 2023
Cited by 274Open Access
Full Text

Abstract

•MicrobiotaProcess is a bioinformatics tool for microbiome profiling.•MicrobiotaProcess defines an MPSE structure to better integrate both primary and intermediate microbiome datasets.•MicrobiotaProcess provides a set of functions under a unified tidy framework, which helps users to explore related datasets more efficiently.•MicrobiotaProcess improves the integration and exploration of downstream data analysis.•MicrobiotaProcess offers many visual methods to quickly render clear and comprehensive visualizations that reveal meaningful insights. The data output from microbiome research is growing at an accelerating rate, yet mining the data quickly and efficiently remains difficult. There is still a lack of an effective data structure to represent and manage data, as well as flexible and composable analysis methods. In response to these two issues, we designed and developed the MicrobiotaProcess package. It provides a comprehensive data structure, MPSE, to better integrate the primary and intermediate data, which improves the integration and exploration of the downstream data. Around this data structure, the downstream analysis tasks are decomposed and a set of functions are designed under a tidy framework. These functions independently perform simple tasks and can be combined to perform complex tasks. This gives users the ability to explore data, conduct personalized analyses, and develop analysis workflows. Moreover, MicrobiotaProcess can interoperate with other packages in the R community, which further expands its analytical capabilities. This article demonstrates the MicrobiotaProcess for analyzing microbiome data as well as other ecological data through several examples. It connects upstream data, provides flexible downstream analysis components, and provides visualization methods to assist in presenting and interpreting results. The data output from microbiome research is growing at an accelerating rate, yet mining the data quickly and efficiently remains difficult. There is still a lack of an effective data structure to represent and manage data, as well as flexible and composable analysis methods. In response to these two issues, we designed and developed the MicrobiotaProcess package. It provides a comprehensive data structure, MPSE, to better integrate the primary and intermediate data, which improves the integration and exploration of the downstream data. Around this data structure, the downstream analysis tasks are decomposed and a set of functions are designed under a tidy framework. These functions independently perform simple tasks and can be combined to perform complex tasks. This gives users the ability to explore data, conduct personalized analyses, and develop analysis workflows. Moreover, MicrobiotaProcess can interoperate with other packages in the R community, which further expands its analytical capabilities. This article demonstrates the MicrobiotaProcess for analyzing microbiome data as well as other ecological data through several examples. It connects upstream data, provides flexible downstream analysis components, and provides visualization methods to assist in presenting and interpreting results.


Related Papers

No related papers found

Powered by citation graph analysis