Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

TitleStatistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples
Publication TypeJournal Articles
Year of Publication2009
AuthorsWhite JRobert, Nagarajan N, Pop M.
JournalPLoS Comput BiologyPLoS Comput BiolPLoS Comput BiologyPLoS Comput Biol
Volume5
Type of Article10.1371/journal.pcbi.1000352
Abstract

The emerging field of metagenomics aims to understand the structure and function of microbial communities solely through DNA analysis. Current metagenomics studies comparing communities resemble large-scale clinical trials with multiple subjects from two general populations (e.g. sick and healthy). To improve analyses of this type of experimental data, we developed a statistical methodology for detecting differentially abundant features between microbial communities, that is, features that are enriched or depleted in one population versus another. We show our methods are applicable to various metagenomic data ranging from taxonomic information to functional annotations. We also provide an assessment of taxonomic differences in gut microbiota between lean and obese humans, as well as differences between the functional capacities of mature and infant gut microbiomes, and those of microbial and viral metagenomes. Our methods are the first to statistically address differential abundance in comparative metagenomics studies with multiple subjects, and we hope will give researchers a more complete picture of how exactly two environments differ.