Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data

Yijun Sun, Yunpeng Cai, Volker Mai, William Farmerie, Fahong Yu, Jian Li, Steve Goodison

Research output: Contribution to journalArticlepeer-review

39 Scopus citations


With the aid of next-generation sequencing technology, researchers can now obtain millions of microbial signature sequences for diverse applications ranging from human epidemiological studies to global ocean surveys. The development of advanced computational strategies to maximally extract pertinent information from massive nucleotide data has become a major focus of the bioinformatics community. Here, we describe a novel analytical strategy including discriminant and topology analyses that enables researchers to deeply investigate the hidden world of microbial communities, far beyond basic microbial diversity estimation. We demonstrate the utility of our approach through a computational study performed on a previously published massive human gut 16S rRNA data set. The application of discriminant and topology analyses enabled us to derive quantitative disease-associated microbial signatures and describe microbial community structure in far more detail than previously achievable. Our approach provides rigorous statistical tools for sequence-based studies aimed at elucidating associations between known or unknown organisms and a variety of physiological or environmental conditions.

Original languageEnglish (US)
Pages (from-to)e205
JournalNucleic acids research
Issue number22
StatePublished - Dec 2010

ASJC Scopus subject areas

  • Genetics


Dive into the research topics of 'Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data'. Together they form a unique fingerprint.

Cite this