Recent years have seen incredible growth in both the scale and specificity of projects analyzing the microbial organisms living in and on the human body (the human microbiome). Such studies typically require subjects to report to clinics for sample collection, a complicated practice that is impractical for large studies. To address these issues, we developed a protocol that allows subjects to collect microbiome samples at home and ship them to laboratories for multiple different types of molecular analysis. Measurements of microbial species, gene, and gene transcript composition within self-collected samples were consistent across sampling methods. In addition, our subsequent analysis of these samples revealed interesting similarities and differences between the measured functional potential and functional activity of the human microbiome.

Abstract

Although the composition of the human microbiome is now well-studied, the microbiota’s >8 million genes and their regulation remain largely uncharacterized. This knowledge gap is in part because of the difficulty of acquiring large numbers of samples amenable to functional studies of the microbiota. We conducted what is, to our knowledge, one of the first human microbiome studies in a well-phenotyped prospective cohort incorporating taxonomic, metagenomic, and metatranscriptomic profiling at multiple body sites using self-collected samples. Stool and saliva were provided by eight healthy subjects, with the former preserved by three different methods (freezing, ethanol, and RNAlater) to validate self-collection. Within-subject microbial species, gene, and transcript abundances were highly concordant across sampling methods, with only a small fraction of transcripts (<5%) displaying between-method variation. Next, we investigated relationships between the oral and gut microbial communities, identifying a subset of abundant oral microbes that routinely survive transit to the gut, but with minimal transcriptional activity there. Finally, systematic comparison of the gut metagenome and metatranscriptome revealed that a substantial fraction (41%) of microbial transcripts were not differentially regulated relative to their genomic abundances. Of the remainder, consistently underexpressed pathways included sporulation and amino acid biosynthesis, whereas up-regulated pathways included ribosome biogenesis and methanogenesis. Across subjects, metatranscriptional profiles were significantly more individualized than DNA-level functional profiles, but less variable than microbial composition, indicative of subject-specific whole-community regulation. The results thus detail relationships between community genomic potential and gene expression in the gut, and establish the feasibility of metatranscriptomic investigations in subject-collected and shipped samples.