Abstract Long chain polyunsaturated fatty acids (LC-PUFAs) are essential for brain structure, development, and function, and adequate dietary quantities of LC-PUFAs are thought to have been necessary for both brain expansion and the increase in brain complexity observed during modern human evolution. Previous studies conducted in largely European populations suggest that humans have limited capacity to synthesize brain LC-PUFAs such as docosahexaenoic acid (DHA) from plant-based medium chain (MC) PUFAs due to limited desaturase activity. Population-based differences in LC-PUFA levels and their product-to-substrate ratios can, in part, be explained by polymorphisms in the fatty acid desaturase (FADS) gene cluster, which have been associated with increased conversion of MC-PUFAs to LC-PUFAs. Here, we show evidence that these high efficiency converter alleles in the FADS gene cluster were likely driven to near fixation in African populations by positive selection ∼85 kya. We hypothesize that selection at FADS variants, which increase LC-PUFA synthesis from plant-based MC-PUFAs, played an important role in allowing African populations obligatorily tethered to marine sources for LC-PUFAs in isolated geographic regions, to rapidly expand throughout the African continent 60–80 kya.

Citation: Mathias RA, Fu W, Akey JM, Ainsworth HC, Torgerson DG, Ruczinski I, et al. (2012) Adaptive Evolution of the FADS Gene Cluster within Africa. PLoS ONE 7(9): e44926. https://doi.org/10.1371/journal.pone.0044926 Editor: Takeo Yoshikawa, Rikagaku Kenkyūsho Brain Science Institute, Japan Received: July 2, 2012; Accepted: August 9, 2012; Published: September 19, 2012 Copyright: © Mathias et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This work was supported by National Institutes of Health grants P50 AT002782 (F.H.C.) and a CTSA grant to The Johns Hopkins Medical Institutions (R.A.M., I.R.). Additional support was received from the Wake Forest University Health Sciences Center for Public Health Genomics. K.C.B. was supported in part by the Mary Beryl Patch Turnbull Scholar Program and RAM by the MOSAIC initiative of Johns Hopkins University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing interests: The senior author (F.H.C.) has published books with Rodale and Simon and Schuster and is a founder and consultant to GeneSmart Health, Inc., which may be partially related to his research. These potential conflicts of interest have been disclosed to Wake Forest University Health Sciences and to outside sponsors and are institutionally managed. No other authors have a conflict of interest. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials.

Introduction Studies suggest that anatomically modern humans arose in Africa approximately 150 thousand years ago (kya), expanded throughout Africa ∼60–80 kya, and to most parts of Europe and Asia ∼40 kya[1]–[6]. Numerous mitochondrial DNA studies support what Foster and Matsumera [5] describe as a ‘remarkable expansion’ from a small geographic region dating broadly to ∼60–80 kya. Interestingly, this expansion occurred at a period of time when archeological evidence indicates great advances in technological, social, and cognitive behavior [7]. Pivotal genetic and/or environmental (which led to shifts in adaptive and selective pressures) changes that enabled the dramatic expansions first within Africa and then throughout the world remain unknown. The brain is ∼60% (dry weight) lipid and is highly enriched in relatively rare (in nature) LC-PUFAs critical for proper brain structure and function, especially docosahexaenoic acid (DHA) and arachidonic acid (AA). Long chain polyunsaturated fatty acids (LC-PUFAs) are essential for brain structure, development, and function, and adequate dietary quantities of LC-PUFAs are thought to have been necessary for both brain expansion and the increase in brain complexity observed during modern human evolution [8]. In fact, LC-PUFAs have long been considered to be the most limiting nutrients for neural growth and complexity during fetal and early childhood development, but they are not widely available in foods [9], [10]. Additionally, metabolic studies to date indicate that humans have little capacity to synthesize LC-PUFAs, especially DHA, from plant-based MC-PUFAs [11], [12]. Consequently, an unresolved question is how enough DHA and AA were acquired to support large brains and increases in brain complexity throughout human evolution[13]–[15]. The prevailing view is that early human African populations obtained sufficient concentrations of LC-PUFAs to support brain evolution from DHA-enriched marine sources by living at the margins of lakes, rivers, or seashores in central and eastern Africa [8]. But what facilitated the movement away from stable sources of DHA and expansions into a wide range of environmental (including arid) conditions? We recently reported that human populations differ dramatically in their capacities to synthesize LC-PUFAs from plant-based MC-PUFAs. Specifically, we found higher levels of LC-PUFAs in African-American individuals compared to European-Americans, which is attributable in part to variation in the FADS gene cluster on chromosome 11q12–13 [16], [17]. These enzymes are responsible for three desaturation steps in DHA synthesis (Figure 1) and have long been recognized as rate-limiting in the conversion of MC-PUFAs to LC-PUFAs (reviewed in [18]). To date, however, the evolutionary history of functionally important variation in the FADS gene cluster has not been characterized, particularly in African populations and we investigate this in this work. PPT PowerPoint slide

PowerPoint slide PNG larger image

larger image TIFF original image Download: Figure 1. Overview of PUFA metabolism illustrating the critical involvement of FADS1 and FADS2 genes in desaturation steps necessary for the metabolism of intermediate chain unsaturated PUFAs (top dark gray panel) to long chain highly unsaturated PUFAs (bottom light gray panel). Omega 3 (right) and Omega 6 (left) pathways are illustrated along with known genes (in center rectangles) and dietary sources of the PUFAs. https://doi.org/10.1371/journal.pone.0044926.g001

Discussion The current work confirms marked global differences in the allele frequencies of variants in the FADS gene cluster that was first noted in African Americans and European Americans [16], [17], especially at variants strongly associated with the efficiency of conversion of LA and ALA to AA and DHA, respectively. Two independent samples of global genetic variation, the 1KGP and HGDP, reveal empiric evidence for signatures of positive selection above the 99th percentile of the genome in the 300 kb region centered on the FADS loci, making them strong candidates for having been subject to recent positive selection [19]. Jointly, these two sets of data support the hypothesis that advantageous mutations within the FADS gene cluster occurred prior to human migration out of Africa (∼85 kya), and swept to fixation within African but not European or Asian populations. Furthermore, multiple studies prove unequivocally that the derived alleles are associated with enhanced metabolism of MC-PUFA to LC-PUFAs, suggesting this is the driving force behind positive selection at the FADS gene cluster. Archeological evidence for regular active hunting of large animals emerged about 50 thousand years ago (kya); by 12–14 kya, humans begin using fishhooks, bows and arrows; and by 10 kya, they began to domesticate plants and animals [22]. Each of these advances would have ensured that humans had much more reliable dietary sources of LC-PUFAs (e.g., meat, eggs and fish). As to why the evidence of selective pressure and a selective sweep is restricted to within Africa and not beyond, we can speculate that perhaps as small groups of humans carrying the ancestral allele migrated out of Africa ∼40–50 kya, the selective pressure to make LC-PUFA was diminishing as the social and technological capacity to obtain them from their environment markedly increased. Consequently, it is likely that at some point, LC-PUFA synthesis may have become too metabolically expensive when it was readily available in common food sources thereby leading to the maintenance of the ancestral alleles in the non-African populations included here in our study. In summary, our results provide support for the hypothesis that positive selection acted to sweep derived alleles in the FADS region, which are associated with enhanced metabolism of MC-PUFA to LC-PUFAs, to near fixation in African populations. There has been considerable debate on how early humans escaped the developmental vulnerability to obtain sufficient DHA and AA necessary to maintain brain [8], [13], [23], [24] size and complexity, especially in light of studies suggesting that only trace amounts of LC-PUFAs could have been synthesized from plant-derived sources [25]. The evidence presented here from the 1 KGP and HGDP data suggest that a ‘game changing’ event (one or more mutations in the FADS cluster), likely occurred early at a time that could have dramatically impacted the rapid expansion from central source populations, ∼60–80 kya. Klein has suggested that modern patterns of culture and technology were due to a sudden change in cognitive capacities entailing some form of neurological mutation [26] which Mellars suggests occurred ∼80 kya [7]. While it is not possible to determine the cognitive impact of a mutation in the FADS cluster ∼84 kya, it is likely that suddenly having the capacity to more efficiently convert plant-based MC-PUFA to LC-PUFAs would have been an important advantage that would have facilitated expansion and movement into a variety of ecological locations.

Materials and Methods Analysis of the 1KGP Data To fine map the signature of positive selection of the FADS gene cluster in Africans, resequencing data in a 300 kb around this region (chr11∶61467097–61759006, hg19) were analyzed from the Thousand Genomes Project (1 KGP, Phase1integrated variant call set, http://www.1000genomes.org/). A total of 1,092 individuals from 14 populations were included (Table S1). We calculated genetic diversity (π), test statistics from site-frequency-spectrum based neutral tests (i.e., Tajima’s D and Fay & Wu’s H) [27], [28], and population differentiation (pairwise F ST [29]) using 5 kb sliding windows with an overlap of 1 kb. Positive selection can result in a deficit of genetic diversity and an excess of rare variants as measured by Tajima’s D. However, background selection can have a similar effect on genetic diversity as a selective sweep due to positive selection, making it difficult to distinguish between the two [30]. Therefore, we also compared measures of Fay & Wu’s H to better indicate an excess of high-frequency derived variants to gain a more comprehensive view of positive selection, and to distinguish between a selective sweep versus background selection. In order to assess the significance for each 5 kb window within the 300 kb region of interest, for each statistic we used the empirical distribution across all 5 kb windows across the genome to minimize the bias caused by low-coverage sequencing [31]. Phasing information for the resequencing data set was obtained from 1 KGP, and haplotype structure and LD blocks in the region was defined using standard confidence intervals algorithms in Haploview [32]. Only single nucleotide polymorphisms were considered and chimpanzee was treated as an outgroup. Median-joining network was constructed for the selected haplotype block using Network 4.61 (http://www.fluxus-engineering.com/sharenet.htm) [33]. Time to the most recent common ancestor (TMRCA) was estimated based on this network [34] assuming a chimpanzee-human split 6.5 million years ago. Analysis of the HGDP Data Genotypes from 1,043 samples from 52 populations in the Human Genome Diversity Panel (HGDP) available from the HGDP genome browser (http://hgdp.uchicago.edu/) [19] were used to evaluate evidence for positive selection around the FADS gene cluster (chr11∶61467097–61759006, hg19). Two haplotype-based tests were used to evaluate the degree of evidence for recent positive selection in the HGDP: the integrated Haplotype Score (iHS) [35], and the Cross Population Extended Haplotype Homozygosity (XP-EHH) [36]. The iHS is useful for identifying partial selective sweeps (or sweeps in progress) by identifying advantageous alleles at common SNPs that reside on unusually long haplotypes due to the actions of positive selection. However, the iHS has reduced power to detect selection as the advantageous allele approaches fixation. Therefore, we also examined the XP-EHH statistic that includes a comparison to a reference population, making it more powerful for identifying completed or almost completed selective sweeps whereby the advantageous allele is almost fixed in one population but polymorphic in the human population as a whole. Data on both these statistics and allele frequencies were downloaded from the HGDP selection browser.

Supporting Information Figure S1. Summary of sliding window analysis across the 300 kb region (chr11∶61467097–61759006, hg19) region around the FADS gene cluster for each individual 1 KGP population. Genetic diversity π (Panel A), Tajima’s D (Panel C), Fay & Wu’s H (Panel E), and pairwise Fst between YRI and other populations (Panel B) and that between LWK and other populations (Panel D) based upon a window size of 5 kb and an overlap of 1 kb. The horizontal black bar represents FADS1. https://doi.org/10.1371/journal.pone.0044926.s001 (TIFF) Table S1. Sample information for 1,092 unrelated individuals from the 1000 Genomes Project. https://doi.org/10.1371/journal.pone.0044926.s002 (DOCX)

Author Contributions Conceived and designed the experiments: RAM WF JMA DT. Performed the experiments: RAM WF JMA DT. Analyzed the data: RAM WF JMA DT. Contributed reagents/materials/analysis tools: RAM WF JMA DT KCB SS FHC. Wrote the paper: RAM WF JMA HCA DT IR SS KCB FHC.