We report the discovery of an African American Y chromosome that carries the ancestral state of all SNPs that defined the basal portion of the Y chromosome phylogenetic tree. We sequenced ∼240 kb of this chromosome to identify private, derived mutations on this lineage, which we named A00. We then estimated the time to the most recent common ancestor (TMRCA) for the Y tree as 338 thousand years ago (kya) (95% confidence interval = 237–581 kya). Remarkably, this exceeds current estimates of the mtDNA TMRCA, as well as those of the age of the oldest anatomically modern human fossils. The extremely ancient age combined with the rarity of the A00 lineage, which we also find at very low frequency in central Africa, point to the importance of considering more complex models for the origin of Y chromosome diversity. These models include ancient population structure and the possibility of archaic introgression of Y chromosomes into anatomically modern humans. The A00 lineage was discovered in a large database of consumer samples of African Americans and has not been identified in traditional hunter-gatherer populations from sub-Saharan Africa. This underscores how the stochastic nature of the genealogical process can affect inference from a single locus and warrants caution during the interpretation of the geographic location of divergent branches of the Y chromosome phylogenetic tree for the elucidation of human origins.

Main Text

1 Batini C.

Ferri G.

Destro-Bisol G.

Brisighelli F.

Luiselli D.

Sánchez-Diz P.

Rocha J.

Simonson T.

Brehm A.

Montano V.

et al. Signatures of the preagricultural peopling processes in sub-Saharan Africa as revealed by the phylogeography of early Y chromosome lineages. , 2 Cann R.L.

Stoneking M.

Wilson A.C. Mitochondrial DNA and human evolution. , 3 Hammer M.F. A recent common ancestry for human Y chromosomes. , 4 Semino O.

Santachiara-Benerecetti A.S.

Falaschi F.

Cavalli-Sforza L.L.

Underhill P.A. Ethiopians and Khoisan share the deepest clades of the human Y-chromosome phylogeny. 5 Underhill P.A.

Kivisild T. Use of y chromosome and mitochondrial DNA population structure in tracing human migrations. 6 Cruciani F.

Trombetta B.

Massaia A.

Destro-Bisol G.

Sellitto D.

Scozzari R. A revised root for the human Y chromosomal phylogenetic tree: The origin of patrilineal diversity in Africa. , 7 Thomson R.

Pritchard J.K.

Shen P.

Oefner P.J.

Feldman M.W. Recent common ancestry of human Y chromosomes: Evidence from DNA sequence data. , 8 Wei W.

Ayub Q.

Chen Y.

McCarthy S.

Hou Y.

Carbone I.

Xue Y.

Tyler-Smith C. A calibrated human Y-chromosomal phylogeny based on resequencing. 9 Behar D.M.

van Oven M.

Rosset S.

Metspalu M.

Loogväli E.L.

Silva N.M.

Kivisild T.

Torroni A.

Villems R. A “Copernican” reassessment of the human mitochondrial DNA tree from its root. , 10 Schuster S.C.

Miller W.

Ratan A.

Tomsho L.P.

Giardine B.

Kasson L.R.

Harris R.S.

Petersen D.C.

Zhao F.

Qi J.

et al. Complete Khoisan and Bantu genomes from southern Africa. 11 Seielstad M.T.

Minch E.

Cavalli-Sforza L.L. Genetic evidence for a higher female migration rate in humans. , 12 Wilder J.A.

Mobasher Z.

Hammer M.F. Genetic evidence for unequal effective population sizes of human females and males. , 13 Wilkins J.F. Unraveling male and female histories from human genetic data. Characterizing the root of the phylogenetic tree of individual genetic loci has influenced many researchers in their attempts to infer the time and place of the origin of anatomically modern humans (AMHs).Analyses of the nonrecombining portion of the Y chromosome and mtDNA have suggested that recent common ancestors lived in sub-Saharan Africa within the last 200,000 years.However, estimates of the time to the most recent common ancestor (TMRCA) have been consistently lower for the Y chromosome (∼60–140 thousand years ago [kya])than for mtDNA (∼140–240 kya).A variety of evolutionary processes (e.g., natural selection, differential migration of males and females, and/or a skew in the breeding sex ratio) have been invoked to explain this difference.

∼240 kb) of the X-degenerate portion of this chromosome, as well as a subset of these regions (∼180 kb) on a chromosome belonging to the previously known basal lineage A1b (which we rename here as A0). We note that all sampling procedures described herein were approved by and performed according to the ethical standards of the University of Arizona Human Subjects Committee. The ancestral state of each polymorphic site was inferred with pairwise alignments between the human (UCSC Genome Browser hg18) and chimpanzee (panTro3) reference sequences, and mutations were assigned to the branches (i.e., A00 and A0) shown as thick lines in 14 Kong A.

Frigge M.L.

Masson G.

Besenbacher S.

Sulem P.

Magnusson G.

Gudjonsson S.A.

Sigurdsson A.

Jonasdottir A.

Jonasdottir A.

et al. Rate of de novo mutations and the importance of father’s age to disease risk. Figure 1 Genealogy of A00, A0, and the Reference Sequence Show full caption 6 Cruciani F.

Trombetta B.

Massaia A.

Destro-Bisol G.

Sellitto D.

Scozzari R. A revised root for the human Y chromosomal phylogenetic tree: The origin of patrilineal diversity in Africa. Lineages on which mutations were identified and lineages that were used for placing those mutations on the genealogy are indicated with thick and thin lines, respectively. The numbers of identified mutations on a branch are indicated in italics (four mutations in A00 were not genotyped but are indicated as shared by Mbo in this tree). The time estimates (and confidence intervals) are indicated kya for three nodes: the most recent common ancestor, the common ancestor between A0 and the reference (ref), and the common ancestor of A00 chromosomes from an African American individual and the Mbo. Two sets of ages are shown: on the left are estimates (numbers in black) obtained with the mutation rate based on recent whole-genome-sequencing results as described in the main text, and on the right are estimates (numbers in gray) based on the higher mutation rate used by Cruciani et al. Genotyping of a DNA sample that was submitted to a commercial genetic-testing facility demonstrated that the Y chromosome of this African American individual carried the ancestral state of all known Y chromosome SNPs. To further characterize this lineage, which we dubbed A00 (see Figure S1 , available online, for proposed nomenclature), we sequenced multiple regions (totaling240 kb) of the X-degenerate portion of this chromosome, as well as a subset of these regions (∼180 kb) on a chromosome belonging to the previously known basal lineage A1b (which we rename here as A0). We note that all sampling procedures described herein were approved by and performed according to the ethical standards of the University of Arizona Human Subjects Committee. The ancestral state of each polymorphic site was inferred with pairwise alignments between the human (UCSC Genome Browser hg18) and chimpanzee (panTro3) reference sequences, and mutations were assigned to the branches (i.e., A00 and A0) shown as thick lines in Figure 1 on the basis of an infinite-sites model ( Table S1 ). To estimate the TMRCA of the Y chromosome tree that incorporates the newly discovered root defined by A00, we developed a likelihood-based method that uses mutation rates recently estimated by Kong et al.,who performed high-coverage whole-genome sequencing on 78 human pedigrees.

14 Kong A.

Frigge M.L.

Masson G.

Besenbacher S.

Sulem P.

Magnusson G.

Gudjonsson S.A.

Sigurdsson A.

Jonasdottir A.

Jonasdottir A.

et al. Rate of de novo mutations and the importance of father’s age to disease risk. Y is the mutation rate per year per base for the Y chromosome, L is the length of sequence analyzed for the autosomes, T is the average number of new mutations per genome in the autosomes, g is the generation time, b is the increment in mutation rate per base per year increment in the paternal age, F is the number of mutations originating in the mother, and g 0 is the average age of fathers in the study, 14 Kong A.

Frigge M.L.

Masson G.

Besenbacher S.

Sulem P.

Magnusson G.

Gudjonsson S.A.

Sigurdsson A.

Jonasdottir A.

Jonasdottir A.

et al. Rate of de novo mutations and the importance of father’s age to disease risk. L · μ Y = T / g + b ( 1 − g 0 / g ) − F / g . We obtained median values and generated 90% confidence intervals (CIs) for μ Y as a function of g (ranging from 20 to 40 years) by sampling 100,000 times from normal distributions for each of the parameters T ∼ N (63.2 [SD = 0.9]), b ∼ N (2.01 [SD = 0.17]), and F ∼ N (14.2 [SD = 3.12]), which result from the number of observed mutations (4,933 in 78 pedigrees), the linear coefficient in the linear fit of mutations as a function of paternal age (2.01 per year [SEM = 0.17 per year]), and the 71 mutations that can be confirmed as de novo and of maternal origin in five pedigrees, 14 Kong A.

Frigge M.L.

Masson G.

Besenbacher S.

Sulem P.

Magnusson G.

Gudjonsson S.A.

Sigurdsson A.

Jonasdottir A.

Jonasdottir A.

et al. Rate of de novo mutations and the importance of father’s age to disease risk. 9 (the length of autosomal sequence covered in each of the 78 trios) to obtain estimates and CIs for μ Y ( Y (across values of g) of 4.39 × 10−10 ≤ μ Y ≤ 7.07 × 10−10. To obtain point estimates of the ages of nodes in Y at age 30 years (6.17 × 10−10). To obtain upper and lower bounds of the CIs, we used the lower and upper bounds of μ Y , respectively. Estimates of mutation rate in Kong et al. 14 Kong A.

Frigge M.L.

Masson G.

Besenbacher S.

Sulem P.

Magnusson G.

Gudjonsson S.A.

Sigurdsson A.

Jonasdottir A.

Jonasdottir A.

et al. Rate of de novo mutations and the importance of father’s age to disease risk. 15 Conrad D.F.

Keebler J.E.

DePristo M.A.

Lindsay S.J.

Zhang Y.

Casals F.

Idaghdour Y.

Hartl C.L.

Torroja C.

Garimella K.V.

et al. 1000 Genomes Project

Variation in genome-wide mutation rates within and between human families. , 16 Roach J.C.

Glusman G.

Smit A.F.

Huff C.D.

Hubley R.

Shannon P.T.

Rowen L.

Pant K.P.

Goodman N.

Bamshad M.

et al. Analysis of genetic inheritance in a family quartet by whole-genome sequencing. , 17 Abecasis G.R.

Altshuler D.

Auton A.

Brooks L.D.

Durbin R.M.

Gibbs R.A.

Hurles M.E.

McVean G.A. 1000 Genomes Project Consortium

A map of human genome variation from population-scale sequencing. , 18 Awadalla P.

Gauthier J.

Myers R.A.

Casals F.

Hamdan F.F.

Griffing A.R.

Côté M.

Henrion E.

Spiegelman D.

Tarabeux J.

et al. Direct measure of the de novo mutation rate in autism and schizophrenia cohorts. , 19 Nelson M.R.

Wegmann D.

Ehm M.G.

Kessner D.

St Jean P.

Verzilli C.

Shen J.

Tang Z.

Bacanu S.A.

Fraser D.

et al. An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people. Assuming that mutations in the X-degenerate portion of the Y chromosome follow the linear model presented by Kong et al.,we adjusted for male-specific mutation processes. If μis the mutation rate per year per base for the Y chromosome, L is the length of sequence analyzed for the autosomes, T is the average number of new mutations per genome in the autosomes, g is the generation time, b is the increment in mutation rate per base per year increment in the paternal age, F is the number of mutations originating in the mother, and gis the average age of fathers in the study,then. We obtained median values and generated 90% confidence intervals (CIs) for μas a function of g (ranging from 20 to 40 years) by sampling 100,000 times from normal distributions for each of the parameters T ∼ N (63.2 [SD = 0.9]), b ∼ N (2.01 [SD = 0.17]), and F ∼ N (14.2 [SD = 3.12]), which result from the number of observed mutations (4,933 in 78 pedigrees), the linear coefficient in the linear fit of mutations as a function of paternal age (2.01 per year [SEM = 0.17 per year]), and the 71 mutations that can be confirmed as de novo and of maternal origin in five pedigrees,respectively. We then divided those values by 2.68 × 10(the length of autosomal sequence covered in each of the 78 trios) to obtain estimates and CIs for μ Figure S2 ). This resulted in a range for μ(across values of g) of 4.39 × 10≤ μ≤ 7.07 × 10. To obtain point estimates of the ages of nodes in Figure 1 , we used the median value of μat age 30 years (6.17 × 10). To obtain upper and lower bounds of the CIs, we used the lower and upper bounds of μ, respectively. Estimates of mutation rate in Kong et al.are consistent with those from a variety of large-scale sequencing studies,making it unlikely that our TMRCA estimates are strongly biased by an excess of false negatives in genome sequence data from human pedigrees.

20 Mendez F.L.

Watkins J.C.

Hammer M.F. A haplotype at STAT2 Introgressed from neanderthals and serves as a candidate of positive selection in Papua New Guinea. 6 Cruciani F.

Trombetta B.

Massaia A.

Destro-Bisol G.

Sellitto D.

Scozzari R. A revised root for the human Y chromosomal phylogenetic tree: The origin of patrilineal diversity in Africa. 14 Kong A.

Frigge M.L.

Masson G.

Besenbacher S.

Sulem P.

Magnusson G.

Gudjonsson S.A.

Sigurdsson A.

Jonasdottir A.

Jonasdottir A.

et al. Rate of de novo mutations and the importance of father’s age to disease risk. , 15 Conrad D.F.

Keebler J.E.

DePristo M.A.

Lindsay S.J.

Zhang Y.

Casals F.

Idaghdour Y.

Hartl C.L.

Torroja C.

Garimella K.V.

et al. 1000 Genomes Project

Variation in genome-wide mutation rates within and between human families. , 16 Roach J.C.

Glusman G.

Smit A.F.

Huff C.D.

Hubley R.

Shannon P.T.

Rowen L.

Pant K.P.

Goodman N.

Bamshad M.

et al. Analysis of genetic inheritance in a family quartet by whole-genome sequencing. , 17 Abecasis G.R.

Altshuler D.

Auton A.

Brooks L.D.

Durbin R.M.

Gibbs R.A.

Hurles M.E.

McVean G.A. 1000 Genomes Project Consortium

A map of human genome variation from population-scale sequencing. , 18 Awadalla P.

Gauthier J.

Myers R.A.

Casals F.

Hamdan F.F.

Griffing A.R.

Côté M.

Henrion E.

Spiegelman D.

Tarabeux J.

et al. Direct measure of the de novo mutation rate in autism and schizophrenia cohorts. −9 per base per year 6 Cruciani F.

Trombetta B.

Massaia A.

Destro-Bisol G.

Sellitto D.

Scozzari R. A revised root for the human Y chromosomal phylogenetic tree: The origin of patrilineal diversity in Africa. −10 − 7.07 × 10−10), the estimated TMRCA for the tree incorporating A00 as the basal lineage would be 209 kya, which is only slightly older than current estimates of the TMRCA of mtDNA and the age of the oldest AMH fossil remains. We note, however, that the higher mutation rate produces an estimate for the common ancestor of all non-African Y chromosome haplogroups (C through T) of ∼39 kya 6 Cruciani F.

Trombetta B.

Massaia A.

Destro-Bisol G.

Sellitto D.

Scozzari R. A revised root for the human Y chromosomal phylogenetic tree: The origin of patrilineal diversity in Africa. 21 Rasmussen M.

Guo X.

Wang Y.

Lohmueller K.E.

Rasmussen S.

Albrechtsen A.

Skotte L.

Lindgreen S.

Metspalu M.

Jombart T.

et al. An Aboriginal Australian genome reveals separate human dispersals into Asia. 22 Bowler J.M.

Johnston H.

Olley J.M.

Prescott J.R.

Roberts R.G.

Shawcross W.

Spooner N.A. New ages for human occupation and climatic change at Lake Mungo, Australia. , 23 Shang H.

Tong H.

Zhang S.

Chen F.

Trinkaus E. An early modern human from Tianyuan Cave, Zhoukoudian, China. , 24 Higham T.

Compton T.

Stringer C.

Jacobi R.

Shapiro B.

Trinkaus E.

Chandler B.

Gröning F.

Collins C.

Hillson S.

et al. The earliest evidence for anatomically modern humans in northwestern Europe. , 25 Demeter F.

Shackelford L.L.

Bacon A.M.

Duringer P.

Westaway K.

Sayavongkhamdy T.

Braga J.

Sichanthongtip P.

Khamdalavong P.

Ponche J.L.

et al. Anatomically modern human in Southeast Asia (Laos) by 46 ka. 26 Mendez F.L.

Watkins J.C.

Hammer M.F. Global genetic variation at OAS1 provides evidence of archaic admixture in Melanesian populations. We estimated the TMRCA of human Y chromosomes as 338 kya (95% CI = 237–581 kya). Using a joint likelihoodand the same mutation rate, we also estimated a divergence time between A0 chromosomes and the human reference as 202 kya (95% CI = 125–382 kya), a time that is older than that previously obtained by Cruciani et al. (142 kya).This discrepancy in the age of A0 is due to the fact that the earlier study did not utilize mutation rates based on recently obtained whole-genome sequence data.If we were to use the higher mutation rate (1.0 × 10per base per year) rather than a realistic range derived from whole-genome sequencing (4.39 × 10− 7.07 × 10), the estimated TMRCA for the tree incorporating A00 as the basal lineage would be 209 kya, which is only slightly older than current estimates of the TMRCA of mtDNA and the age of the oldest AMH fossil remains. We note, however, that the higher mutation rate produces an estimate for the common ancestor of all non-African Y chromosome haplogroups (C through T) of ∼39 kya(i.e., versus ∼63 kya for the mutation rate used here). It is difficult to reconcile the younger estimate with the timing of the out-of-Africa dispersal on the basis of the analyses of autosomal DNAand the fossil record outside of Africa.Regardless of which mutation rate is applied, the analysis of relative ages of nodesshows that the TMRCA of the A00-rooted tree is 67% older (95% CI = 35%–126%) than that of the A0-rooted tree.

27 Thomas M.G.

Bradman N.

Flinn H.M. High throughput analysis of 10 microsatellite and 11 diallelic polymorphisms on the human Y-chromosome. 20 Mendez F.L.

Watkins J.C.

Hammer M.F. A haplotype at STAT2 Introgressed from neanderthals and serves as a candidate of positive selection in Papua New Guinea. Figure 2 Map Showing Cameroon and the Approximate Location where Mbo Speakers Live We then genotyped a set of six Y chromosome short tandem repeats (Y-STRs) (DYS19, DYS388, DYS390, DYS391, DYS392, and DYS393) and found that the A00 chromosome carried the following alleles: 16-11-19-10-12-13. Upon searching a large pan-African database consisting of 5,648 samples from ten countries (Cameroon, Nigeria, Ghana, Senegal, Uganda, Tanzania, Malawi, Zimbabwe, Mozambique, and South Sudan), we identified 11 Y chromosomes that were invariant and identical to the A00 chromosome at five of the six Y-STRs (2 of the 11 chromosomes carried DYS19-16, whereas the others carried DYS19-15). These 11 chromosomes were all found in a sample of 174 (∼6.3%) Mbo individuals from western Cameroon ( Figure 2 ). Seven of these Mbo chromosomes were available for further testing, and the genotypes were found to be identical at 37 of 39 SNPs known to be derived on the A00 chromosome (i.e., two of these genotyped SNPs were ancestral in the Mbo samples) ( Table S1 ). Using the joint likelihood for the split of two lineages and a sublineage internal to one of them,we estimated that the most recent common ancestor of the African American and Mbo A00 chromosomes lived between 2.6 and 73 kya (95% CI, maximum likelihood estimate = 17 kya).

28 Bandelt H.J.

Forster P.

Röhl A. Median-joining networks for inferring intraspecific phylogenies. −4 and 2.76 × 10−5 per year, respectively. 29 Ballantyne K.N.

Goedbloed M.

Fang R.

Schaap O.

Lao O.

Wollstein A.

Choi Y.

van Duijn K.

Vermeulen M.

Brauer S.

et al. Mutability of Y-chromosomal microsatellites: rates, characteristics, molecular bases, and forensic implications. , 30 Zhivotovsky L.A.

Underhill P.A.

Cinnioğlu C.

Kayser M.

Morar B.

Kivisild T.

Scozzari R.

Cruciani F.

Destro-Bisol G.

Spedini G.

et al. The effective mutation rate at Y chromosome short tandem repeats, with application to human population-divergence time. Figure 3 Median-Joining Network of A00 Haplotypes Show full caption The network is based on haplotypes (constructed with 95 Y-STRs) of eight Mbo and an African American (AA) individual. All mutations are assumed to be single step and were given equal weight during the construction of the network. Marker names are indicated without “DYS” at the beginning. Table 1 Pairwise and Average STR-Based Estimates of TMRCA for A00 Chromosomes TMRCA (Years) Mbo 52 Mbo 159 Mbo 160 Mbo 170 Mbo 173 Mbo 183 Mbo 186 Mbo 199 African American Average with Mbo a a High mutation rate. Mbo 52 - 80 120 159 239 159 199 120 478 154 Mbo 159 381 - 120 159 239 159 199 120 478 154 Mbo 160 572 572 - 120 199 199 239 159 439 165 Mbo 170 763 763 572 - 239 239 279 199 478 199 Mbo 173 1,144 1,144 953 1,144 - 319 359 279 399 268 Mbo 183 763 763 953 1,144 1,526 - 120 120 558 188 Mbo 186 953 953 1,144 1,335 1,716 572 - 159 598 222 Mbo 199 572 572 763 953 1,335 572 763 - 518 165 African American 2,288 2,288 2,098 2,288 1,907 2,670 2,860 2,479 - 564 Average with Mbo b b Low mutation rate. 736 736 790 953 1,280 899 1,062 790 2,697 - We obtained point estimates of the TMRCA of two haplotypes by dividing the estimate of the number of mutational steps separating the haplotypes (as inferred from the network) by twice the mutational rate per STR and by the number of STRs scored in both haplotypes. Values above and below the diagonal separation correspond to estimates obtained with the high and low mutation rates, respectively (see text). The following abbreviation is used: TMRCA, time to the most recent common ancestor. We also estimated the level of variation among nine A00 lineages (i.e., including one additional Mbo individual) by using a battery of 95 Y-STRs for which all individuals had no missing data; ( Table S2 ). A median-joining networkshows that the African American A00 lineage is 11 mutational steps from the nearest Mbo and that the maximum difference between any pair of Mbo is nine steps ( Figure 3 and Table S2 ). On the basis of these levels of within- and between-group variation, we calculated a second divergence time estimate of 564–2,697 years ( Table 1 ) by assuming a mean Y-STR mutation rate of 1.32 × 10and 2.76 × 10per year, respectively.

9 Behar D.M.

van Oven M.

Rosset S.

Metspalu M.

Loogväli E.L.

Silva N.M.

Kivisild T.

Torroni A.

Villems R. A “Copernican” reassessment of the human mitochondrial DNA tree from its root. , 10 Schuster S.C.

Miller W.

Ratan A.

Tomsho L.P.

Giardine B.

Kasson L.R.

Harris R.S.

Petersen D.C.

Zhao F.

Qi J.

et al. Complete Khoisan and Bantu genomes from southern Africa. 31 McDougall I.

Brown F.H.

Fleagle J.G. Stratigraphic placement and age of modern humans from Kibish, Ethiopia. 32 Behar D.M.

Villems R.

Soodyall H.

Blue-Smith J.

Pereira L.

Metspalu E.

Scozzari R.

Makkan H.

Tzur S.

Comas D.

et al. Genographic Consortium

The dawn of human matrilineal diversity. , 33 Hammer M.F.

Karafet T.

Rasanayagam A.

Wood E.T.

Altheide T.K.

Jenkins T.

Griffiths R.C.

Templeton A.R.

Zegura S.L. Out of Africa and back again: Nested cladistic analysis of human Y chromosome variation. , 34 Ingman M.

Kaessmann H.

Pääbo S.

Gyllensten U. Mitochondrial genome variation and the origin of modern humans. , 35 Underhill P.A.

Shen P.

Lin A.A.

Jin L.

Passarino G.

Yang W.H.

Kauffman E.

Bonné-Tamir B.

Bertranpetit J.

Francalacci P.

et al. Y chromosome sequence variation and the history of human populations. 36 Nielsen R.

Beaumont M.A. Statistical inferences in phylogeography. , 37 Rosenberg N.A.

Nordborg M. Genealogical trees, coalescent theory and the analysis of genetic polymorphisms. 38 Batini C.

Jobling M.A. The jigsaw puzzle of our African ancestry: Unsolved, or unsolvable?. 39 Pickrell J.K.

Patterson N.

Barbieri C.

Berthold F.

Gerlach L.

Güldemann T.

Kure B.

Mpoloka S.W.

Nakagawa H.

Naumann C.

et al. The genetic prehistory of southern Africa. , 40 Schlebusch C.M.

Skoglund P.

Sjödin P.

Gattepaille L.M.

Hernandez D.

Jay F.

Li S.

De Jongh M.

Singleton A.

Blum M.G.

et al. Genomic variation in seven Khoe-San groups reveals adaptation and complex African history. , 41 Veeramah K.R.

Wegmann D.

Woerner A.

Mendez F.L.

Watkins J.C.

Destro-Bisol G.

Soodyall H.

Louie L.

Hammer M.F. An early divergence of KhoeSan ancestors from those of other modern humans is supported by an ABC-based analysis of autosomal resequencing data. In contrast to the estimates of all previous studies, our estimate of the Y chromosome TMRCA predates both that of known mtDNA lineagesand of the likely time of origin of AMHs on the basis of current fossil evidence.Although we identified the A00 lineage in an African American, the unusual Y-STR profile associated with this individual’s Y chromosome allowed us to identify the same divergent lineage in a single ethnic group living in a small region of western Cameroon ( Figure 2 ). Interestingly, contrary to previous Y chromosome and mtDNA studies, we did not identify the most basal lineage in a traditional hunter-gatherer population, such as the Khoisan or Pygmies.Rather, like the majority of groups that now occupy sub-Saharan Africa, the Mbo are a Bantu-speaking population. Given the large stochasticity expected in the genealogical process of a single locus,we should not be surprised that the Y chromosome genealogy does not match the currently accepted phylogeny of human populations on the basis of genome-wide data,which show that the earliest divergence events involved the ancestors of extant traditional hunter-gatherers.Given what we know about the coalescent process, the lack of dense sampling in sub-Saharan Africa (especially compared with Europe) has most likely contributed to the failure to identify more cases of incongruence between genealogical and population divergence.

th and 19th centuries). 42 Eltis D. The volume and structure of the transatlantic slave trade: A reassessment. The large sample size of African Americans was critical for the discovery of the A00 lineage given its very low frequency estimate in sub-Saharan Africa (0.19% [95% CI = 0.11%–0.35%]). However, even the large consumer-based African American data set examined here is a highly biased representation of sub-Saharan Africans because it captures only the genetic diversity inherited from ancestors living at a particular time and place (i.e., essentially the West African coastal source area of the Atlantic Slave Trade, which took place between the 15and 19centuries).It is likely that a much richer understanding of the Y chromosome phylogeny, as well as of genetic variation in general, would be achieved if more dense and even sampling were to be conducted across sub-Saharan Africa, especially given its high level of genetic diversity.