With almost 35,000 branches comprised of 316,000 SNPs, branches on the Y DNA tree are split every day. In fact, roughly 1000 branches are being added to the Y DNA tree of mankind at Family Tree DNA each month. I wrote about how to navigate their public tree, here, and you can view the tree, here. You can also read about Y DNA terminology, here.
Splitting a deep, very old branch into subclades is unusual – and exciting. Finding a new root, taking the entire haplogroup back another notch in time is even more amazing, especially when that root is 46,000 years old.
Haplogroup P is the parent haplogroup of both Q and R.
This portion of the 2010 haplogroup poster provided to Family Tree DNA conference attendees shows the basic branching structure of haplogroup P, R and Q, with haplogroup P being defined at that time by several equivalent SNPs that had not yet been split into any other subgroups or branches of P. Notice that P295 is shown, but not F115 or PF5850 which would be discovered in years to come.
Haplogroup R, a subclade of P, is the most common haplogroup in Europe, with roughly half of European men falling on some branch of haplogroup R.
In Ireland, nearly all men fall into a subgroup of haplogroup R.
A lot of progress has been made in the past decade.
This week, FamilyTreeDNA identified a split in haplogroup P, upstream of haplogroups Q and R, establishing a new root above haplogroup P-P295.
The Previous 2020 Tree
This is a 2020 “before” picture of the tree as it pertains to haplogroup P. You can see P-P295 at the top as the root or beginning mutation that defined haplogroup P. That was, of course, before this new discovery.
At Family Tree DNA, according to this tree where testers self-identify the location of their most distant known patrilineal ancestor, haplogroup P testers are found in multiple Asian locations. Some haplogroup P kits may have only purchased specific SNP tests, not the full Big Y and would actually be placed on downstream branches if they upgraded. Haplogroup P itself is quite rare and generally only found in Siberia, Southeast Asia, and diaspora regions.
Subgroups Q and R are found across Europe and Asia. Additionally, some subgroups of haplogroup Q migrated across the land bridge, Beringia, to populate the Americas.
You might be wondering – if there are only a few people who fall directly into haplogroup P, how was it split?
How Was Haplogroup P Split?
Testing of ancient DNA has been a boon to science and genealogy, both, and one of my particular interests.
Recently, Goran Runfeldt who heads the R&D team at FamilyTreeDNA was reading the paper titled Ancient migrations in Southeast Asia and noticed that in the supplementary material, several genomic files from ancient samples were available to download. Of course, that was just the beginning, because the files had to be aligned and processed – then the accuracy verified – requiring input from other team members including Michael Sager who maintains the Y DNA haplotree.
Additionally, the paper’s authors sequenced the whole genomes of two present-day Jehai people from Northern Parak State, West Malaysia, a small group of traditional hunter-gatherers, many of whom still live in isolation. One of those samples was the individual whose Y DNA provided the new root SNP, P-PF5850, that is located above the previous root of haplogroup P, P-P295.
Until this sample was analyzed by Goran, Michael and team, three SNPs, PF5850, P295 and F115, were considered to be equivalent, because no tie-breaker had surfaced to indicate which SNPs occurred in what order. Now we know that PF5850 happened first and is the root of haplogroup P.
I asked Michael Sager, the phylogeneticist at FamilyTreeDNA, better-known as “Mr. Big Y,” due to his many-years-long Godfather relationship with the Y DNA tree, how he knew where to place PF5850, and how it became a new root.
Michael explained that we know that P-PF5850 is the new root because the three SNPs that indicated the previous root, P295, PF5850 and F115 are present in all previous samples, but mutations at both P295 and F115 are absent in the new sample, indicating that PF5850 preceded what is now the old P root.
The two SNPs, P295 and F115 occurred some time later.
This sample also included more than 300 additional unique mutations that may become branches in the future. As more people test and more ancient samples are found and sequenced, there’s lots of potential for further branching. Even with more than 50,000 NGS Big-Y DNA tests in the Family Tree DNA database, there’s still so much we don’t know, yet to be discovered.
Amazingly, mutation P-PF5850 occurred approximately 46,000 years ago meaning that this branch had remained hidden all this time. For all we know, he might be the only man left alive with this particular lineage of mankind, but it’s likely more will surface eventually.
Michael Sager had previously analyzed samples from The population history of northeastern Siberia since the Pleistocene by Sikora et al. You’ll notice that additional branches of haplogroup P are reflected in ancient samples Yana1 and Yana2 which split P-M45, twice.
Today, haplogroup branches are defined by their SNP name, except for base and main branches such as P, P1, P2, etc. Haplogroup P is very old and you’ll find it referred to as simply P, P1 or P2 in most literature, not by SNP name. Goran labeled the old branch names beside the current SNP names, and provided a preliminary longhand letter+number branch name with the * for explanatory purposes.
The problem with the old letter+number system is that when new upstream branches are inserted, the current haplogroup “P” has to shift down and become something else. That’s problematic when reading papers. In order to understand which SNP the paper is actually referencing, you have to know what SNP was labeled as “P” at the time the paper was written.
For example, a new P was just defined, so P becomes P1, but the previous P1 has to become something else, resulting in a domino effect of renaming. While that’s not a significant issue with haplogroup P, because it has seldom changed, it’s a huge challenge with the 17,000+ haplogroup R branches. Hence, the transition several years ago to using SNP names such as P295 instead of the older letter+number designations such as P, which now needs to become something like P1.
Goran was kind enough to provide additional information as well, including the estimated “Time to Most Recent Common Ancestor,” or TMRCA, a feature currently in development for all haplogroups. You can see that P-PF5850 is estimated to be approximately 46,000 years old, “ca 46 kybp,” meaning “circa 46 thousand years before present.”
The founding ancestor of haplogroup Q lived approximately 31,000 years ago, and ancestral R lived about 28,000 years ago, someplace in Asia. Their common ancestor, P-P226, lived about 33,000 years ago.
How cool is this that you can peer back in time to view these ancient lineages – the story still told in our Y DNA today.
What About You?
If you’re a male, you can upgrade to or purchase a Big Y-700 to participate, here. In addition to discovering where you fall on the tree of mankind, you’ll discover who you match on your direct patrilineal side and where their ancestors are located in the world.
I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.
Thank you so much.
DNA Purchases and Free Transfers
- FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
- MyHeritage DNA – ancestry autosomal DNA only, not health
- MyHeritage DNA plus Health
- MyHeritage FREE DNA file upload – transfer your results from other vendors free
- AncestryDNA – autosomal DNA only
- 23andMe Ancestry – autosomal DNA only, no Health
- 23andMe Ancestry Plus Health
Genealogy Products and Services
- MyHeritage FREE Tree Builder – genealogy software for your computer
- MyHeritage Subscription with Free Trial
- Legacy Family Tree Webinars – genealogy and DNA classes, subscription-based, some free
- Legacy Family Tree Software – genealogy software for your computer
- Charting Companion – Charts and Reports to use with your genealogy software or FamilySearch
- Legacy Tree Genealogists – professional genealogy research