Mitotree: First, the Tree – Now the Paper

Posted on June 1, 2026 by Roberta Estes

It’s definitely a red-letter day.

Dr. Paul Maier, the lead author on the new paper Mitotree: The Universal Human Mitochondrial Reference Phylogeny at 10x the Resolution has uploaded the paper to the bioRxiv preprint server, here.

I want to congratulate all of the authors, most of whom are members of the FamilyTreeDNA R&D team as either employees or contractors. I’m a contractor and have had the honor of working with these amazing colleagues on this project since 2020.

About Mitotree

Mitotree was officially “born” on February 25, 2025, and the tree has been updated several times since. About 75% of FamilyTreeDNA’s customers who have taken the full-sequence mitochondrial DNA test received a more refined haplogroup with the release of Mitotree or subsequent updates. Those haplogroups are, on average, 2000 years newer than the person’s legacy Phylotree haplogroup, and some are much more recent.

This means that the tree branches have gotten much, much bushier close to the tips. In other words, lots more twigs and leaves!

Unfortunately, about 25% of testers did not receive a new haplogroup because they do not have any qualifying mutations:

Either because they have no additional mutations
Or because they have mutations, but they are unstable
Or because they have mutations, but no other testers have yet tested that match them to split a branch

The good news is that with the addition of haplotype clusters, everyone benefits from new matching and grouping tools. Testers are grouped into clusters on their matches page, and on the Match Time Tree in Discover, which is much more useful for genealogy.

I know this paper has been a long time coming, but it’s well worth the wait.

Mitotree was a massive undertaking. We began with PhyloTree v17 which had 5,438 hand-curated branches constructed from 24,275 full and partial mitochondrial sequences. Phylotree was last updated in 2016 before subsequently being abandoned.

The Million Mito Team developed Mitotree, a robust phylogeny with more than 54,000 branches formed from over 330,000 complete mitochondrial sequences, of which 177,196 are unique sequences.

Let’s Look Under the Hood

There are three critical pieces of information in those statements.

First, the PhyloTree curation and maintenance was not automated, and a paper detailing their build process, what mutations were included or excluded, and under what circumstances was never published.

Approximately once a year, a new PhyloTree was published where newer samples were individually evaluated and new haplogroups were hand-grafted onto an existing backbone tree.

This methodology did not allow for deep splits to become apparent, because the tree itself was never recalculated. This is exactly how haplogroup L7 went undetected until the Million Mito Team recalculated the tree, including the backbone, in 2022, and published this paper about L7’s discovery.

In other words, while PhyloTree was publicly available, there was no recipe for how it was created or maintained.

Clearly, the tree-building process had to be automated, as hand-curation was unsustainable. There were no academic programs in existence capable of handling the number of samples involved. Not even in 2016 for fewer than 25,000 samples, let alone today.

To maintain haplogroup naming consistency, the first thing our team had to do was write software to phylogenetically reverse engineer PhyloTree v17 to establish a common foundation on which to build. This step was essential for consistency and maintaining the established haplogroup naming pattern.

That software also had to be capable of scaling up exponentially. The first versions took weeks to run, which clearly wasn’t an acceptable long-term solution. Still, being able to establish a foundational backbone to build on programmatically was a victory in and of itself.

Second, PhyloTree used partial sequences, meaning HVR1 and HVR2 samples. Early academic researchers did not perform full sequence testing, so the curators of PhyloTree used what was available to the best of their ability.

With over 330,000 full-sequence samples available today, we no longer include partial samples.

Third, 177,196 of the 331,221 full sequence samples used were unique. Before launching the program to construct the tree, identical samples from known immediate relatives are deduped, when possible, in order to reduce unnecessary clutter and processing time.

This means two things. The actual number of testers is greater than 331,000. But more importantly, anyone who thinks that mitochondrial DNA isn’t interesting should take another look. More than half of the sequences used for tree-building are unique, which handily dispels the myth that mitochondrial DNA doesn’t mutate often enough to be useful for genealogy.

The Mitotree initiative has been both scientifically and genealogically successful beyond anything we could have imagined. The base tree includes approximately 180 branches that are older than 30,000 years, including the discovery of haplogroup L7 at 100,000 years old. These branches both expand and more firmly root the oldest portions of the tree.

Amazingly, haplogroup L7 has living descendants whose earliest known family members are found in Turkey, Saudi Arabia, Yemen, the UAE, Palestinian Territory, Ethiopia, Sudan, and South Africa.

Another fun discovery involved Otzi, the Iceman, a mummy found frozen in the Italian Alps who lived more than 5,000 years ago. He was thought to carry an extinct haplogroup, K1ö, named in his honor, but as it turns out, he’s actually a member of haplogroup K1f, a clade with living descendants in Algeria. Additionally, Otzi now matches four ancient burials too, so he does have cousins.

We couldn’t have made these discoveries without the right people testing, so please encourage everyone and dispel the discouraging myth that mitochondrial DNA isn’t useful or interesting. It absolutely IS, and the success stories keep rolling in!

Why Build a Phylogenetic Tree?

Simply put, the history of our ancestors, both recently and reaching back into ancient history, is revealed in the tree – and there’s absolutely no other avenue to reach this information. Ironically, it’s readily available to everyone because everyone has mitochondrial DNA and can easily take the test.

Mitochondrial DNA is different than Y-DNA, which has its own phylogenetic tree based on SNP mutations, and autosomal DNA, which has no tree.

The reason that both Y-DNA and mitochondrial DNA can have phylogenetic trees is that they are inherited from the appropriate parent with only occasional mutations, while autosomal DNA is roughly halved in each generation.

Y-DNA is inherited by males only from their fathers, with no admixture from their mother, while mitochondrial DNA is inherited by everyone from only their mothers, with no admixture from their father.

Autosomal DNA is inherited through random recombination, with half coming from each parent, except for the X chromosome which has its own inheritance pattern. X-DNA is often confused with mitochondrial DNA, but they are entirely different types of DNA. I wrote about that here.

No tree is possible for autosomal DNA, because it gets diced and riced in each generation.

The mutations that occur occasionally and randomly in both Y and mitochondrial DNA form a trail of breadcrumbs leading backward in time, or in our case, they form both the trunk and branches on the tree.

Those unique mutations, once they occur, are inherited by subsequent generations, forming a path back in time.

In current generations, those mutations provide testers with the ability to identify our closest cousins who inherited those same mutations and who have taken either a Big Y-700 test, in males, or a mitochondrial DNA full sequence test for everyone.

In this conceptual example, you can see that Ancestor 1 carries mutation A, as do the next two generations who inherited it from their parent. However, Ancestor 4 now has additional mutation B, so that person carries mutations A+B. This inheritance pattern continues through the apricol lineage as mutations C and D are added in subsequent generations, until “You” are born with A+B+C+D.

Your cousin’s ancestor, on the other hand, was also born to Ancestor 4 and carries both A+B, as seen in the green column. Three generations later, that line added mutation F. Your ancestor 7 added mutation C, so now the apricot and green lineages can easily be genetically distinguished from each other.

When a living person tests, we immediately know, based on the combination of their mutations, if and where they fit in this lineage, because both the apricot and green branches have accumulated unique mutations that the original blue Ancestor 4 and earlier ancestors did not have.

Using our knowledge of the tree branches, when and where they occurred, provides valuable genealogical information, along with fascinating Ancient Connections, both since and prior to the adoption of surnames.

Both Y-DNA and mitochondrial DNA can reach much further back in time than autosomal DNA because they are not diluted with DNA from the other parent in each generation.

So mitochondrial DNA is both broad, meaning many leaves, and deep, meaning it helps us look straight back in time like a laser sight, all the way to the common ancestor of all humanity, Mitochondrial Eve, who lived about 140,000 years ago in Africa.

Mitochondrial DNA Presents Unique Challenges

Mitochondrial DNA presents challenges not found in Y-DNA tree building.

For example, mitochondrial DNA only has 16,569 locations available to utilize, while Y-DNA currently uses roughly 22 million “gold standard” locations on the Y chromosome.

Of those 16,569 mitochondrial locations, some are not reliable enough for tree-building.

Unreliable mutations include:

Insertions, where extra copies of a particular nucleotide (Thymine, Adenine, Cytosine and Guanine) have been inserted at a specific location. Those are indicated by designations such as 309.1C where 309 indicates the marker location, .1 indicates the number of insertions at that location, and C (for Cytosine in this example) indicates the nucleotide inserted.
Heteroplasmies occur when multiple nucleotides are detected at a specific location. They are reported by a different letter than T, A, C or G, depending on which of multiple nucleotides are found. Heteroplasmies tend to “come and go” based on detection and threshold levels, so they can’t be used the same way as more stable mutations for tree building – and are often, but not always, unreliable for genealogy. I wrote about this in the article, What is a Heteroplasmy and Why Do I Care?.

Those locations and types of mutations have been excluded from forming tree branches, or downweighted, because they are too prone to mutating back and forth. However, they *might* be useful for genealogical purposes. Less-than-reliable mutations are now used to create haplotype clusters, even though they aren’t used to create new branches on the Mitotree.

I wrote about how haplogroups and haplotype clusters are formed in these articles:

Weighting and Confidence Factors

Mitotree formation would have been a lot easier if delineations, meaning inclusions and exclusions, were clear, either yes or no, but they aren’t.

Some were obvious from the get-go, such as insertions at location 309 and elsewhere, but other situations were much less obvious.

For example, sometimes there’s a specific location that seems prone to reversion, mutating back and forth, meaning that it mutates, then returns to its original state, then repeats the process.

Reversions are a natural phenomenon that occurs frequently in mitochondrial DNA, but is rarely, if ever, found in Y-DNA.

Let’s look at an example.

Courtesy Dr. Paul Maier

How many reversions at the same location are too many, especially if they are close in the tree?

In the above example, the mutation from A to G occurs just below the first arrow, forming haplogroup L1, a branch of L. The red areas all carry that mutation, subsequently forming eight new branches.

However, one step downstream from that mutation, just above the second arrow, location 7055 back-mutates, or reverts to A from G, which is indicated by the “!”. That reverse mutation forms haplogroup L1c3.

If location 7055 continues to flip back and forth between A and G, at what point do we have less confidence in that location, and at what point should a location be excluded from the tree and prevented from creating or dividing a branch?

The answer is that “it depends,” sometimes on the branch, sometimes on the “group” of other mutations it’s found with, and other factors. Some locations are stable in some parts of the tree, but unstable in others. We certainly never expected to see that!

This means the team had to design and build a weighting methodology so that relevant mutations, such as reversions, are not summarily excluded from tree building but instead carry different confidence weighting levels, depending on the circumstances.

Some samples, such as ancient DNA, were down-weighted in general due to their propensity to contain artifacts resulting from deterioration. Ancient samples can still influence branching, just not as much as a high-quality modern sample.

Furthermore, especially when utilizing academic samples, results with a high number of heteroplasmies are excluded, along with those with ambiguous reads and missing upstream mutations, which were previously inferred with PhyloTree. Academic samples vary in quality and age, and we have no way of knowing which quality criteria were used by that lab at that time.

These types of variances made constructing and updating the Mitotree more challenging than the Y-DNA tree, which is not subject to weighting, resulting from phylogenetic tug-of-war between mutations.

In some situations, the addition of just one test can make the difference between a new branch, or no branch, in a subsequent run of the tree. Due to this type of scenario, and fine-tuning the algorithm, some people’s new haplogroups have reverted to an earlier haplogroup in subsequent Mitotree updates.

The paper and supplemental materials provide details about the exclusion process, types of exclusions, and a list of excluded marker locations.

You can view the confidence of any haplogroup in the Classic Mitotree view in Discover.

My haplogroup, J1c2f, is formed by the mutation G9055A, and you can see that the confidence rank is 7.5 out of 10.

Mousing over the little up-arrow tree icon beside the star explains changes in nearby branches, which can affect the haplogroup’s confidence ranking.

Branches are not renamed for convenience, and only when phylogenetically warranted. Existing haplogroup names used either on PhyloTree, in academic literature, or previously on the Y-Full tree are either maintained or avoided to eliminate potential confusion. No one wants two different haplogroup names depending on which tree is being viewed.

Previously obsoleted names remain permanently obsoleted and are not reused.

The paper explains further about technical corrections and tie-breaker situations. In some cases, potential branches with equal or near-equal weighting are flagged for team review.

Amazing Discoveries

I encourage everyone to read the section in the paper beginning with “Notable discoveries.” These aren’t people, as in Discover’s Notable Connections, but scientific accomplishments achieved with the new Mitotree.

Our knowledge of human migration within and out of Africa has been greatly refined, as well as the ancestral path into and across Eurasia, Asia, and into the Pacific Rim. If you have unusual mitochondrial haplogroups such as L, M, N, P, Q, R or S, you’ll absolutely want to read this.

Of course, in time these haplogroups branch and become Paleolithic haplogroups, then the Gravettian-Mesolithic followed by the Hunter-Gatherers found throughout Europe that we are familiar with. We’ve learned a great deal from rare ancient DNA samples that anchor more modern haplogroups in a place and time, and inform us of migration patterns as well as how now-extinct ghost populations gave rise to current ones.

The earliest humans, whom Mitotree has more firmly anchored, formed a trickle out of Africa that became a bifurcated stream, eventually flowing across the rest of the world. What recorded and even archaeological history cannot tell us can be and is revealed through the patterns held in our DNA today – and Mitotree is our map to read them. Common ancestors are found where our mutations as haplogroups converge, joining as we travel backward in time, piercing an otherwise impenetrable veil.

For those with Native American ancestry, Mitotree expands the two-wave theory, refining it into five or six probable migration surges, depending on how you count, based on a combination of haplogroup ages and distribution.

Summarizing from the paper:

The first wave of haplogroups A2, B2, C1b, C1c, C1d, D1, and D4h3a arrived from Asia, across Beringia or along the Pacific Corridor, about 17,000 to 18,500 years ago, and expanded along the Pacific coast. D4h3a is found almost exclusively in the Pacific region.

This was followed by haplogroup C4c about 15,800 years ago and X2a about 10,000 years ago, which expanded into the interior through the ice-free corridor east of the Rockies after the ice melted.

Next were the Paleo-Eskimo and Na-Dene speakers in haplogroups A2a, D2a, D2b, D2c/D3, and D4b1a2a1a2, who, between 3000 and 7000 years ago, made their way from Alaska, across the polar regions of Canada, into Greenland.

Na-Dene speakers, Apache and Navajo, in haplogroups A2a and B2a made their way southwest between 1300 and 1500 CE, or between 500 and 700 years ago.

Last, the present-day Inuit-Yupik expanded from Beringia to Greenland about 1000 CE.

For additional information, please see the Native American lineages section of the paper.

Mitotree has also clarified the ancestors of the Ainu/Jomon people from Hokkaido, Japan, and their ancient Paleolithic northwest Asian and Siberian relatives. The ancestors of this group and Native Americans share even earlier Asian ancestors.

The history of the Jewish people has been significantly refined as well, expanding on earlier works, and is found in the Counting the newest Jewish founders section of the paper.

43% of Ashkenazi Jewish testers fell into 5 founding lineages where they had no subclades before, but they do now.
Two clades of haplogroup K have now been split 4000 to 5000 years ago in Romania.
There’s new information about the crypto-Jewish community in Portugal, Mountain Jews from Persia and the Caucasus, plus Jewish groups in India, Georgia, Azerbaijan, Israel and Libya.
Additionally, haplogroup M33c9b tells the story of Ashkenazi Silk Road merchants who traveled between China and Europe.

The paper reports the isolation of Sardinian-specific haplogroups and provides substantially greater structural definition for the Saami people, increasing from 22 subclades to more than 300.

The Notable discoveries section is chock full of information.

Genealogy Jump-Start

Today’s tree is ten times larger than the 2016 tree, and will continue to grow as more people take a full sequence mitochondrial DNA test, available at FamilyTreeDNA.

The greatly improved tree alone is not the only facilitator of genealogical success. A dozen reports, including Haplotype Clusters and the Match Time Tree are provided for all full-sequence testers in Discover. I wrote about how to effectively use your matches and Discover to break through genealogy brick walls, here.

There are a couple of things you need to do to increase your opportunities for success and to help Discover and Mitotree.

Genealogy is a team sport, and you can increase everyone’s success rate by completing (and updating) your Earliest Known Ancestor (EKA) and location information, found under “Account Settings” beneath your name in the upper right hand corner when signed on, then “Genealogy”, then “Earliest Known Ancestor”, and by providing a family tree or a link to WikiTree.

Identifying common ancestors is what testing is all about, and these are all important success factors. Everyone wants to identify previously unknown ancestors.

Mitotree is More Than Genealogy

Of course, as genealogists, we’re focused on how to use the new Mitotree information, paired with Discover, to identify brick-walled ancestors and learn more about them. I’ve written specifically about how to do that in these two articles:

Mitotree isn’t just an explosion for genealogy, though – it’s an incredible scientific achievement. Instead of genealogy benefiting from other specialties, now they can benefit from what genealogy has wrought.

Mitotree presents opportunities to rethink and potentially recalculate dating and information in other fields, such as archaeology, medical genetics, forensics, and history.

We know vastly more than ever before, but this is only the beginning.

With each new tester and every ancient genome added to the growing body of evidence, our understanding becomes more refined, revealing insights about our ancestors, and weaving our thread into the broader tapestry of human history.

_____________________________________________________________

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

Subscribe!

If you haven’t already subscribed, it’s free. You’ll receive an e-mail whenever I publish by clicking the “follow” button at the top of the main blog page, here.

Help Keep This Blog Free

I receive a small commission when you click a vendor link in my articles and purchase that item. This does NOT increase your price but helps me keep the lights on and this informational blog free for everyone. Please click on the affiliate links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y-DNA, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
MyHeritage Omni comprehensive “everything included” subscription plan
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA – for those ordering the e-book from anyplace, or paperback within the United States
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA for those ordering the paperback from outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

Discover’s Ancient Connections – How Are You Related?

Posted on May 8, 2025 by Roberta Estes

When FamilyTreeDNA released the new Mitotree, they also introduced their new mtDNA Discover tool, which is a series of 13 reports about each haplogroup, including one titled Ancient Connections.

Ancient Connections shows you ancient relatives from your direct matrilineal line through a mitochondrial DNA test or through a Y-DNA (preferably Big Y-700) test.

Ancient Connections help you connect the present to the past based on archaeological excavations around the world and DNA sequencing of remains. Ancient Connections links you through your DNA to ancient people, cultures, and civilizations that would be impossible to discover any other way. You don’t have to wonder if it’s accurate, or which line it came from, because you know based on the test you took. Discover’s Ancient Connections track the journey of your ancestors and relatives.

Ancient Connections can be very exciting – and it’s easy to get swept away on a wave of jubilation.

Are those people your ancestors, or relatives, or what? How do you know? How can you figure it out?

So let me just answer that question generally before we step through the examples, so you can unveil your own connections.

You are RELATED to both Ancient and Notable Connections. Notable Connections are famous or infamous people who have lived more recently, and their relatives have been tested to identify their haplogroups.
It’s VERY unlikely that Ancient Connections are your direct ancestors – but someone in the line that you share IS your ancestor.
Many factors enter into the equation of how you are related, such as the haplogroup(s), the timeframe, and the location.
The sheer number of people who were living at any specific time makes it very unlikely that any one person with that haplogroup actually was your direct ancestor. They are much more likely to be your distant cousin.

Factors such as whether you share the same haplogroup, similar locations, and the timeframe make a huge difference. Everyone’s situation is different with each Ancient Connection.

Ok, are you ready for some fun???

Let’s find out how to leverage these tools.

Ancient Connections

Ancient connections are fun and can also be quite useful for genealogy.

In this article, I’m going to use a mitochondrial DNA example because full sequence testers at FamilyTreeDNA just received their new Mitotree haplogroup. mtDNA Discover was released with Mitotree, so it’s new too. However, the evaluation process is exactly the same for Y-DNA.

Everyone’s results are unique, so your mileage absolutely WILL vary. What we are going to learn here is a step-by-step analytical process to make sure you’re hearing the message from your ancestors – and interpreting it correctly.

To learn about your new mitochondrial DNA haplogroup and haplotype, read the articles:

Radegonde Lambert

Let’s start with an Acadian woman by the name of Radegonde Lambert. She’s my ancestor, and I wrote about her years ago in the article, Radegonde Lambert (1621/1629-1686/1693), European, Not Native.

At the time, that article caused a bit of a kerfluffle, along with the article, Haplogroup X2b4 is European, Not Native American, because Radegonde’s X2b4 haplogroup had been interpreted by some to mean that her matrilineal ancestors were Native American.

That often happens when a genealogical line abruptly ends and hits a brick wall. What probably began with “I wonder if…”, eventually morphed into “she was Native,” when, in fact, she was not. In Radegonde’s case, it didn’t help any that her haplogroup was X2b4, and some branches of base haplogroup X2 are in fact Native, specifically X2a, However, all branches of X2 are NOT Native, and X2b, which includes X2b4, is not.

The Acadians were French people who established a colony in what is now Nova Scotia in the 1600s. They did sometimes intermarry with the Native people, so either Native or European heritage is always a possibility, and that is exactly why DNA testing is critically important. Let’s just say we’ve had more than one surprise.

I always reevaluate my own work when new data becomes available, so let’s look to see what’s happening with Radegonde Lambert now, with her new haplogroup and mtDNA Discover.

Sign on and Identify Your Haplogroup

You can follow along here, or sign on to your account at FamilyTreeDNA.

The first step is to take note of your new Mitotree haplogroup.

Your haplogroup badge is located near the bottom right of your page after signing in.

The tester who represents Radegonde Lambert has a Legacy Haplogroup of X2b4 and has been assigned a new Mitotree haplogroup of X2b4g.

Click Through to Discover

To view your personal Discover information, click on the Discover link on your dashboard.

You can simply enter a haplogroup in the free version of mtDNA Discover, but customers receive the same categories, but significantly more information if they sign in and click through.

You can follow along on the free version of Discover for haplogroups X2b4 here, and X2b4g here.

Clicking on either the Time Tree, or the Classic Tree shows that a LOT has changed with the Mitotree update.

Each tree has its purpose. Let’s look at the Classic Tree first.

The Classic Tree

I like the Classic Tree because it’s compact, detailed and concise, all in one. Radegonde Lambert’s new haplogroup, X2b4g is a subgroup of X2b4, so let’s start there.

Click on any image to enlarge

Under haplogroup X2b4, several countries are listed, including France. There are also 7 haplotype clusters, which tell you that those testers within the cluster all match each other exactly.

It’s worth noting that the little trowels (which I thought were shovels all along) indicate ancient samples obtained from archaeological digs. In the Discover tools, you’ll find them under Ancient Connections for that haplogroup. We will review those in a minute.

In Mitotree, haplogroup X2b4 has now branched several granular and more specific sub-haplogroups.

Radegonde Lambert’s new haplogroup falls below another new haplogroup, X2b4d’g, which means that haplogroup X2b4d’g is now the parent haplogroup of both haplogroups X2b4d and X2b4g. Both fall below X2b4d’g.

Haplogroup names that include an apostrophe mean it’s an umbrella group from which the two haplogroups descend – in this case, both X2b4d and X2b4g. Apostrophe haplogroups like X2b4d’g are sometimes referred to as Inner Haplogroups.

You can read more about how to understand your haplogroup name, here.

In this case, haplogroup X2b4d’g is defined by mutation G16145A, which is found in both haplogroups X2b4d and X2b4g. Both of those haplogroup have their own defining mutations in addition to G16145A, which caused two branches to form beneath X2b4d’g.

You can see that Radegonde Lambert’s haplogroup X2b4g is defined by mutation C16301T, but right now, that really doesn’t matter for what we’re trying to accomplish.

In descending order, for Radegonde, we have haplogroups:

X2b4
X2b4d’g
X2b4g

Your Match Page

Looking at the tester’s match page, Radegonde’s haplotype cluster number and information about the cluster are found below the haplogroup. You can view your cluster number on:

Your match page
The Match Time Tree beside your name and those of your matches in the same haplotype cluster
The Scientific Details – Variants page

I wrote about haplotype clusters, here.

Click on any image to enlarge

On your match page, which is where most people look first, you are in the same haplogroup and haplotype cluster with anyone whose circle is also checked and is blue. If the little circles are not checked and blue, you don’t share either that haplogroup, haplotype cluster, or haplogroup and haplotype cluster. If you share a haplotype cluster, you will always share the same haplogroup.

Haplotype clusters are important because cluster members match on exactly the same (but less stable) mutations IN ADDITION to haplogroup-defining (more stable) mutations.

However, you may also share an identifiable ancestor with people in different haplotype clusters. Mutations, and back mutations happen – and a lot more often at some mutation locations, which is why they are considered less stable. Normally, though, your own haplotype cluster will hold your closest genealogical matches.

In Discover, you can see that Radegonde’s haplotype cluster, F585777, displays three tester-supplied countries, plus two more. Click on the little plus to expand the countries.

What you’re viewing are the Earliest Known Ancestor (EKA) countries that testers have entered for their direct matrilineal ancestor.

Let’s hope they understood the instructions, and their genealogy information was accurate.

Notice that Canada and France are both probably quite accurate for Radegonde, based on the known history of the Acadians. There were only French and Native women living in Nova Scotia in the 1600s, so Radegonde had to be one or the other.

The US may be accurate for a different tester whose earliest known ancestor (EKA) may have been found in, say, Louisiana. Perhaps that person has hit a brick wall in the US, and that’s all they know.

The US Native American flag is probably attributable to the old “Native” rumor about Radegonde, and the tester didn’t find the Canadian First Nations flag in the “Country of Origin” dropdown list. Perhaps that person has since realized that Radegonde was not Native and never thought to change their EKA designation.

The little globe with “Unknown Origins” is displayed when the tester doesn’t select anything in the “Country of Origin.”

Unfortunately, this person, who knew when Radegonde Lambert lived, did not complete any additional information, and checked the “I don’t know this information” box. Either Canada, or France would have been accurate under the circumstances. If they had tracked Radegonde back to Canada and read about her history, they knew she lived in Canada, was Acadian, and therefore French if she was not Native. Providing location information helps other testers, whose information, in turn, helps you.

Please check your EKA, and if you have learned something new, PLEASE UPDATE YOUR INFORMATION by clicking on the down arrow by your user name in the upper right hand corner, then Account Settings, then Genealogy, then Earliest Known Ancestors.

Don’t hesitate to email your matches and ask them to do the same. You may discover that you have information to share as well. Collaboration is key.

Radegonde’s Discover Haplogroup

First, let’s take a look at Radegonde’s haplogroup, X2b4g, in Discover.

The Discover Haplogroup Story landing page for haplogroup X2b4g provides a good overview. Please READ this page for your own haplogroup, including the little information boxes.

The history of Radegonde’s haplogroup, X2b4g, is her history as well. It’s not just a distant concept, but the history of a woman who is the ancestor of everyone in that haplogroup, but long before surnames. Haplogroups are the only way to lift and peer behind the veil of time to see who our ancestors were, where they lived, and the cultures they were a part of.

We can see that Radegonde’s haplogroup, X2b4g, was born in a woman who lived about 300 CE, Common (or Current) Era, meaning roughly the year 300, which is 1700 years ago, or 1300 years before Radegonde lived.

This means that the tester shares a common ancestor with everyone, including any X2b4g remains, between now and the year 300 when haplogroup X2b4g was born.
This means that everyone who shares haplogroup X2b4g has the same common female ancestor, in whom the mutation that defines haplogroup X2b4g originated. That woman, the common ancestor of everyone in haplogroup X2b4g, lived about the year 300, or 1700 years ago.
Your common ancestor with any one individual in this haplogroup can have lived ANYTIME between very recently (like your Mom) and the date of your haplogroup formation.
Many people misinterpret the haplogroup formation date to mean that’s the date of the MRCA, or most recent common ancestor, of any two people. It’s not, the haplogroup formation date is the date when everyone, all people, in the haplogroup shared ONE ancestor.
The MRCA, or most recent common ancestor, is your closest ancestor in this line with any one person, and the TMRCA is the “time to most recent common ancestor.” It could be your mother, or if your matrilineal first cousin tested, your MRCA is your grandmother, and the TMRCA is when your grandmother was born – not hundreds or thousands of years ago.
Don’t discount mitochondrial DNA testing by thinking that your common ancestor with your matches (MRCA) won’t be found before the haplogroup birth date – the year 300 in Radegonde’s case. The TMRCA for all of Radegonde’s descendants is about 1621 when she was born.
The haplogroup birth date, 1700 years ago, is the common ancestor for EVERYONE in the haplogroup, taken together.
Mitochondrial DNA is useful for BOTH recent genealogy and also reveals more distant ancestors.
Looking back in time helps us understand where Radegonde’s ancestors lived, which cultures they were part of, and where.

There are two ways to achieve that: Radegonde’s upstream or parent haplogroups, and Ancient Connections.

Parent Haplogroups

X2b4g split from X2b4d’g, the parent haplogroup of BOTH X2b4d and X2b4g, around 3700 years ago, or about 1700 BCE (Before Common (or Current) Era).

Looking at either the Classic Tree, the Time Tree (above) or the Match Time Tree, you can see that haplogroup X2b4g has many testers, and none provide any locations other than France, Canada, the US, unknown, and one Native in the midst of a large haplotype cluster comprised of French and Canadian locations. Due to the size of the cluster, it’s only partially displayed in the screen capture above.

You can also see that sister haplogroup X2b4d split from X2b4d’g around the year 1000, and the ancestors of those two testers are reported in Norway.

Many, but not all of the X2b4g testers are descendants of Radegonde. Even if everyone is wrong and Radegonde is not French, that doesn’t explain the other matches, nor how X2b4g’s sister haplogroup is found in Norway.

Clearly, Radegonde isn’t Native, but there’s still more evidence to consider.

Let’s dig a little deeper using Radegonde’s Ancient Connections.

Ancient Connections

While ancestor and location information are user-provided, Ancient Connections are curated from scientifically published papers. There’s no question about where those remains were found.

When signed in to your account, if you’ve taken the mtFull Sequence test, clicking on the Ancient Connections tab in Discover shows a maximum of around 30 Ancient Connections. If you’re viewing the free version of Discover, or you’ve only tested at the HVR1 or HVR1+HVR2 levels, you’ll see two of your closer and one of your most distant Ancient Connections. It’s easy to upgrade to the mtFull.

In Discover, the first group of Ancient Connections are genetically closest to you in time, and the last connections will be your most distant. Some connections may be quite rare and are noted as such.

Please keep in mind that oldest, in this case, Denisova 8 and Sima de los Huesos, will never roll off your list. However, as new studies are released and the results are added to the tree, you may well receive new, closer matches. New results are being added with each Discover update.

It’s very exciting to see your Ancient Connections, but I need to say three things, loudly.

Do NOT jump to conclusions.
These remains are probably NOT YOUR ANCESTORS, but definitely ARE your distant cousins.
Ancient Connections ARE wonderful hints, especially when taken together with each other and additional information.

It’s VERY easy to misinterpret Ancient Connections because you’re excited. I’ve done exactly that. To keep the assumption monster from rearing its ugly head, I have to take a breath and ask myself a specific set of questions. I step through the logical analysis process that I’m sharing with you.

The first thing I always want to know is where the genetically closest set of remains was found, when, and what we know about them, so let’s start there. Keep in mind that the closest remains genetically may not be the most recent set of remains to have lived. For example, my own haplogroup will be the closest genetically, but that person may have lived 2000 years ago. An Ancient Connection in a more distant haplogroup may have lived only 1000 years ago. The closest person genetically is NOT the same as the person who lived the most recently.

Our tester, Radegonde’s descendant, has no Ancient Connections in haplogroup X2b4g or X2b4d’g, but does have two in haplogroup X2b4, so let’s start there.

Discover provides a substantial amount of information about each set of ancient remains. Click on the results you want to view, and the information appears below.

Radegonde’s first Ancient Connection is Carrowkeel 534. The graphic shows the tester, the Ancient Connection being viewed, and their shared ancestor’s haplogroup. In this case, the shared ancestor haplogroup of Carrowkeel 534 and the tester is X2b4, who lived about 5000 years ago.

It’s very easy to look at Carrowkeel 534, become smitten, and assume that this person was your ancestor.

By Shane Finan – Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=35098411

It’s especially easy if you WANT that person to be your ancestor. Carrowkeel 534 was buried in a passage tomb in County Sligo, Ireland. I’ve been there.

However, don’t let your emotions get involved – at least not yet.

This is the first example of the steps that determine that these remains are NOT YOUR ANCESTOR.

Carrowkeel 534 was a male, and we all know that males do not pass on their mitochondrial DNA. Well, that’s an inconvenient fact.😊
There are two sets of X2b4 remains in Ancient Connections. Carrowkeel 534 remains are about 4600-5000 years old, and your common ancestor with them lived about 5000 years ago. However, Radegonde was French and migration from Ireland to France is not typical.
The other set of X2b4 remains, Ladoga 16, lived more recently, between the years of 900 and 1200 (or 800-1100 years ago), but they are found in Russia.
Radegonde’s parent haplogroup, X2b4d’g was born about 3700 years ago, which excludes the Russian remains from being Radegonde’s direct ancestor.
Radegonde’s common ancestor with both these sets of remains lived about 5000 years ago, but these remains were not found even close to each other.

In fact, these remains, if walking, are about 3299 km (2049 miles) apart, including two major water crossings.

Given that Radegonde is probably French, finding her ancestor around 5000 years ago in an Irish passage tomb in County Sligo, or in a location east of St. Petersburg, is extremely unlikely.

What IS likely, though, is that X2b4d’g descendants of your common ancestor with both sets of remains, 5000 years ago, went in multiple directions, meaning:

Radegonde’s ancestor found their way to France and along the way incurred the mutations that define X2b4d’g and X2b4g by the year 1600 when she lived, or about four hundred years ago.
Another X2b4 descendant found their way to what is today Ireland between 4600 and 5000 years ago
A third X2b4 descendant found their way to Russia between 800-1100 years ago, and 5000 years ago

If any question remains about the genesis of Radegonde’s ancestors being Native, Ancient Connections disproves it – BUT – there’s still an opportunity for misunderstanding, which we’ll see in a few minutes.

Ancient Connections Analysis Chart

I’ve created an analysis chart, so that I can explain the findings in a logical way.

Legend:

Hap = Haplogroup
M=male
F=female
U=unknown

Please note that ancient samples are often degraded and can be missing important mutations. In other words, the tree placement may be less specific for ancient samples. Every ancient sample is reviewed by FamilyTreeDNA’s genetic anthropologist before it’s placed on the tree.

Ancient samples use carbon dating to determine ages. Sometimes, the carbon date and the calculated haplogroup age are slightly “off.” The haplogroup age is a scientific calculation based on a genetic clock and is not based on either genealogy or ancient burials. The haplogroup age may change as the tree matures and more branches are discovered.

I’m dividing this chart into sections because I want to analyze the findings between groups.

The first entry is the earliest known ancestor of the current lineage – Radegonde Lambert, who was born about 1621, or roughly 400 years ago. I’ve translated all of the years into “years ago” to avoid any confusion.

If you wish to do the same, with CE (Current or Common Era) dates, subtract the date from 2000. 300 CE= (2000-300) or1700 years ago. With BCE dates, add 2000 to the BCE number. 1000 BCE= (1000+2000) or 3000 years ago.

Connection Identity	Age Years Ago	Location & Cultural Group	Hap	Hap Age Years Ago	Shared Hap	Shared Hap Age Years Ago
Radegonde Lambert (F)	400	France or Canada -Acadian	X2b4g	1700	X2b4	5000
Carrowkeel 534 (M)	4600-5100	Sligo, Ireland – Neolithic Europe	X2b4	5000	X2b4	5000
Ladoga 16 (M)	800-1100	Ladoga, Russia Fed – Viking Russia	X2b4	5000	X2b4	5000

Age Years Ago – When the Ancient Connection lived
Hap Age Years Ago – When the haplogroup of the Ancient Connection (X2b4) originated, meaning was born
Shared Hap Age Years Ago – When the Shared Ancestor of everyone in the Shared Haplogroup originated (was born)

In this first section, the haplogroup of the Ancient Connections and the Shared Haplogroup is the same, but that won’t be the case in the following sections. Radegonde Lambert’s haplogroup is different than her shared haplogroup with the Ancient Connections.

Let’s assume we are starting from scratch with Radegonde.

The first question we wanted to answer is whether or not Radegonde is European, presumably French like the rest of the Acadians, or if she was Native. That’s easy and quick.

Native people crossed Beringia, arriving from Asia someplace between 12,000 and 25,000 years ago in multiple waves of migration that spread throughout both North and South America.

Therefore, given that the first two samples, Carrowkeel 534 and Ladoga 16, share haplogroup X2b4, an upstream haplogroup with Radegonde Lambert, and haplogroup X2b4 was formed around 5000 years ago, the answer is that Radegonde’s X2b4 ancestor, whoever that was, clearly lived in Europe, NOT the Americas.

According to Discover, Haplogroup X2b4:

Was formed about 5000 years ago
Has 16 descendant haplogroups
Has 29 unnamed lineages (haplotype clusters or individuals with no match)
Includes testers whose ancestors are from 23 countries

The Country Frequency map shows the distribution of X2b4, including all descendant haplogroups. Please note that the percentages given are for X2b4 as a percentage of ALL haplogroups found in each colored country. Don’t be misled by the relative physical size of the US and Canada as compared to Europe.

The table view shows the total number of self-identified locations of the ancestors of people in haplogroup X2b4 and all downstream haplogroups.

The Classic Tree that we looked at earlier provides a quick view of X2b4, each descendant haplogroup and haplotype cluster, and every country provided by the 331 X2b4 testers.

For the X2b4 Ancient Connections, we’ve already determined:

That Radegonde’s ancestors were not Native
Carrowkeel 534 is a male and cannot be Radegonde’s ancestor. It’s extremely likely that Carrowkeel 534’s mother is not Radegonda’s ancestor either, based on several factors, including location.
Based on dates of when Ladoga 16 lived, and because he’s a male, he cannot be the ancestor of Radegonde Lambert.

Radegonda’s haplogroup was formed long before Ladoga 16 lived. Each Ancient Connection has this comparative Time Tree if you scroll down below the text.

Both Carrowkeel and Ladoga share an ancestor with our tester, and Radegonde, about 5000 years ago.

Think about how many descendants the X2b4 ancestor probably had over the next hundreds to thousands of years.

We know one thing for sure, absolutely, positively – X2b4 testers and descendant haplogroups live in 32 countries. People migrate – and with them, their haplogroups.

What can we learn about the genealogy and history of Radegonde Lambert and her ancestors?

We find the same haplogroup in multiple populations or cultures, at different times and in multiple places. Country boundaries are political and fluid. What we are looking for are patterns, or sometimes, negative proof, which is often possible at the continental level.

X2b4, excluding downstream haplogroups, is found in the following locations:

Bulgaria
Canada (2)
Czech Republic
England (2)
Finland (2)
France (3)
Germany (4)
Portugal
Scotland (2)
Slovakia (2)
Sweden (2)
UK (2)
Unknown (11)
US (2)

Note that there are three people in France with haplogroup X2b4 but no more refined haplogroup.

Looking at X2b4’s downstream haplogroups with representation in France, we find:

X2b4a (none)
X2b4b (none)
X2b4b1 (1)
X2b4d’g (none)
X2b4d (none)
X2b4g (24) – many from Radegonde’s line
X2b4e and subgroups (none)
X2b4f (none)
X2b4j and subgroups (none)
X2b4k (none)
X2b4l (1)
X2b4m (none)
X2b4n and subgroups (none)
X2b4o (none)
X2b4p (none)
X2b4r (none)
X2b4+16311 (none)

I was hoping that there would be an Ancient Connection for X2b4, X2b4d’g, or X2b4g someplace in or even near France – because that makes logical sense if Radegonde is from France.

All I can say is “not yet,” but new ancient sites are being excavated and papers are being released all the time.

Ok, so moving back in time, let’s see what else we can determine from the next set of Ancient Connections. Haplogroup X2b1”64 was formed about 5050 years ago.

Connection Identity	Age Years Ago	Location & Cultural Group	Hap	Hap Age Years Ago	Shared Hap	Shared Hap Age Years Ago
Radegonde Lambert (F)	400	France or Canada	X2b4g	1700
Carrowkeel 534 (M)	5100-4600	Sligo, Ireland – Neolithic Europe	X2b4	5000	X2b4	5000
Ladoga 16 (M)	800-1100	Ladoga, Russia Fed – Viking Russia	X2b4	5000	X2b4	5000
Parknabinnia 186 (M)	5516-5359	Clare, Ireland – Neolithic Europe	X2b1”64	5516-5259	X2b1”64	Before 5050 years ago
Rössberga 2 (M)	5339-5025	Vastergotland, Sweden – Funnel Beaker	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 29 (M)	5366-5100	Vastergotland, Sweden – Funnel Beaker and Early Plague	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 38 (M)	5340-5022	Vastergotland, Sweden – Funnel Beaker	X2b1”64	5516-5259	X2b1”64	Before 5050
Monte Sirai 797263 (U)	2600-2400	Monte Sirai, Italy (Sardinia) – Phoenicians	X2b35a1	3350	X2b1”64	5050
Bogovej 361 (F)	1000-1100	Lengeland, Denmark – Viking Denmark	X2b1”64	5516-5259	X2b1”64	5050
Ladoga 410 (M)	800-1000	Leningrad Oblast, Russia – Viking Russia	X2b1”64	5516-5259	X2b1”64	5050

Our first group ended with haplogroup X2b4, and our second group consists of haplogroup X2b1”64, the parent haplogroup of X2b4. X2b1”64 is a significantly larger haplogroup with many downstream branches found throughout Europe, parts of western Asia, the Levant, India, and New Zealand (which probably reflects a colonial era settler). The Country Frequency Map and Table are found here.

X2b1”64 is just slightly older than X2b4, but it’s much more widespread, even though they were born about the same time. Keep in mind that haplogroup origination dates shift as the tree is developed.

These seven individuals who share X2b1”64 as their haplogroup could be related to each other individually, meaning their MRCA, anytime between when they lived and when their haplogroup was formed.
The entire group of individuals all share the same haplogroup, so they all descend from the one woman who formed X2b1”64 about 5050 years ago. She is the shared ancestor of everyone in the haplogroup.

One X2b4 and one X2b1”64 individual are found in the same archaeological site in Russia. Their common ancestor would have lived between the time they both lived, about 800 years ago, to about 5000 years ago. It’s also possible that one of the samples could be incomplete.

A second X2b1”64 Ancient Connection is found in the Court Tomb in County Clare, Ireland, not far from the Carrowkeel 534 X2b4 site.

However, Monte Sirai is fascinating, in part because it’s not found near any other site. Monte Sirai is found all the way across France, on an island in the Tyrrhenian Sea.

It may be located “across France” today, but we don’t know that the Phoenician Monte Sirai site is connected with the Irish sites. We can’t assume that the Irish individuals arrived as descendants of the Monte Sirai people, even though it would conveniently fit our narrative – crossing France. Of course, today’s path includes ferries, which didn’t exist then, so if that trip across France did happen, it could well have taken a completely different path. We simply don’t know and there are very few samples available.

Three Ancient Connections are found in the Rössberga site in Sweden and another in Denmark.

Adding all of the Ancient sites so far onto the map, it looks like we have two clusters, one in the northern latitudes, including Denmark, Sweden, and Russia, and one in Ireland with passage burials, plus one single Connection in Monte Sirai.

If I were to approximate a central location between all three, that might be someplace in Germany or maybe further east. But remember, this is 5000 years ago and our number of samples, as compared to the population living at the time is EXTREMELY LIMITED.

Let’s move on to the next group of Ancient Connections, who have different haplogroups but are all a subset of haplogroup X2.

Identity	Age Years Ago	Location & Cultural Group	Hap	Hap Age Years Ago	Shared Hap	Shared Hap Age Years Ago
Radegonde Lambert (F)	400	France or Canada	X2b4g	1700
Carrowkeel 534 (M)	5100-4600	Sligo, Ireland – Neolithic Europe	X2b4	5000	X2b4	5000
Ladoga 16 (M)	800-1100	Ladoga, Russia Fed – Viking Russia	X2b4	5000	X2b4	5000
Parknabinnia 186 (M)	5516-5359	Clare, Ireland – Neolithic Europe	X2b1”64	5516-5259	X2b1”64	Before 5050
Ross Rössberga 2 (M)	5339-5025	Vastergotland, Sweden – Funnel Beaker	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 29 (M)	5366-5100	Vastergotland, Sweden – Funnel Beaker and Early Plague	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 38 (M)	5340-5022	Vastergotland, Sweden – Funnel Beaker	X2b1”64	5516-5259	X2b1”64	Before 5050
Monte Sirai 797263 (U)	2600-2400	Monte Sirai, Italy (Sardinia) – Phoenicians	X2b35a1	3350	X2b1”64	5050
Bogovej 361 (F)	1000-1100	Lengeland, Denmark – Viking Denmark	X2b1”64	5516-5259	X2b1”64	5050
Ladoga 410 (M)	800-1000	Leningrad Oblast, Russia – Viking Russia	X2b1”64	5516-5259	X2b1”64	5050
Barcin 31 (M)	8236-8417	Derekoy, Turkey – Neolithic Anatolia Ceramic	X2m2’5’7^	9200	X2b”aq	13,000
Abasar 55 (M)	500-800	Abasár Bolt-tető, Abasar, Hungary – Medieval Hungary	X2m1e	5350	X2b”aq	13,000
Gerdrup 214	3779-3889	Gerdrup, Sealand, Denmark – Middle Bronze Age	X2c1	3400	X2+225	13,000
Sweden Skara 275	800-1100	Varnhem, Skara, Sweden – Viking Sweden	X2c1	3400	X2+225	13,000
Kopparsvik 225	950-1100	Gotland, Sweden – Viking Sweden	X2z	5650	X2+225	13,000
Sandomierz 494	900-1100	Sandomierz, Poland – Viking Poland	X2c2b	1650	X2+225	13,000
Kennewick man	8390-9250	Kennewick, Washington – Native American	X2a2’3’4^	10,450	X2	13,000
Roopkund 39	80-306	Roopkund Lake, Uttarakhand, India – Historical India	X2d	13,000	X2	13,000

The next several Ancient Connections have haplogroups that are a subgroup of haplogroup X2. These people lived sometime between 500 years ago in Hungary, and 8390-9250 years ago when Kennewick Man lived in the present-day state of Washington in the US. Kennewick Man merits his own discussion, so let’s set him aside briefly while we discuss the others.

The important information to be gleaned here isn’t when these people lived, but when Radegonde shared a common ancestor with each of them. The shared haplogroup with all of these individuals was born about 13,000 years ago.

Looking at the map again, and omitting both X2 samples, we can see that the descendants of that shared ancestor 13,000 years ago are found more widely dispersed.

Including these additional burials on our map, it looks like we have a rather large Swedish and Viking cluster, where several of the older burials occurred prior to the Viking culture. We have a Southeastern Europe cluster, our two Irish tomb burials, and our remaining single Monte Sirai Phoenician burial on the island of Sardinia.

Stepping back one more haplogroup to X2, which was born about the same time, we add a burial in India, and Kennewick Man.

The Migration Map

The Migration map in Discover provides two different features.

The first is the literal migration map for the various ancestral haplogroups as they migrated out of Africa, if in fact yours did, culminating in your base haplogroup. In this case, the base haplogroup is X2, which is shown with the little red circle placed by FamilyTreeDNA. I’ve added the red squares, text and arrows for emphasis.
The second feature is the mapped Ancient Connections, shown with little brown trowels. Clicking on each one opens a popup box.

After haplogroup X2 was formed, it split into haplogroups X2a and X2b.

The X2a group, Kennewick Man’s ancestors, made their way eastward, across eastern Russia to Beringia where they crossed into the Americas.

They either crossed Beringia, follow the Pacific coastline, or both, eventually making their way inland, probably along the Hood River, to where Kennewick Man was found some 2,800 years later on the banks of the Kennewick River.

The X2b group made their way westward, across western Europe to a location, probably France, where Radegonde Lamberts’ ancestors lived, and where Radegonde set sail for Nova Scotia.

After being separated for nearly 13,000 years, the descendants of the single woman who founded haplogroup X2 and lived someplace in central Asia around 13,000 years ago would find themselves on opposite coasts of the same continent.

So, no, Radegonde Lambert was not Native American, but her 600^th matrilineal cousin or so, Kennewick Man, absolutely was.

Radegonde Lambert and Kennewick Man

Here’s where confirmation bias can rear its ugly head. If you’re just scanning the Ancient Connections and see Kennewick Man, it would be easy to jump to conclusions, leap for joy, slap a stamp of “confirmed Native American” on Radegonde Lambert, and never look further. And if one were to do that, they would be wrong.

Let’s work through our evaluation process using Discover.

Radegonde Lambert and Kinnewick Man, an early Native American man whose remains were found Kennewick, Washington in 1996, are both members of the broader haplogroup X2. Kennewick Man lived between 8290 and 9350 years ago, and their shared ancestor lived about 13,000 years ago – in Asia, where mitochondrial haplogroup X2 originated. This is the perfect example of one descendant line of a haplogroup, X2 in this case, going in one direction and a second one traveling in the opposite direction.

Two small groups of people were probably pursuing better hunting grounds, but I can’t help but think of a tundra version of the Hatfields and McCoys and cousin spats.

“I’m going this way. There are better fish on that side of the lake, and I won’t have to put up with you.”

“Fine, I’m going that way. There are more bears and better hunting up there anyway.”

Their wives, who are sisters, “Wait, when will I ever see my sister again?”

One went east and one went west.

X2a became Native American and X2b became European.

Looking back at our information about Kennewick Man, his haplogroup was born significantly before he lived.

He was born about 8390-9250 years ago, so let’s say 8820 years ago, and his haplogroup was born 10,500 years ago, so about 1680 years before he lived. That means there were many generations of women who carried that haplogroup before Kennewick Man.

Let’s Compare

Discover has a compare feature.

I want to Compare Radegonde Lambert’s haplogroup with Kennewick Man’s haplogroup X2a2’3’4^.

The Compare tool uses the haplogroup you are viewing, and you enter a second haplogroup to compare with the first.

The ancestral path to the shared ancestor, meaning their shared haplogroup, is given for each haplogroup entered. That’s X2 in this case. Then, from the shared haplogroup back in time to Mitochondrial Eve.

I prefer to view this information in table format, so I created a chart and rounded the haplogroup ages above X2.

	Hap Age – Years Ago	Radegonde’s Line	Shared Ancestors and Haplogroups	Kennewick’s Line	Hap Age – Years Ago
	143,000		mt-Eve
	130,000		L1”7
	119,000		L2”7
	99,000		L2’3’4’6
	92,000		L3’4’6
	73,500		L3’4
	61,000		L3
	53,000		N
	53,000		N+8701
	25,000		X
	22,500		X1’2’3’7’8
	13,000		X2 – Asia
	13,000	X2+225		X2a	10,500
	12,900	X2b”aq		X2a2’3’4^	10,400	Kennewick Man born c 8800 years ago
	11,000	X2b
	5,500	X2b1”64
	5,000	X2b4
	1,900	X2b4d’g
Radegonde Lambert born c 1661 – 400 years ago	1,700	X2b4g

More Ancient Connections

Radegonde Lambert’s matrilineal descendants have an additional dozen Ancient Connections that are found in upstream haplogroup N-8701. Their shared ancestors with Radegonde reach back to 53,000 years ago in a world far different than the one we inhabit today. I’m not going to list or discuss them, except for one.

Identity	Age Years Ago	Location & Cultural Group	Hap	Hap Age Years Ago	Shared Hap	Shared Hap Age Years Ago
Radegonde Lambert (F)	400	France or Canada	X2b4g	1700
Carrowkeel 534 (M)	5100-4600	Sligo, Ireland – Neolithic Europe	X2b4	5000	X2b4	5000
Ladoga 16 (M)	800-1100	Ladoga, Russia Fed – Viking Russia	X2b4	5000	X2b4	5000
Parknabinnia 186 (M)	5516-5359	Clare, Ireland – Neolithic Europe	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 2 (M)	5339-5025	Vastergotland, Sweden – Funnel Beaker	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 29 (M)	5366-5100	Vastergotland, Sweden – Funnel Beaker and Early Plague	X2b1”64	5516-5259	X2b1”64	Before 5050
Rössberga 38 (M)	5340-5022	Vastergotland, Sweden – Funnel Beaker	X2b1”64	5516-5259	X2b1”64	Before 5050
Monte Sirai 797263 (U)	2600-2400	Monte Sirai, Italy (Sardinia) – Phoenicians	X2b35a1	3350	X2b1”64	5050
Bogovej 361 (F)	1000-1100	Lengeland, Denmark – Viking Denmark	X2b1”64	5516-5259	X2b1”64	5050
Ladoga 410 (M)	800-1000	Leningrad Oblast, Russia – Viking Russia	X2b1”64	5516-5259	X2b1”64	5050
Barcin 31 (M)	8236-8417	Derekoy, Turkey – Neolithic Anatolia Ceramic	X2m2’5’7^	9200	X2b”aq	13,000
Abasar 55 (M)	500-800	Abasár Bolt-tető, Abasar, Hungary – Medieval Hungary	X2m1e	5350	X2b”aq	13,000
Gerdrup 214	3779-3889	Gerdrup, Sealand, Denmark – Middle Bronze Age	X2c1	3400	X2+225	13,000
Kopparsvik 225	950-1100	Gotland, Sweden – Viking Sweden	X2z	5650	X2+225	13,000
Sandomierz 494	900-1100	Sandomierz, Poland – Viking Poland	X2c2b	1650	X2+225	13,000
Sweden Skara 275	800-1100	Varnhem, Skara, Sweden – Viking Sweden	X2c1	3400	X2+225	13,000
Kennewick man	8390-9250	Kennewick, Washington – Native American	X2a2’3’4^	10,450	X2	13,000
Roopkund 39	80-306	Roopkund Lake, Uttarakhand, India – Historical India	X2d	13,000	X2	13,000
Ranis 10	43,500-47,000	Ranis, Germany – LRJ Hunger Gatherer	N3’10	53,000	N+8701	53,000
Zlatý kůň woman	47,000	Czech Republic –	N+8701	53,000	N+8701	53,000

Zlatý kůň Woman

Zlatý kůň Woman lived some 43,000 years ago and her remains were discovered in the Czech Republic in 1950.

Believed to be the first anatomically modern human to be genetically sequenced, she carried about 3% Neanderthal DNA. Europeans, Asians and indigenous Americans carry Neanderthal DNA as well.

Unlike many early remains, Zlatý kůň Woman’s facial bones have been scanned and her face approximately reconstructed.

There’s something magical about viewing a likeness of a human that lived more than 40,000 years ago, and to whom I’m at least peripherally related.

Like all other Ancient Connections, it’s unlikely that I descend from Zlatý kůň Woman herself, but she is assuredly my very distant cousin.

What else do we know about Zlatý kůň Woman? Quoting from her Ancient Connection:

She lived during one of the coldest periods of the last ice age, surviving in harsh tundra conditions as part of a small hunter-gatherer group. She died as a young adult, though the cause of death remains unknown.

Her brain cavity was larger than that of modern humans in the comparative database, another trait showing Neanderthal affinity. While the exact colors of her features cannot be determined from available evidence, researchers created both a scientific grayscale model and a speculative version showing her with dark curly hair and brown eyes.

Zlatý kůň Woman may or may not have direct descendants today, but her haplogroup ancestors certainly do, and Radegonde Lambert is one of them, which means Radegonde’s matrilineal ancestors and descendants are too.

Ancient Connections for Genealogy

While Ancient Connections are fun, they are more than just amusing.

You are related through your direct matrilineal (mitochondrial) line to every one of your mtDNA Discover Ancient Connections. Everyone, males and females, can take a mitochondrial DNA test.

I find people to test for the mitochondrial DNA of each of my ancestral lines – like Radegonde Lambert, for example. I wrote about various methodologies to find your lineages, or people to test for them, in the article, Lineages Versus Ancestors – How to Find and Leverage Yours.

Radegonde’s mitochondrial DNA is the only key I have into her past, both recent and distant. It’s the only prayer I have of breaking through that brick wall, now or in the future.

Interpreted correctly, and with some luck, the closer Ancient Connections can provide genealogical insight into the origins of our ancestors. Not just one ancestor, but their entire lineage. While we will never know their names, we can learn about their cultural origins – whether they were Vikings, Phoenicians or perhaps early Irish buried in Passage Graves.

On a different line, an Ancient Connection burial with an exact haplogroup match was discovered beside the Roman road outside the European town where my ancestral line was believed to have been born.

Ancient Connections are one small glimpse into the pre-history of our genetic line. There are many pieces that are missing and will, in time, be filled in by ancient remains, Notable Connections, and present-day testers.

Check your matches and your Ancient Connections often. You never know when that magic piece of information you desperately need will appear.

What is waiting for you?

_____________________________________________________________

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an e-mail whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase your price but helps me keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
MyHeritage Omni comprehensive “everything included” subscription plan
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA – for those ordering the e-book from anyplace, or paperback within the United States
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA for those ordering the paperback from outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

The Best of 2022

Posted on January 6, 2023 by Roberta Estes

It’s that time of year where we look both backward and forward.

Thank you for your continued readership! Another year under our belts!

I always find it interesting to review the articles you found most interesting this past year.

In total, I published 97 articles in 2022, of which 56 were directly instructional about genetic genealogy. I say “directly instructional,” because, as you know, the 52 Ancestors series of articles are instructional too, but told through the lives of my ancestors. That leaves 41 articles that were either 52 Ancestors articles, or general in nature.

It has been quite a year.

2022 Highlights

In a way, writing these articles serves as a journal for the genetic genealogy community. I never realized that until I began scanning titles a year at a time.

Highlights of 2022 include:

Multiple ancient DNA burial sites with Y and mitochondrial DNA results
Being a guest on Shamele Jordon’s Genealogy Quick Start television program and the Research Like a Pro Podcast with Nicole Dyer and Diana Elder. I participated in several other special events with organizations in 2022 as well. (No wonder I’m tired.)
Several RootsTech articles – RootsTech offered so much in 2022 – and 2023 is going to be amazing too, both in person and virtually. You can still view the 2022 articles here.
Genetic Affairs released their amazing AutoKinship
The East Coast Genetic Genealogy Conference (ECGGC) launched for the first time in 2022 and will also be having a 2023 fall conference, October 6-8. Yes, I’m planning to present there too, hopefully in person.
The 1950 census release on April Fool’s Day – I still can’t find my mother.
The Million Mito team update and also the publication of the amazing haplogroup L7 discovery paper, plus an accompanying video. At year-end, the team was honored that our paper was included in the Nature’s Editor’s Choice Collection.
Mitochondrial DNA webinar at Legacy Family Tree Webinars. That’s still available along with hundreds of other titles, here.
Ancestry’s SideView and that Ancestry only shows shared matches of 20 cM and greater which is very confusing for genealogists. Also, how to share DNA results and tree access with others, which is crucial for research.
FamilyTreeDNA’s amazing DISCOVER Haplogroup Reports tool for Y-DNA launched. Not long after that, their Y-DNA tree passed 60,000 branches.
Tips and tricks for working with Theories of Family Relativity at MyHeritage. What a great tool!
I launched the new In Search Of series to celebrate the 10^th anniversary of this blog. I then created a resource page that includes all six of the In Search Of articles to date, and there will be more in 2023. 2022 topics include:
- Discovering that (ouch) you’re not genetically related to your family
- Endogamy
- Vendor comparisons and testing strategy
- How to tell the difference between full and half siblings
- Determining match relationships,
Downloading Match and Segment files at the various vendors
Endogamy – how to tell if you have it and what to do
Vendor Features, Strengths and Testing Strategies – these details change often
Why connecting your DNA test (correctly) is important at each vendor and how to do it
Tools to determine if your female ancestor was Native American, or not.
Full or half siblings and how to tell the difference
Big Y-DNA case study with a jaw-dropping outcome
MyHeritage’s new artificial intelligence (AI) time machine lets you see yourself and your ancestors in period and historical settings. I’m still super geeked by this and you’re likely to see more from me about this in 2023. This is just pure fun!!
Basic education critical for everyone about your chromosomes and genealogy. You can’t understand genetic genealogy without understanding this basic information, and why people who match you on the same segment may not match each other. Don’t be lulled into incorrect conclusions.

Which articles were your favorites that were published in 2022, and why?

Your Favorites

Often, the topics I select for articles are directly related to your comments, questions and suggestions, especially if I haven’t covered the topic previously, or it needs to be featured again. Things change in this industry, often. That’s a good thing!

However, some articles become forever favorites. Current articles don’t have enough time to amass the number of views accumulated over years for articles published earlier, so recently published articles are often NOT found in the all-time favorites list.

Based on views, what are my readers’ favorites and what do they find most useful?

In the chart below, the 2022 ranking is not just the ranking of articles published in 2022, but the ranking of all articles based on 2022 views alone. Not surprisingly, six of the 15 favorite 2022 articles were published in 2022.

The All-Time Ranking is the ranking for those 2022 favorites IF they fell within the top 15 in the forever ranking, over the entire decade+ that this blog has existed.

Drum roll please!!!

Article Title	Publication Date	2022 Ranking	All-Time Ranking
Concepts – Calculating Ethnicity Percentages	January 2017	1	2
Proving Native American Ancestry Using DNA	December 2012	2	1
Ancestral DNA Percentages – How Much of Them in in You?	June 2017	3	5
AutoKinship at GEDmatch by Genetic Affairs	February 2022	4
442 Ancient Viking Skeletons Hold DNA Surprises – Does Your Y or Mitochondrial DNA Match? Daily Updates Here	September 2020	5
The Origins of Zana of Abkhazia	July 2021	6
Full or Half Siblings	April 2019	7	15
Ancestry Rearranged the Furniture	January 2022	8
DNA from 459 Ancient British Isles Burials Reveals Relationships – Does Yours Match?	February 2022	9
DNA Inherited from Grandparents and Great-Grandparents	January 2020	10
Ancestry Only Shows Shared Matches of 20 cM and Greater – What That Means & Why It Matters	May 2022	11
How Much Indian Do I Have in Me???	June 2015	12	8
Top Ten RootsTech 2022 DNA Sessions + All DNA Session Links	March 2022	13
FamilyTreeDNA DISCOVER Launches – Including Y DNA Haplogroup Ages	June 2022	14
Ancient Ireland’s Y and Mitochondrial DNA – Do You Match???	November 2020	15

2023 Suggestions

I have a few articles already in the works for 2023, including some surprises. I’ll unveil one very soon.

We will be starting out with:

Information about RootsTech where I’ll be giving at least 7 presentations, in person, and probably doing a book signing too. Yes, I know, 7 sessions – what was I thinking? I’ve just missed everyone so very much.
An article about how accurately Ancestry’s ThruLines predicts Potential Ancestors and a few ways to prove, or disprove, accuracy.
The continuation of the “In Search Of” series.

As always, I’m open for 2023 suggestions.

In the comments, let me know what topics you’d like to see.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage FREE Tree Builder – Genealogy software for your computer
MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Book

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

2021 Favorite Articles

Posted on December 31, 2021 by Roberta Estes

It’s that time of the year again when we welcome the next year.

2021 was markedly different than anything that came before. (Is that ever an understatement!)

Maybe you had more time for genealogy and spent time researching!

So, what did we read in 2021? Which of my blog articles were the most popular?

In reverse order, beginning with number 10, we have:

How Much Indian Do I Have in Me?

This timeless article published in 2015 explains how to calculate the amount of any specific heritage you carry based on your ancestors.

Migration Pedigree Chart

Just something fun that’s like your regular pedigree chart, except color coded locations instead of ancestors. Here’s mine

AutoSegment Triangulation Cluster Tool at GEDmatch

The Autosegment Triangulation Cluster Tool is a brand new tool introduced in October 2021. Created by Genetic Affairs for GEDmatch, this tool combines autoclusters and triangulation.

DNA Inherited from Grandparents and Great-Grandparents

Many people don’t realize that we actually don’t inherit exactly 25% of our DNA from each grandparent, nor why.

This enlightening article co-authored with statistician Philip Gammon explains how this works, and why it affects all of your matches.

442 Ancient Viking Skeletons Hold DNA Surprises – Does Your Y or Mitochondrial DNA Match?

Who doesn’t love learning about ancient DNA and the messages it conveys. Does your Y or mitochondrial DNA match any of these burials? Take a look. You might be surprised.

Full or Half Siblings?

How can you tell if you are full or half siblings with another person? You might think this is a really straightforward question with an easy answer, but it isn’t. And trust me, if you EVER find yourself in a position of needing to know, you really need to know urgently.

Ancestral DNA Percentages – How Much of Them is in You?

Using simple match, it’s easy to figure how much of your ancestor’s DNA you “should” have, but that’s now how inheritance actually works. This article explains why and shows different inheritance scenarios.

Clock is Ticking: In 28 Days Ancestry Can Do Anything They Want With Every Image in Your Tree

That 28 day timer has expired, but the article can still be useful in terms of educating yourself. This should also be read in conjunction with Ancestry Retreats, by Judy Russell.

Concepts: Calculating Ethnicity Percentages

If I had a dollar for every time I’ve heard someone say that their ethnicity percentages were “wrong,” I’d be a rich woman, living in a villa in sun-drenched Tuscany😊

This extremely popular article has either been first or second every year since it was published. Ethnicity is both exciting and perplexing.

As genealogists, the first thing we need to do is to calculate what, according to our genealogy, we would expect those percentages to be. Of course, we also need to factor in the fact that we don’t inherit exactly the same amount of DNA from each grandparent. I explain how I calculated my “expected” percentages of ethnicity based on my known tree. That’s the best place to start.

Please note that I am no longer updating the vendor comparison charts in the article. Some vendors no longer release updates to the entire database at the same time, and some “tweak” results periodically without making an announcement. You’ll need to compare your own results at the different vendors at the same point in time to avoid comparing apples and oranges.

The #1 Article for 2021 is…

Proving Native American Ancestry Using DNA

This article has either been first (7 times) or second (twice) for 9 years running. Now you know why I chose this topic for my new book, DNA for Native American Genealogy.

If you’re searching for your Native American ancestry, I’ve provided step-by-step instructions, both with and without some percentage of Native showing in your autosomal DNA percentages.

Make 2022 a Great Year!

Here’s wishing you the best in 2022. I hope your brick walls cave. What are you doing to help that along? Do you have a strategy in mind?

__________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here. You can also subscribe to receive emails when I publish articles by clicking the “Follow” button at www.DNAexplain.com.

You’re always welcome to forward articles or links to friends.

Help Out, Please

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial, and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage FREE Tree Builder – Genealogy software for your computer
MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
Charting Companion – Charts and Reports to use with your genealogy software or FamilySearch
RootsMagic Software – Genealogy software for your computer
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive– Search different newspapers for your ancestors

My Book

DNA for Native American Genealogy – by Roberta Estes
DNA for Native American Genealogy – for those ordering outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

2018 – The Year of the Segment

Posted on January 1, 2019 by Roberta Estes

Looking in the rear view mirror, what a year! Some days it’s been hard to catch your breath things have been moving so fast.

What were the major happenings, how did they affect genetic genealogy and what’s coming in 2019?

The SNiPPY Award

First of all, I’m giving an award this year. The SNiPPY.

Yea, I know it’s kinda hokey, but it’s my way of saying a huge thank you to someone in this field who has made a remarkable contribution and that deserves special recognition.

Who will it be this year?

Drum roll…….

The 2018 SNiPPY goes to…

DNAPainter – The 2018 SNiPPY award goes to DNAPainter, without question. Applause, everyone, applause! And congratulations to Jonny Perl, pictured below at Rootstech!

Jonny Perl created this wonderful, visual tool that allows you to paint your matches with people on your chromosomes, assigning the match to specific ancestors.

I’ve written about how to use the tool with different vendors results and have discovered many different ways to utilize the painted segments. The DNA Painter User Group is here on Facebook. I use DNAPainter EVERY SINGLE DAY to solve a wide variety of challenges.

What else has happened this year? A lot!

Ancient DNA – Academic research seldom reports on Y and mitochondrial DNA today and is firmly focused on sequencing ancient DNA. Ancient genome sequencing has only recently been developed to a state where at least some remains can be successfully sequenced, but it’s going great guns now. Take a look at Jennifer Raff’s article in Forbes that discusses ancient DNA findings in the Americas, Europe, Southeast Asia and perhaps most surprising, a first generation descendant of a Neanderthal and a Denisovan.

From Early human dispersals within the Americas by Moreno-Mayer et al, Science 07 Dec 2018

Inroads were made into deeper understanding of human migration in the Americas as well in the paper Early human dispersals within the Americas by Moreno-Mayer et al.

I look for 2019 and on into the future to hold many more revelations thanks to ancient DNA sequencing as well as using those sequences to assist in understanding the migration patterns of ancient people that eventually became us.

Barbara Rae-Venter and the Golden State Killer Case

Using techniques that adoptees use to identify their close relatives and eventually, their parents, Barbara Rae-Venter assisted law enforcement with identifying the man, Joseph DeAngelo, accused (not yet convicted) of being the Golden State Killer (GSK).

A very large congratulations to Barbara, a retired patent attorney who is also a genealogist. Nature recognized Ms. Rae-Venter as one of 2018’s 10 People Who Mattered in Science.

DNA in the News

DNA is also represented on the 2018 Nature list by Viviane Slon, a palaeogeneticist who discovered an ancient half Neanderthal, half Denisovan individual and sequenced their DNA and He JianKui, a Chinese scientist who claims to have created a gene-edited baby which has sparked widespread controversy. As of the end of the year, He Jiankui’s research activities have been suspended and he is reportedly sequestered in his apartment, under guard, although the details are far from clear.

In 2013, 23andMe patented the technology for designer babies and I removed my kit from their research program. I was concerned at the time that this technology knife could cut two ways, both for good, eliminating fatal disease-causing mutations and also for ethically questionable practices, such as eugenics. I was told at the time that my fears were unfounded, because that “couldn’t be done.” Well, 5 years later, here we are. I expect the debate about the ethics and eventual regulation of gene-editing will rage globally for years to come.

Elizabeth Warren’s DNA was also in the news when she took a DNA test in response to political challenges. I wrote about what those results meant scientifically, here. This topic became highly volatile and politicized, with everyone seeming to have a very strongly held opinion. Regardless of where you fall on that opinion spectrum (and no, please do not post political comments as they will not be approved), the topic is likely to surface again in 2019 due to the fact that Elizabeth Warren has just today announced her intention to run for President. The good news is that DNA testing will likely be discussed, sparking curiosity in some people, perhaps encouraging them to test. The bad news is that some of the discussion may be unpleasant at best, and incorrect click-bait at worst. We’ve already had a rather unpleasant sampling of this.

Law Enforcement and Genetic Genealogy

The Golden State Killer case sparked widespread controversy about using GedMatch and potentially other genetic genealogy data bases to assist in catching people who have committed violent crimes, such as rape and murder.

GedMatch, the database used for the GSK case has made it very clear in their terms and conditions that DNA matches may be used for both adoptees seeking their families and for other uses, such as law enforcement seeking matches to DNA sequenced during a criminal investigation. Since April 2018, more than 15 cold case investigations have been solved using the same technique and results at GedMatch. Initially some people removed their DNA from GedMatch, but it appears that the overwhelming sentiment, based on uploads, is that people either aren’t concerned or welcome the opportunity for their DNA matches to assist apprehending criminals.

Parabon Nanolabs in May established a genetic genealogy division headed by CeCe Moore who has worked in the adoptee community for the past several years. The division specializes in DNA testing forensic samples and then assisting law enforcement with the associated genetic genealogy.

Currently, GedMatch is the only vendor supporting the use of forensic sample matching. Neither 23anMe nor Ancestry allow uploaded data, and MyHeritage and Family Tree DNA’s terms of service currently preclude this type of use.

MyHeritage

Wow talk about coming onto the DNA world stage with a boom.

MyHeritage went from a somewhat wobbly DNA start about 2 years ago to rolling out a chromosome browser at the end of January and adding important features such as SmartMatching which matches your DNA and your family trees. Add triangulation to this mixture, along with record matching, and you’re got a #1 winning combination.

It was Gilad Japhet, the MyHeritage CEO who at Rootstech who christened 2018 “The Year of the Segment,” and I do believe he was right. Additionally, he announced that MyHeritage partnered with the adoption community by offering 15,000 free kits to adoptees.

In November, MyHeritage hosted MyHeritage LIVE, their first user conference in Oslo, Norway which focused on both their genealogical records offerings as well as DNA. This was a resounding success and I hope MyHeritage will continue to sponsor conferences and invest in DNA. You can test your DNA at MyHeritage or upload your results from other vendors (instructions here). You can follow my journey and the conference in Olso here, here, here, here and here.

GDPR

GDPR caused a lot of misery, and I’m glad the implementation is behind us, but the the ripples will be affecting everyone for years to come.

GDPR, the European Data Protection Regulation which went into effect on May 25, 2018 has been a mixed and confusing bag for genetic genealogy. I think the concept of users being in charge and understanding what is happened with their data, and in this case, their data plus their DNA, is absolutely sound. The requirements however, were created without any consideration to this industry – which is small by comparison to the Googles and Facebooks of the world. However, the Googles and Facebooks of the world along with many larger vendors seem to have skated, at least somewhat.

Other companies shut their doors or restricted their offerings in other ways, such as World Families Network and Oxford Ancestors. Vendors such as Ancestry and Family Tree DNA had to make unpopular changes in how their users interface with their software – in essence making genetic genealogy more difficult without any corresponding positive return. The potential fines, 20 million plus Euro for any company holding data for EU residents made it unwise to ignore the mandates.

In the genetic genealogy space, the shuttering of both YSearch and MitoSearch was heartbreaking, because that was the only location where you could actually compare Y STR and mitochondrial HVR1/2 results. Not everyone uploaded their results, and the sites had not been updated in a number of years, but the closure due to GDPR was still a community loss.

Today, mitoydna.org, a nonprofit comprised of genetic genealogists, is making strides in replacing that lost functionality, plus, hopefully more.

On to more positive events.

Family Tree DNA

In April, Family Tree DNA announced a new version of the Big Y test, the Big Y-500 in which at least 389 additional STR markers are included with the Big Y test, for free. If you’re lucky, you’ll receive between 389 and 439 new markers, depending on how many STR markers above 111 have quality reads. All customers are guaranteed a minimum of 500 STR markers in total. Matching was implemented in December.

These additional STR markers allow genealogists to assemble additional line marker mutations to more granularly identify specific male lineages. In other words, maybe I can finally figure out a line marker mutation that will differentiate my ancestor’s line from other sons of my founding ancestor😊

In June, Family Tree DNA announced that they had named more than 100,000 SNPs which means many haplogroup additions to the Y tree. Then, in September, Family Tree DNA published their Y haplotree, with locations, publicly for all to reference.

I was very pleased to see this development, because Family Tree DNA clearly has the largest Y database in the industry, by far, and now everyone can reap the benefits.

In October, Family Tree DNA published their mitochondrial tree publicly as well, with corresponding haplogroup locations. It’s nice that Family Tree DNA continues to be the science company.

You can test your Y DNA, mitochondrial or autosomal (Family Finder) at Family Tree DNA. They are the only vendor offering full Y and mitochondrial services complete with matching.

2018 Conferences

Of course, there are always the national conferences we’re familiar with, but more and more, online conferences are becoming available, as well as some sessions from the more traditional conferences.

I attended Rootstech in Salt Lake City in February (brrrr), which was lots of fun because I got to meet and visit with so many people including Mags Gaulden, above, who is a WikiTree volunteer and writes at Grandma’s Genes, but as a relatively expensive conference to attend, Rootstech was pretty miserable. Rootstech has reportedly made changes and I hope it’s much better for attendees in 2019. My attendance is very doubtful, although I vacillate back and forth.

On the other hand, the MyHeritage LIVE conference was amazing with both livestreamed and recorded sessions which are now available free here along with many others at Legacy Family Tree Webinars.

Family Tree University held a Virtual DNA Conference in June and those sessions, along with others, are available for subscribers to view.

The Virtual Genealogical Association was formed for those who find it difficult or impossible to participate in local associations. They too are focused on education via webinars.

Genetic Genealogy Ireland continues to provide their yearly conference sessions both livestreamed and recorded for free. These aren’t just for people with Irish genealogy. Everyone can benefit and I enjoy them immensely.

Bottom line, you can sit at home and educate yourself now. Technology is wonderful!

2019 Conferences

In 2019, I’ll be speaking at the National Genealogical Society Family History Conference, Journey of Discovery, in St. Charles, providing the Special Thursday Session titled “DNA: King Arthur’s Mighty Genetic Lightsaber” about how to use DNA to break through brick walls. I’ll also see attendees at Saturday lunch when I’ll be providing a fun session titled “Twists and Turns in the Genetic Road.” This is going to be a great conference with a wonderful lineup of speakers. Hope to see you there.

There may be more speaking engagements at conferences on my 2019 schedule, so stay tuned!

The Leeds Method

In September, Dana Leeds publicized The Leeds Method, another way of grouping your matches that clusters matches in a way that indicates your four grandparents.

I combine the Leeds method with DNAPainter. Great job Dana!

Genetic Affairs

In December, Genetic Affairs introduced an inexpensive subscription reporting and visual clustering methodology, but you can try it for free.

I love this grouping tool. I have already found connections I didn’t know existed previously. I suggest joining the Genetic Affairs User Group on Facebook.

DNAGedcom.com

I wrote an article in January about how to use the DNAGedcom.com client to download the trees of all of your matches and sort to find specific surnames or locations of their ancestors.

However, in December, DNAGedcom.com added another feature with their new DNAGedcom client just released that downloads your match information from all vendors, compiles it and then forms clusters. They have worked with Dana Leeds on this, so it’s a combination of the various methodologies discussed above. I have not worked with the new tool yet, as it has just been released, but Kitty Cooper has and writes about it here. If you are interested in this approach, I would suggest joining the Facebook DNAGedcom User Group.

Rootsfinder

I have not had a chance to work with Rootsfinder beyond the very basics, but Rootsfinder provides genetic network displays for people that you match, as well as triangulated views. Genetic networks visualizations are great ways to discern patterns. The tool creates match or triangulation groups automatically for you.

Training videos are available at the website and you can join the Rootsfinder DNA Tools group at Facebook.

Chips and Imputation

Illumina, the chip maker that provides the DNA chips that most vendors use to test changed from the OmniExpress to the GSA chip during the past year. Older chips have been available, but won’t be forever.

The newer GSA chip is only partially compatible with the OmniExpress chip, providing limited overlap between the older and the new results. This has forced the vendors to use imputation to equalize the playing field between the chips, so to speak.

This has also caused a significant hardship for GedMatch who is now in the position of trying to match reasonably between many different chips that sometimes overlap minimally. GedMatch introduced Genesis as a sandbox beta version previously, but are now in the process of combining regular GedMatch and Genesis into one. Yes, there are problems and matching challenges. Patience is the key word as the various vendors and GedMatch adapt and improve their required migration to imputation.

DNA Central

In June Blaine Bettinger announced DNACentral, an online monthly or yearly subscription site as well as a monthly newsletter that covers news in the genetic genealogy industry.

Many educators in the industry have created seminars for DNACentral. I just finished recording “Getting the Most out of Y DNA” for Blaine.

Even though I work in this industry, I still subscribed – initially to show support for Blaine, thinking I might not get much out of the newsletter. I’m pleased to say that I was wrong. I enjoy the newsletter and will be watching sessions in the Course Library and the Monthly Webinars soon.

If you or someone you know is looking for “how to” videos for each vendor, DNACentral offers “Now What” courses for Ancestry, MyHeritage, 23andMe, Family Tree DNA and Living DNA in addition to topic specific sessions like the X chromosome, for example.

Social Media

2018 has seen a huge jump in social media usage which is both bad and good. The good news is that many new people are engaged. The bad news is that people often given faulty advice and for new people, it’s very difficult (nigh on impossible) to tell who is credible and who isn’t. I created a Help page for just this reason.

You can help with this issue by recommending subscribing to these three blogs, not just reading an article, to newbies or people seeking answers.

https://dna-explained.com/ (this blog)
https://thegeneticgenealogist.com/ (Blaine Bettinger’s)
https://www.legalgenealogist.com/ (Judy Russell’s)

Always feel free to post links to my articles on any social media platform. Share, retweet, whatever it takes to get the words out!

The general genetic genealogy social media group I would recommend if I were to select only one would be Genetic Genealogy Tips and Techniques. It’s quite large but well-managed and remains positive.

I’m a member of many additional groups, several of which are vendor or interest specific.

Genetic Snakeoil

Now the bad news. Everyone had noticed the popularity of DNA testing – including shady characters.

Be careful, very VERY careful who you purchase products from and where you upload your DNA data.

If something is free, and you’re not within a well-known community, then YOU ARE THE PRODUCT. If it sounds too good to be true, it probably is. If it sounds shady or questionable, it’s probably that and more, or less.

If reputable people and vendors tell you that no, they really can’t determine your Native American tribe, for example, no other vendor can either. Just yesterday, a cousin sent me a link to a “tribe” in Canada that will, “for $50, we find one of your aboriginal ancestors and the nation stamps it.” On their list of aboriginal people we find one of my ancestors who, based on mitochondrial DNA tests, is clearly NOT aboriginal. Snake oil comes in lots of flavors with snake oil salesmen looking to prey on other people’s desires.

When considering DNA testing or transfers, make sure you fully understand the terms and conditions, where your DNA is going, who is doing what with it, and your recourse. Yes, read every single word of those terms and conditions. For more about legalities, check out Judy Russell’s blog.

Recommended Vendors

All those DNA tests look yummy-good, but in terms of vendors, I heartily recommend staying within the known credible vendors, as follows (in alphabetical order).

For genetic genealogy for ethnicity AND matching:

23andMe
Ancestry
Family Tree DNA
GedMatch (not a vendor because they don’t test DNA, but a reputable third party)
MyHeritage

You can read about Which DNA Test is Best here although I need to update this article to reflect the 2018 additions by MyHeritage.

Understand that both 23andMe and Ancestry will sell your DNA if you consent and if you consent, you will not know who is using your DNA, where, or for what purposes. Neither Family Tree DNA, GedMatch, MyHeritage, Genographic Project, Insitome, Promethease nor LivingDNA sell your DNA.

The next group of vendors offers ethnicity without matching:

Genographic Project by National Geographic Society
Insitome
LivingDNA (currently working on matching, but not released yet)

Health (as a consumer, meaning you receive the results)

23andMe (limited health)
Promethease

Medical (as a contributor, meaning you are contributing your DNA for research)

23andMe
Ancestry
DNA.Land (not a testing vendor, doesn’t test DNA)

There are a few other niche vendors known for specific things within the genetic genealogy community, many of whom are mentioned in this article, but other than known vendors, buyer beware. If you don’t see them listed or discussed on my blog, there’s probably a reason.

What’s Coming in 2019

Just like we couldn’t have foreseen much of what happened in 2018, we don’t have access to a 2019 crystal ball, but it looks like 2019 is taking off like a rocket. We do know about a few things to look for:

MyHeritage is waiting to see if envelope and stamp DNA extractions are successful so that they can be added to their database.
www.totheletterDNA.com is extracting (attempting to) and processing DNA from stamps and envelopes for several people in the community. Hopefully they will be successful.
LivingDNA has been working on matching since before I met with their representative in October of 2017 in Dublin. They are now in Beta testing for a few individuals, but they have also just changed their DNA processing chip – so how that will affect things and how soon they will have matching ready to roll out the door is unknown.
Ancestry did a 2018 ethnicity update, integrating ethnicity more tightly with Genetic Communities, offered genetic traits and made some minor improvements this year, along with adding one questionable feature – showing your matches the location where you live as recorded in your profile. (23andMe subsequently added the same feature.) Ancestry recently said that they are promising exciting new tools for 2019, but somehow I doubt that the chromosome browser that’s been on my Christmas list for years will be forthcoming. Fingers crossed for something new and really useful. In the mean time, we can download our DNA results and upload to MyHeritage, Family Tree DNA and GedMatch for segment matching, as well as utilize Ancestry’s internal matching tools. DNA+tree matching, those green leaf shared ancestor hints, is still their strongest feature.
The Family Tree DNA Conference for Project Administrators will be held March 22-24 in Houston this year, and I’m hopeful that they will have new tools and announcements at that event. I’m looking forward to seeing many old friends in Houston in March.

Here’s what I know for sure about 2019 – it’s going to be an amazing year. We as a community and also as individual genealogists will be making incredible discoveries and moving the ball forward. I can hardly wait to see what quandaries I’ve solved a year from now.

What mysteries do you want to unravel?

I’d like to offer a big thank you to everyone who made 2018 wonderful and a big toast to finding lots of new ancestors and breaking down those brick walls in 2019.

Happy New Year!!!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some (but not all) of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Homo Naledi – A New Species Discovered

Posted on September 11, 2015 by Roberta Estes

“Homo naledi” by Berger et al. 2015 – http://elifesciences.org/lookup/doi/10.7554/eLife.09560.019. Licensed under CC BY 4.0 via Commons – https://commons.wikimedia.org/wiki/File:Homo_naledi.jpg#/media/File:Homo_naledi.jpg

The Cradle of Humankind World Heritage Site near Johannesburg, South Africa has once again produced bones. Previous finds, nearly one third of all ancient hominin fossils found, date to 3.5 million years of age. This new find may be the bones of our ancestor, but regardless, they certainly are the bones of a new, previously unknown, species.

The announcement came this week and articles can be seen online in several locations. The National Geographic Society is a partner in the excavation and retrieval of the bones from a very difficult cave, Rising Star, through only a very small opening following a precipitous decline. Stated bluntly – this is a “scare the hell out of you” cave. Not exactly convenient or inviting.

“Dinaledi Chamber illustration” by Paul H. G. M. Dirks et al – http://elifesciences.org/content/4/e09561. Licensed under CC BY 4.0 via Commons – https://commons.wikimedia.org/wiki/File:Dinaledi_Chamber_illustration.jpg#/media/File:Dinaledi_Chamber_illustration.jpg

There was more than one skeleton present. In this article and video from the New York Times, you can see that many bones were recovered, quite obviously from multiple individuals. More than 1550 in total – representing at least 15 different individuals. How did they get in this extremely remote cave with very limited access in the first place? And why?

Is this a separate species from ours, or our ancestors? How long ago did they live, and where do they fit on the family tree? The scientists are now referring to the ancient family tree as a braided stream – a river that divides into channels only to converge again later.

These announcements are being followed by a special on Nova/National Geographic Special titled the “Dawn of Humanity” which premieres on Sept. 16, 2015 at 9 PM ET/8 Central on PBS and is streaming online now. This documentary details the discovery and excavation of the fossils in the cave including Homo Naledi.

In the mean time, take a look at this wonderful article, chock full of pictures of course, by National Geographic. If you subscribe to the National Geographic magazine, guess what will be on the cover of the October issue???

This article in New Scientist has a great reconstruction of the Homo Naledi skull, and states that no attempt has yet been made to extract DNA. I continue to remind myself that patience is a virtue.

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Anzick Matching Update

Posted on January 5, 2015 by Roberta Estes

In response to my article about haplogroup C3*, a regular contributor, Armando, left the following comment:

“Roberta, there was a problem with the way Felix was processing files and he had to change the Clovis Anzick file three times at Gedmatch. The last one is kit F999919 uploaded October 8, 2014. You can see his post on that at http://www.fc.id.au/2014/10/new-clovis-anzick-1-kit-in-gedmatch.html

If you do one-to-many matching on Clovis Anzick F999919 at Gedmatch there is not a single person that reports to have mtDNA M. Your extracts for Clovis Anzick are from September 24, 2014 and therefore are based on a bad file which was kit F999912. The older bad kits F999912 and F999913 have been deleted from Gedmatch. Felix mentions the updates at http://www.fc.id.au/2014/09/clovis-anzick-1-dna-match-living-people.html“

This comment came in on Christmas Eve, and I replied that I would look into this after the holidays.

Given that it was Christmas Eve, I certainly wasn’t going to bother anyone over the holidays with questions, so I quickly ran a one to many compare for the current Anzick kit, F999919, and found at 5cM and below that there were 4 haplogroup M matches.

As I did before, I sent emails to those who provided e-mail addresses asking about their matrilineal heritage.

The first thing I wanted to do, of course, was to check with Felix. I knew that Felix had updated the kits, but my understanding was that he added SNPs from the various companies to create a single file with all the SNPs from all three testing companies, not that any file was bad, so to speak.

I asked Felix if the original files had problems or were bad, and here is his response.

“I can assure you none of the earlier/older versions uploaded to GEDmatch (kit# F999912 and F999913) of Clovis Anzick was bad.

F999912 – Contains only FTDNA SNPs extracted from VCF file provided by authors.
F999913 – Contains all SNPs used by DNA testing companies extracted from VCF file provided by authors.
F999919 – Contains all SNPs used by DNA testing companies processed from BAM file provided by authors.

Source files:

VCF Source: http://www.cbs.dtu.dk/suppl/clovis/data/Anzick-1/genotypes/
BAM Source: http://www.cbs.dtu.dk/suppl/clovis/data/Anzick-1/bams/

I removed the earlier versions not because they are bad but only to avoid redundancy for the same sample kit, and processed BAM file (which is a 41 GB file) contains significantly more SNPs compared to VCF source. Because the latest file has more SNPs, it is possible that some missing SNPs in earlier uploads (which was assumed as matching in GEDmatch) may actually have mismatches in new file and thus, could fall below the thresholds or could break the previously matching segment.

The difference in matches between F999912 and F999919 kit for Clovis Anzick is similar to difference in matches between a 23andMe V4 kit and V3 kit for the same person.”

After thinking about this some, it occurred to me that perhaps GedMatch was treating different files from different vendors differently in their matching and sorting routines. That might account for a difference in matching. So, I asked John Olson at GedMatch.

John’s reply is as follows:

“At one time, I did use different thresholds depending on which vendor was being compared to which other vendor. That was a holdover from when FTDNA had Affymetrix kits that were producing somewhat different results than Illumina kits. I have since changed the one-to-many thresholds to 5cM/500 SNPs for all comparisons. The one-to-one thresholds default to 7cm/700 SNPs. I believe I made that change about a year ago, but it may have been longer. At any rate, they are all the same now, and I’m pretty sure they are all the same since Felix has introduced the F9999xx kits. Another change made within the past year is to treat A=T and C=G for all comparisons. This was done to get rid of single SNP errors in the few cases where one vendor was reporting a different strand than another vendor. In a few cases, I have observed that this “heals” some single-SNP breaks in otherwise continuous matching segments.

It is possible that older one-to-many comparisons may have been made under slightly different conditions than newer ones. Older comparisons made with a 3cm/300 SNP threshold may show larger total segment match if they contained many very small matching segments. This usually happens with endogamous populations. Comparisons affected by the change to A=T, C=G may show a larger matching segment where 2 smaller matching segments existed previously.

Another issue to be aware of when comparing artificial kits is that there may be large gaps between the defined SNPs. So, even if there is a gap of a million SNPs, the GEDmatch comparison algorithm will treat them as contiguous. This works OK when everybody is using the same SNPs, but when the list of SNPs is significantly different, it may produce matches that are bogus. This is particularly obvious when generating artificial kits that are missing large segments of data. I have had to deal with this issue with phased kits and Lazarus kits by introducing the concept of a “hard break” that forces a break between smaller matching segments.”

I wanted to know how the three files that Felix prepared compared relative to the matches they produced. I originally ran several comparisons with each of the first two versions, kits F999912 and F999913, and I didn’t save all of the original files, but I do have at least one file saved from each version. Therefore, I dropped all three sets of results (F999912, F999913 and F999919) into a spreadsheet to see how matching compared between the three Anzick file versions.

Keep in mind that the first file (F999912) contained just the FTDNA SNPs, while the second (F999913) and third (F999919) files contain the SNPs from all of the testing companies. This could potentially make the participant files appear to have missing segments when the matching routine at GedMatch sees SNPs in the Anzick file not in the participant files. However, this shouldn’t be much different than comparing a file from two different vendors except that the Anzick file has the SNPs from all three vendors combined.

The first file from 9-23 at the default threshold had 491 matches, but I subsequently lowered the threshold so I could see as many matches as possible.

GedMatch only shows you your closest 1500 matches, although I now know that as of 12-31-2014, there were a total of 3442 Anzick matches at the 5cM threshold.

The second file from 9-29, run at 6cM had more than 1500 matches. I ran the third kit at default settings on December 27^th and it has 720 matches.

One would expect that the second and third files would have the effect of including more matches from both 23andMe and Ancestry since all of the SNPs utilized by those companies are included (if they are available in the Anzick sample.) We also have to remember that there are new files being uploaded from all three vendor sites on a daily basis, so the total available to match is also increasing. Of the 721 kit matches to F999919, 31 were shades of green which indicate that they have been uploaded during the last 30 days, so we could probably presume that about double that number were uploaded (and match) in two months or triple in three months, so probably about 100 new kits. Those kits would show in the match extraction for this month but not for the first month and possibly not for the second. However, all the kits that matched the first month at the highest threshold should still be showing in the second and third month. Let’s see if that holds true.

I dropped all three sets of data into a spreadsheet and colorized the rows.

Blue = F999912, first extraction, 9-23-2014
Yellow = F999913, second extraction, 9-29-2014
Pink = F999919, third extraction, 12-27-2014

Then I counted the number of blue rows, which are the first extraction, that had matches to both yellow and pink, or only yellow, the second extraction, or only pink, the third (current) extraction, or no matches at all.

You can see that the green grouping shows that all three match each other. The match between A003479 in both the second and third extraction could be because the kit was not present when the first extraction was done.

	All 3 match	1st to 2nd Only	1st to 3rd Only	No Match
Percent First Extraction Matches to Other Extractions	54%	36%	5%	5%

By percent, this is how the matching between kits worked. About half of the kits in the first extraction continued to match kits in both subsequent extractions. Of the remaining half, three quarters of the balance matches the second extraction only and a few match just the third extraction or no extraction at all. For the most part, there is no evident reason upon inspection why the kits would not match the second or third extraction, so the cause has to be a result of the additional SNPs or the matching routine or both. This is not to imply that the results are problematic, just that they are different than I would have expected.

A very low percentage of kits matched only between the first and third extracts and the same percentage had no matches in either the second or third extraction.

I took a closer look at the kits with no matches at all. All of them had relatively low threshold total cM and largest segment size. The smallest total cM was 7.1 and the largest was 8.2. The smallest segment was 7.1 and the largest segment was also 8.2. All of these entries had the total cM equal to the largest cM. It appears that these simply slipped below the match threshold, but that doesn’t appear to be the case because in the current (pink) extract, a total of 171 entries were at or below 8.2 total cM and 8.2 largest cM and several kits had the exact same cM as the kits that didn’t show up from the first (blue) extract as a match – so obviously something truly was different in the SNPs or how the matching was done.

Is there any correlation to the kits in the original extract that didn’t match any other extract in terms of which testing company the participants utilized?

One Ancestry kit (4%), 18 23andMe kits (64%), 7 Family Tree DNA kits (25%) and 2 FN kits (7%) didn’t match anyone. But how many kits were in the original extract from the various companies?

	Original Kit Matches	Second KitMatches	Current Kit Matches
Ancestry Kits (A)	26 (5%)	438 (29%)	199 (28%)
FTDNA Kits (F)	94 (19%)	295 (20%)	121 (17%)
Other F+ Kits*	15 (3%)	35 (2%)	15 (2%)
23andMe Kits (M)	354 (72%)	732 (49%)	382 (53%)

*FB, FN, FE, FV

The effect of the additional SNPs in the kits seems to have been to increase the Ancestry kit matches significantly.

It was interesting to see how the same person’s kit from different vendors compared as well. In this random example, the Family Finder kit has a higher total cM and largest segment than the 23andMe v3 kit.

Here’s a kit from one person at all three vendors, but the 23andMe kit is version 4, in which 23andMe significantly reduced the number of SNPs tested by about one third, from about 900,000 to about 600,000.

I wondered if there is a difference in what is reported based on the threshold selected. Now at first glance, one would think, “well of course there is a difference,” but the difference should be on the bottom end of the list. In other words, the top matches should be the top matches at 7cM, 6cM, 5cM, etc. The top matches at 7cM would still be the top at 6cM, just more smaller matches appended to the end of the match list – or that is what I would expect.

Let’s see if this holds true with the current file.

I ran the “one to many” option for the current Anzick kit, F999919, at seven different levels, on the same day, one right after the other, as follows:

7cM, 700 SNPs
6cM, 600 SNPs
5cM, 500 SNPs
4cM, 400 SNPs
3cM, 300 SNPs
2cM, 200 SNPs
1cM, 100 SNPs

The first extract produced 719 records. The rest were all over the 1500 threshold, so we only see the first 1500. Normally, for genealogy the 1500 threshold would certainly be adequate, but for research, the threshold is frustrating.

To make this easier let me say that the extracts from 5cM down through 1cM were exactly the same, but the extracts at 7, 6 and 5cM, respectively, were not.

Discussions with John Olson at GedMatch shed some light on why the 5cM through 1cM extracts were exactly the same.

“For the past year or so, the database has only stored matches down to 5 cM.”

I sure wish I had known that BEFORE I did all of those extracts.

I combined and color coded all 7 extractions into a spreadsheet.

Most of the grouping look like this where blue=7cm, pink=6cM, grn=5cM, purple=4cm, teal=3cm,apricot=2cm, yellow=1cm. Nice rainbows.

All of the matches from the 7cM extraction, with the exception of a few X matches at the end, some of which have no matches on chromosomes 1-22, are included in the 6cM and 5cM extractions, but after the first several records, they are not in the same position. In other words, they are not the top 719, in the same order, in either the 5 or 6cM extraction, but the 5cM through 1cM extractions are identical. Of course, now we know why the 5cM through 1cM matches are exact. From here forth in the article, I won’t mention the 4cM-1cM extracts because they are the same as the 5cM extract.

For example, looking at the kit in position 712, the last non-X match in the 7cM extract – you find this same kit at row 1140 in the 6cM extract and row 1489 in the 5cM extract.

The 6cM extract appears to have some issues. I ran this twice with the same parameters to be sure there wasn’t an error in how it was set up, and the two runs were identical.

There are about 350 individuals who show up in the 6cM extract who should show up in the 5cM extract as well, but who don’t show in the 5cM extract. They are under the threshold for the 7cM extract, so that is correct, but why are these 350 individuals not appearing as matches at the 5cM threshold?

The kits noted above are the largest non-matching total cM and largest cM that don’t show up in the 5cM extract. The smallest matches are 6.1 and 6.1, respectively.

Checking the 5cM extract, below, there are files with smaller total cMs and a smaller largest segment that are showing as matches.

However, looking at the kits with the smallest cMs at the 5cM level, the smallest total cMs is 6.9 and it is combined with the largest segment of 6.9 as well, so that is above the 6.8 and 6.8 shown above. The smallest individual segment is 5.1 but the total cM for that individual is 10.1. So obviously the matching threshold at GedMatch is some combination of both the total cM and the largest segment. This is somewhat unexpected, but doesn’t seem to be a red flag, just how this system works.

So, where are we?

I am glad to have Felix confirm that the files weren’t “bad,” only truly “new and improved,” and that the matching between the various files is pretty much as expected – and from various tests run, everything pretty much looks kosher. The newer files with all of the SNPs utilized by the companies seem to level the playing field, allowing Ancestry kits a better chance of matching.

Aside from my intense interest due to the Native American connection, this is also how I’ve been extracting potential Native American mitochondrial haplogroups from the Anzick matches, including haplogroup M, for my research notes. M is potentially a Native American haplogroup, but is as yet unproven. With haplogroup M showing up in these people who are often heavily Native, and often from Mexico, Central and South America where 80% of the mitochondrial population is believed to be of Native American heritage, it seems prudent to add them to my research notes for further research and possible proof in the future. I contact individuals and ask about their matrilineal heritage. If they don’t have Asian or genealogically proven heritage elsewhere, and their families emerge from the areas with high Native frequencies, I include them on the research list.

In the three days between the two extracts this past week, three of the four haplogroup M individuals were pushed below the match threshold and are no longer visible at the default level. Yes, I have confirmed hat they are still there just not visible at the 1500 match threshold.

I have contacted the individuals with e-mail addresses, asking about their matrilineal heritage. One person said the tester’s mother’s heritage was from India, so that haplogroup M is not on the research list, of course, because it is proven to be from elsewhere – a place where haplogroup M and subgroups are quite common.

In total, there were 15 new potentially Native DNA mitochondrial DNA haplogroups listed in the 12-27 extract. I’ll be adding those to my research notes as soon as I have the opportunity to contact these folks and ask about their known matrilineal genealogy.

I didn’t really anticipate that there would be so much change, nor so quickly, so it looks like I’m going to have to check the Anzick matches for potential Native mitochondrial haplogroups much more often.

Since it looks like there may be lots of additions over time, far more than I expected, I’ll also be going back and making better notes in my research file. I will, for example, note the kit number and date for all of the extractions. For this and future extractions, I’ll also be listing the number of results per haplogroup. I think that would be valuable information as well.

I’d like to thank Armando for raising this topic. The research into matching with a kit that has the entire spectrum of SNPs from all three of the companies has been quite interesting. In fact, unless Felix has added all of the SNPs to the other ancient kits, this is the only kit in existence that has all of the SNPs from all of the companies included.

My thanks to Felix Immanuel (formerly Felix Chandrakumar) and John Olson for assistance with research for this article.

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

2014 Top Genetic Genealogy Happenings – A Baker’s Dozen +1

Posted on December 30, 2014 by Roberta Estes

It’s that time again, to look over the year that has just passed and take stock of what has happened in the genetic genealogy world. I wrote a review in both 2012 and 2013 as well. Looking back, these momentous happenings seem quite “old hat” now. For example, both www.GedMatch.com and www.DNAGedcom.com, once new, have become indispensable tools that we take for granted. Please keep in mind that both of these tools (as well as others in the Tools section, below) depend on contributions, although GedMatch now has a tier 1 subscription offering for $10 per month as well.

So what was the big news in 2014?

Beyond the Tipping Point

Genetic genealogy has gone over the tipping point. Genetic genealogy is now, unquestionably, mainstream and lots of people are taking part. From the best I can figure, there are now approaching or have surpassed three million tests or test records, although certainly some of those are duplicates.

500,000+ at 23andMe
700,000+ at Ancestry
700,000+ at Genographic

The organizations above represent “one-test” companies. Family Tree DNA provides various kinds of genetic genealogy tests to the community and they have over 380,000 individuals with more than 700,000 test records.

In addition to the above mentioned mainstream firms, there are other companies that provide niche testing, often in addition to Family Tree DNA Y results.

In addition, there is what I would refer to as a secondary market for testing as well which certainly attracts people who are not necessarily genetic genealogists but who happen across their corporate information and decide the test looks interesting. There is no way of knowing how many of those tests exist.

Additionally, there is still the Sorenson data base with Y and mtDNA tests which reportedly exceeded their 100,000 goal.

Spencer Wells spoke about the “viral spread threshold” in his talk in Houston at the International Genetic Genealogy Conference in October and terms 2013 as the year of infection. I would certainly agree.

Autosomal Now the New Normal

Another change in the landscape is that now, autosomal DNA has become the “normal” test. The big attraction to autosomal testing is that anyone can play and you get lots of matches. Earlier in the year, one of my cousins was very disappointed in her brother’s Y DNA test because he only had a few matches, and couldn’t understand why anyone would test the Y instead of autosomal where you get lots and lots of matches. Of course, she didn’t understand the difference in the tests or the goals of the tests – but I think as more and more people enter the playground – percentagewise – fewer and fewer do understand the differences.

Case in point is that someone contacted me about DNA and genealogy. I asked them which tests they had taken and where and their answer was “the regular one.” With a little more probing, I discovered that they took Ancestry’s autosomal test and had no clue there were any other types of tests available, what they could tell him about his ancestors or genetic history or that there were other vendors and pools to swim in as well.

A few years ago, we not only had to explain about DNA tests, but why the Y and mtDNA is important. Today, we’ve come full circle in a sense – because now we don’t have to explain about DNA testing for genealogy in general but we still have to explain about those “unknown” tests, the Y and mtDNA. One person recently asked me, “oh, are those new?”

Ancient DNA

This year has seen many ancient DNA specimens analyzed and sequenced at the full genomic level.

The year began with a paper titled, “When Populations Collide” which revealed that contemporary Europeans carry between 1-4% of Neanderthal DNA most often associated with hair and skin color, or keratin. Africans, on the other hand, carry none or very little Neanderthal DNA.

http://dna-explained.com/2014/01/30/neanderthal-genome-further-defined-in-contemporary-eurasians/

A month later, a monumental paper was published that detailed the results of sequencing a 12,500 Clovis child, subsequently named Anzick or referred to as the Anzick Clovis child, in Montana. That child is closely related to Native American people of today.

http://dna-explained.com/2014/02/13/clovis-people-are-native-americans-and-from-asia-not-europe/

In June, another paper emerged where the authors had analyzed 8000 year old bones from the Fertile Crescent that shed light on the Neolithic area before the expansion from the Fertile Crescent into Europe. These would be the farmers that assimilated with or replaced the hunter-gatherers already living in Europe.

http://dna-explained.com/2014/06/09/dna-analysis-of-8000-year-old-bones-allows-peek-into-the-neolithic/

Svante Paabo is the scientist who first sequenced the Neanderthal genome. Here is a great interview and speech. This man is so interesting. If you have not read his book, “Neanderthal Man, In Search of Lost Genomes,” I strongly recommend it.

http://dna-explained.com/2014/07/22/finding-your-inner-neanderthal-with-evolutionary-geneticist-svante-paabo/

In the fall, yet another paper was released that contained extremely interesting information about the peopling and migration of humans across Europe and Asia. This was just before Michael Hammer’s presentation at the Family Tree DNA conference, so I covered the paper along with Michael’s information about European ancestral populations in one article. The take away messages from this are two-fold. First, there was a previously undefined “ghost population” called Ancient North Eurasian (ANE) that is found in the northern portion of Asia that contributed to both Asian populations, including those that would become the Native Americans and European populations as well. Secondarily, the people we thought were in Europe early may not have been, based on the ancient DNA remains we have to date. Of course, that may change when more ancient DNA is fully sequenced which seems to be happening at an ever-increasing rate.

http://dna-explained.com/2014/10/21/peopling-of-europe-2014-identifying-the-ghost-population/

Ancient DNA Available for Citizen Scientists

If I were to give a Citizen Scientist of the Year award, this year’s award would go unquestionably to Felix Chandrakumar for his work with the ancient genome files and making them accessible to the genetic genealogy world. Felix obtained the full genome files from the scientists involved in full genome analysis of ancient remains, reduced the files to the SNPs utilized by the autosomal testing companies in the genetic genealogy community, and has made them available at GedMatch.

http://dna-explained.com/2014/09/22/utilizing-ancient-dna-at-gedmatch/

If this topic is of interest to you, I encourage you to visit his blog and read his many posts over the past several months.

https://plus.google.com/+FelixChandrakumar/posts

The availability of these ancient results set off a sea of comparisons. Many people with Native heritage matched Anzick’s file at some level, and many who are heavily Native American, particularly from Central and South America where there is less admixture match Anzick at what would statistically be considered within a genealogical timeframe. Clearly, this isn’t possible, but it does speak to how endogamous populations affect DNA, even across thousands of years.

http://dna-explained.com/2014/09/23/analyzing-the-native-american-clovis-anzick-ancient-results/

Because Anzick is matching so heavily with the Mexican, Central and South American populations, it gives us the opportunity to extract mitochondrial DNA haplogroups from the matches that either are or may be Native, if they have not been recorded before.

http://dna-explained.com/2014/09/23/analyzing-the-native-american-clovis-anzick-ancient-results/

Needless to say, the matches of these ancient kits with contemporary people has left many people questioning how to interpret the results. The answer is that we don’t really know yet, but there is a lot of study as well as speculation occurring. In the citizen science community, this is how forward progress is made…eventually.

http://dna-explained.com/2014/09/25/ancient-dna-matches-what-do-they-mean/

http://dna-explained.com/2014/09/30/ancient-dna-matching-a-cautionary-tale/

More ancient DNA samples for comparison:

http://dna-explained.com/2014/10/04/more-ancient-dna-samples-for-comparison/

A Siberian sample that also matches the Malta Child whose remains were analyzed in late 2013.

http://dna-explained.com/2014/11/12/kostenki14-a-new-ancient-siberian-dna-sample/

Felix has prepared a list of kits that he has processed, along with their GedMatch numbers and other relevant information, like gender, haplogroup(s), age and location of sample.

http://www.y-str.org/p/ancient-dna.html

Furthermore, in a collaborative effort with Family Tree DNA, Felix formed an Ancient DNA project and uploaded the ancient autosomal files. This is the first time that consumers can match with Ancient kits within the vendor’s data bases.

https://www.familytreedna.com/public/Ancient_DNA

Recently, GedMatch added a composite Archaic DNA Match comparison tool where your kit number is compared against all of the ancient DNA kits available. The output is a heat map showing which samples you match most closely.

Indeed, it has been a banner year for ancient DNA and making additional discoveries about DNA and our ancestors. Thank you Felix.

Haplogroup Definition

That SNP tsunami that we discussed last year…well, it made landfall this year and it has been storming all year long…in a good way. At least, ultimately, it will be a good thing. If you asked the haplogroup administrators today about that, they would probably be too tired to answer – as they’ve been quite overwhelmed with results.

The Big Y testing has been fantastically successful. This is not from a Family Tree DNA perspective, but from a genetic genealogy perspective. Branches have been being added to and sawed off of the haplotree on a daily basis. This forced the renaming of the haplogroups from the old traditional R1b1a2 to R-M269 in 2012. While there was some whimpering then, it would be nothing like the outright wailing now that would be occurring as haplogroup named reached 20 or so digits.

Alice Fairhurst discussed the SNP tsunami at the DNA Conference in Houston in October and I’m sure that the pace hasn’t slowed any between now and then. According to Alice, in early 2014, there were 4115 individual SNPs on the ISOGG Tree, and as of the conference, there were 14,238 SNPs, with the 2014 addition total at that time standing at 10,213. That is over 1000 per month or about 35 per day, every day.

Yes, indeed, that is the definition of a tsunami. Every one of those additions requires one of a number of volunteers, generally haplogroup project administrators to evaluate the various Big Y results, the SNPs and novel variants included, where they need to be inserted in the tree and if branches need to be rearranged. In some cases, naming request for previously unknown SNPs also need to be submitted. This is all done behind the scenes and it’s not trivial.

The project I’m closest to is the R1b L-21 project because my Estes males fall into that group. We’ve tested several, and I’ll be writing an article as soon as the final test is back.

The tree has grown unbelievably in this past year just within the L21 group. This project includes over 700 individuals who have taken the Big Y test and shared their results which has defined about 440 branches of the L21 tree. Currently there are almost 800 kits available if you count the ones on order and the 20 or so from another vendor.

Here is the L21 tree in January of 2014

Compare this with today’s tree, below.

Michael Walsh, Richard Stevens, David Stedman need to be commended for their incredible work in the R-L21 project. Other administrators are doing equivalent work in other haplogroup projects as well. I big thank you to everyone. We’d be lost without you!

One of the results of this onslaught of information is that there have been fewer and fewer academic papers about haplogroups in the past few years. In essence, by the time a paper can make it through the peer review cycle and into publication, the data in the paper is often already outdated relative to the Y chromosome. Recently a new paper was released about haplogroup C3*. While the data is quite valid, the authors didn’t utilize the new SNP naming nomenclature. Before writing about the topic, I had to translate into SNPese. Fortunately, C3* has been relatively stable.

http://dna-explained.com/2014/12/23/haplogroup-c3-previously-believed-east-asian-haplogroup-is-proven-native-american/

10^th Annual International Conference on Genetic Genealogy

The Family Tree DNA International Conference on Genetic Genealogy for project administrators is always wonderful, but this year was special because it was the 10^th annual. And yes, it was my 10^th year attending as well. In all these years, I had never had a photo with both Max and Bennett. Everyone is always so busy at the conferences. Getting any 3 people, especially those two, in the same place at the same time takes something just short of a miracle.

Ten years ago, it was the first genetic genealogy conference ever held, and was the only place to obtain genetic genealogy education outside of the rootsweb genealogy DNA list, which is still in existence today. Family Tree DNA always has a nice blend of sessions. I always particularly appreciate the scientific sessions because those topics generally aren’t covered elsewhere.

http://dna-explained.com/2014/10/11/tenth-annual-family-tree-dna-conference-opening-reception/

http://dna-explained.com/2014/10/12/tenth-annual-family-tree-dna-conference-day-2/

http://dna-explained.com/2014/10/13/tenth-annual-family-tree-dna-conference-day-3/

http://dna-explained.com/2014/10/15/tenth-annual-family-tree-dna-conference-wrapup/

Jennifer Zinck wrote great recaps of each session and the ISOGG meeting.

http://www.ancestorcentral.com/decennial-conference-on-genetic-genealogy/

http://www.ancestorcentral.com/decennial-conference-on-genetic-genealogy-isogg-meeting/

http://www.ancestorcentral.com/decennial-conference-on-genetic-genealogy-sunday/

I thank Family Tree DNA for sponsoring all 10 conferences and continuing the tradition. It’s really an amazing feat when you consider that 15 years ago, this industry didn’t exist at all and wouldn’t exist today if not for Max and Bennett.

Education

Two educational venues offered classes for genetic genealogists and have made their presentations available either for free or very reasonably. One of the problems with genetic genealogy is that the field is so fast moving that last year’s session, unless it’s the very basics, is probably out of date today. That’s the good news and the bad news.

http://dna-explained.com/2014/11/12/genetic-genealogy-ireland-2014-presentations

http://dna-explained.com/2014/09/26/educational-videos-from-international-genetic-genealogy-conference-now-available/

In addition, three books have been released in 2014.

In January, Emily Aulicino released Genetic Genealogy, The Basics and Beyond.

In October, Richard Hill released “Guide to DNA Testing: How to Identify Ancestors, Confirm Relationships and Measure Ethnicity through DNA Testing.”

Most recently, David Dowell’s new book, NextGen Genealogy: The DNA Connection was released right after Thanksgiving.

Ancestor Reconstruction – Raising the Dead

This seems to be the year that genetic genealogists are beginning to reconstruct their ancestors (on paper, not in the flesh) based on the DNA that the ancestors passed on to various descendants. Those segments are “gathered up” and reassembled in a virtual ancestor.

I utilized Kitty Cooper’s tool to do just that.

http://dna-explained.com/2014/10/03/ancestor-reconstruction/

I know it doesn’t look like much yet but this is what I’ve been able to gather of Henry Bolton, my great-great-great-grandfather.

Kitty did it herself too.

http://blog.kittycooper.com/2014/08/mapping-an-ancestral-couple-a-backwards-use-of-my-segment-mapper/

http://blog.kittycooper.com/2014/09/segment-mapper-tool-improvements-another-wold-dna-map/

Ancestry.com wrote a paper about the fact that they have figured out how to do this as well in a research environment.

http://corporate.ancestry.com/press/press-releases/2014/12/ancestrydna-reconstructs-partial-genome-of-person-living-200-years-ago/

http://www.thegeneticgenealogist.com/2014/12/16/ancestrydna-recreates-portions-genome-david-speegle-two-wives/

GedMatch has created a tool called, appropriately, Lazarus that does the same thing, gathers up the DNA of your ancestor from their descendants and reassembles it into a DNA kit.

Blaine Bettinger has been working with and writing about his experiences with Lazarus.

http://www.thegeneticgenealogist.com/2014/10/20/finally-gedmatch-announces-monetization-strategy-way-raise-dead/

http://www.thegeneticgenealogist.com/2014/12/09/recreating-grandmothers-genome-part-1/

http://www.thegeneticgenealogist.com/2014/12/14/recreating-grandmothers-genome-part-2/

Tools

Speaking of tools, we have some new tools that have been introduced this year as well.

Genome Mate is a desktop tool used to organize data collected by researching DNA comparsions and aids in identifying common ancestors. I have not used this tool, but there are others who are quite satisfied. It does require Microsoft Silverlight be installed on your desktop.

The Autosomal DNA Segment Analyzer is available through www.dnagedcom.com and is a tool that I have used and found very helpful. It assists you by visually grouping your matches, by chromosome, and who you match in common with.

Charting Companion from Progeny Software, another tool I use, allows you to colorize and print or create pdf files that includes X chromosome groupings. This greatly facilitates seeing how the X is passed through your ancestors to you and your parents.

WikiTree is a free resource for genealogists to be able to sort through relationships involving pedigree charts. In November, they announced Relationship Finder.

Probably the best example I can show of how WikiTree has utilized DNA is using the results of King Richard III.

By clicking on the DNA icon, you see the following:

And then Richard’s Y, mitochondrial and X chromosome paths.

Since Richard had no descendants, to see how descendants work, click on his mother, Cecily of York’s DNA descendants and you’re shown up to 10 generations.

While this isn’t terribly useful for Cecily of York who lived and died in the 1400s, it would be incredibly useful for finding mitochondrial descendants of my ancestor born in 1802 in Virginia. I’d love to prove she is the daughter of a specific set of parents by comparing her DNA with that of a proven daughter of those parents! Maybe I’ll see if I can find her parents at WikiTree.

Kitty Cooper’s blog talks about additional tools. I have used Kitty’s Chromosome mapping tools as discussed in ancestor reconstruction.

Felix Chandrakumar has created a number of fun tools as well. Take a look. I have not used most of these tools, but there are several I’ll be playing with shortly.

Exits and Entrances

With very little fanfare, deCODEme discontinued their consumer testing and reminded people to download their date before year end.

http://dna-explained.com/2014/09/30/decodeme-consumer-tests-discontinued/

I find this unfortunate because at one time, deCODEme seemed like a company full of promise for genetic genealogy. They failed to take the rope and run.

On a sad note, Lucas Martin who founded DNA Tribes unexpectedly passed away in the fall. DNA Tribes has been a long-time player in the ethnicity field of genetic genealogy. I have often wondered if Lucas Martin was a pseudonym, as very little information about Lucas was available, even from Lucas himself. Neither did I find an obituary. Regardless, it’s sad to see someone with whom the community has worked for years pass away. The website says that they expect to resume offering services in January 2015. I would be cautious about ordering until the structure of the new company is understood.

http://www.dnatribes.com/

In the last month, a new offering has become available that may be trying to piggyback on the name and feel of DNA Tribes, but I’m very hesitant to provide a link until it can be determined if this is legitimate or bogus. If it’s legitimate, I’ll be writing about it in the future.

However, the big news exit was Ancestry’s exit from the Y and mtDNA testing arena. We suspected this would happen when they stopped selling kits, but we NEVER expected that they would destroy the existing data bases, especially since they maintain the Sorenson data base as part of their agreement when they obtained the Sorenson data.

http://dna-explained.com/2014/10/02/ancestry-destroys-irreplaceable-dna-database/

The community is still hopeful that Ancestry may reverse that decision.

Ancestry – The Chromosome Browser War and DNA Circles

There has been an ongoing battle between Ancestry and the more seasoned or “hard-core” genetic genealogists for some time – actually for a long time.

The current and most long-standing issue is the lack of a chromosome browser, or any similar tools, that will allow genealogists to actually compare and confirm that their DNA match is genuine. Ancestry maintains that we don’t need it, wouldn’t know how to use it, and that they have privacy concerns.

Other than their sessions and presentations, they had remained very quiet about this and not addressed it to the community as a whole, simply saying that they were building something better, a better mousetrap.

In the fall, Ancestry invited a small group of bloggers and educators to visit with them in an all-day meeting, which came to be called DNA Day.

http://dna-explained.com/2014/10/08/dna-day-with-ancestry/

In retrospect, I think that Ancestry perceived that they were going to have a huge public relations issue on their hands when they introduced their new feature called DNA Circles and in the process, people would lose approximately 80% of their current matches. I think they were hopeful that if they could educate, or convince us, of the utility of their new phasing techniques and resulting DNA Circles feature that it would ease the pain of people’s loss in matches.

I am grateful that they reached out to the community. Some very useful dialogue did occur between all participants. However, to date, nothing more has happened nor have we received any additional updates after the release of Circles.

Time will tell.

http://dna-explained.com/2014/11/18/in-anticipation-of-ancestrys-better-mousetrap/

http://dna-explained.com/2014/11/19/ancestrys-better-mousetrap-dna-circles/

DNA Circles, while interesting and somewhat useful, is certainly NOT a replacement for a chromosome browser, nor is it a better mousetrap.

http://dna-explained.com/2014/11/30/chromosome-browser-war/

In fact, the first thing you have to do when you find a DNA Circle that you have not verified utilizing raw data and/or chromosome browser tools from either 23andMe, Family Tree DNA or Gedmatch, is to talk your matches into transferring their DNA to Family Tree DNA or download to Gedmatch, or both.

http://dna-explained.com/2014/11/27/sarah-hickerson-c1752-lost-ancestor-found-52-ancestors-48/

I might add that the great irony of finding the Hickerson DNA Circle that led me to confirm that ancestry utilizing both Family Tree DNA and GedMatch is that today, when I checked at Ancestry, the Hickerson DNA Circle is no longer listed. So, I guess I’ve been somehow pruned from the circle. I wonder if that is the same as being voted off of the island. So, word to the wise…check your circles often…they change and not always in the upwards direction.

The Seamy Side – Lies, Snake Oil Salesmen and Bullys

Unfortunately a seamy side, an underbelly that’s rather ugly has developed in and around the genetic genealogy industry. I guess this was to be expected with the rapid acceptance and increasing popularity of DNA testing, but it’s still very unfortunate.

Some of this I expected, but I didn’t expect it to be so…well…blatant.

I don’t watch late night TV, but I’m sure there are now DNA diets and DNA dating and just about anything else that could be sold with the allure of DNA attached to the title.

I googled to see if this was true, and it is, although I’m not about to click on any of those links.

Unfortunately, within the ever-growing genetic genealogy community a rather large rift has developed over the past couple of years. Obviously everyone can’t get along, but this goes beyond that. When someone disagrees, a group actively “stalks” the person, trying to cost them their employment, saying hate filled and untrue things and even going so far as to create a Facebook page titled “Against<personname>.” That page has now been removed, but the fact that a group in the community found it acceptable to create something like that, and their friends joined, is remarkable, to say the least. That was accompanied by death threats.

Bullying behavior like this does not make others feel particularly safe in expressing their opinions either and is not conducive to free and open discussion. As one of the law enforcement officers said, relative to the events, “This is not about genealogy. I don’t know what it is about, yet, probably money, but it’s not about genealogy.”

Another phenomenon is that DNA is now a hot topic and is obviously “selling.” Just this week, this report was published, and it is, as best we can tell, entirely untrue.

http://worldnewsdailyreport.com/usa-archaeologists-discover-remains-of-first-british-settlers-in-north-america/

There were several tip offs, like the city (Lanford) and county (Laurens County) is not in the state where it is attributed (it’s in SC not NC), and the name of the institution is incorrect (Johns Hopkins, not John Hopkins). Additionally, if you google the name of the magazine, you’ll see that they specialize in tabloid “faux reporting.” It also reads a lot like the King Richard genuine press release.

http://urbanlegends.about.com/od/Fake-News/tp/A-Guide-to-Fake-News-Websites.01.htm

Earlier this year, there was a bogus institutional site created as well.

On one of the DNA forums that I frequent, people often post links to articles they find that are relevant to DNA. There was an interesting article, which has now been removed, correlating DNA results with latitude and altitude. I thought to myself, I’ve never heard of that…how interesting. Here’s part of what the article said:

Researchers at Aberdeen College’s Havering Centre for Genetic Research have discovered an important connection between our DNA and where our ancestors used to live.

Tiny sequence variations in the human genome sometimes called Single Nucleotide Polymorphisms (SNPs) occur with varying frequency in our DNA. These have been studied for decades to understand the major migrations of large human populations. Now Aberdeen College’s Dr. Miko Laerton and a team of scientists have developed pioneering research that shows that these differences in our DNA also reveal a detailed map of where our own ancestors lived going back thousands of years.

Dr. Laerton explains: “Certain DNA sequence variations have always been important signposts in our understanding of human evolution because their ages can be estimated. We’ve known for years that they occur most frequently in certain regions [of DNA], and that some alleles are more common to certain geographic or ethnic groups, but we have never fully understood the underlying reasons. What our team found is that the variations in an individual’s DNA correlate with the latitudes and altitudes where their ancestors were living at the time that those genetic variations occurred. We’re still working towards a complete understanding, but the knowledge that sequence variations are connected to latitude and altitude is a huge breakthrough by itself because those are enough to pinpoint where our ancestors lived at critical moments in history.”

The story goes on, but at the bottom, the traditional link to the publication journal is found.

The full study by Dr. Laerton and her team was published in the September issue of the Journal of Genetic Science.

I thought to myself, that’s odd, I’ve never heard of any of these people or this journal, and then I clicked to find this.

About that time, Debbie Kennett, DNA watchdog of the UK, posted this:

April Fools Day appears to have arrived early! There is no such institution as Aberdeen College founded in 1394. The University of Aberdeen in Scotland was founded in 1495 and is divided into three colleges: http://www.abdn.ac.uk/about/colleges-schools-institutes/colleges-53.php

The picture on the masthead of the “Aberdeen College” website looks very much like a photo of Aberdeen University. This fake news item seems to be the only live page on the Aberdeen College website. If you click on any other links, including the link to the so-called “Journal of Genetic Science”, you get a message that the website is experienced “unusually high traffic”. There appears to be no such journal anyway.

We also realized that Dr. Laerton, reversed, is “not real.”

I still have no idea why someone would invest the time and effort into the fake website emulating the University of Aberdeen, but I’m absolutely positive that their motives were not beneficial to any of us.

What is the take-away of all of this? Be aware, very aware, skeptical and vigilant. Stick with the mainstream vendors unless you realize you’re experimenting.

King Richard

The much anticipated and long-awaited DNA results on the remains of King Richard III became available with a very unexpected twist. While the science team feels that they have positively identified the remains as those of Richard, the Y DNA of Richard and another group of men supposed to have been descended from a common ancestor with Richard carry DNA that does not match.

http://dna-explained.com/2014/12/09/henry-iii-king-of-england-fox-in-the-henhouse-52-ancestors-49/

http://dna-explained.com/2014/12/05/mitochondrial-dna-mutation-rates-and-common-ancestors/

Debbie Kennett wrote a great summary article.

http://cruwys.blogspot.com/2014/12/richard-iii-and-use-of-dna-as-evidence.html

More Alike than Different

One of the life lessons that genetic genealogy has held for me is that we are more closely related that we ever knew, to more people than we ever expected, and we are far more alike than different. A recent paper recently published by 23andMe scientists documents that people’s ethnicity reflect the historic events that took place in the part of the country where their ancestors lived, such as slavery, the Trail of Tears and immigration from various worldwide locations.

From the 23andMe blog:

The study leverages samples of unprecedented size and precise estimates of ancestry to reveal the rate of ancestry mixing among American populations, and where it has occurred geographically:

All three groups – African Americans, European Americans and Latinos – have ancestry from Africa, Europe and the Americas.
Approximately 3.5 percent of European Americans have 1 percent or more African ancestry. Many of these European Americans who describe themselves as “white” may be unaware of their African ancestry since the African ancestor may be 5-10 generations in the past.
European Americans with African ancestry are found at much higher frequencies in southern states than in other parts of the US.

The ancestry proportions point to the different regional impacts of slavery, immigration, migration and colonization within the United States:

The highest levels of African ancestry among self-reported African Americans are found in southern states, especially South Carolina and Georgia.
One in every 20 African Americans carries Native American ancestry.
More than 14 percent of African Americans from Oklahoma carry at least 2 percent Native American ancestry, likely reflecting the Trail of Tears migration following the Indian Removal Act of 1830.
Among self-reported Latinos in the US, those from states in the southwest, especially from states bordering Mexico, have the highest levels of Native American ancestry.

http://news.sciencemag.org/biology/2014/12/genetic-study-reveals-surprising-ancestry-many-americans?utm_campaign=email-news-weekly&utm_source=eloqua

23andMe provides a very nice summary of the graphics in the article at this link:

http://blog.23andme.com/wp-content/uploads/2014/10/Bryc_ASHG2014_textboxes.pdf

The academic article can be found here:

http://www.cell.com/ajhg/home

2015

So what does 2015 hold? I don’t know, but I can’t wait to find out. Hopefully, it holds more ancestors, whether discovered through plain old paper research, cousin DNA testing or virtually raised from the dead!

What would my wish list look like?

More ancient genomes sequenced, including ones from North and South America.
Ancestor reconstruction on a large scale.
The haplotree becoming fleshed out and stable.
Big Y sequencing combined with STR panels for enhanced genealogical research.
Improved ethnicity reporting.
Mitochondrial DNA search by ancestor for descendants who have tested.
More tools, always more tools….
More time to use the tools!

Here’s wishing you an ancestor filled 2015!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Kostenki14 – A New Ancient Siberian DNA Sample

Posted on November 12, 2014 by Roberta Estes

This week, published in Science, we find another ancient DNA full genome sequence from Siberia in an article titled “Genomic structure in Europeans dating back at least 36,200 years” by Seguin-Orlando et al.. This sample, partially shown above, is quite old and closely related to the Mal’ta child, also found in Siberia from about 24,000 years ago. Interestingly enough, K14 carries more Neanderthal DNA than current Europeans. This skeleton was actually excavated in 1954, but was only recently genetically analyzed.

From the paper, this map above shows the locations of recently analyzed ancient DNA samples. Note that even though K14 and Mal’ta child are similar, they are not located in close geographic proximity.

Also from the paper, this chart of population clusters is quite interesting, because we can see which of these ancient samples share some heritage with today’s indigenous American populations, shown in grey. UPGH=Upper Paleolithic Hunter-Gatherer, MHG=Mesolithic Hunter Gatherer, which is later in time that Paleolithic, and NEOL=Neolithic indicating the farming population that arrived in Europe approximately 7,000-10,000 years ago from the Middle East

You can see that the Neolithic samples show no trace of ancestry with today’s Native people, but both pre-Neolithic Hunter-Gatherer cultures show some amount of shared ancestry with Native people. However, to date, MA1, the Malta child is the most closely related and carries the most DNA in common with today’s Native people.

Felix Chandrakumar is currently preparing the K14 genome for addition to the ancient DNA kits at GedMatch. It will be interesting to see if this sample also matches currently living individuals.

Also from the K14 paper, you can see on the map below where K14 matches current worldwide and European populations, where the warmer colors, i.e. red, indicated a closer match.

Of interest to genealogists and population geneticists, K14’s mitochondrial haplogroup is U2 and his Y haplogroup is C-M130, the same as LaBrana, a late Mesolithic hunter-gatherer found in northern Spain. Haplogroup C is, of course, one of the base haplogroups for the Native people of the Americas.

The K14 paper further fleshes out the new peopling of Europe diagram discussed in my Peopling of Europe article.

This map, from the Lazardis “Ancient human genomes suggest three ancestral populations for present-day Europeans” paper published in September 2014, shows the newly defined map including Ancient North Eurasian in this diagram.

K14 adds to this diagram in the following manner, although the paths are flipped right to left.

Blue represent current populations, red are ancient remains and green are ancestral populations.

Dienekes wrote about this find as well, here.

Paper Abstract:

The origin of contemporary Europeans remains contentious. We obtain a genome sequence from Kostenki 14 in European Russia dating to 38,700 to 36,200 years ago, one of the oldest fossils of Anatomically Modern Humans from Europe. We find that K14 shares a close ancestry with the 24,000-year-old Mal’ta boy from central Siberia, European Mesolithic hunter-gatherers, some contemporary western Siberians, and many Europeans, but not eastern Asians. Additionally, the Kostenki 14 genome shows evidence of shared ancestry with a population basal to all Eurasians that also relates to later European Neolithic farmers. We find that Kostenki 14 contains more Neandertal DNA that is contained in longer tracts than present Europeans. Our findings reveal the timing of divergence of western Eurasians and East Asians to be more than 36,200 years ago and that European genomic structure today dates back to the Upper Paleolithic and derives from a meta-population that at times stretched from Europe to central Asia.

You can read the full paper at the two links below.

http://www.sciencemag.org/content/early/2014/11/05/science.aaa0114

http://www2.zoo.cam.ac.uk/manica/ms/2014_Seguin_Orlando_et_al_Science.pdf

It’s been a great year for ancient DNA analysis and learning about our ancestral human populations.

However, I have one observation I just have to make about this particular find.

What amazing teeth. Obviously, this culture did not consume sugar!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Peopling of Europe 2014 – Identifying the Ghost Population

Posted on October 21, 2014 by Roberta Estes

Beginning with the full sequencing of the Neanderthal genome, first published in May 2010 by the Max Planck Institute with Svante Paabo at the helm, and followed shortly thereafter with a Denisovan specimen, we began to unravel our ancient history.

Neanderthal man, reconstructed at the National Museum of Nature and Science in Tokyo

The photo below shows a step in the process of extracting DNA from ancient bones at Max Planck.

Our Y and mitochondrial DNA haplogroups take us back thousands of years in time, but at some point, where and how people were settling and intermixing becomes fuzzy. Ancient DNA can put the people of that time and place in context. We have discovered that current populations do not necessarily represent the ancient populations of a particular locale.

Recent information discovered from ancient burials tells us that the people of Europe descend from a 3 pronged model. Until recently, it was believed that Europeans descended from Paleolithic hunter-gatherers and Neolithic farmers, a two-pronged model.

Previously, it was believed that Europe was peopled by the ancient hunter-gatherers, the Paleolithic, who originally settled in Europe beginning about 45,000 years ago. At this time, the Neanderthal were already settled in Europe but weren’t considered to be anatomically modern humans, and it was believed, incorrectly, that the two groups did not interbreed. These hunter-gatherers were the people who settled in Europe before the last major ice age, the Younger Dryas, taking refuge in the southern portions of Europe and Eurasia, and repeopling the continent after the ice receded, about 12,000 years ago. By that time, the Neanderthals were gone, or as we now know, at least partially assimilated.

This graphic shows Europe during the last ice age.

The second settlement wave, the agriculturalist farmers from the Near East either overran or integrated with the hunter-gatherers in the Neolithic period, depending on which theory you subscribe to, about 8000-10,000 years ago.

2012 – Ancient Northern European (ANE) Hints

Beginning in 2012, we began to see hints of a third lineage that contributed to the peopling of Europe as well, from the north. Buried in the 2012 paper, Estimating admixture proportions and dates with ADMIXTOOLS by Patterson et al, was a very interesting tidbit. This new technique showed a third population, referred to by many as a “ghost population”, because no one knew who they were, that contributed to the European population.

The new population was termed Ancient North Eurasian, or ANE.

Dienekes covered this paper in his blog, but without additional information, in the community in general, there wasn’t much more than a yawn.

2013 – Mal’ta Child Stirs Excitement

The first real hint of meat on the bones of ANE came in the form of ancient DNA analysis of a 24,000 year old Siberian boy that has come to be named Mal’ta (Malta) Child. In the original paper, by Raghaven et al, Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans, he was referred to as MA-1. I wrote about this in my article titled Native American Gene Flow – Europe?, Asia and the Americas. Dienekes wrote about this paper as well.

This revelation caused quite a stir, because it was reported that the Ancestor of Native Americans in Asia was 30% Western Eurasian. Unfortunately, in some cases, this was immediately interpreted to mean that Native Americans had come directly from Europe which is not what this paper said, nor inferred. It was also inferred that the haplogroups of this child, R* (Y) and U (mtDNA) were Native American, which is also incorrect. To date, there is no evidence for migration to the New World from Europe in ancient times, but that doesn’t mean we aren’t still looking for that evidence in early burials.

What this paper did show was that Europeans and Native Americans shared a common ancestor, and that the Siberian population had contributed to the European population as well as the Native American population. In other words, descendants settled in both directions, east and west.

The most fascinating aspect of this paper was the match distribution map, below, showing which populations Malta child matched most closely.

As you can see, MA-1, Malta Child, matches the Native American population most closely, followed by the northern European and Greenland populations. The further south in Europe and Asia, the more distant the matches and the darker the blue.

2013 – Michael Hammer and Haplogroup R

Last fall at the Family Tree DNA conference, Dr. Michael Hammer, from the Hammer Lab at the University of Arizona discussed new findings relative to ancient burials, specifically in relation to haplogroup R, or more specifically, the absence of haplogroup R in those early burials.

Based on the various theories and questions, ancient burials were enlightening.

In 2013, there were a total of 32 burials from the Neolithic period, after farmers arrived from the Near East, and haplogroup R did not appear. Instead, haplogroups G, I and E were found.

What this tells us is that haplogroup R, as well as other haplogroup, weren’t present in Europe at this time. Having said this, these burials were in only 4 locations and, although unlikely, R could be found in other locations.

Last year, Dr. Hammer concluded that haplogroup R was not found in the Paleolithic and likely arrived with the Neolithic farmers. That shook the community, as it had been widely believed that haplogroup R was one of the founding European haplogroups.

While this provided tantalizing information, we still needed additional evidence. No paper has yet been published that addresses these findings. The mass full sequencing of the Y chromosome over this past year with the introduction of the Big Y will provide extremely valuable information about the Y chromosome and eventually, the migration path into and across Europe.

2014 – Europe’s Three Ancient Tribes

In September 2014, another paper was published by Lazaridis et al that more fully defined this new ANE branch of the European human family tree. An article in BBC News titled Europeans drawn from three ancient ‘tribes’ describes it well for the non-scientist. Of particular interest in this article is the artistic rendering of the ancient individual, based on their genetic markers. You’ll note that they had dark skin, dark hair and blue eyes, a rather unexpected finding.

In discussing the paper, David Reich from Harvard, one of the co-authors, said, “Prior to this paper, the models we had for European ancestry were two-way mixtures. We show that there are three groups. This also explains the recently discovered genetic connection between Europeans and Native Americans. The same Ancient North Eurasian group contributed to both of them.”

The paper, Ancient human genomes suggest three ancestral populations for present-day Europeans, appeared as a letter in Nature and is behind a paywall, but the supplemental information is free.

The article summary states the following:

We sequenced the genomes of a ~7,000-year-old farmer from Germany and eight ~8,000-year-old hunter-gatherers from Luxembourg and Sweden. We analysed these and other ancient genomes1, 2, 3, 4 with 2,345 contemporary humans to show that most present-day Europeans derive from at least three highly differentiated populations: west European hunter-gatherers, who contributed ancestry to all Europeans but not to Near Easterners; ancient north Eurasians related to Upper Palaeolithic Siberians3, who contributed to both Europeans and Near Easterners; and early European farmers, who were mainly of Near Eastern origin but also harboured west European hunter-gatherer related ancestry. We model these populations’ deep relationships and show that early European farmers had ~44% ancestry from a ‘basal Eurasian’ population that split before the diversification of other non-African lineages.

This paper utilized ancient DNA from several sites and composed the following genetic contribution diagram that models the relationship of European to non-European populations.

Present day samples are colored purple, ancient in red and reconstructed ancestral populations in green. Solid lines represent descent without admixture and dashed lines represent admixture. WHG=western European hunter-gatherer, EEF=early European farmer and ANE=ancient north Eurasian

2014 – Michael Hammer on Europe’s Ancestral Population

For anyone interested in ancient DNA, 2014 has been a banner years. At the Family Tree DNA conference in Houston, Texas, Dr. Michael Hammer brought the audience up to date on Europe’s ancestral population, including the newly sequenced ancient burials and the information they are providing.

Dr. Hammer said that ancient DNA is the key to understanding the historical processes that led up to the modern. He stressed that we need to be careful inferring that the current DNA pattern is reflective of the past because so many layers of culture have occurred between then and now.

Until recently, it was assumed that the genes of the Neolithic farmers replaced those of the Paleolithic hunter-gatherers. Ancient DNA is suggesting that this is not true, at least not on a wholesale level.

The theory, of course, is that we should be able to see them today if they still exist. The migration and settlement pattern in the slide below was from the theory set forth in the 1990s.

In 2013, Dr. Hammer discussed the theory that haplogroup R1b spread into Europe with the farmers from the Near East in the Neolithic. This year, he expanded upon that topic that based on the new findings from ancient burials.

Last year, Dr. Hammer discussed 32 burials from 4 sites. Today, we have information from 15 ancient DNA sites and many of those remains have been full genome sequenced.

Information from papers and recent research suggests that Europeans also have genes from a third source lineage, nicknamed the “ghost population of North Eurasia.”

Scientists are finding a signal of northeast Asian related admixture in northern Europeans, first suggested in 2012. This was confirmed with the sequencing of Malta child and then in a second sequencing of Afontova Gora2 in south central Siberia.

We have complete genomes from nine ancient Europeans – Mesolithic hunter gatherers and Neothilic farmers. Hammer refers to the Mesolithic here, which is a time period between the Paleolithic (hunter gatherers with stone tools) and the Neolithic (farmers).

In the PCA charts, shown above, you can see that Europeans and people from the Near East cluster separately, except for a bridge formed by a few Mediterranean and Jewish populations. On the slide below, the hunter-gatherers (WHG) and early farmers (EEF) have been overlayed onto the contemporary populations along with the MA-1 (Malta Child) and AG2 (Afontova Gora2) representing the ANE.

When sequenced, separate groups formed including western hunter gathers and early european farmers include Otzi, the iceman. A third group is the north south clinal variation with ANE contributing to northern European ancestry. The groups are represented by the circles, above.

Dr. Hammer said that the team who wrote the “Ancient Human Genomes” paper just recently published used an F3 test, results shown above, which shows whether populations are an admixture of a reference population based on their entire genome. He mentioned that this technique goes well beyond PCA.

Mapped onto populations today, most European populations are a combination of the three early groups. However, the ANE is not found in the ancient Paleolithic or Neolithic burials. It doesn’t arrive until later.

This tells us that there was a migration event 45,000 years ago from the Levant, followed about 7000 years ago by farmers from the Near East, and that ANE entered the population some time after that. All Europeans today carry some amount of ANE, but ancient burials do not.

These burials also show that southern Europe has more Neolithic farmer genes and northern Europe has more Paleolithic/Mesolithic hunter-gatherer genes.

Pigmentation for light skin came with farmers – blue eyes existed in hunter gatherers even though their skin was dark.

Dr. Hammer created these pie charts of the Y and mitochondrial haplogroups found in the ancient burials as compared to contemporary European haplogroups.

The pie chart on the left shows the haplogroups of the Mesolithic burials, all haplogroup I2 and subclades. Note that in the current German population today, no I2a1b and no I1 was found. The chart on the right shows current Germans where haplogroup I is a minority.

Therefore, we can conclude that haplogroup I is a good candidate to be identified as a Paleolithic/Mesolithic haplogroup.

This information shows that the past is very different from today.

In 2014 we have many more burials that have been sequenced than last year, as shown on the map above.

Green represents Neolithic farmers, red are Mesolithic hunter-gatherers, brown at bottom right represents more recent samples from the Metallic age.

There are a total of 48 Neolithic burials where haplogroup G dominates. In the Mesolithic, there are a total of six haplogroup I.

This suggests that haplogroup I is a good candidate to be the father of the Paleolithic/Mesolithic and haplogroup G, the founding father of the Neolithic.

In addition to haplogroup G in the Neolithic, one sample of both E1b1b1 (M35) and C were also found in Spain. E1b1b1 isn’t surprising given it’s north African genesis, but C was quite interesting.

The Metal ages, which according to wiki begin about 3300BC in Europe, is where haplogroup R, along with I1, first appear.

Please note that the diffusion of melallurgy map above is not part of Dr. Hammer’s presentation. I have added it for clarification.

Nothing is constant in Europe. The Y DNA was very upheaved, as indicated on the graphic above. Mitochondrial DNA shifted from pre-Neolithic to Neolithic which isn’t terribly different from the present day.

Dr. Hammer did not say this, but looking at the Y versus the mtDNA haplogroups, I wonder if this suggests that indeed there was more of a replacement of the males in the population, but that the females were more widely assimilated. This would certainly make sense, especially if the invaders were warriors and didn’t have females with them. They would have taken partners from the invaded population.

Haplogroup G represents the spread of farming into Europe.

The most surprising revelation is that haplogroup R1b appears to have emerged after the Neolithic agriculture transition. Given that just three years ago we thought that haplogroup R1b was one of the original European settlers thousands of years ago, based on the prevalence of haplogroup R in Europe today, at about 50%, this is a surprising turn of events. Last year’s revelation that R was maybe only 7000-8000 years old in Europe was a bit of a whammy, but the age of R in Europe in essence just got halved again and the source of R1b changed from the Near East to the Asian steppes.

Obviously, something conferred an advantage to these R1b men. Given that they arrived in the early Metalic age, was it weapons and chariots that enabled the R1b men who arrived to quickly become more than half of the population?

The Bronze Age saw the first use of metal to create weapons. Warrior identity became a standard part of daily life. Celts ranged over Europe and were the most dominant iron age warriors. Indo-European languages and chariots arrived from Asia about this time.

The map above shows the Hallstadt and LaTene Celtic cultures in Europe, about 600BC. This was not a slide presented by Dr. Hammer.

Haplogroup R1b was not found in an ancient European context prior to a Bell Beaker period burial in Germany 4.8-4.0 kya (thousand years ago, i.e. 4,800-4,000 years ago). R1b arrives about 4.6 kya and is also found in a Corded Ware culture burial in Germany. A late introduction of these lineages which now predominate in Europe corresponds to the autosomal signal of the entry of Asian and Eastern European steppe invaders into western Europe.

Local expansion occurred in Europe of R1b subgroups U106, L21 and U152.

A current haplogroup R distribution map that reflects the findings of this past year is shown above.

Haplogroup I is interesting for another reason. It looks like haplogroup I2a1b (M423) may have been replaced by I1 which expanded after the Mesolithic.

On the slide above, the Loschbour sample from Luxembourg was mapped onto a current haplogroup I SNP map where his closest match is a current day Russian.

One of the benefits of ancient DNA genome processing is that we will be able to map current trees into maps of old SNPs and be able to tell who we match most closely.

Autosomal DNA can also be mapped to see how much of our DNA is from which ancient population.

Dr. Hammer mapped the percentages of European Mesolithic/Paleolithic hunter-gatherers in blue, Neolithic Farmers from the Near East in magenta and Asian Steppe Invaders representing ANE in yellow, over current populations. Note the ancient DNA samples at the top of the list. None of the burials except for Malta Child carry any yellow, indicating that the ANE entered the European population with the steppe invaders; the same group that brought us haplogroup R and possibly I1.

Dr. Hammer says that ANE was introduced to and assimilated into the European population by one or more incursions. We don’t know today if ANE in Europeans is a result of a single blast event or multiple events. He would like to do some model simulations and see if it is related to timing and arrival of swords and chariots.

We know too that there are more recent incursions, because we’re still missing major haplogroups like J.

The further east you go, meaning the closer to the steppes and Volga region, the less well this fits the known models. In other words, we still don’t have the whole story.

At the end of the presentation, Michael was asked if the whole genomes sequenced are also obtaining Y STR data, which would allow us to compare our results on an individual versus a haplogroup level. He said he didn’t know, but he would check.

Family Tree DNA was asked if they could show a personal ancient DNA map in myOrigins, perhaps as an alternate view. Bennett took a vote and that seemed pretty popular, which he interpreted as a yes, we’d like to see that.

In Summary

The advent of and subsequent drop in the price of whole genome sequencing combined with the ability to extract ancient DNA and piece it back together have provided us with wonderful opportunities. I think this is jut the proverbial tip of the iceberg, and I can’t wait to learn more.

If you are interested in other articles I’ve written about ancient DNA, check out these links: