DNA: In Search of…Signs of Endogamy

This is the fourth in our series of articles about searching for unknown close family members, specifically; parents, grandparents, or siblings. However, these same techniques can be applied by genealogists to ancestors further back in time as well.

In this article, we discuss endogamy – how to determine if you have it, from what population, and how to follow the road signs.

After introductions, we will be covering the following topics:

  • Pedigree collapse and endogamy
  • Endogamous groups
  • The challenge(s) of endogamy
  • Endogamy and unknown close relatives (parents, grandparents)
  • Ethnicity and Populations
  • Matches
  • AutoClusters
  • Endogamous Relationships
  • Endogamous DNA Segments
  • “Are Your Parents Related?” Tool
  • Surnames
  • Projects
  • Locations
  • Y DNA, Mitochondrial DNA, and Endogamy
  • Endogamy Tools Summary Tables
    • Summary of Endogamy Tools by Vendor
    • Summary of Endogamous Populations Identified by Each Tool
    • Summary of Tools to Assist People Seeking Unknown Parents and Grandparents

What Is Endogamy and Why Does It Matter?

Endogamy occurs when a group or population of people intermarry among themselves for an extended period of time, without the introduction of many or any people from outside of that population.

The effect of this continual intermarriage is that the founders’ DNA simply gets passed around and around, eventually in small segments.

That happens because there is no “other” DNA to draw from within the population. Knowing or determining that you have endogamy helps make sense of DNA matching patterns, and those patterns can lead you to unknown relatives, both close and distant.

This Article

This article serves two purposes.

  • This article is educational and relevant for all researchers. We discuss endogamy using multiple tools and examples from known endogamous people and populations.
  • In order to be able to discern endogamy when we don’t know who our parents or grandparents are, we need to know what signs and signals to look for, and why, which is based on what endogamy looks like in people who know their heritage.

There’s no crystal ball – no definitive “one-way” arrow, but there are a series of indications that suggest endogamy.

Depending on the endogamous population you’re dealing with, those signs aren’t always the same.

If you’re sighing now, I understand – but that’s exactly WHY I wrote this article.

We’re covering a lot of ground, but these road markers are invaluable diagnostic tools.

I’ve previously written about endogamy in the articles:

Let’s start with definitions.

Pedigree Collapse and Endogamy

Pedigree collapse isn’t the same as endogamy. Pedigree collapse is when you have ancestors that repeat in your tree.

In this example, the parents of our DNA tester are first cousins, which means the tester shares great-grandparents on both sides and, of course, the same ancestors from there on back in their tree.

This also means they share more of those ancestors’ DNA than they would normally share.

John Smith and Mary Johnson are both in the tree twice, in the same position as great-grandparents. Normally, Tester Smith would carry approximately 12.5% of each of his great-grandparents’ DNA, assuming for illustration purposes that exactly 50% of each ancestor’s DNA is passed in each generation. In this case, due to pedigree collapse, 25% of Tester Smith’s DNA descends from John Smith, and another 25% descends from Mary Johnson, double what it would normally be. 25% is the amount of DNA contribution normally inherited from grandparents, not great-grandparents.

While we may find first cousin marriages a bit eyebrow-raising today, they were quite common in the past. Both laws and customs varied with the country, time, social norms, and religion.

Pedigree Collapse and Endogamy is NOT the Same

You might think that pedigree collapse and endogamy is one and the same, but there’s a difference. Pedigree collapse can lead to endogamy, but it takes more than one instance of pedigree collapse to morph into endogamy within a population. Population is the key word for endogamy.

The main difference is that pedigree collapse occurs with known ancestors in more recent generations for one person, while endogamy is longer-term and systemic in a group of people.

Picture a group of people, all descended from Tester Smith’s great-grandparents intermarrying. Now you have the beginnings of endogamy. A couple hundred or a few hundred years later, you have true endogamy.

In other words, endogamy is pedigree collapse on a larger scale – think of a village or a church.

My ancestors’ village of Schnait, in Germany, is shown above in 1685. One church and maybe 30 or 40 homes. According to church and other records, the same families had inhabited this village, and region, for generations. It’s a sure bet that both pedigree collapse and endogamy existed in this small community.

If pedigree collapse happens over and over again because there are no other people within the community to marry, then you have endogamy. In other words, with endogamy, you assuredly DO have historical pedigree collapse, generally back in time, often before you can identify those specific ancestors – because everyone descends from the same set of founders.

Endogamy Doesn’t Necessarily Indicate Recent Pedigree Collapse

With deep, historic endogamy, you don’t necessarily have recent pedigree collapse, and in fact, many people do not. Jewish people are a good example of this phenomenon. They shared ancestors for hundreds or thousands of years, depending on which group we are referring to, but in recent, known, generations, many Jewish people aren’t related. Still, their DNA often matches each other.

The good news is that there are telltale signs and signals of endogamy.

The bad news is that not all of these are obvious, meaning as an aid to people seeking clues about unknown close relatives, and other “signs” aren’t what they are believed to be.

Let’s step through each endogamy identifier, or “hint,” and then we will review how we can best utilize this information.

First, let’s take a look at groups that are considered to be endogamous.

Endogamous Groups

Jewish PeopleSpecifically groups that were isolated from other groups of Jewish (and other) people; Ashkenazi (Germany, Northern France, and diaspora), Sephardic (Spanish, Iberia, and diaspora), Mizrahi (Israel, Middle Eastern, and diaspora,) Ethiopian Jews, and possibly Jews from other locations such as Mountain Jews from Kazakhstan and the Caucasus.

AcadiansDescendants of about 60 French families who settled in “Acadia” beginning about 1604, primarily on the island of Nova Scotia, and intermarried among themselves and with the Mi’kmaq people. Expelled by the English in 1755, they were scattered in groups to various diasporic regions where they continued to intermarry and where their descendants are found today. Some Acadians became the Cajuns of Louisiana.

Anabaptist Protestant FaithsAmish, Mennonite, and Brethren (Dunkards) and their offshoots are Protestant religious sects founded in Europe in the 14th, 15th, and 16th centuries on the principle of baptizing only adults or people who are old enough to choose to follow the faith, or rebaptizing people who had been previously baptized as children. These Anabaptist faiths tend to marry within their own group or church and often expel those who marry outside of the faith. Many emigrated to the American colonies and elsewhere, seeking religious freedom. Occasionally those groups would locate in close proximity and intermarry, but not marry outside of other Anabaptist denominations.

Native American (Indigenous) People – all indigenous peoples found in North and South America before European colonization descended from a small number of original founders who probably arrived at multiple times.

Indigenous Pacific Islanders – Including indigenous peoples of Australia, New Zealand, and Hawaii prior to colonization. They are probably equally as endogamous as Native American people, but I don’t have specific examples to share.

Villages – European or other villages with little inflow or whose residents were restricted from leaving over hundreds of years.

Other groups may have significant multiple lines of pedigree collapse and therefore become endogamous over time. Some people from Newfoundland, French Canadians, and Mormons (Church of Jesus Christ of Latter-Day Saints) come to mind.

Endogamy is a process that occurs over time.

Endogamy and Unknown Relatives

If you know who your relatives are, you may already know you’re from an endogamous population, but if you’re searching for close relatives, it’s helpful to be able to determine if you have endogamous heritage, at least in recent generations.

If you know nothing about either parent, some of these tools won’t help you, at least not initially, but others will. However, as you add to your knowledge base, the other tools will become more useful.

If you know the identity of one parent, this process becomes at least somewhat easier.

In future articles, we will search specifically for parents and each of your four grandparents. In this article, I’ll review each of the diagnostic tools and techniques you can use to determine if you have endogamy, and perhaps pinpoint the source.

The Challenge

People with endogamous heritage are related in multiple, unknown ways, over many generations. They may also be related in known ways in recent generations.

If both of your parents share the SAME endogamous culture or group of relatives:

  • You may have significantly more autosomal DNA matches than people without endogamy, unless that group of people is under-sampled. Jewish people have significantly more matches, but Native people have fewer due to under-sampling.
  • You may experience a higher-than-normal cM (centiMorgan) total for estimated relationships, especially more distant relationships, 3C and beyond.
  • You will have many matches related to you on both your maternal and paternal sides.
  • Parts of your autosomal DNA will be the same on both your mother’s and father’s sides, meaning your DNA will be fully identical in some locations. (I’ll explain more in a minute.)

If either (or both) of your parents are from an endogamous population, you:

  • Will, in some cases, carry identifying Y and mitochondrial DNA that points to a specific endogamous group. This is true for Native people, can be true for Jewish people and Pacific Islanders, but is not true for Anabaptist people.

One Size Does NOT Fit All

Please note that there is no “one size fits all.”

Each or any of these tools may provide relevant hints, depending on:

  • Your heritage
  • How many other people have tested from the relevant population group
  • How many close or distant relatives have tested
  • If your parents share the same heritage
  • Your unique DNA inheritance pattern
  • If your parents, individually, were fully endogamous or only partly endogamous, and how far back generationally that endogamy occurred

For example, in my own genealogy, my maternal grandmother’s father was Acadian on his father’s side. While I’m not fully endogamous, I have significantly more matches through that line proportionally than on my other lines.

I have Brethren endogamy on my mother’s side via her paternal grandmother.

Endogamous ancestors are shown with red stars on my mother’s pedigree chart, above. However, please note that her maternal and paternal endogamous ancestors are not from the same endogamous population.

However, I STILL have fewer matches on my mother’s side in total than on my father’s side because my mother has recent Dutch and recent German immigrants which reduces her total number of matches. Neither of those lines have had as much time to produce descendants in the US, and Europe is under-sampled when compared with the US where more people tend to take DNA tests because they are searching for where they came from.

My father’s ancestors have been in the US since it was a British Colony, and I have many more cousins who have tested on his side than mother’s.

If you looked at my pedigree chart and thought to yourself, “that’s messy,” you’d be right.

The “endogamy means more matches” axiom does not hold true for me, comparatively, between my parents – in part because my mother’s German and Dutch lines are such recent immigrants.

The number of matches alone isn’t going to tell this story.

We are going to need to look at several pieces and parts for more information. Let’s start with ethnicity.

Ethnicity and Populations

Ethnicity can be a double-edged sword. It can tell you exactly nothing you couldn’t discern by looking in the mirror, or, conversely, it can be a wealth of information.

Ethnicity reveals the parts of the world where your ancestors originated. When searching for recent ancestors, you’re most interested in majority ethnicity, meaning the 50% of your DNA that you received from each of your parents.

Ethnicity results at each vendor are easy to find and relatively easy to understand.

This individual at FamilyTreeDNA is 100% Ashkenazi Jewish.

If they were 50% Jewish, we could then estimate, and that’s an important word, that either one of their parents was fully Jewish, and not the other, or that two of their grandparents were Jewish, although not necessarily on the same side.

On the other hand, my mother’s ethnicity, shown below, has nothing remarkable that would point to any majority endogamous population, yet she has two.

The only hint of endogamy from ethnicity would be her ~1% Americas, and that isn’t relevant for finding close relatives. However, minority ancestry is very relevant for identifying Native ancestors, which I wrote about, here.

You can correlate or track your ethnicity segments to specific ancestors, which I discussed in the article, Native American & Minority Ancestors Identified Using DNAPainter Plus Ethnicity Segments, here.

Since I wrote that article, FamilyTreeDNA has added the feature of ethnicity or population Chromosome Painting, based on where each of your populations fall on your chromosomes.

In this example on chromosome 1, I have European ancestry (blue,) except for the pink Native segment, which occurs on the following segment in the same location on my mother’s chromosome 1 as well.

Both 23andMe, and FamilyTreeDNA provide chromosome painting AND the associated segment information so you can identify the relevant ancestors.

Ancestry is in the process of rolling out an ethnicity painting feature, BUT, it has no segment or associated matching information. While it’s interesting eye candy, it’s not terribly useful beyond the ethnicity information that Ancestry already provides. However, Jonny Perl at DNAPainter has devised a way to estimate Ancestry’s start and stop locations, here. Way to go Jonny!

Now all you need to do is convince your Ancestry matches to upload their DNA file to one of the three databases, FamilyTreeDNA, MyHeritage, and GEDMatch, that accept transfers, aka uploads. This allows matching with segment data so that you can identify who matches you on that segment, track your ancestors, and paint your ancestral segments at DNAPainter.

I provided step-by-step instructions, here, for downloading your raw DNA file from each vendor in order to upload the file to another vendor.

Ethnicity Sides

Three of the four DNA testing vendors, 23andMe, FamilyTreeDNA, and recently, Ancestry, attempt to phase your ethnicity DNA, meaning to assign it to one parental “side” or the other – both in total and on each chromosome.

Here’s Ancestry’s SideView, where your DNA is estimated to belong to parent 1 and parent 2. I detailed how to determine which side is which, here, and while that article was written specifically pertaining to Ancestry’s SideView, the technique is relevant for all the vendors who attempt to divide your DNA into parents, a technique known as phasing.

I say “attempt” because phasing may or may not be accurate, meaning the top chromosome may not always be parent 1, and the bottom chromosome may not always be chromosome 2.

Here’s an example at 23andMe.

See the two yellow segments. They are both assigned as Native. I happen to know one is from the mother and one is from the father, yet they are both displayed on the “top” chromosome, which one would interpret to be the same parent.

I am absolutely positive this is not the case because this is a close family member, and I have the DNA of the parent who contributed the Native segment on chromosome 1, on the top chromosome. That parent does not have a Native segment on chromosome 2 to contribute. So that Native segment had to be contributed by the other parent, but it’s also shown on the top chromosome.

The DNA segments circled in purple belong together on the same “side” and were contributed to the tester by the same parent. The Native segment on chromosome 2 abuts a purple African segment, suggesting perhaps that the ancestor who contributed that segment was mixed between those ethnicities. In the US, that suggests enslavement.

The other African segments, circled, are shown on the second chromosome in each pair.

To be clear, parent 1 is not assigned by the vendors to either mother or father and will differ by person. Your parent 1, or the parent on the top chromosome may be your mother and another person’s parent 1 may be their father.

As shown in this example, parents can vary by chromosome, a phenomenon known as “strand swap.” Occasionally, the DNA can even be swapped within a chromosome assignment.

You can, however, get an idea of the division of your DNA at any specific location. As shown above, you can only have a maximum of two populations of DNA on any one chromosome location.

In our example above, this person’s majority ancestry is European (blue.) On each chromosome where we find a minority segment, the opposite chromosome in the same location is European, meaning blue.

Let’s look at another example.

At FamilyTreeDNA, the person whose ethnicity painting is shown below has a Native American (pink) ancestor on their father’s side. FamilyTreeDNA has correctly phased or identified their Native segments as all belonging to the second chromosome in each pair.

Looking at chromosome 18, for example, most of their father’s chromosome is Native American (pink). The other parent’s chromosome is European (dark blue) at those same locations.

If one of the parents was of one ethnicity, and the other parent is a completely different ethnicity, then one bar of each chromosome would be all pink, for example, and one would be entirely blue, representing the other ethnicity.

Phasing ethnicity or populations to maternal and paternal sides is not foolproof, and each chromosome is phased individually.

Ethnicity can, in some cases, give you a really good idea of what you’re dealing with in terms of heritage and endogamy.

If someone had an Ashkenazi Jewish father and European mother, for example, one copy of each chromosome would be yellow (Ashkenazi Jewish), and one would be blue (European.)

However, if each of their parents were half European Jewish and half European (not Jewish), then their different colored segments would be scattered across their entire set of chromosomes.

In this case, both of the tester’s parents are mixed – European Jewish (green) and Western Europe (blue.) We know both parents are admixed from the same two populations because in some locations, both parents contributed blue (Western Europe), and in other locations, both contributed Jewish (green) segments.

Both MyHeritage and Ancestry provide a secondary tool that’s connected to ethnicity, but different and generally in more recent times.

Ancestry’s DNA Communities

While your ethnicity may not point to anything terribly exciting in terms of endogamy, Genetic Communities might. Ancestry says that a DNA Community is a group of people who share DNA because their relatives recently lived in the same place at the same time, and that communities are much smaller than ethnicity regions and reach back only about 50-300 years.

Based on the ancestors’ locations in the trees of me and my matches, Ancestry has determined that I’m connected to two communities. In my case, the blue group is clearly my father’s line. The orange group could be either parent, or even a combination of both.

My endogamous Brethren could be showing up in Maryland, Pennsylvania, and Ohio, but it’s uncertain, in part, because my father’s ancestral lines are found in Virginia, West Virginia, and Maryland too.

These aren’t useful for me, but they may be more useful for fully endogamous people, especially in conjunction with ethnicity.

My Acadian cousin’s European ethnicity isn’t informative.

However, viewing his DNA Communities puts his French heritage into perspective, especially combined with his match surnames.

I wrote about DNA Communities when it was introduced with the name Genetic Communities, here.

MyHeritage’s Genetic Groups

MyHeritage also provides a similar feature that shows where my matches’ ancestors lived in the same locations as mine.

One difference, though, is that testers can adjust their ethnicity results confidence level from high, above, to low, below where one of my Genetic Groups overlaps my ethnicity in the Netherlands.

You can also sort your matches by Genetic Groups.

The results show you not only who is in the group, but how many of your matches are in that group too, which provides perspective.

I wrote about Genetic Groups, here.

Next, let’s look at how endogamy affects your matches.

Matches

The number of matches that a person has who is from an entirely endogamous community and a person with no endogamy may be quite different.

FamilyTreeDNA provides a Family Matching feature that triangulates your matches and assigns them to your paternal or maternal side by using known matches that you have linked to their profile cards in your tree. You must link people for the Family Matching feature known as “bucketing” to be enabled.

The people you link are then processed for shared matches on the same chromosome segment(s). Triangulated individuals are then deposited in your maternal, paternal, and both buckets.

Obviously, your two parents are the best people to link, but if they haven’t tested (or uploaded their DNA file from another vendor) and you have other known relatives, link them using the Family Tree tab at the top of your personal page.

I uploaded my Ancestry V4 kit to use as an example for linking. Let’s pretend that’s my sister. If I had not already linked my Ancestry V4 kit to “my sister’s” profile card, I’d want to do that and link other known individuals the same way. Just drag and drop the match to the correct profile card.

Note that a full or half sibling will be listed as such at FamilyTreeDNA, but an identical twin will show as a potential parent/child match to you. You’re much more likely to find a parent than an identical twin, but just be aware.

I’ve created a table of FamilyTreeDNA bucketed match results, by category, comparing the number of matches in endogamous categories with non-endogamous.

Total Matches Maternal Matches Paternal Matches Both % Both % DNA Unassigned
100% Jewish 34,637 11,329 10,416 4,806 13.9 23.3
100% Jewish 32,973 10,700 9,858 4,606 14 23.7
100% Jewish 32,255 9,060 10,970 3,892 12 25.8
75% Jewish 24,232 11,846 Only mother linked Only mother linked Only mother linked
100% Acadian 8093 3826 2299 1062 13 11
100% Acadian 7828 3763 1825 923 11.8 17
Not Endogamous 6760 3845 1909 13 0.19 14.5
Not Endogamous 7723 1470 3317 6 0.08 38
100% Native American 1,115 Unlinked Unlinked Unlinked
100% Native American 885 290 Unknown Can’t calculate without at least one link on both sides

The 100% Jewish, Acadian, and Not Endogamous testers both have linked their parents, so their matches, if valid (meaning not identical by chance, which I discussed here,) will match them plus one or the other parent.

One person is 75% Jewish and has only linked their Jewish mother.

The Native people have not tested their parents, and the first Native person has not linked anyone in their tree. The second Native person has only linked a few maternal matches, but their mother has not tested. They are seeking their father.

It’s very difficult to find people who are fully Native as testers. Furthermore, Native people are under-sampled. If anyone knows of fully Native (or other endogamous) people who have tested and linked their parents or known relatives in their trees, and will allow me to use their total match numbers anonymously, please let me know.

As you can see, Jewish, Acadian, and Native people are 100% endogamous, but many more Jewish people than Native people have tested, so you CAN’T judge endogamy by the total number of matches alone.

In fact, in order:

  • Fully Jewish testers have about 4-5 times as many matches as the Acadian and Non-endogamous testers
  • Acadian and Non-endogamous testers have about 5-6 times as many matches as the Native American testers
  • Fully Jewish people have about 30 times more matches than the Native American testers

If a person’s endogamy with a particular population is only on their maternal or paternal side, they won’t have a significant number of people related to both sides, meaning few people will fall into the “Both” bucket. People that will always be found in the ”Both” bucket are full siblings and their descendants, along with descendants of the tester, assuming their match is linked to their profiles in the tester’s tree.

In the case of our Jewish testers, you can easily see that the “Both” bucket is very high. The Acadians are also higher than one would reasonably expect without endogamy. A non-endogamous person might have a few matches on both sides, assuming the parents are not related to each other.

A high number of “Both” matches is a very good indicator of endogamy within the same population on both parents’ sides.

The percentage of people who are assigned to the “Both” bucket is between 11% and 14% in the endogamous groups, and less than 1% in the non-endogamous group, so statistically not relevant.

As demonstrated by the Native people compared to the Jewish testers, the total number of matches can be deceiving.

However, being related to both parents, as indicated by the “Both” bucket, unless you have pedigree collapse, is a good indicator of endogamy.

Of course, if you don’t know who your relatives are, you can’t link them in your tree, so this type of “hunt” won’t generally help people seeking their close family members.

However, you may notice that you’re matching people PLUS both of their parents. If that’s the case, start asking questions of those matches about their heritage.

A very high number of total matches, as compared to non-endogamous people, combined with some other hints might well point to Jewish heritage.

I included the % DNA Unassigned category because this category, when both parents are linked, is the percentage of matches by chance, meaning the match doesn’t match either of the tester’s parents. All of the people with people listed in “Both” categories have linked both of their parents, not just maternal and paternal relatives.

Matching Location at MyHeritage

MyHeritage provides a matching function by location. Please note that it’s the location of the tester, but that may still be quite useful.

The locations are shown in the most-matches to least-matches order. Clicking on the location shows the people who match you who are from that location. This would be the most useful in situations where recent immigration has occurred. In my case, my great-grandfather from the Netherlands arrived in the 1860s, and my German ancestors arrived in the 1850s. Neither of those groups are endogamous, though, unless it would be on a village level.

AutoClusters

Let’s shift to Genetic Affairs, a third-party tool available to everyone.

Using their AutoCluster function, Genetic Affairs clusters your matches together who match both each other and you.

This is an example of the first few clusters in my AutoCluster. You can see that I have several colored clusters of various sizes, but none are huge.

Compare that to the following endogamous cluster, sample courtesy of EJ Blom at Genetic Affairs.

If your AutoCluster at Genetic Affairs looks something like this, a huge orange blob in the upper left hand corner, you’re dealing with endogamy.

Please also note that the size of your cluster is also a function of both the number of testers and the match threshold you select. I always begin by using the defaults. I wrote about using Genetic Affairs, here.

If you tested at or transferred to MyHeritage, they too license AutoClusters, but have optimized the algorithm to tease out endogamous matches so that their Jewish customers, in particular, don’t wind up with a huge orange block of interrelated people.

You won’t see the “endogamy signature” huge cluster in the corner, so you’re less likely to be able to discern endogamy from a MyHeritage cluster alone.

The commonality between these Jewish clusters at MyHeritage is that they all tend to be rather uniform in size and small, with lots of grey connecting almost all the blocks.

Grey cells indicate people who match people in two colored groups. In other words, there is often no clear division in clusters between the mother’s side and the father’s side in Jewish clusters.

In non-endogamous situations, even if you can’t identify the parents, the clusters should still fall into two sides, meaning a group of clusters for each parent’s side that are not related to each other.

You can read more about Genetic Affairs clusters and their tools, here. DNAGedcom.com also provides a clustering tool.

Endogamous Relationships

Endogamous estimated relationships are sometimes high. Please note the word, “sometimes.”

Using the Shared cM Project tool relationship chart, here, at DNAPainter, people with heavy endogamy will discover that estimated relationships MAY be on the high side, or the relationships may, perhaps, be estimated too “close” in time. That’s especially true for more distant relationships, but surprisingly, it’s not always true. The randomness of inheritance still comes into play, and so do potential unknown relatives. Hence, the words “may” are bolded and underscored.

Unfortunately, it’s often stated as “conventional wisdom” that Jewish matches are “always” high, and first cousins appear as siblings. Let’s see what the actual data says.

At DNAPainter, you can either enter the amount of shared DNA (cM), or the percent of shared DNA, or just use the chart provided.

I’ve assembled a compilation of close relationships in kits that I have access to or from people who were generous enough to share their results for this article.

I’ve used Jewish results, which is a highly endogamous population, compared with non-endogamous testers.

The “Jewish Actual” column reports the total amount of shared DNA with that person. In other words, someone to their grandparent. The Average Range is the average plus the range from DNAPainter. The Percent Difference is the % difference between the actual number and the DNAPainter average.

You’ll see fully Jewish testers, at left, matching with their family members, and a Non-endogamous person, at right, matching with their same relative.

Relationship Jewish Actual Percent Difference than Average Average -Range Non-endogamous Actual Percent Difference than Average
Grandparent 2141 22 1754 (984-2482) 1742 <1 lower
Grandparent 1902 8.5 1754 (984-2482) 1973 12
Sibling 3039 16 2613 (1613-3488) 2515 3.5 lower
Sibling 2724 4 2613 (1613-3488) 2761 5.5
Half-Sibling 2184 24 1759 (1160-2436) 2127 21
Half-Sibling 2128 21 1759 (1160-2436) 2352 34
Aunt/Uncle 2066 18.5 1741 (1201-2282) 1849 6
Aunt/Uncle 2031 16.5 1741 (1201-2282) 2097 20
1C 1119 29 866 (396-1397) 959 11
1C 909 5 866 (396-1397) 789 9 lower
1C1R 514 19 433 (102-980) 467 8
1C1R 459 6 433 (102-980) 395 9 lower

These totals are from FamilyTreeDNA except one from GEDMatch (one Jewish Half-sibling).

Totals may vary by vendor, even when matching with the same person. 23andMe includes the X segments in the total cMs and also counts fully identical segments twice. MyHeritage imputation seems to err on the generous side.

However, in these dozen examples:

  • You can see that the Jewish actual amount of DNA shared is always more than the average in the estimate.
  • The red means the overage is more than 100 cM larger.
  • The percentage difference is probably more meaningful because 100 cM is a smaller percentage of a 1754 grandparent connection than compared to a 433 cM 1C1R.

However, you can’t tell anything about endogamy by just looking at any one sample, because:

  • Some of the Non-Endogamous matches are high too. That’s just the way of random inheritance.
  • All of the actual Jewish match numbers are within the published ranges, but on the high side.

Furthermore, it can get more complex.

Half Endogamous

I requested assistance from Jewish genealogy researchers, and a lovely lady, Sharon, reached out, compiled her segment information, and shared it with me, granting permission to share with you. A HUGE thank you to Sharon!

Sharon is half-Jewish via one parent, and her half-sibling is fully Jewish. Their half-sibling match to each other at Ancestry is 1756 cM with a longest segment of 164 cM.

How does Jewish matching vary if you’re half-Jewish versus fully Jewish? Let’s look at 21 people who match both Sharon and her fully Jewish half-sibling.

Sharon shared the differences in 21 known Jewish matches with her and her half-sibling. I’ve added the Relationship Estimate Range from DNAPainter and colorized the highest of the two matches in yellow. Bolding in the total cM column shows a value above the average range for that relationship.

Total Matching cMs is on the left, with Longest Segment on the right.

While this is clearly not a scientific study, it is a representative sample.

The fully Jewish sibling carries more Jewish DNA, which is available for other Jewish matches to match as a function of endogamy (identical by chance/population), so I would have expected the fully Jewish sibling to match most if not all Jewish testers at a higher level than the half-Jewish sibling.

However, that’s not universally what we see.

The fully Jewish sibling is not always the sibling with the highest number of matches to the other Jewish testers, although the half-Jewish tester has the larger “Longest Segment” more often than not.

Approximately two-thirds of the time (13/21), the fully Jewish person does have a higher total matching cM, but about one-third of the time (8/21), the half-Jewish sibling has a higher matching cM.

About one-fourth of the time (5/21), the fully Jewish sibling has the longest matching segment, and about two-thirds of the time (13/21), the half-Jewish sibling does. In three cases, or about 14% of the time, the longest segment is equal which may indicate that it’s the same segment.

Because of endogamy, Jewish matches are more likely to have:

  • Larger than average total cM for the specific relationship
  • More and smaller matching segments

However, as we have seen, neither of those are definitive, nor always true. Jewish matches and relationships are not always overestimated.

Ancestry and Timber

Please note that Ancestry downweights some matches by removing some segments using their Timber algorithm. Based on my matches and other accounts that I manage, Ancestry does not downweight in the 2-3rd cousin category, which is 90 cM and above, but they do begin downweighting in the 3-4th cousin category, below 90 cM, where my “Extended Family” category begins.

If you’ve tested at Ancestry, you can check for yourself.

By clicking on the amount of DNA you share with your match on your match list at Ancestry, shown above, you will be taken to another page where you will be able to view the unweighted shared DNA with that match, meaning the amount of DNA shared before the downweighting and removal of some segments, shown below.

Given the downweighting, and the information in the spreadsheet provided by Sharon, it doesn’t appear that any of those matches would have been in a category to be downweighted.

Therefore, for these and other close matches, Timber wouldn’t be a factor, but would potentially be in more distant matches.

Endogamous Segments

Endogamous matches tend to have smaller and more segments. Small amounts of matching DNA tend to skew the total DNA cM upwards.

How and why does this happen?

Ancestral DNA from further back in time tends to be broken into smaller segments.

Sometimes, especially in endogamous situations, two smaller segments, at one time separated from each other, manage to join back together again and form a match, but the match is only due to ancestral segments – not because of a recent ancestor.

Please note that different vendors have different minimum matching cM thresholds, so smaller matches may not be available at all vendors. Remember that factors like Timber and imputation can affect matching as well.

Let’s take a look at an example. I’ve created a chart where two ancestors have their blue and pink DNA broken into 4 cM segments.

They have children, a blue child and a pink child, and the two children, shown above, each inherited the same blue 4 cM segment and the same pink 4 cM segment from their respective parents. The other unlabeled pink and blue segments are not inherited by these two children, so those unlabeled segments are irrelevant in this example.

The parents may have had other children who inherited those same 4 cM labeled pink and blue segments as well, and if not, the parents’ siblings were probably passing at least some of the same DNA down to their descendants too.

The blue and pink children had children, and their children had children – for several generations.

Time passed, and their descendants became an endogamous community. Those pink and blue 4 cM segments may at some time be lost during recombination in the descendants of each of their children, shown by “Lost pink” and “Lost blue.”

However, because there is only a very limited amount of DNA within the endogamous community, their descendants may regain those same segments again from their “other parent” during recombination, downstream.

In each generation, the DNA of the descendant carrying the original blue or pink DNA segment is recombined with their partner. Given that the partners are both members of the same endogamous community, the two people may have the same pink and/or blue DNA segments. If one parent doesn’t carry the pink 4 cM segment, for example, their offspring may receive that ancestral pink segment from the other parent.

They could potentially, and sometimes do, receive that ancestral segment from both parents.

In our example, the descendants of the blue child, at left, lost the pink 4 cM segment in generation 3, but a few generations later, in generation 11, that descendant child inherited that same pink 4 cM segment from their other parent. Therefore, both the 4 cM blue and 4 cM pink segments are now available to be inherited by the descendants in that line. I’ve shown the opposite scenario in the generational inheritance at right where the blue segment is lost and regained.

Once rejoined, that pink and blue segment can be passed along together for generations.

The important part, though, is that once those two segments butt up against each other again during recombination, they aren’t just two separate 4 cM segments, but one segment that is 8 cM long – that is now equal to or above the vendors’ matching threshold.

This is why people descended from endogamous populations often have the following matching characteristics:

  • More matches
  • Many smaller segment matches
  • Their total cM is often broken into more, smaller segments

What does more, smaller segments, look like, exactly?

More, Smaller Segments

All of our vendors except Ancestry have a chromosome browser for their customers to compare their DNA to that of their matches visually.

Let’s take a look at some examples of what endogamous and non-endogamous matches look like.

For example, here’s a screen shot of a random Jewish second cousin match – 298 cM total, divided into 12 segments, with a longest segment of 58 cM,

A second Jewish 2C with 323 cM total, across 19 segments, with a 69 cM longest block.

A fully Acadian 2C match with 600 cM total, across 27 segments, with a longest segment of 69 cM.

A second Acadian 2C with 332 cM total, across 20 segments, with a longest segment of 42 cM.

Next, a non-endogamous 2C match with 217 cM, across 7 segments, with a longest segment of 72 cM.

Here’s another non-endogamous 2C example, with 169 shared cM, across 6 segments, with a longest segment of 70 cM.

Here’s the second cousin data in a summary table. The take-away from this is the proportion of total segments

Tester Population Total cM Longest Block Total Segments
Jewish 2C 298 58 12
Jewish 2C 323 69 19
Acadian 2C 600 69 27
Acadian 2C 332 42 20
Non-endogamous 2C 217 72 7
Non-endogamous 2C 169 70 6

You can see more examples and comparisons between Native American, Jewish and non-endogamous DNA individuals in the article, Concepts – Endogamy and DNA Segments.

I suspect that a savvy mathematician could predict endogamy based on longest block and total segment information.

Lara Diamond, a mathematician, who writes at Lara’s Jewnealogy might be up for this challenge. She just published compiled matching and segment information in her Ashkenazic Shared DNA Survey Results for those who are interested. You can also contribute to Laura’s data, here.

Endogamy, Segments, and Distant Relationships

While not relevant to searching for close relatives, heavily endogamous matches 3C and more distant, to quote one of my Jewish friends, “dissolve into a quagmire of endogamy and are exceedingly difficult to unravel.”

In my own Acadian endogamous line, I often simply have to label them “Acadian” because the DNA tracks back to so many ancestors in different lines. In other words, I can’t tell which ancestor the match is actually pointing to because the same DNA segments or segments is/are carried by several ancestors and their descendants due to founder effect.

The difference with the Acadians is that we can actually identify many or most of them, at least at some point in time. As my cousin, Paul LeBlanc, once said, if you’re related to one Acadian, you’re related to all Acadians. Then he proceeded to tell me that he and I are related 137 different ways. My head hurts!

It’s no wonder that endogamy is incredibly difficult beyond the first few generations when it turns into something like multi-colored jello soup.

“Are Your Parents Related?” Tool

There’s another tool that you can utilize to determine if your parents are related to each other.

To determine if your parents are related to each other, you need to know about ROH, or Runs of Homozygosity (ROH).

ROH means that the DNA on both strands or copies of the same chromosome is identical.

For a few locations in a row, ROH can easily happen just by chance, but the longer the segment, the less likely that commonality occurs simply by chance.

The good news is that you don’t need to know the identity of either of your parents. You don’t need either of your parent’s DNA tests – just your own. You’ll need to upload your DNA file to GEDmatch, which is free.

Click on “Are your parents related?”

GEDMatch analyzes your DNA to see if any of your DNA, above a reasonable matching threshold, is identical on both strands, indicating that you inherited the exact same DNA from both of your parents.

A legitimate match, meaning one that’s not by chance, will include many contiguous matching locations, generally a minimum of 500 SNPs or locations in a row. GEDmatch’s minimum threshold for identifying identical ancestral DNA (ROH) is 200 cM.

Here’s my result, including the graphic for the first two chromosomes. Notice the tiny green bars that show identical by chance tiny sliver segments.

I have no significant identical DNA, meaning my parents are not related to each other.

Next, let’s look at an endogamous example where there are small, completely identical segments across a person’s chromosome

This person’s Acadian parents are related to each other, but distantly.

Next, let’s look at a Jewish person’s results.

You’ll notice larger green matching ROH, but not over 200 contiguous SNPs and 7 cM.

GEDMatch reports that this Jewish person’s parents are probably not related within recent generations, but it’s clear that they do share DNA in common.

People whose parents are distantly related have relatively small, scattered matching segments. However, if you’re seeing larger ROH segments that would be large enough to match in a genealogical setting, meaning multiple greater than 7 cM and 500 SNPs,, you may be dealing with a different type of situation where cousins have married in recent generations. The larger the matching segments, generally, the closer in time.

Blogger Kitty Cooper wrote an article, here, about discovering that your parents are related at the first cousin level, and what their GEDMatch “Are Your Parents Related” results look like.

Let’s look for more clues.

Surnames

There MAY be an endogamy clue in the surnames of the people you match.

Viewing surnames is easier if you download your match list, which you can do at every vendor except Ancestry. I’m not referring to the segment data, but the information about your matches themselves.

I provided instructions in the recent article, How to Download Your DNA Match Lists and Segment Files, here.

If you suspect endogamy for any reason, look at your closest matches and see if there is a discernable trend in the surnames, or locations, or any commonality between your matches to each other.

For example, Jewish, Acadian, and Native surnames may be recognizable, as may locations.

You can evaluate in either or both of two ways:

  • The surnames of your closest matches. Closest matches listed first will be your default match order.
  • Your most frequently occurring surnames, minus extremely common names like Smith, Jones, etc., unless they are also in your closest matches. To utilize this type of matching, sort the spreadsheet in surname order and then scan or count the number of people with each surname.

Here are some examples from our testers.

Jewish – Closest surname matches.

  • Roth
  • Weiss
  • Goldman
  • Schonwald
  • Levi
  • Cohen
  • Slavin
  • Goodman
  • Sender
  • Trebatch

Acadian – Closest surname matches.

  • Bergeron
  • Hebert
  • Bergeron
  • Marcum
  • Muise
  • Legere
  • Gaudet
  • Perry
  • Verlander
  • Trombley

Native American – Closest surname matches.

  • Ortega
  • Begay
  • Valentine
  • Hayes
  • Montoya
  • Sun Bear
  • Martin
  • Tsosie
  • Chiquito
  • Yazzie

You may recognize these categories of surnames immediately.

If not, Google is your friend. Eliminate common surnames, then Google for a few together at a time and see what emerges.

The most unusual surnames are likely your best bets.

Projects

Another way to get some idea of what groups people with these surnames might belong to is to enter the surname in the FamilyTreeDNA surname search.

Go to the main FamilyTreeDNA page, but DO NOT sign on.

Scroll down until you see this image.

Type the surname into the search box. You’ll see how many people have tested with that surname, along with projects where project administrators have included that surname indicating that the project may be of interest to at least some people with that surname.

Here’s a portion of the project list for Cohen, a traditional Jewish surname.

These results are for Muise, an Acadian surname.

Clicking through to relevant surname projects, and potentially contacting the volunteer project administrator can go a very long way in helping you gather and sift information. Clearly, they have an interest in this topic.

For example, here’s the Muise surname in the Acadian AmerIndian project. Two great hints here – Acadian heritage and Halifax, Nova Scotia.

Repeat for the balance of surnames on your list to look for commonalities, including locations on the public project pages.

Locations

Some of the vendor match files include location information. Each person on your match list will have the opportunity at the vendor where they tested to include location information in a variety of ways, either for their ancestors or themselves.

Where possible, it’s easiest to sort or scan the download file for this type of information.

Ancestry does not provide or facilitate a match list, but you can still create your own for your closest 20 or 30 matches in a spreadsheet.

MyHeritage provides common surname and ancestral location information for every match. How cool is that!

Y DNA, Mitochondrial DNA, and Endogamy

Haplogroups for both Y and mitochondrial DNA can indicate and sometimes confirm endogamy. In other cases, the haplogroup won’t help, but the matches and their location information just might.

FamilyTreeDNA is the only vendor that provides Y DNA and mitochondrial DNA tests that include highly granular haplogroups along with matches and additional tools.

23andMe provides high-level haplogroups which may or may not be adequate to pinpoint a haplogroup that indicates endogamy.

Of course, only males carry Y DNA that tracks to the direct paternal (surname) line, but everyone carries their mother’s mitochondrial DNA that represents their mother’s mother’s mother’s, or direct matrilineal line.

Some haplogroups are known to be closely associated with particular ethnicities or populations, like Native Americans, Pacific Islanders, and some Jewish people.

Haplogroups reach back in time before genealogy and can give us a sense of community that’s not available by either looking in the mirror or through traditional records.

This Native American man is a member of high-level haplogroup Q-M242. However, some men who carry this haplogroup are not Native, but are of European or Middle Eastern origin.

I entered the haplogroup in the FamilyTreeDNA Discover tool, which I wrote about, here.

Checking the information about this haplogroup reveals that their common ancestor descended from an Asian man about 30,000 years ago.

The migration path in the Americans explains why this person would have an endogamous heritage.

Our tester would receive a much more refined haplogroup if he upgraded to the Big Y test at FamilyTreeDNA, which would remove all doubt.

However, even without additional testing, information about his matches at FamilyTreeDNA may be very illuminating.

The Q-M242 Native man’s Y DNA matches men with more granular haplogroups, shown above, at left. On the Haplogroup Origins report, you can see that these people have all selected the “US (Native American)” country option.

Another useful tool would be to check the public Y haplotree, here, and the public mitochondrial tree here, for self-reported ancestor location information for a specific haplogroup.

Here’s an example of mitochondrial haplogroup A2 and a few subclades on the public mitochondrial tree. You can see that the haplogroup is found in Mexico, the US (Native,) Canada, and many additional Caribbean, South, and Central American countries.

Of course, Y DNA and mitochondrial DNA (mtDNA) tell a laser-focused story of one specific line, each. The great news, if you’re seeking information about your mother or father, the Y is your father’s direct paternal (surname) line, and mitochondrial is your mother’s direct matrilineal line.

Y and mitochondrial DNA results combined with ethnicity, autosomal matching, and the wide range of other tools that open doors, you will be able to reveal a great deal of information about whether you have endogamous heritage or not – and if so, from where.

I’ve provided a resource for stepping through and interpreting your Y DNA results, here, and mitochondrial DNA, here.

Discover for Y DNA Only

If you’re a female, you may feel left out of Y DNA testing and what it can tell you about your heritage. However, there’s a back door.

You can utilize the Y DNA haplogroups of your closest autosomal matches at both FamilyTreeDNA and 23andMe to reveal information

Haplogroup information is available in the download files for both vendors, in addition to the Family Finder table view, below, at FamilyTreeDNA, or on your individual matches profile cards at both 23andMe and FamilyTreeDNA.

You can enter any Y DNA haplogroup in the FamilyTreeDNA Discover tool, here.

You’ll be treated to:

  • Your Haplogroup Story – how many testers have this haplogroup (so far), where the haplogroup is from, and the haplogroup’s age. In this case, the haplogroup was born in the Netherlands about 250 years ago, give or take 200 years. I know that it was 1806 or earlier based on the common ancestor of the men who tested.
  • Country Frequency – heat map of where the haplogroup is found in the world.
  • Notable Connections – famous and infamous (this haplogroup’s closest notable person is Leo Tolstoy).
  • Migration Map – migration path out of Africa and through the rest of the world.
  • Ancient Connections – ancient burials. His closest ancient match is from about 1000 years ago in Ukraine. Their shared ancestor lived about 2000 years ago.
  • Suggested Projects – based on the surname, projects that other matches have joined, and haplogroups.
  • Scientific Details – age estimates, confidence intervals, graphs, and the mutations that define this haplogroup.

I wrote about the Discover tool in the article, FamilyTreeDNA DISCOVER Launches – Including Y DNA Haplogroup Ages.

Endogamy Tools Summary Tables

Endogamy is a tough nut sometimes, especially if you’re starting from scratch. In order to make this topic a bit easier and to create a reference tool for you, I’ve created three summary tables.

  • Various endogamy-related tools available at each vendor which will or may assist with evaluating endogamy
  • Tools and their ability to detect endogamy in different groups
  • Tools best suited to assist people seeking information about unknown parents or grandparents

Summary of Endogamy Tools by Vendor

Please note that GEDMatch is not a DNA testing vendor, but they accept uploads and do have some tools that the testing vendors do not.

 Tool 23andMe Ancestry FamilyTreeDNA MyHeritage GEDMatch
Ethnicity Yes Yes Yes Yes Use the vendors
Ethnicity Painting Yes + segments Yes, limited Yes + segments Yes
Ethnicity Phasing Yes Partial Yes No
DNA Communities No Yes No No
Genetic Groups No No No Yes
Family Matching aka Bucketing No No Yes No
Chromosome Browser Yes No Yes Yes Yes
AutoClusters Through Genetic Affairs No Through Genetic Affairs Yes, included Yes, with subscription
Match List Download Yes, restricted # of matches No Yes Yes Yes
Projects No No Yes No
Y DNA High-level haplogroup only No Yes, full haplogroup with Big Y, matching, tools, Discover No
Mitochondrial DNA High-level haplogroup only No Yes, full haplogroup with mtFull, matching, tools No
Public Y Tree No No Yes No
Public Mito Tree No No Yes No
Discover Y DNA – public No No Yes No
ROH No No No No Yes

Summary of Endogamous Populations Identified by Each Tool

The following chart provides a guideline for which tools are useful for the following types of endogamous groups. Bolded tools require that both parents be descended from the same endogamous group, but several other tools give more definitive results with higher amounts of endogamy.

Y and mitochondrial DNA testing are not affected by admixture, autosomal DNA or anything from the “other” parent.

Tool Jewish Acadian Anabaptist Native Other/General
Ethnicity Yes No No Yes Pacific Islander
Ethnicity Painting Yes No No Yes Pacific Islander
Ethnicity Phasing Yes, if different No No Yes, if different Pacific Islander, if different
DNA Communities Yes Possibly Possibly Yes Pacific Islander
Genetic Groups Yes Possibly Possibly Yes Pacific Islander
Family Matching aka Bucketing Yes Yes Possibly Yes Pacific Islander
Chromosome Browser Possibly Possibly Yes, once segments or ancestors identified Possibly Pacific Islander, possibly
Total Matches Yes, compared to non-endogamous No No No No, unknown
AutoClusters Yes Yes Uncertain, probably Yes Pacific Islander
Estimated Relationships High Not always Sometimes No Sometimes Uncertain, probably
Relationship Range High Possibly, sometimes Possibly Possibly Possibly Pacific Islander, possibly
More, Smaller Segments Yes Yes Probably Yes Pacific Islander, probably
Parents Related Some but minimal Possibly Uncertain Probably similar to Jewish Uncertain, Possibly
Surnames Probably Probably Probably Not Possibly Possibly
Locations Possibly Probably Probably Not Probably Probably Pacific Islander
Projects Probably Probably Possibly Possibly Probably Pacific Islander
Y DNA Yes, often Yes, often No Yes Pacific Islander
Mitochondrial DNA Yes, often Sometimes No Yes Pacific Islander
Y public tree Probably not alone No No Yes Pacific Islander
MtDNA public tree Probably not No No Yes Pacific Islander
Y DNA Discover Yes Possibly Probably not, maybe projects Yes Pacific Islander

Summary of Endogamy Tools to Assist People Seeking Unknown Parents and Grandparents

This table provides a summary of when each of the various tools can be useful to:

  • People seeking unknown close relatives
  • People who already know who their close relatives are, but are seeking additional information or clues about their genealogy

I considered rating these on a 1 to 10 scale, but the relative usefulness of these tools is dependent on many factors, so different tools will be more or less useful to different people.

For example, ethnicity is very useful if someone is admixed from different populations, or even 100% of a specific endogamous population. It’s less useful if the tester is 100% European, regardless of whether they are seeking close relatives or not. Conversely, even “vanilla” ethnicity can be used to rule out majority or recent admixture with many populations.

Tools Unknown Close Relative Seekers Known Close Relatives – Enhance Genealogy
Ethnicity Yes, to identify or rule out populations Yes
Ethnicity Painting Yes, possibly, depending on population Yes, possibly, depending on population
Ethnicity Phasing Yes, possibly, depending on population Yes, possibly, depending on population
DNA Communities Yes, possibly, depending on population Yes, possibly, depending on population
Genetic Groups Possibly, depending on population Possibly, depending on population
Family Matching aka Bucketing Not if parents are entirely unknown, but yes if one parent is known Yes
Chromosome Browser Unlikely Yes
AutoClusters Yes Yes, especially at MyHeritage if Jewish
Estimated Relationships High Not No
Relationship Range High Not reliably No
More, Smaller Segments Unlikely Unlikely other than confirmation
Match List Download Yes Yes
Surnames Yes Yes
Locations Yes Yes
Projects Yes Yes
Y DNA Yes, males only, direct paternal line, identifies surname lineage Yes, males only, direct paternal line, identifies and correctly places surname lineage
Mitochondrial DNA Yes, both sexes, direct matrilineal line only Yes, both sexes, direct matrilineal line only
Public Y Tree Yes for locations Yes for locations
Public Mito Tree Yes for locations Yes for locations
Discover Y DNA Yes, for heritage information Yes, for heritage information
Parents Related – ROH Possibly Less useful

Acknowledgments

A HUGE thank you to several people who contributed images and information in order to provide accurate and expanded information on the topic of endogamy. Many did not want to be mentioned by name, but you know who you are!!!

If you have information to add, please post in the comments.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

Top Ten RootsTech 2022 DNA Sessions + All DNA Session Links

The official dates of RootsTech 2022 were March 3-5, but the sessions and content in the vendor booths are still available. I’ve compiled a list of the sessions focused on DNA, with web links on the RootsTech YouTube channel

YouTube reports the number of views, so I was able to compile that information as of March 8, 2022.

I do want to explain a couple of things to add context to the numbers.

Most speakers recorded their sessions, but a few offered live sessions which were recorded, then posted later for participants to view. However, there have been glitches in that process. While the sessions were anticipated to be available an hour or so later, that didn’t quite happen, and a couple still aren’t posted. I’m sure the presenters are distressed by this, so be sure to watch those when they are up and running.

The Zoom rooms where participants gathered for the live sessions were restricted to 500 attendees. The YouTube number of views does not include the number of live viewers, so you’ll need to add an additional number, up to 500.

When you see a number before the session name, whether recorded or live, that means that the session is part of a series. RootsTech required speakers to divide longer sessions into a series of shorter sessions no longer than 15-20 minutes each. The goal was for viewers to be able to watch the sessions one after the other, as one class, or separately, and still make sense of the content. Let’s just say this was the most challenging thing I’ve ever done as a presenter.

For recorded series sessions, these are posted as 1, 2 and 3, as you can see below with Diahan Southard’s sessions. However, with my live session series, that didn’t happen. It looks like my sessions are a series, but when you watch them, parts 1, 2 and 3 are recorded and presented as one session. Personally, I’m fine with this, because I think the information makes a lot more sense this way. However, it makes comparisons difficult.

This was only the second year for RootsTech to be virtual and the conference is absolutely HUGE, so live and learn. Next year will be smoother and hopefully, at least partially in-person too.

When I “arrived” to present my live session, “Associating Autosomal DNA Segments With Ancestors,” my lovely moderator, Rhett, told me that they were going to livestream my session to the RootsTech page on Facebook as well because they realized that the 500 Zoom seat limit had been a problem the day before with some popular sessions. I have about 9000 views for that session and more than 7,400 of them are on the RootsTech Facebook page – and that was WITHOUT any advance notice or advertising. I know that the Zoom room was full in addition. I felt kind of strange about including my results in the top ten because I had that advantage, but I didn’t know quite how to otherwise count my session. As it turns out, all sessions with more than 1000 views made it into the top ten so mine would have been there one way or another. A big thank you to everyone who watched!

I hope that the RootsTech team notices that the most viewed session is the one that was NOT constrained by the 500-seat limited AND was live-streamed on Facebook. Seems like this might be a great way to increase session views for everyone next year. Hint, hint!!!

I also want to say a huge thank you to all of the presenters for producing outstanding content. The sessions were challenging to find, plus RootsTech is always hectic, even virtually. So, I know a LOT of people will want to view these informative sessions, now that you know where to look and have more time. Please remember to “like” the session on YouTube as a way of thanking your presenter.

With 140 DNA-focused sessions available, you can watch a new session, and put it to use, every other day for the next year! How fun is that! You can use this article as your own playlist.

Please feel free to share this article with your friends and genealogy groups so everyone can learn more about using DNA for genealogy.

Ok, let’s look at the top 10. Drum roll please…

Top 10 Most Viewed RootsTech Sessions

Session Title Presenter YouTube Link Views
1 1. Associating Autosomal DNA Segments With Ancestors Roberta Estes (live) https://www.youtube.com/watch?v=_IHSCkNnX48

 

~9000: 1019 + 500 live viewers + 7,400+ Facebook
2 1. What to Do with Your DNA Test Results in 2022 (part 1 of 3) Diahan Southard https://www.youtube.com/watch?v=FENAKAYLXX4 7428
3 Who Is FamilyTreeDNA? FamilyTreeDNA – Bennett Greenspan https://www.youtube.com/watch?v=MHFtwoatJ-A 2946
4 2. What to Do with Your DNA Test Results in 2022 (part 2 of 3) Diahan Southard https://www.youtube.com/watch?v=mIllhtONhlI 2448
5 Latest DNA Painter Releases DNAPainter Jonny Perl (live) https://www.youtube.com/watch?v=iLBThU8l33o 2230 + live viewers
6 DNA Painter Introduction DNAPainter – Jonny Perl https://www.youtube.com/watch?v=Rpe5LMPNmf0 1983
7 3. What to Do with Your DNA Test Results in 2022 (part 3 of 3) Diahan Southard https://www.youtube.com/watch?v=hemY5TuLmGI 1780
8 The Tree of Mankind Age Estimates Paul Maier https://www.youtube.com/watch?v=jjkL8PWAEwk 1638
9 A Sneak Peek at FamilyTreeDNA Coming Attractions FamilyTreeDNA (live) https://www.youtube.com/watch?v=K9sKqNScvnE 1270 + live viewers

 

10 Extending Time Horizons with DNA Rob Spencer (live) https://www.youtube.com/watch?v=wppXD1Zz2sQ 1037 + live viewers

 

All DNA-Focused Sessions

I know you’ll find LOTS of goodies here. Which ones are your favorites?

  Session Presenter YouTube Link Views
1 Estimating Relationships by Combining DNA from Multiple Siblings Amy Williams https://www.youtube.com/watch?v=xs1U0ohpKSA 201
2 Overview of HAPI-DNA.org Amy Williams https://www.youtube.com/watch?v=FjNiJgWaBeQ 126
3 How do AncestryDNA® Communities help tell your story? | Ancestry® Ancestry https://www.youtube.com/watch?v=EQNpUxonQO4 183

 

4 AncestryDNA® 201 Ancestry – Crista Cowan https://www.youtube.com/watch?v=lbqpnXloM5s

 

494
5 Genealogy in a Minute: Increase Discoveries by Attaching AncestryDNA® Results to Family Tree Ancestry – Crista Cowan https://www.youtube.com/watch?v=iAqwSCO8Pvw 369
6 AncestryDNA® 101: Beginner’s Guide to AncestryDNA® | Ancestry® Ancestry – Lisa Elzey https://www.youtube.com/watch?v=-N2usCR86sY 909
7 Hidden in Plain Sight: Free People of Color in Your Family Tree Cheri Daniels https://www.youtube.com/watch?v=FUOcdhO3uDM 179
8 Finding Relatives to Prevent Hereditary Cancer ConnectMyVariant – Dr. Brian Shirts https://www.youtube.com/watch?v=LpwLGgEp2IE 63
9 Piling on the chromosomes Debbie Kennett https://www.youtube.com/watch?v=e14lMsS3rcY 465
10 Linking Families With Rare Genetic Condition Using Genealogy Deborah Neklason https://www.youtube.com/watch?v=b94lUfeAw9k 43
11 1. What to Do with Your DNA Test Results in 2022 Diahan Southard https://www.youtube.com/watch?v=FENAKAYLXX4 7428
12 1. What to Do with Your DNA Test Results in 2022 Diahan Southard https://www.youtube.com/watch?v=hemY5TuLmGI 1780
13 2. What to Do with Your DNA Test Results in 2022 Diahan Southard https://www.youtube.com/watch?v=mIllhtONhlI 2448
14 DNA Testing For Family History Diahan Southard https://www.youtube.com/watch?v=kCLuOCC924s 84

 

15 Understanding Your DNA Ethnicity Estimate at 23andMe Diana Elder

 

https://www.youtube.com/watch?v=xT1OtyvbVHE 66
16 Understanding Your Ethnicity Estimate at FamilyTreeDNA Diana Elder https://www.youtube.com/watch?v=XosjViloVE0 73
17 DNA Monkey Wrenches Katherine Borges https://www.youtube.com/watch?v=Thv79pmII5M 245
18 Advanced Features in your Ancestral Tree and Fan Chart DNAPainter – Jonny Perl https://www.youtube.com/watch?v=4u5Vf13ZoAc 425
19 DNA Painter Introduction DNAPainter – Jonny Perl https://www.youtube.com/watch?v=Rpe5LMPNmf0 1983
20 Getting Segment Data from 23andMe DNA Matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=8EBRI85P3KQ 134
21 Getting segment data from FamilyTreeDNA DNA matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=rWnxK86a12U 169
22 Getting segment data from Gedmatch DNA matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=WF11HEL8Apk 163
23 Getting segment data from Geneanet DNA Matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=eclj8Ap0uK4 38
24 Getting segment data from MyHeritage DNA matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=9rGwOtqbg5E 160
25 Inferred Chromosome Mapping: Maximize your DNA Matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=tzd5arHkv64 688
26 Keeping track of your genetic family tree in a fan chart DNAPainter – Jonny Perl https://www.youtube.com/watch?v=W3Hcno7en94 806

 

27 Mapping a DNA Match in a Chromosome Map DNAPainter – Jonny Perl https://www.youtube.com/watch?v=A61zQFBWaiY 423
28 Setting up an Ancestral Tree and Fan Chart and Exploring Tree Completeness DNAPainter – Jonny Perl https://www.youtube.com/watch?v=lkJp5Xk1thg 77
29 Using the Shared cM Project Tool to Evaluate DNA Matches DNAPainter – Jonny Perl https://www.youtube.com/watch?v=vxhn9l3Dxg4 763
30 Your First Chromosome Map: Using your DNA Matches to Link Segments to Ancestors DNAPainter – Jonny Perl https://www.youtube.com/watch?v=tzd5arHkv64 688
31 DNA Painter for absolute beginners DNAPainter (Jonny Perl) https://www.youtube.com/watch?v=JwUWW4WHwhk 1196
32 Latest DNA Painter Releases DNAPainter (live) https://www.youtube.com/watch?v=iLBThU8l33o 2230 + live viewers
33 Unraveling your genealogy with DNA segment networks using AutoSegment from Genetic Affairs Evert-Jan Blom https://www.youtube.com/watch?v=rVpsJSqOJZI

 

162
34 Unraveling your genealogy with genetic networks using AutoCluster Evert-Jan Blom https://www.youtube.com/watch?v=ZTKSz_X7_zs 201

 

 

35 Unraveling your genealogy with reconstructed trees using AutoTree & AutoKinship from Genetic Affairs Evert-Jan Blom https://www.youtube.com/watch?v=OmDQoAn9tVw 143
36 Research Like a Pro with DNA – A Genealogist’s Guide to Finding and Confirming Ancestors with DNA Family Locket Genealogists https://www.youtube.com/watch?v=NYpLscJJQyk 183
37 How to Interpret a DNA Network Graph Family Locket Genealogists – Diana Elder https://www.youtube.com/watch?v=i83WRl1uLWY 393
38 Find and Confirm Ancestors with DNA Evidence Family Locket Genealogists – Nicole Dyer https://www.youtube.com/watch?v=DGLpV3aNuZI 144
39 How To Make A DNA Network Graph Family Locket Genealogists – Nicole Dyer https://www.youtube.com/watch?v=MLm_dVK2kAA 201
40 Create A Family Tree With Your DNA Matches-Use Lucidchart To Create A Picture Worth A Thousand Words Family Locket Genealogists – Robin Wirthlin https://www.youtube.com/watch?v=RlRIzcW-JI4 270
41 Charting Companion 7 – DNA Edition Family Tree Maker https://www.youtube.com/watch?v=k2r9rkk22nU 316

 

42 Family Finder Chromosome Browser: How to Use FamilyTreeDNA https://www.youtube.com/watch?v=w0_tgopBn_o 750

 

 

43 FamilyTreeDNA: 22 Years of Breaking Down Brick Walls FamilyTreeDNA https://www.familysearch.org/rootstech/session/familytreedna-22-years-of-breaking-down-brick-walls Not available
44 Review of Autosomal DNA, Y-DNA, & mtDNA FamilyTreeDNA  – Janine Cloud https://www.youtube.com/watch?v=EJoQVKxgaVY 77
45 Who Is FamilyTreeDNA? FamilyTreeDNA – Bennett Greenspan https://www.youtube.com/watch?v=MHFtwoatJ-A 2946
46 Part 1: How to Interpret Y-DNA Results, A Walk Through the Big Y FamilyTreeDNA – Casimir Roman https://www.youtube.com/watch?v=ra1cjGgvhRw 684

 

47 Part 2: How to Interpret Y-DNA Results, A Walk Through the Big Y FamilyTreeDNA – Casimir Roman https://www.youtube.com/watch?v=CgqcjBD6N8Y

 

259
48 Big Y-700: A Brief Overview FamilyTreeDNA – Janine Cloud https://www.youtube.com/watch?v=IefUipZcLCQ 96
49 Mitochondrial DNA & The Million Mito Project FamilyTreeDNA – Janine Cloud https://www.youtube.com/watch?v=5Zppv2uAa6I 179
50 Mitochondrial DNA: What is a Heteroplasmy FamilyTreeDNA – Janine Cloud https://www.youtube.com/watch?v=ZeGTyUDKySk 57
51 Y-DNA Big Y: A Lifetime Analysis FamilyTreeDNA – Janine Cloud https://www.youtube.com/watch?v=E6NEU92rpiM 154
52 Y-DNA: How SNPs Are Added to the Y Haplotree FamilyTreeDNA – Janine Cloud https://www.youtube.com/watch?v=CGQaYcroRwY 220
53 Family Finder myOrigins: Beginner’s Guide FamilyTreeDNA – Katy Rowe https://www.youtube.com/watch?v=VrJNpSv8nlA 88
54 Mitochondrial DNA: Matches Map & Results for mtDNA FamilyTreeDNA – Katy Rowe https://www.youtube.com/watch?v=YtA1j01MOvs 190
55 Mitochondrial DNA: mtDNA Mutations Explained FamilyTreeDNA – Katy Rowe https://www.youtube.com/watch?v=awPs0cmZApE 340

 

56 Y-DNA: Haplotree and SNPs Page Overview FamilyTreeDNA – Katy Rowe https://www.youtube.com/watch?v=FOuVhoMD-hw 432
57 Y-DNA: Understanding the Y-STR Results Page FamilyTreeDNA – Katy Rowe https://www.youtube.com/watch?v=gCeZz1rQplI 148
58 Y-DNA: What Is Genetic Distance? FamilyTreeDNA – Katy Rowe https://www.youtube.com/watch?v=qJ6wY6ILhfg 149
59 DNA Tools: myOrigins 3.0 Explained, Part 1 FamilyTreeDNA – Paul Maier https://www.youtube.com/watch?v=ACgY3F4-w78 74

 

60 DNA Tools: myOrigins 3.0 Explained, Part 2 FamilyTreeDNA – Paul Maier https://www.youtube.com/watch?v=h7qU36bIFg0 50
61 DNA Tools: myOrigins 3.0 Explained, Part 3 FamilyTreeDNA – Paul Maier https://www.youtube.com/watch?v=SWlGPm8BGyU 36
62 African American Genealogy Research Tips FamilyTreeDNA – Sherman McRae https://www.youtube.com/watch?v=XdbkM58rXIQ 153

 

63 Connecting With My Ancestors Through Y-DNA FamilyTreeDNA – Sherman McRae https://www.youtube.com/watch?v=xbo1XnLkuQU 200
64 Join The Million Mito Project FamilyTreeDNA (Join link) https://www.familysearch.org/rootstech/session/join-the-million-mito-project link
65 View the World’s Largest mtDNA Haplotree FamilyTreeDNA (Link to mtDNA tree) https://www.familytreedna.com/public/mt-dna-haplotree/L n/a
66 View the World’s Largest Y Haplotree FamilyTreeDNA (Link to Y tree) https://www.familytreedna.com/public/y-dna-haplotree/A link
67 A Sneak Peek at FamilyTreeDNA Coming Attractions FamilyTreeDNA (live) https://www.youtube.com/watch?v=K9sKqNScvnE 1270 + live viewers

 

68 DNA Upload: How to Transfer Your Autosomal DNA Data FamilyTreeDNA -Katy Rowe https://www.youtube.com/watch?v=CS-rH_HrGlo 303
69 Family Finder myOrigins: How to Compare Origins With Your DNA Matches FamilyTreeDNA -Katy Rowe https://www.youtube.com/watch?v=7mBmWhM4j9Y 145
70 Join Group Projects at FamilyTreeDNA FamilyTreeDNA link to learning center article) https://www.familysearch.org/rootstech/session/join-group-projects-at-familytreedna link

 

71 Product Demo – Unraveling your genealogy with reconstructed trees using AutoKinship GEDmatch https://www.youtube.com/watch?v=R7_W0FM5U7c 803
72 Towards a Genetic Genealogy Driven Irish Reference Genome Gerard Corcoran https://www.youtube.com/watch?v=6Kx8qeNiVmo 155

 

73 Discovering Biological Origins in Chile With DNA: Simple Triangulation Gonzalo Alexis Luengo Orellana https://www.youtube.com/watch?v=WcVby54Uigc 40
74 Cousin Lynne: An Adoption Story International Association of Jewish Genealogical Societies https://www.youtube.com/watch?v=AptMcV4_B4o 111
75 Using DNA Testing to Uncover Native Ancestry Janine Cloud https://www.youtube.com/watch?v=edzebJXepMA 205
76 1. Forensic Genetic Genealogy Jarrett Ross https://www.youtube.com/watch?v=0euIDZTmx5g 58
77 Reunited and it Feels so Good Jennifer Mendelsohn https://www.youtube.com/watch?v=X-hxjm7grBE 57

 

78 Genealogical Research and DNA Testing: The Perfect Companions Kimberly Brown https://www.youtube.com/watch?v=X82jA3xUVXk 80
79 Finding a Jewish Sperm Donor Kitty Munson Cooper https://www.youtube.com/watch?v=iKRjFfNcpug 164
80 Using DNA in South African Genealogy Linda Farrell https://www.youtube.com/watch?v=HXkbBWmORM0 141
81 Using DNA Group Projects In Your Family History Research Mags Gaulden https://www.youtube.com/watch?v=0tX7QDib4Cw 165
82 2. The Expansion of Genealogy Into Forensics Marybeth Sciaretta https://www.youtube.com/watch?v=HcEO-rMe3Xo 35

 

83 DNA Interest Groups That Keep ’em Coming Back McKell Keeney (live) https://www.youtube.com/watch?v=HFwpmtA_QbE 180 plus live viewers
84 Searching for Close Relatives with Your DNA Results Mckell Keeney (live) https://www.familysearch.org/rootstech/session/searching-for-close-relatives-with-your-dna-results Not yet available
85 Top Ten Reasons To DNA Test For Family History Michelle Leonard https://www.youtube.com/watch?v=1B9hEeu_dic 181
86 Top Tips For Identifying DNA Matches Michelle Leonard https://www.youtube.com/watch?v=-3Oay_btNAI 306
87 Maximising Messages Michelle Patient https://www.youtube.com/watch?v=4TRmn0qzHik 442
88 How to Filter and Sort Your DNA Matches MyHeritage https://www.youtube.com/watch?v=fmIgamFDvc8 88
89 How to Get Started with Your DNA Matches MyHeritage https://www.youtube.com/watch?v=JPOzhTxhU0E 447

 

90 How to Track DNA Kits in MyHeritage` MyHeritage https://www.youtube.com/watch?v=2W0zBbkBJ5w 28

 

91 How to Upload Your DNA Data to MyHeritage MyHeritage https://www.youtube.com/watch?v=nJ4RoZOQafY 82
92 How to Use Genetic Groups MyHeritage https://www.youtube.com/watch?v=PtDAUHN-3-4 62
My Story: Hope MyHeritage https://www.youtube.com/watch?v=qjyggKZEXYA 133
93 MyHeritage Keynote, RootsTech 2022 MyHeritage https://www.familysearch.org/rootstech/session/myheritage-keynote-rootstech-2022 Not available
94 Using Labels to Name Your DNA Match List MyHeritage https://www.youtube.com/watch?v=enJjdw1xlsk 139

 

95 An Introduction to DNA on MyHeritage MyHeritage – Daniel Horowitz https://www.youtube.com/watch?v=1I6LHezMkgc 60
96 Using MyHeritage’s Advanced DNA Tools to Shed Light on Your DNA Matches MyHeritage – Daniel Horowitz https://www.youtube.com/watch?v=Pez46Xw20b4 110
97 You’ve Got DNA Matches! Now What? MyHeritage – Daniel Horowitz https://www.youtube.com/watch?v=gl3UVksA-2E 260
98 My Story: Lizzie and Ayla MyHeritage – Elizbeth Shaltz https://www.youtube.com/watch?v=NQv6C8G39Kw 147
99 My Story: Fernando and Iwen MyHeritage – Fernando Hermansson https://www.youtube.com/watch?v=98-AR0M7fFE 165

 

100 Using the Autocluster and the Chromosome Browser to Explore Your DNA Matches MyHeritage – Gal Zruhen https://www.youtube.com/watch?v=a7aQbfP7lWU 115

 

101 My Story : Kara Ashby Utah Wedding MyHeritage – Kara Ashby https://www.youtube.com/watch?v=Qbr_gg1sDRo 200
102 When Harry Met Dotty – using DNA to break down brick walls Nick David Barratt https://www.youtube.com/watch?v=8SdnLuwWpJs 679
103 How to Add a DNA Match to Airtable Nicole Dyer https://www.youtube.com/watch?v=oKxizWIOKC0 161
104 How to Download DNA Match Lists with DNAGedcom Client Nicole Dyer https://www.youtube.com/watch?v=t9zTWnwl98E 124
105 How to Know if a Matching DNA Segment is Maternal or Paternal Nicole Dyer https://www.youtube.com/watch?v=-zd5iat7pmg 161
106 DNA Basics Part I Centimorgans and Family Relationships Origins International, Inc. dba Origins Genealogy https://www.youtube.com/watch?v=SI1yUdnSpHA 372
107 DNA Basics Part II Clustering and Connecting Your DNA Matches Origins International, Inc. dba Origins Genealogy https://www.youtube.com/watch?v=ECs4a1hwGcs 333
108 DNA Basics Part III Charting Your DNA Matches to Get Answers Origins International, Inc. dba Origins Genealogy https://www.youtube.com/watch?v=qzybjN0JBGY 270
109 2. Using Cluster Auto Painter Patricia Coleman https://www.youtube.com/watch?v=-nfLixwxKN4 691
110 3. Using Online Irish Records Patricia Coleman https://www.youtube.com/watch?v=mZsB0l4z4os 802
111 Exploring Different Types of Clusters Patricia Coleman https://www.youtube.com/watch?v=eEZBFPC8aL4 972

 

112 The Million Mito Project: Growing the Family Tree of Womankind Paul Maier https://www.youtube.com/watch?v=cpctoeKb0Kw 541
113 The Tree of Mankind Age Estimates Paul Maier https://www.youtube.com/watch?v=jjkL8PWAEwk 1638
114 Y-DNA and Mitochondrial DNA Testing Plans Paul Woodbury https://www.youtube.com/watch?v=akymSm0QKaY 168
115 Finding Biological Family Price Genealogy https://www.youtube.com/watch?v=4xh-r3hZ6Hw 137
116 What Y-DNA Testing Can Do for You Richard Hill https://www.youtube.com/watch?v=a094YhIY4HU 191
117 Extending Time Horizons with DNA Rob Spencer (live) https://www.youtube.com/watch?v=wppXD1Zz2sQ 1037 + live viewers
118 DNA for Native American Ancestry by Roberta Estes Roberta Estes https://www.youtube.com/watch?v=EbNyXCFfp4M 212
119 1. Associating Autosomal DNA Segments With Ancestors Roberta Estes (live) https://www.youtube.com/watch?v=_IHSCkNnX48

 

~9000: 1019 + 500 live viewers + 7,400+ Facebook
120 1. What Can I Do With Ancestral DNA Segments? Roberta Estes (live) https://www.youtube.com/watch?v=Suv3l4iZYAQ 325 plus live viewers

 

121 Native American DNA – Ancient and Contemporary Maps Roberta Estes (live) https://www.youtube.com/watch?v=dFTl2vXUz_0 212 plus 483 live viewers

 

122 How Can DNA Enhance My Family History Research? Robin Wirthlin https://www.youtube.com/watch?v=f3KKW-U2P6w 102
123 How to Analyze a DNA Match Robin Wirthlin https://www.youtube.com/watch?v=LTL8NbpROwM 367
124 1. Jewish Ethnicity & DNA: History, Migration, Genetics Schelly Talalay Dardashti https://www.youtube.com/watch?v=AIJyphGEZTA 82

 

125 2. Jewish Ethnicity & DNA: History, Migration, Genetics Schelly Talalay Dardashti https://www.youtube.com/watch?v=VM3MCYM0hkI 72
126 Ask us about DNA Talking Family History (live) https://www.youtube.com/watch?v=kv_RfR6OPpU 96 plus live viewers
127 1. An Introduction to Visual Phasing Tanner Blair Tolman https://www.youtube.com/watch?v=WNhErW5UVKU

 

183
128 2. An Introduction to Visual Phasing Tanner Blair Tolman https://www.youtube.com/watch?v=CRpQ8EVOShI 110

 

129 Common Problems When Doing Visual Phasing Tanner Blair Tolman https://www.youtube.com/watch?v=hzFxtBS5a8Y 68
130 Cross Visual Phasing to Go Back Another Generation Tanner Blair Tolman https://www.youtube.com/watch?v=MrrMqhfiwbs 64
131 DNA Basics Tanner Blair Tolman https://www.youtube.com/watch?v=OCMUz-kXNZc 155
132 DNA Painter and Visual Phasing Tanner Blair Tolman https://www.youtube.com/watch?v=2-eh1L4wOmQ 155
133 DNA Painter Part 2: Chromosome Mapping Tanner Blair Tolman https://www.youtube.com/watch?v=zgOJDRG7hJc 172
134 DNA Painter Part 3: The Inferred Segment Generator Tanner Blair Tolman https://www.youtube.com/watch?v=96ai8nM4lzo

 

100
135 DNA Painter Part 4: The Distinct Segment Generator Tanner Blair Tolman https://www.youtube.com/watch?v=Pu-WIEQ_8vc 83
136 DNA Painter Part 5: Ancestral Trees Tanner Blair Tolman https://www.youtube.com/watch?v=dkYDeFLduKA 73
137 Understanding Your DNA Ethnicity Results Tanner Blair Tolman https://www.youtube.com/watch?v=4tAd8jK6Bgw 518
138 What’s New at GEDmatch Tim Janzen https://www.youtube.com/watch?v=AjA59BG_cF4

 

515
139 What Does it Mean to Have Neanderthal Ancestry? Ugo Perego https://www.youtube.com/watch?v=DshCKDW07so 190
140 Big Y-700 Your DNA Guide https://www.youtube.com/watch?v=rIFC69qswiA 143
141 Next Steps with Your DNA Your DNA Guide – Diahan Southard (live) https://www.familysearch.org/rootstech/session/next-steps-with-your-dna Not yet available

Additions:

142  Adventures of an Amateur Genetic Genealogist – Geoff Nelson https://www.familysearch.org/rootstech/session/adventures-of-an-amateur-genetic-genealogist     291 views

____________________________________________________________

Sign Up Now – It’s Free!

If you enjoyed this article, subscribe to DNAeXplain for free, to automatically receive new articles by email each week.

Here’s the link. Just look for the little grey “follow” button on the right-hand side on your computer screen below the black title bar, enter your e-mail address, and you’re good to go!

In case you were wondering, I never have nor ever will share or use your e-mail outside of the intended purpose.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

2021 Favorite Articles

It’s that time of the year again when we welcome the next year.

2021 was markedly different than anything that came before. (Is that ever an understatement!)

Maybe you had more time for genealogy and spent time researching!

So, what did we read in 2021? Which of my blog articles were the most popular?

In reverse order, beginning with number 10, we have:

This timeless article published in 2015 explains how to calculate the amount of any specific heritage you carry based on your ancestors.

Just something fun that’s like your regular pedigree chart, except color coded locations instead of ancestors. Here’s mine

The Autosegment Triangulation Cluster Tool is a brand new tool introduced in October 2021. Created by Genetic Affairs for GEDmatch, this tool combines autoclusters and triangulation.

Many people don’t realize that we actually don’t inherit exactly 25% of our DNA from each grandparent, nor why.

This enlightening article co-authored with statistician Philip Gammon explains how this works, and why it affects all of your matches.

Who doesn’t love learning about ancient DNA and the messages it conveys. Does your Y or mitochondrial DNA match any of these burials? Take a look. You might be surprised.

How can you tell if you are full or half siblings with another person? You might think this is a really straightforward question with an easy answer, but it isn’t. And trust me, if you EVER find yourself in a position of needing to know, you really need to know urgently.

Using simple match, it’s easy to figure how much of your ancestor’s DNA you “should” have, but that’s now how inheritance actually works. This article explains why and shows different inheritance scenarios.

That 28 day timer has expired, but the article can still be useful in terms of educating yourself. This should also be read in conjunction with Ancestry Retreats, by Judy Russell.

If I had a dollar for every time I’ve heard someone say that their ethnicity percentages were “wrong,” I’d be a rich woman, living in a villa in sun-drenched Tuscany😊

This extremely popular article has either been first or second every year since it was published. Ethnicity is both exciting and perplexing.

As genealogists, the first thing we need to do is to calculate what, according to our genealogy, we would expect those percentages to be. Of course, we also need to factor in the fact that we don’t inherit exactly the same amount of DNA from each grandparent. I explain how I calculated my “expected” percentages of ethnicity based on my known tree. That’s the best place to start.

Please note that I am no longer updating the vendor comparison charts in the article. Some vendors no longer release updates to the entire database at the same time, and some “tweak” results periodically without making an announcement. You’ll need to compare your own results at the different vendors at the same point in time to avoid comparing apples and oranges.

The #1 Article for 2021 is…

  1. Proving Native American Ancestry Using DNA

This article has either been first (7 times) or second (twice) for 9 years running. Now you know why I chose this topic for my new book, DNA for Native American Genealogy.

If you’re searching for your Native American ancestry, I’ve provided step-by-step instructions, both with and without some percentage of Native showing in your autosomal DNA percentages.

Make 2022 a Great Year!

Here’s wishing you the best in 2022. I hope your brick walls cave. What are you doing to help that along? Do you have a strategy in mind?

__________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here. You can also subscribe to receive emails when I publish articles by clicking the “Follow” button at www.DNAexplain.com.

You’re always welcome to forward articles or links to friends.

Help Out, Please

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

DNA for Native American Genealogy – Hot Off the Press!

Drum roll please…my new book, DNA for Native American Genealogy, was just released today, published by Genealogical.com.

I’m so excited! I expected publication around the holidays. What a pleasant surprise.

This 190-page book has been a labor of love, almost a year in the making. There’s a lot.

  • Vendor Tools – The book incorporates information about how to make the best use of the autosomal DNA tools offered by all 4 of the major testing vendors; FamilyTreeDNA, MyHeritage, Ancestry, and 23andMe.
  • Chromosome Painting – I’ve detailed how to use DNAPainter to identify which ancestor(s) your Native heritage descends from by painting your population/ethnicity segments provided by FamilyTreeDNA and 23andMe.
  • Y and Mitochondrial DNA – I’ve described how and when to utilize the important Y and mitochondrial DNA tests, for you and other family members.
  • Maps – Everyone wants to know about ancient DNA. I’ve included ancient DNA information complete with maps of ancient DNA sites by major Native haplogroups, gathered from many academic papers, as well as mapped contemporary DNA locations.
  • Haplogroups – Locations in the Americas, by haplogroup, where individual haplogroups and subgroups are found. Some haplogroups are regional in nature. If you happen to have one of these haplogroups, that’s a BIG HINT about where your ancestor lived.
  • Tribes – Want to know, by tribe, which haplogroups have been identified? Got you covered there too.
  • Checklist – I’ve provided a checklist type of roadmap for you to follow, along with an extensive glossary.
  • Questions – I’ve answered lots of frequently asked questions. For example – what about joining a tribe? I’ve explained how tribes work in the US and Canada, complete with links for relevant forms and further information.

But wait, there’s more…

New Revelations!!!

There is scientific evidence suggesting that two haplogroups not previously identified as Native are actually found in very low frequencies in the Native population. Not only do I describe these haplogroups, but I provide their locations on a map.

I hope other people will test and come forward with similar results in these same haplogroups to further solidify this finding.

It’s important to understand the criteria required for including these haplogroups as (potentially) Native. In general, they:

  • Must be found multiple times outside of a family group
  • Must be unexplained by any other scenario
  • Must be well-documented both genetically as well as using traditional genealogical records
  • Must be otherwise absent in the surrounding populations

This part of the research for the book was absolutely fascinating to me.

Description

Here’s the book description at Genealogical.com:

DNA for Native American Genealogy is the first book to offer detailed information and advice specifically aimed at family historians interested in fleshing out their Native American family tree through DNA testing.

Figuring out how to incorporate DNA testing into your Native American genealogy research can be difficult and daunting. What types of DNA tests are available, and which vendors offer them? What other tools are available? How is Native American DNA determined or recognized in your DNA? What information about your Native American ancestors can DNA testing uncover? This book addresses those questions and much more.

Included are step-by-step instructions, with illustrations, on how to use DNA testing at the four major DNA testing companies to further your genealogy and confirm or identify your Native American ancestors. Among the many other topics covered are the following:

    • Tribes in the United States and First Nations in Canada
    • Ethnicity
    • Chromosome painting
    • Population Genetics and how ethnicity is assigned
    • Genetic groups and communities
    • Y DNA paternal direct line male testing for you and your family members
    • Mitochondrial DNA maternal direct line testing for you and your family members
    • Autosomal DNA matching and ethnicity comparisons
    • Creating a DNA pedigree chart
    • Native American haplogroups, by region and tribe
    • Ancient and contemporary Native American DNA

Special features include numerous charts and maps; a roadmap and checklist giving you clear instructions on how to proceed; and a glossary to help you decipher the technical language associated with DNA testing.

Purchase the Book and Participate

I’ve included answers to questions that I’ve received repeatedly for many years about Native American heritage and DNA. Why Native DNA might show in your DNA, why it might not – along with alternate ways to seek that information.

You can order DNA for Native American Genealogy, here.

For customers in Canada and outside the US, you can use the Amazon link, here, to reduce the high shipping/customs costs.

I hope you’ll use the information in the book to determine the appropriate tests for your situation and fully utilize the tools available to genealogists today to either confirm those family rumors, put them to rest – or maybe discover a previously unknown Native ancestor.

Please feel free to share this article with anyone who might be interested.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

A Triangulation Checklist Born From the Question; “Why NOT Use Close Relatives for Triangulation?”

One of my readers asked why we don’t use close relatives for triangulation.

This is a great question because not using close relatives for triangulation seems counter-intuitive.

I used to ask my kids and eventually my students and customers if they wanted the quick short answer or the longer educational answer.

The short answer is “because close relatives are too close to reliably form the third leg of the triangle.” Since you share so much DNA with close relatives, someone matching you who is identical by chance can also match them for exactly the same reason.

If you trust me and you’re good with that answer, wonderful. But I hope you’ll keep reading because there’s so much to consider, not to mention a few gotchas. I’ll share my methodology, techniques, and workarounds.

We’ll also discuss absolutely wonderful ways to utilize close relatives in the genetic genealogical process – just not for triangulation.

At the end of this article, I’ve provided a working triangulation checklist for you to use when evaluating your matches.

Let’s go!

The Step-by-Step Educational Answer😊

Some people see “evidence” they believe conflicts with the concept that you should not use close relatives for triangulation. I understand that, because I’ve gone down that rathole too, so I’m providing the “educational answer” that explains exactly WHY you should not use close relatives for triangulation – and what you should do.

Of course, we need to answer the question, “Who actually are close relatives?”

I’ll explain the best ways to best utilize close relatives in genetic genealogy, and why some matches are deceptive.

You’ll need to understand the underpinnings of DNA inheritance and also of how the different vendors handle DNA matching behind the scenes.

The purpose of autosomal DNA triangulation is to confirm that a segment is passed down from a particular ancestor to you and a specific set of your matches.

Triangulation, of course, implies 3, so at least three people must all match each other on a reasonably sized portion of the same DNA segment for triangulation to occur.

Matching just one person only provides you with one path to that common ancestor. It’s possible that you match that person due to a different ancestor that you aren’t aware of, or due to chance recombination of DNA.

It’s possible that your or your match inherited part of that DNA from your maternal side and part from your paternal side, meaning that you are matching that other person’s DNA by chance.

I wrote about identical by descent (IBD), which is an accurate genealogically meaningful match, and identical by chance (IBC) which is a false match, in the article Concepts – Identical by…Descent, State, Population and Chance.

I really want you to understand why close relatives really shouldn’t be used for triangulation, and HOW close relative matches should be used, so we’re going to discuss all of the factors that affect and influence this topic – both the obvious and little-understood.

  • Legitimate Matches
  • Inheritance and Triangulation
  • Parental Cross-Matching
  • Parental Phasing
  • Automatic Phasing at FamilyTreeDNA
  • Parental Phasing Caveats
  • Pedigree Collapse
  • Endogamy
  • How Many Identical-by-Chance Matches Will I Have?
  • DNA Doesn’t Skip Generations (Seriously, It Doesn’t)
  • Your Parents Have DNA That You Don’t (And How to Use It)
  • No DNA Match Doesn’t Mean You’re Not Related
  • Imputation
  • Ancestry Issues and Workarounds
  • Testing Close Relatives is VERY Useful – Just Not for Triangulation
  • Triangulated Matches
  • Building Triangulation Evidence – Ingredients and a Recipe
  • Aunts/Uncles
  • Siblings
  • How False Positives Work and How to Avoid Them
  • Distant Cousins Are Best for Triangulation & Here’s Why
  • Where Are We? A Triangulation Checklist for You!
  • The Bottom Line

Don’t worry, these sections are logical and concise. I considered making this into multiple articles, but I really want it in one place for you. I’ve created lots of graphics with examples to help out.

Let’s start by dispelling a myth.

DNA Doesn’t Skip Generations!

Recently, someone emailed to let me know that they had “stopped listening to me” in a presentation when I said that if a match did not also match one of your parents, it was a false match. That person informed me that they had worked on their tree for three years at Ancestry and they have “proof” of DNA skipping generations.

Nope, sorry. That really doesn’t happen, but there are circumstances when a person who doesn’t understand either how DNA works, or how the vendor they are using presents DNA results could misunderstand or misinterpret the results.

You can watch my presentation, RootsTech session, DNA Triangulation: What, Why and How, for free here. I’m thrilled that this session is now being used in courses at two different universities.

DNA really doesn’t skip generations. You CANNOT inherit DNA that your parents didn’t have.

Full stop.

Your children cannot inherit DNA from you that you don’t carry. If you don’t have that DNA, your children and their descendants can’t have it either, at least not from you. They of course do inherit DNA from their other parent.

I think historically, the “skipping generations” commentary was connected to traits. For example, Susie has dimples (or whatever) and so did her maternal grandmother, but her mother did not, so Susie’s dimples were said to have “skipped a generation.” Of course, we don’t know anything about Susie’s other grandparents, if Susie’s parents share ancestors, recessive/dominant genes or even how many genetic locations are involved with the inheritance of “dimples,” but I digress.

DNA skipping generations is a fallacy.

You cannot legitimately match someone that your parent does not, at least not through that parent’s side of the tree.

But here’s the caveat. You can’t match someone one of your parents doesn’t with the rare exception of:

  • Relatively recent pedigree collapse that occurs when you have the same ancestors on both sides of your tree, meaning your parents are related, AND
  • The process of recombination just happened to split and recombine a segment of DNA in segments too small for your match to match your parents individually, but large enough when recombined to match you.

We’ll talk about that more in a minute.

However, the person working with Ancestry trees can’t make this determination because Ancestry doesn’t provide segment information. Ancestry also handles DNA differently than other vendors, which we’ll also discuss shortly.

We’ll review all of this, but let’s start at the beginning and explain how to determine if our matches are legitimate, or not.

Legitimate Matches

Legitimate matches occur when the DNA of your ancestor is passed from that ancestor to their descendants, and eventually to you and a match in an unbroken pathway.

Unbroken means that every ancestor between you and that ancestor carried and then passed on the segment of the ancestor’s DNA that you carry today. The same is true for your match who carries the same segment of DNA from your common ancestor.

False positive matches occur when the DNA of a male and female combine randomly to look like a legitimate match to someone else.

Thankfully, there are ways to tell the difference.

Inheritance and Triangulation

Remember, you inherit two copies of each of your chromosomes 1-22, one copy from your mother and one from your father. You inherit half of the DNA that each parent carries, but it’s mixed together in you so the labs can’t readily tell which nucleotide, A, C, T, or G you received from which parent. I’m showing your maternal and paternal DNA in the graphic below, stacked neatly together in a column – but in reality, it could be AC in one position and CA in the next.

For matching all that matters is the nucleotide that matches your match is present in one of those two locations. In this case, A for your mother’s side and C for your father’s side. If you’re interested, you can read more about that in the article, Hit a Genealogy Home Run Using Your Double-Sided Two-Faced Chromosomes While Avoiding Imposters.

You can see in this example that you inherited all As from your Mom and all Cs from your Dad.

  • A legitimate maternal match would match you on all As on this particular example segment.
  • A legitimate paternal match would match you on all Cs on this particular segment.
  • A false positive match will match you on some random combination of As and Cs that make it look like they match you legitimately, but they don’t.
  • A false positive match will NOT match either your mother or your father.

To be very clear, technically a false positive match DOES match your DNA – but they don’t match your DNA because you share a common ancestor with your match. They match you because random recombination on their side causes you to match each other by chance.

In other words, if part of your DNA came from your Mom’s side and part from your Dad’s but it randomly fell in the correct positional order, you’d still match someone whose DNA was from only their mother or father’s side. That’s exactly the situation shown above and below.

Looking at our example again, it’s evident that your identical by chance (IBC) match’s A locations (1, 3, 5, 7 & 9) will match your Mom. C locations (2, 4, 6 8, & 10) will match your Dad, but the nonmatching segments interleaved in-between that match alternating parents will prevent your match from matching either of your parents. In other words, out of 10 contiguous locations in our example, your IBC match has 5 As alternated with 5 Cs, so they won’t match either of your parents who have 10 As or 10 Cs in a row.

This recombination effect can work in either direction. Either or both matching people’s DNA could be randomly mixed causing them to match each other, but not their parents.

Regardless of whose DNA is zigzagging back and forth between maternal and paternal, the match is not genealogical and does not confirm a common ancestor.

This is exactly why triangulation works and is crucial.

If you legitimately match a third person, shown below, on your maternal side, they will match you, your first legitimate maternal match, and your Mom because they carry all As. But they WON’T match the person who is matching you because they are identical by chance, shown in grey below.

The only person your identical by chance match matches in this group is you because they match you because of the chance recombination of parental DNA.

That third person WILL also match all other legitimate maternal matches on this segment.

In the graphic above, we see that while the grey identical by chance person matches you because of the random combination of As from your mother and Cs from your father, your legitimate maternal matches won’t match your identical by chance match.

This is the first step in identifying false matches.

Parental Cross-Matching

Removing the identical by chance match, and adding in the parents of your legitimate maternal match, we see that your maternal match, above, matches you because you both have all As inherited from one parent, not from a combination of both parents.

We know that because we can see the DNA of both parents of both matches in this example.

The ideal situation occurs when two people match and they have both had their parents tested. We need to see if each person matches the other person’s parents.

We can see that you do NOT match your match’s father and your match does NOT match your father.

You do match your match’s mother and your match does match your mother. I refer to this as Parental Cross-matching.

Your legitimate maternal matches will also match each other and your mother if she is available for testing.

All the people in yellow match each other, while the two parents in gray do not match any of your matches. An entire group of legitimate maternal matches on this segment, no matter how many, will all match each other.

If another person matches you and the other yellow people, you’ll still need to see if you match their parents, because if not, that means they are matching you on all As because their two parents DNA combined just happened, by chance, to contribute an A in all of those positions.

In this last example, your new match, in green, matches you, your legitimate match and both of your mothers, BUT, none of the four yellow people match either of the new match’s parents. You can see that the new green match inherited their As from the DNA of their mother and father both, randomly zigzagging back and forth.

The four yellow matches phase parentally as we just proved with cross matching to parents. The new match at first glance appears to be a legitimate match because they match all of the yellow people – but they aren’t because the yellow people don’t match the green person’s parents.

To tell the difference between legitimate matches and identical by chance matches, you need two things, in order.

  • Parental matching known as parental phasing along with parental cross-matching, if possible, AND
  • Legitimate identical by descent (IBD) triangulated matches

If you have the ability to perform parental matching, called phasing, that’s the easiest first step in eliminating identical by chance matches. However, few match pairs will have parents for everyone. You can use triangulation without parental phasing if parents aren’t available.

Let’s talk about both, including when and how close relatives can and cannot be used.

Parental Phasing

The technique of confirming your match to be legitimate by your match also matching one of your parents is called parental phasing.

If we have the parents of both people in a match pair available for matching, we can easily tell if the match does NOT match either parent. That’s Parental Cross Matching. If either match does NOT match one of the other person’s parents, the match is identical by chance, also known as a false positive.

See how easy that was!

If you, for example, is the only person in your match pair to have parents available, then you can parentally phase the match on your side if your match matches your parents. However, because your match’s parents are unavailable, your match to them cannon tbe verified as legitimate on their side. So you are not phased to their parents.

If you only have one of your parents available for matching, and your match does not match that parent, you CANNOT presume that because your match does NOT match that parent, the match is a legitimate match for the other, missing, parent.

There are four possible match conditions:

  • Maternal match
  • Paternal match
  • Matches neither parent which means the match is identical by chance meaning a false positive
  • Matches both parents in the case of pedigree collapse or endogamy

If two matching people do match one parent of both matches (parental cross-matching), then the match is legitimate. In other words, if we match, I need to match one of your parents and you need to match one of mine.

It’s important to compare your matches’ DNA to generationally older direct family members such as parents or grandparents, if that’s possible. If your grandparents are available, it’s possible to phase your matches back another generation.

Automatic Phasing at FamilyTreeDNA

FamilyTreeDNA automatically phases your matches to your parents if you test that parent, create or upload a GEDCOM file, and link your test and theirs to your tree in the proper places.

FamilyTreeDNA‘s Family Matching assigns or “buckets” your matches maternally and paternally. Matches are assigned as maternal or paternal matches if one or both parents have tested.

Additionally, FamilyTreeDNA uses triangulated matches from other linked relatives within your tree even if your parents have not tested. If you don’t have your parents, the more people you identify and link to your tree in the proper place, the more people will be assigned to maternal and paternal buckets. FamilyTreeDNA is the only vendor that does this. I wrote about this process in the article, Triangulation in Action at Family Tree DNA.

Parental Phasing Caveats

There are very rare instances where parental phasing may be technically accurate, but not genealogically relevant. By this, I mean that a parent may actually match one of your matches due to endogamy or a population level match, even if it’s considered a false positive because it’s not relevant in a genealogical timeframe.

Conversely, a parent may not match when the segment is actually legitimate, but it’s quite rare and only when pedigree collapse has occurred in a very specific set of circumstances where both parents share a common ancestor.

Let’s take a look at that.

Pedigree Collapse

It’s not terribly uncommon in the not-too-distant past to find first cousins marrying each other, especially in rather closely-knit religious communities. I encounter this in Brethren, Mennonite and Amish families often where the community was small and out-marrying was frowned upon and highly discouraged. These families and sometimes entire church congregations migrated cross-country together for generations.

When pedigree collapse is present, meaning the mother and father share a common ancestor not far in the past, it is possible to inherit half of one segment from Mom and the other half from Dad where those halves originated with the same ancestral couple.

For example, let’s say the matching segment between you and your match is 12 cM in length, shown below. You inherited the blue segment from your Dad and the neighboring peach segment from Mom – shown just below the segment numbers. You received 6 cM from both parents.

Another person’s DNA does match you, shown in the bottom row, but they are not shown on the DNA match list of either of your parents. That’s because the DNA segments of the parents just happened to recombine in 6 cM pieces, respectively, which is below the 7 cM matching threshold of the vendor in this example.

If the person matched you at 12 cM where you inherited 8 cM from one parent and 4 from the other, that person would show on one parent’s match list, but not the other. They would not be on the parent’s match list who contributed only 4 cM simply because the DNA divided and recombined in that manner. They would match you on a longer segment than they match your parent at 8 cM which you might notice as “odd.”

Let’s look at another example.

click to enlarge image

If the matching segment is 20 cM, the person will match you and both of your parents on different pieces of the same segment, given that both segments are above 7 cM. In this case, your match who matches you at 20 cM will match each of your parents at 10 cM.

You would be able to tell that the end location of Dad’s segment is the same as the start location of Mom’s segment.

This is NOT common and is NOT the “go to” answer when you think someone “should” match your parent and does not. It may be worth considering in known pedigree collapse situations.

You can see why someone observing this phenomenon could “presume” that DNA skipped a generation because the person matches you on segments where they don’t match your parent. But DNA didn’t skip anything at all. This circumstance was caused by a combination of pedigree collapse, random division of DNA, then random recombination in the same location where that same DNA segment was divided earlier. Clearly, this sequence of events is not something that happens often.

If you’ve uploaded your DNA to GEDmatch, you can select the “Are your parents related?” function which scans your DNA file for runs of homozygosity (ROH) where your DNA is exactly the same in both parental locations for a significant distance. This suggests that because you inherited the exact same sequence from both parents, that your parents share an ancestor.

If your parents didn’t inherit the same segment of DNA from both parents, or the segment is too short, then they won’t show as “being related,” even if they do share a common ancestor.

Now, let’s look at the opposite situation. Parental phasing and ROH sometimes do occur when common ancestors are far back in time and the match is not genealogically relevant.

Endogamy

I often see non-genealogical matching occur when dealing with endogamy. Endogamy occurs when an entire population has been isolated genetically for a long time. In this circumstance, a substantial part of the population shares common DNA segments because there were few original population founders. Much of the present-day population carries that same DNA. Many people within that population would match on that segment. Think about the Jewish community and indigenous Americans.

Consider our original example, but this time where much of the endogamous population carries all As in these positions because one of the original founders carried that nucleotide sequence. Many people would match lots of other people regardless of whether they are a close relative or share a distant ancestor.

People with endogamous lines do share relatives, but that matching DNA segment originated in ancestors much further back in time. When dealing with endogamy, I use parental phasing as a first step, if possible, then focus on larger matches, generally 20 cM or greater. Smaller matches either aren’t relevant or you often can’t tell if/how they are.

At FamilyTreeDNA, people with endogamy will find many people bucketed on the “Both” tab meaning they triangulate with people linked on both sides of the tester’s tree.

An example of a Jewish person’s bucketed matches based on triangulation with relatives linked in their tree is shown above.

Your siblings, their children, and your children will be related on both your mother’s and father’s sides, but other people typically won’t be unless you have experienced either pedigree collapse where you are related both maternally and paternally through the same ancestors or you descend from an endogamous population.

How Many Identical-by-Chance Matches Will I Have?

If you have both parents available to test, and you’re not dealing with either pedigree collapse or endogamy, you’ll likely find that about 15-20% of your matches don’t match your parents on the same segment and are identical by chance.

With endogamy, you’ll have MANY more matches on your endogamous lines and you’ll have some irrelevant matches, often referred to as “false positive” matches even though they technically aren’t, even using parental phasing.

Your Parents Have DNA That You Don’t

Sometimes people are confused when reviewing their matches and their parent’s match to the same person, especially when they match someone and their parent matches them on a different or an additional segment.

If you match someone on a specific segment and your parents do not, that’s a false positive FOR THAT SEGMENT. Every segment has its own individual history and should be evaluated individually. You can match someone on two segments, one from each parent. Or three segments, one from each parent and one that’s identical by chance. Don’t assume.

Often, your match will match both you and your parent on the same segment – which is a legitimate parentally phased match.

But what if your match matches your parent on a different segment where they don’t match you? That’s a false positive match for you.

Keep in mind that it is possible for one of your matches to match your parent on a separate or an additional segment that IS legitimate. You simply didn’t inherit that particular segment from your parent.

That’s NOT the same situation as someone matching you that does NOT match one of your parents on the same segment – which is an identical by chance or false match.

Your parent having a match that does not match you is the reverse situation.

I have several situations where I match someone on one segment, and they match my parent on the same segment. Additionally, that person matches my parent on another segment that I did NOT inherit from that parent. That’s perfectly normal.

Remember, you only inherit half of your parent’s DNA, so you literally did NOT inherit the other half of their DNA. Your mother, for example, should have twice as many matches as you on her side because roughly half of her matches won’t match you.

That’s exactly why testing your parents and close family members is so critical. Their matches are as valid and relevant to your genealogy as your own. The same is true for other relatives, such as aunts and uncles with whom you share ALL of the same ancestors.

You need to work with your family member’s matches that you don’t share.

No DNA Match Doesn’t Mean You’re Not Related

Some people think that not matching someone on a DNA test is equivalent to saying they aren’t related. Not sharing DNA doesn’t mean you’re not related.

People are often disappointed when they don’t match someone they think they should and interpret that to mean that the testing company is telling them they “aren’t related.” They are upset and take issue with this characterization. But that’s not what it means.

Let’s analyze this a bit further.

First, not sharing DNA with a second cousin once removed (2C1R) or more distant does NOT mean you’re NOT related to that person. It simply means you don’t share any measurable DNA ABOVE THE VENDOR THRESHOLD.

All known second cousins match, but about 10% of third cousins don’t match, and so forth on up the line with each generation further back in time having fewer cousins that match each other.

If you have tested close relatives, check to see if that cousin matches your relatives.

Second, it’s possible to match through the “other” or unexpected parent. I certainly didn’t think this would be the case in my family, because my father is from Appalachia and my mother’s family is primarily from the Netherlands, Germany, Canada, and New England. But I was wrong.

All it took was one German son that settled in Appalachia, and voila, a match through my mother that I surely thought should have been through my father’s side. I have my mother’s DNA and sure enough, my match that I thought should be on my father’s side matches Mom on the same segment where they match me, along with several triangulated matches. Further research confirmed why.

I’ve also encountered situations where I legitimately match someone on both my mother’s and father’s side, on different segments.

Third, imputation can be important for people who don’t match and think they should. Imputation can also cause matching segment length to be overreported.

Ok, so what’s imputation and why do I care?

Imputation

Every DNA vendor today has to use some type of imputation.

Let me explain, in general, what imputation is and why vendors use it.

Over the years, DNA processing vendors who sell DNA chips to testing companies have changed their DNA chips pretty substantially. While genealogical autosomal tests test about 700,000 DNA locations, plus or minus, those locations have changed over time. Today, some of these chips only have 100,000 or so chip locations in common with chips either currently or previously utilized by other vendors.

The vendors who do NOT accept uploads, such as 23andMe or Ancestry, have to develop methods to make their newest customers on their DNA processing vendor’s latest chip compatible with their first customer who was tested on their oldest chip – and all iterations in-between.

Vendors who do accept transfers/uploads from other vendors have to equalize any number of vendors’ chips when their customers upload those files.

Imputation is the scientific way to achieve this cross-platform functionality and has been widely used in the industry since 2017.

Imputation, in essence, fills in the blanks between tested locations with the “most likely” DNA found in the human population based on what’s surrounding the blank location.

Think of the word C_T. There are a limited number of letters and words that are candidates for C_T. If you use the word in a sentence, your odds of accuracy increase dramatically. Think of a genetic string of nucleotides as a sentence.

Imputation can be incorrect and can cause both false positive and false negative matches.

For the most part, imputation does not affect close family matches as much as more distant matches. In other words, imputation is NOT going to cause close family members not to match.

Imputation may cause more distant family members not to match, or to have a false positive match when imputation is incorrect.

Imputation is actually MUCH less problematic than I initially expected.

The most likely effect of imputation is to cause a match to be just above or below the vendor threshold.

How can we minimize the effects of imputation?

  • Generally, the best result will be achieved if both people test at the same vendor where their DNA is processed on the same chip and less imputation is required.
  • Upload the results of both people to both MyHeritage and FamilyTreeDNA. If your match results are generally consistent at those vendors, imputation is not a factor.
  • GEDmatch does not use imputation but attempts to overcome files with low overlapping regions by allowing larger mismatch areas. I find their matches to be less accurate than at the various vendors.

Additionally, Ancestry has a few complicating factors.

Ancestry Issues

AncestryDNA is different in three ways.

  • Ancestry doesn’t provide segment information so it’s impossible to triangulate or identify the segment or chromosome where people match. There is no chromosome browser or triangulation tool.
  • Ancestry down-weights and removes some segments in areas where they feel that people are “too matchy.” You can read Ancestry’s white papers here and here.

These “personal pileup regions,” as they are known, can be important genealogically. In my case, these are my mother’s Acadian ancestors. Yes, this is an endogamous population and also suffers from pedigree collapse, but since this is only one of my mother’s great-grandparents, this match information is useful and should not be removed.

  • Ancestry doesn’t show matches in common if the shared segments are less than 20cM. Therefore, you may not see someone on a shared match list with a relative when they actually are a shared match.

If two people both match a third person on less than a 20 cM segment at Ancestry, the third person won’t appear on the other person’s shared match list. So, if I match John Doe on 19 cM of DNA, and I looked at the shared matches with my Dad, John Doe does NOT appear on the shared match list of me and my Dad – even though he is a match to both of us at 19 cM.

The only way to determine if John Doe is a shared match is to check my Dad’s and my match list individually, which means Dad and I will need to individually search for John Doe.

Caveat here – Ancestry’s search sometimes does not work correctly.

Might someone who doesn’t understand that the shared match list doesn’t show everyone who shares DNA with both people presume that the ancestral DNA of that ancestor “skipped a generation” because John Doe matches me with a known ancestor, and not Dad on our shared match list? I mean, wouldn’t you think that a shared match would be shown on a tab labeled “Shared Matches,” especially since there is no disclaimer?

Yes, people can be forgiven for believing that somehow DNA “skipped” a generation in this circumstance, especially if they are relatively inexperienced and they don’t understand Ancestry’s anomalies or know that they need to or how to search for matches individually.

Even if John Doe does match me and Dad both, we still need to confirm that it’s on the same segment AND it’s a legitimate match, not IBC. You can’t perform either of these functions at Ancestry, but you can elsewhere.

Ancestry WorkArounds

To obtain this functionality, people can upload their DNA files for free to both FamilyTreeDNA and MyHeritage, companies that do provide full shared DNA reporting (in common with) lists of ALL matches and do provide segment information with chromosome browsers. Furthermore, both provide triangulation in different ways.

Matching is free, but an inexpensive unlock is required at both vendors to access advanced tools such as Family Matching (bucketing) and triangulation at Family Tree DNA and phasing/triangulation at MyHeritage.

I wrote about Triangulation in Action at FamilyTreeDNA, here.

MyHeritage actually brackets triangulated segments for customers on their chromosome browser, including parents, so you get triangulation and parental phasing at the same time if you and your parent have both tested or uploaded your DNA file to MyHeritage. You can upload, for free, here.

In this example, my mother is matching to me in red on the entire length of chromosome 18, of course, and three other maternal cousins triangulate with me and mother inside the bracketed portion of chromosome 18. Please note that if any one of the people included in the chromosome browser comparison do not triangulate, no bracket is drawn around any others who do triangulate. It’s all or nothing. I remove people one by one to see if people triangulate – or build one by one with my mother included.

I wrote about Triangulation in Action at MyHeritage, here.

People can also upload to GEDmatch, a third-party site. While GEDmatch is less reliable for matching, you can adjust your search thresholds which you cannot do at other vendors. I don’t recommend routinely working below 7 cM. I occasionally use GEDmatch to see if a pedigree collapse segment has recombined below another vendor’s segment matching threshold.

Do NOT check the box to prevent hard breaks when selecting the One-to-One comparison. Checking that box allows GEDmatch to combine smaller matching segments into mega-segments for matching.

I wrote about Triangulation in Action at GEDmatch, here.

Transferring/Uploading Your DNA 

If you want to transfer your DNA to one of these vendors, you must download the DNA file from one vendor and upload it to another. That process does NOT remove your DNA file from the vendor where you tested, unless you select that option entirely separately.

I wrote full step-by-step transfer/upload instructions for each vendor, here.

Testing Close Relatives Is VERY Useful – Just Not for Triangulation

Of course, your best bet if you don’t have your parents available to test is to test as many of your grandparents, great-aunts/uncles, aunts, and uncles as possible. Test your siblings as well, because they will have inherited some of the same and some different segments of DNA from your parents – which means they carry different pieces of your ancestors’ DNA.

Just because close relatives don’t make good triangulation candidates doesn’t mean they aren’t valuable. Close relatives are golden because when they DO share a match with you, you know where to start looking for a common ancestor, even if your relative matches that person on a different segment than you do.

Close relatives are also important because they will share pieces of your common ancestor’s DNA that you don’t. Their matches can unlock the answers to your genealogy questions.

Ok, back to triangulation.

Triangulated Matches

A triangulated match is, of course, when three people all descended from a common ancestor and match each other on the same segment of DNA.

That means all three people’s DNA matches each other on that same segment, confirming that the match is not by chance, and that segment did descend from a common ancestor or ancestral couple.

But, is this always true? You’re going to hate this answer…

“It depends.”

You knew that was coming, didn’t you! 😊

It depends on the circumstances and relationships of the three people involved.

  • One of those three people can match the other two by chance, not by descent, especially if two of those people are close relatives to each other.
  • Identical by chance means that one of you didn’t inherit that DNA from one single parent. That zigzag phenomenon.
  • Furthermore, triangulated DNA is only valid as far back as the closest common ancestor of any two of the three people.

Let’s explore some examples.

Building Triangulation Evidence – Ingredients and a Recipe

The strongest case of triangulation is when:

  • You and at least two additional cousins match on the same segment AND
  • Descend through different children of the common ancestral couple

Let’s look at a valid triangulated match.

In this first example, the magenta segment of DNA is at least partially shared by four of the six cousins and triangulates to their common great-grandfather. Let’s say that these cousins then match with two other people descended from different children of their great-great-great-grandparents on this same segment. Then the entire triangulation group will have confirmed that segment’s origin and push the descent of that segment back another two generations.

These people all coalesce into one line with their common great-grandparents.

I’m only showing 3 generations in this triangulated match, but the concept is the same no matter how many generations you reach back in time. Although, over time, segments inherited from any specific ancestor become smaller and smaller until they are no longer passed to the next generation.

In this pedigree chart, we’re only tracking the magenta DNA which is passed generation to generation in descendants.

Eventually, of course, those segments become smaller and indistinguishable as they either aren’t passed on at all or drop below vendor matching thresholds.

This chart shows the average amount of DNA you would carry from each generational ancestor. You inherit half of each parent’s DNA, but back further than that, you don’t receive exactly half of any ancestor’s DNA in any generation. Larger segments are generally cut in two and passed on partially, but smaller segments are often either passed on whole or not at all.

On average, you’ll carry 7 cM of your eight-times-great-grandparents. In reality, you may carry more or you may not carry any – and you are unlikely to carry the same segment as any random other descendants but we know it happens and you’ll find them if enough (or the right) descendants test.

Putting this another way, if you divide all of your approximate 7000 cM of DNA into 7 cM segments of equal length – you’ll have 1000 7 cM segments. So will every other descendant of your eight-times-great-grandparent. You can see how small the chances are of you both inheriting that same exact 7 cM segment through ten inheritance/transmission events, each. Yet it does happen.

I have several triangulated matches with descendants of Charles Dodson and his wife, Anne through multiple of their 9 (or so) children, ten generations back in my tree. Those triangulated matches range from 7-38 cM. It’s possible that those three largest matches at 38 cM could be related through multiple ancestors because we all have holes in our trees – including Anne’s surname.

Click to enlarge image

It helps immensely that Charles Dodson had several children who were quite prolific as well.

Of course, the further back in time, the more “proof” is necessary to eliminate other unknown common ancestors. This is exactly why matching through different children is important for triangulation and ancestor confirmation.

The method we use to confirm the common ancestor is that all of the descendants who match the tester on the same segment all also match each other. This greatly reduces the chances that these people are matching by chance. The more people in the triangulation group, the stronger the evidence. Of course, parental phasing or cross-matching, where available is an added confirmation bonus.

In our magenta inheritance example, we saw that three of the males and one of the females from three different descendants of the great-grandparents all carry at least a portion of that magenta segment of great-grandpa’s DNA.

Now, let’s take a look at a different scenario.

Why can’t siblings or close relatives be used as two of the three people needed for triangulation?

Aunts and Uncles

We know that the best way to determine if a match is valid is by parental phasing – your match also matching to one of your parents.

If both parents aren’t available, looking for close family matches in common with your match is the next hint that genealogists seek.

Let’s say that you and your match both match your aunt or uncle in common or their children.

You and your aunts or uncles matching DNA only pushes your common ancestor back to your grandparents.

At that point, your match is in essence matching to a segment that belongs to your grandparents. Your matches’ DNA, or your grandparents’ DNA could have randomly recombined and you and your aunt/cousins could be matching that third person by chance.

Ok, then, what about siblings?

Siblings

The most recent common ancestor (MRCA) of you and someone who also matches your sibling is your parents. Therefore, you and your sibling actually only count as one “person” in this scenario. In essence, it’s the DNA of your parent(s) that is matching that third person, so it’s not true triangulation. It’s the same situation as above with aunts/uncles, except the common ancestor is closer than your grandparents.

The DNA of your parents could have recombined in both siblings to look like a match to your match’s family. Or vice versa. Remember Parental Cross-Matching.

If you and a sibling inherited EXACTLY the same segment of your Mom’s and Dad’s DNA, and you match someone by chance – that person will match your sibling by chance as well.

In this example, you can see that both siblings 1 and 2 inherited the exact same segments of DNA at the same locations from both of their parents.

Of course, they also inherited segments at different locations that we’re not looking at that won’t match exactly between siblings, unless they are identical twins. But in this case, the inherited segments of both siblings will match someone whose DNA randomly combined with green or magenta dots in these positions to match a cross-section of both parents.

How False Positives Work and How to Avoid Them

We saw in our first example, displayed again above, what a valid triangulated match looks like. Now let’s expand this view and take a look more specifically at how false positive matches occur.

On the left-hand (blue) side of this graphic, we see four siblings that descend through their father from Great-grandpa who contributed that large magenta segment of DNA. That segment becomes reduced in descendants in subsequent generations.

In downstream generations, we can see gold, white and green segments being added to the DNA inherited by the four children from their ancestor’s spouses. Dad’s DNA is shown on the left side of each child, and Mom’s on the right.

  • Blue Children 1 and 2 inherited the same segments of DNA from Mom and Dad. Magenta from Dad and green from Mom.
  • Blue Child 3 inherited two magenta segments from Dad in positions 1 and 2 and one gold segment from Dad in position 3. They inherited all white segments from Mom.
  • Blue Child 4 inherited all gold segments from Dad and all white segments from Mom.

The family on the blue left-hand side is NOT related to the pink family shown at right. That’s important to remember.

I’ve intentionally constructed this graphic so that you can see several identical by chance (IBC) matches.

Child 5, the first pink sibling carries a white segment in position 1 from Dad and gold segments in positions 2 and 3 from Dad. From Mom, they inherited a green segment in position 1, magenta in position 2 and green in position 3.

IBC Match 1 – Looking at the blue siblings, we see that based on the DNA inherited from Pink Child 5’s parents, Pink Child 5 matches Blue Child 4 with white, gold and gold in positions 1-3, even though they weren’t inherited from the same parent in Blue Child 4. I circled this match in blue.

IBC Match 2 – Pink Child 5 also matches Blue Children 1 and 2 (red circles) because Pink Child 5 has green, magenta, and green in positions 1-3 and so do Blue Children 1 and 2. However, Blue Children 1 and 2 inherited the green and magenta segments from Mom and Dad respectively, not just from one parent.

Pink Child 5 matches Blue Children 1, 2 and 4, but not because they match by descent, but because their DNA zigzags back and forth between the blue children’s DNA contributed by both parents.

Therefore, while Pink Child 5 matches three of the Blue Children, they do not match either parent of the Blue Children.

IBC Match 3 – Pink Child 6 matches Blue Child 3 with white, magenta and gold in positions 1-3 based on the same colors of dots in those same positions found in Blue Child 3 – but inherited both paternally and maternally.

You can see that if we had the four parents available to test, that none of the Pink Children would match either the Blue Children’s mother or father and none of the Blue Children would match either of the Pink Children’s mother or father.

This is why we can’t use either siblings or close family relatives for triangulation.

Distant Cousins Are Best for Triangulation & Here’s Why

When triangulating with 3 people, the most recent common ancestor (MRCA) intersection of the closest two people is the place at which triangulation turns into only two lines being compared and ceases being triangulation. Triangle means 3.

If siblings are 2 of the 3 matching people, then their parents are essentially being compared to the third person.

If you, your aunt/uncle, and a third person match, your grandparents are the place in your tree where three lines converge into two.

The same holds true if you’re matching against a sibling pair on your match’s side, or a match and their aunt/uncle, etc.

The further back in your tree you can push that MRCA intersection, the more your triangulated match provides confirming evidence of a common ancestor and that the match is valid and not caused by random recombination.

That’s exactly what the descendants of Charles Dodson have been able to do through triangulation with multiple descendants from several of his children.

It’s also worth mentioning at this point that the reason autosomal DNA testing uses hundreds/thousands of base pairs in a comparison window and not 3 or 6 dots like in my example is that the probability of longer segments of DNA simply randomly matching by chance is reduced with length and SNP density which is the number of SNP locations tested within that cM range.

Hence a 7 cM/500 SNP minimum is the combined rule of thumb. At that level, roughly half of your matches will be valid and half will be identical by chance unless you’re dealing with endogamy. Then, raise your threshold accordingly.

Ok, So Where are We? A Triangulation Checklist for You!

I know this has been a relatively long educational article, but it’s important to really understand that testing close relatives is VERY important, but also why we can’t effectively use them for triangulation.

Here’s a handy-dandy summary matching/triangulation checklist for you to use as you work through your matches.

  • You inherit half of each of your parents’ DNA. There is no other place for you to obtain or inherit your DNA. There is no DNA fairy sprinkling you with DNA from another source:)
  • DNA does NOT skip generations, although in occasional rare circumstances, it may appear that this happened. In this situation, it’s incumbent upon you, the genealogist, to PROVE that an exception has occurred if you really believe it has. Those circumstances might be pedigree collapse or perhaps imputation. You’ll need to compare matches at vendors who provide a chromosome browser, triangulation, and full shared match list information. Never assume that you are the exception without hard and fast proof. We all know about assume, right?
  • Your siblings inherit half of your parents’ DNA too, but not the same exact half of your parent’s DNA that you other siblings did (unless they are identical twins.) You may inherit the exact same DNA from either or both of your parents on certain segments.
  • Your matches may match your parents on different or an additional segment that you did not inherit.
  • Every segment has an individual history. Evaluate every matching segment separately. One matching segment with someone could be maternal, one paternal, and one identical by chance.
  • You can confirm matches as valid if your match matches one of your parents, and you match one of your match’s parents. Parental Phasing is when your match matches your parent. Parental Cross-Matching is when you both match one of each other’s parents. To be complete, both people who match each other need to match one of the parents of the other person. This rule still holds even if you have a known common ancestor. I can’t even begin to tell you how many times I’ve been fooled.
  • 15-20% (or more with endogamy) of your matches will be identical by chance because either your DNA or your match’s DNA aligns in such a way that while they match you, they don’t match either of your parents.
  • Your siblings, aunts, and uncles will often inherit the same DNA as you – which means that identical by chance matches will also match them. That’s why we don’t use close family members for triangulation. We do utilize close family members to generate common match hints. (Remember the 20 cM shared match caveat at Ancestry)
  • While your siblings, aunts, and uncles are too close to use for triangulation, they are wonderful to identify ancestral matches. Some of their matches will match you as well, and some will not because your close family members inherited segments of your ancestor’s DNA that you did not. Everyone should test their oldest family members.
  • Triangulate your close family member’s matches separately from your own to shed more light on your ancestors.
  • Endogamy may interfere with parental phasing, meaning you may match because you and/or your match may have inherited some of the same DNA segment(s) from both sides of your tree and/or more DNA than might otherwise be expected.
  • Pedigree collapse needs to be considered when using parental phasing, especially when the same ancestor appears on both sides of your family tree. You may share more DNA with a match than expected.
  • Conversely, with pedigree collapse, your match may not match your parents, or vice versa, if a segment happens to have recombined in you in a way that drops the matching segments of your parents beneath the vendor’s match threshold.
  • While you will match all of your second cousins, you will only match approximately 90% of your third cousins and proportionally fewer as your relationship reaches further back in time.
  • Not being a DNA match with someone does NOT mean you’re NOT related to them, unless of course, you’re a second cousin (2C) or closer. It simply means you don’t carry any common ancestral segments above vendor thresholds.
  • At 2C or closer, if you’re not a DNA match, other alternative situations need to be considered – including the transfer/upload of the wrong person’s DNA file.
  • Imputation, a scientific process required of vendors may interfere with matching, especially in more distant relatives who have tested on different platforms.
  • Imputation artifacts will be less obvious when people are more closely related, meaning closer relatives can be expected to match on more and larger segments and imputation errors make less difference.
  • Imputation will not cause close relatives, meaning 2C or closer, to not match each other.
  • In addition to not supporting segment matching information, Ancestry down-weights some segments, removes some matching DNA, and does not show shared matches below 20cM, causing some people to misinterpret their lack of common matches in various ways.
  • To resolve questions about matching issues at Ancestry, testers can transfer/upload their DNA files to MyHeritage, FamilyTreeDNA, and GEDmatch and look for consistent matches on the same segment. Start and end locations may vary to some extent between vendors, but the segment size should be basically in the same location and roughly the same size.
  • GEDmatch does not use imputation but allows larger non-matching segments to combine as a single segment which sometimes causes extremely “generous” matches. GEDmatch matching is less reliable than FamilyTreeDNA or MyHeritage, but you can adjust the matching thresholds.
  • The best situation for matching is for both people to test at the same vendor who supports and provides segment data and a chromosome browser such as 23andMe, FamilyTreeDNA, or MyHeritage.
  • Siblings cannot be used for triangulation because the most recent common ancestor (MRCA) between you and your siblings is your parents. Therefore, the “three” people in the triangulation group is reduced to two lines immediately.
  • Uncles and aunts should not be used for triangulation because the most recent common ancestors between you and your aunts and uncles are your grandparents.
  • Conversely, you should not consider triangulating with siblings and close family members of your matches as proof of an ancestral relationship.
  • A triangulation group of 3 people is only confirmation as far back as when two of those people’s lines converge and reach a common ancestor.
  • Identical by chance (IBC) matching occurs when DNA from the maternal and paternal sides are mixed positionally in the child to resemble a maternal/paternal side match with someone else.
  • Identical by chance DNA admixture (when compared to a match) could have occurred in your parents or grandparent’s generation, or earlier, so the further back in time that people in a triangulation group reach, the more reliable the triangulation group is likely to be.
  • The larger the segments and/or the triangulation group, the stronger the evidence for a specific confirmed common ancestor.
  • Early families with a very large number of descendants may have many matching and triangulated members, even 9 or 10 generations later.
  • While exactly 50% of each ancestor’s DNA is not passed in each generation, on average, you will carry 7 cM of your ancestors 10 generations back in your tree. However, you may carry more, or none.
  • The percentage of matching descendants decreases with each generation beyond great-grandparents.
  • The ideal situation for triangulation is a significant number of people, greater than three, who match on the same reasonably sized segment (7 cM/500 SNP or larger) and descend from the same ancestor (or ancestral couple) through different children whose spouses in descendant generations are not also related.
  • This means that tree completion is an important factor in match/triangulation reliability.
  • Triangulating through different children of the ancestral couple makes it significantly less likely that a different unknown common ancestor is contributing that segment of DNA – like an unknown wife in a descendant generation.

Whew!!!

The Bottom Line

Here’s the bottom line.

  1. Don’t use close relatives to triangulate.
  2. Use parents for Parental Phasing.
  3. Use Parental Cross-Matching when possible.
  4. Use close relatives to look for shared common matches that may lead to triangulation possibilities.
  5. Triangulate your close relatives’ DNA in addition to your own for bonus genealogical information. They will match people that you don’t.
  6. For the most reliable triangulation results, use the most distant relatives possible, descended through different children of the common ancestral couple.
  7. Keep this checklist of best practices, cautions, and caveats handy and check the list as necessary when evaluating the strength of any match or triangulation group. It serves as a good reminder for what to check if something seems “off” or unusual.

Feel free to share and pass this article (and checklist) on to your genealogy buddies and matches as you explain triangulation and collaborate on your genealogy.

Have fun!!!

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Books

Genealogy Research

Concepts – Segment Size, Legitimate and False Matches

Matchmaker, matchmaker, make me a match!

One of the questions I often receive about autosomal DNA is, “What, EXACTLY, is a match?”  The answer at first glance seems evident, meaning when you and someone else are shown on each other’s match lists, but it really isn’t that simple.

What I’d like to discuss today is what actually constitutes a match – and the difference between legitimate or real matches and false matches, also called false positives.

Let’s look at a few definitions before we go any further.

Definitions

  • A Match – when you and another person are found on each other’s match lists at a testing vendor. You may match that person on one or more segments of DNA.
  • Matching Segment – when a particular segment of DNA on a particular chromosome matches to another person. You may have multiple segment matches with someone, if they are closely related, or only one segment match if they are more distantly related.
  • False Match – also known as a false positive match. This occurs when you match someone that is not identical by descent (IBD), but identical by chance (IBC), meaning that your DNA and theirs just happened to match, as a happenstance function of your mother and father’s DNA aligning in such a way that you match the other person, but neither your mother or father match that person on that segment.
  • Legitimate Match – meaning a match that is a result of the DNA that you inherited from one of your parents. This is the opposite of a false positive match.  Legitimate matches are identical by descent (IBD.)  Some IBD matches are considered to be identical by population, (IBP) because they are a result of a particular DNA segment being present in a significant portion of a given population from which you and your match both descend. Ideally, legitimate matches are not IBP and are instead indicative of a more recent genealogical ancestor that can (potentially) be identified.

You can read about Identical by Descent and Identical by Chance here.

  • Endogamy – an occurrence in which people intermarry repeatedly with others in a closed community, effectively passing the same DNA around and around in descendants without introducing different/new DNA from non-related individuals. People from endogamous communities, such as Jewish and Amish groups, will share more DNA and more small segments of DNA than people who are not from endogamous communities.  Fully endogamous individuals have about three times as many autosomal matches as non-endogamous individuals.
  • False Negative Match – a situation where someone doesn’t match that should. False negatives are very difficult to discern.  We most often see them when a match is hovering at a match threshold and by lowing the threshold slightly, the match is then exposed.  False negative segments can sometimes be detected when comparing DNA of close relatives and can be caused by read errors that break a segment in two, resulting in two segments that are too small to be reported individually as a match.  False negatives can also be caused by population phasing which strips out segments that are deemed to be “too matchy” by Ancestry’s Timber algorithm.
  • Parental or Family Phasing – utilizing the DNA of your parents or other close family members to determine which side of the family a match derives from. Actual phasing means to determine which parts of your DNA come from which parent by comparing your DNA to at least one, if not both parents.  The results of phasing are that we can identify matches to family groups such as the Phased Family Finder results at Family Tree DNA that designate matches as maternal or paternal based on phased results for you and family members, up to third cousins.
  • Population Based Phasing – In another context, phasing can refer to academic phasing where some DNA that is population based is removed from an individual’s results before matching to others. Ancestry does this with their Timber program, effectively segmenting results and sometimes removing valid IBD segments.  This is not the type of phasing that we will be referring to in this article and parental/family phasing should not be confused with population/academic phasing.

IBD and IBC Match Examples

It’s important to understand the definitions of Identical by Descent and Identical by Chance.

I’ve created some easy examples.

Let’s say that a match is defined as any 10 DNA locations in a row that match.  To keep this comparison simple, I’m only showing 10 locations.

In the examples below, you are the first person, on the left, and your DNA strands are showing.  You have a pink strand that you inherited from Mom and a blue strand inherited from Dad.  Mom’s 10 locations are all filled with A and Dad’s locations are all filled with T.  Unfortunately, Mother Nature doesn’t keep your Mom’s and Dad’s strands on one side or the other, so their DNA is mixed together in you.  In other words, you can’t tell which parts of your DNA are whose.  However, for our example, we’re keeping them separate because it’s easier to understand that way.

Legitimate Match – Identical by Descent from Mother

matches-ibd-mom

In the example above, Person B, your match, has all As.  They will match you and your mother, both, meaning the match between you and person B is identical by descent.  This means you match them because you inherited the matching DNA from your mother. The matching DNA is bordered in black.

Legitimate Match – Identical by Descent from Father

In this second example, Person C has all T’s and matches both you and your Dad, meaning the match is identical by descent from your father’s side.

matches-ibd-dad

You can clearly see that you can have two different people match you on the same exact segment location, but not match each other.  Person B and Person C both match you on the same location, but they very clearly do not match each other because Person B carries your mother’s DNA and Person C carries your father’s DNA.  These three people (you, Person B and Person C) do NOT triangulate, because B and C do not match each other.  The article, “Concepts – Match Groups and Triangulation” provides more details on triangulation.

Triangulation is how we prove that individuals descend from a common ancestor.

If Person B and Person C both descended from your mother’s side and matched you, then they would both carry all As in those locations, and they would match you, your mother and each other.  In this case, they would triangulate with you and your mother.

False Positive or Identical by Chance Match

This third example shows that Person D does technically match you, because they have all As and Ts, but they match you by zigzagging back and forth between your Mom’s and Dad’s DNA strands.  Of course, there is no way for you to know this without matching Person D against both of your parents to see if they match either parent.  If your match does not match either parent, the match is a false positive, meaning it is not a legitimate match.  The match is identical by chance (IBC.)

matches-ibc

One clue as to whether a match is IBC or IBD, even without your parents, is whether the person matches you and other close relatives on this same segment.  If not, then the match may be IBC. If the match also matches close relatives on this segment, then the match is very likely IBD.  Of course, the segment size matters too, which we’ll discuss momentarily.

If a person triangulates with 2 or more relatives who descend from the same ancestor, then the match is identical by descent, and not identical by chance.

False Negative Match

This last example shows a false negative.  The DNA of Person E had a read error at location 5, meaning that there are not 10 locations in a row that match.  This causes you and Person E to NOT be shown as a match, creating a false negative situation, because you actually do match if Person E hadn’t had the read error.

matches-false-negative

Of course, false negatives are by definition very hard to identify, because you can’t see them.

Comparisons to Your Parents

Legitimate matches will phase to your parents – meaning that you will match Person B on the same amount of a specific segment, or a smaller portion of that segment, as one of your parents.

False matches mean that you match the person, but neither of your parents matches that person, meaning that the segment in question is identical by chance, not by descent.

Comparing your matches to both of your parents is the easiest litmus paper test of whether your matches are legitimate or not.  Of course, the caveat is that you must have both of your parents available to fully phase your results.

Many of us don’t have both parents available to test, so let’s take a look at how often false positive matches really do occur.

False Positive Matches

How often do false matches really happen?

The answer to that question depends on the size of the segments you are comparing.

Very small segments, say at 1cM, are very likely to match randomly, because they are so small.  You can read more about SNPs and centiMorgans (cM) here.

As a rule of thumb, the larger the matching segment as measured in cM, with more SNPs in that segment:

  • The stronger the match is considered to be
  • The more likely the match is to be IBD and not IBC
  • The closer in time the common ancestor, facilitating the identification of said ancestor

Just in case we forget sometimes, identifying ancestors IS the purpose of genetic genealogy, although it seems like we sometimes get all geeked out by the science itself and process of matching!  (I can hear you thinking, “speak for yourself, Roberta.”)

It’s Just a Phase!!!

Let’s look at an example of phasing a child’s matches against those of their parents.

In our example, we have a non-endogamous female child (so they inherit an X chromosome from both parents) whose matches are being compared to her parents.

I’m utilizing files from Family Tree DNA. Ancestry does not provide segment data, so Ancestry files can’t be used.  At 23andMe, coordinating the security surrounding 3 individuals results and trying to make sure that the child and both parents all have access to the same individuals through sharing would be a nightmare, so the only vendor’s results you can reasonably utilize for phasing is Family Tree DNA.

You can download the matches for each person by chromosome segment by selecting the chromosome browser and the “Download All Matches to Excel (CSV Format)” at the top right above chromosome 1.

matches-chromosomr-browser

All segment matches 1cM and above will be downloaded into a CSV file, which I then save as an Excel spreadsheet.

I downloaded the files for both parents and the child. I deleted segments below 3cM.

About 75% of the rows in the files were segments below 3cM. In part, I deleted these segments due to the sheer size and the fact that the segment matching was a manual process.  In part, I did this because I already knew that segments below 3 cM weren’t terribly useful.

Rows Father Mother Child
Total 26,887 20,395 23,681
< 3 cM removed 20,461 15,025 17,784
Total Processed 6,426 5,370 5,897

Because I have the ability to phase these matches against both parents, I wanted to see how many of the matches in each category were indeed legitimate matches and how many were false positives, meaning identical by chance.

How does one go about doing that, exactly?

Downloading the Files

Let’s talk about how to make this process easy, at least as easy as possible.

Step one is downloading the chromosome browser matches for all 3 individuals, the child and both parents.

First, I downloaded the child’s chromosome browser match file and opened the spreadsheet.

Second, I downloaded the mother’s file, colored all of her rows pink, then appended the mother’s rows into the child’s spreadsheet.

Third, I did the same with the father’s file, coloring his rows blue.

After I had all three files in one spreadsheet, I sorted the columns by segment size and removed the segments below 3cM.

Next, I sorted the remaining items on the spreadsheet, in order, by column, as follows:

  • End
  • Start
  • Chromosome
  • Matchname

matches-both-parents

My resulting spreadsheet looked like this.  Sorting in the order prescribed provides you with the matches to each person in chromosome and segment order, facilitating easy (OK, relatively easy) visual comparison for matching segments.

I then colored all of the child’s NON-matching segments green so that I could see (and eventually filter the matchname column by) the green color indicating that they were NOT matches.  Do this only for the child, or the white (non-colored) rows.  The child’s matchname only gets colored green if there is no corresponding match to a parent for that same person on that same chromosome segment.

matches-child-some-parents

All of the child’s matches that DON’T have a corresponding parent match in pink or blue for that same person on that same segment will be colored green.  I’ve boxed the matches so you can see that they do match, and that they aren’t colored green.

In the above example, Donald and Gaff don’t match either parent, so they are all green.  Mess does match the father on some segments, so those segments are boxed, but the rest of Mess doesn’t match a parent, so is colored green.  Sarah doesn’t match any parent, so she is entirely green.

Yes, you do manually have to go through every row on this combined spreadsheet.

If you’re going to phase your matches against your parent or parents, you’ll want to know what to expect.  Just because you’ve seen one match does not mean you’ve seen them all.

What is a Match?

So, finally, the answer to the original question, “What is a Match?”  Yes, I know this was the long way around the block.

In the exercise above, we weren’t evaluating matches, we were just determining whether or not the child’s match also matched the parent on the same segment, but sometimes it’s not clear whether they do or do not match.

matches-child-mess

In the case of the second match with Mess on chromosome 11, above, the starting and ending locations, and the number of cM and segments are exactly the same, so it’s easy to determine that Mess matches both the child and the father on chromosome 11. All matches aren’t so straightforward.

Typical Match

matches-typical

This looks like your typical match for one person, in this case, Cecelia.  The child (white rows) matches Cecelia on three segments that don’t also match the child’s mother (pink rows.)  Those non-matching child’s rows are colored green in the match column.  The child matches Cecelia on two segments that also match the mother, on chromosome 20 and the X chromosome.  Those matching segments are boxed in black.

The segments in both of these matches have exact overlaps, meaning they start and end in exactly the same location, but that’s not always the case.

And for the record, matches that begin and/or end in the same location are NOT more likely to be legitimate matches than those that start and end in different locations.  Vendors use small buckets for matching, and if you fall into any part of the bucket, even if your match doesn’t entirely fill the bucket, the bucket is considered occupied.  So what you’re seeing are the “fuzzy” bucket boundaries.

(Over)Hanging Chad

matches-overhanging

In this case, Chad’s match overhangs on each end.  You can see that Chad’s match to the child begins at 52,722,923 before the mother’s match at 53,176,407.

At the end location, the child’s matching segment also extends beyond the mother’s, meaning the child matches Chad on a longer segment than the mother.  This means that the segment sections before 53,176,407 and after 61,495,890 are false negative matches, because Chad does not also match the child’s mother of these portions of the segment.

This segment still counts as a match though, because on the majority of the segment, Chad does match both the child and the mother.

Nested Match

matches-nested

This example shows a nested match, where the parent’s match to Randy begins before the child’s and ends after the child’s, meaning that the child’s matching DNA segment to Randy is entirely nested within the mother’s.  In other words, pieces got shaved off of both ends of this segment when the child was inheriting from her mother.

No Common Matches

matches-no-common

Sometimes, the child and the parent will both match the same person, but there are no common segments.  Don’t read more into this than what it is.  The child’s matches to Mary are false matches.  We have no way to judge the mother’s matches, except for segment size probability, which we’ll discuss shortly.

Look Ma, No Parents

matches-no-parents

In this case, the child matches Don on 5 segments, including a reasonably large segment on chromosome 9, but there are no matches between Don and either parent.  I went back and looked at this to be sure I hadn’t missed something.

This could, possibly, be an instance of an unseen a false negative, meaning perhaps there is a read issue in the parent’s file on chromosome 9, precluding a match.  However, in this case, since Family Tree DNA does report matches down to 1cM, it would have to be an awfully large read error for that to occur.  Family Tree DNA does have quality control standards in place and each file must pass the quality threshold to be put into the matching data base.  So, in this case, I doubt that the problem is a false negative.

Just because there are multiple IBC matches to Don doesn’t mean any of those are incorrect.  It’s just the way that the DNA is inherited and it’s why this type of a match is called identical by chance – the key word being chance.

Split Match

matches-split

This split match is very interesting.  If you look closely, you’ll notice that Diane matches Mom on the entire segment on chromosome 12, but the child’s match is broken into two.  However, the number of SNPs adds up to the same, and the number of cM is close.  This suggests that there is a read error in the child’s file forcing the child’s match to Diane into two pieces.

If the segments broken apart were smaller, under the match threshold, and there were no other higher matches on other segments, this match would not be shown and would fall into the False Negative category.  However, since that’s not the case, it’s a legitimate match and just falls into the “interesting” category.

The Deceptive Match

matches-surname

Don’t be fooled by seeing a family name in the match column and deciding it’s a legitimate match.  Harrold is a family surname and Mr. Harrold does not match either of the child’s parents, on any segment.  So not a legitimate match, no matter how much you want it to be!

Suspicious Match – Probably not Real

matches-suspicious

This technically is a match, because part of the DNA that Daryl matches between Mom and the child does overlap, from 111,236,840 to 113,275,838.  However, if you look at the entire match, you’ll notice that not a lot of that segment overlaps, and the number of cMs is already low in the child’s match.  There is no way to calculate the number of cMs and SNPs in the overlapping part of the segment, but suffice it to say that it’s smaller, and probably substantially smaller, than the 3.32 total match for the child.

It’s up to you whether you actually count this as a match or not.  I just hope this isn’t one of those matches you REALLY need.  However, in this case, the Mom’s match at 15.46 cM is 99% likely to be a legitimate match, so you really don’t need the child’s match at all!!!

So, Judge Judy, What’s the Verdict?

How did our parental phasing turn out?  What did we learn?  How many segments matched both the child and a parent, and how many were false matches?

In each cM Size category below, I’ve included the total number of child’s match rows found in that category, the number of parent/child matches, the percent of parent/child matches, the number of matches to the child that did NOT match the parent, and the percent of non-matches. A non-match means a false match.

So, what the verdict?

matches-parent-child-phased-segment-match-chart

It’s interesting to note that we just approach the 50% mark for phased matches in the 7-7.99 cM bracket.

The bracket just beneath that, 6-6.99 shows only a 30% parent/child match rate, as does 5-5.99.  At 3 cM and 4 cM few matches phase to the parents, but some do, and could potentially be useful in groups of people descended from a known common ancestor and in conjunction with larger matches on other segments. Certainly segments at 3 cM and 4 cM alone aren’t very reliable or useful, but that doesn’t mean they couldn’t potentially be used in other contexts, nor are they always wrong. The smaller the segment, the less confidence we can have based on that segment alone, at least below 9-15cM.

Above the 50% match level, we quickly reach the 90th percentile in the 9-9.99 cM bracket, and above 10 cM, we’re virtually assured of a phased match, but not quite 100% of the time.

It isn’t until we reach the 16cM category that we actually reach the 100% bracket, and there is still an outlier found in the 18-18.99 cM group.

I went back and checked all of the 10 cM and over non-matches to verify that I had not made an error.  If I made errors, they were likely counting too many as NON-matches, and not the reverse, meaning I failed to visually identify matches.  However, with almost 6000 spreadsheet rows for the child, a few errors wouldn’t affect the totals significantly or even noticeably.

I hope that other people in non-endogamous populations will do the same type of double parent phasing and report on their results in the same type of format.  This experiment took about 2 days.

Furthermore, I would love to see this same type of experiment for endogamous families as well.

Summary

If you can phase your matches to either or both of your parents, absolutely, do.  This this exercise shows why, if you have only one parent to match against, you can’t just assume that anyone who doesn’t match you on your one parent’s side automatically matches you from the other parent. At least, not below about 15 cM.

Whether you can phase against your parent or not, this exercise should help you analyze your segment matches with an eye towards determining whether or not they are valid, and what different kinds of matches mean to your genealogy.

If nothing else, at least we can quantify the relatively likelihood, based on the size of the matching segment, in a non-endogamous population, a match would match a parent, if we had one to match against, meaning that they are a legitimate match.  Did you get all that?

In a nutshell, we can look at the Parent/Child Phased Match Chart produced by this exercise and say that our 8.5 cM match has about a 66% chance of being a legitimate match, and our 10.5 cM match has a 95% change of being a legitimate match.

You’re welcome.

Enjoy!!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Concepts – Identical by…Descent, State, Population and Chance

In genetic genealogy, what does it mean when someone says they are “identical by” something…and what are those various somethings?

In autosomal DNA, where your DNA on chromosomes 1-22 (and sometimes X) is compared to other people for matches of a size that indicates a genealogical relationship, you can actually match people in different ways, for different reasons.

But first, let’s make one thing perfectly clear. There is only one way to obtain your autosomal DNA – and that’s through your parents, 50% from each parent.  However, how much of their (and your) ancestor’s DNA you receive is not necessarily half of what they received from that ancestor.

If you receive ANY DNA from that ancestor, it MUST BE through your parents. There is no other way to inherit DNA.

Period.

No. Other. Way.

If you would like to read the Concepts article about inheritance and matching, click here. If you don’t understand autosomal DNA inheritance and matching concepts, you won’t be able to understand the rest of this article.

Identical by Descent (IBD)

When you match someone because you share DNA from a common ancestor, that is called Identical by Descent, or IBD. That’s what you want.  That’s a good thing, genealogically speaking.

Let’s take a look at how an IBD segment of DNA works. In the graphic below, the strand location is in the first column.  The next two pink columns are the two strands that your mother carries, one from her Mom and one from her Dad – and the values in each location from each parent.  Columns 4 and 5 are the two blue strands of DNA carried by your Dad, one from his Mom and one from his Dad.  The final two columns are what you inherited from both your mother and your father.  In this case, we made it easy and you simply inherited one of each of their strands entirely.  Yes, that does happen in some cases for a particular chromosome segment, but not all of the time.  Conceptually, for this example, it doesn’t matter.

Identical 1

Your Inheritance

In this example, you inherited strand 1 from your Mom, all As and strand 2 from Dad, all Gs. Your match, shown in the graphic below, matches you on all As, so also matches your mother.  This phenomenon is called parental phasing, which means we know it’s a legitimate match because the person matches both you and one of your parents.

For purposes of this conceptual discussion you must match on all 10 locations for this to be considered a matching segment. So in this case, your matching threshold is “10 locations.”

Identical 2

Your Match Matches You and Your Mother’s DNA – Identical by Descent

Now, understand that while I’ve shown “You” with your strands color coded so you can see who you received which pieces of DNA from – that’s not how your DNA really looks. There is no color coding in nature.  I’ve added color coding to make understanding these concepts easier.

This is how you and your parents DNA really look:

Identical 3

Notice that in your parents, their parent’s strands are mixed back and forth, so you really can’t tell which DNA came from whom.  It’s the same for you too.

What the matching software has to do is to look for a common letter between you and your match.

So, at location 1, you inherited an A and a G from your parents. Your match has an A and a T, so you and your match share a common A.  If you look at all of your matches locations, they share a common A with you on all of those locations.  It just so happens you received that A from your mother – but without your Mom to compare to – you have no way to know which parent that particular DNA value came from.  So, the best matching software can do is to tell you that indeed, you do match – on 10 locations in a row – so this is considered a match and will be reported as such on your match list.

Why you match is another matter altogether.

And, ahem….there is another way to match someone, aside from receiving ancestral DNA from your parents. I know, this is a bad joke isn’t it.  Yes, it is, but it’s real.

So, to summarize, there is no other way to obtain your DNA except 50% from one parent and 50% from the other.

However there are two ways to match someone:

  • Identical by Descent, IBD, meaning you match someone because you share the same DNA segment that you received from an ancestor through a parent, as shown above.
  • Identical by Chance, IBC, meaning that you match someone, but randomly – not by inheritance.  How the heck can that happen?

Let’s look at how that can happen.

Identical by Chance (IBC)

Because you receive a strand of DNA from each of your parents, but that DNA is all intermixed in you, you can possibly match someone else by virtue of the fact that they aren’t actually matching your ancestral DNA segment inherited from an ancestor, but by chance they are matching DNA that bounces back and forth between your parents’ DNA.

Identical 4

Your Match Matches Neither of your Parents’ Strands of DNA – Identical by Chance

In this example, you can see the that you inherited the same strands from your parents as in example 1 above, but your match is now matching you, not on your mother’s strand 1, all As, but on a combination of A from your mother and G from your father. Therefore, they don’t match either of your parents on this segment, because they are matching you by chance and not because you share a strand of DNA that you received from a common ancestor on this segment with your match.

This is easy to discern because while they match you, they won’t match either of your parents on that segment, because the match is not on an ancestral DNA segment, passed down from an ancestor. Using parental phasing, you compare your matches to your parents to see which “side” they fall on.  If they fall on neither parents’ side, then they are IBC or identical by chance.

Identical 5

Identical By Chance Identified Through Parental Phasing

In this example, you can see that you match all of these people. By using parental phasing, you can tell that you are identical by descent (IBD) to everyone except John, who matches neither of your parents, so your match to John is identical by chance (IBC).  We will talk more in an upcoming article about Parental Phasing.

If you don’t have your parents to compare to, and you match multiple people on the same segment, there should be 2 groups of people who all match each other on that segment – one group from your Mom’s side and one from your Dad’s side – even if you can’t identify your common ancestor. If there are people who don’t fit into either of those two groups, because they don’t match those group members, then the misfits are identical by chance.

Even if your parents are unavailable, this is a situation where testing other relatives helps, and the closer the better, because those relatives will also fall into those match groups and will help identify which group is from which side of your family, and which ancestral line.

In the example below, using the same people from the phased parent example above, we no longer have our parents to compare to, but we do have an aunt, Mom’s sister, and an uncle, Dad’s brother. By comparing those who match us to our close relatives – if everyone in the match group matches each other, then we know they are IBD and the come from Mom’s side of the family or Dad’s side of the family.

Identical 6

Identical By Chance Identified Through Close Family Match Groups

In general matching, meaning not on specific segments, just on your match list, if John and I match, but John doesn’t match mother’s sister, it could mean that John matches me on a different segment that my aunt didn’t inherit from my grandparents but that my mother did. So the match could be valid, even though he doesn’t match my aunt.

However, moving to the segment matching level, shown above, we can differentiate, at least for that segment.  This is yet another example of why segment analysis tools are so critically important.

If we only had one matching group, the green above, we would not be able to say that John was IBC on this segment, because John might be matching me on Dad’s side.

But in this case, we have proof points on both sides of this same segment, with two match groups, green from Mom and blue from Dad.  Mom’s side has a match group of 4+me (including her sister) who all match each other on this same segment, indicating that they all descend through my mother’s side of my tree.  On Dad’s side, we have his brother and two other people who match each other and me on those same segments.

Since John matches no one in either match group on either side, his match to me on this segment must be IBC.  You can read more about match groups and confidence here.

Identical by chance segments tend to be smaller segments, because the chances of matching more locations in a row by chance diminish as the number of locations increases.

Ok, so now you’ve got this – the two ways to match. Identical by descent (IBD) and identical by chance (IBC,) nature’s cruel joke.

So, what the heck are identical by state (IBS) and identical by population (IBP).

Good questions.

Identical by State (IBS)

Identical by state is really an archaic term now, but you’ll likely still run into it from time to time. Understand that genetic genealogy is still a really new field of discovery.  Initially, terms weren’t defined very well and have since evolved.  IBD was used to mean a match where you could find a common ancestral line.  IBS, or identical by state, was often used when one could not find the ancestral line.  What this implied was that the match was not genealogical in nature.  But that often wasn’t true.  Just because we can’t determine who the common ancestor is, doesn’t mean that common ancestor doesn’t exist.  After we have more matches, we may well figure out the common ancestor at a later time.

What are some reasons we might not be able to figure out who our common ancestor is?

  • There’s a NPE or undocumented adoption in one line or the other.
  • The pedigree chart of one or both people doesn’t go back far enough in time.
  • The pedigree chart of one or both people is incorrect.
  • Not enough people have tested to connect the dots between the DNA. For example, we may share a common surname, Dodson, but be unable to actually pinpoint which Dodson line/ancestor we share.
  • The match is identical by population (IBP) and not in a genealogical timeframe. We see this most often in highly endogamous populations.
  • The match is identical by chance (IBC) and there is no common ancestor.

The tendency in the past has been to assume that if you can’t find the ancestor, then the problem MUST be that the match is Identical by State. But the problem is that identical by state includes two categories that are mutually exclusive; Identical by Chance and Identical by Population.

Identical by chance means there is no common ancestor, as we illustrated above.

Identical by Population means there IS a common ancestor, and you did receive your DNA from that ancestor, but you may not be able to figure out who it was because it’s too far back in time and many people from that same population base share that DNA segment.

So, today, we don’t say IBS anymore, we say either IBD and if it’s not IBD then it’s either IBC or IBP, but not IBS. If someone says IBS, you need to ask and see if you can determine whether they mean, IBC or IBP, or if they are trying to say something else like “I can’t identify the common ancestor so it must be IBS.”

Identical by Population (IBP)

Identical by population means that a large portion of a population group shares a particular segment of DNA. Some people feel IBP segments are not useful and want all of these segments to be stripped away by population (or academic) based phasing software.

In some cases, if an individual is 100% Jewish, for example, they will have many IBP segments from within the highly endogamous Jewish population. They don’t have any other ancestral DNA segments from ancestors who aren’t Jewish to contrast against in their DNA, so their IBP segments are not useful to them, and are in fact, just in the opposite.  There are too many IBP segments and they are in the way – often referred to as “noise” because they are not genealogically useful, even though they are descended from an ancestor (IBD).  So, yes, IBP is a subset of IBD.

However, for someone who has the following genealogy, these same population based endogamous segments can be extremely useful and informative.

Identical 7

In this conceptual pedigree chart, the Jewish person married a non-Jewish person with deep colonial American ancestry. Their child “Colonial Jew” married someone who was mixed “Irish Asian.”  The person at the bottom, “me,” is not themselves endogamous but has several widely variant lines in their heritage including endogamous lines.

If I’m lucky enough to have an African population segment, that tells me very clearly which genealogical line that match is probably from. But if those IBP segments are removed, they can’t inform me in this situation.

Same with Jewish, or Asian, or Native American.

Let’s see how this might work in real matching.

Let’s say your mother’s A value is only found in African populations, and it’s found in very high proportions in African populations and much less frequently anyplace else in the world, except for where Africans settled.

Identical 8

Identical By Population Example Where Mother’s A Equals African

A few match outcomes are possible:

  1. You match with someone and you can discern a common ancestor or at least an ancestral line because you have only one African genealogical line – an ancestor in your mother’s line, like in the pedigree chart above.
  2. You match with someone and you cannot discern a common ancestor because many or all of your lines are African, similar to the Jewish example.
  3. You match with someone and you identify a common ancestor, but later a second genealogical line matches on that same segment because the segment is so common in the African population. This means you could have received that actual DNA segment from either ancestral line.
  4. Some DNA testing company runs academic or population based phasing software against your DNA and removes that segment entirely because they’ve decided that it occurs too frequently in a population to be useful. In this case, you won’t match that person at all.
  5. Some DNA testing company runs academic or population based phasing software against your DNA and removes that segment entirely because they’ve decided that particular segment in your results is “too matchy” so it must therefore be “invalid” and population based. This is often referred to as a “pile-up” and means that you have proportionally more matches on that segment than you do on other segments. If your “pile-up” segments are removed in this case, again, you won’t match at all. This is exactly what happened to my Acadian matches when Ancestry implemented their Timber phasing software, which removes pile-ups.

The graph below was provided to me at Ancestry DNA Day as an example of my own “pile-up” areas in my genome.

genome pileups

Ancestry with their Timber routine uses population phasing and removes your areas they deem “too matchy”? This helps Jewish and other heavily endogamous people by removing truly population based matches that are spurious and the contributing ancestor impossible to discern.  An endogamous individual could achieve much of the same effect by utilizing a higher matching threshold for their own matches, although that’s not an option at Ancestry.

However, for those of us who are not entirely endogamous, but who may have endogamous lines or lines from different parts of the world, population based phasing removes valuable informational segments and therefore, prevents valuable matches. When Ancestry ran Timber against my results, I lost all but one of my Acadian matches.  Yes, Acadians are heavily endogamous, but in my case, that line accounts for 1 of my 16 great-great-grandparents.  Believe me, if I had a tool to put all of my autosomal matches in one of 16 buckets, I would think it was a wonderful day!!!

16 gggrandparents

Because of endogamy, I actually carried MORE Acadian DNA that I would otherwise carry from a non-endogamous population – so yes, I am very matchy to my Acadian cousins, especially on smaller segments – or I was until Ancestry stripped all of that way.  Thankfully, I still have all of my matches at Family Tree DNA.

Why is endogamous DNA more matchy? Because endogamous populations only have the founders’ DNA and they just keep passing the same founder DNA around and around.

Ironically, another word for this kind of phasing is called “excess IBD” phasing. This means that “someone” decides unilaterally how much matching one “should” have and just chops the rest off at that threshold.  Clearly, that threshold for a fully Jewish person and me would be very different – and one size absolutely does NOT fit all.

I want to show you one more example of what population based phasing does. It chops the heart out of segments that would otherwise match.

People whose parents also test should match their parents on exactly 22 segments, one for each chromosome – because each child is a 100% match to their parents. If there is a read error or two (or three), then let’s say they could have as many as 25 matches, because some chromosomes are chopped in two because of a technical issue.  It occasionally happens.

At Ancestry, we’re seeing 80 to 120 matches for each parent/child pair, which means Timber is removing 58 to roughly 100 legitimate segments that you received from your parent.  One individual reported that they match one parent on 150 different segments, meaning that Ancestry removed 128 segments they decided are “too matchy” but are very clearly ancestral, or IBD, because all of your DNA must match your parents DNA on the strand they gave you.  However because of Timber’s removal of “too matchy” segments, the person no longer matches their parent on that removed segment – or on any of those 58 to 128 removed segments.  And remember, there is only one way to receive your DNA, so all of your DNA must match that of your parents.  You have no invalid matches to your parents DNA.  You can read more here.

Here’s a visual of what IBP phased matching does to you. Recall in our example that you need 10 contiguous matching locations to be considered a match.  I’m showing 20 locations in this example.

Identical 9

Normal Matching – No Population or Academic Phasing

In this first example, the DNA you inherited from your mother is a combination of T and A, where A=African. Notice that only part of what you inherited from your mother is the A this time.

In normal matching without IBP phasing, above, the matching threshold is still 10, but you match your match on a segment that totals 20 locations or units. Now it’s up to you to see if you can identify your common ancestor.

In the IBP phased example, below, your African DNA is removed as a result of population based phasing software. Your African DNA used to be where the red spot with no values is showing in the You 1 column.  Therefore, you still match on the Ts, but you only have a contiguous run of 7 Ts, then the 7 As phasing deleted, then 6 more matching Ts.  The problem is, of course, that instead of a nice matching segment of 20 units, above, you now have no match at all because you don’t have 10 matching locations in a row.  Of course, the same IBP phasing would apply to your mother, so your match would not match your mother either, which means that a valid parentally phased match is not reported.

Identical 10

Population Based Phased Matching Example Removing African

What’s worse, you’ll never have that opportunity to see if you can find your common ancestor, because you and your match will never be reported as a match. This is a lost opportunity.  In the first “normal matching” example, you may never BE able to find that common ancestor, but you have the opportunity to try.  In the second IBP phased matching example, you certainly won’t ever find your common ancestor because you’re not shown as a match.  When population based or academic phasing is involved, you’ll never know what you are missing.

This chopping phenomenon is not a rare occurrence with population based phasing. In fact, if you divide 100 removed segments by 22 chromosomes, there are approximately 4 artificial “chops” taken out of every one of your 22 chromosomes with each parent at Ancestry, and in some cases, more.  The person who now matches their parent on 150 segments has an average of 5.8 artifical phasing induced chops in each chromosome.  When Ancestry implemented Timber, many people lost between 80% and 90% of their total matches.  Mine went from 13,100 to 3,350, a loss of about 75%.  At least some of those were valid and we had identified common ancestral lines.

So, identical by population (IBP) doesn’t necessarily mean bad, unless you’re entirely endogamous. If you’re entirely endogamous, then IBP means challenging and can generally be overcome by looking at larger matching segments, which are less likely to be either IBP or IBC.

Identical by population can be very useful in someone not entirely endogamous in that it preserves ancestral DNA in a given population. In people who carry a combination of different endogamous lines, such as Jewish and Acadian, this phenomenon can actually be very useful, because it increases your chances of matching other individuals from that ancestral line – and being able to assign them appropriately.

Identical by What?

So, in summary, you are either identical because you received DNA from a common ancestor (IBD) or identical by chance (IBC) because nature is playing a mean joke on you and you match, literally, by chance because your match’s DNA is zigzagging back and forth between your parents’ DNA.  And by the way, you can match someone IBD on one segment and the same person IBC or IBP on others.

If you match someone but that person does not also match either of your parents, then it’s an IBC, identical by chance, match. Measuring a match against both yourself and your parents to determine if the match is IBC or IBD is called parental phasing.  We will have a Concepts article shortly about Parental Phasing, so stay tuned.

If you don’t have parents to match against, your matches on any segment should cleanly cluster into two matching groups where you match them and your matches also match each other on that same segment. One group for your mother’s side and one group for your father’s side.  Those who match you but don’t fall into one group or the other are identical by chance, like John in our example.  Of course, you won’t be able to sort these out until you have several matches on that segment.  This is also why testing all available upstream family members is so useful.

If you’re not IBC, you’re IBD meaning that you and your match received that DNA segment from a common ancestor, whether or not you can identify that ancestor.

Identical by population (IBP) is a type or subset of identical by descent (IBD) where many people from that same population group carry the same DNA segment. This is seen in its most pronounced fashion in heavily endogamous populations such as Ashkenazi Jews.

If you are from a highly endogamous population, you will have many IBP matches, generally on smaller segments that have been chopped up over time, and you will want to use a higher matching threshold, perhaps up to 10cM, for genealogical matching, or higher.

If you have endogamous lines in your tree, but are not entirely endogamous, IBP segments may actually be beneficial because you may be able to attribute matches to a specific line, even if not the specific ancestor in that line.

The smaller the segment, the more likely it is to be less useful to you, whether IBD or IBP – but that isn’t to say all small segments should be disregarded because they are assumed to be either IBC or not useful. That’s not the case.  Some are IBD and all IBD segments have the potential to be very useful.  Kitty Cooper just recently reported another wonderful success story using a 6cM triangulated segment.

If you’re highly endogamous, or only looking only for the low hanging fruit, which is more likely to be immediately rewarding, then work with only larger segment matches. They are less likely to be IBC or IBP and more likely to yield results more quickly.  I always begin with the largest matching segments, because not only are they easier to assign to an ancestor, but those matching people may also have smaller matching segments that I can tentatively (pending triangulation) attribute to that specific ancestor as well.

Here’s a handy-dandy cheat sheet if you’re having trouble remembering “Identical by What.”

Identical by Chart

Understand that working with genetic genealogy and autosomal DNA is much like panning for gold. You may get lucky and find a large nugget or two smiling at you from on top the pile, but the majority of your rewards will be as a result of hard work sifting and panning and accumulating those small golden flakes that aren’t immediately obvious and useful.  Cumulatively, they may well hold your family secrets and the keys to locks long ago frozen shut.

Here’s hoping all your matches are IBD!!!!!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research