Connect Your DNA Test, and Others, to Your Tree

To optimize your DNA tests, each tester needs to take advantage of the features offered by each vendor.

In order to do that, we need to perform the following tasks.

  1. Upload or create a tree (except at 23andMe who does not support trees)
  2. Connect our own test to our own profile card on our tree
  3. Connect other tests we manage to their (or our) tree, depending on the vendor
  4. Connect matches who are known relatives to their profiles on our tree

Each vendor handles these situations differently, so we’ll look at each one of the vendors with step-by-step instructions for handling those situations. We all want to get the most out of the tests we’ve taken!

Plant a Tree

If you have not created or uploaded a tree at each one of the vendors (except 23andMe who does not support genealogy trees), please do so. However, 23andMe does provide for links to your tree elsewhere, so we will review that function.

I manage my “master tree” on my own computer, but I also maintain trees at both Ancestry and MyHeritage where I attach documents and research found at that vendor. I also update my ancestors at WikiTree to be sure other researchers benefit from new discoveries.

I have not uploaded my full tree from my computer anyplace because I have many private notes that are not appropriate for disclosure, not to mention speculative and unproven relationships. I created a pared-down tree at one time to upload to both Ancestry and MyHeritage, and build those trees out from there.

I’m often asked about replacing your tree at the various vendors with an updated tree. If you do that, be aware that you will lose your DNA connections and document links. I do NOT recommend that. I simply maintain multiple trees. I wrote about this in the article, “Genealogy Tree Replacement – Should I or Shouldn’t I?” If you are considering that option, PLEASE read that article first.

RootsMagic, Family Tree Maker, and Legacy Family Tree Software all provide a syncing option with various vendors and FamilySearch, although not every vendor allows access to each of those software companies. I probably should experiment with the syncing option, but given a family member’s terrible experience some years back, I’ve been unwilling to do that. My biggest fear is that I will corrupt the file and not notice it until it’s far too late to revert to a backup.

When you upload or create a tree, make sure deceased and living people are marked as such, and you’ve opted to share your tree. If you don’t, you accidentally have a private tree. Worse yet, you might not realize it. I wrote about that in Quick Tip: Trees, Death Dates and Unintentionally Private Ancestors.

Now, let’s take a look at each vendor.

23andMe

23andMe does not support traditional genealogy trees, but they do provide a location for you to link your tree at another vendor or source.

Under your name at the right side, you’ll see “View Your Profile” under the dropdown.

I’ve not been able to find a generic Ancestry tree link that will allow non-Ancestry subscribers to view my tree, but it’s easy to do at MyHeritage. Simply open your tree at  MyHeritage and just copy the link at the top. Don’t worry, people won’t see anyone living.

If you want to use “one world” types of trees, you can also link to other trees such as FamilySearch or WikiTree, but just remember that you don’t control that content.

You don’t need to connect yourself to your tree at 23andMe, because there is no genealogy tree. However, 23andMe constructs a “genetic tree” for you using your closest matches, based on how you match other people, and how they match each other.

You can view your tree under “Family and Friends,” then “Family Tree.”

I added my ancestors’ names so it’s easy to keep straight. You can do that by simply clicking on the colored circle representing the ancestor, starting with your parents.

If you know that one of your matching relatives is not in exactly the correct tree location, you can click on their circle, and then click on Edit to make modifications.

You may want to add a relative that you can identify but who isn’t connected on the tree that 23andMe constructed.

Looking on the far-right side of the tree, in the lower corner, you’ll see “Add a Relative.” Click there and follow the instructions.

Ancestry

At Ancestry, you need to link your test to “you” in a tree. Your test can only be linked to one person in one tree at a time. You can change this, but you will lose any ThruLines you currently have. They will be regenerated based on the new tree you connect your test to, but based on the tree and other factors, they may not be the same. My recommendation is if you’re going to disconnect yourself and reconnect yourself elsewhere, record everything first.

Alternatively, you can take a second DNA test and simply link that second test to another tree. IMHO, that’s a better alternative. You can leave one in place as your research tree and use the second test to experiment with.

To link your test to your tree, select the “DNA” tab. At far right, you’ll see “Settings.”

You need to tell Ancestry who you are in your tree. Click on “Settings,” then scroll to “Tree Link.”

You can also link other tests you directly manage to their placards in your tree as well.

These links allow Ancestry to form ThruLines using both DNA matches and common ancestors in trees for 7 generations.

On your DNA Match page, Ancestry will ask you if you recognize a match.

If you click on “Yes,” you’ll be asked which side the match is on.

Then you’ll be given a long list of possible relationships in most-likely to least-likely order. Literally, Erik is the last option offered.

Select and confirm.

I’m not positive exactly HOW this helps Ancestry help you, but I suspect it confirms and helps Ancestry perfect ThruLines, relationship predictions, and perhaps even “sides” of ethnicity.

I wrote about Optimizing Your Tree at Ancestry for More Hints and DNA ThruLines.

FamilyTreeDNA

At FamilyTreeDNA, every DNA test kit has its own kit number and associated tree, so you don’t need to tell FamilyTreeDNA who you are if you create a tree from scratch on their site.

FamilyTreeDNA offers a unique family matching feature that sorts your matches into maternal and paternal sides.

In order to take advantage of this, you will need a tree. You can upload a GEDCOM file, although the upload at FamilyTreeDNA does not seem to do well with very large files.

If you don’t have a GEDCOM file on your computer, you can download a tree from either Ancestry or MyHeritage and upload to FamilyTreeDNA.

I wrote about this in the article Download Your Ancestry Tree and Upload it Elsewhere for Added Benefit.

If you upload a tree, you’ll be asked to select the person in the tree that is “you,” meaning the person who tested their DNA.

You’ll want to link known matches to your tree to enable Family Matching, aka bucketing, so that FamilyTreeDNA can divide and assign your matches maternally and paternally.

If you are building your tree at FamilyTreeDNA from scratch, simply click to begin and complete the information on the placards to add your information, then your parents, building out from there. You’ll want to add the ancestral lines to connect with your closest matches on your match list.

Family matching, or bucketing, is enabled by linking known matches to their proper place on your tree. FamilyTreeDNA then evaluates each match, determining if they match a common segment with you and someone you’ve linked. If that match does share a segment with both of you, meaning they triangulate, then that person is assigned either maternally, paternally, or both. I wrote about Triangulation in Action at FamilyTreeDNA, here.

The best people to link are your parents and grandparents, of course, but that’s not always an option. You’ll want to link as many matches as you can.

To link people, either click on the Family Tree tab at the top of the page, or on the “Link on Family Tree” under Relationship Range for individual matches.

Simply click on “Link Matches,” then drag and drop your match to their placard.

Here’s an example of linking parents.

Once someone is linked, the green dot will appear signifying that they are linked, and which type of test. Green is a Family Finder autosomal test, blue means they’ve taken a Y DNA test, and pink is a mitochondrial DNA test.

If your parents aren’t available to test, link every upstream relative that you can identify. By this, I mean that your children and full siblings will match you on both sides, so aren’t helpful for parental-side assignment.

People who have DNA tests from both parents can expect around 80% of their matches to be assigned maternally, paternally, or both.

If you have relatives who have tested at other vendors, you can ask them to upload to FamilyTreeDNA for free matching.

MyHeritage

At MyHeritage, you will connect yourself and any relatives whose tests you manage to your tree.

Under “DNA,” select “Manage DNA kits.”

At the right, you’ll click on the three dots, also known as a hamburger menu (who knew.)

Select Assign (if this is a new test or a transfer) or Re-assign a kit.

Be sure to do this for every kit you manage. I made that mistake and wrote about how I discovered and fixed the problem, here. Kit assignment enables Theories of Family Relativity and other super-helpful features.

I wrote about several things you can do to optimize your chances of receiving Theories of Family Relativity, here.

You can upload DNA kits to MyHeritage from tests taken at other vendors, here.

Fish in All the Ponds

I have provided step-by-step download/upload instructors for all vendors, here. It’s important to fish in all available ponds by making sure you have DNA tests at all four vendors. Then, upload or create trees and complete this bit of housekeeping to increase your chances of catching fish!

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

DNA: In Search of…Full and Half-Siblings

This is the fifth article in our series of articles about searching for unknown close family members, specifically; parents, grandparents, or siblings. However, these same techniques can be applied by genealogists to identify ancestors further back in time as well.

Please note that if a family member has tested and you do NOT see their results, ask them to verify that they have chosen to allow matching and for other people to view them in their match list. That process varies at different vendors.

You can also ask if they can see you in their results.

All Parties Need to Test

Searching for unknown siblings isn’t exactly searching, because to find them, they, themselves, or their descendant(s) must have taken a DNA test at the same vendor where you tested or uploaded a DNA file.

You may know through any variety of methods that they exist, or might exist, but if they don’t take a DNA test, you can’t find them using DNA. This might sound obvious, but I see people commenting and not realizing that the other sibling(s) must test too – and they may not have.

My first questions when someone comments in this vein are:

  1. Whether or not they are positive their sibling actually tested, meaning actually sent the test in to the vendor, and it was received by the testing company. You’d be surprised how many tests are living in permanent residence on someone’s countertop until it gets pushed into the drawer and forgotten about.
  2. If the person has confirmed that their sibling has results posted. They may have returned their test, but the results aren’t ready yet or there was a problem.
  3. AND that both people have authorized matching and sharing of results. Don’t hesitate to reach out to your vendor’s customer care if you need help with this.

Sibling Scenarios

The most common sibling scenarios are when one of two things happens:

  • A known sibling tests, only to discover that they don’t match you in the full sibling range, or not at all, when you expected they would
  • You discover a surprise match in the full or half-sibling range

Let’s talk about these scenarios and how to determine:

  • If someone is a sibling
  • If they are a full or half-sibling
  • If a half-sibling, if they descend from your mother or father

As with everything else genetic, we’ll be gathering and analyzing different pieces of evidence along the way.

Full and Half-Siblings

Just to make sure we are all on the same page:

  • A full sibling is someone who shares both parents with you.
  • A half-sibling is someone who shares one parent with you, but not the other parent.
  • A step-sibling is someone who shares no biological parents with you. This situation occurs when your parent marries their parent, after you are both born, and their parent becomes your step-parent. You share neither of your biological parents with a step-sibling, so you share no DNA and will not show up on each other’s match lists.
  • A three-quarters sibling is someone with whom you share one parent, but two siblings are the other parent. For example, you share the same mother, but one brother fathered you, and your father’s brother fathered your sibling. Yes, this can get very messy and is almost impossible for a non-professional to sort through, if even then. (This is not a solicitation. I do not take private clients.) We will not be addressing this situation specifically.

Caution

With any search for unknown relatives, you have no way of knowing what you will find.

In one’s mind, there are happy reunions, but you may experience something entirely different. Humans are human. Their stories are not always happy or rosy. They may have made mistakes they regret. Or they may have no regrets about anything.

Your sibling may not know about you or the situation under which you, or they, were born. Some women were victims of assault and violence, which is both humiliating and embarrassing. I wrote about difficult situations, here.

Your sibling or close family member may not be receptive to either you, your message, or even your existence. Just be prepared, because the seeking journey may not be pain-free for you or others, and may not culminate with or include happy reunions.

On the other hand, it may.

Please step back and ponder a bit about the journey you are about to undertake and the possible people that may be affected, and how. This box, once opened, cannot be closed again. Be sure you are prepared.

On the other hand, sometimes that box lid pops off, and the information simply falls in your lap one day when you open your match list, and you find yourself sitting there, in shock, staring at a match, trying to figure out what it all means.

Congratulations, You Have a Sibling!

This might not be exactly what runs through your mind when you see that you have a very close match that you weren’t expecting.

The first two things I recommend when making this sort of discovery, after a few deep breaths, a walk, and a cup of tea, are:

  • Viewing what the vendor says
  • Using the DNAPainter Shared cM Relationship Chart

Let’s start with DNAPainter.

DNAPainter

DNAPainter provides a relationship chart, here, based on the values from the Shared cM Project.

You can either enter a cM amount or a percentage of shared DNA. I prefer the cM amount, but it doesn’t really matter.

I’ll enter 2241 cM from a known half-sibling match. To enter a percent, click on the green “enter %.”

As you can see, statistically speaking, this person is slightly more likely to be a half-sibling than they are to be a full sibling. In reality, they could be either.

Looking at the chart below, DNAPainter highlights the possible relationships from the perspective of “Self.”

The average of all the self-reported relationships is shown, on top, so 2613 for a full sibling. The range is shown below, so 1613-3488 for a full sibling.

In this case, there are several possibilities for two people who share 2241 cM of DNA.

I happen to know that these two people are half-siblings, but if I didn’t, it would be impossible to tell from this information alone.

The cM range for full siblings is 1613-3488, and the cM range for half-siblings is 1160-2436.

  • The lower part of the matching range, from 1160-1613 cM is only found in half-siblings.
  • The portion of the range from 1613-2436 cM can be either half or full siblings.
  • The upper part of the range, from 2436-3488 cM is only found in full siblings.

If your results fall into the center portion of the range, you’re going to need to utilize other tools. Fortunately, we have several.

If you’ve discovered something unexpected, you’ll want to verify using these tools, regardless. Use every tool available. Ranges are not foolproof, and the upper and lower 10% of the responses were removed as outliers. You can read more about the shared cM Project, here and here.

Furthermore, people may be reporting some half-sibling relationships as full sibling relationships, because they don’t expect to be half-siblings, so the ranges may be somewhat “off.”

Relationship Probability Calculator

Third-party matching database, GEDmatch, provides a Relationship Probability Calculator tool that is based on statistical probability methods without compiled user input. Both tools are free, and while I haven’t compared every value, both seem to be reasonably accurate, although they do vary somewhat, especially at the outer ends of the ranges.

When dealing with sibling matches, if you are in all four databases, GEDmatch is a secondary resource, but I will include GEDmatch when they have a unique tool as well as in the summary table. Some of your matches may be willing to upload to GEDmatch if the vendor where you match doesn’t provide everything you need and GEDmatch has a supplemental offering.

Next, let’s look at what the vendors say about sibling matches.

Vendors

Each of the major vendors reports sibling relationships in a slightly different way.

Sibling Matches at Ancestry

Ancestry reports sibling relationships as Sister or Brother, but they don’t say half or full.

If you click on the cM portion of the link, you’ll see additional detail, below

Ancestry tells you that the possible relationships are 100% “Sibling.” The only way to discern the difference between full and half is by what’s next.

If the ONLY relationship shown is Sibling at 100%, that can be interpreted to mean this person is a full sibling, and that a half-sibling or other relationship is NOT a possibility.

Ancestry never stipulates full or half.

The following relationship is a half-sibling at Ancestry.

Ancestry identifies that possible range of relationships as “Close Family to First Cousin” because of the overlaps we saw in the DNAPainter chart.

Clicking through shows that there is a range of possible relationships, and Ancestry is 100% sure the relationship is one of those.

DNAPainter agrees with Ancestry except includes the full-sibling relationship as a possibility for 1826 cM.

Sibling Matches at 23andMe

23andMe does identify full versus half-siblings.

DNAPainter disagrees with 23andMe and claims that anyone who shares 46.2% of their DNA is a parent/child.

However, look at the fine print. 23andMe counts differently than any of the other vendors, and DNAPainter relies on the Shared cM Project, which relies on testers entering known relationship matching information. Therefore, at any other vendor, DNAPainter is probably exactly right.

Before we understand how 23andMe counts, we need to understand about half versus fully identical segments.

To determine half or full siblings, 23andMe compares two things:

  1. The amount of shared matching DNA between two people
  2. Fully Identical Regions (FIR) of DNA compared to Half Identical Regions (HIR) of DNA to determine if any of your DNA is fully identical, meaning some pieces of you and your sibling’s DNA is exactly the same on both your maternal and paternal chromosomes.

Here’s an example on any chromosome – I’ve randomly selected chromosome 12. Which chromosome doesn’t matter, except for the X, which is different.

Your match isn’t broken out by maternal and paternal sides. You would simply see, on the chromosome browser, that you and your sibling match at these locations, above.

In reality, though, you have two copies of each chromosome, one from Mom and one from Dad, and so does your sibling.

In this example, Mom’s chromosome is visualized on top, and Dad’s is on the bottom, below, but as a tester, you don’t know that. All you know is that you match your sibling on all of those blue areas, above.

However, what’s actually happening in this example is that you are matching your sibling on parts of your mother’s chromosome and parts of your father’s chromosome, shown above as green areas

23andMe looks at both copies of your chromosome, the one you inherited from Mom, on top, and Dad, on the bottom, to see if you match your sibling on BOTH your mother’s and your father’s chromosomes in that location.

I’ve boxed the green matching areas in purple where you match your sibling fully, on both parents’ chromosomes.

If you and your sibling share both parents, you will share significant amounts of the same DNA on both copies of the same chromosomes, meaning maternal and paternal. In other words, full siblings share some purple fully identical regions (FIR) of DNA with each other, while half-siblings do not (unless they are also otherwise related) because half-siblings only share one parent with each other. Their DNA can’t be fully identical because they have a different parent that contributed the other copy of their chromosome.

Total Shared DNA Fully Identical DNA from Both Parents
Full Siblings ~50% ~25%
Half Siblings ~25% 0
  • Full siblings are expected to share about 50% of the same DNA. In other words, their DNA will match at that location. That’s all the green boxed locations, above.
  • Full siblings are expected to share about 25% of the same DNA from BOTH parents at the same location on BOTH copies of their chromosomes. These are fully identical regions and are boxed in purple, above.

You’ll find fully identical segments about 25% of the time in full siblings, but you won’t find fully identical segments in half-siblings. Please note that there are exceptions for ¾ siblings and endogamous populations.

You can view each match at 23andMe to see if you have any completely identical regions, shown in dark purple in the top comparison of full siblings. Half siblings are shown in the second example, with less total matching DNA and no FIR or completely identical regions.

Please note that your matching amount of DNA will probably be higher at 23andMe than at other companies because:

  • 23andMe includes the X chromosome in the match totals
  • 23andMe counts fully identical matching regions twice. For full siblings, that’s an additional 25%

Therefore, a full sibling with an X match will have a higher total cM at 23andMe than the same siblings elsewhere because not only is the X added into the total, the FIR match region is added a second time too.

Fully Identical Regions (FIR) and Half Identical Regions (HIR) at GEDmatch

At GEDMatch, you can compare two people to each other, with an option to display the matching information and a painted graphic for each chromosome that includes FIR and HIR.

If you need to know if you and a match share fully identical regions and you haven’t tested at 23andMe, you can both upload your DNA data file to GEDmatch and use their One to One Autosomal DNA Comparison.

On the following page, simply enter both kit numbers and accept the defaults, making sure you have selected one of the graphics options.

While GEDmatch doesn’t specifically tell you whether someone is a full or half sibling, you can garner additional information about the relationship based on the graphic at GEDmatch.

GEDMatch shows both half and fully identical regions.

The above match is between two full siblings using a 7 cM threshold. The blue on the bottom bar indicates a match of 7 cM or larger. Black means no match.

The green regions in the top bar indicate places where these two people carry the same DNA on both copies of their chromosome 1. This means that both people inherited the same DNA from BOTH parents on the green segments.

In the yellow regions, the siblings inherited the same DNA from ONE parent, but different DNA in that region from the other parent. They do match each other, just on one of their chromosomes, not both.

Without a tool like this to differentiate between HIR and FIR, you can’t tell if you’re matching someone on one copy of your chromosome, or on both copies.

In the areas marked with red on top, which corresponds to the black on the bottom band, these two siblings don’t match each other because they inherited different DNA from both parents in that region. The yellow in that region is too scattered to be significant.

Full siblings generally share a significant amount of FIR, or fully identical regions of DNA – about 25%.

Half siblings will share NO significant amount of FIR, although some will be FIR on very small, scattered green segments simply by chance, as you can see in the example, below.

This half-sibling match shares no segments large enough to be a match (7 cM) in the black section. In the blue matching section, only a few small green fragments of DNA match fully, which, based on the rest of that matching segment, must be identical by chance or misreads. There are no significant contiguous segments of fully identical DNA.

When dealing with full or half-siblings, you’re not interested in small, scattered segments of fully identical regions, like those green snippets on chromosome 6, but in large contiguous sections of matching DNA like the chromosome 1 example.

GEDmatch can help when you match when a vendor does not provide FIR/HIR information, and you need additional assistance.

Next, let’s look at full and half-siblings at FamilyTreeDNA

Sibling Matches at FamilyTreeDNA

FamilyTreeDNA does identify full siblings.

Relationships other than full siblings are indicated by a range. The two individuals below are both half-sibling matches to the tester.

The full range when mousing over the relationship ranges is shown below.

DNAPainter agrees except also gives full siblings as an option for the two half-siblings.

FamilyTreeDNA also tells you if you have an X match and the size of your X match.

We will talk about X matching in a minute, which, when dealing with sibling identification, can turn out to be very important.

Sibling Matches at MyHeritage

MyHeritage indicates brother or sister for full siblings

MyHeritage provides other “Estimated relationships” for matches too small to be full siblings.

DNAPainter’s chart agrees with this classification, except adds additional relationship possibilities.

Be sure to review all of the information provided by each vendor for close relationships.

View Close Known Relationships

The next easiest step to take is to compare your full or half-sibling match to known close family members from your maternal and paternal sides, respectively. The closer the family members, the better.

It’s often not possible to determine if someone is a half sibling or a full sibling by centiMorgans (cMs) alone, especially if you’re searching for unknown family members.

Let’s start with the simplest situation first.

Let’s say both of your parents have tested, and of course, you match both of them as parents.

Your new “very close match” is in the sibling range.

The first thing to do at each vendor is to utilize that vendor’s shared matches tool and see whether your new match matches one parent, or both.

Here’s an example.

Close Relationships at FamilyTreeDNA

This person has a full sibling match, but let’s say they don’t know who this is and wants to see if their new sibling matches one or both of their parents.

Select the match by checking the box to the left of the match name, then click on the little two-person icon at far right, which shows “In Common” matches

You can see on the resulting shared match list that both of the tester’s parents are shown on the shared match list.

Now let’s make this a little more difficult.

No Parents, No Problem

Let’s say neither of your parents has tested.

If you know who your family is and can identify your matches, you can see if the sibling you match matches other close relatives on both or either side of your family.

You’ll want to view shared matches with your closest known match on both sides of your tree, beginning with the closest first. Aunts, uncles, first cousins, etc.

You will match all of your family members through second cousins, and 90% of your third cousins. You can view additional relationship percentages in the article, How Much of Them is in You?.

I recommend, for this matching purpose, to utilize 2nd cousins and closer. That way you know for sure if you don’t share them as a match with your sibling, it’s because the sibling is not related on that side of the family, not because they simply don’t share any DNA due to their distance.

In this example, you have three sibling matches. Based on your and their matches to the same known first and second cousins, you can see that:

  • Sibling 1 is your full sibling, because you both match the same maternal and paternal first and second cousins
  • Sibling 2 is your paternal half-sibling because you both match paternal second cousins and closer, but not maternal cousins.
  • Sibling 3 is your maternal half-sibling because you both match maternal second cousins and closer, but not paternal cousins.

Close Relationships at Ancestry

Neither of my parents have tested, but my first cousin on my mother’s side has. Let’s say I have a suspected sibling or half-sibling match, so I click on the match’s name, then on Shared Matches.

Sure enough, my new match also matches my first cousin that I’ve labeled as “on my mother’s side.”

If my new match in the sibling range also matches my second cousins or closer on my father’s side, the new match is a full sibling, not a half-sibling.

Close Relationships at MyHeritage

Comparing my closest match provided a real surprise. I wonder if I’ve found a half-sibling to my mother.

Now, THIS is interesting.

Hmmm. More research is needed, beginning with the age of my match. MyHeritage provides ages if the MyHeritage member authorizes that information to be shared.

Close Relationships at 23andMe

Under DNA Relatives, click on your suspected sibling match, then scroll down and select “Find Relatives in Common.”

The Relatives in Common list shows people that match both of you.

The first common match is very close and a similar relationship to my closest match on my father’s side. This would be expected of a sibling. I have no common matches with this match to anyone on my mother’s side, so they are only related on my father’s side. Therefore they are a paternal half-sibling, not a full sibling.

More Tools Are Available

Hopefully, by now, you’ve been able to determine if your mystery match is a sibling, and if so, if they are a half or full sibling, and through which parent.

We have some additional tools that are relevant and can be very informative in some circumstances. I suggest utilizing these tools, even if you think you know the answer.

In this type of situation, there’s no such thing as too much information.

X Matching

X matching, or lack thereof, may help you determine how you are related to someone.

There are two types of autosomal DNA. The X chromosome versus chromosomes 1-22. The X chromosome (number 23) has a unique inheritance path that distinguishes it from your other chromosomes.

The X chromosome inheritance path also differs between men and women.

Here’s my pedigree chart in fan form, highlighting the ancestors who may have contributed a portion of their X chromosome to me. In the closest generation, this shows that I inherited an X chromosome from both of my parents, and who in each of their lines could have contributed an X to them.

The white or uncolored positions, meaning ancestors, cannot contribute any portion of an X chromosome to me based on how the X chromosome is inherited.

You’ll notice that my father inherited none of his X chromosome from any of his paternal ancestors, so of course, I can’t inherit what he didn’t inherit. There are a very limited number of ancestors on my father’s side whom I can inherit any portion of an X chromosome from.

Men receive their Y chromosome from their fathers, so men ONLY receive an X chromosome from their mother.

Therefore, men MUST pass their mother’s X chromosome on to their female offspring because they don’t have any other copy of the X chromosome to pass on.

Men pass no X chromosome to sons.

We don’t need to worry about a full fan chart when dealing with siblings and half-siblings.

We only need to be concerned with the testers plus one generation (parents) when utilizing the X chromosome in sibling situations.

These two female Disney Princesses, above, are full siblings, and both inherited an X chromosome from BOTH their mother and father. However, their father only has one X (red) chromosome to give them, so the two females MUST match on the entire red X chromosome from their father.

Their mother has two X chromosomes, green and black, to contribute – one from each of her parents.

The full siblings, Melody, and Cinderella:

  • May have inherited some portion of the same green and black X chromosomes from their mother, so they are partial matches on their mother’s X chromosome.
  • May have inherited the exact same full X chromosome from their mother (both inherited the entire green or both inherited the entire black), so they match fully on their mother’s X chromosome.
  • May have inherited the opposite X from different maternal grandparents. One inherited the entire green X and one inherited the entire black X, so they don’t match on their mother’s X chromosome.

Now, let’s look at Cinderella, who matches Henry.

This female and male full sibling match can’t share an X chromosome on the father’s side, because the male’s father doesn’t contribute an X chromosome to him. The son, Henry, inherited a Y chromosome instead from his father, which is what made them males.

Therefore, if a male and female match on the X chromosome, it MUST be through HIS mother, but could be through either of her parents. In a sibling situation, an X match between a male and female always indicates the mother.

In the example above, the two people share both of their mother’s X chromosomes, so are definitely (at least) maternally related. They could be full siblings, but we can’t determine that by the X chromosome in this situation, with males.

However, if the male matches the female on HER father’s X chromosome, there a different message, example below.

You can see that the male is related to the female on her father’s side, where she inherited the entire magenta X chromosome. The male inherited a portion of the magenta X chromosome from his mother, so these two people do have an X match. However, he matches on his mother’s side, and she matches on her father’s side, so that’s clearly not the same parent.

  • These people CAN NOT be full siblings because they don’t match on HER mother’s side too, which would also be his mother’s side if they were full siblings.
  • They cannot be maternal half-siblings because their X DNA only matches on her father’s side, but they wouldn’t know that unless she knew which side was which based on share matches.
  • They cannot be paternal half-siblings because he does not have an X chromosome from his father.

They could, however, be uncle/aunt-niece/nephew or first cousins on his mother’s side and her father’s side. (Yes, you’re definitely going to have to read this again if you ever need male-female X matching.)

Now, let’s look at X chromosome matching between two males. It’s a lot less complicated and much more succinct.

Neither male has inherited an X chromosome from their father, so if two males DO match on the X, it MUST be through their mother. In terms of siblings, this would mean they share the same mother.

However, there is one slight twist. In the above example, you can see that the men inherited a different proportion of the green and black X chromosomes from their common mother. However, it is possible that the mother could contribute her entire green X chromosome to one son, Justin in this example, and her entire black X chromosome to Henry.

Therefore, even though Henry and Justin DO share a mother, their X chromosome would NOT match in this scenario. This is rare but does occasionally happen.

Based on the above examples, the X chromosome may be relevant in the identification of full or half siblings based on the sexes of the two people who otherwise match at a level indicating a full or half-sibling relationship.

Here’s a summary chart for sibling X matching.

X Match Female Male
Female Will match on shared father’s full X chromosome, mother’s X is the same rules as chromosomes 1-22 Match through male’s mother, but either of female’s parents. If the X match is not through the female’s mother, they are not full siblings nor maternal half-siblings. They cannot have an X match through the male’s father. They are either full or half-siblings through their mother if they match on both of their mother’s side. If they match on his mother’s side, and her father’s side, they are not siblings but could be otherwise closely related.
Male Match through male’s mother, but either of female’s parents. If the X match is not through the female’s mother, they are not full siblings nor maternal half-siblings. They cannot have an X match through the male’s father. They are either full or half-siblings through their mother if they match on both or their mother’s side. If they match on his mother’s side, and her father’s side, they are not siblings but could be otherwise closely related. Both males are related on their mother’s side – either full or half-siblings.

Here’s the information presented in a different way.

DOES match X summary:

  • If a male DOES match a female on the X, he IS related to her through HIS mother’s side, but could match her on her mother or father’s side. If their match is not through her mother, then they are not full siblings nor maternal half-siblings. They cannot match through his father, so they cannot be paternal half-siblings.
  • If a female DOES match a female on the X, they could be related on either side and could be full or half-siblings.
  • If a male DOES match a male on the X, they ARE both related through their mother. They may also be related on their father’s side, but the X does not inform us of that.

Does NOT match X summary:

  • If a male does NOT match a female on the X, they are NOT related through HIS mother and are neither full siblings nor maternal half-siblings. Since a male does not have an X chromosome from his father, they cannot be paternal half-siblings based on an X match.
  • If a male does NOT match a male, they do NOT share a mother.
  • If a female does NOT match another female on the X, they are NOT full siblings and are NOT half-siblings on their paternal side. Their father only has one X chromosome, and he would have given the same X to both daughters.

Of the four autosomal vendors, only 23andMe and FamilyTreeDNA report X chromosome results and matching, although the other two vendors, MyHeritage and Ancestry, include the X in their DNA download file so you can find X matches with those files at either FamilyTreeDNA or GEDMatch if your match has or will upload their file to either of those vendors. I wrote step-by-step detailed download/upload instructions, here.

X Matching at FamilyTreeDNA

In this example from FamilyTreeDNA, the female tester has discovered two half-sibling matches, both through her father. In the first scenario, she matches a female on the full X chromosome (181 cM). She and her half-sibling MUST share their father’s entire X chromosome because he only had one X, from his mother, to contribute to both of his daughters.

In the second match to a male half-sibling, our female tester shares NO X match because her father did not contribute an X chromosome to his son.

If we didn’t know which parents these half-sibling matches were through, we can infer from the X matching alone that the male is probably NOT through the mother.

Then by comparing shared matches with each sibling, Advanced Matches, or viewing the match Matrix, we can determine if the siblings match each other and are from the same or different sides of the family.

Under Additional Tests and Tools, Advanced Matching, FamilyTreeDNA provides an additional tool that can show only X matches combined with relationships.

Of course, you’ll need to view shared matches to see which people match the mother and/or match the father.

To see who matches each other, you’ll need to use the Matrix tool.

At FamilyTreeDNA, the Matrix, located under Autosomal DNA Results and Tools, allows you to select your matches to see if they also match each other. If you have known half-siblings, or close relatives, this is another way to view relationships.

Here’s an example using my father and two paternal half-siblings. We can see that the half-siblings also match each other, so they are (at least) half-siblings on the paternal side too.

If they also matched my mother, we would be full siblings, of course.

Next, let’s use Y DNA and mitochondrial DNA.

Y DNA and Mitochondrial DNA

In addition to autosomal DNA, we can utilize Y DNA and mitochondrial DNA (mtDNA) in some cases to identify siblings or to narrow or eliminate relationship possibilities.

Given that Y DNA and mitochondrial DNA both have distinctive inheritance paths, full and half-siblings will, or will not, match under various circumstances.

Y DNA

Y DNA is passed intact from father to son, meaning it’s not admixed with any of the mother’s DNA. Daughters do not inherit Y DNA from their father, so Y DNA is only useful for male-to-male comparisons.

Two types of Y DNA are used for genealogy, STR markers for matching, and haplogroups, and both are equally powerful in slightly different ways.

Y DNA at FamilyTreeDNA

Men can order either 37 or 111 STR marker tests, or the BIg Y which provides more than 700 markers and more. FamilyTreeDNA is the only one of the vendors to offer Y DNA testing that includes STR markers and matching between men.

Men who order these tests will be compared for matching on either 37, 111 or 700 STR markers in addition to SNP markers used for haplogroup identification and assignment.

Fathers will certainly match their sons, and paternal line brothers will match each other, but they will also match people more distantly related.

However, if two men are NOT either full or half siblings on the paternal side, they won’t match at 111 markers.

If two men DON’T match, especially at high marker levels, they likely aren’t siblings. The word “likely” is in there because, very occasionally, a large deletion occurs that prevents STR matching, especially at lower levels.

Additionally, men who take the 37 or 111 marker test also receive an estimated haplogroup at a high level for free, without any additional testing.

However, if men take the Big Y-700 test, they not only will (or won’t) match on up to 700 STR markers, they will also receive a VERY refined haplogroup via SNP marker testing that is often even more sensitive in terms of matching than STR markers. Between these two types of markers, Y DNA testing can place men very granularly in relation to other men.

Men can match in two ways on Y DNA, and the results are very enlightening.

If two men match on BOTH their most refined haplogroup (Big Y test) AND STR markers, they could certainly be siblings or father/son. They could also be related on the same line for another reason, such as known or unknown cousins or closer relationships like uncle/nephew. Of course, Y DNA, in addition to autosomal matching, is a powerful combination.

Conversely, if two men don’t have a similar or close haplogroup, they are not a father and son or paternal line siblings.

FamilyTreeDNA offers both inexpensive entry-level testing (37 and 111 markers) and highly refined advanced testing of most of the Y chromosome (Big Y-700), so haplogroup assignments can vary widely based on the test you take. This makes haplogroup matching and interpretation a bit more complex.

For example, haplogroups R-M269 and I-BY14000 are not related in thousands of years. One is haplogroup R, and one is haplogroup I – completely different branches of the Y DNA tree. These two men won’t match on STR markers or their haplogroup.

However, because FamilyTreeDNA provides over 50,000 different haplogroups, or tree branches, for Big Y testers, and they provide VERY granular matching, two father/son or sibling males who have BOTH tested at the Big Y-700 level will have either the exact same haplogroup, or at most, one branch difference on the tree if a mutation occurred between father and son.

If both men have NOT tested at the Big Y-700 level, their haplogroups will be on the same branch. For example, a man who has only taken a 37/111 marker STR test may be estimated at R-M269, which is certainly accurate as far as it goes.

His sibling who has taken a Big Y test will be many branches further downstream on the tree – but on the same large haplogroup R-M269 branch. It’s essential to pay attention to which tests a Y DNA match has taken when analyzing the match.

The beauty of the two kinds of tests is that even if one haplogroup is very general due to no Big Y test, their STR markers should still match. It’s just that sometimes this means that one hand is tied behind your back.

Y DNA matching alone can eliminate the possibility of a direct paternal line connection, but it cannot prove siblingship or paternity alone – not without additional information.

The Advanced Matching tool will provide a list of matches in all categories selected – in this case, both the 111 markers and the Family Finder test. You can see that one of these men is the father of the tester, and one is the full sibling.

You can view haplogroup assignments on the public Y DNA tree, here. I wrote about using the public tree, here.

In addition, recently, FamilyTreeDNA launched the new Y DNA Discover tool, which explains more about haplogroups, including their ages and other fun facts like migration paths along with notable and ancient connections. I wrote about using the Discover tool, here.

Y DNA at 23andMe

Testers receive a base haplogroup with their autosomal test. 23andMe tests a limited number of Y DNA SNP locations, but they don’t test many, and they don’t test STR markers, so there is no Y DNA matching and no refined haplogroups.

You can view the haplogroups of your matches. If your male sibling match does NOT share the same haplogroup, the two men are not paternal line siblings. If two men DO share the same haplogroup, they MIGHT be paternal siblings. They also might not.

Again, autosomal close matching plus haplogroup comparisons include or exclude paternal side siblings for males.

Paternal side siblings at 23andMe share the same haplogroup, but so do many other people. These two men could be siblings. The haplogroups don’t exclude that possibility. If the haplogroups were different, that would exclude being either full or paternal half-siblings.

Men can also compare their mitochondrial DNA to eliminate a maternal relationship.

These men are not full siblings or maternal half-siblings. We know, unquestionably, because their mitochondrial haplogroups don’t match.

23andMe also constructs a genetic tree, but often struggles with close relative placement, especially when half-relationships are involved. I do not recommend relying on the genetic tree in this circumstance.

Mitochondrial DNA

Mitochondrial DNA is passed from mothers to all of their children, but only females pass it on. If two people, males or females, don’t match on their mitochondrial DNA test, with a couple of possible exceptions, they are NOT full siblings, and they are NOT maternal half-siblings.

Mitochondrial DNA at 23andMe

23andMe provides limited, base mitochondrial haplogroups, but no matching. If two people don’t have the same haplogroup at 23andMe, they aren’t full or maternal siblings, as illustrated above.

Mitochondrial DNA at FamilyTreeDNA

FamilyTreeDNA provides both mitochondrial matching AND a much more refined haplogroup. The full sequence test (mtFull), the only version sold today, is essential for reliable comparisons.

Full siblings or maternal half-siblings will always share the same haplogroup, regardless of their sex.

Generally, a full sibling or maternal half-sibling match will match exactly at the full mitochondrial sequence (FMS) level with a genetic distance of zero, meaning fully matching and no mismatching mutations.

There are rare instances where maternal siblings or even mothers and children do not match exactly, meaning they have a genetic distance of greater than 0, because of a mutation called a heteroplasmy.

I wrote about heteroplasmies, here.

Like Y DNA, mitochondrial DNA cannot identify a sibling or parental relationship without additional evidence, but it can exclude one, and it can also provide much-needed evidence in conjunction with autosomal matching. The great news is that unlike Y DNA, everyone has mitochondrial DNA and it comes directly from their mother.

Once again, FamilyTreeDNA’s Advanced Matching tool provides a list of people who match you on both your mitochondrial DNA test and the Family Finder autosomal test, including transfers/uploads, and provides a relationship.

You can see that our tester matches both a full sibling and their mother. Of course, a parent/child match could mean that our tester is a female and one of her children, of either sex, has tested.

Below is an example of a parent-child match that has experienced a heteroplasmy.

Based on the comparison of both the mitochondrial DNA test, plus the autosomal Family Finder test, you can verify that this is a close family relationship.

You can also eliminate potential relationships based on the mitochondrial DNA inheritance path. The mitochondrial DNA of full siblings and maternal half-siblings will always match at the full sequence and haplogroup level, and paternal half-siblings will never match. If paternal half-siblings do match, it’s happenstance or because of a different reason.

Sibling Summary and Checklist

I’ve created a quick reference checklist for you to use when attempting to determine whether or not a match is a sibling, and, if so, whether they are half or full siblings. Of course, these tools are in addition to the DNAPainter Shared cM Tool and GEDmatch’s Relationship Predictor Calculator.

FamilyTreeDNA Ancestry 23andMe MyHeritage GEDmatch
Matching Yes Yes Yes Yes Yes
Shared Matches Yes – In Common With Yes – Shared Matches Yes – Relatives in Common Yes – Review DNA Match Yes – People who match both or 1 of 2 kits
Relationship Between Shared Matches No No No Yes, under shared match No
Matches Match Each Other* Yes, Matrix No Yes, under “View DNA details,” then, “compare with more relatives” Partly, through triangulation Yes, can match any kits
Full Siblings Yes Sibling, implies full Yes Brother, Sister, means full No
Half Siblings Sibling, Uncle/Aunt-Niece/Nephew, Grandparent-Grandchild Close Family – 1C Yes Half sibling, aunt/uncle-niece-nephew No
Fully Identical Regions (FIR) No No Yes No Yes
Half Identical Regions (HIR) No No Yes No Yes
X matching Yes No Yes No Yes
Unusual Reporting or Anomalies No No, Timber is not used on close relationships X match added into total, FIR added twice No Matching amount can vary from vendors
Y DNA Yes, STRs, refined haplogroups, matching No High-level haplogroup only, no matching No No, only if tester enters haplogroup manually
Mitochondrial DNA Yes, full sequence, matching, refined haplogroup No High-level haplogroup only, no matching No No, only if tester enters haplogroup manually
Combined Tools (Autosomal, X, Y, mtDNA) Yes No No No No

*Autoclusters through Genetic Affairs show cluster relationships of matches to the tester and to each other, but not all matches are included, including close matches. While this is a great tool, it’s not relevant for determining close and sibling relationships. See the article, AutoClustering by Genetic Affairs, here.

Additional Resources

Some of you may be wondering how endogamy affects sibling numbers.

Endogamy makes almost everything a little more complex. I wrote about endogamy and various ways to determine if you have an endogamous heritage, here.

Please note that half-siblings with high cM matches also fall into the range of full siblings (1613-3488), with or without endogamy. This may be, but is not always, especially pronounced in endogamous groups.

As another resource, I wrote an earlier article, Full or Half Siblings, here, that includes some different examples.

Strategy

You have a lot of quills in your quiver now, and I wish you the best if you’re trying to unravel a siblingship mystery.

You may not know who your biological family is, or maybe your sibling doesn’t know who their family is, but perhaps your close relatives know who their family is and can help. Remember, the situation that has revealed itself may be a shock to everyone involved.

Above all, be kind and take things slow. If your unexpected sibling match becomes frightened or overwhelmed, they may simply check out and either delete their DNA results altogether or block you. They may have that reaction before you have a chance to do anything.

Because of that possibility, I recommend performing your analysis quickly, along with taking relevant screenshots before reaching out so you will at least have that much information to work with, just in case things go belly up.

When you’re ready to make contact, I suggest beginning by sending a friendly, short, message saying that you’ve noticed that you have a close match (don’t say sibling) and asking what they know about their family genealogy – maybe ask who their grandparents are or if they have family living in the area where you live. I recommend including a little bit of information about yourself, such as where you were born and are from.

I also refrain from using the word adoption (or similar) in the beginning or giving too much detailed information, because it sometimes frightens people, especially if they know or discover that there’s a painful or embarrassing family situation.

And, please, never, ever assume the worst of anyone or their motives. They may be sitting at their keyboard with the same shocked look on their face as you – especially if they have, or had, no idea. They may need space and time to reach a place of acceptance. There’s just nothing more emotionally boat-capsizing in your life than discovering intimate and personal details about your parents, one or both, especially if that discovery is disappointing and image-altering.

Or, conversely, your sibling may have been hoping and waiting just for you!

Take a deep breath and let me know how it goes!

Please feel free to share this article with anyone who could benefit.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

DNA: In Search of…Signs of Endogamy

This is the fourth in our series of articles about searching for unknown close family members, specifically; parents, grandparents, or siblings. However, these same techniques can be applied by genealogists to ancestors further back in time as well.

In this article, we discuss endogamy – how to determine if you have it, from what population, and how to follow the road signs.

After introductions, we will be covering the following topics:

  • Pedigree collapse and endogamy
  • Endogamous groups
  • The challenge(s) of endogamy
  • Endogamy and unknown close relatives (parents, grandparents)
  • Ethnicity and Populations
  • Matches
  • AutoClusters
  • Endogamous Relationships
  • Endogamous DNA Segments
  • “Are Your Parents Related?” Tool
  • Surnames
  • Projects
  • Locations
  • Y DNA, Mitochondrial DNA, and Endogamy
  • Endogamy Tools Summary Tables
    • Summary of Endogamy Tools by Vendor
    • Summary of Endogamous Populations Identified by Each Tool
    • Summary of Tools to Assist People Seeking Unknown Parents and Grandparents

What Is Endogamy and Why Does It Matter?

Endogamy occurs when a group or population of people intermarry among themselves for an extended period of time, without the introduction of many or any people from outside of that population.

The effect of this continual intermarriage is that the founders’ DNA simply gets passed around and around, eventually in small segments.

That happens because there is no “other” DNA to draw from within the population. Knowing or determining that you have endogamy helps make sense of DNA matching patterns, and those patterns can lead you to unknown relatives, both close and distant.

This Article

This article serves two purposes.

  • This article is educational and relevant for all researchers. We discuss endogamy using multiple tools and examples from known endogamous people and populations.
  • In order to be able to discern endogamy when we don’t know who our parents or grandparents are, we need to know what signs and signals to look for, and why, which is based on what endogamy looks like in people who know their heritage.

There’s no crystal ball – no definitive “one-way” arrow, but there are a series of indications that suggest endogamy.

Depending on the endogamous population you’re dealing with, those signs aren’t always the same.

If you’re sighing now, I understand – but that’s exactly WHY I wrote this article.

We’re covering a lot of ground, but these road markers are invaluable diagnostic tools.

I’ve previously written about endogamy in the articles:

Let’s start with definitions.

Pedigree Collapse and Endogamy

Pedigree collapse isn’t the same as endogamy. Pedigree collapse is when you have ancestors that repeat in your tree.

In this example, the parents of our DNA tester are first cousins, which means the tester shares great-grandparents on both sides and, of course, the same ancestors from there on back in their tree.

This also means they share more of those ancestors’ DNA than they would normally share.

John Smith and Mary Johnson are both in the tree twice, in the same position as great-grandparents. Normally, Tester Smith would carry approximately 12.5% of each of his great-grandparents’ DNA, assuming for illustration purposes that exactly 50% of each ancestor’s DNA is passed in each generation. In this case, due to pedigree collapse, 25% of Tester Smith’s DNA descends from John Smith, and another 25% descends from Mary Johnson, double what it would normally be. 25% is the amount of DNA contribution normally inherited from grandparents, not great-grandparents.

While we may find first cousin marriages a bit eyebrow-raising today, they were quite common in the past. Both laws and customs varied with the country, time, social norms, and religion.

Pedigree Collapse and Endogamy is NOT the Same

You might think that pedigree collapse and endogamy is one and the same, but there’s a difference. Pedigree collapse can lead to endogamy, but it takes more than one instance of pedigree collapse to morph into endogamy within a population. Population is the key word for endogamy.

The main difference is that pedigree collapse occurs with known ancestors in more recent generations for one person, while endogamy is longer-term and systemic in a group of people.

Picture a group of people, all descended from Tester Smith’s great-grandparents intermarrying. Now you have the beginnings of endogamy. A couple hundred or a few hundred years later, you have true endogamy.

In other words, endogamy is pedigree collapse on a larger scale – think of a village or a church.

My ancestors’ village of Schnait, in Germany, is shown above in 1685. One church and maybe 30 or 40 homes. According to church and other records, the same families had inhabited this village, and region, for generations. It’s a sure bet that both pedigree collapse and endogamy existed in this small community.

If pedigree collapse happens over and over again because there are no other people within the community to marry, then you have endogamy. In other words, with endogamy, you assuredly DO have historical pedigree collapse, generally back in time, often before you can identify those specific ancestors – because everyone descends from the same set of founders.

Endogamy Doesn’t Necessarily Indicate Recent Pedigree Collapse

With deep, historic endogamy, you don’t necessarily have recent pedigree collapse, and in fact, many people do not. Jewish people are a good example of this phenomenon. They shared ancestors for hundreds or thousands of years, depending on which group we are referring to, but in recent, known, generations, many Jewish people aren’t related. Still, their DNA often matches each other.

The good news is that there are telltale signs and signals of endogamy.

The bad news is that not all of these are obvious, meaning as an aid to people seeking clues about unknown close relatives, and other “signs” aren’t what they are believed to be.

Let’s step through each endogamy identifier, or “hint,” and then we will review how we can best utilize this information.

First, let’s take a look at groups that are considered to be endogamous.

Endogamous Groups

Jewish PeopleSpecifically groups that were isolated from other groups of Jewish (and other) people; Ashkenazi (Germany, Northern France, and diaspora), Sephardic (Spanish, Iberia, and diaspora), Mizrahi (Israel, Middle Eastern, and diaspora,) Ethiopian Jews, and possibly Jews from other locations such as Mountain Jews from Kazakhstan and the Caucasus.

AcadiansDescendants of about 60 French families who settled in “Acadia” beginning about 1604, primarily on the island of Nova Scotia, and intermarried among themselves and with the Mi’kmaq people. Expelled by the English in 1755, they were scattered in groups to various diasporic regions where they continued to intermarry and where their descendants are found today. Some Acadians became the Cajuns of Louisiana.

Anabaptist Protestant FaithsAmish, Mennonite, and Brethren (Dunkards) and their offshoots are Protestant religious sects founded in Europe in the 14th, 15th, and 16th centuries on the principle of baptizing only adults or people who are old enough to choose to follow the faith, or rebaptizing people who had been previously baptized as children. These Anabaptist faiths tend to marry within their own group or church and often expel those who marry outside of the faith. Many emigrated to the American colonies and elsewhere, seeking religious freedom. Occasionally those groups would locate in close proximity and intermarry, but not marry outside of other Anabaptist denominations.

Native American (Indigenous) People – all indigenous peoples found in North and South America before European colonization descended from a small number of original founders who probably arrived at multiple times.

Indigenous Pacific Islanders – Including indigenous peoples of Australia, New Zealand, and Hawaii prior to colonization. They are probably equally as endogamous as Native American people, but I don’t have specific examples to share.

Villages – European or other villages with little inflow or whose residents were restricted from leaving over hundreds of years.

Other groups may have significant multiple lines of pedigree collapse and therefore become endogamous over time. Some people from Newfoundland, French Canadians, and Mormons (Church of Jesus Christ of Latter-Day Saints) come to mind.

Endogamy is a process that occurs over time.

Endogamy and Unknown Relatives

If you know who your relatives are, you may already know you’re from an endogamous population, but if you’re searching for close relatives, it’s helpful to be able to determine if you have endogamous heritage, at least in recent generations.

If you know nothing about either parent, some of these tools won’t help you, at least not initially, but others will. However, as you add to your knowledge base, the other tools will become more useful.

If you know the identity of one parent, this process becomes at least somewhat easier.

In future articles, we will search specifically for parents and each of your four grandparents. In this article, I’ll review each of the diagnostic tools and techniques you can use to determine if you have endogamy, and perhaps pinpoint the source.

The Challenge

People with endogamous heritage are related in multiple, unknown ways, over many generations. They may also be related in known ways in recent generations.

If both of your parents share the SAME endogamous culture or group of relatives:

  • You may have significantly more autosomal DNA matches than people without endogamy, unless that group of people is under-sampled. Jewish people have significantly more matches, but Native people have fewer due to under-sampling.
  • You may experience a higher-than-normal cM (centiMorgan) total for estimated relationships, especially more distant relationships, 3C and beyond.
  • You will have many matches related to you on both your maternal and paternal sides.
  • Parts of your autosomal DNA will be the same on both your mother’s and father’s sides, meaning your DNA will be fully identical in some locations. (I’ll explain more in a minute.)

If either (or both) of your parents are from an endogamous population, you:

  • Will, in some cases, carry identifying Y and mitochondrial DNA that points to a specific endogamous group. This is true for Native people, can be true for Jewish people and Pacific Islanders, but is not true for Anabaptist people.

One Size Does NOT Fit All

Please note that there is no “one size fits all.”

Each or any of these tools may provide relevant hints, depending on:

  • Your heritage
  • How many other people have tested from the relevant population group
  • How many close or distant relatives have tested
  • If your parents share the same heritage
  • Your unique DNA inheritance pattern
  • If your parents, individually, were fully endogamous or only partly endogamous, and how far back generationally that endogamy occurred

For example, in my own genealogy, my maternal grandmother’s father was Acadian on his father’s side. While I’m not fully endogamous, I have significantly more matches through that line proportionally than on my other lines.

I have Brethren endogamy on my mother’s side via her paternal grandmother.

Endogamous ancestors are shown with red stars on my mother’s pedigree chart, above. However, please note that her maternal and paternal endogamous ancestors are not from the same endogamous population.

However, I STILL have fewer matches on my mother’s side in total than on my father’s side because my mother has recent Dutch and recent German immigrants which reduces her total number of matches. Neither of those lines have had as much time to produce descendants in the US, and Europe is under-sampled when compared with the US where more people tend to take DNA tests because they are searching for where they came from.

My father’s ancestors have been in the US since it was a British Colony, and I have many more cousins who have tested on his side than mother’s.

If you looked at my pedigree chart and thought to yourself, “that’s messy,” you’d be right.

The “endogamy means more matches” axiom does not hold true for me, comparatively, between my parents – in part because my mother’s German and Dutch lines are such recent immigrants.

The number of matches alone isn’t going to tell this story.

We are going to need to look at several pieces and parts for more information. Let’s start with ethnicity.

Ethnicity and Populations

Ethnicity can be a double-edged sword. It can tell you exactly nothing you couldn’t discern by looking in the mirror, or, conversely, it can be a wealth of information.

Ethnicity reveals the parts of the world where your ancestors originated. When searching for recent ancestors, you’re most interested in majority ethnicity, meaning the 50% of your DNA that you received from each of your parents.

Ethnicity results at each vendor are easy to find and relatively easy to understand.

This individual at FamilyTreeDNA is 100% Ashkenazi Jewish.

If they were 50% Jewish, we could then estimate, and that’s an important word, that either one of their parents was fully Jewish, and not the other, or that two of their grandparents were Jewish, although not necessarily on the same side.

On the other hand, my mother’s ethnicity, shown below, has nothing remarkable that would point to any majority endogamous population, yet she has two.

The only hint of endogamy from ethnicity would be her ~1% Americas, and that isn’t relevant for finding close relatives. However, minority ancestry is very relevant for identifying Native ancestors, which I wrote about, here.

You can correlate or track your ethnicity segments to specific ancestors, which I discussed in the article, Native American & Minority Ancestors Identified Using DNAPainter Plus Ethnicity Segments, here.

Since I wrote that article, FamilyTreeDNA has added the feature of ethnicity or population Chromosome Painting, based on where each of your populations fall on your chromosomes.

In this example on chromosome 1, I have European ancestry (blue,) except for the pink Native segment, which occurs on the following segment in the same location on my mother’s chromosome 1 as well.

Both 23andMe, and FamilyTreeDNA provide chromosome painting AND the associated segment information so you can identify the relevant ancestors.

Ancestry is in the process of rolling out an ethnicity painting feature, BUT, it has no segment or associated matching information. While it’s interesting eye candy, it’s not terribly useful beyond the ethnicity information that Ancestry already provides. However, Jonny Perl at DNAPainter has devised a way to estimate Ancestry’s start and stop locations, here. Way to go Jonny!

Now all you need to do is convince your Ancestry matches to upload their DNA file to one of the three databases, FamilyTreeDNA, MyHeritage, and GEDMatch, that accept transfers, aka uploads. This allows matching with segment data so that you can identify who matches you on that segment, track your ancestors, and paint your ancestral segments at DNAPainter.

I provided step-by-step instructions, here, for downloading your raw DNA file from each vendor in order to upload the file to another vendor.

Ethnicity Sides

Three of the four DNA testing vendors, 23andMe, FamilyTreeDNA, and recently, Ancestry, attempt to phase your ethnicity DNA, meaning to assign it to one parental “side” or the other – both in total and on each chromosome.

Here’s Ancestry’s SideView, where your DNA is estimated to belong to parent 1 and parent 2. I detailed how to determine which side is which, here, and while that article was written specifically pertaining to Ancestry’s SideView, the technique is relevant for all the vendors who attempt to divide your DNA into parents, a technique known as phasing.

I say “attempt” because phasing may or may not be accurate, meaning the top chromosome may not always be parent 1, and the bottom chromosome may not always be chromosome 2.

Here’s an example at 23andMe.

See the two yellow segments. They are both assigned as Native. I happen to know one is from the mother and one is from the father, yet they are both displayed on the “top” chromosome, which one would interpret to be the same parent.

I am absolutely positive this is not the case because this is a close family member, and I have the DNA of the parent who contributed the Native segment on chromosome 1, on the top chromosome. That parent does not have a Native segment on chromosome 2 to contribute. So that Native segment had to be contributed by the other parent, but it’s also shown on the top chromosome.

The DNA segments circled in purple belong together on the same “side” and were contributed to the tester by the same parent. The Native segment on chromosome 2 abuts a purple African segment, suggesting perhaps that the ancestor who contributed that segment was mixed between those ethnicities. In the US, that suggests enslavement.

The other African segments, circled, are shown on the second chromosome in each pair.

To be clear, parent 1 is not assigned by the vendors to either mother or father and will differ by person. Your parent 1, or the parent on the top chromosome may be your mother and another person’s parent 1 may be their father.

As shown in this example, parents can vary by chromosome, a phenomenon known as “strand swap.” Occasionally, the DNA can even be swapped within a chromosome assignment.

You can, however, get an idea of the division of your DNA at any specific location. As shown above, you can only have a maximum of two populations of DNA on any one chromosome location.

In our example above, this person’s majority ancestry is European (blue.) On each chromosome where we find a minority segment, the opposite chromosome in the same location is European, meaning blue.

Let’s look at another example.

At FamilyTreeDNA, the person whose ethnicity painting is shown below has a Native American (pink) ancestor on their father’s side. FamilyTreeDNA has correctly phased or identified their Native segments as all belonging to the second chromosome in each pair.

Looking at chromosome 18, for example, most of their father’s chromosome is Native American (pink). The other parent’s chromosome is European (dark blue) at those same locations.

If one of the parents was of one ethnicity, and the other parent is a completely different ethnicity, then one bar of each chromosome would be all pink, for example, and one would be entirely blue, representing the other ethnicity.

Phasing ethnicity or populations to maternal and paternal sides is not foolproof, and each chromosome is phased individually.

Ethnicity can, in some cases, give you a really good idea of what you’re dealing with in terms of heritage and endogamy.

If someone had an Ashkenazi Jewish father and European mother, for example, one copy of each chromosome would be yellow (Ashkenazi Jewish), and one would be blue (European.)

However, if each of their parents were half European Jewish and half European (not Jewish), then their different colored segments would be scattered across their entire set of chromosomes.

In this case, both of the tester’s parents are mixed – European Jewish (green) and Western Europe (blue.) We know both parents are admixed from the same two populations because in some locations, both parents contributed blue (Western Europe), and in other locations, both contributed Jewish (green) segments.

Both MyHeritage and Ancestry provide a secondary tool that’s connected to ethnicity, but different and generally in more recent times.

Ancestry’s DNA Communities

While your ethnicity may not point to anything terribly exciting in terms of endogamy, Genetic Communities might. Ancestry says that a DNA Community is a group of people who share DNA because their relatives recently lived in the same place at the same time, and that communities are much smaller than ethnicity regions and reach back only about 50-300 years.

Based on the ancestors’ locations in the trees of me and my matches, Ancestry has determined that I’m connected to two communities. In my case, the blue group is clearly my father’s line. The orange group could be either parent, or even a combination of both.

My endogamous Brethren could be showing up in Maryland, Pennsylvania, and Ohio, but it’s uncertain, in part, because my father’s ancestral lines are found in Virginia, West Virginia, and Maryland too.

These aren’t useful for me, but they may be more useful for fully endogamous people, especially in conjunction with ethnicity.

My Acadian cousin’s European ethnicity isn’t informative.

However, viewing his DNA Communities puts his French heritage into perspective, especially combined with his match surnames.

I wrote about DNA Communities when it was introduced with the name Genetic Communities, here.

MyHeritage’s Genetic Groups

MyHeritage also provides a similar feature that shows where my matches’ ancestors lived in the same locations as mine.

One difference, though, is that testers can adjust their ethnicity results confidence level from high, above, to low, below where one of my Genetic Groups overlaps my ethnicity in the Netherlands.

You can also sort your matches by Genetic Groups.

The results show you not only who is in the group, but how many of your matches are in that group too, which provides perspective.

I wrote about Genetic Groups, here.

Next, let’s look at how endogamy affects your matches.

Matches

The number of matches that a person has who is from an entirely endogamous community and a person with no endogamy may be quite different.

FamilyTreeDNA provides a Family Matching feature that triangulates your matches and assigns them to your paternal or maternal side by using known matches that you have linked to their profile cards in your tree. You must link people for the Family Matching feature known as “bucketing” to be enabled.

The people you link are then processed for shared matches on the same chromosome segment(s). Triangulated individuals are then deposited in your maternal, paternal, and both buckets.

Obviously, your two parents are the best people to link, but if they haven’t tested (or uploaded their DNA file from another vendor) and you have other known relatives, link them using the Family Tree tab at the top of your personal page.

I uploaded my Ancestry V4 kit to use as an example for linking. Let’s pretend that’s my sister. If I had not already linked my Ancestry V4 kit to “my sister’s” profile card, I’d want to do that and link other known individuals the same way. Just drag and drop the match to the correct profile card.

Note that a full or half sibling will be listed as such at FamilyTreeDNA, but an identical twin will show as a potential parent/child match to you. You’re much more likely to find a parent than an identical twin, but just be aware.

I’ve created a table of FamilyTreeDNA bucketed match results, by category, comparing the number of matches in endogamous categories with non-endogamous.

Total Matches Maternal Matches Paternal Matches Both % Both % DNA Unassigned
100% Jewish 34,637 11,329 10,416 4,806 13.9 23.3
100% Jewish 32,973 10,700 9,858 4,606 14 23.7
100% Jewish 32,255 9,060 10,970 3,892 12 25.8
75% Jewish 24,232 11,846 Only mother linked Only mother linked Only mother linked
100% Acadian 8093 3826 2299 1062 13 11
100% Acadian 7828 3763 1825 923 11.8 17
Not Endogamous 6760 3845 1909 13 0.19 14.5
Not Endogamous 7723 1470 3317 6 0.08 38
100% Native American 1,115 Unlinked Unlinked Unlinked
100% Native American 885 290 Unknown Can’t calculate without at least one link on both sides

The 100% Jewish, Acadian, and Not Endogamous testers both have linked their parents, so their matches, if valid (meaning not identical by chance, which I discussed here,) will match them plus one or the other parent.

One person is 75% Jewish and has only linked their Jewish mother.

The Native people have not tested their parents, and the first Native person has not linked anyone in their tree. The second Native person has only linked a few maternal matches, but their mother has not tested. They are seeking their father.

It’s very difficult to find people who are fully Native as testers. Furthermore, Native people are under-sampled. If anyone knows of fully Native (or other endogamous) people who have tested and linked their parents or known relatives in their trees, and will allow me to use their total match numbers anonymously, please let me know.

As you can see, Jewish, Acadian, and Native people are 100% endogamous, but many more Jewish people than Native people have tested, so you CAN’T judge endogamy by the total number of matches alone.

In fact, in order:

  • Fully Jewish testers have about 4-5 times as many matches as the Acadian and Non-endogamous testers
  • Acadian and Non-endogamous testers have about 5-6 times as many matches as the Native American testers
  • Fully Jewish people have about 30 times more matches than the Native American testers

If a person’s endogamy with a particular population is only on their maternal or paternal side, they won’t have a significant number of people related to both sides, meaning few people will fall into the “Both” bucket. People that will always be found in the ”Both” bucket are full siblings and their descendants, along with descendants of the tester, assuming their match is linked to their profiles in the tester’s tree.

In the case of our Jewish testers, you can easily see that the “Both” bucket is very high. The Acadians are also higher than one would reasonably expect without endogamy. A non-endogamous person might have a few matches on both sides, assuming the parents are not related to each other.

A high number of “Both” matches is a very good indicator of endogamy within the same population on both parents’ sides.

The percentage of people who are assigned to the “Both” bucket is between 11% and 14% in the endogamous groups, and less than 1% in the non-endogamous group, so statistically not relevant.

As demonstrated by the Native people compared to the Jewish testers, the total number of matches can be deceiving.

However, being related to both parents, as indicated by the “Both” bucket, unless you have pedigree collapse, is a good indicator of endogamy.

Of course, if you don’t know who your relatives are, you can’t link them in your tree, so this type of “hunt” won’t generally help people seeking their close family members.

However, you may notice that you’re matching people PLUS both of their parents. If that’s the case, start asking questions of those matches about their heritage.

A very high number of total matches, as compared to non-endogamous people, combined with some other hints might well point to Jewish heritage.

I included the % DNA Unassigned category because this category, when both parents are linked, is the percentage of matches by chance, meaning the match doesn’t match either of the tester’s parents. All of the people with people listed in “Both” categories have linked both of their parents, not just maternal and paternal relatives.

Matching Location at MyHeritage

MyHeritage provides a matching function by location. Please note that it’s the location of the tester, but that may still be quite useful.

The locations are shown in the most-matches to least-matches order. Clicking on the location shows the people who match you who are from that location. This would be the most useful in situations where recent immigration has occurred. In my case, my great-grandfather from the Netherlands arrived in the 1860s, and my German ancestors arrived in the 1850s. Neither of those groups are endogamous, though, unless it would be on a village level.

AutoClusters

Let’s shift to Genetic Affairs, a third-party tool available to everyone.

Using their AutoCluster function, Genetic Affairs clusters your matches together who match both each other and you.

This is an example of the first few clusters in my AutoCluster. You can see that I have several colored clusters of various sizes, but none are huge.

Compare that to the following endogamous cluster, sample courtesy of EJ Blom at Genetic Affairs.

If your AutoCluster at Genetic Affairs looks something like this, a huge orange blob in the upper left hand corner, you’re dealing with endogamy.

Please also note that the size of your cluster is also a function of both the number of testers and the match threshold you select. I always begin by using the defaults. I wrote about using Genetic Affairs, here.

If you tested at or transferred to MyHeritage, they too license AutoClusters, but have optimized the algorithm to tease out endogamous matches so that their Jewish customers, in particular, don’t wind up with a huge orange block of interrelated people.

You won’t see the “endogamy signature” huge cluster in the corner, so you’re less likely to be able to discern endogamy from a MyHeritage cluster alone.

The commonality between these Jewish clusters at MyHeritage is that they all tend to be rather uniform in size and small, with lots of grey connecting almost all the blocks.

Grey cells indicate people who match people in two colored groups. In other words, there is often no clear division in clusters between the mother’s side and the father’s side in Jewish clusters.

In non-endogamous situations, even if you can’t identify the parents, the clusters should still fall into two sides, meaning a group of clusters for each parent’s side that are not related to each other.

You can read more about Genetic Affairs clusters and their tools, here. DNAGedcom.com also provides a clustering tool.

Endogamous Relationships

Endogamous estimated relationships are sometimes high. Please note the word, “sometimes.”

Using the Shared cM Project tool relationship chart, here, at DNAPainter, people with heavy endogamy will discover that estimated relationships MAY be on the high side, or the relationships may, perhaps, be estimated too “close” in time. That’s especially true for more distant relationships, but surprisingly, it’s not always true. The randomness of inheritance still comes into play, and so do potential unknown relatives. Hence, the words “may” are bolded and underscored.

Unfortunately, it’s often stated as “conventional wisdom” that Jewish matches are “always” high, and first cousins appear as siblings. Let’s see what the actual data says.

At DNAPainter, you can either enter the amount of shared DNA (cM), or the percent of shared DNA, or just use the chart provided.

I’ve assembled a compilation of close relationships in kits that I have access to or from people who were generous enough to share their results for this article.

I’ve used Jewish results, which is a highly endogamous population, compared with non-endogamous testers.

The “Jewish Actual” column reports the total amount of shared DNA with that person. In other words, someone to their grandparent. The Average Range is the average plus the range from DNAPainter. The Percent Difference is the % difference between the actual number and the DNAPainter average.

You’ll see fully Jewish testers, at left, matching with their family members, and a Non-endogamous person, at right, matching with their same relative.

Relationship Jewish Actual Percent Difference than Average Average -Range Non-endogamous Actual Percent Difference than Average
Grandparent 2141 22 1754 (984-2482) 1742 <1 lower
Grandparent 1902 8.5 1754 (984-2482) 1973 12
Sibling 3039 16 2613 (1613-3488) 2515 3.5 lower
Sibling 2724 4 2613 (1613-3488) 2761 5.5
Half-Sibling 2184 24 1759 (1160-2436) 2127 21
Half-Sibling 2128 21 1759 (1160-2436) 2352 34
Aunt/Uncle 2066 18.5 1741 (1201-2282) 1849 6
Aunt/Uncle 2031 16.5 1741 (1201-2282) 2097 20
1C 1119 29 866 (396-1397) 959 11
1C 909 5 866 (396-1397) 789 9 lower
1C1R 514 19 433 (102-980) 467 8
1C1R 459 6 433 (102-980) 395 9 lower

These totals are from FamilyTreeDNA except one from GEDMatch (one Jewish Half-sibling).

Totals may vary by vendor, even when matching with the same person. 23andMe includes the X segments in the total cMs and also counts fully identical segments twice. MyHeritage imputation seems to err on the generous side.

However, in these dozen examples:

  • You can see that the Jewish actual amount of DNA shared is always more than the average in the estimate.
  • The red means the overage is more than 100 cM larger.
  • The percentage difference is probably more meaningful because 100 cM is a smaller percentage of a 1754 grandparent connection than compared to a 433 cM 1C1R.

However, you can’t tell anything about endogamy by just looking at any one sample, because:

  • Some of the Non-Endogamous matches are high too. That’s just the way of random inheritance.
  • All of the actual Jewish match numbers are within the published ranges, but on the high side.

Furthermore, it can get more complex.

Half Endogamous

I requested assistance from Jewish genealogy researchers, and a lovely lady, Sharon, reached out, compiled her segment information, and shared it with me, granting permission to share with you. A HUGE thank you to Sharon!

Sharon is half-Jewish via one parent, and her half-sibling is fully Jewish. Their half-sibling match to each other at Ancestry is 1756 cM with a longest segment of 164 cM.

How does Jewish matching vary if you’re half-Jewish versus fully Jewish? Let’s look at 21 people who match both Sharon and her fully Jewish half-sibling.

Sharon shared the differences in 21 known Jewish matches with her and her half-sibling. I’ve added the Relationship Estimate Range from DNAPainter and colorized the highest of the two matches in yellow. Bolding in the total cM column shows a value above the average range for that relationship.

Total Matching cMs is on the left, with Longest Segment on the right.

While this is clearly not a scientific study, it is a representative sample.

The fully Jewish sibling carries more Jewish DNA, which is available for other Jewish matches to match as a function of endogamy (identical by chance/population), so I would have expected the fully Jewish sibling to match most if not all Jewish testers at a higher level than the half-Jewish sibling.

However, that’s not universally what we see.

The fully Jewish sibling is not always the sibling with the highest number of matches to the other Jewish testers, although the half-Jewish tester has the larger “Longest Segment” more often than not.

Approximately two-thirds of the time (13/21), the fully Jewish person does have a higher total matching cM, but about one-third of the time (8/21), the half-Jewish sibling has a higher matching cM.

About one-fourth of the time (5/21), the fully Jewish sibling has the longest matching segment, and about two-thirds of the time (13/21), the half-Jewish sibling does. In three cases, or about 14% of the time, the longest segment is equal which may indicate that it’s the same segment.

Because of endogamy, Jewish matches are more likely to have:

  • Larger than average total cM for the specific relationship
  • More and smaller matching segments

However, as we have seen, neither of those are definitive, nor always true. Jewish matches and relationships are not always overestimated.

Ancestry and Timber

Please note that Ancestry downweights some matches by removing some segments using their Timber algorithm. Based on my matches and other accounts that I manage, Ancestry does not downweight in the 2-3rd cousin category, which is 90 cM and above, but they do begin downweighting in the 3-4th cousin category, below 90 cM, where my “Extended Family” category begins.

If you’ve tested at Ancestry, you can check for yourself.

By clicking on the amount of DNA you share with your match on your match list at Ancestry, shown above, you will be taken to another page where you will be able to view the unweighted shared DNA with that match, meaning the amount of DNA shared before the downweighting and removal of some segments, shown below.

Given the downweighting, and the information in the spreadsheet provided by Sharon, it doesn’t appear that any of those matches would have been in a category to be downweighted.

Therefore, for these and other close matches, Timber wouldn’t be a factor, but would potentially be in more distant matches.

Endogamous Segments

Endogamous matches tend to have smaller and more segments. Small amounts of matching DNA tend to skew the total DNA cM upwards.

How and why does this happen?

Ancestral DNA from further back in time tends to be broken into smaller segments.

Sometimes, especially in endogamous situations, two smaller segments, at one time separated from each other, manage to join back together again and form a match, but the match is only due to ancestral segments – not because of a recent ancestor.

Please note that different vendors have different minimum matching cM thresholds, so smaller matches may not be available at all vendors. Remember that factors like Timber and imputation can affect matching as well.

Let’s take a look at an example. I’ve created a chart where two ancestors have their blue and pink DNA broken into 4 cM segments.

They have children, a blue child and a pink child, and the two children, shown above, each inherited the same blue 4 cM segment and the same pink 4 cM segment from their respective parents. The other unlabeled pink and blue segments are not inherited by these two children, so those unlabeled segments are irrelevant in this example.

The parents may have had other children who inherited those same 4 cM labeled pink and blue segments as well, and if not, the parents’ siblings were probably passing at least some of the same DNA down to their descendants too.

The blue and pink children had children, and their children had children – for several generations.

Time passed, and their descendants became an endogamous community. Those pink and blue 4 cM segments may at some time be lost during recombination in the descendants of each of their children, shown by “Lost pink” and “Lost blue.”

However, because there is only a very limited amount of DNA within the endogamous community, their descendants may regain those same segments again from their “other parent” during recombination, downstream.

In each generation, the DNA of the descendant carrying the original blue or pink DNA segment is recombined with their partner. Given that the partners are both members of the same endogamous community, the two people may have the same pink and/or blue DNA segments. If one parent doesn’t carry the pink 4 cM segment, for example, their offspring may receive that ancestral pink segment from the other parent.

They could potentially, and sometimes do, receive that ancestral segment from both parents.

In our example, the descendants of the blue child, at left, lost the pink 4 cM segment in generation 3, but a few generations later, in generation 11, that descendant child inherited that same pink 4 cM segment from their other parent. Therefore, both the 4 cM blue and 4 cM pink segments are now available to be inherited by the descendants in that line. I’ve shown the opposite scenario in the generational inheritance at right where the blue segment is lost and regained.

Once rejoined, that pink and blue segment can be passed along together for generations.

The important part, though, is that once those two segments butt up against each other again during recombination, they aren’t just two separate 4 cM segments, but one segment that is 8 cM long – that is now equal to or above the vendors’ matching threshold.

This is why people descended from endogamous populations often have the following matching characteristics:

  • More matches
  • Many smaller segment matches
  • Their total cM is often broken into more, smaller segments

What does more, smaller segments, look like, exactly?

More, Smaller Segments

All of our vendors except Ancestry have a chromosome browser for their customers to compare their DNA to that of their matches visually.

Let’s take a look at some examples of what endogamous and non-endogamous matches look like.

For example, here’s a screen shot of a random Jewish second cousin match – 298 cM total, divided into 12 segments, with a longest segment of 58 cM,

A second Jewish 2C with 323 cM total, across 19 segments, with a 69 cM longest block.

A fully Acadian 2C match with 600 cM total, across 27 segments, with a longest segment of 69 cM.

A second Acadian 2C with 332 cM total, across 20 segments, with a longest segment of 42 cM.

Next, a non-endogamous 2C match with 217 cM, across 7 segments, with a longest segment of 72 cM.

Here’s another non-endogamous 2C example, with 169 shared cM, across 6 segments, with a longest segment of 70 cM.

Here’s the second cousin data in a summary table. The take-away from this is the proportion of total segments

Tester Population Total cM Longest Block Total Segments
Jewish 2C 298 58 12
Jewish 2C 323 69 19
Acadian 2C 600 69 27
Acadian 2C 332 42 20
Non-endogamous 2C 217 72 7
Non-endogamous 2C 169 70 6

You can see more examples and comparisons between Native American, Jewish and non-endogamous DNA individuals in the article, Concepts – Endogamy and DNA Segments.

I suspect that a savvy mathematician could predict endogamy based on longest block and total segment information.

Lara Diamond, a mathematician, who writes at Lara’s Jewnealogy might be up for this challenge. She just published compiled matching and segment information in her Ashkenazic Shared DNA Survey Results for those who are interested. You can also contribute to Laura’s data, here.

Endogamy, Segments, and Distant Relationships

While not relevant to searching for close relatives, heavily endogamous matches 3C and more distant, to quote one of my Jewish friends, “dissolve into a quagmire of endogamy and are exceedingly difficult to unravel.”

In my own Acadian endogamous line, I often simply have to label them “Acadian” because the DNA tracks back to so many ancestors in different lines. In other words, I can’t tell which ancestor the match is actually pointing to because the same DNA segments or segments is/are carried by several ancestors and their descendants due to founder effect.

The difference with the Acadians is that we can actually identify many or most of them, at least at some point in time. As my cousin, Paul LeBlanc, once said, if you’re related to one Acadian, you’re related to all Acadians. Then he proceeded to tell me that he and I are related 137 different ways. My head hurts!

It’s no wonder that endogamy is incredibly difficult beyond the first few generations when it turns into something like multi-colored jello soup.

“Are Your Parents Related?” Tool

There’s another tool that you can utilize to determine if your parents are related to each other.

To determine if your parents are related to each other, you need to know about ROH, or Runs of Homozygosity (ROH).

ROH means that the DNA on both strands or copies of the same chromosome is identical.

For a few locations in a row, ROH can easily happen just by chance, but the longer the segment, the less likely that commonality occurs simply by chance.

The good news is that you don’t need to know the identity of either of your parents. You don’t need either of your parent’s DNA tests – just your own. You’ll need to upload your DNA file to GEDmatch, which is free.

Click on “Are your parents related?”

GEDMatch analyzes your DNA to see if any of your DNA, above a reasonable matching threshold, is identical on both strands, indicating that you inherited the exact same DNA from both of your parents.

A legitimate match, meaning one that’s not by chance, will include many contiguous matching locations, generally a minimum of 500 SNPs or locations in a row. GEDmatch’s minimum threshold for identifying identical ancestral DNA (ROH) is 200 cM.

Here’s my result, including the graphic for the first two chromosomes. Notice the tiny green bars that show identical by chance tiny sliver segments.

I have no significant identical DNA, meaning my parents are not related to each other.

Next, let’s look at an endogamous example where there are small, completely identical segments across a person’s chromosome

This person’s Acadian parents are related to each other, but distantly.

Next, let’s look at a Jewish person’s results.

You’ll notice larger green matching ROH, but not over 200 contiguous SNPs and 7 cM.

GEDMatch reports that this Jewish person’s parents are probably not related within recent generations, but it’s clear that they do share DNA in common.

People whose parents are distantly related have relatively small, scattered matching segments. However, if you’re seeing larger ROH segments that would be large enough to match in a genealogical setting, meaning multiple greater than 7 cM and 500 SNPs,, you may be dealing with a different type of situation where cousins have married in recent generations. The larger the matching segments, generally, the closer in time.

Blogger Kitty Cooper wrote an article, here, about discovering that your parents are related at the first cousin level, and what their GEDMatch “Are Your Parents Related” results look like.

Let’s look for more clues.

Surnames

There MAY be an endogamy clue in the surnames of the people you match.

Viewing surnames is easier if you download your match list, which you can do at every vendor except Ancestry. I’m not referring to the segment data, but the information about your matches themselves.

I provided instructions in the recent article, How to Download Your DNA Match Lists and Segment Files, here.

If you suspect endogamy for any reason, look at your closest matches and see if there is a discernable trend in the surnames, or locations, or any commonality between your matches to each other.

For example, Jewish, Acadian, and Native surnames may be recognizable, as may locations.

You can evaluate in either or both of two ways:

  • The surnames of your closest matches. Closest matches listed first will be your default match order.
  • Your most frequently occurring surnames, minus extremely common names like Smith, Jones, etc., unless they are also in your closest matches. To utilize this type of matching, sort the spreadsheet in surname order and then scan or count the number of people with each surname.

Here are some examples from our testers.

Jewish – Closest surname matches.

  • Roth
  • Weiss
  • Goldman
  • Schonwald
  • Levi
  • Cohen
  • Slavin
  • Goodman
  • Sender
  • Trebatch

Acadian – Closest surname matches.

  • Bergeron
  • Hebert
  • Bergeron
  • Marcum
  • Muise
  • Legere
  • Gaudet
  • Perry
  • Verlander
  • Trombley

Native American – Closest surname matches.

  • Ortega
  • Begay
  • Valentine
  • Hayes
  • Montoya
  • Sun Bear
  • Martin
  • Tsosie
  • Chiquito
  • Yazzie

You may recognize these categories of surnames immediately.

If not, Google is your friend. Eliminate common surnames, then Google for a few together at a time and see what emerges.

The most unusual surnames are likely your best bets.

Projects

Another way to get some idea of what groups people with these surnames might belong to is to enter the surname in the FamilyTreeDNA surname search.

Go to the main FamilyTreeDNA page, but DO NOT sign on.

Scroll down until you see this image.

Type the surname into the search box. You’ll see how many people have tested with that surname, along with projects where project administrators have included that surname indicating that the project may be of interest to at least some people with that surname.

Here’s a portion of the project list for Cohen, a traditional Jewish surname.

These results are for Muise, an Acadian surname.

Clicking through to relevant surname projects, and potentially contacting the volunteer project administrator can go a very long way in helping you gather and sift information. Clearly, they have an interest in this topic.

For example, here’s the Muise surname in the Acadian AmerIndian project. Two great hints here – Acadian heritage and Halifax, Nova Scotia.

Repeat for the balance of surnames on your list to look for commonalities, including locations on the public project pages.

Locations

Some of the vendor match files include location information. Each person on your match list will have the opportunity at the vendor where they tested to include location information in a variety of ways, either for their ancestors or themselves.

Where possible, it’s easiest to sort or scan the download file for this type of information.

Ancestry does not provide or facilitate a match list, but you can still create your own for your closest 20 or 30 matches in a spreadsheet.

MyHeritage provides common surname and ancestral location information for every match. How cool is that!

Y DNA, Mitochondrial DNA, and Endogamy

Haplogroups for both Y and mitochondrial DNA can indicate and sometimes confirm endogamy. In other cases, the haplogroup won’t help, but the matches and their location information just might.

FamilyTreeDNA is the only vendor that provides Y DNA and mitochondrial DNA tests that include highly granular haplogroups along with matches and additional tools.

23andMe provides high-level haplogroups which may or may not be adequate to pinpoint a haplogroup that indicates endogamy.

Of course, only males carry Y DNA that tracks to the direct paternal (surname) line, but everyone carries their mother’s mitochondrial DNA that represents their mother’s mother’s mother’s, or direct matrilineal line.

Some haplogroups are known to be closely associated with particular ethnicities or populations, like Native Americans, Pacific Islanders, and some Jewish people.

Haplogroups reach back in time before genealogy and can give us a sense of community that’s not available by either looking in the mirror or through traditional records.

This Native American man is a member of high-level haplogroup Q-M242. However, some men who carry this haplogroup are not Native, but are of European or Middle Eastern origin.

I entered the haplogroup in the FamilyTreeDNA Discover tool, which I wrote about, here.

Checking the information about this haplogroup reveals that their common ancestor descended from an Asian man about 30,000 years ago.

The migration path in the Americans explains why this person would have an endogamous heritage.

Our tester would receive a much more refined haplogroup if he upgraded to the Big Y test at FamilyTreeDNA, which would remove all doubt.

However, even without additional testing, information about his matches at FamilyTreeDNA may be very illuminating.

The Q-M242 Native man’s Y DNA matches men with more granular haplogroups, shown above, at left. On the Haplogroup Origins report, you can see that these people have all selected the “US (Native American)” country option.

Another useful tool would be to check the public Y haplotree, here, and the public mitochondrial tree here, for self-reported ancestor location information for a specific haplogroup.

Here’s an example of mitochondrial haplogroup A2 and a few subclades on the public mitochondrial tree. You can see that the haplogroup is found in Mexico, the US (Native,) Canada, and many additional Caribbean, South, and Central American countries.

Of course, Y DNA and mitochondrial DNA (mtDNA) tell a laser-focused story of one specific line, each. The great news, if you’re seeking information about your mother or father, the Y is your father’s direct paternal (surname) line, and mitochondrial is your mother’s direct matrilineal line.

Y and mitochondrial DNA results combined with ethnicity, autosomal matching, and the wide range of other tools that open doors, you will be able to reveal a great deal of information about whether you have endogamous heritage or not – and if so, from where.

I’ve provided a resource for stepping through and interpreting your Y DNA results, here, and mitochondrial DNA, here.

Discover for Y DNA Only

If you’re a female, you may feel left out of Y DNA testing and what it can tell you about your heritage. However, there’s a back door.

You can utilize the Y DNA haplogroups of your closest autosomal matches at both FamilyTreeDNA and 23andMe to reveal information

Haplogroup information is available in the download files for both vendors, in addition to the Family Finder table view, below, at FamilyTreeDNA, or on your individual matches profile cards at both 23andMe and FamilyTreeDNA.

You can enter any Y DNA haplogroup in the FamilyTreeDNA Discover tool, here.

You’ll be treated to:

  • Your Haplogroup Story – how many testers have this haplogroup (so far), where the haplogroup is from, and the haplogroup’s age. In this case, the haplogroup was born in the Netherlands about 250 years ago, give or take 200 years. I know that it was 1806 or earlier based on the common ancestor of the men who tested.
  • Country Frequency – heat map of where the haplogroup is found in the world.
  • Notable Connections – famous and infamous (this haplogroup’s closest notable person is Leo Tolstoy).
  • Migration Map – migration path out of Africa and through the rest of the world.
  • Ancient Connections – ancient burials. His closest ancient match is from about 1000 years ago in Ukraine. Their shared ancestor lived about 2000 years ago.
  • Suggested Projects – based on the surname, projects that other matches have joined, and haplogroups.
  • Scientific Details – age estimates, confidence intervals, graphs, and the mutations that define this haplogroup.

I wrote about the Discover tool in the article, FamilyTreeDNA DISCOVER Launches – Including Y DNA Haplogroup Ages.

Endogamy Tools Summary Tables

Endogamy is a tough nut sometimes, especially if you’re starting from scratch. In order to make this topic a bit easier and to create a reference tool for you, I’ve created three summary tables.

  • Various endogamy-related tools available at each vendor which will or may assist with evaluating endogamy
  • Tools and their ability to detect endogamy in different groups
  • Tools best suited to assist people seeking information about unknown parents or grandparents

Summary of Endogamy Tools by Vendor

Please note that GEDMatch is not a DNA testing vendor, but they accept uploads and do have some tools that the testing vendors do not.

 Tool 23andMe Ancestry FamilyTreeDNA MyHeritage GEDMatch
Ethnicity Yes Yes Yes Yes Use the vendors
Ethnicity Painting Yes + segments Yes, limited Yes + segments Yes
Ethnicity Phasing Yes Partial Yes No
DNA Communities No Yes No No
Genetic Groups No No No Yes
Family Matching aka Bucketing No No Yes No
Chromosome Browser Yes No Yes Yes Yes
AutoClusters Through Genetic Affairs No Through Genetic Affairs Yes, included Yes, with subscription
Match List Download Yes, restricted # of matches No Yes Yes Yes
Projects No No Yes No
Y DNA High-level haplogroup only No Yes, full haplogroup with Big Y, matching, tools, Discover No
Mitochondrial DNA High-level haplogroup only No Yes, full haplogroup with mtFull, matching, tools No
Public Y Tree No No Yes No
Public Mito Tree No No Yes No
Discover Y DNA – public No No Yes No
ROH No No No No Yes

Summary of Endogamous Populations Identified by Each Tool

The following chart provides a guideline for which tools are useful for the following types of endogamous groups. Bolded tools require that both parents be descended from the same endogamous group, but several other tools give more definitive results with higher amounts of endogamy.

Y and mitochondrial DNA testing are not affected by admixture, autosomal DNA or anything from the “other” parent.

Tool Jewish Acadian Anabaptist Native Other/General
Ethnicity Yes No No Yes Pacific Islander
Ethnicity Painting Yes No No Yes Pacific Islander
Ethnicity Phasing Yes, if different No No Yes, if different Pacific Islander, if different
DNA Communities Yes Possibly Possibly Yes Pacific Islander
Genetic Groups Yes Possibly Possibly Yes Pacific Islander
Family Matching aka Bucketing Yes Yes Possibly Yes Pacific Islander
Chromosome Browser Possibly Possibly Yes, once segments or ancestors identified Possibly Pacific Islander, possibly
Total Matches Yes, compared to non-endogamous No No No No, unknown
AutoClusters Yes Yes Uncertain, probably Yes Pacific Islander
Estimated Relationships High Not always Sometimes No Sometimes Uncertain, probably
Relationship Range High Possibly, sometimes Possibly Possibly Possibly Pacific Islander, possibly
More, Smaller Segments Yes Yes Probably Yes Pacific Islander, probably
Parents Related Some but minimal Possibly Uncertain Probably similar to Jewish Uncertain, Possibly
Surnames Probably Probably Probably Not Possibly Possibly
Locations Possibly Probably Probably Not Probably Probably Pacific Islander
Projects Probably Probably Possibly Possibly Probably Pacific Islander
Y DNA Yes, often Yes, often No Yes Pacific Islander
Mitochondrial DNA Yes, often Sometimes No Yes Pacific Islander
Y public tree Probably not alone No No Yes Pacific Islander
MtDNA public tree Probably not No No Yes Pacific Islander
Y DNA Discover Yes Possibly Probably not, maybe projects Yes Pacific Islander

Summary of Endogamy Tools to Assist People Seeking Unknown Parents and Grandparents

This table provides a summary of when each of the various tools can be useful to:

  • People seeking unknown close relatives
  • People who already know who their close relatives are, but are seeking additional information or clues about their genealogy

I considered rating these on a 1 to 10 scale, but the relative usefulness of these tools is dependent on many factors, so different tools will be more or less useful to different people.

For example, ethnicity is very useful if someone is admixed from different populations, or even 100% of a specific endogamous population. It’s less useful if the tester is 100% European, regardless of whether they are seeking close relatives or not. Conversely, even “vanilla” ethnicity can be used to rule out majority or recent admixture with many populations.

Tools Unknown Close Relative Seekers Known Close Relatives – Enhance Genealogy
Ethnicity Yes, to identify or rule out populations Yes
Ethnicity Painting Yes, possibly, depending on population Yes, possibly, depending on population
Ethnicity Phasing Yes, possibly, depending on population Yes, possibly, depending on population
DNA Communities Yes, possibly, depending on population Yes, possibly, depending on population
Genetic Groups Possibly, depending on population Possibly, depending on population
Family Matching aka Bucketing Not if parents are entirely unknown, but yes if one parent is known Yes
Chromosome Browser Unlikely Yes
AutoClusters Yes Yes, especially at MyHeritage if Jewish
Estimated Relationships High Not No
Relationship Range High Not reliably No
More, Smaller Segments Unlikely Unlikely other than confirmation
Match List Download Yes Yes
Surnames Yes Yes
Locations Yes Yes
Projects Yes Yes
Y DNA Yes, males only, direct paternal line, identifies surname lineage Yes, males only, direct paternal line, identifies and correctly places surname lineage
Mitochondrial DNA Yes, both sexes, direct matrilineal line only Yes, both sexes, direct matrilineal line only
Public Y Tree Yes for locations Yes for locations
Public Mito Tree Yes for locations Yes for locations
Discover Y DNA Yes, for heritage information Yes, for heritage information
Parents Related – ROH Possibly Less useful

Acknowledgments

A HUGE thank you to several people who contributed images and information in order to provide accurate and expanded information on the topic of endogamy. Many did not want to be mentioned by name, but you know who you are!!!

If you have information to add, please post in the comments.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

In Search of…Vendor Features, Strengths, and Testing Strategies

This is the third in our series of articles about searching for unknown close family members, specifically; parents, grandparents, or siblings. However, these same techniques can be applied to ancestors further back in time too.

In this article, we are going to discuss your goals and why testing or uploading to multiple vendors is advantageous – even if you could potentially solve the initial mystery at one vendor. Of course, the vendor you test with first might not be the vendor where the mystery will be solved, and data from multiple vendors might just be the combination you need.

Testing Strategy – You Might Get Lucky

I recommended in the first article that you go ahead and test at the different vendors.

Some people asked why, and specifically, why you wouldn’t just test at one vendor with the largest database first, then proceed to the others if you needed to.

That’s a great question, and I want to discuss the pros and cons in this article more specifically.

Clearly, that is one strategy, but the approach you select might differ based on a variety of considerations:

  • You may only be interested in obtaining the name of the person you are seeking – or – you may be interested in finding out as much as possible.
  • You may find that your best match at one company is decidedly unhelpful, and may even block you or your efforts, while someone elsewhere may be exactly the opposite.
  • Solving your mystery may be difficult and painful at one vendor, but the answer may be infinitely easier at a different vendor where the answer may literally be waiting.
  • There may not be enough, or the right information, or matches, at any one vendor, but the puzzle may be solvable by combining information from multiple vendors and tests. Every little bit helps.
  • You may have a sense of urgency, especially if you hope to meet the person and you’re searching for parents, siblings or grandparents who may be aging.
  • You may be cost-sensitive and cannot afford more than one test at a time. Fortunately, our upload strategy helps with that too. Also, watch for vendor sales or bundles.

From the time you order your DNA test, it will be about 6-8 weeks, give or take a week or two in either direction, before you receive results.

When those results arrive, you might get lucky, and the answer you seek is immediately evident with no additional work and just waiting for you at the first testing company.

If that’s the case, you got lucky and hit the jackpot. If you’re searching for both parents, that means you still have one parent to go.

Unidentified grandparents can be a little more difficult, because there are four of them to sort between.

If you discover a sibling or half-sibling, you still need to figure out who your common parent is. Sometimes X, Y, and mitochondrial DNA provides an immediate answer and is invaluable in these situations.

It’s more likely that you’ll find a group of somewhat more distant relatives. You may be able to figure out who your common grandparents or great-grandparents are, but not your parent(s) initially. Often, the closer generation or two is actually the most difficult because you’re dealing with contemporary records which are not publicly available, fewer descendants, and the topic may be very uncomfortable for some people. It’s also complicated because you’re often not dealing with “full” relationships, but “half,” as in half-sibling, half-niece, half-1C, etc.

You may spend a substantial amount of time trying to solve this puzzle at the first vendor before ordering your next test.

That second test will also take about 6-8 weeks, give or take. I recommend that you order the first two autosomal tests, now.

Order Your First Two Autosomal Tests

The two testing companies with the largest autosomal databases for comparison, Ancestry, and 23andMe, DO NOT accept DNA file uploads from other companies, so you’ll need to test with each individually.

Fortunately, you CAN transfer your autosomal DNA tests to both MyHeritage and FamilyTreeDNA, for free.

You will have different matches at each company. Some people will be far more responsive and helpful than others.

I recommend that you go ahead and order both the Ancestry and 23andMe tests initially, then upload the first one that comes back with results to both FamilyTreeDNA and MyHeritage. Complete, step-by-step download/upload instructions can be found here.

You can also upload your DNA file to a fifth company, Living DNA, but they are significantly smaller and heavily focused on England and Great Britain. However, if that’s where you’re searching, this might be where you find important matches.

You can also upload to GEDMatch, a popular third-party database, but since you’re going to be in the databases of the four major testing companies, there is little to be gained at GEDMatch in terms of people who have not tested at one of the major companies. Do NOT upload to GEDMatch INSTEAD of testing or uploading to the four major sites, as GEDMatch only has a small fraction of the testers in each of the vendor databases.

What GEDMatch does offer is a chromosome browser – something that Ancestry does NOT offer, along with other clustering tools which you may find useful. I recommend GEDMatch in addition to the others, if needed or desired.

Ordering Y and Mitochondrial DNA Tests

We reviewed the basics of the different kinds of DNA, here.

Some people have asked why, if autosomal DNA shows relatives on all of your lines, would one would want to order specific tests that focus on just one line?

It just so happens that the two lines that Y and mitochondrial DNA test ARE the two lines you’re seeking – direct maternal – your mother (and her mother), and direct paternal, your father (and his father.)

These two tests are different kinds of DNA tests, testing a different type of DNA, and provide very focused information, and matches, not available from autosomal DNA tests.

For men, Y DNA can reveal your father’s surname, which can be an invaluable clue in narrowing paternal candidates. Knowing that my brother’s Y DNA matched several men with the surname of Priest made me jump for joy when he matched a woman of that same last name at another vendor.

Here’s a quote from one of the members of a Y DNA project where I’m the volunteer administrator:

“Thank you for your help understanding and using all 4 kinds of my DNA results. By piecing the parts together, I identified my father. Specifically, without Y DNA testing, and the Big Y test, I would not have figured out my parental connection, and then that my paternal line had been assigned to the wrong family. STR testing gave me the correct surname, but the Big Y test showed me exactly where I fit, and disproved that other line. I’m now in touch with my father, and we both know who our relatives are – two things that would have never happened otherwise.”

If you fall into the category of, “I want to know everything I can now,” then order both Y and mitochondrial DNA tests initially, along with those two autosomal tests.

You will need to order Y (males only) and mitochondrial DNA tests separately from the autosomal Family Finder test, although you should order on the same account as your Family Finder test at FamilyTreeDNA.

If you take the Family Finder autosomal test at FamilyTreeDNA or upload your autosomal results from another vendor, you can simply select to add the Y and mitochondrial DNA tests to your account, and they will send you a swab kit.

Conversely, you can order either a Y or mitochondrial DNA test, and then add a Family Finder or upload a DNA file if you’ve already taken an autosomal DNA test to that account too. Note – these might not be current prices – check here for sales.

You will want all 3 of your tests on the same account so that you can use the Advanced Matches feature.

Using Advanced Matches, you’ll be able to view people who match you on combinations of multiple kinds of tests.

For example, if you’re a male, you can see if your Y DNA matches also match you on the Family Finder autosomal test, and if so, how closely?

Here’s an example.

In this case, I requested matches to men with 111 markers who also match the tester on the Family Finder test. I discovered both a father and a full sibling, plus a few more distant matches. There were ten total combined matches to work with, but I’ve only shown five for illustration purposes.

This information is worth its weight in gold.

Is the Big Y Test Worth It?

People ask if the Big Y test is really worth the extra money.

The answer is, “it depends.”

If all you’re looking for are matching surnames, then the answer is probably no. A 37 or 111 marker test will probably suffice. Eventually, you’ll probably want to do the Big Y, though.

If you’re looking for exact placement on the tree, with an estimated distance to other men who have taken that test, then the answer is, “absolutely.” I wish the Big Y test had been available back when I was hunting for my brother’s biological family.

The Big Y test provides a VERY specific haplogroup and places you very accurately in your location on the Y DNA tree, along with other men of your line, assuming they have tested. You may find the surname, as well as being placed within a generation or a few of current in that family line.

Additionally, the Discover page provides estimates of how far in the past you share a common ancestor with other people that share the same haplogroup. This can be a HUGE boon to a male trying to figure out his surname line and how closely in time he’s related to his matches.

Big Y NPE Examples

Y DNA SNP mutations tested with the Big Y test accrue a mutation about every generation, or so. Sometimes we see mutations in every generation.

Here’s an example from my Campbell line. Haplogroups are listed in the top three rows.

I created this spreadsheet, but FamilyTreeDNA provides a block tree for Big Y testers. I’ve added the genealogy of the testers, with the various Big Y testers at the bottom and common ancestors above, in bold.

We have two red NPE lines showing. The MacFarlane tester matches M. Campbell VERY closely, and two Clark males match W. Campbell and other Campbells quite closely. We utilized autosomal plus the Y results to determine where the unknown parentage events occurred. Today, if you’re a Clark or MacFarlane male, or a male by any other surname who was fathered by a Y chromosome Campbell male (by any surname), you’ll know exactly where you fit in this group of testers on your direct paternal line.

Y DNA is important because men often match other men with the same surname, which is a HUGE clue, especially in combination with autosomal DNA results. I say “often,” because it’s possible that no one in your line has tested, or that your father’s surname is not his biological surname either.

Y and mitochondrial DNA matches can be HUGELY beneficial pieces of information either by confirming a close autosomal relationship on that line, or eliminating the possibility.

Lineage-Specific Population Information

In addition to matching other people, both Y and mitochondrial DNA tests provide you with lineage-specific population or “ethnicity” information for this specific line which helps you focus your research.

For example, if you view the Y DNA Haplogroup Origins shown for this tester, you’ll discover that these matches are Jewish.

The tester might not be Jewish on any other genealogical line, but they definitely have Jewish ancestry on their Y DNA, paternal, line.

The same holds true for mitochondrial DNA as well. The main difference with mitochondrial DNA is that the surname changes with each generation, haplogroups today (pre-Million Mito) are less specific, and fewer people have been tested.

Y and Mitochondrial DNA Benefits

Knowing your Y and mitochondrial DNA haplogroups not only arm you with information about yourself, they provide you with matching tools and an avenue to include or exclude people as your direct line paternal or maternal ancestors.

Your Y and mitochondrial DNA can also provide CRITICALLY IMPORTANT information about whether that direct line ancestor belonged to an endogamous population, and where they came from.

For example, both Jewish and Native populations are endogamous populations, meaning highly intermarried for many generations into the past.

Knowing that helps you adjust your autosomal relationship analysis.

Why Order Multiple Tests Initially Instead of Waiting?

If you’ve been adding elapsed time, two autosomal tests (Ancestry and 23andMe), two uploads (to FamilyTreeDNA and MyHeritage,) a Y DNA test, and a mitochondrial DNA test, if all purchased serially, one following the other, means you’ll be waiting approximately 6-8 months.

Do you want to wait 6-8 months for all of your results? Can you afford to?

Part of this answer has to do with what, exactly, you’re seeking, and how patient you are.

Only you can answer that question.

A Name or Information?

Are you seeking the name or identity of a person, or are you seeking information about that person?

Most people don’t just want to put a name to the person they are seeking – they want to learn about them and the rest of the family that door opens.

You will have different matches at each company. Even after you identify the person you seek, the people you match may have trees you can view, with family photos and other important information. (Remember, you can’t see living people in trees.) Your matches may have first-person information about your relative and may know them if they are living, or have known them.

Furthermore, you may have the opportunity to meet that person. Time delayed may not be able to be recovered or regained.

One cousin that I assisted discovered that his father had died just six weeks before he broke through that wall and made the connection.

Working with data from all vendors simultaneously will allow you to combine that data and utilize it together. Using your “best” matches at each company, augmented by X, Y, and/or mitochondrial DNA, can make MUCH shorter work of this search.

Your closest autosomal matches are the most important and insightful. In this series, I will be working with the top 15 autosomal results at each vendor, at least initially. This approach provides me with the best chance of meaningful close relationship discoveries.

Data and Vendor Results Integration

Here’s a table of my two closest maternal and paternal matches at the four major vendors. I can assign these to maternal or paternal sides, because I know the identity of my parents, and I know some of these people. If an adoptee was doing this, the top 4 could all be from one parent, which is why we work with the top 15 or so matches.

Vendor Closest Maternal Closest Paternal Comments
Ancestry 1C, 1C1R Half-1C, 2C I recognized both of the maternal and neither of the paternal.
23andMe 2C, 2C 1C1R, half-gr-niece Recognized both maternal, one paternal
MyHeritage Mother uploaded, 1C Half-niece, half-1C Recognized both maternal, one paternal
FamilyTreeDNA Mother tested, 1C1R Parent/child, half-gr-niece uploaded Recognized all 4

To be clear, I tested my mother’s mitochondrial DNA before she passed away, but because FamilyTreeDNA archives DNA samples for 25 years, as the owner/manager of her DNA kit, I was able to order the Family Finder test after she had passed away. Her tests are invaluable today.

Then, years later, I uploaded her results to MyHeritage.

If I was an adopted child searching for my mother, I would find her results in both databases today. She’ll never be at either 23andMe or Ancestry because she passed away before she could test there and they don’t accept uploads.

Looking at the other vendors, my half-niece at MyHeritage is my paternal half-sibling’s daughter. My half-sibling is deceased, so this is as close as I’ll ever get to matching her.

At 23andMe, the half-great-niece is my half-siblings grandchild.

It’s interesting that I have no matches to descendants of my other half-sibling, who is also deceased. Maybe I should ask if any of his children or grandchildren have tested. Hmmmm…..

You can see that I stand a MUCH BETTER chance of figuring out close relatives using the combined closest matches of all four databases instead of the top matches from just one database. It doesn’t matter if the database is large if the right person or people didn’t test there.

Combine Resources

I’ll be providing analysis methodologies for working with results from all of the vendors together, just in case your answer is not immediately obvious. Taking multiple DNA tests facilitates using all of these tools immediately, not months later. Solving the puzzle sooner means you may not miss valuable opportunities.

You may also discover that the door slams shut with some people, or they may not respond to your queries, but another match may be unbelievably helpful. Don’t limit your possibilities.

Let’s take a look at the strengths of each vendor.

Vendor Strengths and Things to Know

Every vendor has product strengths and idiosyncracies that the others do not. All vendors provide matches and shared matches. Each vendor provides ethnicity tools which certainly can be useful, but the features differ and will be covered elsewhere.

  • AncestryAncestry has the largest autosomal database and includes ThruLines, but no Y or mitochondrial DNA testing, no clusters, no chromosome browser, no triangulation, and no X chromosome matching or reporting. Ancestry provides genealogical records, advanced tools, and full tree access to your matches’ trees with an Ancestry subscription. Ancestry does not allow downloading your match list or segment match information, but the other vendors do.
  • 23andMe 23andMe has the second largest database. They provide triangulation and genetic trees that include your closest matches. Many people test at 23andMe for health and wellness information, so 23andMe has people in their database who are not specifically interested in genealogy and probably won’t have tested elsewhere, but may be invaluable to your search. 23andMe provides Y and mtDNA high-level haplogroups only, but no matching or other haplogroup information. If you purchase a new test or have a V5 ancestry+health current test, you can expand your matches from a limit of 1500 to about 5000 with an annual membership. For seeking close relatives, you don’t need those features, but you may want them for genealogy. 23andMe is the only vendor that limits their customers’ matches.
  • MyHeritageMyHeritage has the third largest database that includes lots of European testers. MyHeritage provides triangulation, Theories of Family Relativity, and an integrated cluster tool* but does not report X matches and does not offer Y or mitochondrial DNA testing. MyHeritage accepts autosomal DNA file uploads from other testing companies for free and provides access to advanced DNA features for a one-time unlock fee. MyHeritage includes genealogical records and full feature access to advanced DNA tools with a Complete Subscription. (Free 15 days trial subscription, here.)
  • FamilyTreeDNA Family Finder (autosomal)FamilyTreeDNA is the oldest DNA testing company, meaning their database includes people who initially tested 20+ years ago and have since passed away. This, in essence, gets you one generation further back in time, with the possibility of stronger matches. Their Family Matching feature buckets and triangulates your matches, assigning them to your maternal or paternal sides if you link known matches to their proper place in your tree, even if your parents have not tested. FamilyTreeDNA accepts uploads from other testing companies for free and provides advanced DNA features for a one time unlock fee.
  • FamilyTreeDNAFamilyTreeDNA is the only company that offers both Y and mitochondrial DNA testing products that include matching, integration with autosomal test results, and other tools. These two tests are lineage-specific and don’t have to be sorted from your other ancestral lines.

I wrote about using Y DNA results, here.

I wrote about using mitochondrial DNA results, here.

*Third parties such as Genetic Affairs provide clustering tools for both 23andMe and FamilyTreeDNA. Clustering is integrated at MyHeritage. Ancestry does not provide a tool for nor allow third-party clustering. If the answer you seek isn’t immediately evident, Genetic Affairs clustering tools group people together who are related to each other, and you, and create both genetic and genealogical trees based on shared matches. You can read more about their tools, here.

Fish in all the Ponds and Use All the Bait Possible

Here’s the testing and upload strategy I recommend, based on the above discussion and considerations. The bottom line is this – if you want as much information as possible, as quickly as possible, order the four tests in red initially. Then transfer the first autosomal test results you receive to the two companies identified in blue. Optionally, GEDMatch may have tools you want to work with, but they aren’t a testing company.

What When Ancestry 23andMe MyHeritage FamilyTreeDNA
Order autosomal Initially X X    
Order Y 111 or Big-Y DNA test if male Initially       X
Order mitochondrial DNA test Initially if desired       X
Upload free autosomal When Ancestry or 23andMe results are available     X X
Unlock Advanced Tools When you upload     $29 $19
Optional GEDMatch free upload If desired, can subscribe for advanced tools

When you upload an autosomal DNA file to a vendor site, only upload one file per site, per tester. Otherwise, multiple tests simply glom up everyone’s match list with multiple matches to the same person.

Multiple vendor sites will hopefully provide multiple close matches, which increase your opportunity to discover INFORMATION about your family, not just the identity of the person you seek.

Or maybe you prefer to wait and order these DNA tests serially, waiting until one set of results is back and you’re finished working with them before ordering the next one. If so, that means you’re a MUCH more patient person than me. 😊

Our next article in this series will be about endogamy, how to know if it applies to you, and what that means to your search.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

DNA: In Search of… New Series Launches

Today, I’m excited to announce a new series titled “DNA: In Search of…”

I receive queries every single day about how to search for either unknown parents or unknown grandparents using genetic genealogy.

While some of the techniques are the same when searching for these different people, others vary and depend on a combination of factors: Continue reading

FamilyTreeDNA DISCOVER™ Launches – Including Y DNA Haplogroup Ages

FamilyTreeDNA just released an amazing new group of public Y DNA tools.

Yes, a group of tools – not just one.

The new Discover tools, which you can access here, aren’t just for people who have tested at FamilyTreeDNA . You don’t need an account and it’s free for everyone. All you need is a Y DNA haplogroup – from any source.

I’m going to introduce each tool briefly because you’re going to want to run right over and try Discover for yourself. In fact, you might follow along with this article.

Y DNA Haplogroup Aging

The new Discover page provides seven beta tools, including Y DNA haplogroup aging.

Haplogroup aging is THE single most requested feature – and it’s here!

Discover also scales for mobile devices.

Free Beta Tool

Beta means that FamilyTreeDNA is seeking your feedback to determine which of these tools will be incorporated into their regular product, so expect a survey.

If you’d like changes or something additional, please let FamilyTreeDNA know via the survey, their support line, email or Chat function.

OK, let’s get started!

Enter Your Haplogroup

Enter your Y DNA haplogroup, or the haplogroup you’re interested in viewing.

If you’re a male who has tested with FamilyTreeDNA , sign on to your home page and locate your haplogroup badge at the lower right corner.

If you’re a female, you may be able to test a male relative or find a haplogroup relevant to your genealogy by visiting your surname group project page to locate the haplogroup for your ancestor.

I’ll use one of my genealogy lines as an example.

In this case, several Y DNA testers appear under my ancestor, James Crumley, in the Crumley DNA project.

Within this group of testers, we have two different Big Y haplogroups, and several estimated haplogroups from testers who have not upgraded to the Big Y.

If you’re a male who has tested at either 23andMe or LivingDNA, you can enter your Y DNA haplogroup from that source as well. Those vendors provide high-level haplogroups.

The great thing about the new Discover tool is that no matter what haplogroup you enter, there’s something for you to enjoy.

I’m going to use haplogroup I-FT272214, the haplogroup of my ancestor, James Crumley, confirmed through multiple descendants. His son John’s descendants carry haplogroup I-BY165368 in addition to I-FT272214, which is why there are two detailed haplogroups displayed for this grouping within the Crumley haplogroup project, in addition to the less-refined I-M223.

Getting Started

When you click on Discover, you’ll be asked to register briefly, agree to terms, and provide your email address.

Click “View my report” and your haplogroup report will appear.

Y DNA Haplogroup Report

For any haplogroup you enter, you’ll receive a haplogroup report that includes 7 separate pages, shown by tabs at the top of your report.

Click any image to enlarge

The first page you’ll see is the Haplogroup Report.

On the first page, you’ll find Haplogroup aging. The TMRCA (time to most recent common ancestor) is provided, plus more!

The report says that haplogroup I-FT272214 was “born,” meaning the mutation that defines this haplogroup, occurred about 300 years ago, plus or minus 150 years.

James Crumley was born about 1710. We know his sons carry haplogroup I-FT272214, but we don’t know when that mutation occurred because we don’t have upstream testers. We don’t know who his parents were.

Three hundred years before the birth of our Crumley tester would be about 1670, so roughly James Crumley’s father’s generation, which makes sense.

James’ son John’s descendants have an additional mutation, so that makes sense too. SNP mutations are known to occur approximately every 80 years, on average. Of course, you know what average means…may not fit any specific situation exactly.

The next upstream haplogroup is I-BY100549 which occurred roughly 500 years ago, plus or minus 150 years. (Hint – if you want to view a haplogroup report for this upstream haplogroup, just click on the haplogroup name.)

There are 5 SNP confirmed descendants of haplogroup I-FT272214 claiming origins in England, all of whom are in the Crumley DNA project.

Haplogroup descendants mean this haplogroup and any other haplogroups formed on the tree beneath this haplogroup.

Share

If you scroll down a bit, you can see the share button on each page. If you think this is fun, you can share through a variety of social media resources, email, or copy the link.

Sharing is a good way to get family members and others interested in both genealogy and genetic genealogy. Light the spark!

I’m going to be sharing with collaborative family genealogy groups on Facebook and Twitter. I can also share with people who may not be genealogists, but who will think these findings are interesting.

If you keep scrolling under the share button or click on “Discover More” you can order Y DNA tests if you’re a biological male and haven’t already taken one. The more refined your haplogroup, the more relevant your information will be on the Discover page as well as on your personal page.

Scrolling even further down provides information about methods and sources.

Country Frequency

The next tab is Country Frequency showing the locations where testers with this haplogroup indicate that their earliest known ancestors are found.

The Crumley haplogroup has only 5 people, which is less than 1% of the people with ancestors from England.

However, taking a look at haplogroup R-M222 with many more testers, we see something a bit different.

Ireland is where R-M222 is found most frequently. 17% of the men who report their ancestors are from Ireland belong to haplogroup R-M222.

Note that this percentage also includes haplogroups downstream of haplogroup R-M222.

Mousing over any other location provides that same information for that area as well.

Seeing where the ancestors of your haplogroup matches are from can be extremely informative. The more refined your haplogroup, the more useful these tools will be for you. Big Y testers will benefit the most.

Notable Connections

On the next page, you’ll discover which notable people have haplogroups either close to you…or maybe quite distant.

Your first Notable Connection will be the one closest to your haplogroup that FamilyTreeDNA was able to identify in their database. In some cases, the individual has tested, but in many cases, descendants of a common ancestor tested.

In this case, Bill Gates is our closest notable person. Our common haplogroup, meaning the intersection of Bill Gates’s haplogroup and my Crumley cousin’s haplogroup is I-L1195. The SNP mutation that defines haplogroup I-L1145 occurred about 4600 years ago. Both my Crumley cousin and Bill Gates descend from that man.

If you’re curious and want to learn more about your common haplogroup, remember, you can enter that haplogroup into the Discover tool. Kind of like genetic time travel. But let’s finish this one first.

Remember that CE means current era, or the number of years since the year “zero,” which doesn’t technically exist but functions as the beginning of the current era. Bill Gates was born in 1955 CE

BCE means “before current era,” meaning the number of years before the year “zero.” So 2600 BCE is approximately 4600 years ago.

Click through each dot for a fun look at who you’re “related to” and how distantly.

This tool is just for fun and reinforces the fact that at some level, we’re all related to each other.

Maybe you’re aware of more notables that could be added to the Discover pages.

Migration Map

The next tab provides brand spanking new migration maps that show the exodus of the various haplogroups out of Africa, through the Middle East, and in this case, into Europe.

Additionally, the little shovel icons show the ancient DNA sites that date to the haplogroup age for the haplogroup shown on the map, or younger. In our case, that’s haplogroup I-M223 (red arrow) that was formed about 16,000 years ago in Europe, near the red circle, at left. These haplogroup ancient sites (shovels) would all date to 16,000 years ago or younger, meaning they lived between 16,000 years ago and now.

Click to enlarge

By clicking on a shovel icon, more information is provided. It’s very interesting that I-L1145, the common haplogroup with Bill Gates is found in ancient DNA in Cardiff, Wales.

This is getting VERY interesting. Let’s look at the rest of the Ancient Connections.

Ancient Connections

Our closest Ancient Connection in time is Gen Scot 24 (so name in an academic paper) who lived in the Western Isles of Scotland.

These ancient connections are more likely cousins than direct ancestors, but of course, we can’t say for sure. We do know that the first man to develop haplogroup I-L126, about 2500 years ago, is an ancestor to both Gen Scot 24 and our Crumley ancestor.

Gen Scot 24 has been dated to 1445-1268 BCE which is about 3400 years ago, which could actually be older than the haplogroup age. Remember that both dating types are ranges, carbon dating is not 100% accurate, and ancient DNA can be difficult to sequence. Haplogroup ages are refined as more branches are discovered and the tree grows.

The convergence of these different technologies in a way that allows us to view the past in the context of our ancestors is truly amazing.

All of our Crumley cousin’s ancient relatives are found in Ireland or Scotland with the exception of the one found in Wales. I think, between this information and the haplogroup formation dates, it’s safe to say that our Crumley ancestors have been in either Scotland or Ireland for the past 4600 years, at least. And someone took a side trip to Wales, probably settled and died there.

Of course, now I need to research what was happening in Ireland and Scotland 4600 years ago because I know my ancestors were involved.

Suggested Projects

I’m EXTREMELY pleased to see suggested projects for this haplogroup based on which projects haplogroup members have joined.

You can click on any of the panels to read more about the project. Remember that not everyone joins a project because of their Y DNA line. Many projects accept people who are autosomally related or descend from the family through the mitochondrial line, the direct mother’s line.

Still, seeing the Crumley surname project would be a great “hint” all by itself if I didn’t already have that information.

Scientific Details

The Scientific Details page actually has three tabs.

The first tab is Age Estimate.

The Age Estimate tab provides more information about the haplogroup age or TMRCA (Time to Most Recent Common Ancestor) calculations. For haplogroup I-FT272214, the most likely creation date, meaning when the SNP occurred, is about 1709, which just happens to align well with the birth of James Crumley about 1710.

However, anyplace in the dark blue band would fall within a 68% confidence interval (CI). That would put the most likely years that the haplogroup-defining SNP mutation took place between 1634 and 1773. At the lower end of the frequency spectrum, there’s a 99% likelihood that the common ancestor was born between 1451 and 1874. That means we’re 99% certain that the haplogroup defining SNP occurred between those dates. The broader the date range, the more certain we can be that the results fall into that range.

The next page, Variants, provides the “normal” or ancestral variant and the derived or mutated variant or SNP (Single Nucleotide Polymorphism) in the position that defines haplogroup I-FT272214.

The third tab displays FamilyTreeDNA‘s public Y DNA Tree with this haplogroup highlighted. On the tree, we can see this haplogroup, downstream haplogroups as well as upstream, along with their country flags.

Your Personal Page

If you have already taken a DNA test at FamilyTreeDNA, you can find the new Discover tool conveniently located under “Additional Tests and Tools.”

If you are a male and haven’t yet tested, then you’ll want to order a Y DNA test or upgrade to the Big Y for the most refined haplogroup possible.

Big Y tests and testers are why the Y DNA tree now has more than 50,000 branches and 460,000 variants. Testing fuels growth and growth fuels new tools and possibilities for genealogists.

What Do You Think?

Do you like these tools?

What have you learned? Have you shared this with your family members? What did they have to say? Maybe we can get Uncle Charley interested after all!

Let me know how you’re using these tools and how they are helping you interpret your Y DNA results and assist your genealogy.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

Mitochondrial Eve Gets a Great-Granddaughter: African Mitochondrial Haplogroup L7 Discovered

Such wonderful news today!

We have a birth announcement, of sorts, detailed in our new paper released just today,  “African mitochondrial haplogroup L7: a 100,000-year-old maternal human lineage discovered through reassessment and new sequencing.”

Woohoo, Mitochondrial Eve has a new great-granddaughter!

Back in 2018, Goran Runfeldt and Bennett Greenspan at FamilyTreeDNA noticed something unusual about a few mitochondrial DNA sequences, but there weren’t enough sequences to be able to draw any conclusions. As time went on, more sequences became available, both in the FamilyTreeDNA database and in the academic community, including an ancient sequence.

This group of sequences did not fit cleanly into the phylogenetic tree as structured and seemed to cluster together, but more research and analysis were needed.

Were these unique sequences a separate branch? One branch or several? What would creating that branch do to the rest of the tree?

Given that Phylotree, last updated in 2016, did not contain an applicable branch, what were we to do with these puzzle pieces that really didn’t fit?

These discussions, and others similar, led to the decision to launch the Million Mito Project to update the mitochondrial phylogenetic tree which is now 6 years old and seriously out-of-date. For the record, phylogenetics on this scale is EXTREMELY challenging, which is probably why Phylotree hasn’t been updated, but that’s a topic for another article, another day. Today is the day to celebrate haplogroup L7.

Haplogroup L7

The Million Mito team knew there were lots of candidate haplogroups waiting to be formed near the ends of the branches of the phylotree, but what we didn’t expect was a new haplogroup near the root of the tree.

Put another way, in terms that genealogists are used to, the new branch is Eve’s great-granddaughter.

Haplogroup L now has 8 branches, instead of 7, beginning with L0. We named this new branch haplogroup L7 in order not to disrupt the naming patterns in the existing tree.

Let’s take a look.

I used the phylogenetic tree from our paper and added Eve.

Just to be clear, we aren’t talking literal daughters and granddaughters. These are phylogenetic daughters which represent many generations between each (known) branch. Of course, we can only measure the branches that survived and are tested today or are found in ancient DNA.

The only way we have of discovering and deciphering Eve and her “tree” of descendants is identifying mutations that occurred, providing breadcrumbs back in time that allow us to reconstruct Eve’s mitochondrial DNA sequence.

Those mutations are then carried forever in daughter branches (barring a back-mutation). This means that, yes, you and I have all of those mutations today – in addition to several more that define our individual branches.

You can see that Eve has two daughter branches. One branch, at left, is L0.

Eve’s daughter to the right, which I’ve labeled, is the path to the new L7 branch.

Before this new branch was identified, haplogroup L5 existed. Now, Eve has a new great-granddaughter branch L5’7 that then splits into two branches; L5 and L7.

L5 is the existing branch, but L7 is the new branch that includes a few sequences formerly misattributed to L5.

Even more exciting, the newly discovered haplogroup L7 has sub-branches too, including L7a, L7a1, L7b1 and L7b2.

In fact, haplogroup L7 has a total of 13 sublineages.

How Cool is This?!!

Haplogroup L7 is 100,000 years old. This is the oldest lineage since haplogroup L5 was discovered 20 years ago. To put this in perspective, that’s about the same time the first full sequence mitochondrial DNA test was offered to genealogists.

It took 20 years for enough people to test, and two eagle-eyed scientists to notice something unusual.

Hundreds of thousands of people have had their mitochondrial DNA tested, and so far, only 19 people are assigned to haplogroup L7 or a subgroup.

One of those people, shown as L7a* on the tree above, is 80,000 years removed from their closest relative. Yes, their DNA is hens-teeth rare. No, they don’t have any matches at FamilyTreeDNA, just in case you were wondering😊

However, in time, as more people test, they may well have matches. This is exactly why I encourage everyone to take a mitochondrial DNA test. If someone is discouraged from testing, you never know who they might have matched – or how rare their DNA may be. If they don’t test, that opportunity is lost forever – to them, to other people waiting for a match, and to science.

Are there other people out there with this haplogroup, in either Africa or the diaspora? Let’s hope so!

With so few L7 people existing today, it looks like this lineage might have been on the verge of extinction at some point, but somehow survived and is now found in a few places around the world.

Ancient DNA

One 16,000-year-old ancient DNA sample from Malawi has been reclassified from L5 to L7.

This figure from the paper shows the distribution of haplogroup L within Africa, and the figure below shows the Haplogroup L7 range within Africa, with Tanzania having the highest frequency. Malawi abuts Tanzania on the Southwest corner.

Where in the World?

Checking on the public tree at FamilyTreeDNA, you can see the new L5’7 branch with L7 and sub-haplogroups beneath.

We find L7 haplogroups in present-day testers from:

  • South Africa
  • Kenya
  • Ethiopia
  • Sudan
  • United Arab Emirates
  • Yemen
  • Tanzania

It’s also found in people who live in two European countries now, but with their roots reaching back into Africa. Surprisingly, no known African-Americans have yet tested with this haplogroup. I suspect finding the haplogroup in the Americas is just a matter of time, and testing.

The FamilyTreeDNA customers who are lucky enough to be in haplogroup L7 have had their haplogroup badges updated.

If you are haplogroup L at FamilyTreeDNA, check and see if you have a new badge.

Credit Where Credit is Due

I want to give a big shout-out to my colleagues and co-authors. Dr. Paul Maier (lead author,) Dr. Miguel Vilar and Goran Runfeldt.

I can’t even begin to express the amount of heavy lifting these fine scientists did on the long journey from initial discovery to publication. This includes months of analysis, writing the paper, creating the graphics, and recording a video which will be available soon.

I’m especially grateful to people like you who test their DNA, and academic researchers who continue to sequence mitochondrial DNA in both contemporary and ancient samples. Without testers, there would be no scientific discoveries, nor genealogy matching. If you haven’t yet tested, you can order (or upgrade) a mitochondrial DNA test here.

I also want to thank both Bennett Greenspan, Founder, and President, Emeritus of FamilyTreeDNA who initially greenlit the Million Mito Project in early 2020, and Dr. Lior Rauschberger, CEO who continues to support this research.

FamilyTreeDNA paid the open access fees so the paper is free for everyone, here, and not behind a paywall. If you’re downloading the pdf, be sure to download the supplements too. Lots of graphics and images that enhance the article greatly.

Congratulations to Mitochondrial Eve for this new branch in her family tree. Of course, her family tree is your family and mine – the family of man and womankind!

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

Honors and Accolades – Thanks to You!

Before I share the good news, I’d like to thank you, my readers and followers – my tribe – for being my fans.

For reading what I write and watching what I produce. For sharing your thoughts. For inspiring me with your stories and questions. For supporting me.

It’s because of you that I’m privileged to write this article about recent honors and accolades.

Three, to be specific, or four, depending on how you count.

I’m truly humbled.

All three notifications arrived in my inbox within a few days this past week, which also corresponded to a difficult death anniversary in my family, so I really needed this boost.

Family Tree Magazine’s 101 Best Genealogy Websites

Family Tree Magazine compiles an annual best of the best list of 101 genealogy resources for genealogy enthusiasts to research our family trees.

I’m very pleased that DNAeXplained is included again this year.

You can see the full list of honorees, here and you can click on each category to learn more.

I encourage you to try something new.

How many of these sites have you never utilized? What might be waiting for you? Do you have a particularly thorny brick wall that needs to fall?

Maybe some of these resources don’t pertain to your areas of research, but others may.

You might have used some in the past but need to check back occasionally.

For example, DeadFred. You could find photos of your long-lost relatives, and you can also submit orphaned photos there as well.

You know I’m already searching for the surname of every ancestor in my tree that died after the advent of cameras in the mid-1800s! If not them, then maybe their children or siblings. Hope springs eternal!

I’m going to try one new website from the Family Tree Magazine list every day.

Which resource are you trying first? Let me know how it goes and if you find something fun.

Legacy Family Tree Webinar’s Top 10

I received an email from Geoff Rasmussen with Legacy Family Tree Webinars announcing that my webinar, Wringing Every Drop out of Mitochondrial DNA ranked number 5 in the top 10 webinars for May.

Truthfully, I was pleasantly surprised because mitochondrial DNA has often been the “neglected” DNA that we all carry. Hopefully, that “neglected” status will change and more people will test now that they understand how beneficial this tool can be, which means additional and more meaningful matches for all of us.

More than 2,200 people have viewed this webinar so far and received the extensive companion syllabus.

You can watch too by joining Legacy Family Tree Webinars, here, which gives you access to all 1787 webinars, and counting. New webinars are literally added daily, and you can register to watch live webinars along with recently recorded webinars for free for the first 7 days. Take a look.

If you haven’t yet tested your mitochondrial DNA, please do by clicking here.

By taking a mitochondrial DNA test at FamilyTreeDNA, you’ll also become a part of the exciting Million Mito Project which is literally rewriting the history of womankind.

 E-book Release and Lovely Book Review

I received a note from my publisher, Genealogical.com, who is also on Family Tree Magazine’s “Best Of” list again this year, telling me that my book, DNA for Native American Genealogy has been released as an e-book AND has received a major book review by Dr. Margaret McMahon. I think this should count as two really good things, not just one.

I wrote about the contents of my book, here, but Dr. Mac, as she is known, summed things up succinctly in her statement, “This book picks up where the theories end and your work begins.”

That was my goal, to educate my readers, explain the various tests and results, and provide a research roadmap. Do you have a family story of a Native American ancestor? Are you looking for answers?

Dr. Mac’s book review corresponds well with the recent release of the book in e-book (e-Pub) format. Here’s how to order:

Thank You, Thank You

Once again, thank you for your continuing support. I’ll have more interesting news soon!

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

Ancestry’s SideView™ – Dividing Your Ethnicity in Two

Recently, Ancestry introduced a new view of your ethnicity called SideView™. In a nutshell, AncestryDNA uses your DNA matches to attempt to divide your ethnicity into regions inherited from Parent 1 and Parent 2.

Based on your matches and the common DNA they share with you, Ancestry strives to divide your ethnicity into parental “sides,” although Ancestry can’t tell you which side is maternal and which side is paternal.

Even though Ancestry can’t tell you which side is which parent, there are tricks that might help you do just that.

Ethnicity

Before we look at SideView, let’s have a quick review of ethnicity estimates and how they do and don’t work.

Every vendor creates their own proprietary mathematical algorithm to determine their customers’ ethnicity or population percentages based on their own customer database and other resources.

“Country” boundaries change and people migrate. The article, Making Sense of Ethnicity Updates may be helpful.

If you haven’t done so, create a spreadsheet or chart identifying the amount of DNA you would inherit from each ancestor if exactly 50% of each ancestor’s DNA was passed down in each generation. Your spreadsheet may/will help you identify which “side” belongs to which parent. I provided instructions for calculating your expected ethnicity percentages based on your genealogy in the article, Concepts – Calculating Ethnicity Percentages.

My Genealogy

I’ve updated my genealogy totals slightly since that 2017 article because I’ve been able to push some of those lines back in time, either genealogically, via Big-Y or full sequence mitochondrial DNA testing and matching, or a combination of both.

Here’s an updated chart. I’ve included the last two ethnicity percentage results from each vendor except MyHeritage because their ethnicity results have remained the same for several years although they released Genetic Groups to complement ethnicity in 2020.

I’ve clustered geographies in regions because the vendors measure locations differently. Locations sometimes change within the same vendor with different releases.

The earlier “Unknown” genealogy category is gone now because I’ve been able to assign those ancestors to a geographic region if not an exact “country.”

The Genealogy Percent column, with a header and totals in yellow, details the geographic source for each of my 64 great-great-great-great-grandparents who each contributed approximately 1.5625% of my DNA. Of course, we know that DNA isn’t divided exactly in half in each generation, and I possibly inherited none of the DNA of some of those people and more than 1.5625% from others. Regardless, this is the best measuring stick of what I should expect and a way to determine if my ethnicity results are in the right ballpark.

The yellow cells in the vendor column totals reflect the “best fit” for my known genealogy percentages when compared against the expected percentages. In the Native grouping, vendors receive a yellow cell for identifying that heritage.

OK, now let’s take a look at Ancestry’s new SideView.

Finding SideView

At Ancestry, your ethnicity estimate, as well as your new SideView results, are found in the DNA Story section of your DNA Results Summary tab.

Ancestry does update your ethnicity estimate from time to time, so yours may have changed since you last viewed your results.

Ideally, if exactly half of the DNA of each ancestor was passed down in each generation, then I would have the amount of DNA shown in my personal chart, assuming my genealogy is accurate with no adoptions or unexpected parent events.

Also ideally, I would show exactly half of each of my parent’s ethnicity.

But that’s not how it works. While we do inherit half of our DNA from each parent, they can randomly give us all of a segment of DNA from one ancestor and not any of a segment of DNA from a different ancestor.

I wrote about how DNA is passed to children in the article, Ancestral DNA Percentages – How Much of Them is in You?. This explains how and why you might not inherit the exact amount of a specific ethnicity you would expect. Of course, that combined with each vendor’s different algorithms may produce results that surprise you, although the discrepancy should be relatively small.

Keep in mind how ethnicity inheritance works as you view your ethnicity results, including SideView.

SideView Results

Here are my Sideview results.

Remember, we don’t know which parent is “1” and which is “2.” Click on “How we identify this” to learn about how SideView works.

Here’s a more detailed description along with some nice graphics.

Analyzing My Results

SideView appears right beside your ethnicity map, so be sure to consult that map. Note that regions reflect populations, not necessarily countries as boundaries are drawn today.

The first thing I noticed is that my significant Dutch heritage, along with my French is missing in my ethnicity results as well as on the map.

How is this possible?

The arrows point to the Netherlands and France. These are important pieces of my ancestry on my Mother’s side. Mom was 25% Dutch so I should be about 12.5%. My maternal side genealogical breakdown is shown in the chart below.

Mom % I Should Inherit From Mom
German 50 25
Dutch 25 12.5
French/Acadian 12.5 6.25
England 12.5 6.25
Native ~2 in the Acadian line ~1

Of course, my Native American is also missing at Ancestry, even though the other three major vendors identify those segments. The two vendors who paint ethnicity by segment, 23andMe and FamilyTreeDNA report Native on the same segment on my chromosome 1, so it’s unlikely that both of those vendors are in error in exactly the same way.

While Native is critically important to my genealogy, it is a small percentage. Missing a small percentage, while frustrating, is more understandable than missing a larger percentage.

My Dutch ancestors at 12.5% and French at 6.25% are not trivial and together comprise more than one-third of my mother’s heritage and more than one-sixth of mine. And yes, in case you’re wondering, I do match many other people who descend from these same ancestors so it’s not a mistaken or misidentified ancestor issue.

My father was kind of a colonial mutt. Scottish, Irish, and English with a small smattering of African and Native along with 1.5% Scandinavian/Nordic. The African in the later versions tends to show as Middle Eastern or North African, or doesn’t show at all, but that segment with a small Native one cluster together on the same chromosome. I also match other people who are Native/African on those segments as well.

However, given that neither of those ethnicities appear at Ancestry, we don’t have those to work with, nor do we have specific segment information.

Let’s work with what we do have.

View Breakdown

I wish Ancestry did not say “Now, you can see which ethnicities you inherited from each parent,” because while that’s the goal, it isn’t always the case. Lots of people will simply accept that statement at face value.

Click on View breakdown.

You’ll see your results broken into two sides with the reported regions noted at the bottom. All regions are showing in the circle by default.

To see how this works, click on any single or combination of regions.

Determining Sides

What can we do to determine which side is which parent?

Let’s start with ethnicities or regions which should be unique to one parent and not the other.

I clicked on both Norway and Sweden/Denmark since I know that one couple on my father’s side is Scandinavian/Nordic, but I discovered that Ancestry assigned pieces of those regions to both Parent 1 and Parent 2.

I’m positive that my mother did not have any ancestors in the past 6 generations and significantly further back that were Scandinavian or Nordic, BUT, Germany and the Netherlands both border those regions. People traveled, wars happened and populations as a whole mixed, so while I’m confident of my genealogy, this actual ethnicity may be accurate even though it does not reflect genealogical locations. It may well reflect populations and admixture.

What I am sure of is that I can’t use these particular regions to identify which side is maternal or paternal.

Detailed Comparison

Let’s look at the detailed comparison you’ll see by scrolling down.

Can I identify any of these regions as solely connected with only one parent?

Yes, I can. Ancestry has assigned Germanic Europe to only one parent, and Mom is 50% German, so Parent 1 has to be Mother. I should expect to be assigned roughly half of what my mother has – so about 25% Germanic.

Mother has no Irish, so Ireland has to be Dad, which also correlates to known genealogy.

However, the rest of the ethnicity results are questionable, including Mom’s missing Dutch and her missing England and Northwest Europe which should total in the neighborhood of 37.5%. I would be expected to inherit about 18.75% of that from her. Where is it?

No Segments

I very much wish Ancestry provided segment information.

Using segment matching information from the other three vendors, including ethnicity segment information from both 23andMe and FamilyTreeDNA, I’ve painted my segments at DNAPainter, so I know which ancestors or ancestral lines contributed which DNA segments.

If Ancestry, provided users with segment information, I could determine which ethnicities they have attributed to which ancestors and maybe unravel why.

Another Possible Clue

There is one additional way I might be able to figure out where Ancestry attributed Mom’s Dutch and French heritage.

Given that I know which of my closest matches are maternal and paternal, I can utilize shared matching plus shared ethnicity to look for similarities. Just click on the match with someone, then on the Ethnicity tab.

The closest match on my mother’s side is my first cousin who also descends from my maternal grandparents. My cousin’s relevant parent should have roughly the same amount of the same ethnicities as my Mom since they were siblings, taking into account that not all of our ancestors’ DNA is passed in exactly half and siblings, unless they are identical twins, don’t inherit all of the same DNA from their parents. Of course, that means my first cousin should share roughly the same amount of DNA/ethnicity from our common grandparents as I do.

My cousin’s other parent is European with what appears to be a significant number of German ancestors, so we need to take that into account when viewing my cousin’s shared ethnicity comparison with me, above.

I can see that my cousin has 4% French and 1% Native, but that percentage might have been contributed by their other parent, especially since there is a French surname in that line.

If my cousin’s other parent had been African or Asian or an ethnicity that is different from the ethnicity of our shared line, it would be easier to compare our results meaningfully.

In this case, the shared match ethnicity comparison did not help, but your mileage may vary based on your unique circumstances.

Assign the Parent

If you are fortunate enough to be able to determine which parent is which, you can assign Parent 1 and 2 as maternal or paternal at Ancestry by clicking on the “Edit parents” icon at top right on the Detailed Comparison page.

I selected side 1 as Maternal based on the 35% Germanic Europe which is very clearly my mother’s side.

What I wish we could do, but we can’t, is to explain why we disagree with some portion of an assigned ethnicity. Ancestry does have my tree and I do have Thrulines from these ancestors, so the information is available for comparison should Ancestry choose to utilize that resource.

You can undo your selections by selecting “Back” or click on “Sounds good.”

I initially clicked on “Sounds good,” even though that bothers me. I hope that I’m not confirming something that’s incorrect, given my Mother’s missing Dutch and French, and that I’m not going to make *something* worse in the future by baking in bad ingredients. I’m not comfortable confirming something that’s significantly wrong. On the other hand, Parent 1 is clearly my mother, so I’m conflicted and I really don’t know exactly what I’m confirming to Ancestry.

In other words, we don’t know what Ancestry is doing under the hood with this information, if anything, other than labeling your sides.

Ultimately, I clicked “Back”, at least for now, to leave my sides unassigned until there is some benefit to me to identify the parental sides and I know I’m not confirming something that shouldn’t be confirmed. In other words, I know which parent is which, but I do NOT want to confirm that these ethnicities are fundamentally accurately assigned, because they are not.

Does Testing Your Parents Make a Difference?

If you’re wondering if testing your parents makes a difference with SideView predictions, it does not.

Ancestry is NOT utilizing your parents’ DNA for SideView ethnicity division, even if your parent or parents have tested, which Ancestry confirms in their documentation.

If you’re wondering why Ancestry doesn’t use your parent’s DNA to improve your SideView results, remember that someone who matches you at the parent/child level can be either your parent or your child. Often trees are either absent or incorrect, so Ancestry cannot simply assume anything.

Benefits of SideView?

What do you think?

Is there a benefit to SideView or is it simply interesting window dressing?

Are your SideView results accurate?

Do you feel that Sideview is accurate enough to be genealogically useful?

Are you able to utilize Sideview for your genealogy? If so, how?

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research

Sarah Rash’s and Perhaps Mary Warren’s Mitochondrial DNA – 52 Ancestors #359

Using the FamilySearch “Relatives at RootsTech” app that was available in the month or so surrounding RootsTech (but not now), I connected with a cousin who is a direct matrilineal descendant of Sarah Rash, our common ancestor.

My cousin, who descends through Sarah’s daughter Rhoda Shepherd, very kindly agreed to take a full sequence mitochondrial DNA test so we now have information about Sarah Rash’s matrilineal origins.

I wrote about Sarah Rash and what we know of her life in Sarah Rash (1748-1829), Church Founder and Grandmother of Nearly 100.

Mitochondrial DNA Inheritance

Women contribute their mitochondrial DNA to all of their children, but only females pass it on. Therefore, mitochondrial DNA is never divided, watered down or mixed with the DNA of the father. Mitochondrial DNA provides an invaluable periscope view directly back in time for our matrilineal ancestors – our direct mother’s, mother’s, mother’s line on up our tree.

Sarah Rash was born to Joseph Rash and wife, Mary, purportedly Mary Warren.  Sarah’s mitochondrial DNA also belongs to her mother Mary. That would be Mary Warren if indeed Mary Warren is Sarah’s mother. Mary Warren’s parents are unknown. However, there is a Warren family in Spotsylvania County, VA, where the Rash family lived in that timeframe.

Goals

My goals for seeking a mitochondrial DNA test for Sarah Rash’s descendant are:

  • To confirm Sarah’s genealogical accuracy by matching another descendant, preferably through another daughter or sister of Sarah.
  • To learn what we can from Sarah’s haplogroup. You don’t know what you don’t know.
  • To gather evidence to confirm or perhaps disprove that Sarah’s mother is Mary Warren.
  • To potentially extend Sarah’s line backward in time.

The Process

Several people have asked me to step through the analysis process that I use for mitochondrial DNA results, so let’s do that.

What can we tell about Sarah’s ancestors through her mitochondrial DNA?.

Sarah’s Matrilineal Line is Not Native

Sometimes when the mother of an early pioneer settler can’t be identified, the “go-to” assumption is that she might be Native American.

Sarah’s haplogroup is U5a2a1d which is definitely NOT Native.

We can dispel this thought permanently.

Since Sarah’s matrilineal ancestors aren’t Native, where are they from?

Where Are Sarah’s Ancestors From?

Using the public mitochondrial tree, here, we see the following countries displayed for haplogroup U5a2a1d.

Sarah’s haplogroup is found most often in the US, which means brick-walled here, followed by England, Ireland, and less-frequent other locations. Note that two people claim Native, the feather, but that can mean either they are mistaken, or they have entered information for their mother’s “side” of the family or their literal “oldest ancestor,” not their specific matrilineal line.

Regardless, haplogroup U is unquestionably not Native.

Matches Map

Sometimes the matches map, which shows the geographic locations of your matches’ most distant matrilineal linear ancestor is very informative, but not so in this case.

Of 74 full sequence matches, only 4, plus the tester whose pin is white, have entered the locations of their matrilineal ancestors.

One of these contains a male name, so we know that’s incorrect.

This is really sad – a wasted opportunity. Imagine how useful this could be with 74 pins instead of 4, and one of those being recorded incorrectly.

Mutations

The mutations tab shows you the mutations you have that are either extra or missing from your haplogroup assignment. This means that these may be combined in the next version of the haplotree to form a new haplogroup.

My cousin has 5 extra mutations, but at least three of those are in unstable areas that I’m sure will not be utilized as haplogroup-forming. The other two mutations are insertions at one single location and I doubt those will be used either.

I wrote about haplogroup formation in the article, Mitochondrial DNA: Part 3 – Haplogroups Unraveled, including a list of unstable and common mutations. Suffice it to say that very common locations like 16519 and 315 insertions aren’t useful to form haplogroups. Some very common mutations, such as insertions at locations 309 and 315 and deletions at 522 and 523 aren’t even counted in matching/differences.

What these unstable mutations actually tell me, relative to Sarah Rash’s DNA is that I need to pay attention to the GD1 (genetic distance of 1) matches, meaning people who have only one mutation difference from my cousin. Given that my cousin’s extra mutations, differences from her defined haplogroup, are in unstable regions, close matches such as GD1 or even GD2 could be quite relevant. It all depends on the difference.

Of course, we can’t see the mutations of the people my cousin matches, so those with a GD1 or GD2 may have mutations on a stable marker that my cousin doesn’t have.

Matches

My cousin has a total of 74 full sequence matches, of which 31 are exact matches, 18 have trees and 12 have listed an earliest known ancestor (EKA). If you haven’t done so, here’s how to enter your EKA.

Of course, the EKA of my cousin’s matches may or may not agree with the earliest matrilineal person in their tree. And the tree may or may not have more than one or two people. Regardless, every hint is worth follow-up.

Think of these as diamonds in the rough.

Trees

I viewed the trees of each of the matches that have uploaded trees. I also made a list of the earliest known ancestors for matches that didn’t have trees so I could be cognizant of watching for those names.

Many trees only had a few generations, but I used Ancestry, FamilySearch, MyHeritage, and WikiTree to see if I could reasonably complete the tree back a little further. Of these, I particularly like WikiTree because I think it tends to be more accurate AND it allows for people to enter that they carry the mitochondrial DNA of specific ancestors. As it turns out, no one has done that for Sarah Rash, or her purported mother, Mary Warren, but if they had, it would provide a confirmation opportunity.

I did find something quite interesting.

Who is Jane Davis?

The EKA of Elizabeth, one of my cousin’s matches, is Jane Davis who was born in 1690.

Unfortunately, Elizabeth did not upload a GEDCOM file or create a tree, so I turned to other trees elsewhere to see what I could unearth about Jane Davis.

I need to state emphatically that what I’m about to tell you needs to be taken with the entire salt lick, not just a grain.

Remember, we’re looking for hints and evidence here, not foregone conclusions – although admittedly, those would be nice.

According to (cringe) some trees, Jane Davis was the wife of one William Warren who was born 1678 in Surry County, VA and died on September 29, 1764 in Edgecombe County, NC. I have not confirmed any of this. Gathering evidence is the first step in the process.

IF this is accurate, William Warren and Jane Davis may be the parents of Mary Warren, the purported mother of Sarah Rash.

Notice all of those weasel words – if, may, purported. That’s where we have to start. In weaselworld.

Obviously, this needs a LOT of traditional genealogy work, but here’s the great news…I now have something to work with and someone else, Elizabeth, who appears one way or another to be descended from this line.

The Good News

Whether or not Jane Davis is accurate or not, I’d wager that we are looking at the same line because Elizabeth matches my cousin’s mitochondrial DNA. I need to email Elizabeth to see if she descends through Sarah Rash. If so, that’s confirmation of this line.

If not, and she descends through a daughter of someone else in this same line, like one of Mary Warren’s sisters, that’s evidence and a HUGE HINT that I can use to confirm Mary Warren as the mother of Sarah Rash. Confirming her mother would also confirm that Mary’s father is William Warren – so would provide evidence for both of Sarah’s parents.

Additional Tools – Advanced Matches

Next, I used Advanced Matches to query for anyone who matches at both the full sequence level and in Family Finder. There were no matches, which doesn’t surprise me since it’s quite a way back in time.

Notice that the link to upload a family tree is in this section, along with the public haplotree I used earlier.

Family Finder

Checking my cousin’s Family Finder matches and searching for surnames, I immediately checked for myself and my known cousins from that line. No cigar, but our common ancestor is many generations in the past.

Checking the Rash surname for my cousin shows a match to someone who descends from Joseph Rash’s brother, William Rash whose children also migrated to Claiborne County, TN along with Sarah Rash’s daughter, Elizabeth Shepherd who married William McNiel.

My cousin has numerous autosomal matches to the McNiel line as well. The Vannoy, McNiel, Shepherd, and Rash lines were all found in Wilkes County, NC together before migrating to Claiborne and Hancock Counties in Tennessee. Before Wilkes County, the Rash, Warren, and McNiel families were in Spotsylvania and nearby counties in Virginia.

Goal Fulfillment

How did we do fulfilling our original goals?

Goal Comment
To confirm Sarah’s genealogical accuracy by matching another descendant. Perhaps – We have that lead to follow up on with Elizabeth and her EKA of Jane Davis. We also have several relevant autosomal matches.
To learn what we can learn from her haplogroup. Yes – Not Native and probably from England or Ireland. That is useful and makes sense.
To confirm her mother as Mary Warren. We now have hints and tools. We need to hear what Elizabeth has to say. I may be able to extract more information by viewing trees individually with people my cousin matches on Family Finder.
To potentially extend Sarah’s line backward in time. We now have a great hint and information to work with, both mitochondrial and autosomal. Jane Davis may be the wife of William Warren, which might well confirm Mary Warren as the daughter of William Warren. It’s too soon to tell but my fingers are crossed for a descendant of Jane Davis from a different daughter through all females.

Sometimes answers come in a gulley-washer, and other times, we have to dig and sift over time for the gems. Let’s create a plan.

What’s Next?

There’s a lot we can do, but maybe one of the best places to start would be to attempt to assemble information about the Warren families of Spotsylvania County, VA. This Thomas Warren might be a good place to begin or maybe work my way up from Mary Warren, here.

I need to focus on both traditional genealogy and genetic autosomal matches at all of the vendors. My cousin’s DNA is only at FamilyTreeDNA, but my results and those of several other cousins are found at several vendors.

I can use Genetic Affairs’ tools to see if I cluster with other people descended from the Warren family. My cousin can set up an account and do the same thing if she wishes. AutoTree and AutoKinship may help with that.

Using traditional genealogy, if I can identify other sisters of Mary Warren (daughters of Jane Davis,) I can ask people descended from them through all females to take a mitochondrial DNA test. If they match my cousin, that’s an exceptionally compelling piece of evidence.

Of course, I can do more work on the mitochondrial DNA matches we already have by emailing and asking for genealogy information. The piece of evidence we need might be right under our noses.

The Warren Family

If you descend from a Warren family in the Spotsylvania County area in the 1600s through 1700s, would you please check your matches to see if you have me, Vannoy, McNiel, McNeil, Rash or Shepherd matches? I’d love to narrow this down.

If you descend through all females from William Warren or another Warren family who would have been having children in the Spotsylvania County from about 1710 to maybe 1740, would you please reach out to me? If we can pinpoint a likely family for Mary Warren who was reportedly born in 1726, I’d love to do a confirming mitochondrial DNA test.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

Genealogy Products and Services

My Book

Genealogy Books

Genealogy Research