Genetic Affairs: AutoPedigree Combines AutoTree with WATO to Identify Your Potential Tree Locations

July 2020 Update: Please note that Ancestry issues a cease-and-desist order against Genetic Affairs, and this tool no longer works at Ancestry. The great news is that it still works at the other vendors, and you can ask Ancestry matches to transfer, which is free.

If you’re an adoptee or searching for an unknown parent or ancestor, AutoPedigree is just what you’ve been waiting for.

By now, we’re all familiar with Genetic Affairs who launched in 2018 with their signature autocluster tool. AutoCluster groups your matches into clusters by who your matches match with each other, in addition to you.

browser autocluster

A year later, in December 2019, Genetic Affairs introduced AutoTree, automated tree reconstruction based on your matches trees at Ancestry and Family Finder at Family Tree DNA, even if you don’t have a tree.

Now, Genetic Affairs has introduced AutoPedigree, a combination of the AutoTree reconstruction technology combined with WATO, What Are the Odds, as seen here at DNAPainter. WATO is a statistical probability technique developed by the DNAGeek that allows users to review possible positions in a tree for where they best fit.

Here’s the progressive functionality of how the three Genetic Affairs tools, combined, function:

  • AutoCluster groups people based on if they match you and each other
  • AutoTree finds common ancestors for trees from each cluster
  • Next, AutoTree finds the trees of all matches combined, including from trees of your DNA matches not in clusters
  • AutoPedigree checks to see if a common ancestor tree meets the minimum requirement which is (at least) 3 matches of greater to or equal to 30-40 cM. If yes, an AutoPedigree with hypotheses is created based on the common ancestor of the matching people.
  • Combined AutoPedigrees then reviews all AutoTrees and AutoPedigrees that have common ancestors and combine them into larger trees.

Let’s look at examples, beginning with DNAPainter who first implemented a form of WATO.

DNA Painter

Let’s say you’re trying to figure out how you’re related to a group of people who descend from a specific ancestral couple. This is particularly useful for someone seeking unknown parents or other unknown relationships.

DNA tools are always from the perspective of the tester, the person whose kit is being utilized.

At DNAPainter, you manually create the pedigree chart beginning with a common couple and creating branches to all of their descendants that you match.

This example at DNAPainter shows the matches with their cM amounts in yellow boxes.

xAutoPedigree DNAPainter WATO2

The tester doesn’t know where they fit in this pedigree chart, so they add other known lines and create hypothesis placeholder possibilities in light blue.

In other words, if you’re searching for your mother and you were born in 1970, you know that your mother was likely born between 1925 (if she was 45 when she gave birth to you) and 1955 (if she was 15 when she gave birth to you.) Therefore, in the family you create, you’d search for parents who could have given birth to children during those years and create hypothetical children in those tree locations.

The WATO tool then utilizes the combination of expected cMs at that position to create scores for each hypothesis position based on how closely or distantly you match other members of that extended family.

The Shared cM Project, created and recently updated by Blaine Bettinger is used as the foundation for the expected centimorgan (cM) ranges of each relationship. DNAPainter has automated the possible relationships for any given matching cM amount, here.

In the graphic above, you can see that the best hypothesis is #2 with a score of 1, followed by #4 and #5 with scores of 3 each. Hypothesis 1 has a score of 63.8979 and hypothesis 3 has a score of 383.

You’ll need to scroll to the bottom to determine which of the various hypothesis are the more likely.

Autopedigree DNAPainter calculated probability

Using DNAPainter’s WATO implementation requires you to create the pedigree tree to test the hypothesis. The benefit of this is that you can construct the actual pedigree as known based on genealogical research. The down-side, of course, is that you have to do the research to current in each line to be able to create the pedigree accurately, and that’s a long and sometimes difficult manual process.

Genetic Affairs and WATO

Genetic Affairs takes a different approach to WATO. Genetic Affairs removes the need for hand entry by scanning your matches at Ancestry and Family Tree DNA, automatically creating pedigrees based on your matches’ trees. In addition, Genetic Affairs automatically creates multiple hypotheses. You may need to utilize both approaches, meaning Genetic Affairs and DNAPainter, depending on who has tested, tree completeness at the vendors, and other factors.

The great news is that you can import the Genetic Affairs reconstructed trees into DNAPainter’s WATO tool instead of creating the pedigrees from scratch. Of course, Genetic Affairs can only use the trees someone has entered. You, on the other hand, can create a more complete tree at DNAPainter.

Combining the two tools leverages the unique and best features of both.

Genetic Affairs AutoPedigree Options

Recently, Genetic Affairs released AutoPedigree, their new tool that utilizes the reconstructed AutoTrees+WATO to place the tester in the most likely region or locations in the reconstructed tree.

Let’s take a look at an example. I’m using my own kit to see what kind of results and hypotheses exist for where I fit in the tree reconstructed from my matches and their trees.

If you actually do have a tree, the AutoTree portion will simply be counted as an equal tree to everyone else’s trees, but AutoPedigree will ignore your tree, creating hypotheses as if it doesn’t exist. That’s great for adoptees who may have hypothetical trees in progress, because that tree is disregarded.

First, sign on to your account at Genetic Affairs and select the AutoPedigree option for either Ancestry or Family Tree DNA which reconstructs trees and generates hypotheses automatically. For AutoPedigree construction, you cannot combine the results from Ancestry and FamilyTreeDNA like you can when reconstructing trees alone. You’ll need to do an AutoPedigree run for each vendor. The good news is that while Ancestry has more testers and matches, FamilyTreeDNA has many testers stretching back 20 years or so in the past who passed away before testing became available at Ancestry. Often, their testers reach back a generation or two further. You can easily transfer Ancestry (and other) results to Family Tree DNA for free to obtain more matches – step-by-step instructions here.

At Genetic Affairs, you should also consider including half-relations, especially if you are dealing with an unknown parent situation. Selecting half-relationships generates very large trees, so you might want to do the first run without, then a second run with half relationships selected.

AutoPedigree options

Results

I ran the program and opened the resulting email with the zip file. Saving that file automatically unzips for me, displaying the following 5 files and folders.

Autopedigree cluster

Clicking on the AutoCluster HTML link reveals the now-familiar clusters, shown below.

Autopedigree clusters

I have a total of 26 clusters, only partially shown above. My first peach cluster and my 9th blue cluster are huge.

Autopedigree 26 clusters

That’s great news because it means that I have a lot to work with.

autopedigree folder

Next, you’ll want to click to open your AutoPedigree folder.

For each cluster, you’ll have a corresponding AutoPedigree file if an AutoPedigree can be generated from the trees of the people in that cluster.

My first cluster is simply too large to show successfully in blog format, so I’m selecting a smaller cluster, #21, shown below with the red arrow, with only 6 members. Why so small, you ask? In part, because I want to illustrate the fact that you really don’t need a lot of matches for the AutoPedigree tool to be useful.

Autopedigree multiple clusters

Note also that this entire group of clusters (blue through brown) has members in more than one cluster, indicated by the grey cells that mean someone is a member of at least 2 clusters. That tells me that I need to include the information from those clusters too in my analysis. Fortunately, Genetic Affairs realizes that and provides a combined AutoPedigree tool for that as well, which we will cover later in the article. Just note for now that the blue through brown clusters seem to be related to cluster 21.

Let’s look at cluster 21.

autopedigree cluster 21

In the AutoPedigree folder, you’ll see cluster files when there are trees available to create pedigrees for individual clusters. If you’re lucky, you’ll find 2 files for some clusters.

autopedigree ancestors

At the top of each cluster AutoPedigree file, Genetic Affairs shows you the home couple of the descendant group shown in the matches and their corresponding trees.

Autopedigree WATO chart

Image 1 – click to enlarge

I don’t expect you to be able to read everything in the above pedigree chart, just note the matches and arrows.

You can see three of my cousins who match, labeled with “Ancestry.” You also see branches that generate a viable hypothesis. When generating AutoPedigrees, Genetic Affairs truncates any branches that cannot result in a viable hypothesis for placing the tester in a viable location on the tree, so you may not see all matches.

Autopedigree hyp 1

Image 2 – click to enlarge

On the top branch, you’ll see hyp-1-child1 which is the first hypothesis, with the first child. Their child is hyp-2- child2, and their child is hyp-3-child3. The tester (me, in this case) cannot be the persons shown with red flags, called badges, based on how I match other people and other tree information such as birth and death dates.

Think of a stoplight, red=no, green are your best bets and the rest are yellow, meaning maybe. AutoPedigree makes no decisions, only shows you options, and calculated mathematically how probable each location is to be correct.

Remember, these “children,” meaning hypothesis 1-child 1 may or may not have actually existed. These relationships are hypothetical showing you that IF these people existed, where the tester could appear on the tree.

We know that I don’t fit on the branch above hypothesis 1, because I only match the descendant of Adam Lentz at 44.2 cM which is statistically too low for me to also inhabit that branch.

I’ve included half relationships, so we see hyp-7-child1-half too, which is a half-sibling.

The rankings for hypotheses 1, 2, and 7 all have red badges, meaning not possible, so they have a score of 0. Hypothesis 3 and 8 are possible, with a ranking of 16, respectively.

autopedigree my location

Image 3 – click to enlarge

Looking now at the next segment of the tree, you see that based on how I match my Deatsman and Hartman cousins, I can potentially fit in any portion of the tree with green badges (in the red boxes) or yellow badges.

You can also see where I actually fit in the tree. HOWEVER, that placement is from AutoTree, the tree reconstruction portion, based on the fact that I have a tree (or someone has a tree with me in it). My own tree is ignored for hypothesis generation for the AutoPedigree hypothesis generation portion.

Had my first cousins once removed through my grandfather John Ferverda’s brother, Roscoe, tested AND HAD A TREE, there would have been no question where I fit based on how I match them.

autopedigree cousins

As it turns out they did test, but provided no tree meaning that Genetic Affairs had no tree to work with.

Remember that I mentioned that my first cluster was huge. Many more matches mean that Genetic Affairs has more to work with. From that cluster, here’s an example of a hypothesis being accurate.

autopedigree correct

Image 4 – click to enlarge

You can see the hypothetical line beneath my own line, with hypothesis 104, 105, 106, 107, 108. The AutoTree portion of my tree is shown above, with my father and grandparents and my name in the green block. The AutoPedigree portion ignores my own tree, therefore generating the hypothesis that’s where I could fit with a rank of 2. And yes, that’s exactly where I fit in the tree.

In this case, there were some hypotheses ranked at 1, but they were incorrect, so be sure to evaluate all good (green) options, then yellow, in that order.

Genetic Affairs cannot work with 23andMe results for AutoPedigree because 23andMe doesn’t provide or support trees on their site. AutoClusters are integrated at MyHeritage, but not the AutoTree or AutoPedigree functions, and they cannot be run separately.

That leaves Family Tree DNA and Ancestry.

Combined AutoPedigree

After evaluating each of the AutoPedigrees generated for each cluster for which an AutoPedigree can be generated, click on the various cluster combined autopedigrees.

autopedigree combined

You can see that for cluster 1, I have 7 separate AutoPedigrees based on common ancestors that were different. I have 3 AutoPedigrees also for cluster 9, and 2 AutoPedigrees for 15, 21, and 24.

I have no AutoPedigrees for clusters 2, 3, 5, 6, 7, 8, 14, 17, 18, and 22.

Moving to the combined clusters, the numbers of which are NOT correlated to the clusters themselves, Genetic Affairs has searched trees and combined ancestors in various clusters together when common ancestors were found.

Autopedigree multiple clusters

Remember that I asked you to note that the above blue through brown clusters seem to have commonality between the clusters based on grey cell matches who are found in multiple groups? In fact, these people do share common ancestors, with a large combined AutoPedigree being generated from those multiple clusters.

I know you can’t read the tree in the image that follows. I’m only including it so you’ll see the scale of that portion of my tree that can be reconstructed from my matches with hypotheses of where I fit.

autopedigree huge

Image 5 – click to enlarge

These larger combined pedigrees are very useful to tie the clusters together and understand how you match numerous people who descend from the same larger ancestral group, further back in time.

Integration with DNAPainter

autopedigree wato file

Each AutoPedigree file and combined cluster AutoPedigree file in the AutoPedigree folder is provided in WATO format, allowing you to import them into DNAPainter’s WATO tool.

autopedigree dnapainter import

You can manually flesh out the trees based on actual genealogy in WATO at DNAPainter, manually add matches from GEDmatch, 23andMe or MyHeritage or matches from vendors where your matches trees may not exist but you know how your match connects to you.

Your AutoTree Ancestors

But wait, there’s more.

autopedigree ancestors folder

If you click on the Ancestors folder, you’ll see 5 options for tree generations 3-7.

autopedigree ancestor generations

My three-generation auto-generated reconstructed tree looks like this:

autopedigree my tree

Selecting the 5th generation level displays Jacob Lentz and Frederica Ruhle, the couple shown in the AutoCluster 21 and AutoPedigree examples earlier. The color-coding indicates the source of the ancestors in that position.

Autopedigree expanded tree

click to enlarge

You will also note that Genetic Affairs indicates how many matches I have that share this common ancestor along with which clusters to view for matches relevant to specific ancestors. How cool is this?!!

Remember that you can also import the genetic match information for each AutoTree cluster found at Family Tree DNA into DNAPainter to paint those matches on your chromosomes using DNAPainter’s Cluster Auto Painter.

If you run AutoCluster for matches at 23andMe, MyHeritage, or FamilyTreeDNA, all vendors who provide segment information, you can also import that cluster segment information into DNAPainter for chromosome painting.

However, from that list of vendors, you can only generate AutoTrees and AutoPedigrees at Family Tree DNA. Given this, it’s in your best interest for your matches to test at or upload their DNA (plus tree) to Family Tree DNA who supports trees AND provides segment information, both, and where you can run AutoTree and AutoPedigree.

Have you painted your clusters or generated AutoTrees? If you’re an adoptee or looking for an unknown parent or grandparent, the new AutoPedigree function is exactly what you need.

Documentation

Genetic Affairs provides complete instructions for AutoPedigree in this newsletter, along with a user manual here, and the Facebook Genetic Affairs User Group can be found here.

I wrote the introductory article, AutoClustering by Genetic Affairs, here, and Genetic Affairs Reconstructs Trees from Genetic Clusters – Even Without Your Tree or Common Ancestors, here. You can read about DNAPainter, here.

Transfer your DNA file, for free, from Ancestry to Family Tree DNA or MyHeritage, by following the easy instructions, here.

Have fun! Your ancestors are waiting.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

 

Concepts: Chromosome Browser – What Is It, How Do I Use It, and Why Do I Care?

The goal of genetic genealogy is to utilize DNA matches to verify known ancestors and identify unknown ancestors.

A chromosome browser is a tool that allows testers to visualize and compare their DNA on each chromosome with that of their genetic matches. How to utilize and interpret that information becomes a little more tricky.

I’ve had requests for one article with all the information in one place about chromosome browsers:

  • What they are
  • How and when to use them
  • Why you’d want to

I’ve included a feature comparison chart and educational resource list at the end.

I would suggest just reading through this article the first time, then following along with your own DNA results after you understand the basic landscape. Using your own results is the best way to learn anything.

What Does a Chromosome Browser Look Like?

Here’s an example of a match to my DNA at FamilyTreeDNA viewed on their chromosome browser.

browser example.png

On my first 16 chromosomes, shown above, my 1C1R (first cousin once removed,) Cheryl, matches me where the chromosomes are painted blue. My chromosome is represented by the grey background, and her matching portion by the blue overlay.

Cheryl matches me on some portion of all chromosomes except 2, 6, and 13, where we don’t match at all.

You can select any one person, like Cheryl, from your match list to view on a chromosome browser to see where they match you on your chromosomes, or you can choose multiple matches, as shown below.

browser multiple example.png

I selected my 7 closest matches that are not my immediate family, meaning not my parents or children. I’m the background grey chromosome, and each person’s match is painted on top of “my chromosome” in the location where they match me. You see 7 images of my grey chromosome 1, for example, because each of the 7 people being compared to me are shown stacked below one another.

Everyplace that Cheryl matches me is shown on the top image of each chromosome, and our matching segment is shown in blue. The same for the second red copy of the chromosome, representing Don’s match to me. Each person I’ve selected to match against is shown by their own respective color.

You’ll note that in some cases, two people match me in the same location. Those are the essential hints we are looking for. We’ll be discussing how to unravel, interpret, and use matches in the rest of this article.

browser MyHeritage example.png

The chromosome browser at MyHeritage looks quite similar. However, I have a different “top 7” matches because each vendor has people who test on their platform who don’t test or transfer elsewhere.

Each vendor that supports chromosome browsers (FamilyTreeDNA, MyHeritage, 23andMe, and GedMatch) provides their own implementation, of course, but the fundamentals of chromosome browsers, how they work and what they are telling us is universal.

Why Do I Need a Chromosome Browser?

“But,” you might say, “I don’t need to compare my DNA with my matches because the vendors already tell me that I match someone, which confirms that we are related and share a common ancestor.”

Well, not exactly. It’s not quite that straightforward.

Let’s take a look at:

  • How and why people match
  • What matches do and don’t tell you
  • Both with and without a chromosome browser

In part, whether you utilize a chromosome browser or not depends on which of the following you seek:

  • A broad-brush general answer; yes or no, I match someone, but either I don’t know how are related, or have to assume why. There’s that assume word again.
  • To actually confirm and prove your ancestry, getting every ounce of value out of your DNA test.

Not everyone’s goals are the same. Fortunately, we have an entire toolbox with a wide range of tools. Different tools are better suited for different tasks.

People seeking unknown parents should read the article, Identifying Unknown Parents and Individuals Using DNA Matching because the methodology for identifying unknown parents is somewhat different than working with genealogy. This article focuses on genealogy, although the foundation genetic principles are the same.

If you’re just opening your DNA results for the first time, the article, First Steps When Your DNA Results are Ready – Sticking Your Toe in the Genealogy Water would be a great place to start.

Before we discuss chromosome browsers further, we need to talk about DNA inheritance.

Your Parents

Every person has 2 copies of each of their 22 chromosomes – one copy contributed by their mother and one copy contributed by their father. A child receives exactly half of the autosomal DNA of each parent. The DNA of each parent combines somewhat randomly so that you receive one chromosome’s worth of DNA from each of your parents, which is half of each parent’s total.

On each chromosome, you receive some portion of the DNA that each parent received from their ancestors, but not exactly half of the DNA from each individual ancestor. In other words, it’s not sliced precisely in half, but served up in chunks called segments.

Sometimes you receive an entire segment of an ancestor’s DNA, sometimes none, and sometimes a portion that isn’t equal to half of your parent’s segment.

browser inheritance.png

This means that you don’t receive exactly half of the DNA of each of your grandparents, which would be 25% each. You might receive more like 22% from one maternal grandparent and 28% from the other maternal grandparent for a total of 50% of the DNA you inherit from your parents. The other 50% of your DNA comes from the other parent, of course. I wrote about that here.

There’s one tiny confounding detail. The DNA of your Mom and Dad is scrambled in you, meaning that the lab can’t discern scientifically which side is which and can’t tell which pieces of DNA came from Mom and which from Dad. Think of a genetic blender.

Our job, using genetic genealogy, is to figure out which side of our family people who match us descend from – which leads us to our common ancestor(s).

Parallel Roads

For the purposes of this discussion, you’ll need to understand that the two copies you receive of each chromosome, one from each parent, have the exact same “addresses.” Think of these as parallel streets or roads with identical addresses on each road.

browser street.png

In the example above, you can see Dad’s blue chromosome and Mom’s red chromosome as compared to me. Of course, children and parents match on the full length of each chromosome.

I’ve divided this chromosome into 6 blocks, for purposes of illustration, plus the centromere where we generally find no addresses used for genetic genealogy.

In the 500 block, we see that the address of 510 Main (red bar) could occur on either Dad’s chromosome, or Mom’s. With only an address and nothing more, you have no way to know whether your match with someone at 510 Main is on Mom’s or Dad’s side, because both streets have exactly the same addresses.

Therefore, if two people match you, at the same address on that chromosome, like 510 Main Street, they could be:

  • Both maternal matches, meaning both descended from your mother’s ancestors, and those two people will also match each other
  • Both paternal matches, meaning both descended from your father’s ancestors, and those two people will also match each other
  • One maternal and one paternal match, and those two people will not match each other

Well then, how do we know which side of the family a match descends from, and how do we know if we share a common ancestor?

Good question!

Identical by Descent

If you and another person match on a reasonably sized DNA segment, generally about 7 cM or above, your match is probably “identical by descent,” meaning not “identical by chance.” In this case, then yes, a match does confirm that you share a common ancestor.

Identical by descent (IBD) means you inherited the piece of DNA from a common ancestor, inherited through the relevant parent.

Identical by chance (IBC) means that your mom’s and dad’s DNA just happens to have been inherited by you randomly in a way that creates a sequence of DNA that matches that other person. I wrote about both IBD and IBC here.

MMB stats by cM 2

This chart, courtesy of statistician Philip Gammon, from the article Introducing the Match-Maker-Breaker Tool for Parental Phasing shows the percentage of time we expect matches of specific segment sizes to be valid, or identical by descent.

Identical by Chance

How does this work?

How is a match NOT identical by descent, meaning that it is identical by chance and therefore not a “real” or valid match, a situation also known as a false positive?

browser inheritance grid.png

The answer involves how DNA is inherited.

You receive a chromosome with a piece of DNA at every address from both parents. Of course, this means you have two pieces of DNA at each address. Therefore people will match you on either piece of DNA. People from your Dad’s side will match you on the pieces you inherited from him, and people from your Mom’s side will match you on the pieces you inherited from her.

However, both of those matches have the same address on their parallel streets as shown in the illustration, above. Your matches from your mom’s side will have all As, and those from your dad’s side will have all Ts.

The problem is that you have no way to know which pieces you inherited from Mom and from Dad – at least not without additional information.

You can see that for 10 contiguous locations (addresses), which create an example “segment” of your DNA, you inherited all As from your Mom and all Ts from your Dad. In order to match you, someone would either need to have an A or a T in one of their two inherited locations, because you have an A and a T, both. If the other person has a C or a G, there’s no match.

Your match inherited a specific sequence from their mother and father, just like you did. As you can see, even though they do match you because they have either an A or a T in all 10 locations – the As and Ts did not all descend from either their mother or father. Their random inheritance of Ts and As just happens to match you.

If your match’s parents have tested, you won’t match either of their parents nor will they match either of your parents, which tells you immediately that this match is by chance (IBC) and not by descent (IBD), meaning this segment did not come from a common ancestor. It’s identical by chance and, therefore, a false positive.

If We Match Someone Else In Common, Doesn’t That Prove Identical by Descent?

Nope, but I sure wish it did!

The vendors show you who else you and your match both match in common, which provides a SUGGESTION as to your common ancestor – assuming you know which common ancestor any of these people share with you.

browser icw.png

However, shared matches are absolutely NOT a guarantee that you, your match, and your common matches all share the same ancestor, unless you’re close family. Your shared match could match you or your match through different ancestors – or could be identical by chance.

How can we be more confident of what matching is actually telling us?

How can we sort this out?

Uncertainties and Remedies

Here’s are 9 things you DON’T know, based on matching alone, along with tips and techniques to learn more.

  1. If your match to Person A is below about 20cM, you’ll need to verify that it’s a legitimate IBD match (not IBC). You can achieve this by determining if Person A also matches one of your parents and if you match one of Person A’s parents, if parents have tested.

Not enough parents have tested? An alternative method is by determining if you and Person A both match known descendants of the candidate ancestors ON THE SAME SEGMENT. This is where the chromosome browser enters the picture.

In other words, at least three people who are confirmed to descend from your presumptive common ancestor, preferably through at least two different children, must match on a significant portion of the same segment.

Why is that? Because every segment has its own unique genealogical history. Each segment can and often does lead to different ancestors as you move further back in time.

In this example, I’m viewing Buster, David, and E., three cousins descended from the same ancestral couple, compared to me on my chromosome browser. I’m the background grey, and they show in color. You can see that all three of them match me on at least some significant portion of the same segment of chromosome 15.

browser 3 cousins.png

If those people also match each other, that’s called triangulation. Triangulation confirms descent from a common ancestral source.

In this case, I already know that these people are related on my paternal side. The fact that they all match my father’s DNA and are therefore all automatically assigned to my paternal matching tab at Family Tree DNA confirms my paper-trail genealogy.

I wrote detailed steps for triangulation at Family Tree DNA, here. In a nutshell, matching on the same segment to people who are bucketed to the same parent is an automated method of triangulation.

Of course, not everyone has the luxury of having their parents tested, so testing other family members, finding common segments, and assigning people to their proper location in your tree facilitates confirmation of your genealogy (and automating triangulation.)

The ONLY way you can determine if people match you on the same segment, and match each other, is having segment information available to you and utilizing a chromosome browser.

browser MyHeritage triangulation.png

In the example above, the MyHeritage triangulation tool brackets matches that match you (the background grey) and who are all triangulated, meaning they all also match each other. In this case, the portion where all three people match me AND each other is bracketed. I wrote about triangulation at MyHeritage here.

  1. If you match several people who descend from the same ancestor, John Doe, for example, on paper, you CANNOT presume that your match to all of those people is due to a segment of DNA descended from John Doe or his wife. You may not match any of those people BECAUSE OF or through segments inherited from John Doe or his wife. You need segment information and a chromosome browser to view the location of those matches.

Assuming these are legitimate IBD matches, you may share another common line, known or unknown, with some or all of those matches.

It’s easy to assume that because you match and share matches in common with other people who believe they are descended from that same ancestor:

  • That you’re all matching because of that ancestor.
  • Even on the same segments.

Neither of those presumptions can be made without additional information.

Trust me, you’ll get yourself in a heap o’ trouble if you assume. Been there, done that. T-shirt was ugly.

Let’s look at how this works.

browser venn.png

Here’s a Venn diagram showing me, in the middle, surrounded by three of my matches:

  • Match 1 – Periwinkle, descends from Lazarus Estes and Elizabeth Vannoy
  • Match 2 – Teal, descends from Joseph Bolton and Margaret Claxton
  • Match 3 – Mustard, descends from John Y. Estes and Rutha Dodson

Utilizing a chromosome browser, autocluster software, and other tools, we can determine if those matches also match each other on a common segment, which means they triangulate and confirm common ancestral descent.

Of course, those people could match each other due to a different ancestor, not necessarily the one I share with them nor the ancestors I think we match through.

If they/we do all match because they descend from a common ancestor, they can still match each other on different segments that don’t match me.

I’m in the center. All three people match me, and they also match each other, shown in the overlap intersections.

Note that the intersection between the periwinkle (Match 1) and teal (Match 2) people, who match each other, is due to the wives of the children of two of my ancestors. In other words, their match to each other has absolutely nothing to do with their match to me. This was an “aha’ moment for me when I first realized this was a possibility and happens far more than I ever suspected.

The intersection of the periwinkle (Match 1) and mustard (Match 3) matches is due to the Dodson line, but on a different segment than they both share with me. If they had matched each other and me on the same segment, we would be all triangulated, but we aren’t.

The source of the teal (Match 2) to mustard (Match 3) is unknown, but then again, Match 3’s tree is relatively incomplete.

Let’s take a look at autocluster software which assists greatly with automating the process of determining who matches each other, in addition to who matches you.

  1. Clustering technology, meaning the Leeds method as automated by Genetic Affairs and DNAGedcom help, but don’t, by themselves, resolve the quandary of HOW people match you and each other.

People in a colored cluster all match you and each other – but not necessarily on the same segment, AND, they can match each other because they are related through different ancestors not related to your ancestor. The benefit of autocluster software is that this process is automated. However, not all of your matches will qualify to be placed in clusters.

browser autocluster.png

My mustard cluster above includes the three people shown in the chromosome browser examples – and 12 more matches that can be now be researched because we know that they are all part of a group of people who all match me, and several of whom match each other too.

My matches may not match each other for a variety of reasons, including:

  • They are too far removed in time/generations and didn’t inherit any common ancestral DNA.
  • This cluster is comprised of some people matching me on different (perhaps intermarried) lines.
  • Some may be IBC matches.

Darker grey boxes indicate that those people should be in both clusters, meaning the red and mustard clusters, because they match people in two clusters. That’s another hint. Because of the grid nature of clusters, one person cannot be associated with more than 2 clusters, maximum. Therefore, people like first cousins who are closely related to the tester and could potentially be in many clusters are not as useful in clusters as they are when utilizing other tools.

  1. Clusters and chromosome browsers are much less complex than pedigree charts, especially when dealing with many people. I charted out the relationships of the three example matches from the Venn diagram. You can see that this gets messy quickly, and it’s much more challenging to visualize and understand than either the chromosome browser or autoclusters.

Having said that, the ultimate GOAL is to identify how each person is related to you and place them in their proper place in your tree. This, cumulatively with your matches, is what identifies and confirms ancestors – the overarching purpose of genealogy and genetic genealogy.

Let’s take a look at this particular colorized pedigree chart.

Browser pedigree.png

click to enlarge

The pedigree chart above shows the genetic relationship between me and the three matches shown in the Venn diagram.

Four descendants of 2 ancestral couples are shown, above; Joseph Bolton and Margaret Claxton, and John Y. Estes and Rutha Dodson. DNA tells me that all 3 people match me and also match each other.

The color of the square (above) is the color of DNA that represents the DNA segment that I received and match with these particular testers. This chart is NOT illustrating how much DNA is passed in each generation – we already know that every child inherits half of the DNA of each parent. This chart shows match/inheritance coloring for ONE MATCHING SEGMENT with each match, ONLY.

Let’s look at Joseph Bolton (blue) and Margaret Claxton (pink). I descend through their daughter, Ollie Bolton, who married William George Estes, my grandfather. The DNA segment that I share with blue Match 2 (bottom left) is a segment that I inherited from Joseph Bolton (blue). I also carry inherited DNA from Margaret Claxton too, but that’s not the segment that I share with Match 2, which is why the path from Joseph Bolton to me, in this case, is blue – and why Match 2 is blue. (Just so you are aware, I know this segment descends from Joseph Bolton, because I also match descendants of Joseph’s father on this segment – but that generation/mtach is not shown on this pedigree chart.)

If I were comparing to someone else who I match through Margaret Claxton, I would color the DNA from Margaret Claxton to me pink in that illustration. You don’t have to DO this with your pedigree chart, so don’t worry. I created this example to help you understand.

The colored dots shown on the squares indicate that various ancestors and living people do indeed carry DNA from specific ancestors, even though that’s not the segment that matches a particular person. In other words, the daughter, Ollie, of Joseph Bolton and Margaret Claxton carries 50% pink DNA, represented by the pink dot on blue Ollie Bolton, married to purple William George Estes.

Ollie Bolton and William George Estes had my father, who I’ve shown as half purple (Estes) and half blue (Bolton) because I share Bolton DNA with Match 2, and Estes DNA with Match 1. Obviously, everyone receives half of each parent’s DNA, but in this case, I’m showing the path DNA descended for a specific segment shared with a particular match.

I’ve represented myself with the 5 colors of DNA that I carry from these particular ancestors shown on the pedigree chart. I assuredly will match other people with DNA that we’ve both inherited from these ancestors. I may match these same matches shown with DNA that we both inherited from other ancestors – for example, I might match Match 2 on a different segment that we both inherited from Margaret Claxton. Match 2 is my second cousin, so it’s quite likely that we do indeed share multiple segments of DNA.

Looking at Match 3, who knows very little about their genealogy, I can tell, based on other matches, that we share Dodson DNA inherited through Rutha Dodson.

I need to check every person in my cluster, and that I share DNA with on these same segment addresses to see if they match on my paternal side and if they match each other.

  1. At Family Tree DNA, I will be able to garner more information about whether or not my matches match each other by using the Matrix tool as well as by utilizing Phased Family Matching.

At Family Tree DNA, I determined that these people all match in common with me and Match 1 by using the “In Common With” tool. You can read more about how to use “In Common With” matching, here.

browser paternal.png

Family Matching phases the matches, assigning or bucketed them maternally or paternally (blue and red icons above), indicating, when possible, if these matches occur on the same side of your family. I wrote about the concept of phasing, here, and Phased Family Matching here and here.

Please note that there is no longer a limit on how distantly related a match can be in order to be utilized in Phased Family Matching, so long as it’s over the phase-matching threshold and connected correctly in your tree.

browser family tree dna link tree.png

Bottom line, if you can figure out how you’re related to someone, just add them into your tree by creating a profile card and link their DNA match to them by simply dragging and dropping, as illustrated above.

Linking your matches allows Family Matching to maternally or paternally assign other matches that match both you and your tree-linked matches.

If your matches match you on the same segment on the same parental side, that’s segment triangulation, assuming the matches are IBD. Phased Family Matching does this automatically for you, where possible, based on who you have linked in your tree.

For matches that aren’t automatically bucketed, there’s another tool, the Matrix.

browser matrix.png

In situations where your matches aren’t “bucketed” either maternally or paternally, the Matrix tool allows you to select matches to determine whether your matches also match each other. It’s another way of clustering where you can select specific people to compare. Note that because they also match each other (blue square) does NOT mean it’s on the same segment(s) where they match you. Remember our Venn diagram.

browser matrix grid.png

  1. Just because you and your matches all match each other doesn’t mean that they are matching each other because of the same ancestor. In other words, your matches may match each other due to another or unknown ancestor. In our pedigree example, you can see that the three matches match each other in various ways.
browser pedigree match.png

click to enlarge

  • Match 1 and Match 2 match each other because they are related through the green Jones family, who is not related to me.
  • Match 2 and Match 3 don’t know why they match. They both match me, but not on the same segment they share with each other.
  • Match 1 and Match 3 match through the mustard Dodson line, but not on the same segment that matches me. If we all did match on the same segment, we would be triangulated, but we wouldn’t know why Match 3 was in this triangulation group.
  1. Looking at a downloaded segment file of your matches, available at all testing vendors who support segment information and a chromosome browser, you can’t determine without additional information whether your matches also match each other.

browser chr 15.png

Here’s a group of people, above, that we’ve been working with on chromosome 15.

My entire match-list shows many more matches on that segment of chromosome 15. Below are just a few.

browser chr 15 all

Looking at seven of these people in the chromosome browser, we can see visually that they all overlap on part of a segment on chromosome 15. It’s a lot easier to see the amount of overlap using a browser as opposed to the list. But you can only view 7 at a time in the browser, so the combination of both tools is quite useful. The downloaded spreadsheet shows you who to select to view for any particular segment.

browser chr 15 compare.png

The critical thing to remember is that some matches will be from tyour mother’s side and some from your father’s side.

Without additional information and advanced tools, there’s no way to tell the difference – unless they are bucketed using Phased Family Matching at Family Tree DNA or bracketed with a triangulation bracket at MyHeritage.

At MyHeritage, this assumes you know the shared ancestor of at least one person in the triangulation group which effectively assigns the match to the maternal or paternal side.

Looking at known relatives on either side, and seeing who they also match, is how to determine whether these people match paternally or maternally. In this example below, the blue people are bucketed paternally through Phased Family Matching, the pink maternally, and the white rows aren’t bucketed and therefore require additional evaluation.

browser chr 15 maternal paternal.png

Additional research shows that Jonathan is a maternal match, but Robert and Adam are identical by chance because they don’t match either of my parents on this segment. They might be valid matches on other segments, but not this one.

browser chr 15 compare maternal paternal.png

  1. Utilizing relatives who have tested is a huge benefit, and why we suggest that everyone test their closest upstream relatives (meaning not children or grandchildren.) Testing all siblings is recommended if both parents aren’t available to test, because every child received different parts of their parents’ DNA, so they will match different relatives.

After deleting segments under 7 cM, I combine the segment match download files of multiple family members (who agree to allow me to aggregate their matches into one file for analysis) so that I can create a master match file for a particular family group. Sorting by match name, I can identify people that several of my cousins’ match.

browser 4 groups.png

This example is from a spreadsheet where I’ve combined the results of about 10 collaborating cousins to determine if we can break through a collective brick wall. Sorted by match name, this table shows the first 4 common matches that appear on multiple cousin’s match lists. Remember that how these people match may have nothing to do with our brick wall – or it might.

Note that while the 4 matches, AB, AG, ag, and A. Wayne, appear in different cousins’ match lists, only one shares a common segment of DNA: AB triangulates with Buster and Iona. This is precisely WHY you need segment information, and a chromosome browser, to visualize these matches, and to confirm that they do share a common DNA segment descended from a specific ancestor.

These same people will probably appear in autocluster groups together as well. It’s worth noting, as illustrated in the download example, that it’s much more typical for “in common with” matches to match on different segments than on the same segment. 

  1. Keep in mind that you will match both your mother and father on every single chromosome for the entire length of each chromosome.

browser parent matching.png

Here’s my kit matching with my father, in blue, and mother, in red on chromosomes 1 and 2.

Given that I match both of my parents on the full chromosome, inheriting one copy of my chromosome from each parent, it’s impossible to tell by adding any person at random to the chromosome browser whether they match me maternally or paternally. Furthermore, many people aren’t fortunate enough to have parents available for testing.

To overcome that obstacle, you can compare to known or close relatives. In fact, your close relatives are genetic genealogy gold and serve as your match anchor. A match that matches you and your close relatives can be assigned either maternally or paternally. I wrote about that here.

browser parent plus buster.png

You can see that my cousin Buster matches me on chromosome 15, as do both of my parents, of course. At this point, I can’t tell from this information alone whether Buster matches on my mother’s or father’s side.

I can tell you that indeed, Buster does match my father on this same segment, but what if I don’t have the benefit of my father’s DNA test?

Genealogy tells me that Buster matches me on my paternal side, through Lazarus Estes and Elizabeth Vannoy. Given that Buster is a relatively close family member, I already know how Buster and I are related and that our DNA matches. That knowledge will help me identify and place other relatives in my tree who match us both on the same segment of DNA.

To trigger Phased Family Matching, I placed Buster in the proper place in my tree at Family Tree DNA and linked his DNA. His Y DNA also matches the Estes males, so no adoptions or misattributed parental events have occurred in the direct Estes patrilineal line.

browser family tree dna tree.png

I can confirm this relationship by checking to see if Buster matches known relatives on my father’s side of the family, including my father using the “in common with” tool.

Buster matches my father as well as several other known family members on that side of the family on the same segments of DNA.

browser paternal bucket.png

Note that I have a total of 397 matches in common with Buster, 140 of which have been paternally bucketed, 4 of which are both (my children and grandchildren), and 7 of which are maternal.

Those maternal matches represent an issue. It’s possible that those people are either identical by chance or that we share both a maternal and paternal ancestor. All 7 are relatively low matches, with longest blocks from 9 to 14 cM.

Clearly, with a total of 397 shared matches with Buster, not everyone that I match in common with Buster is assigned to a bucket. In fact, 246 are not. I will need to take a look at this group of people and evaluate them individually, their genealogy, clusters, the matrix, and through the chromosome browser to confirm individual matching segments.

There is no single perfect tool.

Every Segment Tells a Unique History

I need to check each of the 14 segments that I match with Buster because each segment has its own inheritance path and may well track back to different ancestors.

browser buster segments.png

It’s also possible that we have unknown common ancestors due to either adoptions, NPEs, or incorrect genealogy, not in the direct Estes patrilineal line, but someplace in our trees.

browser buster paint.png

The best way to investigate the history and genesis of each segment is by painting matching segments at DNAPainter. My matching segments with Buster are shown painted at DNAPainter, above. I wrote about DNAPainter, here.

browser overlap.png

By expanding each segment to show overlapping segments with other matches that I’ve painted and viewing who we match, we can visually see which ancestors that segment descends from and through.

browser dnapainter walk back.png

These roughly 30 individuals all descend from either Lazarus Estes and Elizabeth Vannoy (grey), Elizabeth’s parents (dark blue), or her grandparents (burgundy) on chromosome 15.

As more people match me (and Buster) on this segment, on my father’s side, perhaps we’ll push this segment back further in time to more distant ancestors. Eventually, we may well be able to break through our end-of-line brick wall using these same segments by looking for common upstream ancestors in our matches’ trees.

Arsenal of Tools

This combined arsenal of tools is incredibly exciting, but they all depend on having segment information available and understanding how to use and interpret segment and chromosome browser match information.

One of mine and Buster’s common segments tracks back to end-of-line James Moore, born about 1720, probably in Virginia, and another to Charles Hickerson born about 1724. It’s rewarding and exciting to be able to confirm these DNA segments to specific ancestors. These discoveries may lead to breaking through those brick walls eventually as more people match who share common ancestors with each other that aren’t in my tree.

This is exactly why we need and utilize segment information in a chromosome browser.

We can infer common ancestors from matches, but we can’t confirm segment descent without specific segment information and a chromosome browser. The best we can do, otherwise, is to presume that a preponderance of evidence and numerous matches equates to confirmation. True or not, we can’t push further back in time without knowing who else matches us on those same segments, and the identity of their common ancestors.

The more evidence we can amass for each ancestor and ancestral couple, the better, including:

  • Matches
  • Shared “In Common With” Matches, available at all vendors.
  • Phased Family Matching at Family Tree DNA assigns matches to maternal or paternal sides based on shared, linked DNA from known relatives.
  • The Matrix, a Family Tree DNA tool to determine if matches also match each other. Tester can select who to compare.
  • ThruLines from Ancestry is based on a DNA match and shared ancestors in trees, but no specific segment information or chromosome browser. I wrote about ThruLines here and here.
  • Theories of Family Relativity, aka TOFR, at MyHeritage, based on shared DNA matches, shared ancestors in trees and trees constructed between matches from various genealogical records and sources. MyHeritage includes a chromosome browser and triangulation tool. I wrote about TOFR here and here.
  • Triangulation available through Phased Family Matching at Family Tree DNA and the integrated triangulation tool at MyHeritage. Triangulation between only 3 people at a time is available at 23andMe, although 23andMe does not support trees. See triangulation article links in the Resource Articles section below.
  • AutoClusters at MyHeritage (cluster functionality included), at Genetic Affairs (autoclusters plus tree reconstruction) and at DNAGedcom (including triangulation).
  • Genealogical information. Please upload your trees to every vendor site.
  • Y DNA and mitochondrial DNA confirmation, when available, through Family Tree DNA. I wrote about the 4 Kinds of DNA for Genetic Genealogy, here and the importance of Y DNA confirmation here, and how not having that information can trip you up.
  • Compiled segment information at DNAPainter allows you to combine segment information from various vendors, paint your maternal and paternal chromosomes, and visually walk segments back in time. Article with DNAPainter instructions is found here.

Autosomal Tool Summary Table

In order to help you determine which tool you need to use, and when, I’ve compiled a summary table of the types of tools and when they are most advantageous. Of course, you’ll need to read and understand about each tool in the sections above. This table serves as a reminder checklist to be sure you’ve actually utilized each relevant tool where and how it’s appropriate.

Family Tree DNA MyHeritage Ancestry 23andMe GedMatch
DNA Matches Yes Yes Yes Yes, but only highest 2000 minus whoever does not opt -in Yes, limited matches for free, more with subscription (Tier 1)
Download DNA Segment Match Spreadsheet Yes Yes No, must use DNAGedcom for any download, and no chromosome segment information Yes Tier 1 required, can only download 1000 through visualization options
Segment Spreadsheet Benefits View all matches and sort by segment, target all people who match on specific segments for chromosome browser View all matches and sort by segment, target all people who match on specific segments for chromosome browser No segment information but matches might transfer elsewhere where segment information is available View up to 2000 matches if matches have opted in. If you have initiated contact with a match, they will not drop off match list. Can download highest 1000 matches, target people who match on specific segments
Spreadsheet Challenges Includes small segments, I delete less than 7cM segments before using No X chromosome included No spreadsheet and no segment information Maximum of 2000 matches, minus those not opted in Download limited to 1000 with Tier 1, download not available without subscription
Chromosome Segment Information Yes Yes No, only total and longest segment, no segment address Yes Yes
Chromosome Browser Yes, requires $19 unlock if transfer Yes, requires $29 unlock or subscription if transfer No Yes Yes, some features require Tier 1 subscription
X Chromosome Included Yes No No Yes Yes, separate
Chromosome Browser Benefit Visual view of 7 or fewer matches Visual view of 7 or fewer matches, triangulation included if ALL people match on same portion of common segment No browser Visual view of 5 or fewer matches Unlimited view of matches, multiple options through comparison tools
Chromosome Browser Challenges Can’t tell whether maternal or paternal matches without additional info if don’t select bucketed matches Can’t tell whether maternal or paternal without additional info if don’t triangulate or you don’t know your common ancestor with at least one person in triangulation group No browser Can’t tell whether maternal or paternal without other information Can’t tell whether maternal or paternal without other information
Shared “In Common With” Matches Yes Yes Yes Yes, if everyone opts in Yes
Triangulation Yes, Phased Family Matching, plus chromosome browser Yes, included in chromosome browser if all people being compared match on that segment No, and no browser Yes, but only for 3 people if “Shared DNA” = Yes on Relatives in Common Yes, through multiple comparison tools
Ability to Know if Matches Match Each Other (also see autoclusters) Yes, through Matrix tool or if match on common bucketed segment through Family Matching Yes, through triangulation tool if all match on common segment No Yes, can compare any person to any other person on your match list Yes, through comparison tool selections
Autoclusters Can select up to 10 people for Matrix grid, also available for entire match list through Genetic Affairs and DNAGedcom which work well Genetic Affairs clustering included free, DNAGedcom has difficulty due to timeouts No, but Genetic Affairs and DNAGedcom work well No, but Genetic Affairs and DNAGedcom work well Yes, Genetic Affairs included in Tier 1 for selected kits, DNAGedcom is in beta
Trees Can upload or create tree. Linking you and relatives who match to tree triggers Phased Family Matching Can upload or create tree. Link yourself and kits you manage assists Theories of Family Relativity Can upload or create tree. Link your DNA to your tree to generate ThruLines. Recent new feature allows linking of DNA matches to tree. No tree support but can provide a link to a tree elsewhere Upload your tree so your matches can view
Matching and Automated Tree Construction of DNA Matches who Share Common Ancestors with You Genetic Affairs for matches with common ancestors with you Not available Genetic Affairs for matches with common ancestors with you No tree support Not available
Matching and Automated Tree Construction for DNA Matches with Common Ancestors with Each Other, But Not With You Genetic Affairs for matches with common ancestors with each other, but not with you Not available Genetic Affairs for matches with common ancestors with each other, but not with you No tree support Not available
DNAPainter Segment Compilation and Painting Yes, bucketed Family Match file can be uploaded which benefits tester immensely. Will be able to paint ethnicity segments soon. Yes No segment info available, encourage your matches to upload elsewhere Yes, and can paint ethnicity segments from 23andMe, Yes, but only for individually copied matches or highest 1000.
Y DNA and Mitochondrial Matching Yes, both, includes multiple tools, deep testing and detailed matching No No No, base haplogroup only, no matching No, haplogroup only if field manually completed by tester when uploading autosomal DNA file

Transfer Your DNA

Transferring your DNA results to each vendor who supports segment information and accepts transfers is not only important, it’s also a great way to extend your testing collar. Every vendor has strengths along with people who are found there and in no other database.

Ancestry does not provide segment information nor a chromosome browser, nor accept uploads, but you have several options to transfer your DNA file for free to other vendors who offer tools.

23andMe does provide a chromosome browser but does not accept uploads. You can download your DNA file and transfer free to other vendors.

I wrote detailed upload/download and transfer instructions for each vendor, here.

Two vendors and one third party support transfers into their systems. The transfers include matching. Basic tools are free, but all vendors charge a minimal fee for unlocking advanced tools, which is significantly less expensive than retesting:

Third-party tools that work with your DNA results include:

All vendors provide different tools and have unique strengths. Be sure that your DNA is working as hard as possible for you by fishing in every pond and utilizing third party tools to their highest potential.

Resource Articles

Explanations and step by step explanations of what you will see and what to do, when you open your DNA results for the first time.

Original article about chromosomes having 2 sides and how they affect genetic genealogy.

This article explains what triangulation is for autosomal DNA.

Why some matches may not be valid, and how to tell the difference.

This article explains the difference between a match group, meaning a group of people who match you, and triangulation, where that group also matches each other. The concepts are sound, but this article relies heavily on spreadsheets, before autocluster tools were available.

Parental phasing means assigning segment matches to either your paternal or maternal side.

Updated, introductory article about triangulation, providing the foundation for a series of articles about how to utilize triangulation at each vendor (FamilyTreeDNA, MyHeritage, 23andMe, GEDmatch, DNAPainter) that supports triangulation.

These articles step you through triangulation at each vendor.

DNAPainter facilitates painting maternally and paternally phased, bucketed matches from FamilyTreeDNA, a method of triangulation.

Compiled articles with instructions and ideas for using DNAPainter.

Autoclustering tool instructions.

How and why The Leeds Method works.

Step by step instructions for when and how to use FamilyTreeDNA’s chromosome browser.

Close family members are the key to verifying matches and identifying common ancestors.

This article details how much DNA specific relationships between people can expect to share.

Overview of transfer information and links to instruction articles for each vendor, below.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Fun DNA Stuff

  • Celebrate DNA – customized DNA themed t-shirts, bags, and other items

DNA Testing Sales Decline: Reason and Reasons

If you’re involved in genetic genealogy, you’ve probably noticed the recent announcements by both 23andMe and Ancestry relative to workforce layoffs as a result of declining sales.

Layoffs

In January, 23andMe announced that it was laying off 100 people which equated to 14% of its staff.

Following suit, Ancestry this week announced that they are laying off 100 people, 6% of their work force. They discuss their way forward, here.

One shift of this type can be a blip, but two tends to attract attention because it *could* indicate a trend. Accordingly, several articles have been written about possible reasons why this might be occurring. You can read what TechCrunch says here, Business Insider here, and The Verge, here.

Depending on who you talk to and that person’s perspective, the downturn is being attributed to:

  • Market Saturation
  • No Repeat Sales
  • Privacy Concerns
  • FAD Over

Ok, So What’s Happening?

Between Ancestry and 23andMe alone, more than 26 million DNA tests have been sold, without counting the original DNA testing company, FamilyTreeDNA along with MyHeritage who probably have another 4 or 5 million between them.

Let’s say that’s a total of 30 million people in DNA databases that offer matching. The total population of the US is estimated to be about 329 million, including children, which means that one person in 10 or 11 people in the US has now tested. Of course, DNA testing reaches worldwide, but it’s an interesting comparison indicating how widespread DNA testing has become overall.

This slowing of new sales shouldn’t really surprise anyone. In July 2019, Illumina, the chip maker who supplies equipment and supplies to the majority of the consumer DNA testing industry said that the market was softening after a drop in their 2019 second quarter revenue.

Also last year, Ancestry and MyHeritage both announced health products, a move which would potentially generate a repeat sale from someone who has already tested their DNA for genealogy purposes. I suspected at the time this might be either a pre-emptive strike, or in response to slowed sales.

In November 2019, Family Tree DNA announced an extensive high-end health test through Tovana which tests the entire Exome, the portion of our DNA useful for medical and health analysis.

In a sense, this health focus too is trendy, but moves away from genealogy into an untapped area.

23andMe who, according to their website, has obtained $791 million in venture capital or equity funding has always been focused on medical research. In July of 2018 GlaxoSmithKline infused $300 million into 23andMe in exchange for access to DNA results of their 5 million customers who have opted-in to medical research, according to Genengnews. If you divide the 300 million investment by 5 million opted-in customers, 23andMe received $60 per DNA kit.

That 5 million number is low though, based on other statements by 23andMe which suggests they have 10 million total customers, 80% of which opt-in for medical research. That would be a total of 8 million DNA results available to investors.

Divide $791 million by 8 million kits and 23andMe, over the years, has received roughly $99 for each customer who has opted in to research.

We know who Ancestry has partnered with for research, but not how much Ancestry has received.

There’s very big money, huge money, in collaborating with Big Pharma and others. Given the revenue potential, it’s amazing that the other two vendors, Family Tree DNA and MyHeritage, haven’t followed suit, but they haven’t.

Additionally, in January, 23andMe sold the rights to a new drug it developed in-house as a potential treatment for inflammatory diseases for a reported (but unconfirmed by 23andMe) $5 million.

It’s ironic that two companies who just announced layoffs are the two who have partnered to sell access to their opted-in customers’ DNA results.

My Thoughts

I’ve been asked several times about my thoughts on this shift within the industry. I have refrained from saying much, because I think there has been way too much “hair on fire” clickbait reporting that is fanning the flames of fear, not only in the customer base, but in general.

I am sharing my thoughts, and while they are not entirely positive, in that there is clearly room for improvement, I want to emphasize that I am very upbeat about this industry as a whole, and this article ends very positively with suggestions for exactly that – so please read through.

Regardless of why, fewer new people are testing which of course results in fewer sales, and fewer new matches for us.

My suspicion is that each of the 4 reasons given above is accurate to some extent, and the cumulative effect plus a couple of other factors is the reason we’re seeing the downturn.

Let’s take a look at each one.

Market Saturation

Indeed, we’ve come a very long way from the time when DNA was a verboten topic on the old RootsWeb mailing lists and boards.

Early DNA adopters back then were accused of “cheating,” and worse. Our posts were deleted immediately. How times have changed!

As the technology matured, 23andMe began offering autosomal testing accompanied by cousin matching.

Ancestry initially stepped into the market with Y and mitochondrial DNA testing, but ultimately destroyed that database which included Y and mitochondrial DNA results from Relative Genetics, a company they had previously acquired. People in those databases, as well as who had irreplaceable samples in Sorenson, which Ancestry also purchased and subsequently took offline permanently have never forgotten.

Those genealogists have probably since tested at Ancestry, but they may be more inclined to test the rest of their family at places like Family Tree DNA and MyHeritage who have chromosome browsers and tools that support more serious researchers.

I think a contributing factor is that fewer “serious genealogists” are coming up in the ranks. The perception that all you need to do is enter a couple of generations and click on a few leaves, and you’re “done” misleads people as to the complexity and work involved in genealogical research. Not to mention how many of those hints are inaccurate and require analysis.

Having said that, I view each one of these people who are encouraged for the first time by an ad, even if it is misleading in its simplicity, as a potential candidate. We were all baby genealogists once, and some of us stayed for reasons known only to us. Maybe we have the genealogy gene😊

But yes, I would agree that the majority, by far, of serious genealogists have already tested someplace. What they have not done universally is transferred from 23andMe and Ancestry to the other companies that can help them, such as MyHeritage, FamilyTreeDNA and GEDmatch. If they had, the customer numbers at those companies would be higher. We all need to fish in every pond.

Advertising and Ethnicity

The DNA ads over the last few years have focused almost exclusively on ethnicity – the least reliable aspect of genetic genealogy – but also the “easiest” to understand if a customer takes their ethnicity percentages at face value. And of course, every consumer that purchases a test as a result of one of these ads does exactly that – spits or swabs, mails and opens their results to see what they “are” – full of excited anticipation.

Many people have absolutely no idea there’s more, like cousin matching – and many probably wouldn’t care.

The buying public who purchases due to these ads are clearly not early adopters, and most likely are not genealogists. One can hope that at least a few of them get hooked as a result, or at least enter a minimal tree.

Unfortunately, of the two companies experiencing layoffs, only Ancestry supports trees. Genealogy revolves around trees, pure and simple.

23andMe has literally had years to do so and has refused to natively support trees. Their FamilySearch link is not the same as supporting trees and tree matching. Their attempt at creating a genetic tree is laudable and has potential, but it’s not something that can be translated into a genealogical benefit for most people. I’m guessing that there aren’t any genealogists working for 23andMe, or they aren’t “heard” amid the vervre surrounding medical research.

All told, I’m not surprised that the two companies who are experiencing the layoffs are the two companies whose ads we saw most often focused on ethnicity, especially Ancestry. Who can forget the infamous kilt/leiderhosen ad that Ancestry ran? I still cringe.

Many people who test for ethnicity never sign on again – especially if they are unhappy with the results.

Ancestry and 23andMe spent a lot on ad campaigns, ramped up for the resulting sales, but now the ads are less effective, so not being run as much or at all. Sales are down. Who’s to say which came first, the chicken (fewer ads) or the egg (lower sales.)

This leads us to the next topic, add on sales.

No Repeat Sales

DNA testing, unless you have something else to offer customers is being positioned as a “one and done” sale, meaning that it’s a single purchase with no potential for additional revenue. While that’s offered as a reason for the downturn, it’s not exactly true for DNA test sales.

Ancestry clearly encourages customers to subscribe to their records database by withholding access to some DNA features without a subscription. For Ancestry, DNA is the bait for a yearly repeat sale of a subscription. Genealogists subscribe, of course, but people who aren’t genealogists don’t see the benefit.

Ancestry does not allow transfers into their database, which would provide for additional revenue opportunity. I suspect the reason is twofold. First, they want the direct testing revenue, but perhaps more importantly, in order to sell their customer’s DNA who have agreed to participate in research, or partner with research firms, those customers need to have tested on Ancestry’s custom chip. This holds true for 23andMe as well.

Through the 23andMe financial information in the earlier section, it’s clear that while the consumer only pays a one time fee to test, multiple research companies will pay over and over for access to that compiled consumer information.

Ancestry and 23andMe have the product, your opted-in DNA test that you paid for, and they can sell it over and over again. Hopefully, this revenue stream helps to fund development of genetic genealogical tools.

MyHeritage also provides access to advanced DNA tools by selling a subscription to their records database after a free trial. MyHeritage has integrated their DNA testing with genealogical records to provide their advanced Theories of Family Relativity tool, a huge boon to genealogists.

While Family Tree DNA doesn’t have a genealogical records database like Ancestry and MyHeritage, they provide Y DNA and mitochondrial DNA testing, in addition to the autosomal Family Finder test. If more people tested Y DNA and mitochondrial DNA, more genealogical walls would fall due to the unique inheritance path and the fact that neither Y nor mitochondrial DNA is admixed with DNA from the other parent.

Generally, only genealogists know about and are going to order Y DNA and mtDNA tests, or sponsor others to take them to learn more about their ancestral lines. These tests don’t provide yearly revenue like an ongoing subscription, but at least the fact that Family Tree DNA offers three different tests does provide the potential for at least some additional sales.

Both MyHeritage and FamilyTreeDNA encourage uploads, and neither sell, lease or share your DNA for medical testing. You can find upload instructions, here.

In summary of this section, all of the DNA testing companies do have some sort of additional (potential) revenue stream from DNA testing, so it’s not exactly “one and done.”

Health Testing Products

As for health testing, 23andMe has always offered some level of health information for their customers. Health and research has always been their primary focus. Health and genealogy was originally bundled into one test. Today, DNA ancestry tests with the health option at 23andMe cost more than a genealogy-only test and are two separate products.

MyHeritage also offers a genealogy only DNA test and a genealogy plus health DNA test.

In 2019, both Ancestry and MyHeritage added health testing to their menu as upgrades for existing customers.

In November 2019, FamilyTreeDNA announced an alliance with Tovana for their customers to order a full exome grade medical test and accompanying report. I recently received mine and am still reviewing the results – they are extensive.

It’s clear that all four companies see at least some level of consumer interest in health and traits as a lucrative next step.

Medical Research and DNA Sales

Both Ancestry and 23andMe are pursuing and have invested in relationships with research institutions or Big Pharma. I have concerns with how this is handled. You may not.

I’m supportive of medical research, but I’m concerned that most people have no idea of the magnitude and scope of the contracts between Ancestry and 23andMe with Big Pharma and others, in part, because the details are not public. Customers may also not be aware of exactly what they are opting in to, what it means or where their DNA/DNA results are going.

As a consumer, I want to know where my DNA is, who is using it, and for what purpose. I don’t want my DNA to wind up being used for a nefarious purpose or something I don’t approve of. Think Uighurs in China by way of example. BGI Genetics, headquartered in China but with an Americas division and facilities in Silicon Valley has been a major research institute for years. I want to know what my DNA is being used for, and by whom. The fact that the companies won’t provide their customers with that information makes me makes me immediately wonder why not.

I would like to be able to opt-in for specific studies, not blindly for every use that is profitable to the company involved, all without my knowledge. No blank checks. For example, I opted out of 23andMe research when they patented the technology for designer babies.

Furthermore, I feel that if someone is going to profit from my DNA, it should be me since I paid for the sequencing. At minimum, a person whose DNA is used in these studies should receive some guarantee that they will be provided with any drug in which their DNA is used for development, in particular if their insurance doesn’t pay and they cannot afford the drug.

Drug prices have risen exponentially in the US recently, with many people no longer able to afford their medications. For example, the price of insulin has tripled over the last decade, causing people to ration or cut back on their insulin, if not go without altogether. It would be the greatest of ironies if the very people whose DNA was sold and used to create a drug had no access to it.

Of course, Ancestry and 23andMe are not required to inform consumers of which studies their DNA or DNA results are used for, so we don’t know. Always read all of the terms and conditions, and all links when authorizing anything.

Both companies indicate that your DNA results are anonymized before being shared, but we now know that’s not really possible anymore, because it’s relatively easy to re-identify someone. This is exactly how adoptees identify their biological parents through genetic matches. Dr. Yaniv Erlich reported in the journal Science November 2018 that more than 60% of Europeans could be reidentified through a genealogy database of only 1.28 million individuals.

I think greater transparency and a change in policy favoring the consumer would go a long way to instilling more confidence in the outside research relationships that both Ancestry and 23andMe pursue and maintain. It would probably increase their participation level as well if people could select the research initiatives to which they want to contribute their DNA.

Privacy Concerns

The news has been full of articles about genetic privacy, especially in the months since the Golden State Killer case was solved. That was only April 2018, but it seems like eons ago.

Unfortunately, much of what has been widely reported is inaccurate. For example, no company has ever thrown the data base open for the FBI or anyone to rummage through like a closet full of clothes. However, headlines and commentary like that attract outrage and hundreds of thousands of clicks. In the news and media industry, “it’s all about eyeballs.”

In one case, an article I interviewed for extensively in an educational capacity was written accurately, but the headline was awful. The journalist in question replied that the editors write the headlines, not the reporters.

One instance of this type of issue would be pretty insignificant, but the news in this vein hasn’t abated, always simmering just below the surface waiting for something to fan the flames. Outrage sells.

For the most part, those within the genealogy community at least attempt to sort out what is accurate reporting and what is not, but those people are the ones who have already tested.

People outside the genealogy community just know that they’ve now seen repeated headlines reporting that their genetic privacy either has been, could be or might be breached, and they are suspicious and leery. I would be too. They have no idea what that actually means, what is actually occurring, where, or that they are probably far more at risk on social media sites.

These people are not genealogists, and now they look at ads and think to themselves, “yes, I’d like to do that, but…”

And they never go any further.

People are frightened and simply disconnect from the topic – without testing.

If, as a consumer, you see several articles or posts saying that <fill in car model> is really bad, when you consider a purchase, even if you initially like that model, you’ll remember all of those negative messages. You may never realize that the source was the competition which would cause you to interpret those negative comments in a completely different light.

I think that some of the well-intentioned statements made by companies to reassure their existing and potential customers have actually done more harm than good by reinforcing that there’s a widespread issue. “You’re safe with us” can easily be interpreted as, “there’s something to be afraid of.”

Added to that is the sensitive topic of adoptee and unknown parent searches.

Reunion stories are wonderfully touching, and we all love them, but you seldom see the other side of the coin. Not every story has a happy ending, and many don’t. Not every parent wants to be found for a variety of reasons. If you’re the child and don’t want to find your parents, don’t test, but it doesn’t work the other way around. A parent can often be identified by their relatives’ DNA matches to their child.

While most news coverage reflects positive adoptee reunion outcomes, that’s not universal, and almost every family has a few lurking skeletons. People know that. Some people are fearful of what they might discover about themselves or family members and are correspondingly resistant to DNA testing. Realizing you might discover that your father isn’t your biological father if you DNA test gives people pause. It’s a devastating discovery and some folks decide they’d rather not take that chance, even though they believe it’s not possible.

The genealogical search techniques for identifying unknown parents or close relatives and the technique used by law enforcement to identify unknown people, either bodies or perpetrators is exactly the same. If you are in one of the databases, who you match can provide a very big hint to someone hunting for the identify of an unknown person.

People who are not genealogists, adoptees or parents seeking to find children placed for adoption may be becoming less comfortable with this idea in general.

Of course, the ability for law enforcement to upload kits to GedMatch/Verogen and Family Tree DNA, under specific controlled conditions, has itself been an explosive and divisive topic within and outside of the genealogy community since April 2018.

These law enforcement kits are either cold case remains of victims, known as “Does,” or body fluids from the scenes of violent crimes, such as rape, murder and potentially child abduction and aggravated assault. To date, since the Golden State Killer identification, numerous cases have produced a “solve.” ISOGG, a volunteer organization, maintains a page of known cases solved, here.

GEDmatch encourages people to opt-in for law-enforcement matching, meaning that their kit can be seen as a match to kits uploaded by law enforcement agencies or companies working on behalf of law enforcement agencies. If a customer doesn’t opt-in, their kit can’t be seen as a match to a law enforcement kit.

Family Tree DNA initially opted-out all EU kits from law enforcement matching, due to GDPR, and provides the option for their customers to opt-out of law-enforcement matching.

Neither MyHeritage, Ancestry nor 23andMe cooperate with law enforcment under any circumstances and have stated that they will actively resist all subpoenaes in court.

ISOGG provides a FAQ on Investigative Genetic Genealogy, here.

The two sides of the argument have rather publicly waged war on each other in an ongoing battle to convince people of the merits of their side of the equation, including working with news organizations.

Unfortunately, this topic is akin to arguing over politics. No one changes their mind, and everyone winds up mad.

Notice I’m not linking any articles here, not even my own. I do not want to fan these flames, but I would be remiss if I didn’t mention that the topic of law enforcement usage itself, the on-going public genetic genealogy community war and resulting media coverage together have very probably contributed to the lagging sales. I’d also be remiss if I didn’t mention that while a great division of opinion exists, and many people are opposed, there are also many people who are extremely supportive.

All of this, combined, intentionally or not, has introduced FUD, fear, uncertainty and doubt – a very old disinformation “sales technique.”

In a sense, for consumers, this has been like watching pigs mud-wrestle.

As my dad used to say, “Never mud-wrestle with a pig. The pig enjoys it, you get muddy and the spectators can’t tell the difference.” The spectators in this case vote with their lack of spending and no one is a winner.

DNA Testing Was A FAD

Another theory is that genealogy DNA testing was just a FAD whose time has come and gone. I think the FAD was ethnicity testing, and that chicken has come home to roost.

Both 23andMe and Ancestry clearly geared up for testers attracted by their very successful ads. I was just recently on a cruise, and multiple times I heard people at another table discussing their ethnicity results from some unnamed company. They introduced the topic by saying, “I did my DNA.”

The discussion was almost always the same. Someone said that they thought their ethnicity was pretty accurate, someone else said theirs was awful, and the discussion went from there. Not one time did anyone ever mention a company name, DNA matching or any other functionality. I’m not even sure they understood there are different DNA testing companies.

If I was a novice listening-in, based on that discussion, I would have learned to doubt the accuracy of “doing my DNA.”

If most of the people who purchased ethnicity tests understood in advance that ethnicity testing truly is “just an estimate,” they probably wouldn’t have purchased in the first place. If they understood the limitations and had properly set expectations, perhaps they would not have been as unhappy and disenchanted with their results. I realize that’s not very good marketing, but I think that chicken coming home to roost is a very big part of what we’re seeing now.

The media has played this up too, with stories about how the ethnicity of identical twins doesn’t match. If people bother to read more than the headline, and IF it’s a reasonably accurate article, they’ll come to understand why and how that might occur. If not, what they’ll take away is that DNA testing is wrong and unreliable. So don’t bother.

Furthermore, most people don’t understand that ethnicity testing and cousin matching are two entirely different aspects of a DNA test. The “accuracy” of ethnicity is not related to the accuracy of cousin matching, but once someone questions the credibility of DNA testing – their lack of confidence is universal.

I would agree, the FAD is over – meaning lots of people testing primarily for ethnicity. I think the marketing challenge going forward is to show people that DNA testing can be useful for other things – and to make that easy.

Ethnicity was the low hanging fruit and it’s been picked.

Slowed Growth – Not Dead in the Water

The rate of growth has slowed. This does not by any stretch of the imagination mean that genetic genealogy or DNA testing is dead in the water. DNA fishes for us 365x24x7.

For example, just today, I received a message from 23andMe that 75 new relatives have joined 23andMe. I also received match notifications from Family Tree DNA and MyHeritage.  Hey – calorie-free treats!!!

These new matches are nothing to sneeze at. I remember when I was thrilled over ONE new match.

I have well over 100,000 matches if you combine my matches at the four vendors.

Without advanced tools like triangulation, Phased Family Matching, Theories of Family Relativity, ThruLines, DNAPainter, DNAgedcom and Genetic Affairs, I’d have absolutely no prayer of grouping and processing this number of matches for genealogy.

Even if I received no new matches for the next year, I’d still not be finished analyzing the autosomal matches I already have.

This Too Shall Pass

At least I hope it will.

I think people will still test, but the market has corrected. This level of testing is probably the “new normal.”

Neither Ancestry or 23andMe are spending the big ad dollars – or at least not as big.

In order for DNA testing companies to entice customers into purchasing subscriptions or add-on products, tools need to be developed or enhanced that encourage customers to return to the site over and over. This could come in the form of additional results or functionality calculated on their behalf.

That “on their behalf” point is important. Vendors need to focus on making DNA fun, and productive, not work. New tools, especially in the last year or two, have taken a big step in that direction. Make the customer wonder every day what gift is waiting for him or her that wasn’t there yesterday. Make DNA useful and fun!

I would call this “DNA crack.” 😊

Cooking Up DNA Crack!

In order to assist the vendors, I’ve compiled one general suggestion plus what I would consider to be the “Big 3 Wish List” for each of their DNA products in term of features or improvements that would encourage customers to either use or return to their sites. (You’re welcome.)

I don’t want this to appear negative, so I’ve also included the things I like most about each vendor.

If you have something to add, please feel free to comment in a positive fashion.

Family Tree DNA

I Love: Y and Mitochondrial DNA, Phased Family Matching, and DNA projects

General Suggestion – Fix chronic site loading issues which discourage customers

  • Tree Matching – fix the current issues with trees and implement tree matching for DNA matches
  • Triangulation – including by match group and segment
  • Clustering – some form of genetic networks

MyHeritage

I Love: Theories of Family Relativity, triangulation, wide variety of filters, SmartMatches and Record Matches

General – Clarify confusing subscription options in comparative grid format

  • Triangulation by group and segment
  • View DNA matches by ancestor
  • Improved Ethnicity

Ancestry

I Love: Database size, ThruLines, record and DNA hints (green leaves)

General – Focus on the customers’ needs and repeated requests

  • Accept uploads
  • Chromosome Browser (yes, I know this is a dead horse, but that doesn’t change the need)
  • Triangulation (dead horse’s brother)

23andMe

I Love: Triangulation, Ethnicity quality, ethnicity segments identified, painted and available for download

General – Focus on genealogy tools if you’re going to sell a genealogy test

  • Implement individual customer trees – not Family Search
  • Remove 2000 match limit (which is functionally less after 23andMe hides the people not opted into matching)
  • DNA + Tree Matching

Summary

In summary, we, as consumers need to maintain our composure, assuring others that no one’s hair is on fire and the sky really is not falling. We need to calmly educate as opposed to frighten.

Just the facts.

Other approaches don’t serve us in the end. Frightening people away may “win” the argumentative battle of the day, but we all lose the war if people are no longer willing to test.

This is much like a lifeboat – we all succeed together, or we all lose.

Everybody row!

As genealogists, we need to:

  • Focus on verifying ancestors and solving genealogy challenges
  • Sharing those victories with others, including family members
  • Encourage our relatives to test, and transfer so that their testing investment provides as much benefit as possible
  • Offer to help relatives with the various options on each vendor’s platform
  • Share the joy

People share exciting good news with others, especially on Facebook and social media platforms, and feel personally invested when you share new results with them. Collaboration bonds people.

A positive attitude, balanced perspective and excitement about common ancestors goes a very, very long was in terms of encouraging others.

We have more matches now than ever before, along with more and better tools. Matches are still rolling in, every single day.

New announcements are expected at Rootstech in a couple short weeks.

There’s so much opportunity and work to do.

The sky is not falling. It rained a bit.

The seas may have been stormy, but as a genealogist, the sun is out and a rising tide lifts us all.

Rising tide

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Fun DNA Stuff

  • Celebrate DNA – customized DNA themed t-shirts, bags and other items

2018 – The Year of the Segment

Looking in the rear view mirror, what a year! Some days it’s been hard to catch your breath things have been moving so fast.

What were the major happenings, how did they affect genetic genealogy and what’s coming in 2019?

The SNiPPY Award

First of all, I’m giving an award this year. The SNiPPY.

Yea, I know it’s kinda hokey, but it’s my way of saying a huge thank you to someone in this field who has made a remarkable contribution and that deserves special recognition.

Who will it be this year?

Drum roll…….

The 2018 SNiPPY goes to…

DNAPainter – The 2018 SNiPPY award goes to DNAPainter, without question. Applause, everyone, applause! And congratulations to Jonny Perl, pictured below at Rootstech!

Jonny Perl created this wonderful, visual tool that allows you to paint your matches with people on your chromosomes, assigning the match to specific ancestors.

I’ve written about how to use the tool  with different vendors results and have discovered many different ways to utilize the painted segments. The DNA Painter User Group is here on Facebook. I use DNAPainter EVERY SINGLE DAY to solve a wide variety of challenges.

What else has happened this year? A lot!

Ancient DNA – Academic research seldom reports on Y and mitochondrial DNA today and is firmly focused on sequencing ancient DNA. Ancient genome sequencing has only recently been developed to a state where at least some remains can be successfully sequenced, but it’s going great guns now. Take a look at Jennifer Raff’s article in Forbes that discusses ancient DNA findings in the Americas, Europe, Southeast Asia and perhaps most surprising, a first generation descendant of a Neanderthal and a Denisovan.

From Early human dispersals within the Americas by Moreno-Mayer et al, Science 07 Dec 2018

Inroads were made into deeper understanding of human migration in the Americas as well in the paper Early human dispersals within the Americas by Moreno-Mayer et al.

I look for 2019 and on into the future to hold many more revelations thanks to ancient DNA sequencing as well as using those sequences to assist in understanding the migration patterns of ancient people that eventually became us.

Barbara Rae-Venter and the Golden State Killer Case

Using techniques that adoptees use to identify their close relatives and eventually, their parents, Barbara Rae-Venter assisted law enforcement with identifying the man, Joseph DeAngelo, accused (not yet convicted) of being the Golden State Killer (GSK).

A very large congratulations to Barbara, a retired patent attorney who is also a genealogist. Nature recognized Ms. Rae-Venter as one of 2018’s 10 People Who Mattered in Science.

DNA in the News

DNA is also represented on the 2018 Nature list by Viviane Slon, a palaeogeneticist who discovered an ancient half Neanderthal, half Denisovan individual and sequenced their DNA and He JianKui, a Chinese scientist who claims to have created a gene-edited baby which has sparked widespread controversy. As of the end of the year, He Jiankui’s research activities have been suspended and he is reportedly sequestered in his apartment, under guard, although the details are far from clear.

In 2013, 23andMe patented the technology for designer babies and I removed my kit from their research program. I was concerned at the time that this technology knife could cut two ways, both for good, eliminating fatal disease-causing mutations and also for ethically questionable practices, such as eugenics. I was told at the time that my fears were unfounded, because that “couldn’t be done.” Well, 5 years later, here we are. I expect the debate about the ethics and eventual regulation of gene-editing will rage globally for years to come.

Elizabeth Warren’s DNA was also in the news when she took a DNA test in response to political challenges. I wrote about what those results meant scientifically, here. This topic became highly volatile and politicized, with everyone seeming to have a very strongly held opinion. Regardless of where you fall on that opinion spectrum (and no, please do not post political comments as they will not be approved), the topic is likely to surface again in 2019 due to the fact that Elizabeth Warren has just today announced her intention to run for President. The good news is that DNA testing will likely be discussed, sparking curiosity in some people, perhaps encouraging them to test. The bad news is that some of the discussion may be unpleasant at best, and incorrect click-bait at worst. We’ve already had a rather unpleasant sampling of this.

Law Enforcement and Genetic Genealogy

The Golden State Killer case sparked widespread controversy about using GedMatch and potentially other genetic genealogy data bases to assist in catching people who have committed violent crimes, such as rape and murder.

GedMatch, the database used for the GSK case has made it very clear in their terms and conditions that DNA matches may be used for both adoptees seeking their families and for other uses, such as law enforcement seeking matches to DNA sequenced during a criminal investigation. Since April 2018, more than 15 cold case investigations have been solved using the same technique and results at GedMatch. Initially some people removed their DNA from GedMatch, but it appears that the overwhelming sentiment, based on uploads, is that people either aren’t concerned or welcome the opportunity for their DNA matches to assist apprehending criminals.

Parabon Nanolabs in May established a genetic genealogy division headed by CeCe Moore who has worked in the adoptee community for the past several years. The division specializes in DNA testing forensic samples and then assisting law enforcement with the associated genetic genealogy.

Currently, GedMatch is the only vendor supporting the use of forensic sample matching. Neither 23anMe nor Ancestry allow uploaded data, and MyHeritage and Family Tree DNA’s terms of service currently preclude this type of use.

MyHeritage

Wow talk about coming onto the DNA world stage with a boom.

MyHeritage went from a somewhat wobbly DNA start about 2 years ago to rolling out a chromosome browser at the end of January and adding important features such as SmartMatching which matches your DNA and your family trees. Add triangulation to this mixture, along with record matching, and you’re got a #1 winning combination.

It was Gilad Japhet, the MyHeritage CEO who at Rootstech who christened 2018 “The Year of the Segment,” and I do believe he was right. Additionally, he announced that MyHeritage partnered with the adoption community by offering 15,000 free kits to adoptees.

In November, MyHeritage hosted MyHeritage LIVE, their first user conference in Oslo, Norway which focused on both their genealogical records offerings as well as DNA. This was a resounding success and I hope MyHeritage will continue to sponsor conferences and invest in DNA. You can test your DNA at MyHeritage or upload your results from other vendors (instructions here). You can follow my journey and the conference in Olso here, here, here, here and here.

GDPR

GDPR caused a lot of misery, and I’m glad the implementation is behind us, but the the ripples will be affecting everyone for years to come.

GDPR, the European Data Protection Regulation which went into effect on May 25,  2018 has been a mixed and confusing bag for genetic genealogy. I think the concept of users being in charge and understanding what is happened with their data, and in this case, their data plus their DNA, is absolutely sound. The requirements however, were created without any consideration to this industry – which is small by comparison to the Googles and Facebooks of the world. However, the Googles and Facebooks of the world along with many larger vendors seem to have skated, at least somewhat.

Other companies shut their doors or restricted their offerings in other ways, such as World Families Network and Oxford Ancestors. Vendors such as Ancestry and Family Tree DNA had to make unpopular changes in how their users interface with their software – in essence making genetic genealogy more difficult without any corresponding positive return. The potential fines, 20 million plus Euro for any company holding data for EU residents made it unwise to ignore the mandates.

In the genetic genealogy space, the shuttering of both YSearch and MitoSearch was heartbreaking, because that was the only location where you could actually compare Y STR and mitochondrial HVR1/2 results. Not everyone uploaded their results, and the sites had not been updated in a number of years, but the closure due to GDPR was still a community loss.

Today, mitoydna.org, a nonprofit comprised of genetic genealogists, is making strides in replacing that lost functionality, plus, hopefully more.

On to more positive events.

Family Tree DNA

In April, Family Tree DNA announced a new version of the Big Y test, the Big Y-500 in which at least 389 additional STR markers are included with the Big Y test, for free. If you’re lucky, you’ll receive between 389 and 439 new markers, depending on how many STR markers above 111 have quality reads. All customers are guaranteed a minimum of 500 STR markers in total. Matching was implemented in December.

These additional STR markers allow genealogists to assemble additional line marker mutations to more granularly identify specific male lineages. In other words, maybe I can finally figure out a line marker mutation that will differentiate my ancestor’s line from other sons of my founding ancestor😊

In June, Family Tree DNA announced that they had named more than 100,000 SNPs which means many haplogroup additions to the Y tree. Then, in September, Family Tree DNA published their Y haplotree, with locations, publicly for all to reference.

I was very pleased to see this development, because Family Tree DNA clearly has the largest Y database in the industry, by far, and now everyone can reap the benefits.

In October, Family Tree DNA published their mitochondrial tree publicly as well, with corresponding haplogroup locations. It’s nice that Family Tree DNA continues to be the science company.

You can test your Y DNA, mitochondrial or autosomal (Family Finder) at Family Tree DNA. They are the only vendor offering full Y and mitochondrial services complete with matching.

2018 Conferences

Of course, there are always the national conferences we’re familiar with, but more and more, online conferences are becoming available, as well as some sessions from the more traditional conferences.

I attended Rootstech in Salt Lake City in February (brrrr), which was lots of fun because I got to meet and visit with so many people including Mags Gaulden, above, who is a WikiTree volunteer and writes at Grandma’s Genes, but as a relatively expensive conference to attend, Rootstech was pretty miserable. Rootstech has reportedly made changes and I hope it’s much better for attendees in 2019. My attendance is very doubtful, although I vacillate back and forth.

On the other hand, the MyHeritage LIVE conference was amazing with both livestreamed and recorded sessions which are now available free here along with many others at Legacy Family Tree Webinars.

Family Tree University held a Virtual DNA Conference in June and those sessions, along with others, are available for subscribers to view.

The Virtual Genealogical Association was formed for those who find it difficult or impossible to participate in local associations. They too are focused on education via webinars.

Genetic Genealogy Ireland continues to provide their yearly conference sessions both livestreamed and recorded for free. These aren’t just for people with Irish genealogy. Everyone can benefit and I enjoy them immensely.

Bottom line, you can sit at home and educate yourself now. Technology is wonderful!

2019 Conferences

In 2019, I’ll be speaking at the National Genealogical Society Family History Conference, Journey of Discovery, in St. Charles, providing the Special Thursday Session titled “DNA: King Arthur’s Mighty Genetic Lightsaber” about how to use DNA to break through brick walls. I’ll also see attendees at Saturday lunch when I’ll be providing a fun session titled “Twists and Turns in the Genetic Road.” This is going to be a great conference with a wonderful lineup of speakers. Hope to see you there.

There may be more speaking engagements at conferences on my 2019 schedule, so stay tuned!

The Leeds Method

In September, Dana Leeds publicized The Leeds Method, another way of grouping your matches that clusters matches in a way that indicates your four grandparents.

I combine the Leeds method with DNAPainter. Great job Dana!

Genetic Affairs

In December, Genetic Affairs introduced an inexpensive subscription reporting and visual clustering methodology, but you can try it for free.

I love this grouping tool. I have already found connections I didn’t know existed previously. I suggest joining the Genetic Affairs User Group on Facebook.

DNAGedcom.com

I wrote an article in January about how to use the DNAGedcom.com client to download the trees of all of your matches and sort to find specific surnames or locations of their ancestors.

However, in December, DNAGedcom.com added another feature with their new DNAGedcom client just released that downloads your match information from all vendors, compiles it and then forms clusters. They have worked with Dana Leeds on this, so it’s a combination of the various methodologies discussed above. I have not worked with the new tool yet, as it has just been released, but Kitty Cooper has and writes about it here.  If you are interested in this approach, I would suggest joining the Facebook DNAGedcom User Group.

Rootsfinder

I have not had a chance to work with Rootsfinder beyond the very basics, but Rootsfinder provides genetic network displays for people that you match, as well as triangulated views. Genetic networks visualizations are great ways to discern patterns. The tool creates match or triangulation groups automatically for you.

Training videos are available at the website and you can join the Rootsfinder DNA Tools group at Facebook.

Chips and Imputation

Illumina, the chip maker that provides the DNA chips that most vendors use to test changed from the OmniExpress to the GSA chip during the past year. Older chips have been available, but won’t be forever.

The newer GSA chip is only partially compatible with the OmniExpress chip, providing limited overlap between the older and the new results. This has forced the vendors to use imputation to equalize the playing field between the chips, so to speak.

This has also caused a significant hardship for GedMatch who is now in the position of trying to match reasonably between many different chips that sometimes overlap minimally. GedMatch introduced Genesis as a sandbox beta version previously, but are now in the process of combining regular GedMatch and Genesis into one. Yes, there are problems and matching challenges. Patience is the key word as the various vendors and GedMatch adapt and improve their required migration to imputation.

DNA Central

In June Blaine Bettinger announced DNACentral, an online monthly or yearly subscription site as well as a monthly newsletter that covers news in the genetic genealogy industry.

Many educators in the industry have created seminars for DNACentral. I just finished recording “Getting the Most out of Y DNA” for Blaine.

Even though I work in this industry, I still subscribed – initially to show support for Blaine, thinking I might not get much out of the newsletter. I’m pleased to say that I was wrong. I enjoy the newsletter and will be watching sessions in the Course Library and the Monthly Webinars soon.

If you or someone you know is looking for “how to” videos for each vendor, DNACentral offers “Now What” courses for Ancestry, MyHeritage, 23andMe, Family Tree DNA and Living DNA in addition to topic specific sessions like the X chromosome, for example.

Social Media

2018 has seen a huge jump in social media usage which is both bad and good. The good news is that many new people are engaged. The bad news is that people often given faulty advice and for new people, it’s very difficult (nigh on impossible) to tell who is credible and who isn’t. I created a Help page for just this reason.

You can help with this issue by recommending subscribing to these three blogs, not just reading an article, to newbies or people seeking answers.

Always feel free to post links to my articles on any social media platform. Share, retweet, whatever it takes to get the words out!

The general genetic genealogy social media group I would recommend if I were to select only one would be Genetic Genealogy Tips and Techniques. It’s quite large but well-managed and remains positive.

I’m a member of many additional groups, several of which are vendor or interest specific.

Genetic Snakeoil

Now the bad news. Everyone had noticed the popularity of DNA testing – including shady characters.

Be careful, very VERY careful who you purchase products from and where you upload your DNA data.

If something is free, and you’re not within a well-known community, then YOU ARE THE PRODUCT. If it sounds too good to be true, it probably is. If it sounds shady or questionable, it’s probably that and more, or less.

If reputable people and vendors tell you that no, they really can’t determine your Native American tribe, for example, no other vendor can either. Just yesterday, a cousin sent me a link to a “tribe” in Canada that will, “for $50, we find one of your aboriginal ancestors and the nation stamps it.” On their list of aboriginal people we find one of my ancestors who, based on mitochondrial DNA tests, is clearly NOT aboriginal. Snake oil comes in lots of flavors with snake oil salesmen looking to prey on other people’s desires.

When considering DNA testing or transfers, make sure you fully understand the terms and conditions, where your DNA is going, who is doing what with it, and your recourse. Yes, read every single word of those terms and conditions. For more about legalities, check out Judy Russell’s blog.

Recommended Vendors

All those DNA tests look yummy-good, but in terms of vendors, I heartily recommend staying within the known credible vendors, as follows (in alphabetical order).

For genetic genealogy for ethnicity AND matching:

  • 23andMe
  • Ancestry
  • Family Tree DNA
  • GedMatch (not a vendor because they don’t test DNA, but a reputable third party)
  • MyHeritage

You can read about Which DNA Test is Best here although I need to update this article to reflect the 2018 additions by MyHeritage.

Understand that both 23andMe and Ancestry will sell your DNA if you consent and if you consent, you will not know who is using your DNA, where, or for what purposes. Neither Family Tree DNA, GedMatch, MyHeritage, Genographic Project, Insitome, Promethease nor LivingDNA sell your DNA.

The next group of vendors offers ethnicity without matching:

  • Genographic Project by National Geographic Society
  • Insitome
  • LivingDNA (currently working on matching, but not released yet)

Health (as a consumer, meaning you receive the results)

Medical (as a contributor, meaning you are contributing your DNA for research)

  • 23andMe
  • Ancestry
  • DNA.Land (not a testing vendor, doesn’t test DNA)

There are a few other niche vendors known for specific things within the genetic genealogy community, many of whom are mentioned in this article, but other than known vendors, buyer beware. If you don’t see them listed or discussed on my blog, there’s probably a reason.

What’s Coming in 2019

Just like we couldn’t have foreseen much of what happened in 2018, we don’t have access to a 2019 crystal ball, but it looks like 2019 is taking off like a rocket. We do know about a few things to look for:

  • MyHeritage is waiting to see if envelope and stamp DNA extractions are successful so that they can be added to their database.
  • www.totheletterDNA.com is extracting (attempting to) and processing DNA from stamps and envelopes for several people in the community. Hopefully they will be successful.
  • LivingDNA has been working on matching since before I met with their representative in October of 2017 in Dublin. They are now in Beta testing for a few individuals, but they have also just changed their DNA processing chip – so how that will affect things and how soon they will have matching ready to roll out the door is unknown.
  • Ancestry did a 2018 ethnicity update, integrating ethnicity more tightly with Genetic Communities, offered genetic traits and made some minor improvements this year, along with adding one questionable feature – showing your matches the location where you live as recorded in your profile. (23andMe subsequently added the same feature.) Ancestry recently said that they are promising exciting new tools for 2019, but somehow I doubt that the chromosome browser that’s been on my Christmas list for years will be forthcoming. Fingers crossed for something new and really useful. In the mean time, we can download our DNA results and upload to MyHeritage, Family Tree DNA and GedMatch for segment matching, as well as utilize Ancestry’s internal matching tools. DNA+tree matching, those green leaf shared ancestor hints, is still their strongest feature.
  • The Family Tree DNA Conference for Project Administrators will be held March 22-24 in Houston this year, and I’m hopeful that they will have new tools and announcements at that event. I’m looking forward to seeing many old friends in Houston in March.

Here’s what I know for sure about 2019 – it’s going to be an amazing year. We as a community and also as individual genealogists will be making incredible discoveries and moving the ball forward. I can hardly wait to see what quandaries I’ve solved a year from now.

What mysteries do you want to unravel?

I’d like to offer a big thank you to everyone who made 2018 wonderful and a big toast to finding lots of new ancestors and breaking down those brick walls in 2019.

Happy New Year!!!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some (but not all) of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

AutoClustering by Genetic Affairs

The company Genetic Affairs launched a few weeks ago with an offer to regularly visit your vendor accounts at Family Tree DNA, Ancestry and 23andMe, and compile a spreadsheet of your matches, download it, and send it to you in an e-mail. They then update your match list at regular intervals of your choosing.

I didn’t take advantage of this, mostly because Ancestry doesn’t provide me with segment information and while 23andMe and Family Tree DNA both do, I maintain a master spreadsheet that the new matches wouldn’t integrate with. Granted, I could sort by match date and add only the new ones to my master spreadsheet, but it was never a priority. That was yesterday.

AutoClustering

That changed this week. Genetic Affairs introduced a new AutoClustering tool that provides users with clustered matches. I’m salivating and couldn’t get signed up quickly enough.

Please note that I’ve cropped the names for this article – the Genetic Affairs display shows you the entire name.

In short, each tiny square node represents a three-way match, between you and both of the people in the intersection of the grid. This does NOT mean they are triangulated, but it does mean there’s a really good chance they would triangulate. Think of this as the Family Tree DNA matrix on steroids and automated.

This tool allows me by using my mother’s test as well to actually triangulate my matches. If they are on my mother’s side of the tree, match me and mother both, and are in the match matrix, they must triangulate on my mother’s side of my tree if they both match me on the same segment.

With this information, I can check the chromosome browser, comparing my chromosomes to those other two individuals in the matrix to see if we share a common segment – or I can simply sort the spreadsheet provided with the AutoCluster results. Suddenly that delivery service is extremely convenient!

No, this service is not free, but it’s quite reasonable. I’m going to step through the process. Note that at times, the website seemed to be unresponsive especially when moving from one step to another. Refreshing the page remedied the problem.

Account Setup

Go to www.geneticaffairs.com. Click on Register to set up your account, which is very easy.

After registering, move to step 2, “Add website.”

Add websites where you have accounts. All of your own profiles plus the other people’s that you manage at both Ancestry and 23andMe are included when you register that site in your profile.

You’ll need your signon information and password for each site.

At Family Tree DNA, you’ll need to add a new website for each account since every account has its own kit number and password.

I added my own account and my mother’s account since mother’s DNA is every bit as relevant to my genealogy as my own, AND, I only received half of her DNA which means she will have many matches that I don’t.

When you’re finished adding accounts, click on “Websites and Profiles” at the top to open the website tab of your choosing and click on the blue circular arrows AutoCluster link. You are telling the system to go out and gather your matches from the vendor and then cluster your matches together, generating an AutoCluster graphic file.

There are several more advanced options, but I’m going to run initially with Approach A, the default level. This will exclude my closest matches. Your closest matches will fall into multiple cluster groups, and the software is not set up to accommodate that – so they will wind up as a grey nonclustered square. That’s not all bad, but you’ll want to experiment to see which parameters are best for you.

If you have half-siblings, you may want to work with alternate settings because that half-sibling is important in terms of phasing your matches to maternal or paternal sides.

Asking me if “I’m sure” always causes me to really sit back and think about what I’ve done. Like, do I want to delete my account. In this case, it’s “overworry” because the system is just asking if you want to spend 25 credits, which is less than a dollar and probably less than a quarter. Right now, you’re using your free initial credits anyway.

The first time you set up an account, Genetic Affairs signs in to your account to assure that your login information is accurate.

I selected my profile and my mother’s profile at Family Tree DNA, plus one profile each at 23andMe and Ancestry. I have two profiles at both 23andMe (V3 and V4) and Ancestry (V1 and V2).

When making my selections, I wasn’t clear about the meaning of “minimum DNA match” initially, but it means fourth cousin and closer, NOT fourth and more distant.

My recommendation until you get the hang of things is to use the first default option, at least initially, then experiment.

Welcome

While I was busy ordering AutoClusters, Genetic Affairs was sending me a welcome e-mail.

Hello Roberta Estes,

Thank you for joining Genetic Affairs! We hope you will enjoy our services.

We have a manual available as well as a frequently asked questions section that both provide background information how to use our website.

You currently have 200 credits which can be supplemented using single payments and/or monthly subscriptions. Check out our prices page for more information concerning our rates.

Please let us know if anything is unclear, we can be reached using the contact form.

The great news is that everyone begins with 200 free credits which may last you for quite some time.  Or not. Consider them introductory crack from your new pusher.

Options

Genetic affairs will sign on your account at either Ancestry, 23andMe or Family Tree DNA, or all 3, periodically and provide you with match information about your new matches at each website. You select the interval when you configure your account. After each update, you can order a new AutoCluster if you wish.

Each update, and each AutoCluster request has a cost in points, sold as credits, associated with the service.

To purchase credits after you use your initial 200, you will need to enter your credit card information in the Settings Page, which is found in the dropdown (down arrow) right beside your profile photo.

You can select from and enroll in several plans.

Prices which varies by how often you want updates to be performed and for how many accounts. To see the various service offerings and cost, click here.

Here’s an example calculation for weekly updates:

This is exactly what I need, so it looks like this service will cost me $2.16 per month, plus any Autoclustering which is 25 credits each time I AutoCluster. Therefore, I’ll add another 100 credits for a total of $3.16 per month.

It looks like the $5 per month package will do for me. But don’t worry about that right now, because you’re enjoying your free crack, um, er, credits.

Ok, the e-mail with my results has just arrived after the longest 10 minutes on earth, so let’s take a look!

The Results E-mail

In a few minutes (or longer) after you order, an e-mail with the autoclustering results will arrive. Check your spam filter. Some of my e-mails were there, and some reports simply had to be reordered. One report never arrived after being ordered 3 times.

The e-mail when it arrives states the following:

Hello Roberta Estes,

For profile Roberta Estes: An AutoCluster analysis has been performed (access it through the attached HTML file).

As requested, cM thresholds of 250 cM and 50 cM were used. A total number of 176 matches were identified that were used for a AutoCluster analysis. There should be two CSV files attached to this email and if enough matches can be clustered, an additional HTML file. The first CSV file contains all matches that were identified. The second CSV file contains a spreadsheet version of the AutoCluster analysis. The HTML file will contain a visual representation of the AutoCluster analysis if enough matches were present for the clustering analysis. Please note that some files might be displayed incorrectly when directly opened from this email. Instead, save them to your local drive and open the files from there.

Attached I found 3 files:

  • Matches list
  • Autocluster grid csv file
  • Autocluster html file that shows the cluster itself

The Match Spreadsheet

The first thing that will arrive in your e-mail is a spreadsheet of your matches for the account you configured and ordered an AutoCluster for.

In the e-mail, your top 20 matches are listed, which initially confused me, because I wondered if that means they are not in the spreadsheet. They are.

At 23andMe, I initially selected 5th cousins and closer, which was the most distant match option provided. I had a total of 1233 matches.

23andMe caps your account at 2000 (unless you have communicated with people who are further than 2000 away, in which case they remain on your list), but you can’t modify the Genetic Affairs profile to include any people more distant than 5th cousins

Note that the 23andMe download shows you information about your match, but NOT the actual matching segment information☹

At Ancestry, I selected 4th cousin and closer and I received a total of 2698 matches. I could select “distant cousin” which would result in additional matches being downloaded and a different autoclustering diagram. I may experiment with this with my V2 account and compare them side by side.

This Ancestry information provides an important clue for me, because the matches I work with are generally only my Shared Ancestor Hints matches. If the Viewed field equals false, this tells  me immediately that I didn’t have a shared ancestor hint – but now because of the clustering, I know where they might fit.

At Family Tree DNA, I selected 4th cousin, but I could have selected 5th cousins. I have a total of 1500 matches.

This report does include the segment information (Yay!) and my only wish here would be to merge the two downloads available at Family Tree DNA, meaning the segment information and the match information. I’d like to know which of these are assigned to maternal or paternal buckets, or both.

AutoClustering

The Autocluster csv file is interesting in that it shows who matches whom. It’s the raw data used to construct the colored grid.

My matches are numbered in their column. For example, person M.B. is person 1. Every person that matches person 1 is noted at left with a 1 in that column.  Look at the second person under the Name column, C. W., who matches person 1 (M.B.), 2 (C.W.), 3 (T.F.), 4 (purple) and 5 (A.D.).

All of these people are in the same cluster, number 3, which you’ll see below.

The AutoCluster Graph

Finally, we get to the meat of the matter, the cluster graph.

Caveat – I experienced a significant amount of difficulty with both my account and my graph. If your graph does not display correctly, save the file to your system and click to open the file from your hard drive. Try Edge or Internet explorer if Chrome doesn’t work correctly. If it still doesn’t display accurately, notify GeneticAffairs at info@geneticaffairs.com. Consider this software release late alpha or early beta. Personally, I’m just grateful for the tool.

When you first open the html file, you’ll be able to see your matches “fly” into place. That’s pretty cool. Actually, that’s a metaphor for what I want all of my genealogy to do.

This grid shows the people who match me and each other as well, so a trio – although this does NOT mean the three of us match on the same segment.

The first person is Debbie, a known cousin on my father’s side. She and all of the other 12 people match me and each other as well and are shown in the orange cluster at the top left.

I know that my common ancestor couple with Debbie is Lazarus Estes and Elizabeth Vannoy, so it’s very likely that all of these same people share the same ancestral line, although perhaps not the same ancestral couple. For example, they could descend from anyone upstream of Lazarus and Elizabeth. Some may have known ancestors on either the Estes or Vannoy side, which will help determine who the actual oldest common ancestors are.

You’ll notice people in grey squares that aren’t in the cluster, but match me and Debbie both. This means that they would fall into two different clusters and the software can’t accommodate that. You may find your closest relatives in this grey never-never-land. Don’t ignore the grey squares because they are important too.

The second green cluster is also on my father’s side and represents the Vannoy line. My common ancestor with several matches is Joel Vannoy and Phoebe Crumley.

Working my way through each cluster, I can discern which common ancestor I match by recognizing my cousins or people who I’ve already shared genealogy with.

The third red cluster is on my mother’s side and I know that it’s my Jacob Lentz and Fredericka Ruhle line. I can verify this by looking at my mother’s AutoCluster file to see if the same people appear in her cluster.

You can also view this grid by name, # of shared matches and the # of shared cMs with the tester. Those displays are nice but not nearly as informative at the AutoClusters.

Scroll for More Match Information

Be sure to scroll down below the grid (yes, there is something below the grid!) and read the text where you’re provided a list of people who qualify to be included in the clusters, but don’t match anyone else at the criteria selection level you chose – so they aren’t included in the grid. This too is informative.  For example, my cousin Christine is there which tells me that our mutual line may not be represented by a cluster. This isn’t surprising, since our common ancestor immigrated in the 1850s – so not a lot of descendants today.

You’re also provided with AutoCluster match information, including whether or not your match has a tree. I do have notes on my matches at Family Tree DNA for several of these people, but unfortunately, the file download did not pick those notes up.

However, the fact that these matches are displayed “by cluster” is invaluable.

You can bet your socks that I’m clicking on the “tree” hotlink and signing on to FTDNA right now to see if any of these people have recognizable ancestors (or surnames) of either Elizabeth Vannoy or Lazarus Estes, or upstream. Some DO! Glory be!

Better yet, their DNA may descend from one of my dead-ends in this line, so I’ll be carefully recording any genealogical information that I can obtain to either confirm the known ancestors or break through those stubborn walls.

Dead ends would become evident by multiple people in the cluster sharing a different ancestor than one you’re already familiar with. Look carefully for patterns. Could this be the key to solving the mystery of who the mother of Nancy Ann Moore is? Or several other brick walls that I’d love to fall, just in time for Christmas. Who doesn’t have brick walls?

By signing on to Family Tree DNA and looking carefully at the trees and surnames of the people in each group, I was able to quickly identify the common line and assign an ancestor to most of the matching groups.

This also means I’ll now be able to make notes on these matches at Family Tree DNA paint these in DNAPainter! (I’ve written several articles about using DNAPainter which you can read by entering DNAPainter into the search box on this blog.)

Mom’s Acadian Cluster

Endogamy is always tough and this tool isn’t any different. Lots of grey squares which mean people would fit into multiple clusters. That’s the hallmark of endogamy.

My Mom’s largest clustered group is Acadian, which is endogamous, and her orange cluster has a very interesting subgroup structure.

If you look, the larger loosely connected orange group extends quite some way down the page, but within that group, there seems to be a large, almost solid orange group in the lower right. I’m betting that almost solid group to the right lower part of the orange region represents a particular ancestral line within the endogamous Acadian grouping.

Also of interest, my Mom’s green cluster is the same as my red Jacob Lentz/Frederica Ruhle cluster group, with many of the same individuals. This confirms that these people match me and that other person on Mom’s side, so whoever in this group matches me and any other person on the same segment is triangulated to my Mom’s side of my genealogy.

You can also use this information in conjunction with your parental bucketing at Family Tree DNA.

In Summary

I’m still learning about this tool, it’s limitations and possibilities. The software is new and not bug-free, but the developer is working to get things straightened out. I don’t think he expected such a deluge of desperate genealogists right away and we’ve probably swamped his servers and his inbox.

I haven’t yet experimented with changing the parameters to see who is included and who isn’t in various runs. I’ll be doing that over the next several days, and I’ll be applying the confirmed ancestral segments I discover in DNAPainter!

This is going to be a lot of fun. I may not surface again until 2019😊

______________________________________________________________

Disclosure

I receive a small contribution when you click on some (but not all) of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research