Genetic Affairs AutoClustering, SuperClusters and brand-new AutoTree tree reconstruction are to-die-for features for traditional genealogists. For adoptees or people seeking unknown parentage, they are the best thing since sliced bread, automating tasks previously peformed manually over labor-filled hours, days and months.
Why Genetic Affairs?
MyHeritage has integrated a version of Genetic Affairs directly into their product offering on the MyHeritage website so every MyHeritage DNA customer receives clustering functionality, free, through MyHeritage, but not tree reconstruction.
GedMatch has also implemented an autocluster version for Tier 1 users, but GedMatch’s version only works at GedMatch, of course, and does not include the new tree reconstruction feature.
This article pertains to the functionality of the features available directly through Genetic Affairs, including:
- Clustering your matches visually to identify ancestral lines of people that match you and each other
- Reports by cluster including common surnames and locations
- Analysis of trees within each cluster to identify common ancestors
- Partially reconstructs trees with your known ancestors for each cluster
- Partially reconstructs trees between your matches even if you don’t have a tree or don’t share the common ancestor
Genetic Affairs provides visualization for linked DNA matches along with critically important clues to help you figure out just how you are related to these people, and these clusters of interrelated people. The Genetic Affairs user manual can be found here.
Each time you run Genetic Affairs is called an analysis. Each analysis scans your kit at the selected vendor(s) for all current matches. A few minutes later, you receive a zip file via e-mail with two or three files depending on your selections at Genetic Affairs and the tree availabilty of the vendor:
- Autocluster file including the visual clusters plus additional information
- Excel spreadsheet of cluster members and relevant information such as common ancestors and common locations
- Tree file containing reconstructed trees (23andMe does not support trees, so no trees are available for 23andMe clusters)
Let’s look at each feature. Grab a cup of coffee and head for the computer.
Selecting Analysis Options
I encourage you to experiment. Selecting a wider range of cM (centimorgans) results in a larger file, but may also mean that the analysis times out.
For this report, I’m utilizing my matches at FamilyTreeDNA and selected a cM range of 50 minimum and 250 maximum. I wanted a minimum cluster size of 2 people, meaning 2 in addition to me. This resulted in 249 total matches that met that criteria and 20 people who met the cM criteria but did not have another person with whom to cluster.
I tried a second analysis using 20 cM – 300 cM resulting in a much larger file with 499 people in the cluster group. Currently, 499 is the maximum that will be processed.
On the Genetic Affairs Profiles page, I can view all of the profiles I manage. Users can schedule updates where Genetic Affairs automatically scans for matches and produces reports.
By clicking on the Autoscan button, you can schedule automated recurring scans with e-mail notification.
You can scan daily, weekly, monthly or never – whatever interval you select.
You can select both the minimum level of DNA match and the minimum cM. The lowest you can select is 9cM.
You can view any e-mails that have been sent to you by Genetic Affairs. The green envelope means that there’s something in your e-mail box. This answers the question about whether the report was completed and sent. If the report has been sent, but is not in your e-mail, check your spam filter.
Starting the Scan
Back on the Genetic Affairs profiles page, you can initiate an autocluster by clicking on the AutoCluster button where you’ll see the options based on which vendor you’ve selected.
For example, at Ancestry, you can include only people in a particular group or only starred matches.
23andMe includes surname enrichment and triangulated groups options.
FamilyTreeDNA and Ancestry both include the “AutoTree – identify common ancestors from trees” option. It’s very important that you click this box if you select the “Default AutoCluster” option – or you won’t get the reconstructed trees.
Of course, you can always run the analysis again.
If you click on the “AutoTree AutoCluster” function, the AutoTree box is already checked for you.
Rule Based AutoCluster
The “Rule based AutoCluster” is a dream-come-true for people seeking unknown parents or ancestors in a relatively recent timeframe.
The “Rule based AutoCluster” provides you with options that allow you to do three things:
- NOT – Exclude your matches with someone else. For example, your mother has tested. You can use the NOT rule to exclude anyone you might match through your mother’s side, providing you with clusters from your father’s side.
- AND – Combine your results with someone else’s. If you have identified a half-sibling, you can view only clusters of only people who match you AND your half sibling.
- OR – Combined rules. You can request a cluster of everyone in clusters with person A but not in a cluster with person B. In this case, if you match a number of half siblings, you can include all of their matches, except people who match them through their “other” parent, if that parent has tested.
Genetic Affairs has provided some graphics and examples here, but you may have to be a member of the site to access this page because the options are customized for you. So I’ll include the non-customized information, below. You can click these to open in a separate window and enlarge.
The “Rule based AutoCluster” explanations provided by Genetic Affairs.
Read the details of how these tools work. They are powerful, so don’t assume you understand without reading carefully.
Now let’s cluster!
Clustering Your Matches
At Genetic Affairs, if you initiate clustering by clicking on the AutoCluster button, you’ll need to put a checkmark in the AutoTree function box. If you began by clicking the AutoTree button, the box is automatically checked for you.
A few minutes later, you’ll receive an email with a zipped file. Save this file to someplace on your computer where you can find it, and open the zipped file by clicking.
You’ll see the files, above.
Click on the chrome AutoCluster HTML file which will display in your browser.
The first thing you will see is your visual autocluster. It’s so much fun to watch your matches “fly” into place!
Each of the people in this cluster are somehow related to the other people in the custer who have cells of the same color. The people with grey cells are included in two clusters – meaning the one to the right and the one above, both.
The names of the matches are listed to the left and above the display.
The legend is to the right.
I have a total of 41 clusters.
Scrolling down the page, each cluster has additional information, and each column is searchable or selectable, including comments I’ve entered at the vendor.
Just by looking at these first 3 matches, I know immediately which side of the family and which ancestors are involved with this cluster. I can look at my notes, to the right, which indicate whether I’ve identified our common ancestor. I paint identified matches at DNAPainter which I’ve entered into the notes field at the vendor.
If I’m signed in to my account at the vendor, I can click on my match’s tree link, above, and take a look. Keep in mind that these people can be related to you, and each other, through multiple ancestors.
You can hover over any person in the grid, above, to view additional information. For each person whose square is grey, indicating membership in (at least) two clusters, you can hover over the grey square and view the members of both clusters. In this case, I’m hovered over the grey square of Brooke and E.H and the black box shows me who is in both people’s clusters.
Note that while a match could be related to you through several ancestors, and hence be in more than 2 clusters, because of the grid nature of clustering, a match can only be displayed in a maximum of 2 different clusters.
Looking at the auto-generated table below, I see the common surnames in cluster 1. Keep in mind that many of these people maybe related to each other through a spouse that you aren’t. Your ancestor’s brother’s children, for example, are also related to each other through your ancestor’s brother’s wife.
I know that Vannoy is the common line, but Upton isn’t my ancestor – at least not that I know of. However, a surname with 20 people in a cluster needs to be investigated and evaluated. Do I have any missing wives in this line? Here’s a really great place to start digging.
In this case, it turns out that one of my ancestor’s children married an Upton, and several of his descendants have tested.
Let’s see what other tools we have.
The Ancestor Spreadsheet
Opening the spreadsheet file, I see several rows and columns.
The common ancestor between the people in the rows is listed at left. The green cells are from my tree.
Two example ancestors are shown above, Mary McDowell and William Harrell, who just happen to have been married to each other.
Scrolling on down, I see rows without green cells.
These people share a common ancestor in their trees, an ancestor that isn’t in my tree. Presumably this is an ancestor I don’t share with them – or one I haven’t identified.
For example, “Bev” and “van” share William Grubb. “Vicki” and “Mark” share Martha Helen Smith. I don’t share either of these ancestors, but Martha Smith married Alvis Winster Bolton, the son of my ancestor – so I know why Martha Helen Smith appears as a common person in the trees of my matches, but not me.
Further down in the same cluster, I notice that one match shares multiple lines in our trees. Therefore, our DNA match could be on either line, or some segments from one line and some from the other.
Scrolling to the bottom of each cluster’s sheet, common locations are provided.
While the designation of “Tennessee” isn’t terribly exciting, scrolling further down provides a list by county, and that IS exciting, especially if you’re chasing a brick wall. Sometimes a group of ancestors in a location where you’re seeking a female’s family is very suggestive especially when combined with ancestral names and surnames.
Let’s move on to the third group of files, Trees.
The Tree File
Click on the tree file and you’ll see the following.
For each cluster where trees can be reconstructed, you’ll see two files for cluster 1:
- Ancestors 1
- Tree 1
Opening the file labeled Ancestors 1, I see the following information for the first ancestor, meaning a common ancestor between the two people listed below that ancestor. You can click to enlarge these images.
Opening the corresponding Tree 1 file, I see that Genetic Affairs has reconstructed the tree between me and the other testers as best it can based on the provided trees.
Looking at the tree for cluster 3, below, I see this line in cluster 1, above, has been extended because Sarah, the pink match and me all share a common ancestor, Elizabeth Shepherd.
Looking at another cluster, below, while I don’t share an ancestor in a tree, three people that I match at a relatively high level do.
As you can see, their common ancestor is Anne Adelaide Chiasson. This is my Acadian line, so our common ancestor or ancestors must be someplace on up that tree, or the result of an undocumented adoption, or a missing ancestor in our trees.
Constructing the trees of your matches to each other, even when you don’t have a common ancestor in your tree, is the best feature of all.
Clustering plus tree reconstruction, especially in combination with the other clues, is the key to breaking through those unyielding brick walls.
Just as I was getting ready to publish this article, Genetic Affairs released a new feature called Super AutoCluster.
I absolutely love this, because it combines your clusters from multiple vendors – today Ancestry, who does not provide segment information, along with Family Tree DNA, who provides invaluable segment information.
This combination can be extremely powerful.
Next, you’ll see the screen confirming the kits to use. The combined autocluster tool is limited to a total of 500 matches, or 250 at each account. However, that’s more than enough to make some great progress.
Press “Perform Analysis.”
Drum roll please…
Voila, your combined cluster.
In this example, you can see the large peach and purple Ancestry clusters. The green red, brown and pink smaller clusters are Family Tree DNA clusters. The Family Tree DNA clusters have tiny little Fs in their cells. If you click the above graphic to enlarge, you can see the Fs.
If you look closely at the cells labeled here with “common names,” you’ll see “N” in the cells indicating a common names for you to check out within that cluster.
The “Common Ancestors” box shows the people who connect to both clusters.
There are also a number of people that span the green and red Family Tree DNA clusters too.
Genetic Affairs then proceeds to combine the clustered DNA matches and trees for you from both vendors.
In addition to the cluster graph and spreadsheet information that now includes combined information, you’ll see a much larger clustered tree.
And again, the best part is that even if you don’t know how you connect to people through trees, their tree and ancestors will be connected, even if you’re absent. You’ll be present in the genetic cluster itself, so you can work the combined tree cluster to see where you might fit in that branch of the family. Because trust me, you do fit – somehow, someplace.
Genetic Affairs uses a “credit” payment system. Your first 200 credits are free so you can learn. These may last you for weeks or months, depending on how often you run the clusters. If you manage multiple kits, you’ll use credits more quickly, but it’s worth every last dollar. Genetic Affairs is very inexpensive. I manage multiple accounts and I spend around $5 per month. You can read about Genetic Affairs’ payment plans and see sample calculations here.
My recommendation is simply to dive in and use your free credits. By the way, I’m gifting myself with a “credit purchase” for Christmas😊
Genetic Affairs is a wonderful genealogy gift idea for serious genealogists, adoptees or people seeking unknown parents or ancestors in recent generations.
Have You Tested or Transferred With All 4 Vendors?
If you haven’t yet tested at or transferred to each of the main 4 vendors, clustering, reconstructed trees and SuperClusters is yet another reason to do so. Additionally, every close relative’s DNA holds hints that yours doesn’t, so be sure to test them too.
I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.
Thank you so much.
DNA Purchases and Free Transfers
- MyHeritage DNA only
- MyHeritage DNA plus Health
- MyHeritage FREE DNA file upload
- 23andMe Ancestry
- 23andMe Ancestry Plus Health
- Legacy Tree Genealogists for genealogy research