Family Tree DNA’s Mitochondrial Haplotree

On September 27th, 2018 Family Tree DNA published the largest Y haplotree in the world, based on SNP tests taken by customers. Now, less than two weeks later, they’ve added an exhaustive mitochondrial DNA (mtDNA) public haplotree as well, making this information universally available to everyone.

Family Tree DNA’s mtDNA Haplotree is based on the latest version of the mtDNA Phylotree. The new Family Tree DNA tree includes 5,434 branches derived from more than 150,000 full sequence results from 180+ different countries of origin. Family Tree DNA‘s tree has SIX TIMES more samples than the Phylotree. Furthermore, Family Tree DNA only includes full sequence results, where Phylotree includes partial results.

This new tree is a goldmine! What does it provide that that’s unique? Locations – lots of locations!

The Official Phylotree

Unlike the Y DNA tree, which is literally defined and constructed by the genetic community, new mitochondrial DNA branches cannot be added to the official mitochondrial Phylotree by Family Tree DNA. Haplogroups, meaning new branches in the form of SNPs are added to the Y tree as new SNPs are discovered and inserted into the tree in their proper location. The mitochondrial DNA phylotree can’t be expanded by a vendor in that manner.

The official mitochondrial Phylotree is maintained at www.phylotree.org and is episodically updated. The most recent version was mtDNA tree build 17, published and updated in February 2016. You can view version history here.

Mitochondrial Phylogenic Tree Version 17

Version 17 of the official mitochondrial tree consists of approximately 5,400 nodes, or branches with a total of 24,275 samples uploaded by both private individuals and academic researchers which are then utilized to define haplogroup branches.

Individuals can upload their own full sequence results from Family Tree DNA, but they must be in a specific format. I keep meaning to write detailed instructions about how to submit your full sequence test results, but so far, that has repeatedly slipped off of the schedule. I’ll try to do this soon.

In a nutshell, download your FASTA file from Family Tree DNA and continue with the submission process here. The instructions are below the submission box, so scroll down.

In any case, the way that new branches are added to the phylotree is when enough new results with a specific mutation are submitted and evaluated, the tree will have a new branch added in the next version. That magic number of individuals with the same mutation was 3 in the past, but now that so many more people are testing, I’m not sure if that number holds, or if it should. Spontaneous mutations can and do happen at the same location. The Phylotree branches mean that the haplogroup defining mutations indicate a common ancestor, not de novo separate mutations. That’s why analysis has to be completed on each candidate branch.

How do Mitochondrial DNA Branches Work?

If you are a member of haplogroup J1c2f today, and a certain number of people in that haplogroup have another common mutation, that new mutation may be assigned the designation of 1, as in J1c2f1, where anyone in haplogroup J1c2f who has that mutation will be assigned to J1c2f1.

While the alternating letter/number format is very easy to follow, some problems and challenges do exist with the alternating letter/number haplogroup naming system.

The Name of the Game

The letter number system works fine if not many new branches are added, branches don’t shuffle and if the growth is slow. However, that’s not the case anymore.

If you recall, back in July of 2012, which is equivalent to the genetic dark ages (I know, right), the Y tree was also represented with the same type of letter number terminology used on the mitochondrial tree today.

For example, Y DNA haplogroup R-M269 was known as R1b1a2, and before that the same haplogroup was known as R1b1c. The changes occurred because so many new haplgroups were being discovered that a new sprout wasn’t added from time to time, but entire branches had to be sawed off and either discarded or grafted elsewhere. It became obvious that while the R1b1a2 version was nice, because it was visually obvious that R1b1a2a was just one step below R1b1a2, that long term, that format just wasn’t going to be able to work anymore. New branches weren’t just sprouting, wholesale shuffling was occurring. Believe it or not, we’re still on the frontier of genetic science.

In 2012, the change to the SNP based haplogroup designations was introduced by Family Tree DNA, and adopted within the community.

The ISOGG tree, the only tree that still includes the older letter/number system and creates extended letter number haplogroup names as new SNPs are added provides us with an example of how much the Y tree has grown.

You can see that the letter/number format haplogroups to the far right are 19 locations in length. The assigned SNP or SNPs associated with that haplogroup are shown as well. Those 19-digit haplogroup names are just too unwieldy, and new haplogroups are still being discovered daily.

It’s 2012 All Over Again

That’s where we are with mitochondrial DNA today, but unlike Y DNA naming, a vendor can’t just make that change to a terminal SNP based naming system because all vendors conform to the published Phylotree.

However, in this case, the vendor, Family Tree DNA has more than 6 times the number of full sequence mitochondrial results than the mitochondrial reference model Phylotree. If you look at the haplogroup projects at Family Tree DNA, you’ll notice that (some) administrators routinely group results by a specific mutation that is found within a named haplogroup, meaning that the people with the mutation form a subgroup that they believe is worthy of its own haplogroup subgroup name. The problem is that unless enough people upload their results to Phylotree, that subgroup will never be identified, so a new haplogroup won’t be added.

If the entire Family Tree DNA data base were to be uploaded to Phylotree, can you imagine how many new haplogroups would need to be formed? Of course, Family Tree DNA can’t do that, but individual testers can and should.

Challenges for Vendors

The challenge for vendors is that every time the phylotree tree is updated and a new version is produced, the vendors must “rerun” their existing tester samples against the new haplogroup defining mutations to update their testers’ haplogroup results.

In some cases, entire haplogroups are obsoleted and branches moved, so it’s not a simple matter of just adding a single letter or digit. Rearranging occurs, and will occur more and more, the more tests that are uploaded to Phylotree.

For example, in the Phylotree V17 update, haplogroup A4a1 became A1a. In other words, some haplogroups became entirely obsolete and were inserted onto other branches of the tree.

In the current version of the Phylotree, haplogroup A4 has been retired.

Keep in mind that all haplogroup assignments are the cumulative combination of all of the upstream direct haplogroups. That means that haplogroup A4a1, in the prior version, had all of the haplogroup defining mutations shown in bold in the chart below. In the V17 version, haplogroup A1a contains all of the mutations shown in bold red. You might notice that the haplogroup A4 defining mutation T16362C is no longer included, and haplogroup A4, plus all 9 downstream haplogroups which were previously dependent on T16362C have been retired. A4a1 is now A1a.

Taking a look at the mitochondrial tree in pedigree fashion, we can see haplogroup A4a1 in Build 15 from September 2012, below.

Followed by haplogroup A1a in the current Build 17.

Full Sequence Versus Chip Based Mitochondrial Testing

While Family Tree DNA tests the full sequence of their customers who purchase that level of testing, other vendors don’t, and these changes wreak havoc for those vendors, and for compatibility for customer attempting to compare between data bases and information from different vendors.

That means that without knowing which version of Phylotree a vendor currently uses, you may not be able to compare meaningfully with another user, depending on changes that occurred that haplogroup between versions. You also need to know which vendor each person utilized for testing and if that vendor’s mitochondrial results are generated from an autosomal style chip or are actually a full mitochondrial sequence test. Utilizing the ISOGG mtDNA testing comparison chart, here’s a cheat sheet.

Vendor No Mitochondrial Chip based haplogroup only mitochondrial Full Sequence mitochondrial
Family Tree DNA No Yes – V17
23andMe Yes – Build V7 No
Ancestry None
LivingDNA Yes – Build V17 No
MyHeritage None
Genographic V2 Yes – Build V16 No

Of the chip-based vendors, 23andMe is the most out of date, with V7 extending back to November of 2009. The Genographic Project has done the best job of updating from previous versions. LivingDNA entered the marketplace in 2016, utilizing V17 when they began.

Family Tree DNA’s mitochondrial test is not autosomal chip based, so they don’t encounter the problem of not having tested needed locations because they test all locations. They have upgraded their customers several times over the years, with the current version being V17.

Family Tree DNA’s mitochondrial DNA test is a separate test from their Family Finder autosomal test while the chip-based vendors provide a base-level haplogroup designation that is included in their autosomal product. However, for chip-based vendors, updating that information can be very challenging, especially when significant branch changes occur.

Let’s take a closer look.

Challenges for Autosomal Chip-Based Vendors Providing Mitochondrial Results

SNP based mitochondrial and Y DNA testing for basic haplogroups that some vendors include with autosomal DNA is a mixed blessing. The up side, you receive a basic haplogroup. The down aide, the vendor doesn’t test anyplace near all of the 16,569 mitochondrial DNA SNP locations.

I wrote in detail about how this works in the article, Haplogroup Comparisons Between Family Tree DNA and 23andMe. Since that time, LivingDNA has also added some level of haplogroup reporting through autosomal testing.

How does this work?

Let’s say that a vendor tests approximately 4000 mitochondrial DNA SNPs on the autosomal chip that you submit for autosomal DNA testing. First, that’s 4000 locations they can’t use for autosomal SNPs, because a DNA chip has a finite number of locations that can be utilized.

Secondly, and more importantly, it’s devilishly difficult to “predict” haplogroups at a detailed level correctly. Therefore, some customers receive a partial haplogroup, such as J1c, and some receive more detail.

It’s even more difficult, sometimes impossible, to update haplogroups when new Phylotree versions are released.

Why is Haplogroup Prediction and Updating so Difficult?

The full mitochondrial DNA sequence is 16,569 locations in length, plus or minus insertions and deletions. The full sequence test does exactly what that name implies, tests every single location.

Now, let’s say, by way of example, that location 10,000 isn’t used to determine any haplogroup today, so the chip-based vendors don’t test it. They only have room for 4000 of those locations on their chip, so they must use them wisely. They aren’t about to waste one of those 4000 spaces on a location that isn’t utilized in haplogroup determination.

Let’s say in the next release, V2, that location 10,000 is now used for just one haplogroup definition, but the haplogroup assignment still works without it. In other words, previously to define that haplogroup, location 9000 was used, and now a specific value at location 10,000 has been added. Assuming you have the correct value at 9,000, you’re still golden, even if the vendor doesn’t test location 10,000. No problem.

However, in V3, now there are new haplogroup subgroups in two different branches that use location 10,000 as a terminal SNP. A terminal SNP is the last SNP in line that define your results most granularly. In haplogroup J1c2f, the SNP(s) that define the f are my terminal SNPs. But if the vendor doesn’t test location 10,000, then the mutation there can’t be used to determine my terminal SNP, and my full haplogroup will be incomplete. What now?

If location 10,000 isn’t tested, the vendor can’t assign those new haplogroups, and if any other haplogroup branch is dependent on this SNP location, they can’t be assigned correctly either. Changes between releases are cumulative, so the more new releases, the further behind the haplogroup designations become.

Multiple problems exist:

  • Even if those vendors were to recalculate their customer’s results to update haplogroups, they can’t report on locations they never tested, so their haplogroup assignments become increasingly outdated.
  • To update your haplogroup when new locations need to be tested, the vendor would have to actually rerun your actual DNA test itself, NOT just update your results in the data base. They can’t update results for locations they didn’t test.
  • Without running the full mitochondrial sequence, the haplogroup can never be more current than the locations on the vendor’s chip at the time the actual DNA test is run.
  • No vendor runs a full sequence test on an autosomal chip. A full mitochondrial sequence test at Family Tree DNA is required for that.
  • Furthermore, results matching can’t be performed without the type of test performed at Family Tree DNA, because people carry mutations other than haplogroup defining mutations. Haplogroup only information is entertaining and can sometimes provide you with base information about the origins of your ancestor (Native, African, European, Asian,) but quickly loses its appeal because it’s not specific, can’t be used for matching and can’t reliably be upgraded.

The lack of complete testing also means that while Family Tree DNA can publish this type of tree and contribute to science, the other vendors can’t.

Let’s take a look at Family Tree DNA’s new tree.

Finding the Tree

To view the tree, click here, but do NOT sign in to your account. Simply scroll to the bottom of the page where you will see the options for both the Y DNA Haplotree and the mtDNA Haplotree under the Community heading.

Click on mtDNA Haplotree.

If you are a Family Tree DNA customer, you can view both the Y and mitochondrial trees from your personal page as well. You don’t have to have taken either the Y or mitochondrial DNA tests to view the trees.

Browsing the mtDNA Tree

Across the top, you’ll see the major haplogroups.

I’m using haplogroup M as an example, because it’s far up the tree and has lots of subgroups. Only full sequence results are shown on the tree.

The basic functionality of the new mitochondrial tree, meaning how it works, is the same as the Y tree, which I wrote about in the Family Tree DNA’s PUBLIC Y DNA Haplotree.

You can view the tree in two formats, countries or variants, in the upper left-hand corner. View is not the same thing as search.

When viewing the mitochondrial DNA phylotree by country, we see that haplogroup M has a total of 1339 entries, which means M and everything below M on the tree.

However, the flags showing in the M row are only for people whose full mitochondrial sequence puts them into M directly, with no subgroup.

As you can see, there are only 12: 6 people in Australia, and one in 5 other countries. These are the locations of the most distant known ancestor of those testers. If they have not completed the maternal Country of Origin on the Earliest Known Ancestor tab, nothing shows for the location.

Viewing the tree by variant shows the haplogroup defining mutations, but NOT any individual mutations beyond those that are haplogroup defining.

For each haplogroup, click on the three dots to the right to display the country report for that haplogroup.

The Country Report

The Country Report provides three columns.

The column titled Branch Participants M shows only the total of people in haplogroup M itself, with no upstream or downstream results, meaning excluding M1, M2, etc. Just the individuals in M itself. Be sure to note that there may be multiple pages to click through, at bottom right.

The second column, Downstream Participants – M and Downstream (Excluding other Letters) means the people in haplogroup M and M subclades. You may wonder why this column is included, but realize that branches of haplogroup M include haplogroups G, Q, C, Z, D and E. The middle column only includes M and subgroups that begin with M, without the others, meaning M, M10, M11 but not G, Q, etc.

Of course the final column, All Downstream Participants – M and Downstream (Including other Letters) shows all of the haplogroup M participants, meaning M and all subclades, including all other haplogroups beneath M, such as M10, G, Q, etc..

What Can I Do with This Information?

Unlike the companion Y tree DNA, since surnames change every generation for maternal lineages, there is no requirement to have multiple matching surnames on a branch to be displayed.

Therefore, every person who includes a location for a most distant known ancestor is included in the tree, but surnames are not.

I want to see, at a glance, where the other people in my haplogroup, and the haplogroups that are the “direct ancestral line” of mine are found today. Clusters may mean something genealogically or are at least historically important – and I’ll never be able to view that information any other way. In fact, before this tree was published, I wasn’t able to see this at all. Way to go Family Tree DNA!!

It’s very unlikely that I’ll match every person in my haplogroup – but the history of that haplogroup and all of the participants in that haplogroup are important to that historical lineage of my family. At one time, these people all shared one ancestor and determining when and where that person lived is relevant to my family story.

Searching for Your Haplogroup

I’m searching for haplogroup J1c2f by entering J1c2f in the “Go to Branch Name.”

There it is.

I can see that there are 17 people in Sweden, 13 in Norway, 5 in Germany, 3 in Russia, etc. What’s with the Scandinavian cluster? My most distant known ancestor was found in Germany. There’s something to be learned here that existing records can’t tell me!

The mother branch is J1c2 which shows the majority of individuals in Ireland followed by England. This probably suggests that while J1c2f may have been born in Scandinavia, J1c2 probably was not. According to the supplement to Dr. Doron Behar’s paper, A “Copernican” Reassessment of the Human Mitochondrial DNA tree from its Root, which provides ages for some mitochondrial DNA haplogroups:

Haplogroup How Old Standard Deviation Approximate Age Range in Years
J1c2 9762 2010 7,752 – 11,772
J1c2f 1926 3128 500 – 5,054

I happen to know from communicating with my matches that the haplogroup J1c2f was born more than 500 years ago because my Scandinavian mito-cousins know where their J1c2f cousin was then, and so do I. Mine was in Germany, so we know our common ancestor existed sometime before that 500 year window, and based on our mutations and the mutation tree we created, probably substantially before that 500 year threshold.

Given that J1c2, which doesn’t appear to have been born in Scandinavia is at least 7,700 years old, we can pretty safely conclude that my ancestor wasn’t in Scandinavia roughly 9,000 years ago, but was perhaps 2,000 years, ago when J1c2f was born. What types of population migration and movement happened between 2,000 and 9,000 years ago which would have potentially been responsible for the migration of a people from someplace in Europe into Scandinavia.

The first hint might be that in the Nordic Bronze Age, trade with European cultures became evident, which of course means that traders themselves were present. Scandinavian petroglyphs dating from that era depict ships and art works from as far away as Greece and Egypt have been found.

The climate in Scandinavia was warm during this period, but later deteriorated, pushing the Germanic tribes southward into continental Europe about 3000 years ago. Scandinavian influence was found in eastern Europe, and numerous Germanic tribes claimed Scandinavian origins 2000 years ago, including the Bergundians, Goths, Heruls and Lombards.

Hmmm, that might also explain how my mitochondrial DNA, in the form of my most distant known ancestor arrived in Germany, as well as the distribution into Poland.

Is this my family history? I don’t know for sure, but I do know that the clustering information on the new phylotree provides me with clustering data to direct my search for a historical connection.

What Can You Do?

  • Take a full mitochondrial DNA test. Click here if you’d like to order a test or if you need to upgrade your current test.
  • Enter your Earliest Known Ancestor on the Genealogy tab of your Account Information, accessed by clicking the “Manage Personal Information” beneath your profile photo on your personal page.

The next few steps aren’t related to actually having your results displayed on the phylotree, but they are important to taking full advantage of the power of testing.

  • While viewing your account information, click on the Privacy and Sharing tab, and select to participate in matching, under Matching Preferences.

  • Also consent to Group Project Sharing AND allow your group project administrators to view your full sequence matches so that they can group you properly in any projects that you join. You full sequence mutations will never be shown publicly, only to administrators.

Of course, always click on save when you’re finished.

  • Enter your most distant ancestor information on your Matches Map page by clicking on the “Update Ancestor’s Location” beneath the map.

  • Join a project relevant to your haplogroup, such as the J project for haplogroup J. To join a project, click on myProjects at the top of the page, then on Join Projects.

  • To view available haplogroup projects, scroll down to the bottom of the screen that shows you available projects to join, and click on the letter of your haplogroup in the MTDNA Haplogroup Projects section.

  • Locate the applicable haplogroup, then click through to join the project.

These steps assure that you’ve maximized the benefits of your mitochondrial results for your own research and to your matches as well. Collaborative effort in completing geographic and known ancestor information means that we can all make discoveries.

The article, Working with Mitochondrial DNA Results steps you through you all of the various tools provided to Family Tree DNA testers.

Now, go and see who you match, where your closest matches cluster, and on the new mtDNA Haplotree, what kind of historical ancestral history your locations may reveal. What’s waiting for you?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

DNAPainter: Painting Leeds Method Matches

Last week, I wrote about how I utilized the Leeds Method in the article, The Leeds Method. What I didn’t say is that I was sizing up the Leeds Method for how I could use the technique to paint additional segments of my chromosomes.

The Leeds Method divides your matches into four groups, one attributable to each grandparent. That means those matches can be painted to your four sets of great-grandparents, assuming you can identify the maternal and paternal groups. Hint – Y and mitochondrial DNA matching or haplogroups may help if you have no better hints.

For genealogists who know who their grandparents are, testing close relatives and cousins is a must in order to be able to associate matches with your four grandparents’ lines.

Please note that the Leeds method generates hints for genealogists by grouping people according to common matches. We must further evaluate those matches by doing traditional genealogy and by looking for segments that triangulate. The Leeds method in conjunction with the actual match results at vendors, combined with DNAPainter helps us do just that.

Utilizing DNAPainter

Since I’ve been able to sort matches into maternal and paternal “sides” using the Leeds Method, which in essence parentally phases the matches, I can use DNAPainter to paint them. Here are my four articles I wrote about how to utilize DNAPainter.

DNAPainter – Chromosome Sudoku for Genetic Genealogy Addicts 
DNAPainter – Touring the Chromosome Garden 
DNAPainter – Mining Vendor Matches to Paint Your Chromosomes 
Proving or Disproving a Half Sibling Relationship Using DNAPainter

Combining the Two Tools

DNAPainter has the potential to really utilize the Leeds Method results, other than Ancestry matches of course. Ancestry does not provide segment information. (Yes, I know, dead horse but I still can’t resist an occasional whack.)

You’re going to utilize your spreadsheet groupings to paint the DNA from each individual match at the vendors to DNAPainter.

On the spreadsheet, if these matches are from Family Tree DNA, MyHeritage, 23andMe or GedMatch, you’ll copy the matching segments from that vendor and paint those matching segments at DNAPainter. I explained how to do that in the articles about DNAPainter.

I do not use mass uploads to DNAPainter, because it’s impossible to assign those to different sides of your tree or ancestors. I paint individual matches, including information about the match and what I know about the history of the segment itself or associated ancestor.

I only paint segments that I can identify with certainty as maternal or paternal.

Pushing Back in Time

Based on which segments of identified ancestors the Leeds matches overlap with at DNAPainter, I can push that segment information further back in time. The blessing of this is that these Leeds matches may well fill in several blanks in my chromosome that are not yet painted by people with whom I share identified ancestors.

Even if your maternal and paternal grandparents are intermarried on each side, as long as they are not intermarried across your parental lines (meaning mother & father,) then the Leeds Method will work fine for painting. Even if you think you are attributing a segment to your paternal grandmother, for example, and the person actually matches through your paternal grandfather, you’ve still painted them on the correct chromosome – meaning your paternal chromosome. As you build up that chromosome with matches, you’ll see soon enough if you have 9 matches attributed to John Doe and one to Jane Smith, the Jane Smith match is likely incorrectly attributed, those two lines are somehow interrelated or it’s a false positive match.

Because I work with only fairly large Leeds matches – nothing below 30 cM, I sometimes receive a nice gift in terms of painting large previously unpainted segments – like the one on my mother’s side, below.

Look at this large green segment on chromosome 19 that I painted thanks to one of the Leeds matches, Harold. (Note that the two long blue and brown bars at the bottom of each chromosome are my ethnicity, not matches.) Another benefit is that if a Leeds match matches on already identified segments assigned to ancestors, I’ve just identified which ancestral lines I share with that match.

The green Ferverda side match to Roland through the Leeds Method aligns partially with a segment already known to descend from Jacob Lentz and Frederica Ruhle who were born in the 1780s. I’m related to Roland somehow through that line, and by just looking at his (redacted here) surname, I *think* I know how, even though he doesn’t have a tree online. How cool is that!

Important Notes for DNAPainter

Word of caution here. I would NOT paint anyone who falls into multiple match groups without being able to identify ancestors. Multiple match groups may indicate multiple ancestors, even if you aren’t aware of that.

Each segment has its own history, so it’s entirely possible that multiple match groups are accurate. It’s also possible that to some extent, especially with smaller segments, that matches by chance come into play. That’s why I only work with segments above 30 cM when using the Leeds method where I know I’m safe from chance matches. You can read about identical by descent (IBD) and identical by chance (IBC) matches here.

What a DNAPainter Leeds Match Means

It’s very important to label segments in DNAPainter with the fact that the source was through the Leeds Method.

These painted matches DO NOT MEAN that the match descends from the grandparent you are associating with the match.

It means that YOU inherited your common DNA with this match FROM that grandparent. It suggests that your match descends from one of the ancestors of this couple, or possibly from your great-grandparents, but you don’t necessarily share this great-grandparent couple with your match.

That’s different than the way I normally paint my chromosomes – meaning only when a specific common ancestor has been identified. For someone painted from matches NOT identified through the Leeds Method, if I know the person descends from a grandparent, I paint them to the great-grandparent couple. People painted through the Leeds Method don’t necessarily share that couple, but do share an ancestor of that couple.

When I paint using the Leeds method, I’m assigning the match to a set of great-grandparents because I can’t genealogically identify the common ancestor further upstream, so I’m letting genetics tell me which genealogical quadrant they fall into on my tree. With the Leeds Method, I can tell which grandparent I inherited that DNA through. In my normal DNAPainter methodology, I ONLY paint matches when I’ve identified the common ancestor – so Leeds Method matches would not previously have qualified.

I don’t mean to beat this to death and explain it several ways – but it’s really important to understand the difference and when looking back, understand why you painted what you did.

Labeling Leeds Match Painted Segments

Therefore, with Leeds Method match painting, I identify the match name as “John Doe FTDNA Leeds-Ferverda” which tells me the matches name (John Doe,) where they tested (FTDNA) and why I painted them (Ferverda column in my Leeds spreadsheet,) even though I don’t know for sure which ancestor we actually have in common. I paint them to the parents of my Ferverda grandfather. Not John Ferverda, my grandfather, but to his parents, Hiram Ferverda and Eva Miller. I know I received my matching DNA through one of them – I just don’t know which person of that couple yet.

However, looking at who else is assigned to that segment with an identified common ancestor will tell me where in my tree that segment originated – for me. We still don’t know where in your matches tree that segment originated.

“Match To” Issues

Lastly, if you happen to select a “match to” person to represent one of your grandparent matches that just happens to be descended from two grandparent lines, you’ve had your bad luck for the month. Remember, your “match to” person is the first person (closest match) that hasn’t yet been grouped, so you don’t really select them. If you realize you’re getting goofy results, stop and undo those results, then select the next candidate as your “match to” person.

At one vendor, when I selected the first person who hadn’t yet been grouped and used them for the red column which turned out to be Bolton, about half of them overlapped with Estes segments that I’ve already painted and confirmed from several sources. Obviously, there’s a problem someplace, and I’m guessing it just happens to be the luck of the draw with the “match to” person being descended from both lines. The lines both lived in the same county for generations. I need to redo that section with someone whose tree I know positively descends from the Bolton line and does NOT intersect with another of my lines. However, I was able to identify that this issue existed because I’ve already painted multiple ancestor-confirmed cousins who carry those same segments – and I know where they came from.

These tools are just that – tools and require some level of analytical skill and common sense. In other words, it’s a good idea to stay with larger matches and know when to say “uh-oh.” If it doesn’t feel right, don’t paint it.

Breaking Down Distant Brick Walls

I’m still thinking about how to use the Leeds Method, probably in combination with DNAPainter, to break down brick walls. My brick walls aren’t close in time. Most of them are several generations back and revolve around missing female surnames, missing records or ancestors appearing in a new location with no ability to connect them back to the location/family they left.

In essence, I would need to be able to isolate the people matching that most distant ancestor couple, then look for common surnames and ancestors within that match group. The DNAGedcom.com client which allows you to sort matches by surname might well be an integral piece of this puzzle/solution. I’ll have to spend some time to see how well this works.

Solving this puzzle would be entirely dependent on people uploading their trees.

If you have thoughts on how to use these tools to break down distant brick walls, or devise a methodology, please let me know.

And if you haven’t uploaded your tree, please do.

Would I Do The Leeds Method Again?

Absolutely, at least for the vendors who provide segment information.

I painted 8 new Leeds matches from Family Tree DNA on my Ferverda grandparent side which increased the number of painted segments at DNAPainter from 689 to 704, filled in a significant number of blank spaces on my chromosomes, and took my total % DNA painted from 60 to 61%. I added the rest of my Leeds hints from Family Tree DNA of 30 cM or over, and increased my painted segments to 734 and my percentage to 62% I know that 1 or 2% doesn’t sound like a very big increase, but it’s scientific progress.

It’s more difficult to increase the number of new segments after you’ve painted much of your genome because many segments overlap segments already painted. So, a 2% increase is well worth celebrating!

Having said that, I would love for the vendors to provide this type of clustering so I don’t have to. To date, Family Tree DNA is the only vendor who does any flavor of automatically bucketing results in this fashion – meaning paternal and maternal, which is half the battle. I would like to see them expand to the four grandparents from the maternal/paternal matching they provide today.

We’ve been asking Ancestry for enhanced tools for years. There’s no reason they couldn’t in essence do what Dana has done along with provide the DNAgedcom.com search functionality. And yes…I still desperately want a chromosome browser or at least segment information.

I will continue to utilize the Leeds Method, at least with vendors other than Ancestry because it allows me to incorporate the results with DNAPainter. It’s somehow ironic that I started out grouping the Ancestry results, but wound up realizing that the results from other vendors, specifically Family Tree DNA and MyHeritage are significantly more useful due to the segment data and combined tools.

Getting the Most Bang for Your Buck

If you tested at Ancestry or 23andMe, I would strongly encourage you to download your raw data file from both of these vendors and transfer to Family Tree DNA, MyHeritage and GedMatch to get the most out of your DNA tests. Here is the step-by-step guide for how to download your DNA from Ancestry.

The uploads to those three locations are free. All tools are free at MyHeritage until December 1, 2018 when they will begin charging for more advanced tools. The upload is free at Family Tree DNA and the advanced tools, including the chromosome browser, only require a $19 unlock.

Here is the step-by-step guide for uploading to MyHeritage and to Family Tree DNA. Fishing in every pond is critically important. You never know what you’re missing otherwise!

How many segments of your DNA can you paint using the Leeds Method in combination with DNA Painter?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

The Leeds Method

This is the first in a series of two articles. This article explains the Leeds Method and how I created a Leeds Spreadsheet in preparation for utilizing the results in DNAPainter. I stumbled around a bit, but I think I’ve found a nice happy medium and you can benefit from my false starts by not having to stumble around in the dark yourself. Of course, I’m telling you about the pitfalls I discovered.

The second article details the methodology I utilized to paint these matches, because they aren’t quite the same as “normal” matching segments with identified ancestors.

Welcome to the Leeds Method

Dana Leeds developed a novel way to utilize a spreadsheet for grouping your matches from second through fourth cousins and to assign them to “grandparent” quadrants with no additional or previous information. That’s right, this method generates groupings that can be considered good hints without any other information at all.

Needless to say, this is great for adoptees and those searching for a parent.

It’s also quite interesting for genetic genealogists as well. One of the best aspects is that it’s very easy to do and very visual. Translation – no math. No subtraction.

Caveat – it’s also not completely accurate 100% of the time, especially when you are dealing with more distant matches, intermarriage and/or endogamy. But there are ways to work around these issues, so read on!

You can click to enlarge any image.

I’ll be referring to this graphic throughout this article. It shows the first several people on my Ancestry match list, beginning with second cousins, using pseudonyms. I chose to use Ancestry initially because they don’t provide chromosome browsers or triangulation tools, so we need as much help there as we can get.

I’ve shown the surnames of my 4 grandparents in the header columns with an assigned color, plus a “Weird group” (grey) that doesn’t seem to map to any of the 4. People in that group are much more distant in my match list, so they aren’t shown here.

I list the known “Most Common Recent Ancestor,” when identified, along with the color code that so I can easily see who’s who.

All those blanks in the MCRA column – those are mostly people without trees. Just think how useful this would be if everyone who could provide a tree did!

What Does the Leeds Method Tell You?

The Leeds Method divides your matches into four colored quadrants representing each grandparent unless your genealogical lines are heavily intermarried. If you have lots of people who fall into both of two (or more) colors, that probably indicates intermarriage or a heavily endogamous population.

In order to create this chart, you work with your closest matches that are 2nd cousins or more distant, but no more distant than 4th cousins. For endogamous people, by the time you’re working in 4th cousins, you’ll have too much overlap, meaning people who fall into multiple columns, so you’ll want to work with primarily 2nd and 3rd cousins. The good news is that endogamous people tend to have lots of matches, so you should still have plenty to work with!

Instructions

In this article, I’m using Dana’s method, with a few modifications.

By way of a very, very brief summary:

  • On a spreadsheet, you list all of your matches through at least third cousins
  • Then check each match to see who you match in common with them
  • Color code the results, in columns
  • Each person what you match in common with your closest cousin, Sleepy, is marked as yellow. Dopey and I both match Bashful and Jasmine in common and are colored Red. Doc and I both match Happy and Belle and are colored blue, and so forth.
  • The result is that each color represents a grandparent

To understand exactly what I’m doing, read Dana’s articles, then continue with this article.

DNA Color Clustering: The Leeds Method for Easily Visualizing Matches  
DNA Color Clustering: Identifying “In Common” Surnames 
DNA Color Clustering: Does it Work with 4th Cousins? By the way, yes it does, most of the time.
DNA Color Clustering: Dealing with 3 Types of Overlap

Why Use “The Leeds Method”?

In my case, I wanted to experiment. I wanted to see if this method works reliably and what could be done with the information if you already know a significant amount about your genealogy. And if you don’t.

The Leeds Method is a wonderful way to group people into 4 “grandparent” groups in order to search for in-common surnames. I love being able to perform this proof of concept “blind,” then knowing my genealogy and family connections well enough to be able to ascertain whether it did or didn’t work accurately.

If you can associate a match with a single grandparent, that really means you’ve pushed that match back to the great-grandparent couple.

That’s a lot of information without any genealogical knowledge in advance.

How Low Can You Go?

I have more than 1000 fourth cousins at Ancestry. This makes the task of performing the Leeds Method manually burdensome at that level. It means I would have had to type all 1000+ fourth cousins into a spreadsheet. I’m patient, but not that patient, at least not without a lot of return for the investment. I have to ask myself, exactly what would I DO with that information once they were grouped?

Would 4th cousin groupings provide me with additional information that second and third cousin groupings wouldn’t? I don’t think so, but you can be the judge.

After experimenting, I’d recommend creating a spreadsheet listing all of your 2nd and 3rd cousins, along with about 300 or so of your closest 4th cousin matches. Said another way, my results started getting somewhat unpredictable at about 40-45 cMs, although that might not hold true for others. (No, you can’t tell the longest matching segment length at Ancestry, but I could occasionally verify at the other vendors, especially when people from Ancestry have transferred.)

Therefore, I only proceeded through third cousins and about 300 of the Ancestry top 4th cousin matches.

I didn’t just utilize this methodology with Ancestry, but with Family Tree DNA, MyHeritage and 23andMe as well. I didn’t use GedMatch because those matches would probably have tested at one of the primary 4 vendors and I really didn’t want to deal with duplicate kits any more than I already had to. Furthermore, GedMatch is undergoing a transition to their Genesis platform and matching within the Genesis framework has yet to be perfected for kits other than those from these vendors.

Let’s talk about working with matches from each vendor.

Ancestry

At Ancestry, make a list of all of your second and third cousin matches, plus as many 4th cousins as you want to work with.

To begin viewing your common matches, select your first second cousin on the list and click on the green View Match. (Note that I am using my own second kit at Ancestry, RobertaV2Estes, not a cousin’s kit in these examples. The methodology is the same, so don’t fret about that.)

Then, click on Shared Matches.

Referring to your spreadsheet, assign a color to this match group and color the spreadsheet squares for this match group. Looking at my spreadsheet, my first group would be the yellow Estes group, so I color the squares for each person that I match in common with this particular cousin. On my spreadsheet, those cousins have all been assigned pseudonyms, of course.

Your shared match list will be listed in highest match order which should be approximately the same order they are listed on your spreadsheet. I use two monitors so I can display the spreadsheet on one and the Ancestry match list on the other.

Lon is shared in common with the gold person I’m comparing against (Roberta V2 Estes), and me, so his box would be colored gold on the spreadsheet. Lon’s pseudonym is Sneezy and the person beneath him on this list, not shown, would be Ariel.

Ancestry only shows in-common matches to the 4th cousin level, so you really couldn’t reach deeper if you wanted. Furthermore, I can’t see any advantage to working beyond the 4th cousin’s level, maximum. Your best matches are going to be the largest ones that reveal the most information and have the most matches, therefore allowing you to group the most people by color.

Unfortunately, Ancestry provides the total cMs and the number of segments, but not the largest matching segment.

One benefit of this methodology is that it’s fairly easy to group those pesky private matches like the last one on the master spreadsheet, Cersei, shown in red. You’ll at least know which grandparent group they match. Based on your identified ancestors of matches in the color group, you may be able to tell much more about that private match.

For example, one of my private matches is a match to someone who I share great-great-grandparents with AND they also match with two people further on up that tree on the maternal side of that couple, shown above, in red. I may never know which ancestor I share with that private match specifically, but I have a pretty darned good idea now in spite of that ugly little lock. The more identified matches, the better and more accurate this technique.

Is the Leeds Method foolproof? No.

Is this a great tool? Yes, absolutely.

Family Tree DNA

Thankfully, Family Tree DNA provides more information about my matches than Ancestry, including segment information combined with a chromosome browser and Family Matching. I often refer to Family Matching as parental bucketing, shown on your match list with the maternal and paternal tabs, because Family Tree DNA separates your matches into parental “sides” based on common segments with others on your maternal and paternal branches of your tree when you link your matches’ results.

At Family Tree DNA, sign on and then click on Matches under Family Finder.

When viewing your matches, you’ll see blue or red people icons any that are assigned to either your maternal, paternal side, or both (purple) on your match list. If you click on the tabs at the top,  you’ll see JUST the maternal, paternal or both lists.

This combination of tools allows you to confirm (and often triangulate) the match for several people. If those matches are bucketed, meaning assigned to the same parental side, and they match on the same segment, they are triangulated for all intents and purposes if the segment is above 20 cM. All of the matches I worked with for the Leeds Method were well above 20 cM, so you don’t really need to worry about false or identical by chance matches at that level.

Family Tree DNA matches are initially displayed by the total number of “Shared cM.” Click on “Longest Block” to sort in that manner. I considered people through 30 cM and above as equivalent to the Ancestry 3rd cousin category. Some of the matching became inconsistent below that threshold.

List all of your second and third cousins on the spreadsheet, along with however many 4th cousins you want to work with.

Then, select your closest second cousin by checking the box to the left of that individual, then click on “In Common With” above the display. This shows you your matches in common with this person.

On the resulting common match list, sort your matches in Longest block order, then mark the matches on your spreadsheet in the correct colored columns.

With each vendor, you may need to make new columns until you can work with enough matches to figure out which column is which color – then you can transfer them over. If you’re lucky enough to already know the family association of your closest cousins, then you already know which colored column they belong to.

All of my matches that fell into the Leeds groups were previously bucketed to maternal or paternal, so consistency between the two confirms both methodologies. Between 20 and 28 cM, three of my bucketed matches at Family Tree DNA fell into another group using the Leeds method, which is why I drew the line at 30cM.

For genealogists who already know a lot about their tree, this methodology in essence divides the maternal and paternal buckets into half. FTDNA already assigns matches maternally or paternally with Family Matching if you have any information about how your matches fit into your tree and can link any matching testers to either side of your tree at the 3rd cousin level or closer.

If you don’t know anything about your heritage, or don’t have any way to link to other family members who have tested, you’ll start from scratch with the Leeds Method. If you can link family members, Family Tree DNA already does half of the heavy lifting for you which allows you to confirm the Leeds methodology.

MyHeritage

At MyHeritage, sign in, click on DNA and sort by “largest segment,” shown at right, above. I didn’t utilize matches below 40 cM due to consistency issues. I wonder if imputation affects smaller matches more than larger matches.

You’ll see your closest matches at the top of the page. Scroll down and make a list on your spreadsheet of your second and third cousins. Return to your closest DNA match that is a second cousin and click on the purple “Review DNA Match” which will display your closest in-common matches with that person, but not necessarily in segment size order.

Scroll down to view the various matches and record on the spreadsheet in their proper column by coloring that space.

The great aspect of MyHeritage is that triangulation is built in, and you can easily see which matches triangulate, providing another layer of confirmation, assuming you know the relationship of at least some of your matches.

The message for me personally at MyHeritage is that I need to ask known cousins who are matches elsewhere to upload to MyHeritage because I can use those as a measuring stick to group matches, given that I know the cousin’s genealogy hands-down.

The great thing about MyHeritage is that they are focused on Europe, and I’m seeing European matches that aren’t anyplace else.

23andMe

At 23andMe, sign in and click on DNA Relatives under the Ancestry tab.

You’ll see your list of DNA matches. Record 2nd and third cousins on your spreadsheet, as before.

To see who you share in common with a match, click on the person’s name and color your matches on the spreadsheet in the proper column.

Unfortunately, the Leeds Method simply didn’t work well for me with my 23andMe data, or at least the results are highly suspect and I have no way of confirming accuracy.

Most of my matches fell into in the Estes category, with the Boltons overlapping almost entirely, and none in the Lore or Ferverda columns. There is one small group that I can’t identify. Without trees or surnames, genealogically, my hands are pretty much tied. I can’t really explain why this worked so poorly at 23andMe. Your experience may be different.

The lack of trees is a significant detriment at 23andMe because other than a very few matches whose genealogy I know, there’s no way to correlate or confirm accuracy. My cousins who tested at 23andMe years ago and whose tests I paid for lost interest and never signed in to re-authorize matching. Many of those tests are on the missing Ferverda side, but their usefulness is now forever lost to me.

23andMe frustrates me terribly. Their lack of commitment to and investment in the genealogical community makes working with their results much more difficult than it needs to be. I’ve pretty much given up on using 23andMe for anything except adoption searches for very close matches as a last resort, and ethnicity.

The good news is that with so many people testing elsewhere, there’s a lot of good data just waiting!

What are the Benefits?

The perception of “benefit” is probably directly connected to your goal for DNA testing and genetic genealogy.

  • For adoptees or people seeking unknown parentage or unknown grandparents, the Leeds Method is a fantastic tool, paving the way to search for common surnames within the 4 groups as opposed to one big pool.
  • For people who have been working with their genealogy for a long time, maybe not as much, but hints may lurk and you won’t know unless you do the discovery work. If you’re a long-time genealogist, you’re used to this, so it’s just a new way of digging through records – and you can do it at home!
  • For people who have tested at Family Tree DNA, the family grouping by maternal and paternal based on people linked to your tree is more accurate and groups people further down your match list because it’s actually based on triangulated matching segments. However, the Leeds Method expands on that and adds granularity by breaking those two groups into four.
  • For people who want to paint their chromosomes using DNAPainter, the Leeds Method is the first step of a wonderful opportunity if you have tested at either Family Tree DNA, MyHeritage or 23andMe.

Unfortunately, Ancestry doesn’t provide segment information, so you can’t chromosome paint from Ancestry directly, BUT, you can upload to either Family Tree DNA, MyHeritage or GedMatch and paint Ancestry matches from there. At GedMatch, their kit numbers begin with A.

What Did I Do Differently than Dana?

Instead of adding a 5th column with the first person (Sam) who was not grouped into the first 4 groups, I looked for the closest matches that I shared with Sam who were indeed in the first 4 color groups. I added Sam to that existing color group along with my shared matches with Sam that weren’t already grouped into that color so long as it was relatively consistent. If it looked too messy, meaning I found people in multiple match groups, I left it blank or set that match aside. This didn’t happen until I was working at the 4th cousin level or between 30 and 40 cM, depending on the vendor.

Please note that just because you find people that you match in common with someone does NOT MEAN that you all share a common ancestor, or the same ancestor. It’s a hint, a tip to be followed.

There were a couple of groups that I couldn’t cluster with other groups, and one match that clustered in three of the four grandparent groups. I set that one aside as an outlier. I will attempt to contact them. They don’t have a tree.

I grouped every person through third cousin matches. I started out manually adding the 4th cousins for each match, but soon gave up on that due to the sheer magnitude. I did group my closest 4th cousins, or until they began to be inaccurate or messy, meaning matching in multiple groups. Second and third cousin matching was very consistent.

Tips

  • Don’t use siblings or anyone closer than the second cousin level. First cousins share two grandparents. You only want to use matches that can be assigned to ONLY ONE GRANDPARENT.
  • In the spreadsheet cell, mark the person you used as a “match to.” In other words, which people did you use to populate that color group. You can see that I used two different people in the Estes category. I used more in the other categories too, but they are further down in my list.
  • At Family Tree DNA, you can utilize the X chromosome. Understand that if you are a male, you will not have any X matches with your paternal grandfather. I would not recommend using X matches for the Leeds Method, especially since they are not uniformly available at all vendors and form a specific unique inheritance pattern that is not the same as the other autosomes.
  • Ancestry, MyHeritage and Family Tree DNA allow you to make notes on each match. As I group these, and as I paint them with DNAPainter I made a note on each match that allows me to identify which group they are assigned to, or if they match multiple groups.
  • Look at each match to be sure they are consistent. If they aren’t, either mark them as inconclusive or omit them entirely in the painting process. I write notes on each one if there is something odd, or if I don’t paint them.

What Did I Learn?

Almost all of my (endogamous by definition) Acadian matches are more distant, which means the segments are smaller. I expected to find more in the painted group, because I have SO MANY Acadian matches, but given that my closest Acadian ancestor was my great-great-grandfather, those segments are now small enough that those matches don’t appear in the candidate group of matches for the Leeds Method. My Acadian heritage occurs in my green Lore line, and there are surprisingly few matches in that grouping large or strong enough to show up in my clustered matches. In part, that’s probably because my other set of great-great-grandparents in that line arrived in 1852 from Germany and there are very few people in the US descended from them.

I found 4th cousin matches I would have otherwise never noticed because they don’t have a tree attached. At Ancestry, I only pay attention to closer matches, Shared Ancestor Hints and people with trees. We have so many matches today that I tend to ignore the rest.

Based on the person’s surname and the color group into which they fall, it’s often possible to assign them to a probable ancestral group based on the most distant ancestors of the people they match within the color group. In some cases, the surname is another piece of evidence and may provide a Y DNA lead.

For example, one of my matches user name is XXXFervida. They do match in the Ferverda grandparent group, and Fervida is how one specific line of the family spelled the surname. Of course, I could have determined that without grouping, but you can never presume a specific connection based solely on surname, especially with a more common name. For all I know, Fervida could be a married name.

By far the majority of my matches don’t have trees or have very small trees. That “no-tree” percentage is steadily increasing at Ancestry, probably due to their advertising push for ethnicity testing. At Family Tree DNA where trees are infinitely more useful, the percentage of people WITH trees is actually rising. By and large, Family Tree DNA users tend to be the more serious genealogists.

MyHeritage launched their product more recently with DNA plus trees from the beginning, although many of the new transfers don’t have trees or have private trees. Their customers seem to be genealogically savvy and many live in Europe where MyHeritage DNA testing is focused.

23andMe is unquestionably the least useful for the Leeds Method because of their lack of support for trees, among other issues, but you may still find some gems there.

Keeping Current

Now that I invested in all of this work, how will I keep the spreadsheet current, or will I at all?

At Ancestry, I plan to periodically map all of my SAH (Shared Ancestor Hints) green leaf matches as well as all new second and third cousin matches, trees or not.

In essence, for those with DNA matches and trees with a common ancestor, Ancestry already provides Circles, so they are doing the grouping for those people. Where this falls short, of course, is matches without trees and without a common identified ancestor.

For Ancestry matches, I would be better served, I think, to utilize Ancestry matches at GedMatch instead of at Ancestry, because GedMatch provides segment information which means the matches can be confirmed and triangulated, and can be painted.

For matches outside of Ancestry, in particular at Family Tree DNA and MyHeritage I will keep the spreadsheet current at least until I manage to paint my entire set of chromosomes. That will probably be a very long time!

I may not bother with 23andMe directly, given that I have almost no ability to confirm accuracy. I will utilize 23andMe matches at GedMatch. People who transfer to GedMatch tend to be interested in genealogy.

What Else Can I Do?

At Ancestry, I can use Blaine’s new “DNA Match Labeling” tool that facilitates adding 8 colored tags to sort matches at Ancestry. Think of it as organizing your closet of matches. I could tag each of these matches to their grandparent side which would make them easy to quickly identify by this “Leeds Tag.”

My Goals

I have two primary goals:

  • Associating segments of my DNA with specific ancestors
  • Breaking down genealogical brick walls

I want to map my DNA segments to specific ancestors. I am already doing this using Family Tree DNA and MyHeritage where common ancestors are indicated in trees and by surnames. I can map these additional Leeds leads (pardon the pun) to grandparents utilizing this methodology.

To the extent I can identify paternal and maternal matches at 23andMe, I can do the same thing. I don’t have either parents’ DNA there, and few known relatives, so separating matches into maternal and paternal is more difficult. It’s not impossible but it means I can associate fewer matches with “sides” of my genealogy.

For associating segments with specific ancestors and painting my chromosomes, DNAPainter is my favorite tool.

In my next article, we’ll see how to use our Leeds Method results successfully with DNAPainter and how to interpret the results.

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

MyHeritage Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor, if possible
  • Other mainstream vendors where you can upload this vendor’s files

Uploading TO MyHeritage

Upload Step 1

To upload your DNA to MyHeritage, click here and then click on the purple “Start” button.

Upload Step 1 If You Already Have an Account at MyHeritage

If you already have an account, click here to sign in and then click on the DNA tab to display the “Upload DNA Data” option which displays the graphic above. Click on the purple “Start” button. This is the same process you’ll use whether it’s the first time you’ve uploaded a kit, or you’re uploading subsequent kits to your account that you’ll be managing.

Upload Step 2

You’ll be prompted to create a free account by entering your name, e-mail and password, and from there you can upload your autosomal DNA file.

You’ll be asked whose DNA you’re uploading and prompted to read and agree to the terms of service and consent.

Click the purple upload button.

Then click done when the file is finished uploading.

You’ll be notified by e-mail within a couple days when the file is finished processing.

Downloading FROM MyHeritage

Download Step 1

Sign on to your MyHeritage account.

Click on DNA on the upper toolbar.

The dropdown menu includes “Manage DNA Kits”

Download Step 2

At the right of the kit you wish to download, click on the three small buttons which will include an option for “Download,” as shown in the graphics below from the MyHeritage blog article.

Download Step 3

You’ll be presented with a box titled “Learn more about DNA data files.” Click the purple “Continue” button.

Download Step 4

You’ll need to confirm that you want to download your data, and that you understand that the download is outside of MyHeritage and their protection. Click the purple “Continue” button.

Download Step 5

You’ll receive a confirmation e-mail. Click on “Click here to continue with download.”

This e-mail link is only valid for 24 hours.

Download Step 6

Enter your password again, and click on the purple “Download” button.

Download Step 7

Save the file as a recognizable file name on your computer.

MyHeritage File Transfers TO Other Vendors

You can upload your MyHeritage file to other vendors, as follows.

From below to >>>>>>>>>>> Family Tree DNA Accepts Ancestry Accepts 23andMe Accepts GedMatch Accepts
MyHeritage Yes No No Yes

Neither Ancestry nor 23andMe accepts uploads from any vendor.

MyHeritage File Transfers FROM Other Vendors

You can upload files from other vendors to MyHeritage, as follows:

  From Family Tree DNA From Ancestry From 23andMe From LivingDNA
To MyHeritage Yes Yes Yes Yes

Testing and Transfer Strategy

Transferring to MyHeritage is always free. You can view your ethnicity, your matches and their trees, and utilize the DNA tools, but you won’t receive the full benefit of SmartMatching and other records without a subscription. You will be limited to building a tree of 250 people for free, but you can upload a Gedcom file of any size, although you do need to subscribe to change anything in that file if it contains more than 250 individuals.

Until December 1, 2018, all DNA tools will be and remain free for anyone who uploads before that date. After December 1st, matching will remain free, but the advanced tools such as ethnicity, the chromosome browser, triangulation and more will require payment. MyHeritage has not yet indicated how that will work, so upload now to receive free DNA tools forever.

My testing/transfer recommendations are as follows relative to MyHeritage:

Have fun!

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Ancestry 2018 Ethnicity Update

When ethnicity estimates were first produced by vendors, they tended to resemble the wild west.

Today, results are becoming more refined and hopefully, more accurate as reference populations grow and become more reliable.

The Ancestry ethnicity update has been in beta for several months, but this week, Ancestry rolled out the ethnicity update for everyone.

Checking Your New Results

To see your updated results, sign on and click on the DNA Story to the left with Ethnicity Estimates.

Ancestry then explains that while your DNA doesn’t change, the estimates (pay attention to that word) do as the science improves.

Ethnicity Estimate Aren’t Precise

I’ve said this before, and I want to say it again. Ethnicity is the least precise and the least accurate of DNA tools for genetic genealogy. Ethnicity estimates are the most accurate at a continental level. Within continents, like Europe, Asia and Africa, there has been a lot of population movement and intermixing over time making the term “ethnicity” almost meaningless.

I know, I know – ethnicity estimates are also the simplest because there isn’t much learning curve and they’re easy to understand at a glance. This deceptive “ease of use” also makes them interesting to people who have only a passing curiosity. That’s why they attract so many test takers who either love of hate their results, but never fully understand the true message or utilize any other genetic genealogy tools.

Let’s take a look at how ethnicity estimates have changed over time and if they have improved with the latest version.

Ethnicity Estimate Changes

In my case, my original Ancestry ethnicity estimate in 2012 was:

  • British Isles 80%
  • Scandinavia 12%
  • Uncertain 8%

To say it was really bad is an understatement.

In 2013, Ancestry introduced their ethnicity V2 version which provided a lot more granularity.

Version 2 was dramatically different, with the British Isles moving from 80% to a total of 6%. Like a pendulum swinging, neither was accurate.

Ancestry introduced new features and combined their Genetic Communities with their ethnicity estimates in 2017.

In this new 2018 version, Ancestry has divided and recombined the British Isles and Western Europe differently and the resulting differences are significant.

My mystery Scandinavian is entirely gone now, but sadly, so is my Native American.

The New Results

I just got really boring – but the question is whether or not the new results are more accurate as compared to my proven genealogy. Boring doesn’t matter. Accuracy does.

Various Ancestry Ethnicity Versions Compared to Proven Genealogy

I created a chart that reflects the three Ancestry ethnicity versions as compared to my proven genealogy.

For the current version, I also included the ranges as provided by Ancestry.

As you can see, generally, the results are much more accurate, but the regions are also fairly broad which makes accuracy easier to achieve.

Until this current version, Ancestry didn’t show any Germanic, but now the Germanic estimate is exact at 25%.  The Germanic range is also very tight at 24-26%, right where it should be.

The England, Wales & Northeast Europe category is somewhat high, but that could be accurate because I do have some ancestry that is unknown.

Unfortunately, my Native is proven, both through Y and mtDNA and by triangulating the Native segments to others descending from the same Native ancestors. That portion is now missing in my Ancestry ethnicity.

Ancestry V1 Test Versus the V2 Test

For the record, I’m using my Ancestry V1 test because I’ve used that test version for all previous ethnicity comparisons.  My Ancestry V2 test ethnicity results are approximately the same, as follows:

  • England, Wales and Northeast Europe – 76%
  • Germanic – 22%
  • Ireland and Scotland – 2%

The same tree is attached to both tests.

On my V2 test, which I seldom use, I had to answer a couple of question regarding my expectations about ethnicity testing changes and how accurate my previous results were perceived to be before I could access my updated results.

Regions Changed

In Ancestry’s FAQ, they provided this list of how the regions were and are defined.

Previous Region New Regions
Scandinavia Norway, Sweden
Iberian Peninsula Spain, Portugal, Basque
Europe South Italy, Greece and the Balkans, Sardinia
Europe East Baltic States, Eastern Europe and Russia
Caucasus Turkey and the Caucasus, Iran/Persia
Europe West Germanic Europe, France
Native American Native America—North, Central, South; Native America—Andean
Asia South Southern Asia, Western and Central India, Balochistan, Burusho
Asia East Japan, Korea and Northern China, China, Southeast Asia—Dai (Tai), Southeast Asia—Vietnam, Philippines

Ancestry has addressed lots of other questions in their FAQ as well, and I suggest taking a look. I particularly like their comment, “Some places are complicated.” Indeed, that’s true with population churn both in historical times along with unknown pre-history and that complexity is exactly what makes intra-continental ethnicity estimates so difficult. Of course, people whose ancestors are from Europe, for example, want as much granularity as possible.

Previous Ethnicity Versions

For the first time, Ancestry explains what happened between versions, at least at a high level.

Click on the little “i” in the upper right hand corner of your ethnicity estimate box.

You’ll see more information.

Click on “View Previous Estimate” at the bottom.

Your previous ethnicity estimate is shown.

To see how your estimate changed, click on “Compare these results to your most recent Ancestry DNA estimate.”

This display shows you the differences compared to the previous version. In my case, England, Wales and NE Europe increased by 69%, but that’s because Ancestry redefined the regions. Note the little slide box underneath the regions on the map. You can slide back and forth from previous to current (update.).

I do wish Ancestry had told us where the “Scandinavian” went, what category it fell into. Are those segments, as a group, included in another region? Was the previous estimate simply flat out wrong? Was Scandinavian a vestige of Vikings who invaded much of Europe? What happened?

New Regions and Reference Samples

By clicking on “See other regions tested” at the bottom of your Ethnicity Estimate box, you can view the locations of Ancestry’s current reference populations.

The regions tested in which you have results are colored, and the regions where you aren’t showing results are shades of grey. This is an improvement over the previous version which people routinely misinterpreted to mean that they had results in those tested regions.

Best Features

In my opinion, the best feature of the combined ethnicity and Genetic Communities is the combined mapping. For example, the screenshot below combines the ethnicity regions with the ancestors from my tree who immigrated from that region in that timeframe.

By clicking on the 1700 box, the people from that time period in my tree are displayed. I can enlarge the map to make the display larger, until finally individual “people” icons are displayed, as shown with Johann Peter Koehler, below. Clicking on the individual person pin shows that individual in the box at right.

By clicking on the “Lower Midwest and Virginia Settlers,” I see this region and Ancestry tells me where those settlers likely originated.

You can then scroll down to the bottom of the information box where you see “Ancestry DNA Members.”

Click on the 1000+ link and you will then see the people who match you in a specific region or migration.

It’s worth noting that this isn’t always accurate. My 2nd cousin match is showing as a “Lower Midwest and Virginia” match and our ancestors came from the Netherlands directly to Northern Indiana. Ironically, she shows up in three of the 4 regions I can select from. This feature is not 100%, but it’s still nice to be able to see where that match is grouped in terms of ethnicity and Genetic Communities, according to Ancestry.

Given this combined functionality, I do wonder if Ancestry’s new ethnicity isn’t simply population genetics, but a combination of population genetics, ancestors in my tree, my matches and corresponding DNA Circles with their associated history. If so, that would make sense, both in terms of what I’m seeing as my new ethnicity results and the map functionality as well. Could that be where my Germanic came from, and why it’s so precise at 25% which matches by tree exactly?

In Summary

For me, Ancestry’s ethnicity estimates are significantly improved with the exception that my Native disappeared. I’ve worked long and hard on the Native aspect of my genealogy, and I know that part of my ethnicity mix is valid. However, that is a very small percentage overall (about 2%), and the combined improvements certainly outshine that one negative.

Of course, your mileage may vary. What are you seeing in terms of your new ethnicity estimates as compared to your known genealogy? Better? Worse? Did you lose any categories that you know are valid? What about small amounts of minority heritage?

Ancestor Birthdays Mean Presents for YOU!

I’ve been wanting to celebrate my ancestors’ birthdays for some time now, and I’ve finally figured out exactly how to accomplish this goal in a really fun way.

Being reminded once a year about their birthday and the anniversary of their death reminds me to work on their genealogy, and in particular, genetic genealogy. With more people testing every single day, meaning different people at every vendor, we need to check often with specific ancestors in mind. You never know who’s going to be the person who puts the chink in that brick wall.

With this in mind, I’ve put together a spreadsheet to track what I know about each ancestor. This makes it easy to schedule those dates in my calendar, with a reminder of course, and then to check my spreadsheet to see what information might have been previously missing that might be able to be found today.

It’s like a birthday present for them, but now for me. I am, after all, their heir, along with the rest of their descendants of course! If I’m lucky, I inherited part of their DNA, and if not, their DNA is still relevant to me.

Checking the List

Here’s my spreadsheet checklist for each ancestor:

  • Birth date
  • Birth place
  • Death date
  • Death place
  • Spouse
  • Y DNA haplogroup (if male)
  • Mitochondrial DNA haplogroup
  • Autosomal confirmed
  • Ancestry Circle

New information becomes digitized every year making new information available.

Additionally, some items may change. For example, if a base haplogroup was previous known, a deeper haplogroup might be available a year later if someone has taken a more detailed test or the haplogroup name might have been updated. Yes, that happens too.

I originally had a triangulation column on the spreadsheet too, but I pretty quickly discovered that column was subject to lots of questions about interpretation. Is the actual ancestor triangulated, or the line? I decided that “autosomal confirmed” would suffice to cover whatever I decide constitutes confirmation and a comment column could hold the description. For example, my grandparents are autosomal confirmed because I match (and triangulate) with cousins who are descended from ancestors upstream of my grandparents. If my grandparent wasn’t my grandparent, I wouldn’t be related to those people either. In particular, first cousins.

I also added an “Article Link” column to paste the link to that ancestor’s 52 Ancestors article so I can quickly check or maybe even provide this spreadsheet to a family member.

Here’s an example of what the first several entries of my Ancestor Birthday Spreadsheet look like.

Ancestor Birthday Presents for You

In order to remind myself to check on my ancestors’ status, on their birth and death days, I schedule reminders in my phone calendar. Every morning when I wake, I’m greeted by my ancestor – well – at least this much of them.

  • First, I check at Family Tree DNA for new matches, haplogroups and the presence of my family lines in surname projects.
  • Then it’s off to Ancestry to see if I have any new green leaf DNA or record hints, to add or update the circle for this particular ancestor, and to see if any of my matches would be a candidate for either Y or mitochondrial DNA testing, assuming they reply to messages and agree to test at Family Tree DNA. I keep a separate spreadsheet of each person that I’ve identified as a match with an identified ancestor. I know it’s extra work, but that spreadsheet is invaluable for determining if the ancestor is autosomal proven and if the match is a candidate for Y or mtDNA testing.
  • Then I get another cup of coffee and check at MyHeritage for new record matches for that ancestor, along with new DNA SmartMatches.
  • GedMatch and 23andMe aren’t as easy to check for matches specific to ancestors, but I still check both places to see if I can find matches that I can identify as descending from that ancestor.
  • While I’m at it, sometimes I run over to FamilySearch to see if there’s anything new over there, although they don’t deal with DNA. They do, however, have many traditional genealogical records. I may add another column to track if I’m waiting for something specific to be digitized – like court minutes, for example. FamilySearch has been on a digitization binge!
  • As I go along, I add any new discovery to my genealogy software and my Ancestor Birthday Spreadsheet as well.
  • Last, I paint new segment information from Family Tree DNA, MyHeritage, GedMatch or 23andMe at DNAPainter. My three articles about how I use DNAPainter are here, here and here.

I just love ancestor birthdays.

Any day that I get to find something new is a wonderful day indeed – fleshing out the lives, history and DNA of my ancestors. With this many places to look, there’s seldom a day that goes by that I don’t discover at least something in my ancestor scavenger hunt!

Ancestor birthday presents for me😊

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Family Tree DNA Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor
  • Other mainstream vendors where you can upload this vendor’s files

You can both upload autosomal data files from another vendor to Family Tree DNA, and download from Family Tree DNA.

Uploading TO Family Tree DNA

Step 1

On the main Family Tree DNA page, click on “Upload DNA Data,” at the top.

Step 2

Select either “Autosomal DNA” or Nat Geo’s Geno 2.0 data. Hint, the current version of Genographic data won’t work. The current version is processed on the Helix chip as of November 2016.

Step 3

If you are not transferring from Nat Geo, go to Step 4.

If you are transferring a Nat Geo 2.0 kit, proceed to enter your National Geographic Kit Number. If you don’t know your kit number, click on “Where do I find my Geno ID Code.”

Step 4

These instructions in Step 4 focus on uploading your autosomal DNA file from other vendors, not Genographic 2.0.

Complete the form. If you already have tested Y or mitochondrial DNA at Family Tree DNA, select “Already have a Family Tree DNA account,” so that your uploaded test can be integrated with your existing account. If you have already taken the Family Finder test at Family Tree DNA, there’s no need to upload your autosomal DNA from any other vendor.

Step 5

Next, select the vendor test that you are going to upload. Uploads accepted include:

  • 23andMe V3 and V4 – tests taken after December 2010 and before August 2017. The V5 chip, in use at 23andMe since August 2017 is not compatible.
  • Ancestry V1 – tests taken until May 2016.
  • Ancestry V2 – tests taken since May 2016
  • MyHeritage – fully compatible, Family Tree DNA is the lab that does their testing

If you select “MyHeritage,” you will be redirected to MyHeritage to log in and select your kit for transfer.

If you select either 23andMe or Ancestry DNA, you will be directed to either drag and drop your data file from that vendor or browse to upload.

Family Tree DNA provides a step by step guide for accessing your raw data files from those vendors by clicking on “How to I access my raw data files?,” above the grey transfer box.

You will be assigned a Family Tree DNA kit number. After your results are processed, you can sign in to see your matches.

Compatibility

The 23andMe V3, V4 and the Ancestry V1 kits are fully compatible, meaning that you will get the same matches at Family Tree DNA using those transfer kits as you would if you tested at Family Tree DNA. However, the Ancestry V2 kit is only partially compatible, meaning that you will only receive 20-25% of the matches by transferring a V2 test that you would receive if you tested at Family Tree DNA. Ancestry changed their chip in May of 2016.

If you want all of your possible matches, and who doesn’t, you should test at Family Tree DNA instead of uploading a V2 Ancestry test.

Step 6

The transfer to Family Tree DNA is free and so is viewing your matches along with basic tools. However, for additional tools, such as ethnicity and the chromosome browser, you’ll need to purchase the $19 unlock. This is a great value, as compared to retesting at $79, or sometimes less when on sale.

You will be prompted for the “Unlock” purchase if you click on either the Chromosome Browser button, the Ethnicity button or other advanced tools on your dashboard after your results are finished.

Downloading FROM Family Tree DNA

Step 1

To download your raw autosomal DNA file from Family Tree DNA, click on the orange “Download Raw Data” link at the bottom of your Family Finder section on your dashboard.

Alternatively, you can select the “Download Raw Data” option at the top of the page under myDNA, “Family Finder.”

Step 2

You will be given the option of downloading your Build 36 and Build 37 raw data files.

Different vendors request different types of files.

  • GedMatch – Build 36 Raw Data Concatenated
  • MyHeritage – Build 37 Raw Data Concatenated

Other vendors may request different file formats, and the above vendors may change over time.

Click the arrow beside the version you need.

Step 3

Save the file in a manner that you’ll recognize. The file name will be something like “37_R_Estes_Chrom_Autoso_20180818.gz”. I append the word FTDNA in front of the 37 so there is no question which vendor’s file this is. The last several digits are the date.

Family Tree DNA File Transfers TO Other Vendors

You can upload Family Tree DNA results to other vendors, as follows:

From below to >>>>>>>>>>> Ancestry Accepts MyHeritage Accepts 23andMe Accepts GedMatch Accepts
Family Tree DNA No Yes No Yes

Neither Ancestry nor 23andMe accept uploads from any vendor.

Family Tree DNA Transfers FROM Other Vendors

You can upload files from other vendors to Family Tree DNA, as follows:

From Ancestry  From MyHeritage  From 23andMe  From Living DNA
Family Tree DNA Accepts Yes Yes V3, V4 No

Testing and Transfer Strategy

Transferring to Family Tree DNA is always free, and you can see your matches. In order to view your ethnicity and use advanced tools like the chromosome browser, you’ll need to purchase the $19 unlock.

Remember that while Ancestry and MyHeritage both have records subscriptions to sell you, Family Tree DNA doesn’t. For Tier 1 tools, GedMatch requires a subscription. At Family Tree DNA, you pay a one time fee to unlock all of their tools. Every company needs to be profitable to stay in business and develop new tools, and each company has a different product pricing model.

My testing/transfer recommendations are as follows relative to Family Tree DNA:

An Ancestry V1 test is entirely compatible at Family Tree DNA, but with a V2 test, because the testing platform that Ancestry uses is only about 20-25% compatible with the Family Tree DNA test, you’ll only receive your closest 20-25% matches. Family Tree DNA can’t match on those smaller segments if you don’t test on a compatible platform, so please do. I wrote a step-by-step guide about how to download from Ancestry here, including how to tell if you have a V1 or V2 format test.

    • If you have Ancestry V2 results, you can transfer to MyHeritage and GedMatch but I recommend retesting at Family Tree DNA. The cost difference at Family Tree DNA between the $19 unlock and a new Family Finder test is $60, for a total of $79 when the tests aren’t on sale. When they are on sale, it’s less. You never know which match is going to break down that brick wall, and it would be a shame to miss it because you transferred rather than retested.
    • If you test at Family Tree DNA, transfer your results to MyHeritage for free. Matching and ethnicity is free with a transfer to MyHeritage, but you won’t receive the full potential benefit of tree matching with other testers without a subscription.
    • If you test at MyHeritage, transfer your results to Family Tree DNA for free.
    • If you test at 23andMe and have the V3 or V4 test, transfer to both Family Tree DNA and MyHeritage. If you have the 23andMe V5 test, retest at Family Tree DNA and transfer those results to MyHeritage and GedMatch.

Have fun!

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Ancestry Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor, if possible
  • Other mainstream vendors where you can upload this vendor’s files

Uploading TO Ancestry

This part is easy with Ancestry, because Ancestry doesn’t accept any other vendor’s files. There is no ability to upload TO Ancestry. You have to test with Ancestry if you want results from Ancestry.

Downloading FROM Ancestry

In order to transfer your autosomal DNA file to another testing vendor, or GedMatch, for either matching or ethnicity, you’ll need to first download the file from Ancestry.

Step 1

Sign in to your account at Ancestry and click on the DNA Results Summary link.

Step 2

Click on the Settings gear, at the far upper right hand corner of the summary page, just beneath your Ancestry user ID.

Step 3

Click on the link for “Download Raw DNA Data.”

Step 4

Enter your password and click on “I Understand,” after reading of course.

At that point, the confirm button turns orange – click there.

Step 5

Ancestry will send an e-mail to the e-mail address where you are registered with Ancestry. Check your inbox for that e-mail.

Waiting…waiting.

Still waiting…

If the e-mail doesn’t arrive shortly, check your spam folder. If you’ve changed e-mail addresses, check to be sure your new one is registered with Ancestry. That’s on the same Settings page. If all else fails, request the e-mail again.

Step 6

Ahhh, it’s finally here.

Click on the green “Confirm Data Download” and do not close the window.

Step 7

Next, click on the green “Download DNA Raw Data.”

You’ll see the following confirmation screen.

Step 8

At the bottom of the page, above, if you’re on a PC, you’ll see the typical file download box that asks you if you want to open or save. Save the file as a name you can find later when you want to upload to another site.

The file name will be “dna-data-2018-07-31” where the date is the date you downloaded the file. I would suggest adding the word Ancestry to the front when you save the file on your system.

Most vendors want an unopened zip file, so if you want to open your file, first copy it to another name. Otherwise, you’ll have to download again.

That’s it, you’re done!

Ancestry File Transfers to Other Vendors

Ancestry testing falls into two different categories. V1 tests taken before May of 2016 and V2 tests taken after May 2016. Tests processed during May 2016 could be either version.

The difference between V1 and V2 files is that Ancestry changed the chips they use to test and different DNA positions are tested, resulting in a file of a different format.

If you don’t remember when you tested, make a copy of your Ancestry file using a different name, like, “Opened Ancestry file 7-31-2018.” Then just click to open the zip file.

The first four rows of the file will say something like this:

#AncestryDNA raw data download
#This file was generated by AncestryDNA at: 08/11/2017 07:23:49 UTC
#Data was collected using AncestryDNA array version: V1.0
#Data is formatted using AncestryDNA converter version: V1.0

This is a version 1 (V1) file.

A version 2 file will say V2.0.

Your upload results to other vendors’ sites will vary in terms of both matching and ethnicity accuracy based on your Ancestry version number, as follows:

From below to >>>>>>>>>>> Family Tree DNA Accepts ** MyHeritage Accepts*** 23andMe Accepts* GedMatch Accepts ****
Ancestry before May 2016 (V1) Yes, fully compatible Yes, fully compatible No Yes
Ancestry after May 2016 (V2) Yes, partly compatible Yes, fully compatible No Yes

*Note that 23andMe earlier in 2018 allowed a one-time transfer from Ancestry, but people who transferred results did not receive matches from 23andMe.

**Note that the transfer to Family Tree DNA and matching is free, but advanced tools including the chromosome browser and ethnicity require a $19 unlock fee. That fee is less expensive than retesting, but V2 customers should consider retesting to obtain fully compatible matching and ethnicity results. V2 tests typically receive only the closest 20-25% of matches they would receive if they tested directly at Family Tree DNA.

***MyHeritage utilizes a technique known as imputation to achieve compatibility between different vendors files. The transfer and tools are free, but without a subscription you can’t fully utilize all of the MyHeritage benefits available.

****I’m not sure exactly how GedMatch compensates for the V1 versus V2 differences, but they can handle both data file types. Most people don’t take both tests, but I was conducting an experiment and have uploaded both V1 and V2 tests.

A quick survey of GedMatch matches to my Ancestry V1 and Ancestry V2 kits shows that of my first 249 (125 V2, 124 V1) matches, I have 3 V1 tests that don’t have a corresponding match to a person on the V2 kit, and 5 V2 kits that don’t have a corresponding V1 kit match. That’s roughly a 6% nonmatch rate between Ancestry V1 and V2 kits. I would presume that as the genealogical and genetic distance increases with more distant matches, so would the percentage of non-matches because the segment size is smaller with more distant matches, so there is less matching DNA to have the opportunity to match in the first place.

Testing and Transfer Strategy

My recommendation, if you test at Ancestry, is to transfer your V1 results to MyHeritage, Family Tree DNA and GedMatch.

An Ancestry V1 test is entirely compatible at Family Tree DNA, but with a V2 test, because the testing platform that Ancestry uses is only about 20-25% compatible with the Family Tree DNA test, you’ll only receive your closest 20-25% matches. Family Tree DNA can’t match on those smaller segments if you don’t test on a compatible platform, so please do.

If you have Ancestry V2 results, transfer to MyHeritage and GedMatch but retest at Family Tree DNA. The cost difference at Family Tree DNA between the $19 unlock and a new Family Finder test is $60, for a total of $79 when the tests aren’t on sale. When they are on sale, it’s less. Right now, the tests are only $59.

You never know which match is going to break down that brick wall, and it would be a shame to miss it because you transferred rather than retested.

Matching and ethnicity is free with a transfer to MyHeritage, but you won’t receive the full potential benefit of SmartMatching without a subscription, as free trees are limited to 250 people and genealogical records aren’t included without a subscription. My subscription has been well worth the $.

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Concepts: Anonymized Versus Pseudonymized Data and Your Genetic Privacy

Until recently, when people (often relatives) expressed concerns about DNA testing, genetic genealogy buffs would explain that the tester could remain anonymous, and that their test could be registered under another name; ours, for example.

This means, of course, that since our relative is testing for OUR genealogy addiction, er…hobby, that we would take care of those pesky inquiries and everything else. Not only would they not be bothered, but their identity would never be known to anyone other than us.

Let’s dissect that statement, because in some cases, it’s still partially true – but in other cases, anonymity in DNA testing is no longer possible.

You certainly CAN put your name on someone else’s kit and manage their account for them. There are a variety of ways to accomplish this, depending on the testing vendor you select.

If the DNA testing is either Y or mitochondrial DNA, it’s extremely UNLIKELY, if not impossible, that their Y or mitochondrial DNA is going to uniquely identify them as an individual.

Y and mitochondrial DNA is extremely useful in identifying someone as having descended from an ancestor, or not, but it (probably) won’t identify the tester’s identity to any matching person – at least not without additional information.

If you need a brush-up on the different kinds of DNA and how they can be used for genealogy, please read 4 Kinds of DNA for Genetic Genealogy.

Y and mitochondrial DNA can be used to rule in or rule out specific descendant relationships. In other words, you can unquestionably tell for sure that you are NOT related through a specific line. Conversely, you can sometimes confirm that you are most likely related to someone you match through the direct Y (patrilineal) line for males, and matrilineal mitochondrial line for both males and females. That match could be very distant in time, meaning many generations – even hundreds or thousands of years ago.

However, autosomal DNA, which tests a subset of all of your DNA for the genealogical goal of matching to cousins and confirming ancestors is another matter entirely. Some of the information you discern from autosomal testing includes how closely you match, which effectively predicts a range of relationships to your match.

These matches are much more recent in time and do not reach back into the distant past. The more closely you are related, the more DNA you share, which means that your DNA is identifying your location in the family tree, regardless of the name you put on the test itself.

Now, let’s look at the difference between anonymization and pseudonymization.

It may seem trivial, but it isn’t.

Anonymization vs Pseudonymization

Recently, as a result of the European Union GDPR (General Data Protection Regulation,) we’ve heard a lot about privacy and pseudonymization, which is not the same as anonymized data.

Anonymized data must be entirely stripped of any identifiable information, making it impossible to derive insights on a discreet individual, even by the person or entity who performed the anonymization. In other words, anonymization cannot be reversed under any circumstances.

Given that the purpose of genetic genealogy conflicts with the concept of anonymization, the term pseudonymization is more properly applied to the situation where someone masks or replaces the name of the tester with the goal of hiding the identity of the person who is actually taking the test.

Pseudonymization under GDPR (Article 4(5)) is defined as “the processing of personal data in such a way that the data can no longer be attributed to a specific data subject without the use of ‘additional information.’”

In reality, pseudonymization is what has been occurring all along, because the tester could always be re-identified by you.

However, and this important, neither anonymization or pseudonymization can be guaranteed to disguise your identity anymore.

Anonymous Isn’t Anonymous Anymore

The situation with autosomal DNA and the expectation of anonymity has changed rather gradually over the past few years, but with tidal wave force recently with the coming-of-age of two related techniques:

  • The increasingly routine identification of biological parents
  • The Buckskin Girl and Golden State Killer cases in which a victim and suspect were identified in April 2018, respectively, by the same methodology used to identify biological parents

Therefore, with autosomal DNA results, meaning the raw data results file ONLY, neither total anonymity or any expectation of pseudonymization is reasonable or possible.

Why?

The reason is very simple.

The size of the data bases of the combined mainstream vendors has reached the point where it’s unusual, at least for US testers, to not have a reasonably close match with a relative that you did not personally test – meaning third cousin or closer. Using a variety of tools, including in-common-with matches and trees, it’s possible to discern or narrow down candidates to be either a biological parent, a crime victim or a suspect.

In essence, the only real difference between genetic genealogy searching, parent searches and victim/suspect searches is motivation. The underlying technique is exactly the same with only a few details that differ based on the goal.

You can read about the process used to identify the Golden State Killer here, and just a few days later, a second case, the Cook/Van Cuylenborg double homicide cold case in Snohomish County, Washington was solved utilizing the following family tree of the suspect whose DNA was utilized and matched the blue and pink cousins.

Provided by the Snohomish County Sheriff

A genealogist discovering those same matches, of course, would be focused on the common ancestors, not contemporary people or generations.

To identify present day individuals, meaning parents, victims or suspects, the researcher identifies the common ancestor and works their way forward in time. The genealogist, on the other hands, is focused on working backwards in time.

All three types of processes, genealogical, parent identification and law enforcement depend on identifying cousins that lead us to common ancestors.

At that point, the only question is whether we continue working backwards (genealogically) or begin working forwards in time from the common ancestors for either parent identification or law enforcement.

Given that the suspect’s or victim’s name or identifying information is not known, their DNA alone, in combination with the DNA of their matches can identify them uniquely (unless they are an identical twin,) or closely enough that targeted testing or non-genetic information will confirm the identification.

Sometimes, people newly testing discover that a parent, sibling or half sibling genetic match is just waiting for them and absolutely no analysis is necessary. You can read about the discovery of the identity of my brother’s biological family here and here.

Therefore, we cannot represent to Uncle Henry, especially when discussing autosomal DNA testing, that he can test and remain anonymous. He can’t. If there is a family secret, known or unknown to Uncle Henry, it’s likely to be exposed utilizing autosomal DNA and may be exposed utilizing either Y or mitochondrial DNA testing.

For the genealogist, this may cause Pavlovian drooling, but Uncle Henry may not be nearly so enthralled.

In Summary

Genealogical methods developed to identify currently living individuals has obsoleted the concept of genetic anonymity. You can see in the pedigree chart example below how the same match, in yellow, can lead to solving any of the three different scenarios we’ve discussed.

Click to enlarge any graphic

If the tester is Uncle Henry, you might discover that his parents weren’t his parents. You also might discover who his real parents were, when your intention was only to confirm your common great-grandparents. So much for that idea.

A match between Henry and a second cousin, in our example above, can also identify someone involved in a law enforcement situation – although today those very few and far between. Testing for law enforcement purposes is prohibited according to the terms and conditions of all 4 major testing vendors; Ancestry, 23andMe, Family Tree DNA and MyHeritage.

Currently law enforcement kits to identify either victims or suspects can be uploaded at GedMatch but only for violent crimes identified as either homicide or sexual assault, per their terms and conditions.

Furthermore, both 23andMe and Ancestry who previously reserved the right to anonymize your genetic information and sell or otherwise utilize that information in aggregated format no longer can do so under the new GDPR legislation without your specific consent. GDPR, while a huge pain in the behind for other reasons has returned the control of the consumer’s DNA to the consumer in these cases.

The loss of anonymity is the inevitable result of this industry maturing. That’s good news for genetic genealogy. It means we now have lots of matches – sometimes more than we can keep up with!

Because of those matches, we know that if we test our DNA, or that of a family member, our DNA plus the common DNA shared with many of our relatives is enough to identify us, or them. That’s not news to genealogists, but it might be to Uncle Henry, so don’t tell him that he can be anonymous anymore.

You can pseudonymize accounts to some extent by masking Uncle Henry’s name or using your name. Managing accounts for the same reasons of convenience that you always did is just fine! We just need to explain the current privacy situation to Uncle Henry when asking permission to test or to upload his raw data file to GedMatch (or anyplace else,) because ultimately, Uncle Henry’s DNA leads to Uncle Henry, no matter whose name is on the account.

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

DNAPainter – Mining Vendor Matches to Paint Your Chromosomes

This isn’t quite the same as when my mother used to talk about painting the town, but in genetic genealogy terms, it’s better.

This is the second of 4 articles that will describe how to use DNA Painter.

Today, I’d like to talk about how I utilize the various vendor testing tools combined with DNAPainter to “mine my DNA,” or better put, to mine my ancestor’s DNA which is now mine, pun intended.

To review instructions for how to set up and use the DNA Painter tool, please read DNA Painter – Chromosome Sudoku for Genetic Genealogy Addicts and then come back here to proceed.

I’m going to discuss each vendor’s tools and how I’ve used them, sometimes in combination.

57% Painted

Please note that you can click on any image to enlarge

Is this not a beautiful thing to behold? That’s my ancestors, in loving color, looking back at me, on MY chromosomes.

I’m completely thrilled that I have managed to paint 57% of my chromosomes. I’m a visual person, and while I’ve worked with spreadsheets now for years, I’ve officially abandoned them. Ok, mostly.

Yes, you heard me right – I’ve abandoned the spreadsheets in favor of DNA Painter, at least for segments where I can positively identify an ancestral couple. In other words, those segments that can be reliably mapped.

That 57% is made up of 445 segments in total, split between my maternal and paternal sides. That’s without counting my mother’s DNA. While I do utilize matching to my mother in order to be sure that a match is really a valid match, I didn’t paint her DNA. Obviously, I’m going to match her 100%, and DNA painter already breaks chromosomes into my pink maternal and blue paternal sides.

Key Elements

  1. The single best thing you can do in order to paint your chromosomes is to have known family members and cousins test. You can then paint their DNA that matches yours, attributing it to their identified family line.
  2. The second best thing you can do is to work with your matches using their trees to identify your common ancestor.

Now, you’re ready to begin painting.

I’m going to step through the process I used at each vendor to identify paintable segments.

I did not paint segments that I could not identify to an ancestral line, except for my endogamous Acadian line which I labeled simply as Acadian to mark those segments that I can identify as Acadian, but I can’t identify a specific ancestor, or ancestors. When I can identify the Acadian ancestor, I paint that segment using the ancestors’ names.

Family Tree DNA

At Family Tree DNA, I begin with my closest matches that are not immediate family – meaning not my parents, children or grandchildren. I’m looking for aunts, uncles, cousins, etc. I don’t paint siblings, but often half siblings are extremely useful because they can help you identify which paternal side other matches are related to.

In the first DNA Painter article, I explained how to utilize the Family Tree DNA chromosome browser to select an individual whose matching DNA can be displayed so that you can copy and paste that segment into the painting feature of DNA Painter.

On your results page, your “bucketed individuals” who have been assigned as maternal (pink icon above) or paternal (blue icon not shown) can be a huge clue when used in conjunction with the in-common-with (ICW) tool and the matrix.

You can also search by ancestral surname and then evaluate each match through common surnames, trees and other resources. If you’re not familiar with how to use the tools at Family Tree DNA, here’s a quick run-through.

Select the individual whose DNA you wish to paint, view in the chromosome browser, then copy and paste from the grid below to the DNAPainter tool.

I painted the matching DNA of all the people whose common ancestor with me I could positively identify before moving on to the next vendor.

Who Have I Painted?

As you begin to paint segments from multiple vendors, you may wonder if you’re finding duplicates. It’s easy to tell. At DNA Painter, click on “All segment data,” below the legend in the bottom right corner.

This displays the entire list of matches whose DNA you have painted, in spreadsheet format. You can sort by match name or simply do a browser search. (CTRL+F)

You can also download this data into a cvs (Excel compatible) file at the top left of this page.

Avoiding Duplicates

As you view and paint your matches at the various vendors, you may discover that you have already found a match with that person at another vendor, either because they tested there or uploaded their autosomal file. When possible, avoid duplicate painting. It won’t help anything and will just clutter your chromosomes. You may not always be able to identify a match as a duplicate, especially if the tester utilizes a pseudonym at various locations. Don’t’ worry though, because you can always easily delete it later and a duplicate person/segment certainly won’t hurt anything.

Ok, now to our next vendor! Let’s find more segments to paint.

MyHeritage

At MyHeritage, click on DNA matches.

At the right of the search box, fly over the little pink key (or funnel) looking thing and you’ll see the option for “Has Smart Matches.” That’s what you’re looking for.

Click on the key icon.

Smart Matches mean that your DNA matches and you have a common ancestor in your trees. Click on the purple button to review this DNA match.

For each match, scroll all the way down to the bottom where your matching chromosome segments will be colored.

At the right, above the chromosome browser, click on “advanced options” which will allow you to select “download shared DNA info.” You need to download to your system so that you can copy and paste the matching segment information to DNA Painter.

MyHeritage has a few more columns than necessary, and DNA Painter can’t utilize them. Delete the columns for Name, Match Name, RSID beginning and end, and also eliminate SNPs due to an overestimation issue. In many cases, the SNPs at MyHeritage are twice or more than the number of SNPs when comparing the same segment at other vendors.

Now that your segment is cleaned up, copy the entire group shown above, minus the yellow columns which you’ve deleted, and paste into the DNA Painter spreadsheet.

MyHeritage has recently added a triangulation feature, shown at the far right, below, indicating that these two people individually triangulate with me and Alberta. The icon at far right of “5th cousin” indicates triangulation.

By clicking on the triangulation icon, you then see how that person triangulates with both your match and you – in this case, me, Alberta, and Chandler.

You may choose to paint triangulated segments, BUT, the size of the triangulated segment is often going to be smaller than the amount of DNA than you match individually to either one or both people.

In the example above, you can see that you match the pink person on a significantly longer segment than you match the tan person. The amount of DNA where you match both the pink and tan person is smaller yet, because the area where you match the tan person extends beyond where you match the pink person and vice versa. If you were going to paint ONLY the triangulated segments, you would paint only the portion that is both pink and tan, “boxed” above.

I don’t recommend painting ONLY triangulated segments, because you’ll be depriving yourself of the ability for each person to match others on the portions of the segments on which they match you, but not the other person in question.

In this example, utilizing DNA Painter, you’ll see that people in fact match you AND the pink person on several segments. The segment shown in pink, at MyHeritage, above, is shown on chromosome 5 in DNA Painter as the long mustard colored segment. Look at how many people match you on that segment. This is why we don’t paint only the triangulated portions of the chromosome. That long mustard segment match will triangulate with many people on smaller portions of that mustard segment, as evidenced by the yellow, grey, blue, cinnamon, purple and red segment matches..

DNA Painter helps you triangulate, so there is no reason to restrict your painting to triangulated segments.

Triangulation is a great tool, but don’t mix triangulated segments with matching segments in the same profile, at least not until you get the hang of the tool and using the multiple vendor’s results.

23andMe

Unfortunately, 23andMe doesn’t have tools like tree matching (MyHeritage) or maternal/paternal phasing (Family Tree DNA,) but they do allow testers to enter common surnames.

Looking at closer matches, meaning first, second or third cousins, if they list even a few surnames, you may well be able to identify the common genealogical line, especially in conjunction with ancestral locations and the other people you match in common.

Sometimes you can glean enough information to identify your common ancestor. In this case, even if I didn’t know Cheryl, the surname would have identified the ancestor. If that didn’t do it, the “in common” list below would!

Once you’ve identified the common ancestor and decide you’re ready to paint, click on the Tools tab at the top of your page and select DNA Relatives.

On the DNA Relatives tab, click on the relative whose DNA you wish to paint. I’m selecting my cousin, Cheryl.

Click on the blue DNA Comparison, in the upper right hand corner.

On the comparison screen, you will select yourself as one person and Cheryl as the other.

At the top you’ll see the two individuals and their overlapping segments painted onto chromosomes. Scroll down and you’ll see the segment detail, below.

Highlight the rows (they’ll turn blue, like above) and right click to copy the segment information.

The next step is to drop the results into a spreadsheet, just long enough to delete the first and last columns, shown in red below, then copy the remaining rows and paste into the DNA Painter tool.

Mining Ancestry Data at GedMatch

GedMatch is somewhat of a special case, because GedMatch doesn’t do DNA testing, but provides an open sharing platform by facilitating uploads of raw autosomal files from multiple other vendors. Therefore, anyone with results at GedMatch tested elsewhere. If you tested at all of the other vendors, it’s probable that you find people at GedMatch as a match that match you at other vendors too.

Because 23andMe does not support the uploading of Gedcom files, if your match has uploaded a Gedcom file to GedMatch, or connected to Geni or WikiTree, then you may be able to identify your common ancestor at GedMatch that you were not able to identify at 23andMe.

Conversely, if you match at Ancestry, you won’t be able to paint from Ancestry, because Ancestry does not provide segment information. We will talk about Ancestry as a special case next, but for now, let’s focus on how to utilize GedMatch.

At GedMatch, you’ll work in steps after setting your account up and uploading your raw data file from either:

If you tested elsewhere, or after August of 2017 at 23andMe, you will have to upload to a special section called GedMatch Genesis. GedMatch Genesis provides a sandbox area for files other than the ones listed above that are generally incompatible with those files and with each other. Genesis files often have few SNP locations in common and not enough to match reliably.

I do not recommend DNA painting utilizing segments from GedMatch Genesis.

GedMatch is currently merging their regular GedMatch service with the Genesis service, so I’m not entirely clear how you will tell the difference between the kits known to match reliably, mentioned above, and others after the merge.

Currently, kits with T prefix (Family Tree DNA), A (Ancestry) and M (23andMe) show version levels in the type field when you match in regular GedMatch. MyHeritage kits are processed by the Family Tree DNA lab. G kits used a generic upload, so you can’t tell where they originated.

Kits uploaded in the Genesis sandbox seem to be assigned double alpha letter kit prefixes at random. Genesis includes a “Testing Company” field which does not include a version number. Today, just stay with the regular GedMatch one-to many and one-to-one matching for DNA Painter.

First, you’ll want to perform a one-to-many match.

This page shows your closest 2000 results. In my case, truncating my matches at 12.7cM. This means if I want to see my results below 12.7 cM, I must subscribe to the Tier 1 Utilities in order to be able to display over 2000 matches.

We’ll discuss how to utilize Tier 1 matching in the Ancestry portion, next, but for now, we’ll just be working with the regular one-to-many matches report.

Of course, trusty cousin Cheryl has results here as well.

In order to compare Cheryl’s results to my own, I need to do two separate things:

  • Click on the A link under the Autosomal Details column (above) and/or
  • Click on the X link under the X DNA column

These two results, both of which are paintable, do not display together so must be selected separately.

By clicking on the A or X, GedMatch will display a one-to-one comparison. I leave this page (below) at the default values and simply click submit.

Your next screen will be a match grid.

Once again, select and copy the results, then paste into DNA Painter. If you also have an X match with this individual, return to the one-to-many match page and then click on the X link to repeat the same process for the X chromosome.

Ancestry Through GedMatch

As far as I’m concerned, the best thing about Ancestry matches is DNA shared ancestor hints (SAH) – meaning those green leaves visible near the green “view match” button which indicate that you share both DNA and a common ancestor(s) in your trees.

Followed immediately by the worst thing which is that Ancestry provides no segment data. However, pairing Ancestry with GedMatch can provide you with some segment information, although you do have to dig. That digging was certainly worthwhile for me, as I found several readily identifiable matches.

When I find a green leaf shared ancestor hint at Ancestry, I record as much information about that match as I can in a spreadsheet. The reason is twofold.

  • Ancestry hints tend to come and go, rather inexplicable, and I want to have that information someplace besides at Ancestry
  • I want to be able to view how many matches I have through specific ancestors which I can do in a spreadsheet by sorting.
  • I want to be able to mine GedMatch for segment information for people at Ancestry who have uploaded to GedMatch.

Note the RJE V2 results, a 6th cousin who I match at 6.6 cM, as we’ll be using that at GedMatch.

I maintain several columns in my Ancestry Match spreadsheet, as shown above. I track people who might be good Y or mitochondrial DNA candidates, as well as GedMatch numbers or other useful information.

I don’t utilize segments smaller than 7 cM for DNA Painter, BUT, Ancestry almost always under-reports the matching segment size due to their internal process which removes some segments that do match. Therefore, I search for all Ancestry matches in GedMatch and paint them if they are 7cM or over at GedMatch. You will match at Ancestry down to 6 cM. Since 7cM is the default GedMatch threshold, that works out well. I don’t find them if they are under 7cM at GedMatch, and I don’t care.

In my case to obtain segments smaller than 12.7 cM, because that is the cutoff where the free one-to-many GedMatch tool reaches the 2000 match threshold (for me,) I need to utilize the Tier 1 subscription utilities which are well worth every dollar.

The one-to-many match looks quite different for the Tier 1 tool.

You’ll need to play with this a bit to determine how high you need to set the limit to see all of your 7cM matches. In my case, I had to set it to 20,000.

I utilize two monitors, so I display my Ancestry spreadsheet on the first monitor and the GedMatch one-to-many match table on the second monitor.

Then, utilizing the browser’s search function, I search for any identifiable portion of the information for the Ancestry match at GedMatch.

In the first example, the user’s name is RJE V2. I search at GedMatch for “RJE” using “ctrl+F” which is the browser’s find function.

You can see that the search found a total of 3 different “RJE” entries. Looking at the first 2, you can see that one is labeled V4 and one is labeled V2. Typically, I would look at this and decide that the RJE V2 is the right match based on the user name at Ancestry.

However, look closer.

The RJE V2 at GedMatch has a much higher amount of shared DNA at 3587.1 cM total than the RJE V2 at Ancestry with a total of 6.6 cM. Clearly, this is not the same person, even though the user name is the same.

For all we know, a different person may have used the same user name, which is clearly an alias, noted by the “*”. Or the same person may have multiple kits at GedMatch.

However, in this case, the RJE V2 is not the same match.

However, let’s say that it is the same person and we’ve been able to reasonably identify the match. In order to compare one-to-one, click on the highlighted blue “largest segment” in the autosomal category, shown below.

If you want to compare the X one-to-one, click on the blue largest segment in that column.

From this point, the matching will look the same as the one-to-one GedMatch matching shown in the previous section – so copy and paste as normal.

While this certainly isn’t the most effective way of working with Ancestry matches, it’s really the only hope we have, unless your match has also uploaded to either Family Tree DNA or MyHeritage.

However, in my experience, I generally stand a better chance of identifying Ancestry matches at GedMatch because their user name or the user name of the person managing their account can be found much more readily. People sometimes tend to utilize the same abbreviations, names or nicknames in multiple locations.

Summary

While each vendor has unique strengths and weaknesses today, and GedMatch provides a platform used by some but not all, the best way to effectively paint your chromosomes is to utilize all of the tools available, and sometimes together. I strongly suggest that you test at or upload to each vendor, because you will find matches at each vendor that aren’t elsewhere.

How many segments can you paint on your chromosomes, and what will those segments tell you?

In the next article, I’ll be walking through my chromosome painting gallery to take a look at the hidden messages there! I hope you’ll come along so you can find some hidden messages of your own.

Enjoy!

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to: