Whole Genome Sequencing – Is It Ready for Prime Time?

Dante Labs is offering a whole genomes test for $199 this week as an early Black Friday special.

Please note that just as I was getting ready to push the publish button on this article, Veritas Genetics also jumped on the whole sequencing bandwagon for $199 for the first 1000 testers Nov. 19 and 20th. In this article, I discuss the Dante Labs test. I have NOT reviewed Veritas, their test nor terms, so the same cautions discussed below apply to them and any other company offering whole genome sequencing. The Veritas link is here.

Update – Veritas provides the VCF file for an additional $99, but does not provide FASTQ or BAM files, per their Tweet to me.

I have no affiliation with either company.

$199 (US) is actually a great price for a whole genome test, but before you click and purchase, there are some things you need to know about whole genome sequencing (WGS) and what it can and can’t do for you. Or maybe better stated, what you’ll have to do with your own results before you can utilize the information for genealogical purposes.

The four questions you need to ask yourself are:

  • Why do you want to consider whole genome testing?
  • What question(s) are you trying to answer?
  • What information do you seek?
  • What is your testing goal?

I’m going to say this once now, and I’ll say it again at the end of the article.

Whole genome sequencing tests are NOT A REPLACEMENT FOR GENEALOGICAL DNA TESTS for mitochondrial, Y or autosomal testing. Whole genome sequencing is not a genealogy magic bullet.

There are both pros and cons of this type of purchase, as with most everything. Whole genome tests are for the most experienced and technically savvy genetic genealogists who understand both working with genetics and this field well, who have already taken the vendors’ genealogy tests and are already in the Y, mitochondrial and autosomal comparison data bases.

If that’s you or you’re interested in medical information, you might want to consider a whole genome test.

Let’s start with some basics.

What Is Whole Genome Sequencing?

Whole Genome Sequencing will sequence most of your genome. Keep in mind that humans are more than 99% identical, so the only portions that you’ll care about either medically or genealogically are the portions that differ or tend to mutate. Comparing regions where you match everyone else tells you exactly nothing at all.

Exome Sequencing – A Subset of Whole Genome

Exome sequencing, a subset of whole genome sequencing is utilized for medical testing. The Exome is the region identified as the portions most likely to mutate and that hold medically relevant information. You can read about the benefits and challenges of exome testing here.

I have had my Exome sequenced twice, once at Helix and once at Genos, now owned by NantOmics. Currently, NantOmics does not have a customer sign-in and has acquired my DNA sequence as part of the absorption of Genos. I’ll be writing about that separately. There is always some level of consumer risk in dealing with a startup.

I wrote about Helix here. Helix sequences your Exome (plus) so that you can order a variety of DNA based or personally themed products from their marketplace, although I’m not convinced about the utility of even the legitimacy of some of the available tests, such as the “Wine Explorer.”

On the other hand, the world-class The National Geographic Society’s Genographic Project now utilizes Helix for their testing, as does Spencer Well’s company, Insitome.

You can also pay to download your Exome sequence data separately for $499.

Autosomal Testing for Genealogy

Both whole genome and Exome testing are autosomal testing, meaning that they test chromosomes 1-22 (as opposed to Y and mitochondrial DNA) but the number of autosomal locations varies vastly between the various types of tests.

The locations selected by the genealogy testing companies are a subset of both the whole genome and the Exome. The different vendors that compare your DNA for genealogy generally utilize between 600,000 and 900,000 chip-specific locations that they have selected as being inclined to mutate – meaning that we can obtain genealogically relevant information from those mutations.

Some vendors (for example, 23andMe and Ancestry) also include some medical SNPs (single nucleotide polymorphisms) on their chips, as both have formed medical research alliances with various companies.

Whole genome and Exome sequencing includes these same locations, BUT, the whole genome providers don’t compare the files to other testers nor reduce the files to the locations useful for genealogical comparisons. In other words, they don’t create upload files for you.

The following chart is not to scale, but is meant to convey the concept that the Exome is a subset of the whole genome, and the autosomal vendors’ selected SNPs, although not the same between the companies, are all subsets of the Exome and full genome.

I have not had my whole genome sequenced because I have seen no purpose for doing so, outside of curiosity.

This is NOT to imply that you shouldn’t. However, here are some things to think about.

Whole Genome Sequencing Questions

Coverage – Medical grade coverage is considered to be 30X, meaning an average of 30 scans of every targeted location in your genome. Some will have more and some will have less. This means that your DNA is scanned thirty different times to minimize errors. If a read error happens once or twice, it’s unlikely that the same error will happen several more times. You can read about coverage here and here.

Genomics Education Programme [CC BY 2.0 (https://creativecommons.org/licenses/by/2.

Here’s an example where the read length of Read 1 is 18, and the depth of the location shown in light blue is 4, meaning 4 actual reads were obtained. If the goal was 30X, then this result would be very poor. If the goal was 4X then this location is a high quality result for a 4X read.

In the above example, if the reference value, meaning the value at the light blue location for most people is T, then 4 instances of a T means you don’t have a mutation. On the other hand, if T is not the reference value, then 4 instances of T means that a mutation has occurred in that location.

Dante Labs coverage information is provided from their webpage as follows:

Other vendors coverage values will differ, but you should always know what you are purchasing.

Ownership – Who owns your data? What happens to your DNA itself (the sample) and results (the files) under normal circumstances and if the company is sold. Typically, the assets of the company, meaning your information, are included during any acquisition.

Does the company “share, lease or sell” your information as an additional revenue stream with other entities? If so, do they ask your permission each and every time? Do they perform internal medical research and then sell the results? What, if anything, is your DNA going to be used for other than the purpose for which you purchased the test? What control do you exercise over that usage?

Read the terms and conditions carefully for every vendor before purchasing.

File Delivery – Three types of files are generated during a whole genome test.

The VCF (Variant Call Format) which details your locations that are different from the reference file. A reference file is the “normal” value for humans.

A FASTQ file which includes the nucleotide sequence along with a corresponding quality score. Mutations in a messy area or that are not consistent may not be “real” and are considered false positives.

The BAM (Binary Alignment Map) file is used for Y DNA SNP alignment. The output from a BAM file is displayed in Family Tree DNA’s Big Y browser for their customers. Are these files delivered to you? If so, how? Family Tree DNA delivers their Big Y DNA BAM files as free downloads.

Typically whole genome data is too large for a download, so it is sent on a disc drive to you. Dante provides this disc for BAM and FASTQ files for 59 Euro ($69 US) plus shipping. VCF files are available free, but if you’re going to order this product, it would be a shame not to receive everything available.

Version – Discoveries are still being made to the human genome. If you thought we’re all done with that, we’re not. As new regions are mapped successfully, the addresses for the rest change, and a new genomic map is created. Think of this as street addresses and a new cluster of houses is now inserted between existing houses. All of the houses are periodically renumbered.

Today, typically results are delivered in either of two versions: hg19(GRVH37) or hg38(GRCH38). What happens when the next hg (human genome) version is released?

When you test with a vendor who uses your data for comparison as a part of a product they offer, they must realign your data so that the comparison will work for all of their customers (think Family Tree DNA and GedMatch, for example), but a vendor who only offers the testing service has no motivation to realign your output file for you. You only pay for sequencing, not for any after-the-fact services.

Platform – Multiple sequencing platforms are available, and not all platforms are entirely compatible with other competing platforms. For example, the Illumina platform and chips may or may not be compatible with the Affymetrix platform (now Thermo Fisher) and chips. Ask about chip compatibility if you have a specific usage in mind before you purchase.

Location – Where is your DNA actually being sequenced? Are you comfortable having your DNA sent to that geographic location for processing? I’m personally fine with anyplace in either the US, Canada or most of Europe, but other locations maybe not so much. I’d have to evaluate the privacy policies, applicable laws, non-citizen recourse and track record of those countries.

Last but perhaps most important, what do you want to DO with this file/information?

Utilization

What you receive from whole genome sequencing is files. What are you going to do with those files? How can you use them? What is your purpose or goal? How technically skilled are you, and how well do you understand what needs to be done to utilize those files?

A Specific Medical Question

If you have a particular question about a specific medical location, Dante allows you to ask the question as soon as you purchase, but you must know what question to ask as they note below.

You can click on their link to view their report on genetic diseases, but keep in mind, this is the disease you specifically ask about. You will very likely NOT be able to interpret this report without a genetic counselor or physician specializing in this field.

Take a look at both sample reports, here.

Health and Wellness in General

The Dante Labs Health and Wellness Report appears to be a collaborative effort with Sequencing.com and also appears to be included in the purchase price.

I uploaded both my Exome and my autosomal DNA results from the various testing companies (23andMe V3 and V4, Ancestry V1 and V2, Family Tree DNA, LivingDNA, DNA.Land) to Promethease for evaluation and there was very little difference between the health-related information returned based on my Exome data and the autosomal testing vendors. The difference is, of course, that the Exome coverage is much deeper (and therefore more reliable) because that test is a medical test, not a consumer genealogy test and more locations are covered. Whole genome testing would be more complete.

I wrote about Promethease here and here. Promethease does accept VCF files from various vendors who provide whole genome testing.

None of these tests are designed or meant for medical interpretation by non-professionals.

Medical Testing

If you plan to test with the idea that should your physician need a genetics test, you’re already ahead of the curve, don’t be so sure. It’s likely that your physician will want a genetics test using the latest technology, from their own lab, where they understand the quality measures in place as well as how the data is presented to them. They are unlikely to accept a test from any other source. I know, because I’ve already had this experience.

Genealogical Comparisons

The power of DNA testing for genealogy is comparing your data to others. Testing in isolation is not useful.

Mitochondrial DNA – I can’t tell for sure based on the sample reports, but it appears that you receive your full sequence haplogroup and probably your mutations as well from Dante. They don’t say which version of mitochondrial DNA they utilize.

However, without the ability to compare to other testers in a database, what genealogical benefit can you derive from this information?

Furthermore, mitochondrial DNA also has “versions,” and converting from an older to a newer version is anything but trivial. Haplogroups are renamed and branches sawed from one part of the mitochondrial haplotree and grafted onto another. A testing (only) vendor that does not provide comparisons has absolutely no reason to update your results and can’t be expected to do so. V17 is the current build, released in February 2016, with the earlier version history here.

Family Tree DNA is the only vendor who tests your full sequence mitochondrial DNA, compares it to other testers and updates your results when a new version is released. You can read more about this process, here and how to work with mtDNA results here.

Y DNA – Dante Labs provides BAM files, but other whole genome sequencers may not. Check before you purchase if you are interested in Y DNA. Again, you’ll need to be able to analyze the results and submit them for comparison. If you are not capable of doing that, you’ll need to pay a third party like either YFull or FGS (Full Genome Sequencing) or take the Big Y test at Family Tree DNA who has the largest Y Database worldwide and compares results.

Typically whole genome testers are looking for Y DNA SNPs, not STR values in BAM files. STR (short tandem repeat) values are the results that you receive when you purchase the 37, 67 or 111 tests at Family Tree DNA, as compared to the Big Y test which provides you with SNPs in order to resolve your haplogroup at the most granular level possible. You can read about the difference between SNPs and STRs here.

As with SNP data, you’ll need outside assistance to extract your STR information from the whole genome sequence information, none of which will be able to be compared with the testers in the Family Tree DNA data base. There is also an issue of copy-count standardization between vendors.

You can read about how to work with STR results and matches here and Big Y results here.

Autosomal DNA – None of the major providers that accept transfers (MyHeritage, Family Tree DNA, GedMatch) accept whole genome files. You would need to find a methodology of reducing the files from the whole genome to the autosomal SNPs accepted by the various vendors. If the vendors adopt the digital signature technology recently proposed in this paper by Yaniv Erlich et al to prevent “spoofed files,” modified files won’t be accepted by vendors.

Summary

Whole genome testing, in general, will and won’t provide you with the following:

Desired Feature Whole Genome Testing
Mitochondrial DNA Presumed full haplogroup and mutations provided, but no ability for comparison to other testers. Upload to Family Tree DNA, the only vendor doing comparisons not available.
Y DNA Presume Y chromosome mostly covered, but limited ability for comparison to other testers for either SNPs or STRs. Must utilize either YFull or FGS for SNP/STR analysis. Upload to Family Tree DNA, the vendor with the largest data base not available when testing elsewhere.
Autosomal DNA for genealogy Presume all SNPs covered, but file output needs to be reduced to SNPs offered/processed by vendors accepting transfers (Family Tree DNA, MyHeritage, GedMatch) and converted to their file formats. Modified files may not be accepted in the future.
Medical (consumer interest) Accuracy is a factor of targeted coverage rate and depth of actual reads. Whole genome vendors may or may not provide any analysis or reports. Dante does but for limited number of conditions. Promethease accepts VCF files from vendors and provides more.
Medical (physician accepted) Physician is likely to order a medical genetics test through their own institution. Physicians may not be willing to risk a misdiagnosis due to a factor outside of their control such as an incompatible human genome version.
Files VCF, FASTQ and BAM may or may not be included with results, and may or may not be free.
Coverage Coverage and depth may or may not be adequate. Multiple extractions (from multiple samples) may or may not be included with the initial purchase (if needed) or may be limited. Ask.
Updates Vendors who offer sequencing as a part of a products that include comparison to other testers will update your results version to the current reference version, such as hg38 and mitochondrial V17. Others do not, nor can they be expected to provide that service.
Version Inquire as to the human genome (hg) version or versions available to you, and which version(s) are acceptable to the third party vendors you wish to utilize. When the next version of the human genome is released, your file will no longer be compatible because WGS vendors are offering sequencing only, not results comparisons to databases for genealogy.
Ownership/Usage Who owns your sample? What will it be utilized for, other than the service you ordered, by whom and for what purposes? Will you we able to authorize or decline each usage?
Location Where geographically is your DNA actually being sequenced and stored? What happens to your actual DNA sample itself and the resulting files? This may not be the location where you return your swab kit.

The Question – Will I Order?

The bottom line is that if you are a genealogist, seeking genetic information for genealogical purposes, you’re much better off to test with the standard and well know genealogy vendors who offer compatibility and comparisons to other testers.

If you are a pioneer in this field, have the technical ability required to make use of a whole genome test and are willing to push the envelope, then perhaps whole genome sequencing is for you.

I am considering ordering the Dante Labs whole genome test out of simple curiosity and to upload to Promethease to determine if the whole genome test provides me with something potentially medically relevant (positive or negative) that autosomal and Exome testing did not.

I’m truly undecided. Somehow, I’m having trouble parting with the $199 plus $69 (hard drive delivery by request when ordering) plus shipping for this limited functionality. If I was a novice genetic genealogist or was not a technology expert, I would definitely NOT order this test for the reasons mentioned above.

A whole genome test is not in any way a genealogical replacement for a full sequence mitochondrial test, a Y STR test, a Y SNP test or an autosomal test along with respective comparison(s) in the data bases of vendors who don’t allow uploads for these various functions.

The simple fact that 30X whole genome testing is available for $199 plus $69 plus shipping is amazing, given that 15 years ago that same test cost 2.7 billion dollars. However, it’s still not the magic bullet for genealogy – at least, not yet.

Today, the necessary integration simply doesn’t exist. You pay the genealogy vendors not just for the basic sequencing, but for the additional matching and maintenance of their data bases, not to mention the upgrading of your sequence as needed over time.

If I had to choose between spending the money for the WGS test or taking the genealogy tests, hands down, I’d take the genealogy tests because of the comparisons available. Comparison and collaboration is absolutely crucial for genealogy. A raw data file buys me nothing genealogically.

If I had not previously taken an Exome test, I would order this test in order to obtain the free Dante Health and Wellness Report which provides limited reporting and to upload my raw data file to Promethease. The price is certainly right.

However, keep in mind that once you view health information, you cannot un-see it, so be sure you do really want to know.

What do you plan to do? Are you going to order a whole genome test?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Jacob Lentz’s Signatures: Cursive and Genetic – 52 Ancestors #216

What is a signature anyway?

A signature is defined as a mark or something that personally identifies an individual. A form of undeniable self-identification.

Of course, that’s exactly why I seek my ancestors’ signatures, both their handwriting and their genetic signature.

Jacob Lentz was born in Germany in 1783 and died in 1870 in Ohio.

Most documents of that timeframe contained only facsimiles of actual signatures. Original deeds indicate that the document was signed, but when recorded in deed books at the courthouse, the clerk only transcribed the signature. The person recorded the physical deed that they had in their hand, and then took it home with them. Therefore, the deed book doesn’t hold the original signature – the original deed does. I was crestfallen years ago when I discovered that fact. ☹

Hence, the actual physical signature of an ancestor is rare indeed.

Recently, I’ve been lucky enough to find not one, but two actual signatures of Jacob Lentz – plus part of his genetic signature as well.

Jacob’s Handwritten Signatures

When Jacob Lenz, later Lentz in the US, petitioned to leave Germany in 1817, he signed the petition document.

The original document is in the “Weinstadt City Archive”, which kindly gave permission for the reproduction and was graciously retrieved by my distant cousin, Niclas Witt. Thank you very much to both!

Here’s Jacob’s actual signature.

The story of Jacob’s life and immigration, and what a story it is, is recorded here, here, here and here.

Jacob’s life has a missing decade or so, after he completed his indentured servitude about 1820 or 1821 in Pennsylvania and before he arrived in Montgomery County, Ohio about 1830. In Ohio, he purchased land and began creating records. That’s where I found him initially.

Jacob’s youngest child, Mary Lentz, was born in May or June of 1829, before leaving Pennsylvania. She married in Montgomery County, Ohio on December 19, 1848 to Henry Overlease. That marriage document contains the signature of her father, Jacob Lentz.

This signature is slightly different than the German one from 31 years earlier, but it’s still clearly our Jacob, as the document states that the parents have signed. It looks like he’s also incorporated the “t” into the name now as well.

Jacob Lentz’s Genetic Signatures

As I was celebrating the discovery of not one, but two versions of Jacob’s written signature, I realized that I carry part of Jacob’s genetic signature too, as do others of his descendants. I just never thought of it quite like that before.

His genetic signature is every bit as personal, and even better because it’s in me, not lost to time.

There are three types of DNA that can provide genetic signatures of our ancestors; mitochondrial, Y DNA and autosomal.

Mitochondrial DNA

Mitochondrial DNA is passed from mothers to all genders of their children, but only their daughters pass it on. Therefore, it’s primarily unchanged, generation to generation.

Being a male, Jacob couldn’t pass his mitochondrial DNA on to his descendants, so we have to discover Jacob’s mitochondrial DNA by testing someone else who descends from his mother’s direct matrilineal line through all females but can be a male in the current generation.

Unfortunately, we haven’t been able to discover Jacob’s mitochondrial DNA that he inherited from his matrilineal line, meaning his mother’s mother’s mother’s line.

However, we only identified his parents a few months ago. Most of Jacob’s family didn’t immigrate, so perhaps eventually the right person will test who descends from his mother, or her matrilineal line, through all women to the current generation.

Jacob’s matrilineal line is as follows, beginning with his mother:

  • Jacob’s mother – Maria Margaretha Gribler born May 4, 1749 and died July 5, 1823 in Beutelsbach, married Jakob Lenz November 3, 1772.
  • Her mother, Katharina Nopp born April 23, 1707 and died November 27, 1764 in Beutelsbach, married Johann Georg Gribler on October 26, 1745.
  • Agnes Back/Beck born November 26, 1673 in Aichelberg, Germany, died February 10, 1752 in Beutelsbach and married Johann Georg Nopp from Beutelsbach.
  • Margaretha, surname unknown, from Magstadt who married Dionysus Beck who lived in Aichelberg, Germany.

If you descend from any of these women, or their female siblings through all females to the current generation, I have a DNA testing scholarship for mitochondrial DNA at Family Tree DNA for you! I’ll throw an autosomal Family Finder test in too!

If you’d like a read a quick article about how mitochondrial, Y DNA and autosomal DNA work and are inherited, click here.

Y-DNA

On the other hand, Jacob did contribute his Y DNA to his sons. Lentz male descendants, presuming no adoptions, carry Jacob’s Y DNA signature as their own.

We are very fortunate to have Jacob Lentz’s Y DNA signature, thanks to two male Lentz cousins. I wrote about how unique the Lentz Y DNA is, and that we’ve determined that our Lentz line descends from the Yamnaya culture in Russia some 3500 years ago. How did we do that? We match one of the ancient burials. Jacob’s haplogroup is R-BY39280 which is a shorthand way of telling us about his clan.

On the Big Y Tree, at Family Tree DNA, we can see that on our BY39280 branch, we have people whose distant ancestors were found in two locations, France and Germany. On the next upstream branch, KMS67, the parent of BY39280, we find people with that haplogroup in Switzerland and Greece.

Our ancestors are amazingly interesting.

Autosomal DNA

Jacob shares his Y and mitochondrial DNA, probably exactly, with other relatives, since both Y and mitochondrial DNA is passed intact from generation to generation, except for an occasional mutation.

However, Jacob’s autosomal DNA was the result of a precise combination of half of his mother’s and half of his father’s autosomal DNA. No one on this earth had the exact combination of DNA as Jacob. Therefore, Jacob’s autosomal DNA identifies him uniquely.

Unfortunately, Jacob isn’t alive to test, and no, I’m not digging him up – so we are left to piece together Jacob’s genetic signature from the pieces distributed among his descendants.

I realized that by utilizing DNAPainter, which allows me to track my own segments by ancestor, I have reconstructed a small portion of Jacob’s autosomal DNA.

Now, there’s a hitch, of course.

Given that there are no testers that descend from the ancestors of either Jacob or his wife, Fredericka Ruhle, at least not that I know of, I can’t sort out which of these segments are actually Jacob’s and which are Fredericka’s.

In the chart above, the tester and my mother match each other on the same segments, but without testers who descend from the parents of Jacob and Fredericka, through other children and also match on that same segment, we can’t tell which of those common segments came from Jacob and which from Fredericka. If my mother and the tester matched a tester from Jacob’s siblings, then we would know that their common segment descended through Jacob’s line, for example.

Painting Jacob’s Genetic Signature

The segments in pink below show DNA that I inherited from either Jacob or Fredericka. I match 8 other cousins who descend from Jacob Lentz and Fredericka Ruhle on some portion of my DNA – and in many cases, three or more descendants of Jacob/Fredericka match on the same exact segment, meaning they are triangulated.

As you can see, I inherited a significant portion of my maternal chromosome 3 from Jacob or Fredericka, as did my cousins. I also inherited portions of chromosomes 7, 9, 18 and 22 from Jacob or Fredericka as well. While I was initially surprised to see such a big piece of chromosome three descending from Jacob/Fredericka, Jacob Lentz and Fredericka Ruhle aren’t really that distantly removed – being my great-great-great-grandparents, or 5 generations back in time.

Based on the DNAPainter calculations, these segments represent about 2.4% of my DNA segments on my maternal side. The expected amount, if the DNA actually was passed in exactly half (which seldom happens,) would be approximately 3.125% for each Jacob and Fredericka, or 6.25% combined. That means I probably carry more of Jacob/Fredericka’s DNA that can eventually be identified by new cousin matches!

Of course, my cousins may well share segments of Jacob’s DNA with each other that I don’t, so those segments won’t be shown on my DNAPainter graph.

However, if we were to create a DNAPainter chart for Jacob/Fredericka themseves, and their descendants were to map their shared segments to that chart, we could eventually recreate a significant amount of Jacob’s genetic signature through the combined efforts of his descendants – like reassembling a big puzzle where we all possess different pieces of the puzzle.

Portions of Jacob’s genetic signature are in each of his descendants, at least for several generations! Reassembling Jacob would be he ultimate scavenger hunt.

What fun!

Resources

You can order Y and mitochondrial DNA tests from Family Tree DNA here, the only company offering these tests.

You can order autosomal tests from either Family Tree DNA or MyHeritage by clicking on those names in this sentence. You’ll need segment information that isn’t available at Ancestry, so I recommend testing with one of these two companies.

23andMe and Gedmatch also provide segment information. Some people who test at both 23andMe and Ancestry upload to GedMatch, so be sure to check there as well.

You can transfer your autosomal DNA files from one company to the other, with instructions for Family Tree DNA here and MyHeritage here, including how to transfer from Ancestry here.

You can learn how to use DNA Painter here, here and here.

Whose genetic signatures can you identify?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Ethnicity – Far More than Percentages!

Since ethnicity results have been in the news recently, I thought this might be a good time to talk about how to squeeze more out of your ethnicity results than just percentages.

You do know there’s more, right? You can tell a lot more about where your ethnicity came from by who you match, and how. Vendors provide that information too, but you need to know where to look. Plus, I have some tips about how to use this information effectively.

Genealogists are always trying to squeeze every last drop of information out of every DNA test, so I’d like to illustrate how I use ethnicity in combination with shared matches at Ancestry, Family Tree DNA, MyHeritage and 23andMe. Each vendor has a few unique features and tools as well, plus people in their databases that other vendors don’t have.

Come along and see what you might discover!

Ancestry

Ancestry recently introduced a new ethnicity comparison feature so let’s start there. Ancestry’s new tool:

  • Compares the ethnicity of you and a match side by side.
  • Shows Shared Migrations
  • Shows you common matches with that person.

At Ancestry, I have a V1 (older) and a V2 (newer) test, so I’m comparing my own V1 to my own V2 test for purposes of illustration.

To start, click on DNA Matches. You’ll see a new blue compare button, beneath the green View Match button, at right.

Clink on any image to enlarge

Click on the blue Compare button. You’ll see a side by side display, shown below.

My V1, at left, compared to my V2 test, at right. My V2 test results do not have a photo uploaded, so you just see my initials. It’s interesting to note that even though these are both me, just tested on different chips, that my ethnicity doesn’t match exactly, although it’s mighty close.

Next, you’ll see the shared migrations between the two people being compared. This helps determine where your common ancestor might be found.

Last, you’ll see the shared matches between you and the other person. This means that those people match both you and the person you’re comparing against, suggesting a potential common ancestor.

On your matches page, you can also sort your matches by your regions.

Where Did Your Ethnicity Come From?

Ethnicity comparisons can be helpful, especially if you’re a person who carries DNA from different continents. I do not suggest trying to compare intra-continental estimates in the same way. It’s simply too difficult for vendors to separate DNA from locations that all border each other where countries are the size of states in the US, such as the Netherlands, Germany, France and Switzerland for example.

As I’ve said before, ethnicity results are only estimates, but they are relatively accurate at the continental level, plus Jewish, as illustrated below.

To be specific, these regions are the easiest for vendors to tell apart from the other regions:

  • European
  • African
  • Native American (North American, South American, Central American and Siberian in conjunction with the Americas)
  • Asian
  • Jewish

For example, if you are 30% African, 35% Native American and 35% European, you could use this information to form a hypothesis about how you match a particular individual or group of individuals.

If the person you match is 50% Asian and 50% African, it’s most likely that the region you match them on is the common African side.

Of course, the next step would be to look at the shared matches to see if those matches include your known relatives with African heritage. This is one reason I always encourage testing of relatives. Who you and your known relative both match tells you a lot about where the common ancestor of a matching group of individuals is found in your tree. For example, if someone matches you and a first cousin, then the common ancestor of the three people is on the side of your tree that you share with the first cousin.

Not exactly sure, or dealing with smaller amounts of continental ethnicity? There’s another way to work with ethnicity.

Ethnicity Match Chart

Make an Ethnicity Match Chart that includes the ethnicity of each person in the match group, as follows.

In this example, the only category in which all people fall is African, so that’s where I’d look in my tree first for a family connection.

Keep in mind that you match person 1, and people 2-4 match both you and person 1.

That does NOT mean that:

  • Person 2, 3 or 4 match each other.
  • Any of those people share the same ancestor with each other. Yes, you can match due to different ancestors that might not have anything to do with each other.
  • These people match on any of the same segments. You can’t view segments at Ancestry. You’ll have to transfer your results to Family Tree DNA, MyHeritage or GedMatch to do that.

Next, look at the trees for each person in the common match group and see if you can discern any common genealogy or even common geography. The best hints of course, at Ancestry, are those green leaf Shared Ancestor Hints. If you find a common ancestor or line, you’re well on your way to identifying how those people are related to you and potentially your match as well.

You could also use this methodology as an adaptation of or in tandem with the Leeds Method that I wrote about here.

Comparing Segments – Yes, You’ll Need To

Ancestry doesn’t offer a chromosome browser, but Family Tree DNA, MyHeritage, 23andMe and GedMatch all do, allowing you to view segments and triangulate. I always suggest uploading Ancestry results to GedMatch, Family Tree DNA and MyHeritage. 23andMe does not accept uploads.

You’ll find instructions for downloading from Ancestry here, uploading to Family Tree DNA here, and to MyHeritage here.

Other Vendors

Each vendor offers their own version of ethnicity comparison. All vendors offer in common with (ICW) and shared match tools too, so you can create your Ethnicity Match Chart for a specific group of people from any vendor’s results – although I don’t mix vendor results on one chart. Plus, every vendor has people in their matching database that no other vendor has, so fish in every pond.

Family Tree DNA

Family Tree DNA offers shared ethnicity information on the myOrigins map. To view, click on MyOrigins, then on View MyOrigins Map.

Testers who opt in can view their ethnicity as compared to their matches’ ethnicity. You can also sort by ethnicity as well as use the pin function at bottom right to drop Y and mtDNA most distant ancestor pins on the map.

Please note that this is NOT where your match lives, but is the location of their most distant matrilineal (mtDNA) or patrilineal (surname) known individual.

If you’re looking for Native American matches, for example, you might look for someone with some percentage of Native American autosomal DNA and/or Native American Y or mitochondrial haplogroups. Click on any pin to view that person and their ethnicity that matches yours. You can also search for a specific individual to see how your ethnicity lines up.

On your match list, look for common surnames with those matches, see who you match in common and check your matches’ trees.

Linking your DNA matches to their location in your tree enables you to participate in Phased Family Matching, meaning you can then select people that are assigned to your maternal or paternal sides to view in the chromosome browser.

When viewing all maternal (red icon) or all paternal (blue icon) matches together on the chromosome browser, the segments are automatically mathematically triangulated. All you need to do is identify the common ancestor!

I love Phased Family Matches. Family Tree DNA is the only vendor to offer this feature and to incorporate Y and mitochondrial DNA.

MyHeritage

MyHeritage provides multiple avenues for comparison, allowing users to select matches by their ethnicity, country or to simply compare their ethnicity to each other. To view matches by ethnicity, click on the Filter button, but note that not all ethnicity locations are included. You can also combine options, such as looking for anyone from the Netherlands with Nigerian DNA.

To view your matches ethnicity as compared to yours, click on the match and scroll down.

Look for people you match in common as well as the triangulation icon, shown at right, below. Another feature, SmartMatches (a filter option) sort for people who have common ancestors with you in trees.

I love triangulation and DNA SmartMatches and MyHeritage is the only vendor to offer this combination of tools!

23andMe

At 23andMe, you can see your ethnicity beside that of your match by clicking on DNA Relatives, on the Ancestry tab, then click on the person you wish to compare to. In my case, I’ve also taken the V3 and V4 test at 23andMe, so I’m comparing to myself.

At 23andMe, you can view which portions of your segments are attributed to which ethnicity. Under the Ancestry tab, click Ancestry Composition and scroll down to view your Ancestry Composition Chromosome Painting.

You can see my Native American segments on chromosomes 1 and 2.

Click on Scientific Details, then scroll to the bottom to download your ethnicity raw data that includes the segment detail for the location of those specific segments.

Utilizing these chromosome and segment locations with any other vendor who supports a chromosome browser, and determining which side that ethnicity descends through allows you to identify matches who should also carry segments of that same ethnicity at that same location.

Here’s my Native segment on chromosome 2 from the download file. Remember, you have two copies of every chromosome – and in my case, only one of those copies on Chromosome 2 is Native. I know it’s from my mother, so anyone matching me on my maternal side at this location on chromosome 2 should also have a Native segment, and our common ancestor is the source of our common Native American heritage.

23andMe is the only vendor to identify ethnicity segments.

23andMe does show matches in common and common matching segments on the chromosome browser, but they don’t support trees.

Your Turn!

If you carry ethnicity from multiple continents (plus Jewish), what hints can you derive from using your ethnicity as a match tool?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Ancestry Displays City/State Where You Live on Map to Your DNA Matches

A new Ancestry feature, in beta mode, has been rolled out to many, if not most, users. Truthfully, I was quite surprised to discover that Ancestry is displaying the location where I currently live to my DNA matches through fourth cousins.

I never intentionally gave permission for this, meaning I never expected the location where I live to be utilized in this fashion. I’ve been an Ancestry subscriber for many years, and while I may have entered my location information originally, I certainly would never have done that today. We live in a different “privacy breach,” “identity theft” and otherwise unpleasant world than we did a few years ago.

The potential ramifications of this mapping tool are mind-boggling – both negative and positive, depending on your perspective.

For people searching for unknown parents or not terribly distant ancestors, the location information is awesome. Ancestry clearly knows this, which is why your matches to 4th cousins are shown. They are your genealogically most useful matches.

For those more concerned with privacy, this feature could open the door to a number of dangerous or at least unpleasant situations – from dangerously crazy people to family stalkers to unknown children/parent situations resulting in someone landing unexpectedly on your doorstep. I may not want to meet a previously unknown sibling, especially not at my house. And certainly not without some amount of preparation first – including a criminal background check. And yes, I’ve been there and done that, in case you were wondering.

Seeing where I live on a map, displayed to my genetic matches brought me face to face with the realization of how careful we need to be with what we choose, even inadvertently, to share. It’s also important to review your past selections to be sure they are still what you want.

So, here’s how to use the tool and how to change your location if you wish to do so.

Ancestry Matches Map

On your matches tab, beside the blue Search Matches button, click on Matches Map.

Next, you’ll see the map with what appears to only be your matches through 4th cousins, although I can’t verify that exactly. I know 4th cousin matches are included and I didn’t see any more distant.

You can see your own pin, in red.

You can click on any of these pins to view the city and state where that person lives based on the information they provided in their profile.

Here’s how to change your location.

Changing Your Location

To change the location, click on your pin on the map.

You’ll see this popup.

I tried to simply remove the information, but I was not allowed to save. A location is required in this tab, but if you go directly to your Profile, accessible from your user ID on your main page, you can remove the location entirely and save.

Before I discovered that selecting my profile directly allowed me to remove my location entirely, I entered the location where I’d love to live. I now live in Bergen, Norway:)

If you’re not comfortable with the city being displayed, but the state is fine, then you can make that modification as well. If you no longer live where you were born, your birth location might be more useful genealogically.

However, even though the new location is displayed to you on the map when you change to a new location, it is NOT CHANGED on the Ancestry map site at the same time. I signed out, signed in again, and the map pin is still displaying my previous location, even though my profile now reflects the new location. It took a few hours for the change to take effect.

Safety and Privacy Considerations

I would strongly prefer that Ancestry provide an opt-in option for people to have their location displayed to their matches, or for that matter, to anyone – especially since a location is required on the map tab when you attempt to make a change. This would avoid the surprise factor of seeing your location revealed on a map. I’m fine with ancestral locations, but not with where I currently live.

As a genealogist, I can certainly see how this feature would be useful. If you’re fine with having the city/state where you live revealed to your matches and other Ancestry users who view your profile, then this is a great tool and you don’t need to change anything.

Do be aware that your location information combined with your name and a search tool like Intellus or BeenVerified can/will reveal your address, phone, e-mail, family members names and more.

Now is a good time to review your profile. Consider what you are willing to reveal and make any changes accordingly.

Family Tree DNA’s Mitochondrial Haplotree

On September 27th, 2018 Family Tree DNA published the largest Y haplotree in the world, based on SNP tests taken by customers. Now, less than two weeks later, they’ve added an exhaustive mitochondrial DNA (mtDNA) public haplotree as well, making this information universally available to everyone.

Family Tree DNA’s mtDNA Haplotree is based on the latest version of the mtDNA Phylotree. The new Family Tree DNA tree includes 5,434 branches derived from more than 150,000 full sequence results from 180+ different countries of origin. Family Tree DNA‘s tree has SIX TIMES more samples than the Phylotree. Furthermore, Family Tree DNA only includes full sequence results, where Phylotree includes partial results.

This new tree is a goldmine! What does it provide that that’s unique? Locations – lots of locations!

The Official Phylotree

Unlike the Y DNA tree, which is literally defined and constructed by the genetic community, new mitochondrial DNA branches cannot be added to the official mitochondrial Phylotree by Family Tree DNA. Haplogroups, meaning new branches in the form of SNPs are added to the Y tree as new SNPs are discovered and inserted into the tree in their proper location. The mitochondrial DNA phylotree can’t be expanded by a vendor in that manner.

The official mitochondrial Phylotree is maintained at www.phylotree.org and is episodically updated. The most recent version was mtDNA tree build 17, published and updated in February 2016. You can view version history here.

Mitochondrial Phylogenic Tree Version 17

Version 17 of the official mitochondrial tree consists of approximately 5,400 nodes, or branches with a total of 24,275 samples uploaded by both private individuals and academic researchers which are then utilized to define haplogroup branches.

Individuals can upload their own full sequence results from Family Tree DNA, but they must be in a specific format. I keep meaning to write detailed instructions about how to submit your full sequence test results, but so far, that has repeatedly slipped off of the schedule. I’ll try to do this soon.

In a nutshell, download your FASTA file from Family Tree DNA and continue with the submission process here. The instructions are below the submission box, so scroll down.

In any case, the way that new branches are added to the phylotree is when enough new results with a specific mutation are submitted and evaluated, the tree will have a new branch added in the next version. That magic number of individuals with the same mutation was 3 in the past, but now that so many more people are testing, I’m not sure if that number holds, or if it should. Spontaneous mutations can and do happen at the same location. The Phylotree branches mean that the haplogroup defining mutations indicate a common ancestor, not de novo separate mutations. That’s why analysis has to be completed on each candidate branch.

How do Mitochondrial DNA Branches Work?

If you are a member of haplogroup J1c2f today, and a certain number of people in that haplogroup have another common mutation, that new mutation may be assigned the designation of 1, as in J1c2f1, where anyone in haplogroup J1c2f who has that mutation will be assigned to J1c2f1.

While the alternating letter/number format is very easy to follow, some problems and challenges do exist with the alternating letter/number haplogroup naming system.

The Name of the Game

The letter number system works fine if not many new branches are added, branches don’t shuffle and if the growth is slow. However, that’s not the case anymore.

If you recall, back in July of 2012, which is equivalent to the genetic dark ages (I know, right), the Y tree was also represented with the same type of letter number terminology used on the mitochondrial tree today.

For example, Y DNA haplogroup R-M269 was known as R1b1a2, and before that the same haplogroup was known as R1b1c. The changes occurred because so many new haplgroups were being discovered that a new sprout wasn’t added from time to time, but entire branches had to be sawed off and either discarded or grafted elsewhere. It became obvious that while the R1b1a2 version was nice, because it was visually obvious that R1b1a2a was just one step below R1b1a2, that long term, that format just wasn’t going to be able to work anymore. New branches weren’t just sprouting, wholesale shuffling was occurring. Believe it or not, we’re still on the frontier of genetic science.

In 2012, the change to the SNP based haplogroup designations was introduced by Family Tree DNA, and adopted within the community.

The ISOGG tree, the only tree that still includes the older letter/number system and creates extended letter number haplogroup names as new SNPs are added provides us with an example of how much the Y tree has grown.

You can see that the letter/number format haplogroups to the far right are 19 locations in length. The assigned SNP or SNPs associated with that haplogroup are shown as well. Those 19-digit haplogroup names are just too unwieldy, and new haplogroups are still being discovered daily.

It’s 2012 All Over Again

That’s where we are with mitochondrial DNA today, but unlike Y DNA naming, a vendor can’t just make that change to a terminal SNP based naming system because all vendors conform to the published Phylotree.

However, in this case, the vendor, Family Tree DNA has more than 6 times the number of full sequence mitochondrial results than the mitochondrial reference model Phylotree. If you look at the haplogroup projects at Family Tree DNA, you’ll notice that (some) administrators routinely group results by a specific mutation that is found within a named haplogroup, meaning that the people with the mutation form a subgroup that they believe is worthy of its own haplogroup subgroup name. The problem is that unless enough people upload their results to Phylotree, that subgroup will never be identified, so a new haplogroup won’t be added.

If the entire Family Tree DNA data base were to be uploaded to Phylotree, can you imagine how many new haplogroups would need to be formed? Of course, Family Tree DNA can’t do that, but individual testers can and should.

Challenges for Vendors

The challenge for vendors is that every time the phylotree tree is updated and a new version is produced, the vendors must “rerun” their existing tester samples against the new haplogroup defining mutations to update their testers’ haplogroup results.

In some cases, entire haplogroups are obsoleted and branches moved, so it’s not a simple matter of just adding a single letter or digit. Rearranging occurs, and will occur more and more, the more tests that are uploaded to Phylotree.

For example, in the Phylotree V17 update, haplogroup A4a1 became A1a. In other words, some haplogroups became entirely obsolete and were inserted onto other branches of the tree.

In the current version of the Phylotree, haplogroup A4 has been retired.

Keep in mind that all haplogroup assignments are the cumulative combination of all of the upstream direct haplogroups. That means that haplogroup A4a1, in the prior version, had all of the haplogroup defining mutations shown in bold in the chart below. In the V17 version, haplogroup A1a contains all of the mutations shown in bold red. You might notice that the haplogroup A4 defining mutation T16362C is no longer included, and haplogroup A4, plus all 9 downstream haplogroups which were previously dependent on T16362C have been retired. A4a1 is now A1a.

Taking a look at the mitochondrial tree in pedigree fashion, we can see haplogroup A4a1 in Build 15 from September 2012, below.

Followed by haplogroup A1a in the current Build 17.

Full Sequence Versus Chip Based Mitochondrial Testing

While Family Tree DNA tests the full sequence of their customers who purchase that level of testing, other vendors don’t, and these changes wreak havoc for those vendors, and for compatibility for customer attempting to compare between data bases and information from different vendors.

That means that without knowing which version of Phylotree a vendor currently uses, you may not be able to compare meaningfully with another user, depending on changes that occurred that haplogroup between versions. You also need to know which vendor each person utilized for testing and if that vendor’s mitochondrial results are generated from an autosomal style chip or are actually a full mitochondrial sequence test. Utilizing the ISOGG mtDNA testing comparison chart, here’s a cheat sheet.

Vendor No Mitochondrial Chip based haplogroup only mitochondrial Full Sequence mitochondrial
Family Tree DNA No Yes – V17
23andMe Yes – Build V7 No
Ancestry None
LivingDNA Yes – Build V17 No
MyHeritage None
Genographic V2 Yes – Build V16 No

Of the chip-based vendors, 23andMe is the most out of date, with V7 extending back to November of 2009. The Genographic Project has done the best job of updating from previous versions. LivingDNA entered the marketplace in 2016, utilizing V17 when they began.

Family Tree DNA’s mitochondrial test is not autosomal chip based, so they don’t encounter the problem of not having tested needed locations because they test all locations. They have upgraded their customers several times over the years, with the current version being V17.

Family Tree DNA’s mitochondrial DNA test is a separate test from their Family Finder autosomal test while the chip-based vendors provide a base-level haplogroup designation that is included in their autosomal product. However, for chip-based vendors, updating that information can be very challenging, especially when significant branch changes occur.

Let’s take a closer look.

Challenges for Autosomal Chip-Based Vendors Providing Mitochondrial Results

SNP based mitochondrial and Y DNA testing for basic haplogroups that some vendors include with autosomal DNA is a mixed blessing. The up side, you receive a basic haplogroup. The down aide, the vendor doesn’t test anyplace near all of the 16,569 mitochondrial DNA SNP locations.

I wrote in detail about how this works in the article, Haplogroup Comparisons Between Family Tree DNA and 23andMe. Since that time, LivingDNA has also added some level of haplogroup reporting through autosomal testing.

How does this work?

Let’s say that a vendor tests approximately 4000 mitochondrial DNA SNPs on the autosomal chip that you submit for autosomal DNA testing. First, that’s 4000 locations they can’t use for autosomal SNPs, because a DNA chip has a finite number of locations that can be utilized.

Secondly, and more importantly, it’s devilishly difficult to “predict” haplogroups at a detailed level correctly. Therefore, some customers receive a partial haplogroup, such as J1c, and some receive more detail.

It’s even more difficult, sometimes impossible, to update haplogroups when new Phylotree versions are released.

Why is Haplogroup Prediction and Updating so Difficult?

The full mitochondrial DNA sequence is 16,569 locations in length, plus or minus insertions and deletions. The full sequence test does exactly what that name implies, tests every single location.

Now, let’s say, by way of example, that location 10,000 isn’t used to determine any haplogroup today, so the chip-based vendors don’t test it. They only have room for 4000 of those locations on their chip, so they must use them wisely. They aren’t about to waste one of those 4000 spaces on a location that isn’t utilized in haplogroup determination.

Let’s say in the next release, V2, that location 10,000 is now used for just one haplogroup definition, but the haplogroup assignment still works without it. In other words, previously to define that haplogroup, location 9000 was used, and now a specific value at location 10,000 has been added. Assuming you have the correct value at 9,000, you’re still golden, even if the vendor doesn’t test location 10,000. No problem.

However, in V3, now there are new haplogroup subgroups in two different branches that use location 10,000 as a terminal SNP. A terminal SNP is the last SNP in line that define your results most granularly. In haplogroup J1c2f, the SNP(s) that define the f are my terminal SNPs. But if the vendor doesn’t test location 10,000, then the mutation there can’t be used to determine my terminal SNP, and my full haplogroup will be incomplete. What now?

If location 10,000 isn’t tested, the vendor can’t assign those new haplogroups, and if any other haplogroup branch is dependent on this SNP location, they can’t be assigned correctly either. Changes between releases are cumulative, so the more new releases, the further behind the haplogroup designations become.

Multiple problems exist:

  • Even if those vendors were to recalculate their customer’s results to update haplogroups, they can’t report on locations they never tested, so their haplogroup assignments become increasingly outdated.
  • To update your haplogroup when new locations need to be tested, the vendor would have to actually rerun your actual DNA test itself, NOT just update your results in the data base. They can’t update results for locations they didn’t test.
  • Without running the full mitochondrial sequence, the haplogroup can never be more current than the locations on the vendor’s chip at the time the actual DNA test is run.
  • No vendor runs a full sequence test on an autosomal chip. A full mitochondrial sequence test at Family Tree DNA is required for that.
  • Furthermore, results matching can’t be performed without the type of test performed at Family Tree DNA, because people carry mutations other than haplogroup defining mutations. Haplogroup only information is entertaining and can sometimes provide you with base information about the origins of your ancestor (Native, African, European, Asian,) but quickly loses its appeal because it’s not specific, can’t be used for matching and can’t reliably be upgraded.

The lack of complete testing also means that while Family Tree DNA can publish this type of tree and contribute to science, the other vendors can’t.

Let’s take a look at Family Tree DNA’s new tree.

Finding the Tree

To view the tree, click here, but do NOT sign in to your account. Simply scroll to the bottom of the page where you will see the options for both the Y DNA Haplotree and the mtDNA Haplotree under the Community heading.

Click on mtDNA Haplotree.

If you are a Family Tree DNA customer, you can view both the Y and mitochondrial trees from your personal page as well. You don’t have to have taken either the Y or mitochondrial DNA tests to view the trees.

Browsing the mtDNA Tree

Across the top, you’ll see the major haplogroups.

I’m using haplogroup M as an example, because it’s far up the tree and has lots of subgroups. Only full sequence results are shown on the tree.

The basic functionality of the new mitochondrial tree, meaning how it works, is the same as the Y tree, which I wrote about in the Family Tree DNA’s PUBLIC Y DNA Haplotree.

You can view the tree in two formats, countries or variants, in the upper left-hand corner. View is not the same thing as search.

When viewing the mitochondrial DNA phylotree by country, we see that haplogroup M has a total of 1339 entries, which means M and everything below M on the tree.

However, the flags showing in the M row are only for people whose full mitochondrial sequence puts them into M directly, with no subgroup.

As you can see, there are only 12: 6 people in Australia, and one in 5 other countries. These are the locations of the most distant known ancestor of those testers. If they have not completed the maternal Country of Origin on the Earliest Known Ancestor tab, nothing shows for the location.

Viewing the tree by variant shows the haplogroup defining mutations, but NOT any individual mutations beyond those that are haplogroup defining.

For each haplogroup, click on the three dots to the right to display the country report for that haplogroup.

The Country Report

The Country Report provides three columns.

The column titled Branch Participants M shows only the total of people in haplogroup M itself, with no upstream or downstream results, meaning excluding M1, M2, etc. Just the individuals in M itself. Be sure to note that there may be multiple pages to click through, at bottom right.

The second column, Downstream Participants – M and Downstream (Excluding other Letters) means the people in haplogroup M and M subclades. You may wonder why this column is included, but realize that branches of haplogroup M include haplogroups G, Q, C, Z, D and E. The middle column only includes M and subgroups that begin with M, without the others, meaning M, M10, M11 but not G, Q, etc.

Of course the final column, All Downstream Participants – M and Downstream (Including other Letters) shows all of the haplogroup M participants, meaning M and all subclades, including all other haplogroups beneath M, such as M10, G, Q, etc..

What Can I Do with This Information?

Unlike the companion Y tree DNA, since surnames change every generation for maternal lineages, there is no requirement to have multiple matching surnames on a branch to be displayed.

Therefore, every person who includes a location for a most distant known ancestor is included in the tree, but surnames are not.

I want to see, at a glance, where the other people in my haplogroup, and the haplogroups that are the “direct ancestral line” of mine are found today. Clusters may mean something genealogically or are at least historically important – and I’ll never be able to view that information any other way. In fact, before this tree was published, I wasn’t able to see this at all. Way to go Family Tree DNA!!

It’s very unlikely that I’ll match every person in my haplogroup – but the history of that haplogroup and all of the participants in that haplogroup are important to that historical lineage of my family. At one time, these people all shared one ancestor and determining when and where that person lived is relevant to my family story.

Searching for Your Haplogroup

I’m searching for haplogroup J1c2f by entering J1c2f in the “Go to Branch Name.”

There it is.

I can see that there are 17 people in Sweden, 13 in Norway, 5 in Germany, 3 in Russia, etc. What’s with the Scandinavian cluster? My most distant known ancestor was found in Germany. There’s something to be learned here that existing records can’t tell me!

The mother branch is J1c2 which shows the majority of individuals in Ireland followed by England. This probably suggests that while J1c2f may have been born in Scandinavia, J1c2 probably was not. According to the supplement to Dr. Doron Behar’s paper, A “Copernican” Reassessment of the Human Mitochondrial DNA tree from its Root, which provides ages for some mitochondrial DNA haplogroups:

Haplogroup How Old Standard Deviation Approximate Age Range in Years
J1c2 9762 2010 7,752 – 11,772
J1c2f 1926 3128 500 – 5,054

I happen to know from communicating with my matches that the haplogroup J1c2f was born more than 500 years ago because my Scandinavian mito-cousins know where their J1c2f cousin was then, and so do I. Mine was in Germany, so we know our common ancestor existed sometime before that 500 year window, and based on our mutations and the mutation tree we created, probably substantially before that 500 year threshold.

Given that J1c2, which doesn’t appear to have been born in Scandinavia is at least 7,700 years old, we can pretty safely conclude that my ancestor wasn’t in Scandinavia roughly 9,000 years ago, but was perhaps 2,000 years, ago when J1c2f was born. What types of population migration and movement happened between 2,000 and 9,000 years ago which would have potentially been responsible for the migration of a people from someplace in Europe into Scandinavia.

The first hint might be that in the Nordic Bronze Age, trade with European cultures became evident, which of course means that traders themselves were present. Scandinavian petroglyphs dating from that era depict ships and art works from as far away as Greece and Egypt have been found.

The climate in Scandinavia was warm during this period, but later deteriorated, pushing the Germanic tribes southward into continental Europe about 3000 years ago. Scandinavian influence was found in eastern Europe, and numerous Germanic tribes claimed Scandinavian origins 2000 years ago, including the Bergundians, Goths, Heruls and Lombards.

Hmmm, that might also explain how my mitochondrial DNA, in the form of my most distant known ancestor arrived in Germany, as well as the distribution into Poland.

Is this my family history? I don’t know for sure, but I do know that the clustering information on the new phylotree provides me with clustering data to direct my search for a historical connection.

What Can You Do?

  • Take a full mitochondrial DNA test. Click here if you’d like to order a test or if you need to upgrade your current test.
  • Enter your Earliest Known Ancestor on the Genealogy tab of your Account Information, accessed by clicking the “Manage Personal Information” beneath your profile photo on your personal page.

The next few steps aren’t related to actually having your results displayed on the phylotree, but they are important to taking full advantage of the power of testing.

  • While viewing your account information, click on the Privacy and Sharing tab, and select to participate in matching, under Matching Preferences.

  • Also consent to Group Project Sharing AND allow your group project administrators to view your full sequence matches so that they can group you properly in any projects that you join. You full sequence mutations will never be shown publicly, only to administrators.

Of course, always click on save when you’re finished.

  • Enter your most distant ancestor information on your Matches Map page by clicking on the “Update Ancestor’s Location” beneath the map.

  • Join a project relevant to your haplogroup, such as the J project for haplogroup J. To join a project, click on myProjects at the top of the page, then on Join Projects.

  • To view available haplogroup projects, scroll down to the bottom of the screen that shows you available projects to join, and click on the letter of your haplogroup in the MTDNA Haplogroup Projects section.

  • Locate the applicable haplogroup, then click through to join the project.

These steps assure that you’ve maximized the benefits of your mitochondrial results for your own research and to your matches as well. Collaborative effort in completing geographic and known ancestor information means that we can all make discoveries.

The article, Working with Mitochondrial DNA Results steps you through you all of the various tools provided to Family Tree DNA testers.

Now, go and see who you match, where your closest matches cluster, and on the new mtDNA Haplotree, what kind of historical ancestral history your locations may reveal. What’s waiting for you?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

DNAPainter: Painting Leeds Method Matches

Last week, I wrote about how I utilized the Leeds Method in the article, The Leeds Method. What I didn’t say is that I was sizing up the Leeds Method for how I could use the technique to paint additional segments of my chromosomes.

The Leeds Method divides your matches into four groups, one attributable to each grandparent. That means those matches can be painted to your four sets of great-grandparents, assuming you can identify the maternal and paternal groups. Hint – Y and mitochondrial DNA matching or haplogroups may help if you have no better hints.

For genealogists who know who their grandparents are, testing close relatives and cousins is a must in order to be able to associate matches with your four grandparents’ lines.

Please note that the Leeds method generates hints for genealogists by grouping people according to common matches. We must further evaluate those matches by doing traditional genealogy and by looking for segments that triangulate. The Leeds method in conjunction with the actual match results at vendors, combined with DNAPainter helps us do just that.

Utilizing DNAPainter

Since I’ve been able to sort matches into maternal and paternal “sides” using the Leeds Method, which in essence parentally phases the matches, I can use DNAPainter to paint them. Here are my four articles I wrote about how to utilize DNAPainter.

DNAPainter – Chromosome Sudoku for Genetic Genealogy Addicts 
DNAPainter – Touring the Chromosome Garden 
DNAPainter – Mining Vendor Matches to Paint Your Chromosomes 
Proving or Disproving a Half Sibling Relationship Using DNAPainter

Combining the Two Tools

DNAPainter has the potential to really utilize the Leeds Method results, other than Ancestry matches of course. Ancestry does not provide segment information. (Yes, I know, dead horse but I still can’t resist an occasional whack.)

You’re going to utilize your spreadsheet groupings to paint the DNA from each individual match at the vendors to DNAPainter.

On the spreadsheet, if these matches are from Family Tree DNA, MyHeritage, 23andMe or GedMatch, you’ll copy the matching segments from that vendor and paint those matching segments at DNAPainter. I explained how to do that in the articles about DNAPainter.

I do not use mass uploads to DNAPainter, because it’s impossible to assign those to different sides of your tree or ancestors. I paint individual matches, including information about the match and what I know about the history of the segment itself or associated ancestor.

I only paint segments that I can identify with certainty as maternal or paternal.

Pushing Back in Time

Based on which segments of identified ancestors the Leeds matches overlap with at DNAPainter, I can push that segment information further back in time. The blessing of this is that these Leeds matches may well fill in several blanks in my chromosome that are not yet painted by people with whom I share identified ancestors.

Even if your maternal and paternal grandparents are intermarried on each side, as long as they are not intermarried across your parental lines (meaning mother & father,) then the Leeds Method will work fine for painting. Even if you think you are attributing a segment to your paternal grandmother, for example, and the person actually matches through your paternal grandfather, you’ve still painted them on the correct chromosome – meaning your paternal chromosome. As you build up that chromosome with matches, you’ll see soon enough if you have 9 matches attributed to John Doe and one to Jane Smith, the Jane Smith match is likely incorrectly attributed, those two lines are somehow interrelated or it’s a false positive match.

Because I work with only fairly large Leeds matches – nothing below 30 cM, I sometimes receive a nice gift in terms of painting large previously unpainted segments – like the one on my mother’s side, below.

Look at this large green segment on chromosome 19 that I painted thanks to one of the Leeds matches, Harold. (Note that the two long blue and brown bars at the bottom of each chromosome are my ethnicity, not matches.) Another benefit is that if a Leeds match matches on already identified segments assigned to ancestors, I’ve just identified which ancestral lines I share with that match.

The green Ferverda side match to Roland through the Leeds Method aligns partially with a segment already known to descend from Jacob Lentz and Frederica Ruhle who were born in the 1780s. I’m related to Roland somehow through that line, and by just looking at his (redacted here) surname, I *think* I know how, even though he doesn’t have a tree online. How cool is that!

Important Notes for DNAPainter

Word of caution here. I would NOT paint anyone who falls into multiple match groups without being able to identify ancestors. Multiple match groups may indicate multiple ancestors, even if you aren’t aware of that.

Each segment has its own history, so it’s entirely possible that multiple match groups are accurate. It’s also possible that to some extent, especially with smaller segments, that matches by chance come into play. That’s why I only work with segments above 30 cM when using the Leeds method where I know I’m safe from chance matches. You can read about identical by descent (IBD) and identical by chance (IBC) matches here.

What a DNAPainter Leeds Match Means

It’s very important to label segments in DNAPainter with the fact that the source was through the Leeds Method.

These painted matches DO NOT MEAN that the match descends from the grandparent you are associating with the match.

It means that YOU inherited your common DNA with this match FROM that grandparent. It suggests that your match descends from one of the ancestors of this couple, or possibly from your great-grandparents, but you don’t necessarily share this great-grandparent couple with your match.

That’s different than the way I normally paint my chromosomes – meaning only when a specific common ancestor has been identified. For someone painted from matches NOT identified through the Leeds Method, if I know the person descends from a grandparent, I paint them to the great-grandparent couple. People painted through the Leeds Method don’t necessarily share that couple, but do share an ancestor of that couple.

When I paint using the Leeds method, I’m assigning the match to a set of great-grandparents because I can’t genealogically identify the common ancestor further upstream, so I’m letting genetics tell me which genealogical quadrant they fall into on my tree. With the Leeds Method, I can tell which grandparent I inherited that DNA through. In my normal DNAPainter methodology, I ONLY paint matches when I’ve identified the common ancestor – so Leeds Method matches would not previously have qualified.

I don’t mean to beat this to death and explain it several ways – but it’s really important to understand the difference and when looking back, understand why you painted what you did.

Labeling Leeds Match Painted Segments

Therefore, with Leeds Method match painting, I identify the match name as “John Doe FTDNA Leeds-Ferverda” which tells me the matches name (John Doe,) where they tested (FTDNA) and why I painted them (Ferverda column in my Leeds spreadsheet,) even though I don’t know for sure which ancestor we actually have in common. I paint them to the parents of my Ferverda grandfather. Not John Ferverda, my grandfather, but to his parents, Hiram Ferverda and Eva Miller. I know I received my matching DNA through one of them – I just don’t know which person of that couple yet.

However, looking at who else is assigned to that segment with an identified common ancestor will tell me where in my tree that segment originated – for me. We still don’t know where in your matches tree that segment originated.

“Match To” Issues

Lastly, if you happen to select a “match to” person to represent one of your grandparent matches that just happens to be descended from two grandparent lines, you’ve had your bad luck for the month. Remember, your “match to” person is the first person (closest match) that hasn’t yet been grouped, so you don’t really select them. If you realize you’re getting goofy results, stop and undo those results, then select the next candidate as your “match to” person.

At one vendor, when I selected the first person who hadn’t yet been grouped and used them for the red column which turned out to be Bolton, about half of them overlapped with Estes segments that I’ve already painted and confirmed from several sources. Obviously, there’s a problem someplace, and I’m guessing it just happens to be the luck of the draw with the “match to” person being descended from both lines. The lines both lived in the same county for generations. I need to redo that section with someone whose tree I know positively descends from the Bolton line and does NOT intersect with another of my lines. However, I was able to identify that this issue existed because I’ve already painted multiple ancestor-confirmed cousins who carry those same segments – and I know where they came from.

These tools are just that – tools and require some level of analytical skill and common sense. In other words, it’s a good idea to stay with larger matches and know when to say “uh-oh.” If it doesn’t feel right, don’t paint it.

Breaking Down Distant Brick Walls

I’m still thinking about how to use the Leeds Method, probably in combination with DNAPainter, to break down brick walls. My brick walls aren’t close in time. Most of them are several generations back and revolve around missing female surnames, missing records or ancestors appearing in a new location with no ability to connect them back to the location/family they left.

In essence, I would need to be able to isolate the people matching that most distant ancestor couple, then look for common surnames and ancestors within that match group. The DNAGedcom.com client which allows you to sort matches by surname might well be an integral piece of this puzzle/solution. I’ll have to spend some time to see how well this works.

Solving this puzzle would be entirely dependent on people uploading their trees.

If you have thoughts on how to use these tools to break down distant brick walls, or devise a methodology, please let me know.

And if you haven’t uploaded your tree, please do.

Would I Do The Leeds Method Again?

Absolutely, at least for the vendors who provide segment information.

I painted 8 new Leeds matches from Family Tree DNA on my Ferverda grandparent side which increased the number of painted segments at DNAPainter from 689 to 704, filled in a significant number of blank spaces on my chromosomes, and took my total % DNA painted from 60 to 61%. I added the rest of my Leeds hints from Family Tree DNA of 30 cM or over, and increased my painted segments to 734 and my percentage to 62% I know that 1 or 2% doesn’t sound like a very big increase, but it’s scientific progress.

It’s more difficult to increase the number of new segments after you’ve painted much of your genome because many segments overlap segments already painted. So, a 2% increase is well worth celebrating!

Having said that, I would love for the vendors to provide this type of clustering so I don’t have to. To date, Family Tree DNA is the only vendor who does any flavor of automatically bucketing results in this fashion – meaning paternal and maternal, which is half the battle. I would like to see them expand to the four grandparents from the maternal/paternal matching they provide today.

We’ve been asking Ancestry for enhanced tools for years. There’s no reason they couldn’t in essence do what Dana has done along with provide the DNAgedcom.com search functionality. And yes…I still desperately want a chromosome browser or at least segment information.

I will continue to utilize the Leeds Method, at least with vendors other than Ancestry because it allows me to incorporate the results with DNAPainter. It’s somehow ironic that I started out grouping the Ancestry results, but wound up realizing that the results from other vendors, specifically Family Tree DNA and MyHeritage are significantly more useful due to the segment data and combined tools.

Getting the Most Bang for Your Buck

If you tested at Ancestry or 23andMe, I would strongly encourage you to download your raw data file from both of these vendors and transfer to Family Tree DNA, MyHeritage and GedMatch to get the most out of your DNA tests. Here is the step-by-step guide for how to download your DNA from Ancestry.

The uploads to those three locations are free. All tools are free at MyHeritage until December 1, 2018 when they will begin charging for more advanced tools. The upload is free at Family Tree DNA and the advanced tools, including the chromosome browser, only require a $19 unlock.

Here is the step-by-step guide for uploading to MyHeritage and to Family Tree DNA. Fishing in every pond is critically important. You never know what you’re missing otherwise!

How many segments of your DNA can you paint using the Leeds Method in combination with DNA Painter?

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

The Leeds Method

This is the first in a series of two articles. This article explains the Leeds Method and how I created a Leeds Spreadsheet in preparation for utilizing the results in DNAPainter. I stumbled around a bit, but I think I’ve found a nice happy medium and you can benefit from my false starts by not having to stumble around in the dark yourself. Of course, I’m telling you about the pitfalls I discovered.

The second article details the methodology I utilized to paint these matches, because they aren’t quite the same as “normal” matching segments with identified ancestors.

Welcome to the Leeds Method

Dana Leeds developed a novel way to utilize a spreadsheet for grouping your matches from second through fourth cousins and to assign them to “grandparent” quadrants with no additional or previous information. That’s right, this method generates groupings that can be considered good hints without any other information at all.

Needless to say, this is great for adoptees and those searching for a parent.

It’s also quite interesting for genetic genealogists as well. One of the best aspects is that it’s very easy to do and very visual. Translation – no math. No subtraction.

Caveat – it’s also not completely accurate 100% of the time, especially when you are dealing with more distant matches, intermarriage and/or endogamy. But there are ways to work around these issues, so read on!

You can click to enlarge any image.

I’ll be referring to this graphic throughout this article. It shows the first several people on my Ancestry match list, beginning with second cousins, using pseudonyms. I chose to use Ancestry initially because they don’t provide chromosome browsers or triangulation tools, so we need as much help there as we can get.

I’ve shown the surnames of my 4 grandparents in the header columns with an assigned color, plus a “Weird group” (grey) that doesn’t seem to map to any of the 4. People in that group are much more distant in my match list, so they aren’t shown here.

I list the known “Most Common Recent Ancestor,” when identified, along with the color code that so I can easily see who’s who.

All those blanks in the MCRA column – those are mostly people without trees. Just think how useful this would be if everyone who could provide a tree did!

What Does the Leeds Method Tell You?

The Leeds Method divides your matches into four colored quadrants representing each grandparent unless your genealogical lines are heavily intermarried. If you have lots of people who fall into both of two (or more) colors, that probably indicates intermarriage or a heavily endogamous population.

In order to create this chart, you work with your closest matches that are 2nd cousins or more distant, but no more distant than 4th cousins. For endogamous people, by the time you’re working in 4th cousins, you’ll have too much overlap, meaning people who fall into multiple columns, so you’ll want to work with primarily 2nd and 3rd cousins. The good news is that endogamous people tend to have lots of matches, so you should still have plenty to work with!

Instructions

In this article, I’m using Dana’s method, with a few modifications.

By way of a very, very brief summary:

  • On a spreadsheet, you list all of your matches through at least third cousins
  • Then check each match to see who you match in common with them
  • Color code the results, in columns
  • Each person what you match in common with your closest cousin, Sleepy, is marked as yellow. Dopey and I both match Bashful and Jasmine in common and are colored Red. Doc and I both match Happy and Belle and are colored blue, and so forth.
  • The result is that each color represents a grandparent

To understand exactly what I’m doing, read Dana’s articles, then continue with this article.

DNA Color Clustering: The Leeds Method for Easily Visualizing Matches  
DNA Color Clustering: Identifying “In Common” Surnames 
DNA Color Clustering: Does it Work with 4th Cousins? By the way, yes it does, most of the time.
DNA Color Clustering: Dealing with 3 Types of Overlap

Why Use “The Leeds Method”?

In my case, I wanted to experiment. I wanted to see if this method works reliably and what could be done with the information if you already know a significant amount about your genealogy. And if you don’t.

The Leeds Method is a wonderful way to group people into 4 “grandparent” groups in order to search for in-common surnames. I love being able to perform this proof of concept “blind,” then knowing my genealogy and family connections well enough to be able to ascertain whether it did or didn’t work accurately.

If you can associate a match with a single grandparent, that really means you’ve pushed that match back to the great-grandparent couple.

That’s a lot of information without any genealogical knowledge in advance.

How Low Can You Go?

I have more than 1000 fourth cousins at Ancestry. This makes the task of performing the Leeds Method manually burdensome at that level. It means I would have had to type all 1000+ fourth cousins into a spreadsheet. I’m patient, but not that patient, at least not without a lot of return for the investment. I have to ask myself, exactly what would I DO with that information once they were grouped?

Would 4th cousin groupings provide me with additional information that second and third cousin groupings wouldn’t? I don’t think so, but you can be the judge.

After experimenting, I’d recommend creating a spreadsheet listing all of your 2nd and 3rd cousins, along with about 300 or so of your closest 4th cousin matches. Said another way, my results started getting somewhat unpredictable at about 40-45 cMs, although that might not hold true for others. (No, you can’t tell the longest matching segment length at Ancestry, but I could occasionally verify at the other vendors, especially when people from Ancestry have transferred.)

Therefore, I only proceeded through third cousins and about 300 of the Ancestry top 4th cousin matches.

I didn’t just utilize this methodology with Ancestry, but with Family Tree DNA, MyHeritage and 23andMe as well. I didn’t use GedMatch because those matches would probably have tested at one of the primary 4 vendors and I really didn’t want to deal with duplicate kits any more than I already had to. Furthermore, GedMatch is undergoing a transition to their Genesis platform and matching within the Genesis framework has yet to be perfected for kits other than those from these vendors.

Let’s talk about working with matches from each vendor.

Ancestry

At Ancestry, make a list of all of your second and third cousin matches, plus as many 4th cousins as you want to work with.

To begin viewing your common matches, select your first second cousin on the list and click on the green View Match. (Note that I am using my own second kit at Ancestry, RobertaV2Estes, not a cousin’s kit in these examples. The methodology is the same, so don’t fret about that.)

Then, click on Shared Matches.

Referring to your spreadsheet, assign a color to this match group and color the spreadsheet squares for this match group. Looking at my spreadsheet, my first group would be the yellow Estes group, so I color the squares for each person that I match in common with this particular cousin. On my spreadsheet, those cousins have all been assigned pseudonyms, of course.

Your shared match list will be listed in highest match order which should be approximately the same order they are listed on your spreadsheet. I use two monitors so I can display the spreadsheet on one and the Ancestry match list on the other.

Lon is shared in common with the gold person I’m comparing against (Roberta V2 Estes), and me, so his box would be colored gold on the spreadsheet. Lon’s pseudonym is Sneezy and the person beneath him on this list, not shown, would be Ariel.

Ancestry only shows in-common matches to the 4th cousin level, so you really couldn’t reach deeper if you wanted. Furthermore, I can’t see any advantage to working beyond the 4th cousin’s level, maximum. Your best matches are going to be the largest ones that reveal the most information and have the most matches, therefore allowing you to group the most people by color.

Unfortunately, Ancestry provides the total cMs and the number of segments, but not the largest matching segment.

One benefit of this methodology is that it’s fairly easy to group those pesky private matches like the last one on the master spreadsheet, Cersei, shown in red. You’ll at least know which grandparent group they match. Based on your identified ancestors of matches in the color group, you may be able to tell much more about that private match.

For example, one of my private matches is a match to someone who I share great-great-grandparents with AND they also match with two people further on up that tree on the maternal side of that couple, shown above, in red. I may never know which ancestor I share with that private match specifically, but I have a pretty darned good idea now in spite of that ugly little lock. The more identified matches, the better and more accurate this technique.

Is the Leeds Method foolproof? No.

Is this a great tool? Yes, absolutely.

Family Tree DNA

Thankfully, Family Tree DNA provides more information about my matches than Ancestry, including segment information combined with a chromosome browser and Family Matching. I often refer to Family Matching as parental bucketing, shown on your match list with the maternal and paternal tabs, because Family Tree DNA separates your matches into parental “sides” based on common segments with others on your maternal and paternal branches of your tree when you link your matches’ results.

At Family Tree DNA, sign on and then click on Matches under Family Finder.

When viewing your matches, you’ll see blue or red people icons any that are assigned to either your maternal, paternal side, or both (purple) on your match list. If you click on the tabs at the top,  you’ll see JUST the maternal, paternal or both lists.

This combination of tools allows you to confirm (and often triangulate) the match for several people. If those matches are bucketed, meaning assigned to the same parental side, and they match on the same segment, they are triangulated for all intents and purposes if the segment is above 20 cM. All of the matches I worked with for the Leeds Method were well above 20 cM, so you don’t really need to worry about false or identical by chance matches at that level.

Family Tree DNA matches are initially displayed by the total number of “Shared cM.” Click on “Longest Block” to sort in that manner. I considered people through 30 cM and above as equivalent to the Ancestry 3rd cousin category. Some of the matching became inconsistent below that threshold.

List all of your second and third cousins on the spreadsheet, along with however many 4th cousins you want to work with.

Then, select your closest second cousin by checking the box to the left of that individual, then click on “In Common With” above the display. This shows you your matches in common with this person.

On the resulting common match list, sort your matches in Longest block order, then mark the matches on your spreadsheet in the correct colored columns.

With each vendor, you may need to make new columns until you can work with enough matches to figure out which column is which color – then you can transfer them over. If you’re lucky enough to already know the family association of your closest cousins, then you already know which colored column they belong to.

All of my matches that fell into the Leeds groups were previously bucketed to maternal or paternal, so consistency between the two confirms both methodologies. Between 20 and 28 cM, three of my bucketed matches at Family Tree DNA fell into another group using the Leeds method, which is why I drew the line at 30cM.

For genealogists who already know a lot about their tree, this methodology in essence divides the maternal and paternal buckets into half. FTDNA already assigns matches maternally or paternally with Family Matching if you have any information about how your matches fit into your tree and can link any matching testers to either side of your tree at the 3rd cousin level or closer.

If you don’t know anything about your heritage, or don’t have any way to link to other family members who have tested, you’ll start from scratch with the Leeds Method. If you can link family members, Family Tree DNA already does half of the heavy lifting for you which allows you to confirm the Leeds methodology.

MyHeritage

At MyHeritage, sign in, click on DNA and sort by “largest segment,” shown at right, above. I didn’t utilize matches below 40 cM due to consistency issues. I wonder if imputation affects smaller matches more than larger matches.

You’ll see your closest matches at the top of the page. Scroll down and make a list on your spreadsheet of your second and third cousins. Return to your closest DNA match that is a second cousin and click on the purple “Review DNA Match” which will display your closest in-common matches with that person, but not necessarily in segment size order.

Scroll down to view the various matches and record on the spreadsheet in their proper column by coloring that space.

The great aspect of MyHeritage is that triangulation is built in, and you can easily see which matches triangulate, providing another layer of confirmation, assuming you know the relationship of at least some of your matches.

The message for me personally at MyHeritage is that I need to ask known cousins who are matches elsewhere to upload to MyHeritage because I can use those as a measuring stick to group matches, given that I know the cousin’s genealogy hands-down.

The great thing about MyHeritage is that they are focused on Europe, and I’m seeing European matches that aren’t anyplace else.

23andMe

At 23andMe, sign in and click on DNA Relatives under the Ancestry tab.

You’ll see your list of DNA matches. Record 2nd and third cousins on your spreadsheet, as before.

To see who you share in common with a match, click on the person’s name and color your matches on the spreadsheet in the proper column.

Unfortunately, the Leeds Method simply didn’t work well for me with my 23andMe data, or at least the results are highly suspect and I have no way of confirming accuracy.

Most of my matches fell into in the Estes category, with the Boltons overlapping almost entirely, and none in the Lore or Ferverda columns. There is one small group that I can’t identify. Without trees or surnames, genealogically, my hands are pretty much tied. I can’t really explain why this worked so poorly at 23andMe. Your experience may be different.

The lack of trees is a significant detriment at 23andMe because other than a very few matches whose genealogy I know, there’s no way to correlate or confirm accuracy. My cousins who tested at 23andMe years ago and whose tests I paid for lost interest and never signed in to re-authorize matching. Many of those tests are on the missing Ferverda side, but their usefulness is now forever lost to me.

23andMe frustrates me terribly. Their lack of commitment to and investment in the genealogical community makes working with their results much more difficult than it needs to be. I’ve pretty much given up on using 23andMe for anything except adoption searches for very close matches as a last resort, and ethnicity.

The good news is that with so many people testing elsewhere, there’s a lot of good data just waiting!

What are the Benefits?

The perception of “benefit” is probably directly connected to your goal for DNA testing and genetic genealogy.

  • For adoptees or people seeking unknown parentage or unknown grandparents, the Leeds Method is a fantastic tool, paving the way to search for common surnames within the 4 groups as opposed to one big pool.
  • For people who have been working with their genealogy for a long time, maybe not as much, but hints may lurk and you won’t know unless you do the discovery work. If you’re a long-time genealogist, you’re used to this, so it’s just a new way of digging through records – and you can do it at home!
  • For people who have tested at Family Tree DNA, the family grouping by maternal and paternal based on people linked to your tree is more accurate and groups people further down your match list because it’s actually based on triangulated matching segments. However, the Leeds Method expands on that and adds granularity by breaking those two groups into four.
  • For people who want to paint their chromosomes using DNAPainter, the Leeds Method is the first step of a wonderful opportunity if you have tested at either Family Tree DNA, MyHeritage or 23andMe.

Unfortunately, Ancestry doesn’t provide segment information, so you can’t chromosome paint from Ancestry directly, BUT, you can upload to either Family Tree DNA, MyHeritage or GedMatch and paint Ancestry matches from there. At GedMatch, their kit numbers begin with A.

What Did I Do Differently than Dana?

Instead of adding a 5th column with the first person (Sam) who was not grouped into the first 4 groups, I looked for the closest matches that I shared with Sam who were indeed in the first 4 color groups. I added Sam to that existing color group along with my shared matches with Sam that weren’t already grouped into that color so long as it was relatively consistent. If it looked too messy, meaning I found people in multiple match groups, I left it blank or set that match aside. This didn’t happen until I was working at the 4th cousin level or between 30 and 40 cM, depending on the vendor.

Please note that just because you find people that you match in common with someone does NOT MEAN that you all share a common ancestor, or the same ancestor. It’s a hint, a tip to be followed.

There were a couple of groups that I couldn’t cluster with other groups, and one match that clustered in three of the four grandparent groups. I set that one aside as an outlier. I will attempt to contact them. They don’t have a tree.

I grouped every person through third cousin matches. I started out manually adding the 4th cousins for each match, but soon gave up on that due to the sheer magnitude. I did group my closest 4th cousins, or until they began to be inaccurate or messy, meaning matching in multiple groups. Second and third cousin matching was very consistent.

Tips

  • Don’t use siblings or anyone closer than the second cousin level. First cousins share two grandparents. You only want to use matches that can be assigned to ONLY ONE GRANDPARENT.
  • In the spreadsheet cell, mark the person you used as a “match to.” In other words, which people did you use to populate that color group. You can see that I used two different people in the Estes category. I used more in the other categories too, but they are further down in my list.
  • At Family Tree DNA, you can utilize the X chromosome. Understand that if you are a male, you will not have any X matches with your paternal grandfather. I would not recommend using X matches for the Leeds Method, especially since they are not uniformly available at all vendors and form a specific unique inheritance pattern that is not the same as the other autosomes.
  • Ancestry, MyHeritage and Family Tree DNA allow you to make notes on each match. As I group these, and as I paint them with DNAPainter I made a note on each match that allows me to identify which group they are assigned to, or if they match multiple groups.
  • Look at each match to be sure they are consistent. If they aren’t, either mark them as inconclusive or omit them entirely in the painting process. I write notes on each one if there is something odd, or if I don’t paint them.

What Did I Learn?

Almost all of my (endogamous by definition) Acadian matches are more distant, which means the segments are smaller. I expected to find more in the painted group, because I have SO MANY Acadian matches, but given that my closest Acadian ancestor was my great-great-grandfather, those segments are now small enough that those matches don’t appear in the candidate group of matches for the Leeds Method. My Acadian heritage occurs in my green Lore line, and there are surprisingly few matches in that grouping large or strong enough to show up in my clustered matches. In part, that’s probably because my other set of great-great-grandparents in that line arrived in 1852 from Germany and there are very few people in the US descended from them.

I found 4th cousin matches I would have otherwise never noticed because they don’t have a tree attached. At Ancestry, I only pay attention to closer matches, Shared Ancestor Hints and people with trees. We have so many matches today that I tend to ignore the rest.

Based on the person’s surname and the color group into which they fall, it’s often possible to assign them to a probable ancestral group based on the most distant ancestors of the people they match within the color group. In some cases, the surname is another piece of evidence and may provide a Y DNA lead.

For example, one of my matches user name is XXXFervida. They do match in the Ferverda grandparent group, and Fervida is how one specific line of the family spelled the surname. Of course, I could have determined that without grouping, but you can never presume a specific connection based solely on surname, especially with a more common name. For all I know, Fervida could be a married name.

By far the majority of my matches don’t have trees or have very small trees. That “no-tree” percentage is steadily increasing at Ancestry, probably due to their advertising push for ethnicity testing. At Family Tree DNA where trees are infinitely more useful, the percentage of people WITH trees is actually rising. By and large, Family Tree DNA users tend to be the more serious genealogists.

MyHeritage launched their product more recently with DNA plus trees from the beginning, although many of the new transfers don’t have trees or have private trees. Their customers seem to be genealogically savvy and many live in Europe where MyHeritage DNA testing is focused.

23andMe is unquestionably the least useful for the Leeds Method because of their lack of support for trees, among other issues, but you may still find some gems there.

Keeping Current

Now that I invested in all of this work, how will I keep the spreadsheet current, or will I at all?

At Ancestry, I plan to periodically map all of my SAH (Shared Ancestor Hints) green leaf matches as well as all new second and third cousin matches, trees or not.

In essence, for those with DNA matches and trees with a common ancestor, Ancestry already provides Circles, so they are doing the grouping for those people. Where this falls short, of course, is matches without trees and without a common identified ancestor.

For Ancestry matches, I would be better served, I think, to utilize Ancestry matches at GedMatch instead of at Ancestry, because GedMatch provides segment information which means the matches can be confirmed and triangulated, and can be painted.

For matches outside of Ancestry, in particular at Family Tree DNA and MyHeritage I will keep the spreadsheet current at least until I manage to paint my entire set of chromosomes. That will probably be a very long time!

I may not bother with 23andMe directly, given that I have almost no ability to confirm accuracy. I will utilize 23andMe matches at GedMatch. People who transfer to GedMatch tend to be interested in genealogy.

What Else Can I Do?

At Ancestry, I can use Blaine’s new “DNA Match Labeling” tool that facilitates adding 8 colored tags to sort matches at Ancestry. Think of it as organizing your closet of matches. I could tag each of these matches to their grandparent side which would make them easy to quickly identify by this “Leeds Tag.”

My Goals

I have two primary goals:

  • Associating segments of my DNA with specific ancestors
  • Breaking down genealogical brick walls

I want to map my DNA segments to specific ancestors. I am already doing this using Family Tree DNA and MyHeritage where common ancestors are indicated in trees and by surnames. I can map these additional Leeds leads (pardon the pun) to grandparents utilizing this methodology.

To the extent I can identify paternal and maternal matches at 23andMe, I can do the same thing. I don’t have either parents’ DNA there, and few known relatives, so separating matches into maternal and paternal is more difficult. It’s not impossible but it means I can associate fewer matches with “sides” of my genealogy.

For associating segments with specific ancestors and painting my chromosomes, DNAPainter is my favorite tool.

In my next article, we’ll see how to use our Leeds Method results successfully with DNAPainter and how to interpret the results.

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

MyHeritage Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor, if possible
  • Other mainstream vendors where you can upload this vendor’s files

Uploading TO MyHeritage

Upload Step 1

To upload your DNA to MyHeritage, click here and then click on the purple “Start” button.

Upload Step 1 If You Already Have an Account at MyHeritage

If you already have an account, click here to sign in and then click on the DNA tab to display the “Upload DNA Data” option which displays the graphic above. Click on the purple “Start” button. This is the same process you’ll use whether it’s the first time you’ve uploaded a kit, or you’re uploading subsequent kits to your account that you’ll be managing.

Upload Step 2

You’ll be prompted to create a free account by entering your name, e-mail and password, and from there you can upload your autosomal DNA file.

You’ll be asked whose DNA you’re uploading and prompted to read and agree to the terms of service and consent.

Click the purple upload button.

Then click done when the file is finished uploading.

You’ll be notified by e-mail within a couple days when the file is finished processing.

Downloading FROM MyHeritage

Download Step 1

Sign on to your MyHeritage account.

Click on DNA on the upper toolbar.

The dropdown menu includes “Manage DNA Kits”

Download Step 2

At the right of the kit you wish to download, click on the three small buttons which will include an option for “Download,” as shown in the graphics below from the MyHeritage blog article.

Download Step 3

You’ll be presented with a box titled “Learn more about DNA data files.” Click the purple “Continue” button.

Download Step 4

You’ll need to confirm that you want to download your data, and that you understand that the download is outside of MyHeritage and their protection. Click the purple “Continue” button.

Download Step 5

You’ll receive a confirmation e-mail. Click on “Click here to continue with download.”

This e-mail link is only valid for 24 hours.

Download Step 6

Enter your password again, and click on the purple “Download” button.

Download Step 7

Save the file as a recognizable file name on your computer.

MyHeritage File Transfers TO Other Vendors

You can upload your MyHeritage file to other vendors, as follows.

From below to >>>>>>>>>>> Family Tree DNA Accepts Ancestry Accepts 23andMe Accepts GedMatch Accepts
MyHeritage Yes No No Yes

Neither Ancestry nor 23andMe accepts uploads from any vendor.

MyHeritage File Transfers FROM Other Vendors

You can upload files from other vendors to MyHeritage, as follows:

  From Family Tree DNA From Ancestry From 23andMe From LivingDNA
To MyHeritage Yes Yes Yes Yes

Testing and Transfer Strategy

Transferring to MyHeritage is always free. You can view your ethnicity, your matches and their trees, and utilize the DNA tools, but you won’t receive the full benefit of SmartMatching and other records without a subscription. You will be limited to building a tree of 250 people for free, but you can upload a Gedcom file of any size, although you do need to subscribe to change anything in that file if it contains more than 250 individuals.

Until December 1, 2018, all DNA tools will be and remain free for anyone who uploads before that date. After December 1st, matching will remain free, but the advanced tools such as ethnicity, the chromosome browser, triangulation and more will require payment. MyHeritage has not yet indicated how that will work, so upload now to receive free DNA tools forever.

My testing/transfer recommendations are as follows relative to MyHeritage:

Have fun!

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to:

Ancestry 2018 Ethnicity Update

When ethnicity estimates were first produced by vendors, they tended to resemble the wild west.

Today, results are becoming more refined and hopefully, more accurate as reference populations grow and become more reliable.

The Ancestry ethnicity update has been in beta for several months, but this week, Ancestry rolled out the ethnicity update for everyone.

Checking Your New Results

To see your updated results, sign on and click on the DNA Story to the left with Ethnicity Estimates.

Ancestry then explains that while your DNA doesn’t change, the estimates (pay attention to that word) do as the science improves.

Ethnicity Estimate Aren’t Precise

I’ve said this before, and I want to say it again. Ethnicity is the least precise and the least accurate of DNA tools for genetic genealogy. Ethnicity estimates are the most accurate at a continental level. Within continents, like Europe, Asia and Africa, there has been a lot of population movement and intermixing over time making the term “ethnicity” almost meaningless.

I know, I know – ethnicity estimates are also the simplest because there isn’t much learning curve and they’re easy to understand at a glance. This deceptive “ease of use” also makes them interesting to people who have only a passing curiosity. That’s why they attract so many test takers who either love of hate their results, but never fully understand the true message or utilize any other genetic genealogy tools.

Let’s take a look at how ethnicity estimates have changed over time and if they have improved with the latest version.

Ethnicity Estimate Changes

In my case, my original Ancestry ethnicity estimate in 2012 was:

  • British Isles 80%
  • Scandinavia 12%
  • Uncertain 8%

To say it was really bad is an understatement.

In 2013, Ancestry introduced their ethnicity V2 version which provided a lot more granularity.

Version 2 was dramatically different, with the British Isles moving from 80% to a total of 6%. Like a pendulum swinging, neither was accurate.

Ancestry introduced new features and combined their Genetic Communities with their ethnicity estimates in 2017.

In this new 2018 version, Ancestry has divided and recombined the British Isles and Western Europe differently and the resulting differences are significant.

My mystery Scandinavian is entirely gone now, but sadly, so is my Native American.

The New Results

I just got really boring – but the question is whether or not the new results are more accurate as compared to my proven genealogy. Boring doesn’t matter. Accuracy does.

Various Ancestry Ethnicity Versions Compared to Proven Genealogy

I created a chart that reflects the three Ancestry ethnicity versions as compared to my proven genealogy.

For the current version, I also included the ranges as provided by Ancestry.

As you can see, generally, the results are much more accurate, but the regions are also fairly broad which makes accuracy easier to achieve.

Until this current version, Ancestry didn’t show any Germanic, but now the Germanic estimate is exact at 25%.  The Germanic range is also very tight at 24-26%, right where it should be.

The England, Wales & Northeast Europe category is somewhat high, but that could be accurate because I do have some ancestry that is unknown.

Unfortunately, my Native is proven, both through Y and mtDNA and by triangulating the Native segments to others descending from the same Native ancestors. That portion is now missing in my Ancestry ethnicity.

Ancestry V1 Test Versus the V2 Test

For the record, I’m using my Ancestry V1 test because I’ve used that test version for all previous ethnicity comparisons.  My Ancestry V2 test ethnicity results are approximately the same, as follows:

  • England, Wales and Northeast Europe – 76%
  • Germanic – 22%
  • Ireland and Scotland – 2%

The same tree is attached to both tests.

On my V2 test, which I seldom use, I had to answer a couple of question regarding my expectations about ethnicity testing changes and how accurate my previous results were perceived to be before I could access my updated results.

Regions Changed

In Ancestry’s FAQ, they provided this list of how the regions were and are defined.

Previous Region New Regions
Scandinavia Norway, Sweden
Iberian Peninsula Spain, Portugal, Basque
Europe South Italy, Greece and the Balkans, Sardinia
Europe East Baltic States, Eastern Europe and Russia
Caucasus Turkey and the Caucasus, Iran/Persia
Europe West Germanic Europe, France
Native American Native America—North, Central, South; Native America—Andean
Asia South Southern Asia, Western and Central India, Balochistan, Burusho
Asia East Japan, Korea and Northern China, China, Southeast Asia—Dai (Tai), Southeast Asia—Vietnam, Philippines

Ancestry has addressed lots of other questions in their FAQ as well, and I suggest taking a look. I particularly like their comment, “Some places are complicated.” Indeed, that’s true with population churn both in historical times along with unknown pre-history and that complexity is exactly what makes intra-continental ethnicity estimates so difficult. Of course, people whose ancestors are from Europe, for example, want as much granularity as possible.

Previous Ethnicity Versions

For the first time, Ancestry explains what happened between versions, at least at a high level.

Click on the little “i” in the upper right hand corner of your ethnicity estimate box.

You’ll see more information.

Click on “View Previous Estimate” at the bottom.

Your previous ethnicity estimate is shown.

To see how your estimate changed, click on “Compare these results to your most recent Ancestry DNA estimate.”

This display shows you the differences compared to the previous version. In my case, England, Wales and NE Europe increased by 69%, but that’s because Ancestry redefined the regions. Note the little slide box underneath the regions on the map. You can slide back and forth from previous to current (update.).

I do wish Ancestry had told us where the “Scandinavian” went, what category it fell into. Are those segments, as a group, included in another region? Was the previous estimate simply flat out wrong? Was Scandinavian a vestige of Vikings who invaded much of Europe? What happened?

New Regions and Reference Samples

By clicking on “See other regions tested” at the bottom of your Ethnicity Estimate box, you can view the locations of Ancestry’s current reference populations.

The regions tested in which you have results are colored, and the regions where you aren’t showing results are shades of grey. This is an improvement over the previous version which people routinely misinterpreted to mean that they had results in those tested regions.

Best Features

In my opinion, the best feature of the combined ethnicity and Genetic Communities is the combined mapping. For example, the screenshot below combines the ethnicity regions with the ancestors from my tree who immigrated from that region in that timeframe.

By clicking on the 1700 box, the people from that time period in my tree are displayed. I can enlarge the map to make the display larger, until finally individual “people” icons are displayed, as shown with Johann Peter Koehler, below. Clicking on the individual person pin shows that individual in the box at right.

By clicking on the “Lower Midwest and Virginia Settlers,” I see this region and Ancestry tells me where those settlers likely originated.

You can then scroll down to the bottom of the information box where you see “Ancestry DNA Members.”

Click on the 1000+ link and you will then see the people who match you in a specific region or migration.

It’s worth noting that this isn’t always accurate. My 2nd cousin match is showing as a “Lower Midwest and Virginia” match and our ancestors came from the Netherlands directly to Northern Indiana. Ironically, she shows up in three of the 4 regions I can select from. This feature is not 100%, but it’s still nice to be able to see where that match is grouped in terms of ethnicity and Genetic Communities, according to Ancestry.

Given this combined functionality, I do wonder if Ancestry’s new ethnicity isn’t simply population genetics, but a combination of population genetics, ancestors in my tree, my matches and corresponding DNA Circles with their associated history. If so, that would make sense, both in terms of what I’m seeing as my new ethnicity results and the map functionality as well. Could that be where my Germanic came from, and why it’s so precise at 25% which matches by tree exactly?

In Summary

For me, Ancestry’s ethnicity estimates are significantly improved with the exception that my Native disappeared. I’ve worked long and hard on the Native aspect of my genealogy, and I know that part of my ethnicity mix is valid. However, that is a very small percentage overall (about 2%), and the combined improvements certainly outshine that one negative.

Of course, your mileage may vary. What are you seeing in terms of your new ethnicity estimates as compared to your known genealogy? Better? Worse? Did you lose any categories that you know are valid? What about small amounts of minority heritage?

Ancestor Birthdays Mean Presents for YOU!

I’ve been wanting to celebrate my ancestors’ birthdays for some time now, and I’ve finally figured out exactly how to accomplish this goal in a really fun way.

Being reminded once a year about their birthday and the anniversary of their death reminds me to work on their genealogy, and in particular, genetic genealogy. With more people testing every single day, meaning different people at every vendor, we need to check often with specific ancestors in mind. You never know who’s going to be the person who puts the chink in that brick wall.

With this in mind, I’ve put together a spreadsheet to track what I know about each ancestor. This makes it easy to schedule those dates in my calendar, with a reminder of course, and then to check my spreadsheet to see what information might have been previously missing that might be able to be found today.

It’s like a birthday present for them, but now for me. I am, after all, their heir, along with the rest of their descendants of course! If I’m lucky, I inherited part of their DNA, and if not, their DNA is still relevant to me.

Checking the List

Here’s my spreadsheet checklist for each ancestor:

  • Birth date
  • Birth place
  • Death date
  • Death place
  • Spouse
  • Y DNA haplogroup (if male)
  • Mitochondrial DNA haplogroup
  • Autosomal confirmed
  • Ancestry Circle

New information becomes digitized every year making new information available.

Additionally, some items may change. For example, if a base haplogroup was previous known, a deeper haplogroup might be available a year later if someone has taken a more detailed test or the haplogroup name might have been updated. Yes, that happens too.

I originally had a triangulation column on the spreadsheet too, but I pretty quickly discovered that column was subject to lots of questions about interpretation. Is the actual ancestor triangulated, or the line? I decided that “autosomal confirmed” would suffice to cover whatever I decide constitutes confirmation and a comment column could hold the description. For example, my grandparents are autosomal confirmed because I match (and triangulate) with cousins who are descended from ancestors upstream of my grandparents. If my grandparent wasn’t my grandparent, I wouldn’t be related to those people either. In particular, first cousins.

I also added an “Article Link” column to paste the link to that ancestor’s 52 Ancestors article so I can quickly check or maybe even provide this spreadsheet to a family member.

Here’s an example of what the first several entries of my Ancestor Birthday Spreadsheet look like.

Ancestor Birthday Presents for You

In order to remind myself to check on my ancestors’ status, on their birth and death days, I schedule reminders in my phone calendar. Every morning when I wake, I’m greeted by my ancestor – well – at least this much of them.

  • First, I check at Family Tree DNA for new matches, haplogroups and the presence of my family lines in surname projects.
  • Then it’s off to Ancestry to see if I have any new green leaf DNA or record hints, to add or update the circle for this particular ancestor, and to see if any of my matches would be a candidate for either Y or mitochondrial DNA testing, assuming they reply to messages and agree to test at Family Tree DNA. I keep a separate spreadsheet of each person that I’ve identified as a match with an identified ancestor. I know it’s extra work, but that spreadsheet is invaluable for determining if the ancestor is autosomal proven and if the match is a candidate for Y or mtDNA testing.
  • Then I get another cup of coffee and check at MyHeritage for new record matches for that ancestor, along with new DNA SmartMatches.
  • GedMatch and 23andMe aren’t as easy to check for matches specific to ancestors, but I still check both places to see if I can find matches that I can identify as descending from that ancestor.
  • While I’m at it, sometimes I run over to FamilySearch to see if there’s anything new over there, although they don’t deal with DNA. They do, however, have many traditional genealogical records. I may add another column to track if I’m waiting for something specific to be digitized – like court minutes, for example. FamilySearch has been on a digitization binge!
  • As I go along, I add any new discovery to my genealogy software and my Ancestor Birthday Spreadsheet as well.
  • Last, I paint new segment information from Family Tree DNA, MyHeritage, GedMatch or 23andMe at DNAPainter. My three articles about how I use DNAPainter are here, here and here.

I just love ancestor birthdays.

Any day that I get to find something new is a wonderful day indeed – fleshing out the lives, history and DNA of my ancestors. With this many places to look, there’s seldom a day that goes by that I don’t discover at least something in my ancestor scavenger hunt!

Ancestor birthday presents for me😊

_____________________________________________________________________

Standard Disclosure

This standard disclosure appears at the bottom of every article in compliance with the FTC Guidelines.

I provide Personalized DNA Reports for Y and mitochondrial DNA results for people who have tested through Family Tree DNA. I provide Quick Consults for DNA questions for people who have tested with any vendor. I would welcome the opportunity to provide one of these services for you.

Hot links are provided to Family Tree DNA, where appropriate. If you wish to purchase one of their products, and you click through one of the links in an article to Family Tree DNA, or on the sidebar of this blog, I receive a small contribution if you make a purchase. Clicking through the link does not affect the price you pay. This affiliate relationship helps to keep this publication, with more than 900 articles about all aspects of genetic genealogy, free for everyone.

I do not accept sponsorship for this blog, nor do I write paid articles, nor do I accept contributions of any type from any vendor in order to review any product, etc. In fact, I pay a premium price to prevent ads from appearing on this blog.

When reviewing products, in most cases, I pay the same price and order in the same way as any other consumer. If not, I state very clearly in the article any special consideration received. In other words, you are reading my opinions as a long-time consumer and consultant in the genetic genealogy field.

I will never link to a product about which I have reservations or qualms, either about the product or about the company offering the product. I only recommend products that I use myself and bring value to the genetic genealogy community. If you wonder why there aren’t more links, that’s why and that’s my commitment to you.

Thank you for your readership, your ongoing support and for purchasing through the affiliate link if you are interested in making a purchase at Family Tree DNA, or one of the affiliate links below:

Affiliate links are limited to: