DNAPainter: Painting Leeds Method Matches

Last week, I wrote about how I utilized the Leeds Method in the article, The Leeds Method. What I didn’t say is that I was sizing up the Leeds Method for how I could use the technique to paint additional segments of my chromosomes.

The Leeds Method divides your matches into four groups, one attributable to each grandparent. That means those matches can be painted to your four sets of great-grandparents, assuming you can identify the maternal and paternal groups. Hint – Y and mitochondrial DNA matching or haplogroups may help if you have no better hints.

For genealogists who know who their grandparents are, testing close relatives and cousins is a must in order to be able to associate matches with your four grandparents’ lines.

Please note that the Leeds method generates hints for genealogists by grouping people according to common matches. We must further evaluate those matches by doing traditional genealogy and by looking for segments that triangulate. The Leeds method in conjunction with the actual match results at vendors, combined with DNAPainter helps us do just that.

Utilizing DNAPainter

Since I’ve been able to sort matches into maternal and paternal “sides” using the Leeds Method, which in essence parentally phases the matches, I can use DNAPainter to paint them. Here are my four articles I wrote about how to utilize DNAPainter.

DNAPainter – Chromosome Sudoku for Genetic Genealogy Addicts 
DNAPainter – Touring the Chromosome Garden 
DNAPainter – Mining Vendor Matches to Paint Your Chromosomes 
Proving or Disproving a Half Sibling Relationship Using DNAPainter

Combining the Two Tools

DNAPainter has the potential to really utilize the Leeds Method results, other than Ancestry matches of course. Ancestry does not provide segment information. (Yes, I know, dead horse but I still can’t resist an occasional whack.)

You’re going to utilize your spreadsheet groupings to paint the DNA from each individual match at the vendors to DNAPainter.

On the spreadsheet, if these matches are from Family Tree DNA, MyHeritage, 23andMe or GedMatch, you’ll copy the matching segments from that vendor and paint those matching segments at DNAPainter. I explained how to do that in the articles about DNAPainter.

I do not use mass uploads to DNAPainter, because it’s impossible to assign those to different sides of your tree or ancestors. I paint individual matches, including information about the match and what I know about the history of the segment itself or associated ancestor.

I only paint segments that I can identify with certainty as maternal or paternal.

Pushing Back in Time

Based on which segments of identified ancestors the Leeds matches overlap with at DNAPainter, I can push that segment information further back in time. The blessing of this is that these Leeds matches may well fill in several blanks in my chromosome that are not yet painted by people with whom I share identified ancestors.

Even if your maternal and paternal grandparents are intermarried on each side, as long as they are not intermarried across your parental lines (meaning mother & father,) then the Leeds Method will work fine for painting. Even if you think you are attributing a segment to your paternal grandmother, for example, and the person actually matches through your paternal grandfather, you’ve still painted them on the correct chromosome – meaning your paternal chromosome. As you build up that chromosome with matches, you’ll see soon enough if you have 9 matches attributed to John Doe and one to Jane Smith, the Jane Smith match is likely incorrectly attributed, those two lines are somehow interrelated or it’s a false positive match.

Because I work with only fairly large Leeds matches – nothing below 30 cM, I sometimes receive a nice gift in terms of painting large previously unpainted segments – like the one on my mother’s side, below.

Look at this large green segment on chromosome 19 that I painted thanks to one of the Leeds matches, Harold. (Note that the two long blue and brown bars at the bottom of each chromosome are my ethnicity, not matches.) Another benefit is that if a Leeds match matches on already identified segments assigned to ancestors, I’ve just identified which ancestral lines I share with that match.

The green Ferverda side match to Roland through the Leeds Method aligns partially with a segment already known to descend from Jacob Lentz and Frederica Ruhle who were born in the 1780s. I’m related to Roland somehow through that line, and by just looking at his (redacted here) surname, I *think* I know how, even though he doesn’t have a tree online. How cool is that!

Important Notes for DNAPainter

Word of caution here. I would NOT paint anyone who falls into multiple match groups without being able to identify ancestors. Multiple match groups may indicate multiple ancestors, even if you aren’t aware of that.

Each segment has its own history, so it’s entirely possible that multiple match groups are accurate. It’s also possible that to some extent, especially with smaller segments, that matches by chance come into play. That’s why I only work with segments above 30 cM when using the Leeds method where I know I’m safe from chance matches. You can read about identical by descent (IBD) and identical by chance (IBC) matches here.

What a DNAPainter Leeds Match Means

It’s very important to label segments in DNAPainter with the fact that the source was through the Leeds Method.

These painted matches DO NOT MEAN that the match descends from the grandparent you are associating with the match.

It means that YOU inherited your common DNA with this match FROM that grandparent. It suggests that your match descends from one of the ancestors of this couple, or possibly from your great-grandparents, but you don’t necessarily share this great-grandparent couple with your match.

That’s different than the way I normally paint my chromosomes – meaning only when a specific common ancestor has been identified. For someone painted from matches NOT identified through the Leeds Method, if I know the person descends from a grandparent, I paint them to the great-grandparent couple. People painted through the Leeds Method don’t necessarily share that couple, but do share an ancestor of that couple.

When I paint using the Leeds method, I’m assigning the match to a set of great-grandparents because I can’t genealogically identify the common ancestor further upstream, so I’m letting genetics tell me which genealogical quadrant they fall into on my tree. With the Leeds Method, I can tell which grandparent I inherited that DNA through. In my normal DNAPainter methodology, I ONLY paint matches when I’ve identified the common ancestor – so Leeds Method matches would not previously have qualified.

I don’t mean to beat this to death and explain it several ways – but it’s really important to understand the difference and when looking back, understand why you painted what you did.

Labeling Leeds Match Painted Segments

Therefore, with Leeds Method match painting, I identify the match name as “John Doe FTDNA Leeds-Ferverda” which tells me the matches name (John Doe,) where they tested (FTDNA) and why I painted them (Ferverda column in my Leeds spreadsheet,) even though I don’t know for sure which ancestor we actually have in common. I paint them to the parents of my Ferverda grandfather. Not John Ferverda, my grandfather, but to his parents, Hiram Ferverda and Eva Miller. I know I received my matching DNA through one of them – I just don’t know which person of that couple yet.

However, looking at who else is assigned to that segment with an identified common ancestor will tell me where in my tree that segment originated – for me. We still don’t know where in your matches tree that segment originated.

“Match To” Issues

Lastly, if you happen to select a “match to” person to represent one of your grandparent matches that just happens to be descended from two grandparent lines, you’ve had your bad luck for the month. Remember, your “match to” person is the first person (closest match) that hasn’t yet been grouped, so you don’t really select them. If you realize you’re getting goofy results, stop and undo those results, then select the next candidate as your “match to” person.

At one vendor, when I selected the first person who hadn’t yet been grouped and used them for the red column which turned out to be Bolton, about half of them overlapped with Estes segments that I’ve already painted and confirmed from several sources. Obviously, there’s a problem someplace, and I’m guessing it just happens to be the luck of the draw with the “match to” person being descended from both lines. The lines both lived in the same county for generations. I need to redo that section with someone whose tree I know positively descends from the Bolton line and does NOT intersect with another of my lines. However, I was able to identify that this issue existed because I’ve already painted multiple ancestor-confirmed cousins who carry those same segments – and I know where they came from.

These tools are just that – tools and require some level of analytical skill and common sense. In other words, it’s a good idea to stay with larger matches and know when to say “uh-oh.” If it doesn’t feel right, don’t paint it.

Breaking Down Distant Brick Walls

I’m still thinking about how to use the Leeds Method, probably in combination with DNAPainter, to break down brick walls. My brick walls aren’t close in time. Most of them are several generations back and revolve around missing female surnames, missing records or ancestors appearing in a new location with no ability to connect them back to the location/family they left.

In essence, I would need to be able to isolate the people matching that most distant ancestor couple, then look for common surnames and ancestors within that match group. The DNAGedcom.com client which allows you to sort matches by surname might well be an integral piece of this puzzle/solution. I’ll have to spend some time to see how well this works.

Solving this puzzle would be entirely dependent on people uploading their trees.

If you have thoughts on how to use these tools to break down distant brick walls, or devise a methodology, please let me know.

And if you haven’t uploaded your tree, please do.

Would I Do The Leeds Method Again?

Absolutely, at least for the vendors who provide segment information.

I painted 8 new Leeds matches from Family Tree DNA on my Ferverda grandparent side which increased the number of painted segments at DNAPainter from 689 to 704, filled in a significant number of blank spaces on my chromosomes, and took my total % DNA painted from 60 to 61%. I added the rest of my Leeds hints from Family Tree DNA of 30 cM or over, and increased my painted segments to 734 and my percentage to 62% I know that 1 or 2% doesn’t sound like a very big increase, but it’s scientific progress.

It’s more difficult to increase the number of new segments after you’ve painted much of your genome because many segments overlap segments already painted. So, a 2% increase is well worth celebrating!

Having said that, I would love for the vendors to provide this type of clustering so I don’t have to. To date, Family Tree DNA is the only vendor who does any flavor of automatically bucketing results in this fashion – meaning paternal and maternal, which is half the battle. I would like to see them expand to the four grandparents from the maternal/paternal matching they provide today.

We’ve been asking Ancestry for enhanced tools for years. There’s no reason they couldn’t in essence do what Dana has done along with provide the DNAgedcom.com search functionality. And yes…I still desperately want a chromosome browser or at least segment information.

I will continue to utilize the Leeds Method, at least with vendors other than Ancestry because it allows me to incorporate the results with DNAPainter. It’s somehow ironic that I started out grouping the Ancestry results, but wound up realizing that the results from other vendors, specifically Family Tree DNA and MyHeritage are significantly more useful due to the segment data and combined tools.

Getting the Most Bang for Your Buck

If you tested at Ancestry or 23andMe, I would strongly encourage you to download your raw data file from both of these vendors and transfer to Family Tree DNA, MyHeritage and GedMatch to get the most out of your DNA tests. Here is the step-by-step guide for how to download your DNA from Ancestry.

The uploads to those three locations are free. All tools are free at MyHeritage until December 1, 2018 when they will begin charging for more advanced tools. The upload is free at Family Tree DNA and the advanced tools, including the chromosome browser, only require a $19 unlock.

Here is the step-by-step guide for uploading to MyHeritage and to Family Tree DNA. Fishing in every pond is critically important. You never know what you’re missing otherwise!

How many segments of your DNA can you paint using the Leeds Method in combination with DNA Painter?

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

The Leeds Method

This is the first in a series of two articles. This article explains the Leeds Method and how I created a Leeds Spreadsheet in preparation for utilizing the results in DNAPainter. I stumbled around a bit, but I think I’ve found a nice happy medium and you can benefit from my false starts by not having to stumble around in the dark yourself. Of course, I’m telling you about the pitfalls I discovered.

The second article details the methodology I utilized to paint these matches, because they aren’t quite the same as “normal” matching segments with identified ancestors.

Welcome to the Leeds Method

Dana Leeds developed a novel way to utilize a spreadsheet for grouping your matches from second through fourth cousins and to assign them to “grandparent” quadrants with no additional or previous information. That’s right, this method generates groupings that can be considered good hints without any other information at all.

Needless to say, this is great for adoptees and those searching for a parent.

It’s also quite interesting for genetic genealogists as well. One of the best aspects is that it’s very easy to do and very visual. Translation – no math. No subtraction.

Caveat – it’s also not completely accurate 100% of the time, especially when you are dealing with more distant matches, intermarriage and/or endogamy. But there are ways to work around these issues, so read on!

You can click to enlarge any image.

I’ll be referring to this graphic throughout this article. It shows the first several people on my Ancestry match list, beginning with second cousins, using pseudonyms. I chose to use Ancestry initially because they don’t provide chromosome browsers or triangulation tools, so we need as much help there as we can get.

I’ve shown the surnames of my 4 grandparents in the header columns with an assigned color, plus a “Weird group” (grey) that doesn’t seem to map to any of the 4. People in that group are much more distant in my match list, so they aren’t shown here.

I list the known “Most Common Recent Ancestor,” when identified, along with the color code that so I can easily see who’s who.

All those blanks in the MCRA column – those are mostly people without trees. Just think how useful this would be if everyone who could provide a tree did!

What Does the Leeds Method Tell You?

The Leeds Method divides your matches into four colored quadrants representing each grandparent unless your genealogical lines are heavily intermarried. If you have lots of people who fall into both of two (or more) colors, that probably indicates intermarriage or a heavily endogamous population.

In order to create this chart, you work with your closest matches that are 2nd cousins or more distant, but no more distant than 4th cousins. For endogamous people, by the time you’re working in 4th cousins, you’ll have too much overlap, meaning people who fall into multiple columns, so you’ll want to work with primarily 2nd and 3rd cousins. The good news is that endogamous people tend to have lots of matches, so you should still have plenty to work with!

Instructions

In this article, I’m using Dana’s method, with a few modifications.

By way of a very, very brief summary:

  • On a spreadsheet, you list all of your matches through at least third cousins
  • Then check each match to see who you match in common with them
  • Color code the results, in columns
  • Each person what you match in common with your closest cousin, Sleepy, is marked as yellow. Dopey and I both match Bashful and Jasmine in common and are colored Red. Doc and I both match Happy and Belle and are colored blue, and so forth.
  • The result is that each color represents a grandparent

To understand exactly what I’m doing, read Dana’s articles, then continue with this article.

DNA Color Clustering: The Leeds Method for Easily Visualizing Matches  
DNA Color Clustering: Identifying “In Common” Surnames 
DNA Color Clustering: Does it Work with 4th Cousins? By the way, yes it does, most of the time.
DNA Color Clustering: Dealing with 3 Types of Overlap

Why Use “The Leeds Method”?

In my case, I wanted to experiment. I wanted to see if this method works reliably and what could be done with the information if you already know a significant amount about your genealogy. And if you don’t.

The Leeds Method is a wonderful way to group people into 4 “grandparent” groups in order to search for in-common surnames. I love being able to perform this proof of concept “blind,” then knowing my genealogy and family connections well enough to be able to ascertain whether it did or didn’t work accurately.

If you can associate a match with a single grandparent, that really means you’ve pushed that match back to the great-grandparent couple.

That’s a lot of information without any genealogical knowledge in advance.

How Low Can You Go?

I have more than 1000 fourth cousins at Ancestry. This makes the task of performing the Leeds Method manually burdensome at that level. It means I would have had to type all 1000+ fourth cousins into a spreadsheet. I’m patient, but not that patient, at least not without a lot of return for the investment. I have to ask myself, exactly what would I DO with that information once they were grouped?

Would 4th cousin groupings provide me with additional information that second and third cousin groupings wouldn’t? I don’t think so, but you can be the judge.

After experimenting, I’d recommend creating a spreadsheet listing all of your 2nd and 3rd cousins, along with about 300 or so of your closest 4th cousin matches. Said another way, my results started getting somewhat unpredictable at about 40-45 cMs, although that might not hold true for others. (No, you can’t tell the longest matching segment length at Ancestry, but I could occasionally verify at the other vendors, especially when people from Ancestry have transferred.)

Therefore, I only proceeded through third cousins and about 300 of the Ancestry top 4th cousin matches.

I didn’t just utilize this methodology with Ancestry, but with Family Tree DNA, MyHeritage and 23andMe as well. I didn’t use GedMatch because those matches would probably have tested at one of the primary 4 vendors and I really didn’t want to deal with duplicate kits any more than I already had to. Furthermore, GedMatch is undergoing a transition to their Genesis platform and matching within the Genesis framework has yet to be perfected for kits other than those from these vendors.

Let’s talk about working with matches from each vendor.

Ancestry

At Ancestry, make a list of all of your second and third cousin matches, plus as many 4th cousins as you want to work with.

To begin viewing your common matches, select your first second cousin on the list and click on the green View Match. (Note that I am using my own second kit at Ancestry, RobertaV2Estes, not a cousin’s kit in these examples. The methodology is the same, so don’t fret about that.)

Then, click on Shared Matches.

Referring to your spreadsheet, assign a color to this match group and color the spreadsheet squares for this match group. Looking at my spreadsheet, my first group would be the yellow Estes group, so I color the squares for each person that I match in common with this particular cousin. On my spreadsheet, those cousins have all been assigned pseudonyms, of course.

Your shared match list will be listed in highest match order which should be approximately the same order they are listed on your spreadsheet. I use two monitors so I can display the spreadsheet on one and the Ancestry match list on the other.

Lon is shared in common with the gold person I’m comparing against (Roberta V2 Estes), and me, so his box would be colored gold on the spreadsheet. Lon’s pseudonym is Sneezy and the person beneath him on this list, not shown, would be Ariel.

Ancestry only shows in-common matches to the 4th cousin level, so you really couldn’t reach deeper if you wanted. Furthermore, I can’t see any advantage to working beyond the 4th cousin’s level, maximum. Your best matches are going to be the largest ones that reveal the most information and have the most matches, therefore allowing you to group the most people by color.

Unfortunately, Ancestry provides the total cMs and the number of segments, but not the largest matching segment.

One benefit of this methodology is that it’s fairly easy to group those pesky private matches like the last one on the master spreadsheet, Cersei, shown in red. You’ll at least know which grandparent group they match. Based on your identified ancestors of matches in the color group, you may be able to tell much more about that private match.

For example, one of my private matches is a match to someone who I share great-great-grandparents with AND they also match with two people further on up that tree on the maternal side of that couple, shown above, in red. I may never know which ancestor I share with that private match specifically, but I have a pretty darned good idea now in spite of that ugly little lock. The more identified matches, the better and more accurate this technique.

Is the Leeds Method foolproof? No.

Is this a great tool? Yes, absolutely.

Family Tree DNA

Thankfully, Family Tree DNA provides more information about my matches than Ancestry, including segment information combined with a chromosome browser and Family Matching. I often refer to Family Matching as parental bucketing, shown on your match list with the maternal and paternal tabs, because Family Tree DNA separates your matches into parental “sides” based on common segments with others on your maternal and paternal branches of your tree when you link your matches’ results.

At Family Tree DNA, sign on and then click on Matches under Family Finder.

When viewing your matches, you’ll see blue or red people icons any that are assigned to either your maternal, paternal side, or both (purple) on your match list. If you click on the tabs at the top,  you’ll see JUST the maternal, paternal or both lists.

This combination of tools allows you to confirm (and often triangulate) the match for several people. If those matches are bucketed, meaning assigned to the same parental side, and they match on the same segment, they are triangulated for all intents and purposes if the segment is above 20 cM. All of the matches I worked with for the Leeds Method were well above 20 cM, so you don’t really need to worry about false or identical by chance matches at that level.

Family Tree DNA matches are initially displayed by the total number of “Shared cM.” Click on “Longest Block” to sort in that manner. I considered people through 30 cM and above as equivalent to the Ancestry 3rd cousin category. Some of the matching became inconsistent below that threshold.

List all of your second and third cousins on the spreadsheet, along with however many 4th cousins you want to work with.

Then, select your closest second cousin by checking the box to the left of that individual, then click on “In Common With” above the display. This shows you your matches in common with this person.

On the resulting common match list, sort your matches in Longest block order, then mark the matches on your spreadsheet in the correct colored columns.

With each vendor, you may need to make new columns until you can work with enough matches to figure out which column is which color – then you can transfer them over. If you’re lucky enough to already know the family association of your closest cousins, then you already know which colored column they belong to.

All of my matches that fell into the Leeds groups were previously bucketed to maternal or paternal, so consistency between the two confirms both methodologies. Between 20 and 28 cM, three of my bucketed matches at Family Tree DNA fell into another group using the Leeds method, which is why I drew the line at 30cM.

For genealogists who already know a lot about their tree, this methodology in essence divides the maternal and paternal buckets into half. FTDNA already assigns matches maternally or paternally with Family Matching if you have any information about how your matches fit into your tree and can link any matching testers to either side of your tree at the 3rd cousin level or closer.

If you don’t know anything about your heritage, or don’t have any way to link to other family members who have tested, you’ll start from scratch with the Leeds Method. If you can link family members, Family Tree DNA already does half of the heavy lifting for you which allows you to confirm the Leeds methodology.

MyHeritage

At MyHeritage, sign in, click on DNA and sort by “largest segment,” shown at right, above. I didn’t utilize matches below 40 cM due to consistency issues. I wonder if imputation affects smaller matches more than larger matches.

You’ll see your closest matches at the top of the page. Scroll down and make a list on your spreadsheet of your second and third cousins. Return to your closest DNA match that is a second cousin and click on the purple “Review DNA Match” which will display your closest in-common matches with that person, but not necessarily in segment size order.

Scroll down to view the various matches and record on the spreadsheet in their proper column by coloring that space.

The great aspect of MyHeritage is that triangulation is built in, and you can easily see which matches triangulate, providing another layer of confirmation, assuming you know the relationship of at least some of your matches.

The message for me personally at MyHeritage is that I need to ask known cousins who are matches elsewhere to upload to MyHeritage because I can use those as a measuring stick to group matches, given that I know the cousin’s genealogy hands-down.

The great thing about MyHeritage is that they are focused on Europe, and I’m seeing European matches that aren’t anyplace else.

23andMe

At 23andMe, sign in and click on DNA Relatives under the Ancestry tab.

You’ll see your list of DNA matches. Record 2nd and third cousins on your spreadsheet, as before.

To see who you share in common with a match, click on the person’s name and color your matches on the spreadsheet in the proper column.

Unfortunately, the Leeds Method simply didn’t work well for me with my 23andMe data, or at least the results are highly suspect and I have no way of confirming accuracy.

Most of my matches fell into in the Estes category, with the Boltons overlapping almost entirely, and none in the Lore or Ferverda columns. There is one small group that I can’t identify. Without trees or surnames, genealogically, my hands are pretty much tied. I can’t really explain why this worked so poorly at 23andMe. Your experience may be different.

The lack of trees is a significant detriment at 23andMe because other than a very few matches whose genealogy I know, there’s no way to correlate or confirm accuracy. My cousins who tested at 23andMe years ago and whose tests I paid for lost interest and never signed in to re-authorize matching. Many of those tests are on the missing Ferverda side, but their usefulness is now forever lost to me.

23andMe frustrates me terribly. Their lack of commitment to and investment in the genealogical community makes working with their results much more difficult than it needs to be. I’ve pretty much given up on using 23andMe for anything except adoption searches for very close matches as a last resort, and ethnicity.

The good news is that with so many people testing elsewhere, there’s a lot of good data just waiting!

What are the Benefits?

The perception of “benefit” is probably directly connected to your goal for DNA testing and genetic genealogy.

  • For adoptees or people seeking unknown parentage or unknown grandparents, the Leeds Method is a fantastic tool, paving the way to search for common surnames within the 4 groups as opposed to one big pool.
  • For people who have been working with their genealogy for a long time, maybe not as much, but hints may lurk and you won’t know unless you do the discovery work. If you’re a long-time genealogist, you’re used to this, so it’s just a new way of digging through records – and you can do it at home!
  • For people who have tested at Family Tree DNA, the family grouping by maternal and paternal based on people linked to your tree is more accurate and groups people further down your match list because it’s actually based on triangulated matching segments. However, the Leeds Method expands on that and adds granularity by breaking those two groups into four.
  • For people who want to paint their chromosomes using DNAPainter, the Leeds Method is the first step of a wonderful opportunity if you have tested at either Family Tree DNA, MyHeritage or 23andMe.

Unfortunately, Ancestry doesn’t provide segment information, so you can’t chromosome paint from Ancestry directly, BUT, you can upload to either Family Tree DNA, MyHeritage or GedMatch and paint Ancestry matches from there. At GedMatch, their kit numbers begin with A.

What Did I Do Differently than Dana?

Instead of adding a 5th column with the first person (Sam) who was not grouped into the first 4 groups, I looked for the closest matches that I shared with Sam who were indeed in the first 4 color groups. I added Sam to that existing color group along with my shared matches with Sam that weren’t already grouped into that color so long as it was relatively consistent. If it looked too messy, meaning I found people in multiple match groups, I left it blank or set that match aside. This didn’t happen until I was working at the 4th cousin level or between 30 and 40 cM, depending on the vendor.

Please note that just because you find people that you match in common with someone does NOT MEAN that you all share a common ancestor, or the same ancestor. It’s a hint, a tip to be followed.

There were a couple of groups that I couldn’t cluster with other groups, and one match that clustered in three of the four grandparent groups. I set that one aside as an outlier. I will attempt to contact them. They don’t have a tree.

I grouped every person through third cousin matches. I started out manually adding the 4th cousins for each match, but soon gave up on that due to the sheer magnitude. I did group my closest 4th cousins, or until they began to be inaccurate or messy, meaning matching in multiple groups. Second and third cousin matching was very consistent.

Tips

  • Don’t use siblings or anyone closer than the second cousin level. First cousins share two grandparents. You only want to use matches that can be assigned to ONLY ONE GRANDPARENT.
  • In the spreadsheet cell, mark the person you used as a “match to.” In other words, which people did you use to populate that color group. You can see that I used two different people in the Estes category. I used more in the other categories too, but they are further down in my list.
  • At Family Tree DNA, you can utilize the X chromosome. Understand that if you are a male, you will not have any X matches with your paternal grandfather. I would not recommend using X matches for the Leeds Method, especially since they are not uniformly available at all vendors and form a specific unique inheritance pattern that is not the same as the other autosomes.
  • Ancestry, MyHeritage and Family Tree DNA allow you to make notes on each match. As I group these, and as I paint them with DNAPainter I made a note on each match that allows me to identify which group they are assigned to, or if they match multiple groups.
  • Look at each match to be sure they are consistent. If they aren’t, either mark them as inconclusive or omit them entirely in the painting process. I write notes on each one if there is something odd, or if I don’t paint them.

What Did I Learn?

Almost all of my (endogamous by definition) Acadian matches are more distant, which means the segments are smaller. I expected to find more in the painted group, because I have SO MANY Acadian matches, but given that my closest Acadian ancestor was my great-great-grandfather, those segments are now small enough that those matches don’t appear in the candidate group of matches for the Leeds Method. My Acadian heritage occurs in my green Lore line, and there are surprisingly few matches in that grouping large or strong enough to show up in my clustered matches. In part, that’s probably because my other set of great-great-grandparents in that line arrived in 1852 from Germany and there are very few people in the US descended from them.

I found 4th cousin matches I would have otherwise never noticed because they don’t have a tree attached. At Ancestry, I only pay attention to closer matches, Shared Ancestor Hints and people with trees. We have so many matches today that I tend to ignore the rest.

Based on the person’s surname and the color group into which they fall, it’s often possible to assign them to a probable ancestral group based on the most distant ancestors of the people they match within the color group. In some cases, the surname is another piece of evidence and may provide a Y DNA lead.

For example, one of my matches user name is XXXFervida. They do match in the Ferverda grandparent group, and Fervida is how one specific line of the family spelled the surname. Of course, I could have determined that without grouping, but you can never presume a specific connection based solely on surname, especially with a more common name. For all I know, Fervida could be a married name.

By far the majority of my matches don’t have trees or have very small trees. That “no-tree” percentage is steadily increasing at Ancestry, probably due to their advertising push for ethnicity testing. At Family Tree DNA where trees are infinitely more useful, the percentage of people WITH trees is actually rising. By and large, Family Tree DNA users tend to be the more serious genealogists.

MyHeritage launched their product more recently with DNA plus trees from the beginning, although many of the new transfers don’t have trees or have private trees. Their customers seem to be genealogically savvy and many live in Europe where MyHeritage DNA testing is focused.

23andMe is unquestionably the least useful for the Leeds Method because of their lack of support for trees, among other issues, but you may still find some gems there.

Keeping Current

Now that I invested in all of this work, how will I keep the spreadsheet current, or will I at all?

At Ancestry, I plan to periodically map all of my SAH (Shared Ancestor Hints) green leaf matches as well as all new second and third cousin matches, trees or not.

In essence, for those with DNA matches and trees with a common ancestor, Ancestry already provides Circles, so they are doing the grouping for those people. Where this falls short, of course, is matches without trees and without a common identified ancestor.

For Ancestry matches, I would be better served, I think, to utilize Ancestry matches at GedMatch instead of at Ancestry, because GedMatch provides segment information which means the matches can be confirmed and triangulated, and can be painted.

For matches outside of Ancestry, in particular at Family Tree DNA and MyHeritage I will keep the spreadsheet current at least until I manage to paint my entire set of chromosomes. That will probably be a very long time!

I may not bother with 23andMe directly, given that I have almost no ability to confirm accuracy. I will utilize 23andMe matches at GedMatch. People who transfer to GedMatch tend to be interested in genealogy.

What Else Can I Do?

At Ancestry, I can use Blaine’s new “DNA Match Labeling” tool that facilitates adding 8 colored tags to sort matches at Ancestry. Think of it as organizing your closet of matches. I could tag each of these matches to their grandparent side which would make them easy to quickly identify by this “Leeds Tag.”

My Goals

I have two primary goals:

  • Associating segments of my DNA with specific ancestors
  • Breaking down genealogical brick walls

I want to map my DNA segments to specific ancestors. I am already doing this using Family Tree DNA and MyHeritage where common ancestors are indicated in trees and by surnames. I can map these additional Leeds leads (pardon the pun) to grandparents utilizing this methodology.

To the extent I can identify paternal and maternal matches at 23andMe, I can do the same thing. I don’t have either parents’ DNA there, and few known relatives, so separating matches into maternal and paternal is more difficult. It’s not impossible but it means I can associate fewer matches with “sides” of my genealogy.

For associating segments with specific ancestors and painting my chromosomes, DNAPainter is my favorite tool.

In my next article, we’ll see how to use our Leeds Method results successfully with DNAPainter and how to interpret the results.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

MyHeritage Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor, if possible
  • Other mainstream vendors where you can upload this vendor’s files

Uploading TO MyHeritage

Upload Step 1

To upload your DNA to MyHeritage, click here and then click on the purple “Start” button.

Upload Step 1 If You Already Have an Account at MyHeritage

If you already have an account, click here to sign in and then click on the DNA tab to display the “Upload DNA Data” option which displays the graphic above. Click on the purple “Start” button. This is the same process you’ll use whether it’s the first time you’ve uploaded a kit, or you’re uploading subsequent kits to your account that you’ll be managing.

Upload Step 2

You’ll be prompted to create a free account by entering your name, e-mail and password, and from there you can upload your autosomal DNA file.

You’ll be asked whose DNA you’re uploading and prompted to read and agree to the terms of service and consent.

Click the purple upload button.

Then click done when the file is finished uploading.

You’ll be notified by e-mail within a couple days when the file is finished processing.

Downloading FROM MyHeritage

Download Step 1

Sign on to your MyHeritage account.

Click on DNA on the upper toolbar.

The dropdown menu includes “Manage DNA Kits”

Download Step 2

At the right of the kit you wish to download, click on the three small buttons which will include an option for “Download,” as shown in the graphics below from the MyHeritage blog article.

Download Step 3

You’ll be presented with a box titled “Learn more about DNA data files.” Click the purple “Continue” button.

Download Step 4

You’ll need to confirm that you want to download your data, and that you understand that the download is outside of MyHeritage and their protection. Click the purple “Continue” button.

Download Step 5

You’ll receive a confirmation e-mail. Click on “Click here to continue with download.”

This e-mail link is only valid for 24 hours.

Download Step 6

Enter your password again, and click on the purple “Download” button.

Download Step 7

Save the file as a recognizable file name on your computer.

MyHeritage File Transfers TO Other Vendors

You can upload your MyHeritage file to other vendors, as follows.

From below to >>>>>>>>>>> Family Tree DNA Accepts Ancestry Accepts 23andMe Accepts GedMatch Accepts
MyHeritage Yes No No Yes

Neither Ancestry nor 23andMe accepts uploads from any vendor.

MyHeritage File Transfers FROM Other Vendors

You can upload files from other vendors to MyHeritage, as follows:

  From Family Tree DNA From Ancestry From 23andMe From LivingDNA
To MyHeritage Yes Yes Yes Yes

Testing and Transfer Strategy

Transferring to MyHeritage is always free. You can view your ethnicity, your matches and their trees, and utilize the DNA tools, but you won’t receive the full benefit of SmartMatching and other records without a subscription. You will be limited to building a tree of 250 people for free, but you can upload a Gedcom file of any size, although you do need to subscribe to change anything in that file if it contains more than 250 individuals.

Until December 1, 2018, all DNA tools will be and remain free for anyone who uploads before that date. After December 1st, matching will remain free, but the advanced tools such as ethnicity, the chromosome browser, triangulation and more will require payment. MyHeritage has not yet indicated how that will work, so upload now to receive free DNA tools forever.

My testing/transfer recommendations are as follows relative to MyHeritage:

Have fun!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Ancestry 2018 Ethnicity Update

When ethnicity estimates were first produced by vendors, they tended to resemble the wild west.

Today, results are becoming more refined and hopefully, more accurate as reference populations grow and become more reliable.

The Ancestry ethnicity update has been in beta for several months, but this week, Ancestry rolled out the ethnicity update for everyone.

Checking Your New Results

To see your updated results, sign on and click on the DNA Story to the left with Ethnicity Estimates.

Ancestry then explains that while your DNA doesn’t change, the estimates (pay attention to that word) do as the science improves.

Ethnicity Estimate Aren’t Precise

I’ve said this before, and I want to say it again. Ethnicity is the least precise and the least accurate of DNA tools for genetic genealogy. Ethnicity estimates are the most accurate at a continental level. Within continents, like Europe, Asia and Africa, there has been a lot of population movement and intermixing over time making the term “ethnicity” almost meaningless.

I know, I know – ethnicity estimates are also the simplest because there isn’t much learning curve and they’re easy to understand at a glance. This deceptive “ease of use” also makes them interesting to people who have only a passing curiosity. That’s why they attract so many test takers who either love of hate their results, but never fully understand the true message or utilize any other genetic genealogy tools.

Let’s take a look at how ethnicity estimates have changed over time and if they have improved with the latest version.

Ethnicity Estimate Changes

In my case, my original Ancestry ethnicity estimate in 2012 was:

  • British Isles 80%
  • Scandinavia 12%
  • Uncertain 8%

To say it was really bad is an understatement.

In 2013, Ancestry introduced their ethnicity V2 version which provided a lot more granularity.

Version 2 was dramatically different, with the British Isles moving from 80% to a total of 6%. Like a pendulum swinging, neither was accurate.

Ancestry introduced new features and combined their Genetic Communities with their ethnicity estimates in 2017.

In this new 2018 version, Ancestry has divided and recombined the British Isles and Western Europe differently and the resulting differences are significant.

My mystery Scandinavian is entirely gone now, but sadly, so is my Native American.

The New Results

I just got really boring – but the question is whether or not the new results are more accurate as compared to my proven genealogy. Boring doesn’t matter. Accuracy does.

Various Ancestry Ethnicity Versions Compared to Proven Genealogy

I created a chart that reflects the three Ancestry ethnicity versions as compared to my proven genealogy.

For the current version, I also included the ranges as provided by Ancestry.

As you can see, generally, the results are much more accurate, but the regions are also fairly broad which makes accuracy easier to achieve.

Until this current version, Ancestry didn’t show any Germanic, but now the Germanic estimate is exact at 25%.  The Germanic range is also very tight at 24-26%, right where it should be.

The England, Wales & Northeast Europe category is somewhat high, but that could be accurate because I do have some ancestry that is unknown.

Unfortunately, my Native is proven, both through Y and mtDNA and by triangulating the Native segments to others descending from the same Native ancestors. That portion is now missing in my Ancestry ethnicity.

Ancestry V1 Test Versus the V2 Test

For the record, I’m using my Ancestry V1 test because I’ve used that test version for all previous ethnicity comparisons.  My Ancestry V2 test ethnicity results are approximately the same, as follows:

  • England, Wales and Northeast Europe – 76%
  • Germanic – 22%
  • Ireland and Scotland – 2%

The same tree is attached to both tests.

On my V2 test, which I seldom use, I had to answer a couple of question regarding my expectations about ethnicity testing changes and how accurate my previous results were perceived to be before I could access my updated results.

Regions Changed

In Ancestry’s FAQ, they provided this list of how the regions were and are defined.

Previous Region New Regions
Scandinavia Norway, Sweden
Iberian Peninsula Spain, Portugal, Basque
Europe South Italy, Greece and the Balkans, Sardinia
Europe East Baltic States, Eastern Europe and Russia
Caucasus Turkey and the Caucasus, Iran/Persia
Europe West Germanic Europe, France
Native American Native America—North, Central, South; Native America—Andean
Asia South Southern Asia, Western and Central India, Balochistan, Burusho
Asia East Japan, Korea and Northern China, China, Southeast Asia—Dai (Tai), Southeast Asia—Vietnam, Philippines

Ancestry has addressed lots of other questions in their FAQ as well, and I suggest taking a look. I particularly like their comment, “Some places are complicated.” Indeed, that’s true with population churn both in historical times along with unknown pre-history and that complexity is exactly what makes intra-continental ethnicity estimates so difficult. Of course, people whose ancestors are from Europe, for example, want as much granularity as possible.

Previous Ethnicity Versions

For the first time, Ancestry explains what happened between versions, at least at a high level.

Click on the little “i” in the upper right hand corner of your ethnicity estimate box.

You’ll see more information.

Click on “View Previous Estimate” at the bottom.

Your previous ethnicity estimate is shown.

To see how your estimate changed, click on “Compare these results to your most recent Ancestry DNA estimate.”

This display shows you the differences compared to the previous version. In my case, England, Wales and NE Europe increased by 69%, but that’s because Ancestry redefined the regions. Note the little slide box underneath the regions on the map. You can slide back and forth from previous to current (update.).

I do wish Ancestry had told us where the “Scandinavian” went, what category it fell into. Are those segments, as a group, included in another region? Was the previous estimate simply flat out wrong? Was Scandinavian a vestige of Vikings who invaded much of Europe? What happened?

New Regions and Reference Samples

By clicking on “See other regions tested” at the bottom of your Ethnicity Estimate box, you can view the locations of Ancestry’s current reference populations.

The regions tested in which you have results are colored, and the regions where you aren’t showing results are shades of grey. This is an improvement over the previous version which people routinely misinterpreted to mean that they had results in those tested regions.

Best Features

In my opinion, the best feature of the combined ethnicity and Genetic Communities is the combined mapping. For example, the screenshot below combines the ethnicity regions with the ancestors from my tree who immigrated from that region in that timeframe.

By clicking on the 1700 box, the people from that time period in my tree are displayed. I can enlarge the map to make the display larger, until finally individual “people” icons are displayed, as shown with Johann Peter Koehler, below. Clicking on the individual person pin shows that individual in the box at right.

By clicking on the “Lower Midwest and Virginia Settlers,” I see this region and Ancestry tells me where those settlers likely originated.

You can then scroll down to the bottom of the information box where you see “Ancestry DNA Members.”

Click on the 1000+ link and you will then see the people who match you in a specific region or migration.

It’s worth noting that this isn’t always accurate. My 2nd cousin match is showing as a “Lower Midwest and Virginia” match and our ancestors came from the Netherlands directly to Northern Indiana. Ironically, she shows up in three of the 4 regions I can select from. This feature is not 100%, but it’s still nice to be able to see where that match is grouped in terms of ethnicity and Genetic Communities, according to Ancestry.

Given this combined functionality, I do wonder if Ancestry’s new ethnicity isn’t simply population genetics, but a combination of population genetics, ancestors in my tree, my matches and corresponding DNA Circles with their associated history. If so, that would make sense, both in terms of what I’m seeing as my new ethnicity results and the map functionality as well. Could that be where my Germanic came from, and why it’s so precise at 25% which matches by tree exactly?

In Summary

For me, Ancestry’s ethnicity estimates are significantly improved with the exception that my Native disappeared. I’ve worked long and hard on the Native aspect of my genealogy, and I know that part of my ethnicity mix is valid. However, that is a very small percentage overall (about 2%), and the combined improvements certainly outshine that one negative.

Of course, your mileage may vary. What are you seeing in terms of your new ethnicity estimates as compared to your known genealogy? Better? Worse? Did you lose any categories that you know are valid? What about small amounts of minority heritage?

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Ancestor Birthdays Mean Presents for YOU!

I’ve been wanting to celebrate my ancestors’ birthdays for some time now, and I’ve finally figured out exactly how to accomplish this goal in a really fun way.

Being reminded once a year about their birthday and the anniversary of their death reminds me to work on their genealogy, and in particular, genetic genealogy. With more people testing every single day, meaning different people at every vendor, we need to check often with specific ancestors in mind. You never know who’s going to be the person who puts the chink in that brick wall.

With this in mind, I’ve put together a spreadsheet to track what I know about each ancestor. This makes it easy to schedule those dates in my calendar, with a reminder of course, and then to check my spreadsheet to see what information might have been previously missing that might be able to be found today.

It’s like a birthday present for them, but now for me. I am, after all, their heir, along with the rest of their descendants of course! If I’m lucky, I inherited part of their DNA, and if not, their DNA is still relevant to me.

Checking the List

Here’s my spreadsheet checklist for each ancestor:

  • Birth date
  • Birth place
  • Death date
  • Death place
  • Spouse
  • Y DNA haplogroup (if male)
  • Mitochondrial DNA haplogroup
  • Autosomal confirmed
  • Ancestry Circle

New information becomes digitized every year making new information available.

Additionally, some items may change. For example, if a base haplogroup was previous known, a deeper haplogroup might be available a year later if someone has taken a more detailed test or the haplogroup name might have been updated. Yes, that happens too.

I originally had a triangulation column on the spreadsheet too, but I pretty quickly discovered that column was subject to lots of questions about interpretation. Is the actual ancestor triangulated, or the line? I decided that “autosomal confirmed” would suffice to cover whatever I decide constitutes confirmation and a comment column could hold the description. For example, my grandparents are autosomal confirmed because I match (and triangulate) with cousins who are descended from ancestors upstream of my grandparents. If my grandparent wasn’t my grandparent, I wouldn’t be related to those people either. In particular, first cousins.

I also added an “Article Link” column to paste the link to that ancestor’s 52 Ancestors article so I can quickly check or maybe even provide this spreadsheet to a family member.

Here’s an example of what the first several entries of my Ancestor Birthday Spreadsheet look like.

Ancestor Birthday Presents for You

In order to remind myself to check on my ancestors’ status, on their birth and death days, I schedule reminders in my phone calendar. Every morning when I wake, I’m greeted by my ancestor – well – at least this much of them.

  • First, I check at Family Tree DNA for new matches, haplogroups and the presence of my family lines in surname projects.
  • Then it’s off to Ancestry to see if I have any new green leaf DNA or record hints, to add or update the circle for this particular ancestor, and to see if any of my matches would be a candidate for either Y or mitochondrial DNA testing, assuming they reply to messages and agree to test at Family Tree DNA. I keep a separate spreadsheet of each person that I’ve identified as a match with an identified ancestor. I know it’s extra work, but that spreadsheet is invaluable for determining if the ancestor is autosomal proven and if the match is a candidate for Y or mtDNA testing.
  • Then I get another cup of coffee and check at MyHeritage for new record matches for that ancestor, along with new DNA SmartMatches.
  • GedMatch and 23andMe aren’t as easy to check for matches specific to ancestors, but I still check both places to see if I can find matches that I can identify as descending from that ancestor.
  • While I’m at it, sometimes I run over to FamilySearch to see if there’s anything new over there, although they don’t deal with DNA. They do, however, have many traditional genealogical records. I may add another column to track if I’m waiting for something specific to be digitized – like court minutes, for example. FamilySearch has been on a digitization binge!
  • As I go along, I add any new discovery to my genealogy software and my Ancestor Birthday Spreadsheet as well.
  • Last, I paint new segment information from Family Tree DNA, MyHeritage, GedMatch or 23andMe at DNAPainter. My three articles about how I use DNAPainter are here, here and here.

I just love ancestor birthdays.

Any day that I get to find something new is a wonderful day indeed – fleshing out the lives, history and DNA of my ancestors. With this many places to look, there’s seldom a day that goes by that I don’t discover at least something in my ancestor scavenger hunt!

Ancestor birthday presents for me😊

_____________________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Family Tree DNA Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor
  • Other mainstream vendors where you can upload this vendor’s files

You can both upload autosomal data files from another vendor to Family Tree DNA, and download from Family Tree DNA.

Uploading TO Family Tree DNA

Step 1

On the main Family Tree DNA page, click on “Upload DNA Data,” at the top.

Step 2

Select either “Autosomal DNA” or Nat Geo’s Geno 2.0 data. Hint, the current version of Genographic data won’t work. The current version is processed on the Helix chip as of November 2016.

Step 3

If you are not transferring from Nat Geo, go to Step 4.

If you are transferring a Nat Geo 2.0 kit, proceed to enter your National Geographic Kit Number. If you don’t know your kit number, click on “Where do I find my Geno ID Code.”

Step 4

These instructions in Step 4 focus on uploading your autosomal DNA file from other vendors, not Genographic 2.0.

Complete the form. If you already have tested Y or mitochondrial DNA at Family Tree DNA, select “Already have a Family Tree DNA account,” so that your uploaded test can be integrated with your existing account. If you have already taken the Family Finder test at Family Tree DNA, there’s no need to upload your autosomal DNA from any other vendor.

Step 5

Next, select the vendor test that you are going to upload. Uploads accepted include:

  • 23andMe V3 and V4 – tests taken after December 2010 and before August 2017. The V5 chip, in use at 23andMe since August 2017 is not compatible.
  • Ancestry V1 – tests taken until May 2016.
  • Ancestry V2 – tests taken since May 2016
  • MyHeritage – fully compatible, Family Tree DNA is the lab that does their testing

If you select “MyHeritage,” you will be redirected to MyHeritage to log in and select your kit for transfer.

If you select either 23andMe or Ancestry DNA, you will be directed to either drag and drop your data file from that vendor or browse to upload.

Family Tree DNA provides a step by step guide for accessing your raw data files from those vendors by clicking on “How to I access my raw data files?,” above the grey transfer box.

You will be assigned a Family Tree DNA kit number. After your results are processed, you can sign in to see your matches.

Compatibility

The 23andMe V3, V4 and the Ancestry V1 kits are fully compatible, meaning that you will get the same matches at Family Tree DNA using those transfer kits as you would if you tested at Family Tree DNA. However, the Ancestry V2 kit is only partially compatible, meaning that you will only receive 20-25% of the matches by transferring a V2 test that you would receive if you tested at Family Tree DNA. Ancestry changed their chip in May of 2016.

If you want all of your possible matches, and who doesn’t, you should test at Family Tree DNA instead of uploading a V2 Ancestry test.

Step 6

The transfer to Family Tree DNA is free and so is viewing your matches along with basic tools. However, for additional tools, such as ethnicity and the chromosome browser, you’ll need to purchase the $19 unlock. This is a great value, as compared to retesting at $79, or sometimes less when on sale.

You will be prompted for the “Unlock” purchase if you click on either the Chromosome Browser button, the Ethnicity button or other advanced tools on your dashboard after your results are finished.

Downloading FROM Family Tree DNA

Step 1

To download your raw autosomal DNA file from Family Tree DNA, click on the orange “Download Raw Data” link at the bottom of your Family Finder section on your dashboard.

Alternatively, you can select the “Download Raw Data” option at the top of the page under myDNA, “Family Finder.”

Step 2

You will be given the option of downloading your Build 36 and Build 37 raw data files.

Different vendors request different types of files.

  • GedMatch – Build 36 Raw Data Concatenated
  • MyHeritage – Build 37 Raw Data Concatenated

Other vendors may request different file formats, and the above vendors may change over time.

Click the arrow beside the version you need.

Step 3

Save the file in a manner that you’ll recognize. The file name will be something like “37_R_Estes_Chrom_Autoso_20180818.gz”. I append the word FTDNA in front of the 37 so there is no question which vendor’s file this is. The last several digits are the date.

Family Tree DNA File Transfers TO Other Vendors

You can upload Family Tree DNA results to other vendors, as follows:

From below to >>>>>>>>>>> Ancestry Accepts MyHeritage Accepts 23andMe Accepts GedMatch Accepts
Family Tree DNA No Yes No Yes

Neither Ancestry nor 23andMe accept uploads from any vendor.

Family Tree DNA Transfers FROM Other Vendors

You can upload files from other vendors to Family Tree DNA, as follows:

From Ancestry  From MyHeritage  From 23andMe  From Living DNA
Family Tree DNA Accepts Yes Yes V3, V4 No

Testing and Transfer Strategy

Transferring to Family Tree DNA is always free, and you can see your matches. In order to view your ethnicity and use advanced tools like the chromosome browser, you’ll need to purchase the $19 unlock.

Remember that while Ancestry and MyHeritage both have records subscriptions to sell you, Family Tree DNA doesn’t. For Tier 1 tools, GedMatch requires a subscription. At Family Tree DNA, you pay a one time fee to unlock all of their tools. Every company needs to be profitable to stay in business and develop new tools, and each company has a different product pricing model.

My testing/transfer recommendations are as follows relative to Family Tree DNA:

An Ancestry V1 test is entirely compatible at Family Tree DNA, but with a V2 test, because the testing platform that Ancestry uses is only about 20-25% compatible with the Family Tree DNA test, you’ll only receive your closest 20-25% matches. Family Tree DNA can’t match on those smaller segments if you don’t test on a compatible platform, so please do. I wrote a step-by-step guide about how to download from Ancestry here, including how to tell if you have a V1 or V2 format test.

    • If you have Ancestry V2 results, you can transfer to MyHeritage and GedMatch but I recommend retesting at Family Tree DNA. The cost difference at Family Tree DNA between the $19 unlock and a new Family Finder test is $60, for a total of $79 when the tests aren’t on sale. When they are on sale, it’s less. You never know which match is going to break down that brick wall, and it would be a shame to miss it because you transferred rather than retested.
    • If you test at Family Tree DNA, transfer your results to MyHeritage for free. Matching and ethnicity is free with a transfer to MyHeritage, but you won’t receive the full potential benefit of tree matching with other testers without a subscription.
    • If you test at MyHeritage, transfer your results to Family Tree DNA for free.
    • If you test at 23andMe and have the V3 or V4 test, transfer to both Family Tree DNA and MyHeritage. If you have the 23andMe V5 test, retest at Family Tree DNA and transfer those results to MyHeritage and GedMatch.

Have fun!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Ancestry Step by Step Guide: How to Upload-Download DNA Files

In this Upload-Download Series, we’ll cover each major vendor:

  • How to download raw data files from the vendor
  • How to upload raw data files to the vendor, if possible
  • Other mainstream vendors where you can upload this vendor’s files

Uploading TO Ancestry

This part is easy with Ancestry, because Ancestry doesn’t accept any other vendor’s files. There is no ability to upload TO Ancestry. You have to test with Ancestry if you want results from Ancestry.

Downloading FROM Ancestry

In order to transfer your autosomal DNA file to another testing vendor, or GedMatch, for either matching or ethnicity, you’ll need to first download the file from Ancestry.

Step 1

Sign in to your account at Ancestry and click on the DNA Results Summary link.

Step 2

Click on the Settings gear, at the far upper right hand corner of the summary page, just beneath your Ancestry user ID.

Step 3

Click on the link for “Download Raw DNA Data.”

Step 4

Enter your password and click on “I Understand,” after reading of course.

At that point, the confirm button turns orange – click there.

Step 5

Ancestry will send an e-mail to the e-mail address where you are registered with Ancestry. Check your inbox for that e-mail.

Waiting…waiting.

Still waiting…

If the e-mail doesn’t arrive shortly, check your spam folder. If you’ve changed e-mail addresses, check to be sure your new one is registered with Ancestry. That’s on the same Settings page. If all else fails, request the e-mail again.

Step 6

Ahhh, it’s finally here.

Click on the green “Confirm Data Download” and do not close the window.

Step 7

Next, click on the green “Download DNA Raw Data.”

You’ll see the following confirmation screen.

Step 8

At the bottom of the page, above, if you’re on a PC, you’ll see the typical file download box that asks you if you want to open or save. Save the file as a name you can find later when you want to upload to another site.

The file name will be “dna-data-2018-07-31” where the date is the date you downloaded the file. I would suggest adding the word Ancestry to the front when you save the file on your system.

Most vendors want an unopened zip file, so if you want to open your file, first copy it to another name. Otherwise, you’ll have to download again.

That’s it, you’re done!

Ancestry File Transfers to Other Vendors

Ancestry testing falls into two different categories. V1 tests taken before May of 2016 and V2 tests taken after May 2016. Tests processed during May 2016 could be either version.

The difference between V1 and V2 files is that Ancestry changed the chips they use to test and different DNA positions are tested, resulting in a file of a different format.

If you don’t remember when you tested, make a copy of your Ancestry file using a different name, like, “Opened Ancestry file 7-31-2018.” Then just click to open the zip file.

The first four rows of the file will say something like this:

#AncestryDNA raw data download
#This file was generated by AncestryDNA at: 08/11/2017 07:23:49 UTC
#Data was collected using AncestryDNA array version: V1.0
#Data is formatted using AncestryDNA converter version: V1.0

This is a version 1 (V1) file.

A version 2 file will say V2.0.

Your upload results to other vendors’ sites will vary in terms of both matching and ethnicity accuracy based on your Ancestry version number, as follows:

From below to >>>>>>>>>>> Family Tree DNA Accepts ** MyHeritage Accepts*** 23andMe Accepts* GedMatch Accepts ****
Ancestry before May 2016 (V1) Yes, fully compatible Yes, fully compatible No Yes
Ancestry after May 2016 (V2) Yes, partly compatible Yes, fully compatible No Yes

*Note that 23andMe earlier in 2018 allowed a one-time transfer from Ancestry, but people who transferred results did not receive matches from 23andMe.

**Note that the transfer to Family Tree DNA and matching is free, but advanced tools including the chromosome browser and ethnicity require a $19 unlock fee. That fee is less expensive than retesting, but V2 customers should consider retesting to obtain fully compatible matching and ethnicity results. V2 tests typically receive only the closest 20-25% of matches they would receive if they tested directly at Family Tree DNA.

***MyHeritage utilizes a technique known as imputation to achieve compatibility between different vendors files. The transfer and tools are free, but without a subscription you can’t fully utilize all of the MyHeritage benefits available.

****I’m not sure exactly how GedMatch compensates for the V1 versus V2 differences, but they can handle both data file types. Most people don’t take both tests, but I was conducting an experiment and have uploaded both V1 and V2 tests.

A quick survey of GedMatch matches to my Ancestry V1 and Ancestry V2 kits shows that of my first 249 (125 V2, 124 V1) matches, I have 3 V1 tests that don’t have a corresponding match to a person on the V2 kit, and 5 V2 kits that don’t have a corresponding V1 kit match. That’s roughly a 6% nonmatch rate between Ancestry V1 and V2 kits. I would presume that as the genealogical and genetic distance increases with more distant matches, so would the percentage of non-matches because the segment size is smaller with more distant matches, so there is less matching DNA to have the opportunity to match in the first place.

Testing and Transfer Strategy

My recommendation, if you test at Ancestry, is to transfer your V1 results to MyHeritage, Family Tree DNA and GedMatch.

An Ancestry V1 test is entirely compatible at Family Tree DNA, but with a V2 test, because the testing platform that Ancestry uses is only about 20-25% compatible with the Family Tree DNA test, you’ll only receive your closest 20-25% matches. Family Tree DNA can’t match on those smaller segments if you don’t test on a compatible platform, so please do.

If you have Ancestry V2 results, transfer to MyHeritage and GedMatch but retest at Family Tree DNA. The cost difference at Family Tree DNA between the $19 unlock and a new Family Finder test is $60, for a total of $79 when the tests aren’t on sale. When they are on sale, it’s less. Right now, the tests are only $59.

You never know which match is going to break down that brick wall, and it would be a shame to miss it because you transferred rather than retested.

Matching and ethnicity is free with a transfer to MyHeritage, but you won’t receive the full potential benefit of SmartMatching without a subscription, as free trees are limited to 250 people and genealogical records aren’t included without a subscription. My subscription has been well worth the $.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Concepts: Anonymized Versus Pseudonymized Data and Your Genetic Privacy

Until recently, when people (often relatives) expressed concerns about DNA testing, genetic genealogy buffs would explain that the tester could remain anonymous, and that their test could be registered under another name; ours, for example.

This means, of course, that since our relative is testing for OUR genealogy addiction, er…hobby, that we would take care of those pesky inquiries and everything else. Not only would they not be bothered, but their identity would never be known to anyone other than us.

Let’s dissect that statement, because in some cases, it’s still partially true – but in other cases, anonymity in DNA testing is no longer possible.

You certainly CAN put your name on someone else’s kit and manage their account for them. There are a variety of ways to accomplish this, depending on the testing vendor you select.

If the DNA testing is either Y or mitochondrial DNA, it’s extremely UNLIKELY, if not impossible, that their Y or mitochondrial DNA is going to uniquely identify them as an individual.

Y and mitochondrial DNA is extremely useful in identifying someone as having descended from an ancestor, or not, but it (probably) won’t identify the tester’s identity to any matching person – at least not without additional information.

If you need a brush-up on the different kinds of DNA and how they can be used for genealogy, please read 4 Kinds of DNA for Genetic Genealogy.

Y and mitochondrial DNA can be used to rule in or rule out specific descendant relationships. In other words, you can unquestionably tell for sure that you are NOT related through a specific line. Conversely, you can sometimes confirm that you are most likely related to someone you match through the direct Y (patrilineal) line for males, and matrilineal mitochondrial line for both males and females. That match could be very distant in time, meaning many generations – even hundreds or thousands of years ago.

However, autosomal DNA, which tests a subset of all of your DNA for the genealogical goal of matching to cousins and confirming ancestors is another matter entirely. Some of the information you discern from autosomal testing includes how closely you match, which effectively predicts a range of relationships to your match.

These matches are much more recent in time and do not reach back into the distant past. The more closely you are related, the more DNA you share, which means that your DNA is identifying your location in the family tree, regardless of the name you put on the test itself.

Now, let’s look at the difference between anonymization and pseudonymization.

It may seem trivial, but it isn’t.

Anonymization vs Pseudonymization

Recently, as a result of the European Union GDPR (General Data Protection Regulation,) we’ve heard a lot about privacy and pseudonymization, which is not the same as anonymized data.

Anonymized data must be entirely stripped of any identifiable information, making it impossible to derive insights on a discreet individual, even by the person or entity who performed the anonymization. In other words, anonymization cannot be reversed under any circumstances.

Given that the purpose of genetic genealogy conflicts with the concept of anonymization, the term pseudonymization is more properly applied to the situation where someone masks or replaces the name of the tester with the goal of hiding the identity of the person who is actually taking the test.

Pseudonymization under GDPR (Article 4(5)) is defined as “the processing of personal data in such a way that the data can no longer be attributed to a specific data subject without the use of ‘additional information.’”

In reality, pseudonymization is what has been occurring all along, because the tester could always be re-identified by you.

However, and this important, neither anonymization or pseudonymization can be guaranteed to disguise your identity anymore.

Anonymous Isn’t Anonymous Anymore

The situation with autosomal DNA and the expectation of anonymity has changed rather gradually over the past few years, but with tidal wave force recently with the coming-of-age of two related techniques:

  • The increasingly routine identification of biological parents
  • The Buckskin Girl and Golden State Killer cases in which a victim and suspect were identified in April 2018, respectively, by the same methodology used to identify biological parents

Therefore, with autosomal DNA results, meaning the raw data results file ONLY, neither total anonymity or any expectation of pseudonymization is reasonable or possible.

Why?

The reason is very simple.

The size of the data bases of the combined mainstream vendors has reached the point where it’s unusual, at least for US testers, to not have a reasonably close match with a relative that you did not personally test – meaning third cousin or closer. Using a variety of tools, including in-common-with matches and trees, it’s possible to discern or narrow down candidates to be either a biological parent, a crime victim or a suspect.

In essence, the only real difference between genetic genealogy searching, parent searches and victim/suspect searches is motivation. The underlying technique is exactly the same with only a few details that differ based on the goal.

You can read about the process used to identify the Golden State Killer here, and just a few days later, a second case, the Cook/Van Cuylenborg double homicide cold case in Snohomish County, Washington was solved utilizing the following family tree of the suspect whose DNA was utilized and matched the blue and pink cousins.

Provided by the Snohomish County Sheriff

A genealogist discovering those same matches, of course, would be focused on the common ancestors, not contemporary people or generations.

To identify present day individuals, meaning parents, victims or suspects, the researcher identifies the common ancestor and works their way forward in time. The genealogist, on the other hands, is focused on working backwards in time.

All three types of processes, genealogical, parent identification and law enforcement depend on identifying cousins that lead us to common ancestors.

At that point, the only question is whether we continue working backwards (genealogically) or begin working forwards in time from the common ancestors for either parent identification or law enforcement.

Given that the suspect’s or victim’s name or identifying information is not known, their DNA alone, in combination with the DNA of their matches can identify them uniquely (unless they are an identical twin,) or closely enough that targeted testing or non-genetic information will confirm the identification.

Sometimes, people newly testing discover that a parent, sibling or half sibling genetic match is just waiting for them and absolutely no analysis is necessary. You can read about the discovery of the identity of my brother’s biological family here and here.

Therefore, we cannot represent to Uncle Henry, especially when discussing autosomal DNA testing, that he can test and remain anonymous. He can’t. If there is a family secret, known or unknown to Uncle Henry, it’s likely to be exposed utilizing autosomal DNA and may be exposed utilizing either Y or mitochondrial DNA testing.

For the genealogist, this may cause Pavlovian drooling, but Uncle Henry may not be nearly so enthralled.

In Summary

Genealogical methods developed to identify currently living individuals has obsoleted the concept of genetic anonymity. You can see in the pedigree chart example below how the same match, in yellow, can lead to solving any of the three different scenarios we’ve discussed.

Click to enlarge any graphic

If the tester is Uncle Henry, you might discover that his parents weren’t his parents. You also might discover who his real parents were, when your intention was only to confirm your common great-grandparents. So much for that idea.

A match between Henry and a second cousin, in our example above, can also identify someone involved in a law enforcement situation – although today those very few and far between. Testing for law enforcement purposes is prohibited according to the terms and conditions of all 4 major testing vendors; Ancestry, 23andMe, Family Tree DNA and MyHeritage.

Currently law enforcement kits to identify either victims or suspects can be uploaded at GedMatch but only for violent crimes identified as either homicide or sexual assault, per their terms and conditions.

Furthermore, both 23andMe and Ancestry who previously reserved the right to anonymize your genetic information and sell or otherwise utilize that information in aggregated format no longer can do so under the new GDPR legislation without your specific consent. GDPR, while a huge pain in the behind for other reasons has returned the control of the consumer’s DNA to the consumer in these cases.

The loss of anonymity is the inevitable result of this industry maturing. That’s good news for genetic genealogy. It means we now have lots of matches – sometimes more than we can keep up with!

Because of those matches, we know that if we test our DNA, or that of a family member, our DNA plus the common DNA shared with many of our relatives is enough to identify us, or them. That’s not news to genealogists, but it might be to Uncle Henry, so don’t tell him that he can be anonymous anymore.

You can pseudonymize accounts to some extent by masking Uncle Henry’s name or using your name. Managing accounts for the same reasons of convenience that you always did is just fine! We just need to explain the current privacy situation to Uncle Henry when asking permission to test or to upload his raw data file to GedMatch (or anyplace else,) because ultimately, Uncle Henry’s DNA leads to Uncle Henry, no matter whose name is on the account.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

DNAPainter – Mining Vendor Matches to Paint Your Chromosomes

This isn’t quite the same as when my mother used to talk about painting the town, but in genetic genealogy terms, it’s better.

This is the second of 4 articles that will describe how to use DNA Painter.

Today, I’d like to talk about how I utilize the various vendor testing tools combined with DNAPainter to “mine my DNA,” or better put, to mine my ancestor’s DNA which is now mine, pun intended.

To review instructions for how to set up and use the DNA Painter tool, please read DNA Painter – Chromosome Sudoku for Genetic Genealogy Addicts and then come back here to proceed.

I’m going to discuss each vendor’s tools and how I’ve used them, sometimes in combination.

57% Painted

Please note that you can click on any image to enlarge

Is this not a beautiful thing to behold? That’s my ancestors, in loving color, looking back at me, on MY chromosomes.

I’m completely thrilled that I have managed to paint 57% of my chromosomes. I’m a visual person, and while I’ve worked with spreadsheets now for years, I’ve officially abandoned them. Ok, mostly.

Yes, you heard me right – I’ve abandoned the spreadsheets in favor of DNA Painter, at least for segments where I can positively identify an ancestral couple. In other words, those segments that can be reliably mapped.

That 57% is made up of 445 segments in total, split between my maternal and paternal sides. That’s without counting my mother’s DNA. While I do utilize matching to my mother in order to be sure that a match is really a valid match, I didn’t paint her DNA. Obviously, I’m going to match her 100%, and DNA painter already breaks chromosomes into my pink maternal and blue paternal sides.

Key Elements

  1. The single best thing you can do in order to paint your chromosomes is to have known family members and cousins test. You can then paint their DNA that matches yours, attributing it to their identified family line.
  2. The second best thing you can do is to work with your matches using their trees to identify your common ancestor.

Now, you’re ready to begin painting.

I’m going to step through the process I used at each vendor to identify paintable segments.

I did not paint segments that I could not identify to an ancestral line, except for my endogamous Acadian line which I labeled simply as Acadian to mark those segments that I can identify as Acadian, but I can’t identify a specific ancestor, or ancestors. When I can identify the Acadian ancestor, I paint that segment using the ancestors’ names.

Family Tree DNA

At Family Tree DNA, I begin with my closest matches that are not immediate family – meaning not my parents, children or grandchildren. I’m looking for aunts, uncles, cousins, etc. I don’t paint siblings, but often half siblings are extremely useful because they can help you identify which paternal side other matches are related to.

In the first DNA Painter article, I explained how to utilize the Family Tree DNA chromosome browser to select an individual whose matching DNA can be displayed so that you can copy and paste that segment into the painting feature of DNA Painter.

On your results page, your “bucketed individuals” who have been assigned as maternal (pink icon above) or paternal (blue icon not shown) can be a huge clue when used in conjunction with the in-common-with (ICW) tool and the matrix.

You can also search by ancestral surname and then evaluate each match through common surnames, trees and other resources. If you’re not familiar with how to use the tools at Family Tree DNA, here’s a quick run-through.

Select the individual whose DNA you wish to paint, view in the chromosome browser, then copy and paste from the grid below to the DNAPainter tool.

I painted the matching DNA of all the people whose common ancestor with me I could positively identify before moving on to the next vendor.

Who Have I Painted?

As you begin to paint segments from multiple vendors, you may wonder if you’re finding duplicates. It’s easy to tell. At DNA Painter, click on “All segment data,” below the legend in the bottom right corner.

This displays the entire list of matches whose DNA you have painted, in spreadsheet format. You can sort by match name or simply do a browser search. (CTRL+F)

You can also download this data into a cvs (Excel compatible) file at the top left of this page.

Avoiding Duplicates

As you view and paint your matches at the various vendors, you may discover that you have already found a match with that person at another vendor, either because they tested there or uploaded their autosomal file. When possible, avoid duplicate painting. It won’t help anything and will just clutter your chromosomes. You may not always be able to identify a match as a duplicate, especially if the tester utilizes a pseudonym at various locations. Don’t’ worry though, because you can always easily delete it later and a duplicate person/segment certainly won’t hurt anything.

Ok, now to our next vendor! Let’s find more segments to paint.

MyHeritage

At MyHeritage, click on DNA matches.

At the right of the search box, fly over the little pink key (or funnel) looking thing and you’ll see the option for “Has Smart Matches.” That’s what you’re looking for.

Click on the key icon.

Smart Matches mean that your DNA matches and you have a common ancestor in your trees. Click on the purple button to review this DNA match.

For each match, scroll all the way down to the bottom where your matching chromosome segments will be colored.

At the right, above the chromosome browser, click on “advanced options” which will allow you to select “download shared DNA info.” You need to download to your system so that you can copy and paste the matching segment information to DNA Painter.

MyHeritage has a few more columns than necessary, and DNA Painter can’t utilize them. Delete the columns for Name, Match Name, RSID beginning and end, and also eliminate SNPs due to an overestimation issue. In many cases, the SNPs at MyHeritage are twice or more than the number of SNPs when comparing the same segment at other vendors.

Now that your segment is cleaned up, copy the entire group shown above, minus the yellow columns which you’ve deleted, and paste into the DNA Painter spreadsheet.

MyHeritage has recently added a triangulation feature, shown at the far right, below, indicating that these two people individually triangulate with me and Alberta. The icon at far right of “5th cousin” indicates triangulation.

By clicking on the triangulation icon, you then see how that person triangulates with both your match and you – in this case, me, Alberta, and Chandler.

You may choose to paint triangulated segments, BUT, the size of the triangulated segment is often going to be smaller than the amount of DNA than you match individually to either one or both people.

In the example above, you can see that you match the pink person on a significantly longer segment than you match the tan person. The amount of DNA where you match both the pink and tan person is smaller yet, because the area where you match the tan person extends beyond where you match the pink person and vice versa. If you were going to paint ONLY the triangulated segments, you would paint only the portion that is both pink and tan, “boxed” above.

I don’t recommend painting ONLY triangulated segments, because you’ll be depriving yourself of the ability for each person to match others on the portions of the segments on which they match you, but not the other person in question.

In this example, utilizing DNA Painter, you’ll see that people in fact match you AND the pink person on several segments. The segment shown in pink, at MyHeritage, above, is shown on chromosome 5 in DNA Painter as the long mustard colored segment. Look at how many people match you on that segment. This is why we don’t paint only the triangulated portions of the chromosome. That long mustard segment match will triangulate with many people on smaller portions of that mustard segment, as evidenced by the yellow, grey, blue, cinnamon, purple and red segment matches..

DNA Painter helps you triangulate, so there is no reason to restrict your painting to triangulated segments.

Triangulation is a great tool, but don’t mix triangulated segments with matching segments in the same profile, at least not until you get the hang of the tool and using the multiple vendor’s results.

23andMe

Unfortunately, 23andMe doesn’t have tools like tree matching (MyHeritage) or maternal/paternal phasing (Family Tree DNA,) but they do allow testers to enter common surnames.

Looking at closer matches, meaning first, second or third cousins, if they list even a few surnames, you may well be able to identify the common genealogical line, especially in conjunction with ancestral locations and the other people you match in common.

Sometimes you can glean enough information to identify your common ancestor. In this case, even if I didn’t know Cheryl, the surname would have identified the ancestor. If that didn’t do it, the “in common” list below would!

Once you’ve identified the common ancestor and decide you’re ready to paint, click on the Tools tab at the top of your page and select DNA Relatives.

On the DNA Relatives tab, click on the relative whose DNA you wish to paint. I’m selecting my cousin, Cheryl.

Click on the blue DNA Comparison, in the upper right hand corner.

On the comparison screen, you will select yourself as one person and Cheryl as the other.

At the top you’ll see the two individuals and their overlapping segments painted onto chromosomes. Scroll down and you’ll see the segment detail, below.

Highlight the rows (they’ll turn blue, like above) and right click to copy the segment information.

The next step is to drop the results into a spreadsheet, just long enough to delete the first and last columns, shown in red below, then copy the remaining rows and paste into the DNA Painter tool.

Mining Ancestry Data at GedMatch

GedMatch is somewhat of a special case, because GedMatch doesn’t do DNA testing, but provides an open sharing platform by facilitating uploads of raw autosomal files from multiple other vendors. Therefore, anyone with results at GedMatch tested elsewhere. If you tested at all of the other vendors, it’s probable that you find people at GedMatch as a match that match you at other vendors too.

Because 23andMe does not support the uploading of Gedcom files, if your match has uploaded a Gedcom file to GedMatch, or connected to Geni or WikiTree, then you may be able to identify your common ancestor at GedMatch that you were not able to identify at 23andMe.

Conversely, if you match at Ancestry, you won’t be able to paint from Ancestry, because Ancestry does not provide segment information. We will talk about Ancestry as a special case next, but for now, let’s focus on how to utilize GedMatch.

At GedMatch, you’ll work in steps after setting your account up and uploading your raw data file from either:

If you tested elsewhere, or after August of 2017 at 23andMe, you will have to upload to a special section called GedMatch Genesis. GedMatch Genesis provides a sandbox area for files other than the ones listed above that are generally incompatible with those files and with each other. Genesis files often have few SNP locations in common and not enough to match reliably.

I do not recommend DNA painting utilizing segments from GedMatch Genesis.

GedMatch is currently merging their regular GedMatch service with the Genesis service, so I’m not entirely clear how you will tell the difference between the kits known to match reliably, mentioned above, and others after the merge.

Currently, kits with T prefix (Family Tree DNA), A (Ancestry) and M (23andMe) show version levels in the type field when you match in regular GedMatch. MyHeritage kits are processed by the Family Tree DNA lab. G kits used a generic upload, so you can’t tell where they originated.

Kits uploaded in the Genesis sandbox seem to be assigned double alpha letter kit prefixes at random. Genesis includes a “Testing Company” field which does not include a version number. Today, just stay with the regular GedMatch one-to many and one-to-one matching for DNA Painter.

First, you’ll want to perform a one-to-many match.

This page shows your closest 2000 results. In my case, truncating my matches at 12.7cM. This means if I want to see my results below 12.7 cM, I must subscribe to the Tier 1 Utilities in order to be able to display over 2000 matches.

We’ll discuss how to utilize Tier 1 matching in the Ancestry portion, next, but for now, we’ll just be working with the regular one-to-many matches report.

Of course, trusty cousin Cheryl has results here as well.

In order to compare Cheryl’s results to my own, I need to do two separate things:

  • Click on the A link under the Autosomal Details column (above) and/or
  • Click on the X link under the X DNA column

These two results, both of which are paintable, do not display together so must be selected separately.

By clicking on the A or X, GedMatch will display a one-to-one comparison. I leave this page (below) at the default values and simply click submit.

Your next screen will be a match grid.

Once again, select and copy the results, then paste into DNA Painter. If you also have an X match with this individual, return to the one-to-many match page and then click on the X link to repeat the same process for the X chromosome.

Ancestry Through GedMatch

As far as I’m concerned, the best thing about Ancestry matches is DNA shared ancestor hints (SAH) – meaning those green leaves visible near the green “view match” button which indicate that you share both DNA and a common ancestor(s) in your trees.

Followed immediately by the worst thing which is that Ancestry provides no segment data. However, pairing Ancestry with GedMatch can provide you with some segment information, although you do have to dig. That digging was certainly worthwhile for me, as I found several readily identifiable matches.

When I find a green leaf shared ancestor hint at Ancestry, I record as much information about that match as I can in a spreadsheet. The reason is twofold.

  • Ancestry hints tend to come and go, rather inexplicable, and I want to have that information someplace besides at Ancestry
  • I want to be able to view how many matches I have through specific ancestors which I can do in a spreadsheet by sorting.
  • I want to be able to mine GedMatch for segment information for people at Ancestry who have uploaded to GedMatch.

Note the RJE V2 results, a 6th cousin who I match at 6.6 cM, as we’ll be using that at GedMatch.

I maintain several columns in my Ancestry Match spreadsheet, as shown above. I track people who might be good Y or mitochondrial DNA candidates, as well as GedMatch numbers or other useful information.

I don’t utilize segments smaller than 7 cM for DNA Painter, BUT, Ancestry almost always under-reports the matching segment size due to their internal process which removes some segments that do match. Therefore, I search for all Ancestry matches in GedMatch and paint them if they are 7cM or over at GedMatch. You will match at Ancestry down to 6 cM. Since 7cM is the default GedMatch threshold, that works out well. I don’t find them if they are under 7cM at GedMatch, and I don’t care.

In my case to obtain segments smaller than 12.7 cM, because that is the cutoff where the free one-to-many GedMatch tool reaches the 2000 match threshold (for me,) I need to utilize the Tier 1 subscription utilities which are well worth every dollar.

The one-to-many match looks quite different for the Tier 1 tool.

You’ll need to play with this a bit to determine how high you need to set the limit to see all of your 7cM matches. In my case, I had to set it to 20,000.

I utilize two monitors, so I display my Ancestry spreadsheet on the first monitor and the GedMatch one-to-many match table on the second monitor.

Then, utilizing the browser’s search function, I search for any identifiable portion of the information for the Ancestry match at GedMatch.

In the first example, the user’s name is RJE V2. I search at GedMatch for “RJE” using “ctrl+F” which is the browser’s find function.

You can see that the search found a total of 3 different “RJE” entries. Looking at the first 2, you can see that one is labeled V4 and one is labeled V2. Typically, I would look at this and decide that the RJE V2 is the right match based on the user name at Ancestry.

However, look closer.

The RJE V2 at GedMatch has a much higher amount of shared DNA at 3587.1 cM total than the RJE V2 at Ancestry with a total of 6.6 cM. Clearly, this is not the same person, even though the user name is the same.

For all we know, a different person may have used the same user name, which is clearly an alias, noted by the “*”. Or the same person may have multiple kits at GedMatch.

However, in this case, the RJE V2 is not the same match.

However, let’s say that it is the same person and we’ve been able to reasonably identify the match. In order to compare one-to-one, click on the highlighted blue “largest segment” in the autosomal category, shown below.

If you want to compare the X one-to-one, click on the blue largest segment in that column.

From this point, the matching will look the same as the one-to-one GedMatch matching shown in the previous section – so copy and paste as normal.

While this certainly isn’t the most effective way of working with Ancestry matches, it’s really the only hope we have, unless your match has also uploaded to either Family Tree DNA or MyHeritage.

However, in my experience, I generally stand a better chance of identifying Ancestry matches at GedMatch because their user name or the user name of the person managing their account can be found much more readily. People sometimes tend to utilize the same abbreviations, names or nicknames in multiple locations.

Summary

While each vendor has unique strengths and weaknesses today, and GedMatch provides a platform used by some but not all, the best way to effectively paint your chromosomes is to utilize all of the tools available, and sometimes together. I strongly suggest that you test at or upload to each vendor, because you will find matches at each vendor that aren’t elsewhere.

How many segments can you paint on your chromosomes, and what will those segments tell you?

In the next article, I’ll be walking through my chromosome painting gallery to take a look at the hidden messages there! I hope you’ll come along so you can find some hidden messages of your own.

Enjoy!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

DNA Painter – Chromosome Sudoku for Genetic Genealogy Addicts

Not long ago, Jonny Perl introduced the free online tool, DNA Painter, designed to paint your chromosomes. I didn’t get around to trying this right away, but had I realized just how much fun I would have, I would have started sooner.

Fittingly, Jonny, pictured above, won the RootsTech Innovation award this year for DNA Painter – and I must say, it’s quite well-deserved.

Congratulations Jonny!

  • This is the first of four articles about DNA Painter. In this article, we’ll talk about how to use the tool, and how to get started.
  • The second article talks about mining your matches at the various vendors for paintable segments with instructions for how to do that accurately with each vendor.
  • In the third article, we’ll walk through an analysis of my painted segments, so you can too – and know how to spot revelations.
  • The fourth article explains how I solved a long-standing mystery that was driving me crazy. If you have a relatively close mystery person in your DNA match list that you can’t figure out quite where they fit, this article is written just for you!

I’ll tell you right now, I haven’t had this much fun in a long time!

Want to hear the best part? You don’t have to triangulate. DNA painting is “self-triangulating.” Yes, really!

Let’s get started!

Introducing DNA Painter

To begin to use DNA Painter, you’ll need to set up a free account at www.dnapainter.com.

Read the instructions and create your profile.

Jonny provides an overview.  Don’t get so excited that you skip this, or you won’t know how to paint correctly. You don’t need to be Picasso, but taking a few minutes up front will save you mistakes and frustration later.

Blaine Bettinger recorded a YouTube video discussing how to use DNA Painter to paint your chromosomes to identify and attribute particular segments to specific ancestors. It includes a mini-lesson on chromosome matching.

I strongly suggest you take time to watch Blaine’s video from the beginning. For some reason, this link drops into the video near the end, but just slide the red bar back to the beginning.

Get Started

Here’s my blank, naked chromosomes. Notice for every chromosome, you see a blue paternal “half” and a pink maternal “half.” That’s because everyone gets half of their autosomal DNA from their father, and the other half from their mother.

Looking at my own chromosome painting today, below, it’s incredibly exciting for me to see 57% of my DNA painted, attributed to 77 couples and one endogamous group, Acadians. This took me a month or so working off and on.

At the end of the day, this is often how I rewarded myself! The only problem it that it has been difficult to go to bed.

Comparatively, I’ve been working on my DNA match spreadsheet, attributing segments to ancestors now for 5 or 6 years, and I’ve never been able to see this information visually like this before. This view of my ancestrally painted chromosomes is so rewarding!

Who To Map

DNA Painter is not the kind of tool where you upload your results, it’s a tool where you selectively paint specific segments of matches – meaning segments on which you match particular people with known common ancestors.

How do you know who is a good candidate to map?

I began with painting my closest matches with whom I could identify the common ancestor.

Not only will painting your largest matches be rewarding as you harvest low-hanging-fruit, it will help you determine if you actually have identified the correct DNA for later matches being attributed to a specific genealogical line. In other words, mapping these larger known segments will help you identify false positives when you have no other yardstick.

Your First Painting

I’m opening a new profile in DNA Painter to demonstrate the steps in painting along with hints that I’ve learned along the way.

I’m going to utilize my cousin, Cheryl, whom I match closely at Family Tree DNA. If you don’t know how to use the Family Tree DNA autosomal tools, click here.

Cheryl is my first cousin once removed, so we share a significant amount of DNA.

I’ve selected Cheryl on my match list, checked her match box, and then clicked on the Chromosome Browser in order to view our segment matching information.

You can see on the chromosome browser that I share quite a bit of DNA with Cheryl.

At the top of the chromosome browser, click on “View this data in a table.”

Highlight and copy all of the segments for Cheryl. I only use 7cM segments or higher at DNA Painter, so you don’t have to copy the data in the rows below your last match at that level. DNA Painter takes care of stripping out all the extraneous stuff.

Paint a New Match

At DNA Painter, after you have your profile set up, click on “Paint a New Match.”

Simply paste the segment data into the box in the window that pops up. DNA Painter takes care of removing the header information as well as segments that are too small.

You can click on “overlay these segments” to “test” a fit, but I haven’t really found a good use for that, because I’m only painting segments I’m confident about and I know which side, maternal or paternal, the match is on based on the known relative.

Click on “save match now” in the bottom right corner.

In the Save Match popup, shown above, I utilize the fields as follows.

I enter the name of my DNA match, followed by their relationship to me, followed by the source of the match. In this case, “Cheryl <lastname>, 1C1R, FTDNA”

In the “Segment/Match Notes” I list how the match descends from the common ancestral couple, a GedMatch ID if known, and anything else pertinent including other potential ancestral lines in common. This means that I list every generation beginning with the common ancestral couple and ending with the tester.

Hiram Ferverda and Eva Miller, Roscoe, Cheryl, GedMatch Txxxxxx

You’ll wind up eventually rethinking some of your segment assignments to particular ancestors and you’ll want as much information here about this match as possible.

Moving to the next field, in the “Ancestors Name,” I utilize the couples name, because at this point, you can’t tell which of the two people actually contributed the DNA segment, or if part is from one ancestor of the couple and part is from the other. If the male ancestor is a Sr. or Jr., or is otherwise difficult to tell apart from your other ancestors, I suggest entering a birth year by his name. This is your selection list for later painting segments from the same ancestor, so you want to be sure you can tell the generations apart.

Next, you’ll select the maternal or paternal side of your family. Change the color if you don’t like the one pre-selected to assign to segments descending from that couple. Originally, I was going to have pinks or light colors for maternal, and blues or darker for paternal, but I quickly discovered that scheme didn’t work well, and I had more ancestors than I could ever have imagined whose DNA I am be able to map and paint.

Therefore, pick contrasting colors. You can use each color on each half, meaning maternal and paternal, since the segments will be painted on different halves of the chromosome.

In the “Notes for This Group,” I add more information for the couple such as birth and death dates and location if I know or am likely to forget.

Click “save.”

Here you go!  Isn’t this fun!!!! Cheryl’s segments that match mine are painted onto my chromosomes!

At the right, your ancestor key appears with each ancestor to whom you’ve assigned a color key.

So far, I only have one!

Want to paint another group of segments?

Let’s paint Cheryl’s brother.

Following the same sequence, I paint Donald’s DNA, but this time, I select “Or link these segments to an ancestor I’ve added before.”

I select Hiram Ferverda, Eva Miller and save. The segments that I have in common with Cheryl and/or Don will now be displayed on each chromosome.

Looking at chromosome 1, you can see that I match Cheryl and Don on the same segment at the beginning of the chromosome, but received two different segments of DNA on a different portion of chromosome 1, further to the right.

As one last example, I added the DNA from two known cousins, Rex and Maxine, who descend a couple generations further back in time through more distant ancestors in the same line – one maternal and one paternal.

Click on the chromosome number to expand to see all of the painted segments

You can see, looking at chromosome 3 that Cheryl and Don match me on a significant amount of the same large pink segment plus a smaller pink segment at the end

Rex (yellow) and Maxine (blue) both match me on different parts of the chromosome. It looks like there is a small amount of overlap between Rex and Maxine which is certainly feasible, because Jacob Lentz, the ancestor that Maxine descends from is ancestral to the couple that Rex descends from.

By utilizing known matches, and mapping, we can see segments that move us back in time, telling us from which ancestor that portion of the segment descends.

For example, if the blue segment was directly aligned with one of the pink segments, then we would know that the blue portion of the pink segment descended from Jacob Lentz and Fredericka Reuhl.

This is the most awesome, extremely addictive game of ancestor Sukoku ever.

Wanna play???

Here’s how to prepare for my next article where we’ll utilize the various vendor matches to begin painting.

Download and Upload Your Autosomal Files

You’ll want to have your DNA at the most vendor locations possible so you can find all your matches that can be attributed to known relatives and ancestors. You never know who is going to test at which vendor, and the only way to find out is to have your DNA there too.

For each vendor, I’ve provided a mini-tutorial on how to maximize your testing and transfers both monetarily and for maximum matching effect, or you can read an article here that explains more.

There’s also a cheat sheet for transfer strategies at the end of this article.

A technique called imputation is mentioned below, so you may want to read about imputation here. MyHeritage’s initial offering utilizing imputation was problem plagued but has since improved significantly.

Ancestry

To Ancestry – There’s no way to transfer files TO Ancestry, so you’ll need to test there to be in their database. You will also need at least a minimum subscription ($49) to utilize all of the Ancestry DNA features. You can see a with and without subscription feature comparison chart here.

From Ancestry – There is also no chromosome browser at Ancestry. In order to use DNA Painter, chromosome segment information is required, so if you test at Ancestry and want to paint your segments, you’ll need to download your DNA file to either or all of:

  • Family Tree DNA – partially compatible with the current Ancestry test chip format – uses imputation to infer additional genetic regions
  • MyHeritage – partially compatible, but uses imputation to infer additional genetic regions
  • GedMatch

Family Tree DNA

To Family Tree DNA – You can upload the following vendor files TO Family Tree DNA.  Matching is free, but to use the advanced tools, including ethnicity and the chromosome browser, you’ll need to pay the $19 unlock fee. That’s still significantly less than retesting, especially for files that are 100% compatible.

  • Ancestry – V1 files generated from before May 2016 are entirely compatible, V2 files from after May 2016 are partially compatible, providing between 20-25% of your matches, meaning your closest matches
  • 23andMe – V3 file from Dec 2010-Nov 2013 and V4 file from November 2013-August 2017 are compatible, the V5 platform file beginning in August 2017 is not compatible
  • MyHeritage – fully compatible

From Family Tree DNA – You can upload your Family Finder results to:

MyHeritage

To MyHeritage – You can upload the following files to MyHeritage:

  • Family Tree DNA – fully compatible
  • Ancestry – partially compatible but uses imputation to infer additional genetic regions
  • 23andMe – partially compatible but uses imputation to infer additional genetic regions

From MyHeritage – If you test at MyHeritage, you can upload your files to:

23andMe

To 23andMe – You cannot transfer TO 23andMe, so you’ll need to test there if you want to be in their database.

From 23andMe – If you tested at 23andMe, you can upload your files to the following vendors:

  • Family Tree DNA – V3 file from Dec 2010-Nov 2013 and V4 file from November 2013-August 2017 are compatible, the V5 chip beginning in August 2017 is not compatible
  • MyHeritage – 23andMe – partially compatible but uses imputation to infer additional genetic regions
  • GedMatch – V3 file from Dec 2010-Nov 2013 and V4 file from November 2013-August 2017 are compatible, the V5 chip beginning in August 2017 is only compatible in the Genesis sandbox area. V5 matching is not reliable. Files from other vendors are recommended for GedMatch unless you are matching against another V5 result.

GedMatch

GedMatch is a third-party site that accepts all of these vendors’ autosomal files, with a caveat that the 23andMe V5 kit matches very poorly and requires special handling. I don’t recommend using that kit at GedMatch unless you are matching against other 23andMe V5 kits.

I upload multiple kits to GedMatch and mark all but one for research only. This allows me to use my Ancestry kit to match with other Ancestry users for more accurate matches, my Family Tree DNA kit to other Family Tree DNA kits, and so forth. Not marking multiple kits for research means that you’ll appear more than once on other people’s match lists, and only your first 2000 matches are free. Marking all kits except one as research is a courtesy to others.

Recommended Testing Strategy for New Testers

  1. Test at Ancestry and download to GedMatch.
  2. Test at Family Tree DNA and upload to MyHeritage and GedMatch.
  3. Test at 23andMe and upload to GedMatch Genesis.
  4. At GedMatch, mark all except one kit as “research,” then utilize your kits from the same vendor for one-to-one comparisons.

Recommended Transfer Strategy

Of course, where you have, and haven’t already tested will impact your transfer strategy decision. I’ve prepared the following cheat sheet to be used in combination with the information discussed above.

*Unless you can transfer a 23andMe V3/V4 or an Ancestry V1 kit to Family Tree DNA, it’s better to test at Family Tree DNA. Ancestry V2 tests are only 20-25% compatible.

A transfer from Family Tree DNA to MyHeritage is best because those vendors are on the same platform and the tools at MyHeritage are free.

In my next article, we’ll discuss how to mine your matches at the various vendors to obtain accurate segments for chromosome painting – including a strategy for how to utilize Ancestry and Gedmatch together to identify at least some Ancestry segment matches.

So, for now, get ready by transferring your matches into whichever data bases they aren’t already in. The only data base where I couldn’t identify matches that I didn’t have elsewhere was at 23andMe. The rest were all there just waiting to be harvested!

_____________________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research