A Triangulation Checklist Born From the Question; “Why NOT Use Close Relatives for Triangulation?”

One of my readers asked why we don’t use close relatives for triangulation.

This is a great question because not using close relatives for triangulation seems counter-intuitive.

I used to ask my kids and eventually my students and customers if they wanted the quick short answer or the longer educational answer.

The short answer is “because close relatives are too close to reliably form the third leg of the triangle.” Since you share so much DNA with close relatives, someone matching you who is identical by chance can also match them for exactly the same reason.

If you trust me and you’re good with that answer, wonderful. But I hope you’ll keep reading because there’s so much to consider, not to mention a few gotchas. I’ll share my methodology, techniques, and workarounds.

We’ll also discuss absolutely wonderful ways to utilize close relatives in the genetic genealogical process – just not for triangulation.

At the end of this article, I’ve provided a working triangulation checklist for you to use when evaluating your matches.

Let’s go!

The Step-by-Step Educational Answer😊

Some people see “evidence” they believe conflicts with the concept that you should not use close relatives for triangulation. I understand that, because I’ve gone down that rathole too, so I’m providing the “educational answer” that explains exactly WHY you should not use close relatives for triangulation – and what you should do.

Of course, we need to answer the question, “Who actually are close relatives?”

I’ll explain the best ways to best utilize close relatives in genetic genealogy, and why some matches are deceptive.

You’ll need to understand the underpinnings of DNA inheritance and also of how the different vendors handle DNA matching behind the scenes.

The purpose of autosomal DNA triangulation is to confirm that a segment is passed down from a particular ancestor to you and a specific set of your matches.

Triangulation, of course, implies 3, so at least three people must all match each other on a reasonably sized portion of the same DNA segment for triangulation to occur.

Matching just one person only provides you with one path to that common ancestor. It’s possible that you match that person due to a different ancestor that you aren’t aware of, or due to chance recombination of DNA.

It’s possible that your or your match inherited part of that DNA from your maternal side and part from your paternal side, meaning that you are matching that other person’s DNA by chance.

I wrote about identical by descent (IBD), which is an accurate genealogically meaningful match, and identical by chance (IBC) which is a false match, in the article Concepts – Identical by…Descent, State, Population and Chance.

I really want you to understand why close relatives really shouldn’t be used for triangulation, and HOW close relative matches should be used, so we’re going to discuss all of the factors that affect and influence this topic – both the obvious and little-understood.

  • Legitimate Matches
  • Inheritance and Triangulation
  • Parental Cross-Matching
  • Parental Phasing
  • Automatic Phasing at FamilyTreeDNA
  • Parental Phasing Caveats
  • Pedigree Collapse
  • Endogamy
  • How Many Identical-by-Chance Matches Will I Have?
  • DNA Doesn’t Skip Generations (Seriously, It Doesn’t)
  • Your Parents Have DNA That You Don’t (And How to Use It)
  • No DNA Match Doesn’t Mean You’re Not Related
  • Imputation
  • Ancestry Issues and Workarounds
  • Testing Close Relatives is VERY Useful – Just Not for Triangulation
  • Triangulated Matches
  • Building Triangulation Evidence – Ingredients and a Recipe
  • Aunts/Uncles
  • Siblings
  • How False Positives Work and How to Avoid Them
  • Distant Cousins Are Best for Triangulation & Here’s Why
  • Where Are We? A Triangulation Checklist for You!
  • The Bottom Line

Don’t worry, these sections are logical and concise. I considered making this into multiple articles, but I really want it in one place for you. I’ve created lots of graphics with examples to help out.

Let’s start by dispelling a myth.

DNA Doesn’t Skip Generations!

Recently, someone emailed to let me know that they had “stopped listening to me” in a presentation when I said that if a match did not also match one of your parents, it was a false match. That person informed me that they had worked on their tree for three years at Ancestry and they have “proof” of DNA skipping generations.

Nope, sorry. That really doesn’t happen, but there are circumstances when a person who doesn’t understand either how DNA works, or how the vendor they are using presents DNA results could misunderstand or misinterpret the results.

You can watch my presentation, RootsTech session, DNA Triangulation: What, Why and How, for free here. I’m thrilled that this session is now being used in courses at two different universities.

DNA really doesn’t skip generations. You CANNOT inherit DNA that your parents didn’t have.

Full stop.

Your children cannot inherit DNA from you that you don’t carry. If you don’t have that DNA, your children and their descendants can’t have it either, at least not from you. They of course do inherit DNA from their other parent.

I think historically, the “skipping generations” commentary was connected to traits. For example, Susie has dimples (or whatever) and so did her maternal grandmother, but her mother did not, so Susie’s dimples were said to have “skipped a generation.” Of course, we don’t know anything about Susie’s other grandparents, if Susie’s parents share ancestors, recessive/dominant genes or even how many genetic locations are involved with the inheritance of “dimples,” but I digress.

DNA skipping generations is a fallacy.

You cannot legitimately match someone that your parent does not, at least not through that parent’s side of the tree.

But here’s the caveat. You can’t match someone one of your parents doesn’t with the rare exception of:

  • Relatively recent pedigree collapse that occurs when you have the same ancestors on both sides of your tree, meaning your parents are related, AND
  • The process of recombination just happened to split and recombine a segment of DNA in segments too small for your match to match your parents individually, but large enough when recombined to match you.

We’ll talk about that more in a minute.

However, the person working with Ancestry trees can’t make this determination because Ancestry doesn’t provide segment information. Ancestry also handles DNA differently than other vendors, which we’ll also discuss shortly.

We’ll review all of this, but let’s start at the beginning and explain how to determine if our matches are legitimate, or not.

Legitimate Matches

Legitimate matches occur when the DNA of your ancestor is passed from that ancestor to their descendants, and eventually to you and a match in an unbroken pathway.

Unbroken means that every ancestor between you and that ancestor carried and then passed on the segment of the ancestor’s DNA that you carry today. The same is true for your match who carries the same segment of DNA from your common ancestor.

False positive matches occur when the DNA of a male and female combine randomly to look like a legitimate match to someone else.

Thankfully, there are ways to tell the difference.

Inheritance and Triangulation

Remember, you inherit two copies of each of your chromosomes 1-22, one copy from your mother and one from your father. You inherit half of the DNA that each parent carries, but it’s mixed together in you so the labs can’t readily tell which nucleotide, A, C, T, or G you received from which parent. I’m showing your maternal and paternal DNA in the graphic below, stacked neatly together in a column – but in reality, it could be AC in one position and CA in the next.

For matching all that matters is the nucleotide that matches your match is present in one of those two locations. In this case, A for your mother’s side and C for your father’s side. If you’re interested, you can read more about that in the article, Hit a Genealogy Home Run Using Your Double-Sided Two-Faced Chromosomes While Avoiding Imposters.

You can see in this example that you inherited all As from your Mom and all Cs from your Dad.

  • A legitimate maternal match would match you on all As on this particular example segment.
  • A legitimate paternal match would match you on all Cs on this particular segment.
  • A false positive match will match you on some random combination of As and Cs that make it look like they match you legitimately, but they don’t.
  • A false positive match will NOT match either your mother or your father.

To be very clear, technically a false positive match DOES match your DNA – but they don’t match your DNA because you share a common ancestor with your match. They match you because random recombination on their side causes you to match each other by chance.

In other words, if part of your DNA came from your Mom’s side and part from your Dad’s but it randomly fell in the correct positional order, you’d still match someone whose DNA was from only their mother or father’s side. That’s exactly the situation shown above and below.

Looking at our example again, it’s evident that your identical by chance (IBC) match’s A locations (1, 3, 5, 7 & 9) will match your Mom. C locations (2, 4, 6 8, & 10) will match your Dad, but the nonmatching segments interleaved in-between that match alternating parents will prevent your match from matching either of your parents. In other words, out of 10 contiguous locations in our example, your IBC match has 5 As alternated with 5 Cs, so they won’t match either of your parents who have 10 As or 10 Cs in a row.

This recombination effect can work in either direction. Either or both matching people’s DNA could be randomly mixed causing them to match each other, but not their parents.

Regardless of whose DNA is zigzagging back and forth between maternal and paternal, the match is not genealogical and does not confirm a common ancestor.

This is exactly why triangulation works and is crucial.

If you legitimately match a third person, shown below, on your maternal side, they will match you, your first legitimate maternal match, and your Mom because they carry all As. But they WON’T match the person who is matching you because they are identical by chance, shown in grey below.

The only person your identical by chance match matches in this group is you because they match you because of the chance recombination of parental DNA.

That third person WILL also match all other legitimate maternal matches on this segment.

In the graphic above, we see that while the grey identical by chance person matches you because of the random combination of As from your mother and Cs from your father, your legitimate maternal matches won’t match your identical by chance match.

This is the first step in identifying false matches.

Parental Cross-Matching

Removing the identical by chance match, and adding in the parents of your legitimate maternal match, we see that your maternal match, above, matches you because you both have all As inherited from one parent, not from a combination of both parents.

We know that because we can see the DNA of both parents of both matches in this example.

The ideal situation occurs when two people match and they have both had their parents tested. We need to see if each person matches the other person’s parents.

We can see that you do NOT match your match’s father and your match does NOT match your father.

You do match your match’s mother and your match does match your mother. I refer to this as Parental Cross-matching.

Your legitimate maternal matches will also match each other and your mother if she is available for testing.

All the people in yellow match each other, while the two parents in gray do not match any of your matches. An entire group of legitimate maternal matches on this segment, no matter how many, will all match each other.

If another person matches you and the other yellow people, you’ll still need to see if you match their parents, because if not, that means they are matching you on all As because their two parents DNA combined just happened, by chance, to contribute an A in all of those positions.

In this last example, your new match, in green, matches you, your legitimate match and both of your mothers, BUT, none of the four yellow people match either of the new match’s parents. You can see that the new green match inherited their As from the DNA of their mother and father both, randomly zigzagging back and forth.

The four yellow matches phase parentally as we just proved with cross matching to parents. The new match at first glance appears to be a legitimate match because they match all of the yellow people – but they aren’t because the yellow people don’t match the green person’s parents.

To tell the difference between legitimate matches and identical by chance matches, you need two things, in order.

  • Parental matching known as parental phasing along with parental cross-matching, if possible, AND
  • Legitimate identical by descent (IBD) triangulated matches

If you have the ability to perform parental matching, called phasing, that’s the easiest first step in eliminating identical by chance matches. However, few match pairs will have parents for everyone. You can use triangulation without parental phasing if parents aren’t available.

Let’s talk about both, including when and how close relatives can and cannot be used.

Parental Phasing

The technique of confirming your match to be legitimate by your match also matching one of your parents is called parental phasing.

If we have the parents of both people in a match pair available for matching, we can easily tell if the match does NOT match either parent. That’s Parental Cross Matching. If either match does NOT match one of the other person’s parents, the match is identical by chance, also known as a false positive.

See how easy that was!

If you, for example, is the only person in your match pair to have parents available, then you can parentally phase the match on your side if your match matches your parents. However, because your match’s parents are unavailable, your match to them cannon tbe verified as legitimate on their side. So you are not phased to their parents.

If you only have one of your parents available for matching, and your match does not match that parent, you CANNOT presume that because your match does NOT match that parent, the match is a legitimate match for the other, missing, parent.

There are four possible match conditions:

  • Maternal match
  • Paternal match
  • Matches neither parent which means the match is identical by chance meaning a false positive
  • Matches both parents in the case of pedigree collapse or endogamy

If two matching people do match one parent of both matches (parental cross-matching), then the match is legitimate. In other words, if we match, I need to match one of your parents and you need to match one of mine.

It’s important to compare your matches’ DNA to generationally older direct family members such as parents or grandparents, if that’s possible. If your grandparents are available, it’s possible to phase your matches back another generation.

Automatic Phasing at FamilyTreeDNA

FamilyTreeDNA automatically phases your matches to your parents if you test that parent, create or upload a GEDCOM file, and link your test and theirs to your tree in the proper places.

FamilyTreeDNA‘s Family Matching assigns or “buckets” your matches maternally and paternally. Matches are assigned as maternal or paternal matches if one or both parents have tested.

Additionally, FamilyTreeDNA uses triangulated matches from other linked relatives within your tree even if your parents have not tested. If you don’t have your parents, the more people you identify and link to your tree in the proper place, the more people will be assigned to maternal and paternal buckets. FamilyTreeDNA is the only vendor that does this. I wrote about this process in the article, Triangulation in Action at Family Tree DNA.

Parental Phasing Caveats

There are very rare instances where parental phasing may be technically accurate, but not genealogically relevant. By this, I mean that a parent may actually match one of your matches due to endogamy or a population level match, even if it’s considered a false positive because it’s not relevant in a genealogical timeframe.

Conversely, a parent may not match when the segment is actually legitimate, but it’s quite rare and only when pedigree collapse has occurred in a very specific set of circumstances where both parents share a common ancestor.

Let’s take a look at that.

Pedigree Collapse

It’s not terribly uncommon in the not-too-distant past to find first cousins marrying each other, especially in rather closely-knit religious communities. I encounter this in Brethren, Mennonite and Amish families often where the community was small and out-marrying was frowned upon and highly discouraged. These families and sometimes entire church congregations migrated cross-country together for generations.

When pedigree collapse is present, meaning the mother and father share a common ancestor not far in the past, it is possible to inherit half of one segment from Mom and the other half from Dad where those halves originated with the same ancestral couple.

For example, let’s say the matching segment between you and your match is 12 cM in length, shown below. You inherited the blue segment from your Dad and the neighboring peach segment from Mom – shown just below the segment numbers. You received 6 cM from both parents.

Another person’s DNA does match you, shown in the bottom row, but they are not shown on the DNA match list of either of your parents. That’s because the DNA segments of the parents just happened to recombine in 6 cM pieces, respectively, which is below the 7 cM matching threshold of the vendor in this example.

If the person matched you at 12 cM where you inherited 8 cM from one parent and 4 from the other, that person would show on one parent’s match list, but not the other. They would not be on the parent’s match list who contributed only 4 cM simply because the DNA divided and recombined in that manner. They would match you on a longer segment than they match your parent at 8 cM which you might notice as “odd.”

Let’s look at another example.

click to enlarge image

If the matching segment is 20 cM, the person will match you and both of your parents on different pieces of the same segment, given that both segments are above 7 cM. In this case, your match who matches you at 20 cM will match each of your parents at 10 cM.

You would be able to tell that the end location of Dad’s segment is the same as the start location of Mom’s segment.

This is NOT common and is NOT the “go to” answer when you think someone “should” match your parent and does not. It may be worth considering in known pedigree collapse situations.

You can see why someone observing this phenomenon could “presume” that DNA skipped a generation because the person matches you on segments where they don’t match your parent. But DNA didn’t skip anything at all. This circumstance was caused by a combination of pedigree collapse, random division of DNA, then random recombination in the same location where that same DNA segment was divided earlier. Clearly, this sequence of events is not something that happens often.

If you’ve uploaded your DNA to GEDmatch, you can select the “Are your parents related?” function which scans your DNA file for runs of homozygosity (ROH) where your DNA is exactly the same in both parental locations for a significant distance. This suggests that because you inherited the exact same sequence from both parents, that your parents share an ancestor.

If your parents didn’t inherit the same segment of DNA from both parents, or the segment is too short, then they won’t show as “being related,” even if they do share a common ancestor.

Now, let’s look at the opposite situation. Parental phasing and ROH sometimes do occur when common ancestors are far back in time and the match is not genealogically relevant.

Endogamy

I often see non-genealogical matching occur when dealing with endogamy. Endogamy occurs when an entire population has been isolated genetically for a long time. In this circumstance, a substantial part of the population shares common DNA segments because there were few original population founders. Much of the present-day population carries that same DNA. Many people within that population would match on that segment. Think about the Jewish community and indigenous Americans.

Consider our original example, but this time where much of the endogamous population carries all As in these positions because one of the original founders carried that nucleotide sequence. Many people would match lots of other people regardless of whether they are a close relative or share a distant ancestor.

People with endogamous lines do share relatives, but that matching DNA segment originated in ancestors much further back in time. When dealing with endogamy, I use parental phasing as a first step, if possible, then focus on larger matches, generally 20 cM or greater. Smaller matches either aren’t relevant or you often can’t tell if/how they are.

At FamilyTreeDNA, people with endogamy will find many people bucketed on the “Both” tab meaning they triangulate with people linked on both sides of the tester’s tree.

An example of a Jewish person’s bucketed matches based on triangulation with relatives linked in their tree is shown above.

Your siblings, their children, and your children will be related on both your mother’s and father’s sides, but other people typically won’t be unless you have experienced either pedigree collapse where you are related both maternally and paternally through the same ancestors or you descend from an endogamous population.

How Many Identical-by-Chance Matches Will I Have?

If you have both parents available to test, and you’re not dealing with either pedigree collapse or endogamy, you’ll likely find that about 15-20% of your matches don’t match your parents on the same segment and are identical by chance.

With endogamy, you’ll have MANY more matches on your endogamous lines and you’ll have some irrelevant matches, often referred to as “false positive” matches even though they technically aren’t, even using parental phasing.

Your Parents Have DNA That You Don’t

Sometimes people are confused when reviewing their matches and their parent’s match to the same person, especially when they match someone and their parent matches them on a different or an additional segment.

If you match someone on a specific segment and your parents do not, that’s a false positive FOR THAT SEGMENT. Every segment has its own individual history and should be evaluated individually. You can match someone on two segments, one from each parent. Or three segments, one from each parent and one that’s identical by chance. Don’t assume.

Often, your match will match both you and your parent on the same segment – which is a legitimate parentally phased match.

But what if your match matches your parent on a different segment where they don’t match you? That’s a false positive match for you.

Keep in mind that it is possible for one of your matches to match your parent on a separate or an additional segment that IS legitimate. You simply didn’t inherit that particular segment from your parent.

That’s NOT the same situation as someone matching you that does NOT match one of your parents on the same segment – which is an identical by chance or false match.

Your parent having a match that does not match you is the reverse situation.

I have several situations where I match someone on one segment, and they match my parent on the same segment. Additionally, that person matches my parent on another segment that I did NOT inherit from that parent. That’s perfectly normal.

Remember, you only inherit half of your parent’s DNA, so you literally did NOT inherit the other half of their DNA. Your mother, for example, should have twice as many matches as you on her side because roughly half of her matches won’t match you.

That’s exactly why testing your parents and close family members is so critical. Their matches are as valid and relevant to your genealogy as your own. The same is true for other relatives, such as aunts and uncles with whom you share ALL of the same ancestors.

You need to work with your family member’s matches that you don’t share.

No DNA Match Doesn’t Mean You’re Not Related

Some people think that not matching someone on a DNA test is equivalent to saying they aren’t related. Not sharing DNA doesn’t mean you’re not related.

People are often disappointed when they don’t match someone they think they should and interpret that to mean that the testing company is telling them they “aren’t related.” They are upset and take issue with this characterization. But that’s not what it means.

Let’s analyze this a bit further.

First, not sharing DNA with a second cousin once removed (2C1R) or more distant does NOT mean you’re NOT related to that person. It simply means you don’t share any measurable DNA ABOVE THE VENDOR THRESHOLD.

All known second cousins match, but about 10% of third cousins don’t match, and so forth on up the line with each generation further back in time having fewer cousins that match each other.

If you have tested close relatives, check to see if that cousin matches your relatives.

Second, it’s possible to match through the “other” or unexpected parent. I certainly didn’t think this would be the case in my family, because my father is from Appalachia and my mother’s family is primarily from the Netherlands, Germany, Canada, and New England. But I was wrong.

All it took was one German son that settled in Appalachia, and voila, a match through my mother that I surely thought should have been through my father’s side. I have my mother’s DNA and sure enough, my match that I thought should be on my father’s side matches Mom on the same segment where they match me, along with several triangulated matches. Further research confirmed why.

I’ve also encountered situations where I legitimately match someone on both my mother’s and father’s side, on different segments.

Third, imputation can be important for people who don’t match and think they should. Imputation can also cause matching segment length to be overreported.

Ok, so what’s imputation and why do I care?

Imputation

Every DNA vendor today has to use some type of imputation.

Let me explain, in general, what imputation is and why vendors use it.

Over the years, DNA processing vendors who sell DNA chips to testing companies have changed their DNA chips pretty substantially. While genealogical autosomal tests test about 700,000 DNA locations, plus or minus, those locations have changed over time. Today, some of these chips only have 100,000 or so chip locations in common with chips either currently or previously utilized by other vendors.

The vendors who do NOT accept uploads, such as 23andMe or Ancestry, have to develop methods to make their newest customers on their DNA processing vendor’s latest chip compatible with their first customer who was tested on their oldest chip – and all iterations in-between.

Vendors who do accept transfers/uploads from other vendors have to equalize any number of vendors’ chips when their customers upload those files.

Imputation is the scientific way to achieve this cross-platform functionality and has been widely used in the industry since 2017.

Imputation, in essence, fills in the blanks between tested locations with the “most likely” DNA found in the human population based on what’s surrounding the blank location.

Think of the word C_T. There are a limited number of letters and words that are candidates for C_T. If you use the word in a sentence, your odds of accuracy increase dramatically. Think of a genetic string of nucleotides as a sentence.

Imputation can be incorrect and can cause both false positive and false negative matches.

For the most part, imputation does not affect close family matches as much as more distant matches. In other words, imputation is NOT going to cause close family members not to match.

Imputation may cause more distant family members not to match, or to have a false positive match when imputation is incorrect.

Imputation is actually MUCH less problematic than I initially expected.

The most likely effect of imputation is to cause a match to be just above or below the vendor threshold.

How can we minimize the effects of imputation?

  • Generally, the best result will be achieved if both people test at the same vendor where their DNA is processed on the same chip and less imputation is required.
  • Upload the results of both people to both MyHeritage and FamilyTreeDNA. If your match results are generally consistent at those vendors, imputation is not a factor.
  • GEDmatch does not use imputation but attempts to overcome files with low overlapping regions by allowing larger mismatch areas. I find their matches to be less accurate than at the various vendors.

Additionally, Ancestry has a few complicating factors.

Ancestry Issues

AncestryDNA is different in three ways.

  • Ancestry doesn’t provide segment information so it’s impossible to triangulate or identify the segment or chromosome where people match. There is no chromosome browser or triangulation tool.
  • Ancestry down-weights and removes some segments in areas where they feel that people are “too matchy.” You can read Ancestry’s white papers here and here.

These “personal pileup regions,” as they are known, can be important genealogically. In my case, these are my mother’s Acadian ancestors. Yes, this is an endogamous population and also suffers from pedigree collapse, but since this is only one of my mother’s great-grandparents, this match information is useful and should not be removed.

  • Ancestry doesn’t show matches in common if the shared segments are less than 20cM. Therefore, you may not see someone on a shared match list with a relative when they actually are a shared match.

If two people both match a third person on less than a 20 cM segment at Ancestry, the third person won’t appear on the other person’s shared match list. So, if I match John Doe on 19 cM of DNA, and I looked at the shared matches with my Dad, John Doe does NOT appear on the shared match list of me and my Dad – even though he is a match to both of us at 19 cM.

The only way to determine if John Doe is a shared match is to check my Dad’s and my match list individually, which means Dad and I will need to individually search for John Doe.

Caveat here – Ancestry’s search sometimes does not work correctly.

Might someone who doesn’t understand that the shared match list doesn’t show everyone who shares DNA with both people presume that the ancestral DNA of that ancestor “skipped a generation” because John Doe matches me with a known ancestor, and not Dad on our shared match list? I mean, wouldn’t you think that a shared match would be shown on a tab labeled “Shared Matches,” especially since there is no disclaimer?

Yes, people can be forgiven for believing that somehow DNA “skipped” a generation in this circumstance, especially if they are relatively inexperienced and they don’t understand Ancestry’s anomalies or know that they need to or how to search for matches individually.

Even if John Doe does match me and Dad both, we still need to confirm that it’s on the same segment AND it’s a legitimate match, not IBC. You can’t perform either of these functions at Ancestry, but you can elsewhere.

Ancestry WorkArounds

To obtain this functionality, people can upload their DNA files for free to both FamilyTreeDNA and MyHeritage, companies that do provide full shared DNA reporting (in common with) lists of ALL matches and do provide segment information with chromosome browsers. Furthermore, both provide triangulation in different ways.

Matching is free, but an inexpensive unlock is required at both vendors to access advanced tools such as Family Matching (bucketing) and triangulation at Family Tree DNA and phasing/triangulation at MyHeritage.

I wrote about Triangulation in Action at FamilyTreeDNA, here.

MyHeritage actually brackets triangulated segments for customers on their chromosome browser, including parents, so you get triangulation and parental phasing at the same time if you and your parent have both tested or uploaded your DNA file to MyHeritage. You can upload, for free, here.

In this example, my mother is matching to me in red on the entire length of chromosome 18, of course, and three other maternal cousins triangulate with me and mother inside the bracketed portion of chromosome 18. Please note that if any one of the people included in the chromosome browser comparison do not triangulate, no bracket is drawn around any others who do triangulate. It’s all or nothing. I remove people one by one to see if people triangulate – or build one by one with my mother included.

I wrote about Triangulation in Action at MyHeritage, here.

People can also upload to GEDmatch, a third-party site. While GEDmatch is less reliable for matching, you can adjust your search thresholds which you cannot do at other vendors. I don’t recommend routinely working below 7 cM. I occasionally use GEDmatch to see if a pedigree collapse segment has recombined below another vendor’s segment matching threshold.

Do NOT check the box to prevent hard breaks when selecting the One-to-One comparison. Checking that box allows GEDmatch to combine smaller matching segments into mega-segments for matching.

I wrote about Triangulation in Action at GEDmatch, here.

Transferring/Uploading Your DNA 

If you want to transfer your DNA to one of these vendors, you must download the DNA file from one vendor and upload it to another. That process does NOT remove your DNA file from the vendor where you tested, unless you select that option entirely separately.

I wrote full step-by-step transfer/upload instructions for each vendor, here.

Testing Close Relatives Is VERY Useful – Just Not for Triangulation

Of course, your best bet if you don’t have your parents available to test is to test as many of your grandparents, great-aunts/uncles, aunts, and uncles as possible. Test your siblings as well, because they will have inherited some of the same and some different segments of DNA from your parents – which means they carry different pieces of your ancestors’ DNA.

Just because close relatives don’t make good triangulation candidates doesn’t mean they aren’t valuable. Close relatives are golden because when they DO share a match with you, you know where to start looking for a common ancestor, even if your relative matches that person on a different segment than you do.

Close relatives are also important because they will share pieces of your common ancestor’s DNA that you don’t. Their matches can unlock the answers to your genealogy questions.

Ok, back to triangulation.

Triangulated Matches

A triangulated match is, of course, when three people all descended from a common ancestor and match each other on the same segment of DNA.

That means all three people’s DNA matches each other on that same segment, confirming that the match is not by chance, and that segment did descend from a common ancestor or ancestral couple.

But, is this always true? You’re going to hate this answer…

“It depends.”

You knew that was coming, didn’t you! 😊

It depends on the circumstances and relationships of the three people involved.

  • One of those three people can match the other two by chance, not by descent, especially if two of those people are close relatives to each other.
  • Identical by chance means that one of you didn’t inherit that DNA from one single parent. That zigzag phenomenon.
  • Furthermore, triangulated DNA is only valid as far back as the closest common ancestor of any two of the three people.

Let’s explore some examples.

Building Triangulation Evidence – Ingredients and a Recipe

The strongest case of triangulation is when:

  • You and at least two additional cousins match on the same segment AND
  • Descend through different children of the common ancestral couple

Let’s look at a valid triangulated match.

In this first example, the magenta segment of DNA is at least partially shared by four of the six cousins and triangulates to their common great-grandfather. Let’s say that these cousins then match with two other people descended from different children of their great-great-great-grandparents on this same segment. Then the entire triangulation group will have confirmed that segment’s origin and push the descent of that segment back another two generations.

These people all coalesce into one line with their common great-grandparents.

I’m only showing 3 generations in this triangulated match, but the concept is the same no matter how many generations you reach back in time. Although, over time, segments inherited from any specific ancestor become smaller and smaller until they are no longer passed to the next generation.

In this pedigree chart, we’re only tracking the magenta DNA which is passed generation to generation in descendants.

Eventually, of course, those segments become smaller and indistinguishable as they either aren’t passed on at all or drop below vendor matching thresholds.

This chart shows the average amount of DNA you would carry from each generational ancestor. You inherit half of each parent’s DNA, but back further than that, you don’t receive exactly half of any ancestor’s DNA in any generation. Larger segments are generally cut in two and passed on partially, but smaller segments are often either passed on whole or not at all.

On average, you’ll carry 7 cM of your eight-times-great-grandparents. In reality, you may carry more or you may not carry any – and you are unlikely to carry the same segment as any random other descendants but we know it happens and you’ll find them if enough (or the right) descendants test.

Putting this another way, if you divide all of your approximate 7000 cM of DNA into 7 cM segments of equal length – you’ll have 1000 7 cM segments. So will every other descendant of your eight-times-great-grandparent. You can see how small the chances are of you both inheriting that same exact 7 cM segment through ten inheritance/transmission events, each. Yet it does happen.

I have several triangulated matches with descendants of Charles Dodson and his wife, Anne through multiple of their 9 (or so) children, ten generations back in my tree. Those triangulated matches range from 7-38 cM. It’s possible that those three largest matches at 38 cM could be related through multiple ancestors because we all have holes in our trees – including Anne’s surname.

Click to enlarge image

It helps immensely that Charles Dodson had several children who were quite prolific as well.

Of course, the further back in time, the more “proof” is necessary to eliminate other unknown common ancestors. This is exactly why matching through different children is important for triangulation and ancestor confirmation.

The method we use to confirm the common ancestor is that all of the descendants who match the tester on the same segment all also match each other. This greatly reduces the chances that these people are matching by chance. The more people in the triangulation group, the stronger the evidence. Of course, parental phasing or cross-matching, where available is an added confirmation bonus.

In our magenta inheritance example, we saw that three of the males and one of the females from three different descendants of the great-grandparents all carry at least a portion of that magenta segment of great-grandpa’s DNA.

Now, let’s take a look at a different scenario.

Why can’t siblings or close relatives be used as two of the three people needed for triangulation?

Aunts and Uncles

We know that the best way to determine if a match is valid is by parental phasing – your match also matching to one of your parents.

If both parents aren’t available, looking for close family matches in common with your match is the next hint that genealogists seek.

Let’s say that you and your match both match your aunt or uncle in common or their children.

You and your aunts or uncles matching DNA only pushes your common ancestor back to your grandparents.

At that point, your match is in essence matching to a segment that belongs to your grandparents. Your matches’ DNA, or your grandparents’ DNA could have randomly recombined and you and your aunt/cousins could be matching that third person by chance.

Ok, then, what about siblings?

Siblings

The most recent common ancestor (MRCA) of you and someone who also matches your sibling is your parents. Therefore, you and your sibling actually only count as one “person” in this scenario. In essence, it’s the DNA of your parent(s) that is matching that third person, so it’s not true triangulation. It’s the same situation as above with aunts/uncles, except the common ancestor is closer than your grandparents.

The DNA of your parents could have recombined in both siblings to look like a match to your match’s family. Or vice versa. Remember Parental Cross-Matching.

If you and a sibling inherited EXACTLY the same segment of your Mom’s and Dad’s DNA, and you match someone by chance – that person will match your sibling by chance as well.

In this example, you can see that both siblings 1 and 2 inherited the exact same segments of DNA at the same locations from both of their parents.

Of course, they also inherited segments at different locations that we’re not looking at that won’t match exactly between siblings, unless they are identical twins. But in this case, the inherited segments of both siblings will match someone whose DNA randomly combined with green or magenta dots in these positions to match a cross-section of both parents.

How False Positives Work and How to Avoid Them

We saw in our first example, displayed again above, what a valid triangulated match looks like. Now let’s expand this view and take a look more specifically at how false positive matches occur.

On the left-hand (blue) side of this graphic, we see four siblings that descend through their father from Great-grandpa who contributed that large magenta segment of DNA. That segment becomes reduced in descendants in subsequent generations.

In downstream generations, we can see gold, white and green segments being added to the DNA inherited by the four children from their ancestor’s spouses. Dad’s DNA is shown on the left side of each child, and Mom’s on the right.

  • Blue Children 1 and 2 inherited the same segments of DNA from Mom and Dad. Magenta from Dad and green from Mom.
  • Blue Child 3 inherited two magenta segments from Dad in positions 1 and 2 and one gold segment from Dad in position 3. They inherited all white segments from Mom.
  • Blue Child 4 inherited all gold segments from Dad and all white segments from Mom.

The family on the blue left-hand side is NOT related to the pink family shown at right. That’s important to remember.

I’ve intentionally constructed this graphic so that you can see several identical by chance (IBC) matches.

Child 5, the first pink sibling carries a white segment in position 1 from Dad and gold segments in positions 2 and 3 from Dad. From Mom, they inherited a green segment in position 1, magenta in position 2 and green in position 3.

IBC Match 1 – Looking at the blue siblings, we see that based on the DNA inherited from Pink Child 5’s parents, Pink Child 5 matches Blue Child 4 with white, gold and gold in positions 1-3, even though they weren’t inherited from the same parent in Blue Child 4. I circled this match in blue.

IBC Match 2 – Pink Child 5 also matches Blue Children 1 and 2 (red circles) because Pink Child 5 has green, magenta, and green in positions 1-3 and so do Blue Children 1 and 2. However, Blue Children 1 and 2 inherited the green and magenta segments from Mom and Dad respectively, not just from one parent.

Pink Child 5 matches Blue Children 1, 2 and 4, but not because they match by descent, but because their DNA zigzags back and forth between the blue children’s DNA contributed by both parents.

Therefore, while Pink Child 5 matches three of the Blue Children, they do not match either parent of the Blue Children.

IBC Match 3 – Pink Child 6 matches Blue Child 3 with white, magenta and gold in positions 1-3 based on the same colors of dots in those same positions found in Blue Child 3 – but inherited both paternally and maternally.

You can see that if we had the four parents available to test, that none of the Pink Children would match either the Blue Children’s mother or father and none of the Blue Children would match either of the Pink Children’s mother or father.

This is why we can’t use either siblings or close family relatives for triangulation.

Distant Cousins Are Best for Triangulation & Here’s Why

When triangulating with 3 people, the most recent common ancestor (MRCA) intersection of the closest two people is the place at which triangulation turns into only two lines being compared and ceases being triangulation. Triangle means 3.

If siblings are 2 of the 3 matching people, then their parents are essentially being compared to the third person.

If you, your aunt/uncle, and a third person match, your grandparents are the place in your tree where three lines converge into two.

The same holds true if you’re matching against a sibling pair on your match’s side, or a match and their aunt/uncle, etc.

The further back in your tree you can push that MRCA intersection, the more your triangulated match provides confirming evidence of a common ancestor and that the match is valid and not caused by random recombination.

That’s exactly what the descendants of Charles Dodson have been able to do through triangulation with multiple descendants from several of his children.

It’s also worth mentioning at this point that the reason autosomal DNA testing uses hundreds/thousands of base pairs in a comparison window and not 3 or 6 dots like in my example is that the probability of longer segments of DNA simply randomly matching by chance is reduced with length and SNP density which is the number of SNP locations tested within that cM range.

Hence a 7 cM/500 SNP minimum is the combined rule of thumb. At that level, roughly half of your matches will be valid and half will be identical by chance unless you’re dealing with endogamy. Then, raise your threshold accordingly.

Ok, So Where are We? A Triangulation Checklist for You!

I know this has been a relatively long educational article, but it’s important to really understand that testing close relatives is VERY important, but also why we can’t effectively use them for triangulation.

Here’s a handy-dandy summary matching/triangulation checklist for you to use as you work through your matches.

  • You inherit half of each of your parents’ DNA. There is no other place for you to obtain or inherit your DNA. There is no DNA fairy sprinkling you with DNA from another source:)
  • DNA does NOT skip generations, although in occasional rare circumstances, it may appear that this happened. In this situation, it’s incumbent upon you, the genealogist, to PROVE that an exception has occurred if you really believe it has. Those circumstances might be pedigree collapse or perhaps imputation. You’ll need to compare matches at vendors who provide a chromosome browser, triangulation, and full shared match list information. Never assume that you are the exception without hard and fast proof. We all know about assume, right?
  • Your siblings inherit half of your parents’ DNA too, but not the same exact half of your parent’s DNA that you other siblings did (unless they are identical twins.) You may inherit the exact same DNA from either or both of your parents on certain segments.
  • Your matches may match your parents on different or an additional segment that you did not inherit.
  • Every segment has an individual history. Evaluate every matching segment separately. One matching segment with someone could be maternal, one paternal, and one identical by chance.
  • You can confirm matches as valid if your match matches one of your parents, and you match one of your match’s parents. Parental Phasing is when your match matches your parent. Parental Cross-Matching is when you both match one of each other’s parents. To be complete, both people who match each other need to match one of the parents of the other person. This rule still holds even if you have a known common ancestor. I can’t even begin to tell you how many times I’ve been fooled.
  • 15-20% (or more with endogamy) of your matches will be identical by chance because either your DNA or your match’s DNA aligns in such a way that while they match you, they don’t match either of your parents.
  • Your siblings, aunts, and uncles will often inherit the same DNA as you – which means that identical by chance matches will also match them. That’s why we don’t use close family members for triangulation. We do utilize close family members to generate common match hints. (Remember the 20 cM shared match caveat at Ancestry)
  • While your siblings, aunts, and uncles are too close to use for triangulation, they are wonderful to identify ancestral matches. Some of their matches will match you as well, and some will not because your close family members inherited segments of your ancestor’s DNA that you did not. Everyone should test their oldest family members.
  • Triangulate your close family member’s matches separately from your own to shed more light on your ancestors.
  • Endogamy may interfere with parental phasing, meaning you may match because you and/or your match may have inherited some of the same DNA segment(s) from both sides of your tree and/or more DNA than might otherwise be expected.
  • Pedigree collapse needs to be considered when using parental phasing, especially when the same ancestor appears on both sides of your family tree. You may share more DNA with a match than expected.
  • Conversely, with pedigree collapse, your match may not match your parents, or vice versa, if a segment happens to have recombined in you in a way that drops the matching segments of your parents beneath the vendor’s match threshold.
  • While you will match all of your second cousins, you will only match approximately 90% of your third cousins and proportionally fewer as your relationship reaches further back in time.
  • Not being a DNA match with someone does NOT mean you’re NOT related to them, unless of course, you’re a second cousin (2C) or closer. It simply means you don’t carry any common ancestral segments above vendor thresholds.
  • At 2C or closer, if you’re not a DNA match, other alternative situations need to be considered – including the transfer/upload of the wrong person’s DNA file.
  • Imputation, a scientific process required of vendors may interfere with matching, especially in more distant relatives who have tested on different platforms.
  • Imputation artifacts will be less obvious when people are more closely related, meaning closer relatives can be expected to match on more and larger segments and imputation errors make less difference.
  • Imputation will not cause close relatives, meaning 2C or closer, to not match each other.
  • In addition to not supporting segment matching information, Ancestry down-weights some segments, removes some matching DNA, and does not show shared matches below 20cM, causing some people to misinterpret their lack of common matches in various ways.
  • To resolve questions about matching issues at Ancestry, testers can transfer/upload their DNA files to MyHeritage, FamilyTreeDNA, and GEDmatch and look for consistent matches on the same segment. Start and end locations may vary to some extent between vendors, but the segment size should be basically in the same location and roughly the same size.
  • GEDmatch does not use imputation but allows larger non-matching segments to combine as a single segment which sometimes causes extremely “generous” matches. GEDmatch matching is less reliable than FamilyTreeDNA or MyHeritage, but you can adjust the matching thresholds.
  • The best situation for matching is for both people to test at the same vendor who supports and provides segment data and a chromosome browser such as 23andMe, FamilyTreeDNA, or MyHeritage.
  • Siblings cannot be used for triangulation because the most recent common ancestor (MRCA) between you and your siblings is your parents. Therefore, the “three” people in the triangulation group is reduced to two lines immediately.
  • Uncles and aunts should not be used for triangulation because the most recent common ancestors between you and your aunts and uncles are your grandparents.
  • Conversely, you should not consider triangulating with siblings and close family members of your matches as proof of an ancestral relationship.
  • A triangulation group of 3 people is only confirmation as far back as when two of those people’s lines converge and reach a common ancestor.
  • Identical by chance (IBC) matching occurs when DNA from the maternal and paternal sides are mixed positionally in the child to resemble a maternal/paternal side match with someone else.
  • Identical by chance DNA admixture (when compared to a match) could have occurred in your parents or grandparent’s generation, or earlier, so the further back in time that people in a triangulation group reach, the more reliable the triangulation group is likely to be.
  • The larger the segments and/or the triangulation group, the stronger the evidence for a specific confirmed common ancestor.
  • Early families with a very large number of descendants may have many matching and triangulated members, even 9 or 10 generations later.
  • While exactly 50% of each ancestor’s DNA is not passed in each generation, on average, you will carry 7 cM of your ancestors 10 generations back in your tree. However, you may carry more, or none.
  • The percentage of matching descendants decreases with each generation beyond great-grandparents.
  • The ideal situation for triangulation is a significant number of people, greater than three, who match on the same reasonably sized segment (7 cM/500 SNP or larger) and descend from the same ancestor (or ancestral couple) through different children whose spouses in descendant generations are not also related.
  • This means that tree completion is an important factor in match/triangulation reliability.
  • Triangulating through different children of the ancestral couple makes it significantly less likely that a different unknown common ancestor is contributing that segment of DNA – like an unknown wife in a descendant generation.

Whew!!!

The Bottom Line

Here’s the bottom line.

  1. Don’t use close relatives to triangulate.
  2. Use parents for Parental Phasing.
  3. Use Parental Cross-Matching when possible.
  4. Use close relatives to look for shared common matches that may lead to triangulation possibilities.
  5. Triangulate your close relatives’ DNA in addition to your own for bonus genealogical information. They will match people that you don’t.
  6. For the most reliable triangulation results, use the most distant relatives possible, descended through different children of the common ancestral couple.
  7. Keep this checklist of best practices, cautions, and caveats handy and check the list as necessary when evaluating the strength of any match or triangulation group. It serves as a good reminder for what to check if something seems “off” or unusual.

Feel free to share and pass this article (and checklist) on to your genealogy buddies and matches as you explain triangulation and collaborate on your genealogy.

Have fun!!!

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Books

Genealogy Research

Haplogroup Matching: What It Does (and Doesn’t) Mean

“Our haplogroups (sort of) match, so that means we’re related, right?”

Well, maybe.

It depends.

Great Question

This is an oft-asked great question. Of course, the answer varies depending on the context of the question and what is meant by “related.” A haplogroup match may or may not be a valid match for genealogy. A “match” or a “not match” can mean different things.

The questions people often ask include:

  • Does a haplogroup have to match exactly in order for another person to either be considered a match to you?
  • If they don’t match exactly, can they still be considered as a possible match?
  • Conversely, can we rule someone out as a match on a specific genealogical line based on haplogroup alone?

These questions often arise in relation to DNA testing at Family Tree DNA, sometimes when people are trying to compare results to people who have haplogroup estimates, either at FamilyTreeDNA or from testing elsewhere.

In other words, if one person is haplogroup J and someone else is J1, either at the same vendor or at another, what does that tell us? This question pertains to both Y DNA and mitochondrial DNA tests no matter where you’ve tested.

Family Tree DNA offers different levels of Y DNA testing. Interpreting those match results can sometimes be confusing. The same is true for mitochondrial DNA, especially if your matches have not taken the full mitochondrial sequence (mtFull) test.

You might be comparing apples and oranges, or you might be comparing a whole orange (detailed test) with a few slices (haplogroup estimate.) How can you know, and how can you make sense of the results?

If you’re comparing a haplogroup between sources, such as a partial haplogroup determined by testing through a company like 23andMe or LivingDNA to complete tests taken at FamilyTreeDNA, the answer can be less than straightforward.

I discussed the difference between autosomal-based haplogroup assignments and actual testing of both Y DNA and mitochondrial DNA which result in haplogroup assignments, here. In a nutshell, both LivingDNA and 23andMe provide a high-level (base) haplogroup estimates based on a few specific probes when you purchase an autosomal test, but that’s not the same as deeper testing of the Y chromosome or mitochondrial DNA.

The answer to whether your haplogroup has to match is both “yes”, and “no.” Don’t you hate it when this happens?

Let’s look at different situations. But to begin with, there is at least one common answer.

Yes, Your Base Haplogroup Must Match

To even begin to look further for a common ancestor on either your Y DNA line (direct patrilineal) or direct mitochondrial matrilineal line (your mother’s mother’s mother’s line on up the tree), your base haplogroup much match.

In other words, you and your matches must all be in the same base haplogroup. Haplogroups are defined by the presence of specific combinations of mutations which are called SNPs (single nucleotide polymorphisms) in the Y DNA.

Click to enlarge images

All of these men on the Y DNA matches page are a branch of haplogroup R as shown under the Y-DNA Haplogroup column. There are more matches on down the page (not shown here) with more and different haplogroups. However, you’ll notice that all matches are a subset of haplogroup R, the base haplogroup.

The same is true for mitochondrial DNA haplogroups. You can see in this example that people who have not tested at the FMS (full mitochondrial sequence) level have a less specific haplogroup. The entire mitochondria must be tested in order to obtain a full haplogroup, such as J1c2f, as opposed to haplogroup J.

The Y DNA Terminal SNP Might Not Match

For Y DNA testers, when looking at your matches, even to close relatives, you may not have the same exact haplogroup because:

  • Some people may have tested at different levels
  • Some people in recent generations may have developed a SNP specific to their line.

In other words, haplogroups, testing level, and known genealogy all need to be considered, especially when the haplogroups are “close to each other” on the tree.

For Y DNA, FamilyTreeDNA:

  • Provides all testers with base haplogroup estimates based on STR tests, meaning 12-111 marker panels. These are very accurate estimates, but are also very high level.
  • Offers or has offered in the past both individual SNP tests and SNP packs or bundles that test individual SNPs indicating their presence or absence. This confirms a SNP or haplogroup, but only to that particular level.
  • Offers the Big Y-700 test, along with upgrades to previous Big Y test levels. There have been 3 versions of the Big Y test over time. The Big Y reads the entire gold standard region of the Y chromosome, reporting the known (named) SNP mutations customers do and don’t have. Additionally, the test reports any unnamed SNPs which are considered private variants until multiple men on the same branch of the Y DNA tree test with the same mutation. At that point, the mutation is named and becomes a haplogroup.

That’s why the answer is “no,” your haplogroup does not have to match exactly for you to actually be a match to each other.

A father and son could test, with one having an estimated haplogroup of R-M269 and the other taking the Big Y-700 resulting in a very different Terminal SNP, quite distant on the tree. Conversely, both men could take the Big Y and the son could have a different terminal SNP than the father because a mutation occurred between them. An autosomal DNA test would confirm that they are in fact, father and son.

However, a father and son who test and are placed in different base haplogroups – one in haplogroup I, and the other in haplogroup R, for example, has a very different situation. Their autosomal test would likely confirm that they are not father and son.

Having said this about paternity, especially if haplogroups are estimated and specific Y DNA SNP testing has not been done, don’t have a premature freak-out moment. Look at autosomal DNA, assuming you DO want to know. Y DNA alone should never be used to infer paternity without autosomal testing.

Let’s look at some examples.

Matches and Haplogroups

In the example shown above, you can see that several people have taken the Big Y test, so their SNP will be shown on further down the haplotree than those testers who have not. These are a leaf, not a branch.

You can see by looking at the Terminal SNP column, at far right, that people who have either taken the Big Y, or had any positive SNP test will have a value in the Terminal SNP column.

Anyone who has NOT taken the Big Y or taken a SNP test will have their base haplogroup estimated based on their STR tests. In this case, that estimate is R-M269. People with estimated haplogroups will not show anything in the Terminal SNP column.

It’s possible that if all of these men took the Big Y test that at least some would share the same Terminal SNP, and others might be closely related, only a branch or so different on the tree.

These men in this example are all descendants of Robert Estes born in England in 1555. All have Estes surnames, except for one man who is seeking the identity of his paternal line.

Let’s Look at the Tree

Our tester in the screenshot is haplogroup R-ZS3700 and matches men in the following haplogroups:

  • R-M269
  • R-L21
  • R-BY490
  • R-BY154784

There are a few additional haplogroups not shown because they are further down on his match list, so let’s just work with these for now.

After determining that these men are on the same branch of the Y tree, haplogroup R, the real question is how closely they are related and how close or far distant their terminal SNPs are located. More distance means the common ancestor is further back in time.

However, looks can be deceiving, especially if not everyone has tested to the same level.

The haplogroup furthest up in the tree, meaning the oldest, is R-M269, followed by the man who took the single SNP test for R-L21. Notice that R-M269 has more than 15,000 branches, so while this haplogroup could be used to rule out a match, R-M269 alone isn’t useful to determine genealogical matching.

There are a lot of branches between R-L21 and the next haplogroup on the tree.

Finally, here we go. Our tester is haplogroup R-ZS3700 that has one descendant branch. R-ZS3700 is a branch of R-BY490 that has 2 branches.

R-BY154784 is the last SNP on this branch of the tree. Our tester matches this man too.

Another way of viewing these matches is on the Block Tree provided for Big Y testers.

In this view, you can see that the Estes men all match back to about 18 “SNP generations” ago according to the legend at left, but they don’t match men further back in time who have taken the Big Y test.

Notice the up-arrow where haplogroups R-L21 and R-M269 are shown across the top of the display.

If you click on R-L21, you’ll see that that it appears about 61 SNP generations back in time.

Haplogroup R-M269 appears even further back in time, about 174 SNP generations.

The only reason you will match someone at either the R-L21 or R-M269 level is because you both descend from a common long-ago ancestral branch, hundreds to thousands of years in the past. You and they would both need to take either the Big Y-700 test for Y DNA, or the full sequence mitochondrial DNA test in order to determine your full haplogroup and see your list of matches based on those full sequences.

Public Trees

You can view FamilyTreeDNA‘s extensive public Y DNA tree by haplogroup, here.

You can view their public mitochondrial DNA tree by haplogroup, here.

And the Answer Is…

As you can see, there is no single answer to the question of haplogroup relationships. The answer is also partly defined by the context in which the question is asked.

  1. For two men to be “related” on the Y DNA patrilineal line, yes, minimally, the base haplogroup does have to match. Base haplogroups are defined by the leading letter, like “R” in the examples above.
  2. “Related” based on base haplogroup only can be hundreds or thousands of years back in time, but additional testing can resolve that question.
  3. “Related” can mean before the advent of surnames. However, a match to a man with the same surname suggests a common ancestor with that surname in the past several hundred years. That match could, however, be much closer in time.
  4. For two men to be closely related, assuming they have taken the same version of Big Y test, their haplogroup branches need to be fairly closely adjacent on the haplotree. FamilyTreeDNA will be introducing haplogroup aging soon, meaning SNP/haplogroup branch dates on their haplotree. At that time, the “distance” between men will be easier to understand.
  5. You can exclude a genealogical relationship on the direct paternal line if the two men involved have a different base haplogroup. This question often occurs when people are trying to understand if they “might match” with someone whose haplogroup has been estimated.
  6. This holds true as well for mitochondrial DNA haplogroups and matching.

And there you have it, six answers about what haplogroup matching does and does not mean.

The bottom line is that haplogroups can be a great starting point and you can sometimes eliminate people as potential matches.

However, to confirm genealogical matches, you’ll always need more granular testing that includes actual Y DNA or mitochondrial DNA matching based on marker mutation results, not just haplogroups.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Books

Genetic Genealogy at 20 Years: Where Have We Been, Where Are We Going and What’s Important?

Not only have we put 2020 in the rear-view mirror, thankfully, we’re at the 20-year, two-decade milestone. The point at which genetics was first added to the toolbox of genealogists.

It seems both like yesterday and forever ago. And yes, I’ve been here the whole time,  as a spectator, researcher, and active participant.

Let’s put this in perspective. On New Year’s Eve, right at midnight, in 2005, I was able to score kit number 50,000 at Family Tree DNA. I remember this because it seemed like such a bizarre thing to be doing at midnight on New Year’s Eve. But hey, we genealogists are what we are.

I knew that momentous kit number which seemed just HUGE at the time was on the threshold of being sold, because I had inadvertently purchased kit 49,997 a few minutes earlier.

Somehow kit 50,000 seemed like such a huge milestone, a landmark – so I quickly bought kits, 49,998, 49,999, and then…would I get it…YES…kit 50,000. Score!

That meant that in the 5 years FamilyTreeDNA had been in business, they had sold on an average of 10,000 kits per year, or 27 kits a day. Today, that’s a rounding error. Then it was momentous!

In reality, the sales were ramping up quickly, because very few kits were sold in 2000, and roughly 20,000 kits had been sold in 2005 alone. I know this because I purchased kit 28,429 during the holiday sale a year earlier.

Of course, I had no idea who I’d test with that momentous New Year’s Eve Y DNA kit, but I assuredly would find someone. A few months later, I embarked on a road trip to visit an elderly family member with that kit in tow. Thank goodness I did, and they agreed and swabbed on the spot, because they are gone today and with them, the story of the Y line and autosomal DNA of their branch.

In the past two decades, almost an entire generation has slipped away, and with them, an entire genealogical library held in their DNA.

Today, more than 40 million people have tested with the four major DNA testing companies, although we don’t know exactly how many.

Lots of people have had more time to focus on genealogy in 2020, so let’s take a look at what’s important? What’s going on and what matters beyond this month or year?

How has this industry changed in the last two decades, and where it is going?

Reflection

This seems like a good point to reflect a bit.

Professor Dan Bradley reflecting on early genetic research techniques in his lab at the Smurfit Institute of Genetics at Trinity College in Dublin. Photo by Roberta Estes

In the beginning – twenty years ago, there were two companies who stuck their toes in the consumer DNA testing water – Oxford Ancestors and Family Tree DNA. About the same time, Sorenson Genomics and GeneTree were also entering that space, although Sorenson was a nonprofit. Today, of those, only FamilyTreeDNA remains, having adapted with the changing times – adding more products, testing, and sophistication.

Bryan Sykes who founded Oxford Ancestors announced in 2018 that he was retiring to live abroad and subsequently passed away in 2020. The website still exists, but the company has announced that they have ceased sales and the database will remain open until Sept 30, 2021.

James Sorenson died in 2008 and the assets of Sorenson Molecular Genealogy Foundation, including the Sorenson database, were sold to Ancestry in 2012. Eventually, Ancestry removed the public database in 2015.

Ancestry dabbled in Y and mtDNA for a while, too, destroying that database in 2014.

Other companies, too many to remember or mention, have come and gone as well. Some of the various company names have been recycled or purchased, but aren’t the same companies today.

In the DNA space, it was keep up, change, die or be sold. Of course, there was the small matter of being able to sell enough DNA kits to make enough money to stay in business at all. DNA processing equipment and a lab are expensive. Not just the equipment, but also the expertise.

The Next Wave

As time moved forward, new players entered the landscape, comprising the “Big 4” testing companies that constitute the ponds where genealogists fish today.

23andMe was the first to introduce autosomal DNA testing and matching. Their goal and focus was always medical genetics, but they recognized the potential in genealogists before anyone else, and we flocked to purchase tests.

Ancestry settled on autosomal only and relies on the size of their database, a large body of genealogy subscribers, and a widespread “feel-good” marketing campaign to sell DNA kits as the gateway to “discover who you are.”

FamilyTreeDNA did and still does offer all 3 kinds of tests. Over the years, they have enhanced both the Y DNA and mitochondrial product offerings significantly and are still known as “the science company.” They are the only company to offer the full range of Y DNA tests, including their flagship Big Y-700, full sequence mitochondrial testing along with matching for both products. Their autosomal product is called Family Finder.

MyHeritage entered the DNA testing space a few years after the others as the dark horse that few expected to be successful – but they fooled everyone. They have acquired companies and partnered along the way which allowed them to add customers (Promethease) and tools (such as AutoCluster by Genetic Affairs), boosting their number of users. Of course, MyHeritage also offers users a records research subscription service that you can try for free.

In summary:

One of the wonderful things that happened was that some vendors began to accept compatible raw DNA autosomal data transfer files from other vendors. Today, FamilyTreeDNA, MyHeritage, and GEDmatch DO accept transfer files, while Ancestry and 23andMe do not.

The transfers and matching are free, but there are either minimal unlock or subscription plans for advanced features.

There are other testing companies, some with niche markets and others not so reputable. For this article, I’m focusing on the primary DNA testing companies that are useful for genealogy and mainstream companion third-party tools that complement and enhance those services.

The Single Biggest Change

As I look back, the single biggest change is that genetic genealogy evolved from the pariah of genealogy where DNA discussion was banned from the (now defunct) Rootsweb lists and summarily deleted for the first few years after introduction. I know, that’s hard to believe today.

Why, you ask?

Reasons varied from “just because” to “DNA is cheating” and then morphed into “because DNA might do terrible things like, maybe, suggest that a person really wasn’t related to an ancestor in a lineage society.”

Bottom line – fear and misunderstanding. Change is exceedingly difficult for humans, and DNA definitely moved the genealogy cheese.

From that awkward beginning, genetic genealogy organically became a “thing,” a specific application of genealogy. There was paper-trail traditional genealogy and then the genetic aspect. Today, for almost everyone, genealogy is “just another tool” in the genealogist’s toolbox, although it does require focused learning, just like any other tool.

DNA isn’t separate anymore, but is now an integral part of the genealogical whole. Having said that, DNA can’t solve all problems or answer all questions, but neither can traditional paper-trail genealogy. Together, each makes the other stronger and solves mysteries that neither can resolve alone.

Synergy.

I fully believe that we have still only scratched the surface of what’s possible.

Inheritance

As we talk about the various types of DNA testing and tools, here’s a quick graphic to remind you of how the different types of DNA are inherited.

  • Y DNA is inherited paternally for males only and informs us of the direct patrilineal (surname) line.
  • Mitochondrial DNA is inherited by everyone from their mothers and informs us of the mother’s matrilineal (mother’s mother’s mother’s) line.
  • Autosomal DNA can be inherited from potentially any ancestor in random but somewhat predictable amounts through both parents. The further back in time, the less identifiable DNA you’ll inherit from any specific ancestor. I wrote about that, here.

What’s Hot and What’s Not

Where should we be focused today and where is this industry going? What tools and articles popped up in 2020 to help further our genealogy addiction? I already published the most popular articles of 2020, here.

This industry started two decades ago with testing a few Y DNA and mitochondrial DNA markers, and we were utterly thrilled at the time. Both tests have advanced significantly and the prices have dropped like a stone. My first mitochondrial DNA test that tested only 400 locations cost more than $800 – back then.

Y DNA and mitochondrial DNA are still critically important to genetic genealogy. Both play unique roles and provide information that cannot be obtained through autosomal DNA testing. Today, relative to Y DNA and mitochondrial DNA, the biggest challenge, ironically, is educating newer genealogists about their potential who have never heard about anything other than autosomal, often ethnicity, testing.

We have to educate in order to overcome the cacophony of “don’t bother because you don’t get as many matches.”

That’s like saying “don’t use the right size wrench because the last one didn’t fit and it’s a bother to reach into the toolbox.” Not to mention that if everyone tested, there would be a lot more matches, but I digress.

If you don’t use the right tool, and all of the tools at your disposal, you’re not going to get the best result possible.

The genealogical proof standard, the gold standard for genealogy research, calls for “a reasonably exhaustive search,” and if you haven’t at least considered if or how Y
DNA
and mitochondrial DNA along with autosomal testing can or might help, then your search is not yet exhaustive.

I attempt to obtain the Y and mitochondrial DNA of every ancestral line. In the article, Search Techniques for Y and Mitochondrial DNA Test Candidates, I described several methodologies to find appropriate testing candidates.

Y DNA – 20 Years and Still Critically Important

Y DNA tracks the Y chromosome for males via the patrilineal (surname) line, providing matching and historical migration information.

We started 20 years ago testing 10 STR markers. Today, we begin at 37 markers, can upgrade to 67 or 111, but the preferred test is the Big Y which provides results for 700+ STR markers plus results from the entire gold standard region of the Y chromosome in order to provide the most refined results. This allows genealogists to use STR markers and SNP results together for various aspects of genealogy.

I created a Y DNA resource page, here, in order to provide a repository for Y DNA information and updates in one place. I would encourage anyone who can to order or upgrade to the Big Y-700 test which provides critical lineage information in addition to and beyond traditional STR testing. Additionally, the Big Y-700 test helps build the Y DNA haplotree which is growing by leaps and bounds.

More new SNPs are found and named EVERY SINGLE DAY today at FamilyTreeDNA than were named in the first several years combined. The 2006 SNP tree listed a grand total of 459 SNPs that defined the Y DNA tree at that time, according to the ISOGG Y DNA SNP tree. Goran Rundfeldt, head of R&D at FamilyTreeDNA posted this today:

2020 was an awful year in so many ways, but it was an unprecedented year for human paternal phylogenetic tree reconstruction. The FTDNA Haplotree or Great Tree of Mankind now includes:

37,534 branches with 12,696 added since 2019 – 51% growth!
defined by
349,097 SNPs with 131,820 added since 2019 – 61% growth!

In just one year, 207,536 SNPs were discovered and assigned FT SNP names. These SNPs will help define new branches and refine existing ones in the future.

The tree is constructed based on high coverage chromosome Y sequences from:
– More than 52,500 Big Y results
– Almost 4,000 NGS results from present-day anonymous men that participated in academic studies

Plus an additional 3,000 ancient DNA results from archaeological remains, of mixed quality and Y chromosome coverage at FamilyTreeDNA.

Wow, just wow.

These three new articles in 2020 will get you started on your Y DNA journey!

Mitochondrial DNA – Matrilineal Line of Humankind is Being Rewritten

The original Oxford Ancestor’s mitochondrial DNA test tested 400 locations. The original Family Tree DNA test tested around 1000 locations. Today, the full sequence mitochondrial DNA test is standard, testing the entire 16,569 locations of the mitochondria.

Mitochondrial DNA tracks your mother’s direct maternal, or matrilineal line. I’ve created a mitochondrial DNA resource page, here that includes easy step-by-step instructions for after you receive your results.

New articles in 2020 included the introduction of The Million Mito Project. 2021 should see the first results – including a paper currently in the works.

The Million Mito Project is rewriting the haplotree of womankind. The current haplotree has expanded substantially since the first handful of haplogroups thanks to thousands upon thousands of testers, but there is so much more information that can be extracted today.

Y and Mitochondrial Resources

If you don’t know of someone in your family to test for Y DNA or mitochondrial DNA for a specific ancestral line, you can always turn to the Y DNA projects at Family Tree DNA by searching here.

The search provides you with a list of projects available for a specific surname along with how many customers with that surname have tested. Looking at the individual Y DNA projects will show the earliest known ancestor of the surname line.

Another resource, WikiTree lists people who have tested for the Y DNA, mitochondrial DNA and autosomal DNA lines of specific ancestors.

Click on images to enlarge

On the left side, my maternal great-grandmother’s profile card, and on the right, my paternal great-great-grandfather. You can see that someone has tested for the mitochondrial DNA of Nora (OK, so it’s me) and the Y DNA of John Estes (definitely not me.)

MitoYDNA, a nonprofit volunteer organization created a comparison tool to replace Ysearch and Mitosearch when they bit the dust thanks to GDPR.

MitoYDNA accepts uploads from different sources and allows uploaders to not only match to each other, but to view the STR values for Y DNA and the mutation locations for the HVR1 and HVR2 regions of mitochondrial DNA. Mags Gaulden, one of the founders, explains in her article, What sets mitoYDNA apart from other DNA Databases?.

If you’ve tested at nonstandard companies, not realizing that they didn’t provide matching, or if you’ve tested at a company like Sorenson, Ancestry, and now Oxford Ancestors that is going out of business, uploading your results to mitoYDNA is a way to preserve your investment. PS – I still recommend testing at FamilyTreeDNA in order to receive detailed results and compare in their large database.

CentiMorgans – The Word of Two Decades

The world of autosomal DNA turns on the centimorgan (cM) measure. What is a centimorgan, exactly? I wrote about that unit of measure in the article Concepts – CentiMorgans, SNPs and Pickin’ Crab.

Fortunately, new tools and techniques make using cMs much easier. The Shared cM Project was updated this year, and the results incorporated into a wonderfully easy tool used to determine potential relationships at DNAPainter based on the number of shared centiMorgans.

Match quality and potential relationships are determined by the number of shared cMs, and the chromosome browser is the best tool to use for those comparisons.

Chromosome Browser – Genetics Tool to View Chromosome Matches

Chromosome browsers allow testers to view their matching cMs of DNA with other testers positioned on their own chromosomes.

My two cousins’ DNA where they match me on chromosomes 1-4, is shown above in blue and red at Family Tree DNA. It’s important to know where you match cousins, because if you match multiple cousins on the same segment, from the same side of your family (maternal or paternal), that’s suggestive of a common ancestor, with a few caveats.

Some people feel that a chromosome browser is an advanced tool, but I think it’s simply standard fare – kind of like driving a car. You need to learn how to drive initially, but after that, you don’t even think about it – you just get in and go. Here’s help learning how to drive that chromosome browser.

Triangulation – Science Plus Group DNA Matching Confirms Genealogy

The next logical step after learning to use a chromosome browser is triangulation. If fact, you’re seeing triangulation above, but don’t even realize it.

The purpose of genetic genealogy is to gather evidence to “prove” ancestral connections to either people or specific ancestors. In autosomal DNA, triangulation occurs when:

  • You match at least two other people (not close relatives)
  • On the same reasonably sized segment of DNA (generally 7 cM or greater)
  • And you can assign that segment to a common ancestor

The same two cousins are shown above, with triangulated segments bracketed at MyHeritage. I’ve identified the common ancestor with those cousins that those matching DNA segments descend from.

MyHeritage’s triangulation tool confirms by bracketing that these cousins also match each other on the same segment, which is the definition of triangulation.

I’ve written a lot about triangulation recently.

If you’d prefer a video, I recorded a “Top Tips” Facebook LIVE with MyHeritage.

Why is Ancestry missing from this list of triangulation articles? Ancestry does not offer a chromosome browser or segment information. Therefore, you can’t triangulate at Ancestry. You can, however, transfer your Ancestry DNA raw data file to either FamilyTreeDNA, MyHeritage, or GEDmatch, all three of which offer triangulation.

Step by step download/upload transfer instructions are found in this article:

Clustering Matches and Correlating Trees

Based on what we’ve seen over the past few years, we can no longer depend on the major vendors to provide all of the tools that genealogists want and need.

Of course, I would encourage you to stay with mainstream products being used by a significant number of community power users. As with anything, there is always someone out there that’s less than honorable.

2020 saw a lot of innovation and new tools introduced. Maybe that’s one good thing resulting from people being cooped up at home.

Third-party tools are making a huge difference in the world of genetic genealogy. My favorites are Genetic Affairs, their AutoCluster tool shown above, DNAPainter and DNAGedcom.

These articles should get you started with clustering.

If you like video resources, here’s a MyHeritage Facebook LIVE that I recorded about how to use AutoClusters:

I created a compiled resource article for your convenience, here:

I have not tried a newer tool, YourDNAFamily, that focuses only on 23andMe results although the creator has been a member of the genetic genealogy community for a long time.

Painting DNA Makes Chromosome Browsers and Triangulation Easy

DNAPainter takes the next step, providing a repository for all of your painted segments. In other words, DNAPainter is both a solution and a methodology for mass triangulation across all of your chromosomes.

Here’s a small group of people who match me on the same maternal segment of chromosome 1, including those two cousins in the chromosome browser and triangulation sections, above. We know that this segment descends from Philip Jacob Miller and his wife because we’ve been able to identify that couple as the most distant ancestor intersection in all of our trees.

It’s very helpful that DNAPainter has added the functionality of painting all of the maternal and paternal bucketed matches from Family Tree DNA.

All you need to do is to link your known matches to your tree in the proper place at FamilyTreeDNA, then they do the rest by using those DNA matches to indicate which of the rest of your matches are maternal and paternal. Instructions, here. You can then export the file and use it at DNAPainter to paint all of those matches on the correct maternal or paternal chromosomes.

Here’s an article providing all of the DNAPainter Instructions and Resources.

DNA Matches Plus Trees Enhance Genealogy

Of course, utilizing DNA matching plus finding common ancestors in trees is one of the primary purposes of genetic genealogy – right?

Vendors have linked the steps of matching DNA with matching ancestors in trees.

Genetic Affairs take this a step further. If you don’t have an ancestor in your tree, but your matches have common ancestors with each other, Genetic Affairs assembles those trees to provide you with those hints. Of course, that common ancestor might not be relevant to your genealogy, but it just might be too!

click to enlarge

This tree does not include me, but two of my matches descend from a common ancestor and that common ancestor between them might be a clue as to why I match both of them.

Ethnicity Continues to be Popular – But Is No Shortcut to Genealogy

Ethnicity is always popular. People want to “do their DNA” and find out where they come from. I understand. I really do. Who doesn’t just want an answer?

Of course, it’s not that simple, but that doesn’t mean it’s not disappointing to people who test for that purpose with high expectations. Hopefully, ethnicity will pique their curiosity and encourage engagement.

All four major vendors rolled out updated ethnicity results or related tools in 2020.

The future for ethnicity, I believe, will be held in integrated tools that allow us to use ethnicity results for genealogy, including being able to paint our ethnicity on our chromosomes as well as perform segment matching by ethnicity.

For example, if I carry an African segment on chromosome 1 from my father, and I match one person from my mother’s side and one from my father’s side on that same segment – one or the other of those people should also have that segment identified as African. That information would inform me as to which match is paternal and which is maternal

Not only that, this feature would help immensely tracking ancestors back in time and identifying their origins.

Will we ever get there? I don’t know. I’m not sure ethnicity is or can be accurate enough. We’ll see.

Transition to Digital and Online

Sometimes the future drags us kicking and screaming from the present.

With the imposed isolation of 2020, conferences quickly moved to an online presence. The genealogy community has all pulled together to make this work. The joke is that 2020’s most used phrase is “can you hear me?” I can vouch for that.

Of course while the year 2020 is over, the problem isn’t and is extending at least through the first half of 2021 and possibly longer. Conferences are planned months, up to a year, in advance and they can’t turn on a dime, so don’t even begin to expect in-person conferences until either late in 2021 or more likely, 2022 if all goes well this year.

I expect the future will eventually return to in-person conferences, but not entirely.

Finding ways to be more inclusive allows people who don’t want to or can’t travel or join in-person to participate.

I’ve recorded several sessions this year, mostly for 2021. Trust me, these could be a comedy, mostly of errors😊

I participated in four MyHeritage Facebook LIVE sessions in 2020 along with some other amazing speakers. This is what “live” events look like today!

Screenshot courtesy MyHeritage

A few days ago, I asked MyHeritage for a list of their LIVE sessions in 2020 and was shocked to learn that there were more than 90 in English, all free, and you can watch them anytime. Here’s the MyHeritage list.

By the way, every single one of the speakers is a volunteer, so say a big thank you to the speakers who make this possible, and to MyHeritage for the resources to make this free for everyone. If you’ve ever tried to coordinate anything like this, it’s anything but easy.

Additonally, I’ve created two Webinars this year for Legacy Family Tree Webinars.

Geoff Rasmussen put together the list of their top webinars for 2020, and I was pleased to see that I made the top 10! I’m sure there are MANY MORE you’d be interested in watching. Personally, I’m going to watch #6 yet today! Also, #9 and #22. You can always watch new webinars for free for a few days, and you can subscribe to watch all webinars, here.

The 2021 list of webinar speakers has been announced here, and while I’m not allowed to talk about something really fun that’s upcoming, let’s just say you definitely have something to look forward to in the springtime!

Also, don’t forget to register for RootsTech Connect which is entirely online and completely free, February 25-27, here.

Thank you to Penny Walters for creating this lovely graphic.

There are literally hundreds of speakers providing sessions in many languages for viewers around the world. I’ve heard the stats, but we can’t share them yet. Let me just say that you will be SHOCKED at the magnitude and reach of this conference. I’m talking dumbstruck!

During one of our zoom calls, one of the organizers says it feels like we’re constructing the plane as we’re flying, and I can confirm his observation – but we are getting it done – together! All hands on deck.

I’ll be presenting an advanced session about triangulation as well as a mini-session in the FamilySearch DNA Resource Center about finding your mother’s ancestors. I’ll share more information as it’s released and I can.

Companies and Owners Come & Go

You probably didn’t even notice some of these 2020 changes. Aside from the death of Bryan Sykes (RIP Bryan,) the big news and the even bigger unknown is the acquisition of Ancestry by Blackstone. Recently the CEO, Margo Georgiadis announced that she was stepping down. The Ancestry Board of Directors has announced an external search for a new CEO. All I can say is that very high on the priority list should be someone who IS a genealogist and who understands how DNA applies to genealogy.

Other changes included:

In the future, as genealogy and DNA testing becomes ever more popular and even more of a commodity, company sales and acquisitions will become more commonplace.

Some Companies Reduced Services and Cut Staff

I understand this too, but it’s painful. The layoffs occurred before Covid, so they didn’t result from Covid-related sales reductions. Let’s hope we see renewed investment after the Covid mess is over.

In a move that may or may not be related to an attempt to cut costs, Ancestry removed 6 and 7 cM matches from their users, freeing up processing resources, hardware, and storage requirements and thereby reducing costs.

I’m not going to beat this dead horse, because Ancestry is clearly not going to move on this issue, nor on that of the much-requested chromosome browser.

Later in the year, 23andMe also removed matches and other features, although, to their credit, they have restored at least part of this functionality and have provided ethnicity updates to V3 and V4 kits which wasn’t initially planned.

It’s also worth noting that early in 2020, 23andMe laid off 100 people as sales declined. Since that time, 23andMe has increasingly pushed consumers to pay to retest on their V5 chip.

About the same time, Ancestry also cut their workforce by about 6%, or about 100 people, also citing a slowdown in the consumer testing market. Ancestry also added a health product.

I’m not sure if we’ve reached market saturation or are simply seeing a leveling off. I wrote about that in DNA Testing Sales Decline: Reason and Reasons.

Of course, the pandemic economy where many people are either unemployed or insecure about their future isn’t helping.

The various companies need some product diversity to survive downturns. 23andMe is focused on medical research with partners who pay 23andMe for the DNA data of customers who opt-in, as does Ancestry.

Both Ancestry and MyHeritage provide subscription services for genealogy records.

FamilyTreeDNA is part of a larger company, GenebyGene whose genetics labs do processing for other companies and medical facilities.

A huge thank you to both MyHeritage and FamilyTreeDNA for NOT reducing services to customers in 2020.

Scientific Research Still Critical & Pushes Frontiers

Now that DNA testing has become a commodity, it’s easy to lose track of the fact that DNA testing is still a scientific endeavor that requires research to continue to move forward.

I’m still passionate about research after 20 years – maybe even more so now because there’s so much promise.

Research bleeds over into the consumer marketplace where products are improved and new features created allowing us to better track and understand our ancestors through their DNA that we and our family members inherit.

Here are a few of the research articles I published in 2020. You might notice a theme here – ancient DNA. What we can learn now due to new processing techniques is absolutely amazing. Labs can share files and information, providing the ability to “reprocess” the data, not the DNA itself, as more information and expertise becomes available.

Of course, in addition to this research, the Million Mito Project team is hard at work rewriting the tree of womankind.

If you’d like to participate, all you need to do is to either purchase a full sequence mitochondrial DNA kit at FamilyTreeDNA, or upgrade to the full sequence if you tested at a lower level previously.

Predictions

Predictions are risky business, but let me give it a shot.

Looking back a year, Covid wasn’t on the radar.

Looking back 5 years, neither Genetic Affairs nor DNAPainter were yet on the scene. DNAAdoption had just been formed in 2014 and DNAGedcom which was born out of DNAAdoption didn’t yet exist.

In other words, the most popular tools today didn’t exist yet.

GEDmatch, founded in 2010 by genealogists for genealogists was 5 years old, but was sold in December 2019 to Verogen.

We were begging Ancestry for a chromosome browser, and while we’ve pretty much given up beating them, because the horse is dead and they can sell DNA kits through ads focused elsewhere, that doesn’t mean genealogists still don’t need/want chromosome and segment based tools. Why, you’d think that Ancestry really doesn’t want us to break through those brick walls. That would be very bizarre, because every brick wall that falls reveals two more ancestors that need to be researched and spurs a frantic flurry of midnight searching. If you’re laughing right now, you know exactly what I mean!

Of course, if Ancestry provided a chromosome browser, it would cost development money for no additional revenue and their customer service reps would have to be able to support it. So from Ancestry’s perspective, there’s no good reason to provide us with that tool when they can sell kits without it. (Sigh.)

I’m not surprised by the management shift at Ancestry, and I wouldn’t be surprised to see several big players go public in the next decade, if not the next five years.

As companies increase in value, the number of private individuals who could afford to purchase the company decreases quickly, leaving private corporations as the only potential buyers, or becoming publicly held. Sometimes, that’s a good thing because investment dollars are infused into new product development.

What we desperately need, and I predict will happen one way or another is a marriage of individual tools and functions that exist separately today, with a dash of innovation. We need tools that will move beyond confirming existing ancestors – and will be able to identify ancestors through our DNA – out beyond each and every brick wall.

If a tester’s DNA matches to multiple people in a group descended from a particular previously unknown couple, and the timing and geography fits as well, that provides genealogical researchers with the hint they need to begin excavating the traditional records, looking for a connection.

In fact, this is exactly what happened with mitochondrial DNA – twice now. A match and a great deal of digging by one extremely persistent cousin resulting in identifying potential parents for a brick-wall ancestor. Autosomal DNA then confirmed that my DNA matched with 59 other individuals who descend from that couple through multiple children.

BUT, we couldn’t confirm those ancestors using autosomal DNA UNTIL WE HAD THE NAMES of the couple. DNA has the potential to reveal those names!

I wrote about that in Mitochondrial DNA Bulldozes Brick Wall and will be discussing it further in my RootsTech presentation.

The Challenge

We have most of the individual technology pieces today to get this done. Of course, the combined technological solution would require significant computing resources and processing power – just at the same time that vendors are desperately trying to pare costs to a minimum.

Some vendors simply aren’t interested, as I’ve already noted.

However, the winner, other than us genealogists, of course, will be the vendor who can either devise solutions or partner with others to create the right mix of tools that will combine matching, triangulation, and trees of your matches to each other, even if you don’t’ share a common ancestor.

We need to follow the DNA past the current end of the branch of our tree.

Each triangulated segment has an individual history that will lead not just to known ancestors, but to their unknown ancestors as well. We have reached critical mass in terms of how many people have tested – and more success would encourage more and more people to test.

There is a genetic path over every single brick wall in our genealogy.

Yes, I know that’s a bold statement. It’s not future Jetson’s flying-cars stuff. It’s doable – but it’s a matter of commitment, investment money, and finding a way to recoup that investment.

I don’t think it’s possible for the one-time purchase of a $39-$99 DNA test, especially when it’s not a loss-leader for something else like a records or data subscription (MyHeritage and Ancestry) or a medical research partnership (Ancestry and 23andMe.)

We’re performing these analysis processes manually and piecemeal today. It’s extremely inefficient and labor-intensive – which is why it often fails. People give up. And the process is painful, even when it does succeed.

This process has also been made increasingly difficult when some vendors block tools that help genealogists by downloading match and ancestral tree information. Before Ancestry closed access, I was creating theories based on common ancestors in my matches trees that weren’t in mine – then testing those theories both genetically (clusters, AutoTrees and ThruLines) and also by digging into traditional records to search for the genetic connection.

For example, I’m desperate to identify the parents of my James Lee Clarkson/Claxton, so I sorted my spreadsheet by surname and began evaluating everyone who had a Clarkson/Claxton in their tree in the 1700s in Virginia or North Carolina. But I can’t do that anymore now, either with a third-party tool or directly at Ancestry. Twenty million DNA kits sold for a minimum of $79 equals more than 1.5 billion dollars. Obviously, the issue here is not a lack of funds.

Including Y and mitochondrial DNA resources in our genetic toolbox not only confirms accuracy but also provides additional hints and clues.

Sometimes we start with Y DNA or mitochondrial DNA, and wind up using autosomal and sometimes the reverse. These are not competing products. It’s not either/or – it’s *and*.

Personally, I don’t expect the vendors to provide this game-changing complex functionality for free. I would be glad to pay for a subscription for top-of-the-line innovation and tools. In what other industry do consumers expect to pay for an item once and receive constant life-long innovations and upgrades? That doesn’t happen with software, phones nor with automobiles. I want vendors to be profitable so that they can invest in new tools that leverage the power of computing for genealogists to solve currently unsolvable problems.

Every single end-of-line ancestor in your tree represents a brick wall you need to overcome.

If you compare the cost of books, library visits, courthouse trips, and other research endeavors that often produce exactly nothing, these types of genetic tools would be both a godsend and an incredible value.

That’s it.

That’s the challenge, a gauntlet of sorts.

Who’s going to pick it up?

I can’t answer that question, but I can say that 23andMe can’t do this without supporting extensive trees, and Ancestry has shown absolutely no inclination to support segment data. You can’t achieve this goal without segment information or without trees.

Among the current players, that leaves two DNA testing companies and a few top-notch third parties as candidates – although – as the past has proven, the future is uncertain, fluid, and everchanging.

It will be interesting to see what I’m writing at the end of 2025, or maybe even at the end of 2021.

Stay tuned.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Books

Y DNA Resources and Repository

I’ve created a Y DNA resource page with the information in this article, here, as a permanent location where you can find Y DNA information in one place – including:

  • Step-by-step guides about how to utilize Y DNA for your genealogy
  • Educational articles and links to the latest webinars
  • Articles about the science behind Y DNA
  • Ancient DNA
  • Success stories

Please feel free to share this resource or any of the links to individual articles with friends, genealogy groups, or on social media.

If you haven’t already taken a Y DNA test, and you’re a male (only males have a Y chromosome,) you can order one here. If you also purchase the Family Finder, autosomal test, those results can be used to search together.

What is Y DNA?

Y DNA is passed directly from fathers to their sons, as illustrated by the blue arrow, above. Daughters do not inherit the Y chromosome. The Y chromosome is what makes males, male.

Every son receives a Y chromosome from his father, who received it from his father, and so forth, on up the direct patrilineal line.

Comparatively, mitochondrial DNA, the pink arrow, is received by both sexes of children from the mother through the direct matrilineal line.

Autosomal DNA, the green arrow, is a combination of randomly inherited DNA from many ancestors that is inherited by both sexes of children from both parents. This article explains a bit more.

Y DNA has Unique Properties

The Y chromosome is never admixed with DNA from the mother, so the Y chromosome that the son receives is identical to the father’s Y chromosome except for occasional minor mutations that take place every few generations.

This lack of mixture with the mother’s DNA plus the occasional mutation is what makes the Y chromosome similar enough to match against other men from the same ancestors for hundreds or thousands of years back in time, and different enough to be useful for genealogy. The mutations can be tracked within extended families.

In western cultures, the Y chromosome path of inheritance is usually the same as the surname, which means that the Y chromosome is uniquely positioned to identify the direct biological patrilineal lineage of males.

Two different types of Y DNA tests can be ordered that work together to refine Y DNA results and connect testers to other men with common ancestors.

FamilyTreeDNA provides STR tests with their 37, 67 and 111 marker test panels, and comprehensive STR plus SNP testing with their Big Y-700 test.

click to enlarge

STR markers are used for genealogy matching, while SNP markers work with STR markers to refine genealogy further, plus provide a detailed haplogroup.

Think of a haplogroup as a genetic clan that tells you which genetic family group you belong to – both today and historically, before the advent of surnames.

This article, What is a Haplogroup? explains the basic concept of how haplogroups are determined.

In addition to the Y DNA test itself, Family Tree DNA provides matching to other testers in their database plus a group of comprehensive tools, shown on the dashboard above, to help testers utilize their results to their fullest potential.

You can order or upgrade a Y DNA test, here. If you also purchase the Family Finder, autosomal test, those results can be used to search together.

Step-by-Step – Using Your Y DNA Results

Let’s take a look at all of the features, functions, and tools that are available on your FamilyTreeDNA personal page.

What do those words mean? Here you go!

Come along while I step through evaluating Big Y test results.

Big Y Testing and Results

Why would you want to take a Big Y test and how can it help you?

While the Big Y-500 has been superseded by the Big Y-700 test today, you will still be interested in some of the underlying technology. STR matching still works the same way.

The Big Y-500 provided more than 500 STR markers and the Big Y-700 provides more than 700 – both significantly more than the 111 panel. The only way to receive these additional markers is by purchasing the Big Y test.

I have to tell you – I was skeptical when the Big Y-700 was introduced as the next step above the Big Y-500. I almost didn’t upgrade any kits – but I’m so very glad that I did. I’m not skeptical anymore.

This Y DNA tree rocks. A new visual format with your matches listed on their branches. Take a look!

Educational Articles

I’ve been writing about DNA for years and have selected several articles that you may find useful.

What kinds of information are available if you take a Y DNA test, and how can you use it for genealogy?

What if your father isn’t available to take a DNA test? How can you determine who else to test that will reveal your father’s Y DNA information?

Family Tree DNA shows the difference in the number of mutations between two men as “genetic distance.” Learn what that means and how it’s figured in this article.

Of course, there were changes right after I published the original Genetic Distance article. The only guarantees in life are death, taxes, and that something will change immediately after you publish.

Sometimes when we take DNA tests, or others do, we discover the unexpected. That’s always a possibility. Here’s the story of my brother who wasn’t my biological brother. If you’d like to read more about Dave’s story, type “Dear Dave” into the search box on my blog. Read the articles in publication order, and not without a box of Kleenex.

Often, what surprise matches mean is that you need to dig further.

The words paternal and patrilineal aren’t the same thing. Paternal refers to the paternal half of your family, where patrilineal is the direct father to father line.

Just because you don’t have any surname matches doesn’t necessarily mean it’s because of what you’re thinking.

Short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) aren’t the same thing and are used differently in genealogy.

Piecing together your ancestor’s Y DNA from descendants.

Haplogroups are something like our pedigree charts.

What does it mean when you have a zero for a marker value?

There’s more than one way to break down that brick wall. Here’s how I figured out which of 4 sons was my ancestor.

Just because you match the right line autosomally doesn’t mean it’s because you descend from the male child you think is your ancestor. Females gave their surnames to children born outside of a legal marriage which can lead to massive confusion. This is absolutely why you need to test the Y DNA of every single ancestral line.

When the direct patrilineal line isn’t the line you’re expecting.

You can now tell by looking at the flags on the haplotree where other people’s ancestral lines on your branch are from. This is especially useful if you’ve taken the Big Y test and can tell you if you’re hunting in the right location.

If you’re just now testing or tested in 2018 or after, you don’t need to read this article unless you’re interested in the improvements to the Big Y test over the years.

2019 was a banner year for discovery. 2020 was even more so, keeping up an amazing pace. I need to write a 2020 update article.

What is a terminal SNP? Hint – it’s not fatal😊

How the TIP calculator works and how to best interpret the results. Note that this tool is due for an update that incorporates more markers and SNP results too.

You can view the location of the Y DNA and mitochondrial DNA ancestors of people whose ethnicity you match.

Tools and Techniques

This free public tree is amazing, showing locations of each haplogroup and totals by haplogroup and country, including downstream branches.

Need to search for and find Y DNA candidates when you don’t know anyone from that line? Here’s how.

Yes, it’s still possible to resolve this issue using autosomal DNA. Non-matching Y DNA isn’t the end of the road, just a fork.

Science Meets Genealogy – Including Ancient DNA

Haplogroup C was an unexpected find in the Americas and reaches into South America.

Haplogroup C is found in several North American tribes.

Haplogroup C is found as far east as Nova Scotia.

Test by test, we made progress.

New testers, new branches. The research continues.

The discovery of haplogroup A00 was truly amazing when it occurred – the base of the phylotree in Africa.

The press release about the discovery of haplogroup A00.

In 2018, a living branch of A00 was discovered in Africa, and in 2020, an ancient DNA branch.

Did you know that haplogroups weren’t always known by their SNP names?

This brought the total of SNPs discovered by Family Tree DNA in mid-2018 to 153,000. I should contact the Research Center to see how many they have named at the end of 2020.

An academic paper split ancient haplogroup D, but then the phylogenetic research team at FamilyTreeDNA split it twice more! This might not sound exciting until you realize this redefines what we know about early man, in Africa and as he emerged from Africa.

Ancient DNA splits haplogroup P after analyzing the remains of two Jehai people from West Malaysia.

For years I doubted Kennewick Man’s DNA would ever be sequenced, but it finally was. Kennewick Man’s mitochondrial DNA haplogroup is X2a and his Y DNA was confirmed to Q-M3 in 2015.

Compare your own DNA to Vikings!

Twenty-seven Icelandic Viking skeletons tell a very interesting story.

Irish ancestors? Check your DNA and see if you match.

Ancestors from Hungary or Italy? Take a look. These remains have matches to people in various places throughout Europe.

The Y DNA story is no place near finished. Dr. Miguel Vilar, former Lead Scientist for National Geographic’s Genographic Project provides additional analysis and adds a theory.

Webinars

Y DNA Webinar at Legacy Family Tree Webinars – a 90-minute webinar for those who prefer watching to learn! It’s not free, but you can subscribe here.

Success Stories and Genealogy Discoveries

Almost everyone has their own Y DNA story of discovery. Because the Y DNA follows the surname line, Y DNA testing often helps push those lines back a generation, or two, or four. When STR markers fail to be enough, we can turn to the Big Y-700 test which provides SNP markers down to the very tip of the leaves in the Y DNA tree. Often, but not always, family-defining SNP branches will occur which are much more stable and reliable than STR mutations – although SNPs and STRs should be used together.

Methodologies to find ancestral lines to test, or maybe descendants who have already tested.

DNA testing reveals an unexpected mystery several hundred years old.

When I write each of my “52 Ancestor” stories, I include genetic information, for the ancestor and their descendants, when I can. Jacob was special because, in addition to being able to identify his autosomal DNA, his Y DNA matches the ancient DNA of the Yamnaya people. You can read about his Y DNA story in Jakob Lenz (1748-1821), Vinedresser.

Please feel free to add your success stories in the comments.

What About You?

You never know what you’re going to discover when you test your Y DNA. If you’re a female, you’ll need to find a male that descends from the line you want to test via all males to take the Y DNA test on your behalf. Of course, if you want to test your father’s line, your father, or a brother through that father, or your uncle, your father’s brother, would be good candidates.

What will you be able to discover? Who will the earliest known ancestor with that same surname be among your matches? Will you be able to break down a long-standing brick wall? You’ll never know if you don’t test.

You can click here to upgrade an existing test or order a Y DNA test.

Share the Love

You can always forward these articles to friends or share by posting links on social media. Who do you know that might be interested?

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Books

Most Popular Articles of 2020

We all know that 2020 was a year like no other, right? So, what were we reading this year as we spent more time at home?

According to my blog stats, these are the ten most popular articles of 2020.

2020 Rank Blog Article Name Publication Date/Comment
1 Concepts – Calculating Ethnicity Percentages Jan 11, 2017
2 Proving Native American Ancestry Using DNA December 18, 2012
3 Ancestry to Remove DNA Matches Soon – Preservation Strategies with Detailed Instructions Now obsolete article – July 16, 2020
4 Ancestral DNA Percentages – How Much of Them is in You? June 27, 2017
5 Full or Half Siblings? April 3, 2019
6 442 Ancient Viking Skeletons Hold DNA Surprises – Does Your Y or Mitochondrial DNA Match? September 18, 2020
7 Migration Pedigree Chart March 25, 2016
8 DNA Inherited from Grandparents and Great-Grandparents January 14, 2020
9 Optimizing Your Tree at Ancestry for More Hints and DNA ThruLines February 22, 2020
10 Phylogenetic Tree of Novel Coronavirus (hCoV-19) Covid-19 March 12, 2020

Half of these articles were published this year, and half are older.

One article is now obsolete. The Ancestry purge has already happened, so there’s nothing to be done now.

Let’s take a look at the rest and what messages might be held in these popular selections.

Ethnicity

I’m not the least bit surprised by ethnicity being the most popular topic, nor that Concepts – Calculating Ethnicity Percentages is the most popular article. Not only is ethnicity a perennially favorite, but all four major vendors introduced something new this year.

By the way, my perennial caveat still applies – ethnicity is only an estimate😊

While Genetic Groups isn’t actually ethnicity, per se, it’s a layer on top of ethnicity that provides you with locations where your ancestors might have been from and migrated to, based on genetic clusters. Clusters are defined by the locations of ancestors of other people within that genetic cluster.

There’s actually good news at 23andMe. Since this article was published in October, 23andMe has indeed updated the V3 and V4 kits with new ethnicity updates. 23andMe had originally stated they weren’t going to do that, clearly in the hope that people would pay to retest by purchasing the V5 Health + Ancestry test. I’m so glad to see their reversal.

Viewing the older V2 kits, the “updated” date at the bottom of their Ancestry Composition page says they were updated on December 9th or 10th, but I don’t see a difference and they don’t have the “updated” icon like the V3 and V4 kits do.

23andMe made another reversal too and also restored the original matches. They had reduced the number of matches to 1500 for non-Health+Ancestry testers who don’t also subscribe. If you wanted between 1500 and 5000 matches, you had to retest and subscribe for $29 per year. (It’s worth noting that I have over 5000 matches at all of the other vendors.)

To date, 23andMe has restored previous matches and also restored some but not all of the search functionality that they had removed.

What isn’t clear is whether 23andMe will continue to add to this number of matches until the tester reaches the earlier limit of 2000, or whether they have simply restored the previous matches, but the match total will not increase unless you have a subscription.

Consumer feedback works – so thanks to everyone who provided feedback to 23andMe.

Native American Ancestry

The article, Proving Native American Ancestry Using DNA, written 8 years ago, only 5 months after launching this blog, has been in the top 10 every year since I’ve been counting.

I created a Native American reference and resource page too, which you can find here.

I’ll also be publishing some new articles after the first of the year which I promise you’ll find VERY INTERESTING. Something to look forward to.

Understanding Autosomal DNA

2020 has seen more people delving into genealogy + DNA testing which means they need to understand both the results and the concepts underlying their results.

Whooohooo – more people in the pool. Jump on in – the water’s fine!

The articles Ancestral DNA Percentages – How Much of Them is in You? and DNA Inherited from Grandparents and Great-Grandparents both explain how DNA is passed from your ancestors to you.

These are great basic articles if you’re looking to help someone new, and so is First Steps When Your DNA Results are Ready – Sticking Your Toe in the Genealogy Water.

I always look forward to the end of January because there will be lots of matches from holiday gifts being posted. Feel free to forward any of these articles to your new matches. It’s always fun helping new people because you just never know when they might be able to help you.

Surprises

With more and more people testing, more and more people are receiving “surprises” in their results. Need to figure out the difference between full and half-siblings? Then Full or Half Siblings? is the article for you.

Trying to discern other relationships? My favorite tool is the Shared cM Project tool at DNAPainter, here.

Vikings

Who doesn’t want to know if they are related to the ancient Vikings??? You can make that discovery in the article, 442 Ancient Viking Skeletons Hold DNA Surprises – Does Your Y or Mitochondrial DNA Match?. Not only is this just plain fun, but I snuck in a little education too.

Of course, you’ll need to have your Y DNA or mitochondrial DNA results, which you can easily order, here. If you’re unsure and would like to read a short article about the different kinds of DNA and how they can help you, 4 Kinds of DNA for Genetic Genealogy is perfect.

Do you think your DNA isn’t Viking because your ancestors aren’t from Scandinavia? Guess again!

Those Vikings didn’t stay home, and they didn’t restrict their escapades to the British Isles either.

This drawing depicts Viking ships besieging Paris in the year 845. Vikings voyaged into Russia and as far as the Mediterranean.

Have a child studying at home? This might be an interesting topic!

Migration Pedigree Chart

Another just plain fun idea is the Migration Pedigree Chart.

I created this migration pedigree chart in a spreadsheet, but you can also create a pedigree chart in genealogy software with whatever “names” you want. This will also help you figure out the estimated percentages of ethnicity you might reasonably expect.

Another idea for helping kids learn at home and they might accidentally learn about figuring percentages in the process.

ThruLines

ThruLines is the Ancestry tool that assists DNA testers with trees connect the dots to common ancestors with their matches. There are ways to optimize your tree to improve your connections, both in terms of accuracy and the number of Thrulines that form.

Optimizing Your Tree at Ancestry for More Hints and DNA ThruLines provides step by step instructions, which reminds me – I need to write a similar article for MyHeritage’s Theories of Family Relativity. I keep meaning to…

Covid

You know, it wouldn’t be 2020 if I didn’t HAVE to mention that word.

I’m glad to know that people were and hopefully still are educating themselves about Covid. Phylogenetic Tree of Novel Coronavirus (hCoV-19) Covid-19 reflected early information about the novel virus and our first efforts to sequence the DNA. Of course, as expected, just like any other organism, mutations have occurred since then.

Goodness knows, we are all tired of Covid and the resulting safety protocols. Keep on keeping on. We need you on the other side.

Stay home, mask up when you must leave, stay away from other people outside your family that you live with, wash your hands, and get vaccinated as soon as you can.

And until we can all see each other in person again, hopefully, sooner than later, keep on doing genealogy.

Locked in the Library

Be careful what you ask for.

Remember that dream where you’re locked in a library? Remember saying you don’t have enough time for genealogy?

Well, now you are and now you do.

The library is your desk with your computer or maybe your laptop on a picnic table in the yard.

DNA results, matches, and research tools are the books and you’re officially locked in for at least a few more weeks. Free articles like these are your guide.

Hmmm, pandemic isolation doesn’t sound so bad now, does it??

We’ll just rename it “genealogy library lock-in.”

Happy New Year!

What can you discover?

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Books

Free Y DNA Webinar at Legacy Family Tree Webinars

I just finished recording a new, updated Y DNA webinar, “Wringing Every Drop out of Y DNA” for Legacy Family Tree Webinars and it’s available for viewing now.

This webinar is packed full of information about Y DNA testing. We discuss the difference between STR markers, SNPs and the Big Y test. Of course, the goal is to use these tests in the most advantageous way for genealogy, so I walk you through each step. There’s so much available that sometimes people miss critical pieces!

FamilyTreeDNA provides a wide variety of tools for each tester in addition to advanced matching which combines Y DNA along with the Family Finder autosomal test. Seeing who you match on both tests can help identify your most recent common ancestor! You can order or upgrade to either or both tests, here.

During this 90 minute webinar, I covered several topics.

There’s also a syllabus that includes additional resources.

At the end, I summarized all the information and show you what I’ve done with my own tree, illustrating how useful this type of testing can be, even for women.

No, women can’t test directly, but we can certainly recruit appropriate men for each line or utilize projects to see if our lines have already tested. I provide tips and hints about how to successfully accomplish that too.

Free for a Limited Time

Who doesn’t love FREE???

The “Squeezing Every Drop out of Y DNA” webinar is free to watch right now, and will remain free through Wednesday, October 14, 2020. On the main Legacy Family Tree Webinar page, here, just scroll down to the “Webinar Library – New” area to see everything that’s new and free.

If you’re a Legacy Farmily Tree Webinar member, all webinars are included with your membership, of course. I love the great selection of topics, with more webinars being added by people you know every week. This is the perfect time to sign up, with fall having arrived in all its golden glory and people spending more time at home right now.

More than 4000 viewers have enjoyed this webinar since yesterday, and I think you will too. Let’s hope lots of people order Y DNA tests so everyone has more matches! You just never know who’s going to be the right match to break down those brick walls or extend your line back a few generations or across the pond, perhaps.

You can view this webinar after October 14th as part of a $49.95 annual membership. If you’d like to join, click here and use the discount code ydna10 through October 13th.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Concepts: Inheritance

Inheritance.

What is it?

How does it work?

I’m not talking about possessions – but about the DNA that you receive from your parents, and their parents.

The reason that genetic genealogy works is because of inheritance. You inherit DNA from your parents in a known and predictable fashion.

Fortunately, we have more than one kind of DNA to use for genealogy.

Types of DNA

Females have 3 types of DNA and males have 4. These different types of DNA are inherited in various ways and serve different genealogical purposes.

Males Females
Y DNA Yes No
Mitochondrial DNA Yes Yes
Autosomal DNA Yes Yes
X Chromosome Yes, their mother’s only Yes, from both parents

Different Inheritance Paths

Different types of DNA are inherited from different ancestors, down different ancestral paths.

Inheritance Paths

The inheritance path for Y DNA is father to son and is inherited by the brother, in this example, from his direct male ancestors shown by the blue arrow. The sister does not have a Y chromosome.

The inheritance path for the red mitochondrial DNA for both the brother and sister is from the direct matrilineal ancestors, only, shown by the red arrow.

Autosomal DNA is inherited from all ancestral lines on both the father’s and mother’s side of your tree, as illustrated by the broken green arrow.

The X chromosome has a slightly different inheritance path, depending on whether you are a male or female.

Let’s take a look at each type of inheritance, how it works, along with when and where it’s useful for genealogy.

Autosomal DNA

Autosomal DNA testing is the most common. It’s the DNA that you inherit from both of your parents through all ancestral lines back in time several generations. Autosomal DNA results in matches at the major testing companies such as FamilyTreeDNA, MyHeritage, Ancestry, and 23andMe where testers view trees or other hints, hoping to determine a common ancestor.

How does autosomal DNA work?

22 autosomes

Every person has two each of 22 chromosomes, shown above, meaning one copy is contributed by your mother and one copy by your father. Paired together, they form the two-sided shape we are familiar with.

For each pair of chromosomes, you receive one from your father, shown with a blue arrow under chromosome 1, and one from your mother, shown in red. In you, these are randomly combined, so you can’t readily tell which piece comes from which parent. Therein lies the challenge for genealogy.

This inheritance pattern is the same for all chromosomes, except for the 23rd pair of chromosomes, at bottom right, which determined the sex of the child.

The 23rd chromosome pair is inherited differently for males and females. One copy is the Y chromosome, shown in blue, and one copy is the X, shown in red. If you receive a Y chromosome from your father, you’re a male. If you receive an X from your father, you’re a female.

Autosomal Inheritance

First, let’s talk about how chromosomes 1-22 are inherited, omitting chromosome 23, beginning with grandparents.

Inheritance son daughter

Every person inherits precisely half of each of their parents’ autosomal DNA. For example, you will receive one copy of your mother’s chromosome 1. Your mother’s chromosome 1 is a combination of her mother’s and father’s chromosome 1. Therefore, you’ll receive ABOUT 25% of each of your grandparents’ chromosome 1.

Inheritance son daughter difference

In reality, you will probably receive a different amount of your grandparent’s DNA, not exactly 25%, because your mother or father will probably contribute slightly more (or less) of the DNA of one of their parents than the other to their offspring.

Which pieces of DNA you inherit from your parents is random, and we don’t know how the human body selects which portions are and are not inherited, other than we know that large pieces are inherited together.

Therefore, the son and daughter won’t inherit the exact same segments of the grandparents’ DNA. They will likely share some of the same segments, but not all the same segments.

Inheritance maternal autosomalYou’ll notice that each parent carries more of each color DNA than they pass on to their own children, so different children receive different pieces of their parents’ DNA, and varying percentages of their grandparents’ DNA.

I wrote about a 4 Generation Inheritance Study, here.

Perspective

Keep in mind that you will only inherit half of the DNA that each of your parents carries.

Looking at a chromosome browser, you match your parents on all of YOUR chromosomes.

Inheritance parental autosomal

For example, this is me compared to my father. I match my father on either his mother’s side, or his father’s side, on every single location on MY chromosomes. But I don’t match ALL of my father’s DNA, because I only received half of what he has.

From your parents’ perspective, you only have half of their DNA.

Let’s look at an illustration.

Inheritance mom dad

Here is an example of one of your father’s pairs of chromosomes 1-22. It doesn’t matter which chromosome, the concepts are the same.

He inherited the blue chromosome from his father and the pink chromosome from his mother.

Your father contributed half of his DNA to you, but that half is comprised of part of his father’s chromosome, and part of his mother’s chromosome, randomly selected in chunks referred to as segments.

Inheritance mom dad segments

Your father’s chromosomes are shown in the upper portion of the graphic, and your chromosome that you inherited from you father is shown below.

On your copy of your father’s chromosome, I’ve darkened the dark blue and dark pink segments that you inherited from him. You did not receive the light blue and light pink segments. Those segments of DNA are lost to your line, but one of your siblings might have inherited some of those pieces.

Inheritance mom dad both segments

Now, I’ve added the DNA that you inherited from your Mom into the mixture. You can see that you inherited the dark green from your Mom’s father and the dark peach from your Mom’s mother.

Inheritance grandparents dna

These colored segments reflect the DNA that you inherited from your 4 grandparents on this chromosome.

I often see questions from people wondering how they match someone from their mother’s side and someone else from their father’s side – on the same segment.

Understanding that you have a copy of the same chromosome from your mother and one from your father clearly shows how this happens.

Inheritance match 1 2

You carry a chromosome from each parent, so you will match different people on the same segment. One match is to the chromosome copy from Mom, and one match is to Dad’s DNA.

Inheritance 4 gen

Here is the full 4 generation inheritance showing Match 1 matching a segment from your Dad’s father and Match 2 matching a segment from your Mom’s father.

Your Parents Will Have More Matches Than You Do

From your parents’ perspective, you will only match (roughly) half of the DNA with other people that they will match. On your Dad’s side, on segment 1, you won’t match anyone pink because you didn’t inherit your paternal grandmother’s copy of segment 1, nor did you inherit your maternal grandmother’s segment 1 either. However, your parents will each have matches on those segments of DNA that you didn’t inherit from them.

From your perspective, one or the other of your parents will match ALL of the people you match – just like we see in Match 1 and Match 2.

Matching you plus either of your parents, on the same segment, is exactly how we determine whether a match is valid, meaning identical by descent, or invalid, meaning identical by chance. I wrote about that in the article, Concepts: Identical by…Descent, State, Population and Chance.

Inheritance on chromosomes 1-22 works in this fashion. So does the X chromosome, fundamentally, but the X chromosome has a unique inheritance pattern.

X Chromosome

The X chromosome is inherited differently for males as compared to females. This is because the 23rd pair of chromosomes determines a child’s sex.

If the child is a female, the child inherits an X from both parents. Inheritance works the same way as chromosomes 1-22, conceptually, but the inheritance path on her father’s side is different.

If the child is a male, the father contributes a Y chromosome, but no X, so the only X chromosome a male has is his mother’s X chromosome.

Males inherit X chromosomes differently than females, so a valid X match can only descend from certain ancestors on your tree.

inheritance x fan

This is my fan chart showing the X chromosome inheritance path, generated by using Charting Companion. My father’s paternal side of his chart is entirely blank – because he only received his X chromosome from his mother.

You’ll notice that the X chromosome can only descend from any male though his mother – the effect being a sort of checkerboard inheritance pattern. Only the pink and blue people potentially contributed all or portions of X chromosomes to me.

This can actually be very useful for genealogy, because several potential ancestors are immediately eliminated. I cannot have any X chromosome segment from the white boxes with no color.

The X Chromsome in Action

Here’s an X example of how inheritance works.

Inheritance X

The son inherits his entire X chromosome from his mother. She may give him all of her father’s or mother’s X, or parts of both. It’s not uncommon to find an entire X chromosome inherited. The son inherits no X from his father, because he inherits the Y chromosome instead.

Inheritance X daughter

The daughter inherits her father’s X chromosome, which is the identical X chromosome that her father inherited from his mother. The father doesn’t have any other X to contribute to his daughter, so like her father, she inherits no portion of an X chromosome from her paternal grandfather.

The daughter also received segments of her mother’s X that her mother inherited maternally and paternally. As with the son, the daughter can receive an entire X chromosome from either her maternal grandmother or maternal grandfather.

This next illustration ONLY pertains to chromosome 23, the X and Y chromosomes.

Inheritance x y

You can see in this combined graphic that the Y is only inherited by sons from one direct line, and the father’s X is only inherited by his daughter.

X chromosome results are included with autosomal results at both Family Tree DNA and 23andMe, but are not provided at MyHeritage. Ancestry, unfortunately, does not provide segment information of any kind, for the X or chromosomes 1-22. You can, however, transfer the DNA files to Family Tree DNA where you can view your X matches.

Note that X matches need to be larger than regular autosomal matches to be equally as useful due to lower SNP density. I use 10-15 cM as a minimum threshold for consideration, equivalent to about 7 cM for autosomal matches. In other words, roughly double the rule of thumb for segment size matching validity.

Autosomal Education

My blog is full of autosomal educational articles and is fully keyword searchable, but here are two introductory articles that include information from the four major vendors:

When to Purchase Autosomal DNA Tests

Literally, anytime you want to work on genealogy to connect with cousins, prove ancestors or break through brick walls.

  • Purchase tests for yourself and your siblings if both parents aren’t living
  • Purchase tests for both parents
  • Purchase tests for all grandparents
  • Purchase tests for siblings of your parents or your grandparents – they have DNA your parents (and you) didn’t inherit
  • Test all older generation family members
  • If the family member is deceased, test their offspring
  • Purchase tests for estimates of your ethnicity or ancestral origins

Y DNA

Y DNA is only inherited by males from males. The Y chromosome is what makes a male, male. Men inherit the Y chromosome intact from their father, with no contribution from the mother or any female, which is why men’s Y DNA matches that of their father and is not diluted in each generation.

Inheritance y mtdna

If there are no adoptions in the line, known or otherwise, the Y DNA will match men from the same Y DNA line with only small differences for many generations. Eventually, small changes known as mutations accrue. After many accumulated mutations taking several hundred years, men no longer match on special markers called Short Tandem Repeats (STR). STR markers generally match within the past 500-800 years, but further back in time, they accrue too many mutations to be considered a genealogical-era match.

Family Tree DNA sells this test in 67 and 111 marker panels, along with a product called the Big Y-700.

The Big Y-700 is the best-of-class of Y DNA tests and includes at least 700 STR markers along with SNPs which are also useful genealogically plus reach further back in time to create a more complete picture.

The Big Y-700 test scans the entire useful portion of the Y chromosome, about 15 million base pairs, as compared to 67 or 111 STR locations.

67 and 111 Marker Panel Customers Receive:

  • STR marker matches
  • Haplogroup estimate
  • Ancestral Origins
  • Matches Map showing locations of the earliest known ancestors of matches
  • Haplogroup Origins
  • Migration Maps
  • STR marker results
  • Haplotree and SNPs
  • SNP map

Y, mitochondrial and autosomal DNA customers all receive options for Advanced Matching.

Big Y-700 customers receive, in addition to the above:

  • All of the SNP markers in the known phylotree shown publicly, here
  • A refined, definitive haplogroup
  • Their place on the Block Tree, along with their matches
  • New or unknown private SNPs that might lead to a new haplogroup, or genetic clan, assignment
  • 700+ STR markers
  • Matching on both the STR markers and SNP markers, separately

Y DNA Education

I wrote several articles about understanding and using Y DNA:

When to Purchase Y DNA Tests

The Y DNA test is for males who wish to learn more about their paternal line and match against other men to determine or verify their genealogical lineage.

Women cannot test directly, but they can purchase the Y DNA test for men such as fathers, brothers, and uncles.

If you are purchasing for someone else, I recommend purchasing the Big Y-700 initially.

Why purchase the Big Y-700, when you can purchase a lower level test for less money? Because if you ever want to upgrade, and you likely will, you have to contact the tester and obtain their permission to upgrade their test. They may be ill, disinterested, or deceased, and you may not be able to upgrade their test at that time, so strike while the iron is hot.

The Big Y-700 provides testers, by far, the most Y DNA data to work (and fish) with.

Mitochondrial DNA

Inheritance mito

Mitochondrial DNA is passed from mothers to both sexes of their children, but only females pass it on.

In your tree, you and your siblings all inherit your mother’s mitochondrial DNA. She inherited it from her mother, and your grandmother from her mother, and so forth.

Mitochondrial DNA testers at FamilyTreeDNA receive:

  • A definitive haplogroup, thought of as a genetic clan
  • Matching
  • Matches Map showing locations of the earliest know ancestors of matches
  • Personalized mtDNA Journey video
  • Mutations
  • Haplogroup origins
  • Ancestral origins
  • Migration maps
  • Advanced matching

Of course, Y, mitochondrial and autosomal DNA testers can join various projects.

Mitochondrial DNA Education

I created a Mitochondrial DNA page with a comprehensive list of educational articles and resources.

When to Purchase Mitochondrial DNA Tests

Mitochondrial DNA can be valuable in terms of matching as well as breaking down brick walls for women ancestors with no surnames. You can also use targeted testing to prove, or disprove, relationship theories.

Furthermore, your mitochondrial DNA haplogroup, like Y DNA haplogroups, provides information about where your ancestors came from by identifying the part of the world where they have the most matches.

You’ll want to purchase the mtFull sequence test provided by Family Tree DNA. Earlier tests, such as the mtPlus, can be upgraded. The full sequence test tests all 16,569 locations on the mitochondria and provides testers with the highest level matching as well as their most refined haplogroup.

The full sequence test is only sold by Family Tree DNA and provides matching along with various tools. You’ll also be contributing to science by building the mitochondrial haplotree of womankind through the Million Mito Project.

Combined Resources for Genealogists

You may need to reach out to family members to obtain Y and mitochondrial DNA for your various genealogical lines.

For example, the daughter in the tree below, a genealogist, can personally take an autosomal test along with a mitochondrial test for her matrilineal line, but she cannot test for Y DNA, nor can she obtain her paternal grandmother’s mitochondrial DNA directly by testing herself.

Hearts represent mitochondrial DNA, and stars, Y DNA.

Inheritance combined

However, our genealogist’s brother, father or grandfather can test for her father’s (blue star) Y DNA.

Her father or any of his siblings can test for her paternal grandmother’s (hot pink heart) mitochondrial DNA, which provides information not available from any other tester in this tree, except for the paternal grandmother herself.

Our genealogist’s paternal grandfather, and his siblings, can test for his mother’s (yellow heart) mitochondrial DNA.

Our genealogist’s maternal grandfather can test for his (green star) Y DNA and (red heart) mitochondrial DNA.

And of course, it goes without saying that every single generation upstream of the daughter, our genealogist, should all take autosomal DNA tests.

So, with several candidates, who can and should test for what?

Person Y DNA Mitochondrial Autosomal
Daughter No Y – can’t test Yes, her pink mother’s Yes – Test
Son Yes – blue Y Yes, his pink mother’s Yes – Test
Father Yes – blue Y Yes – his magenta mother’s Yes – Test
Paternal Grandfather Yes – blue Y – Best to Test Yes, his yellow mother’s – Test Yes – Test
Mother No Y – can’t test Yes, her pink mother’s Yes – Test
Maternal Grandmother No Y – can’t test Yes, her pink mother’s – Best to Test Yes – Test
Maternal Grandfather Yes – green Y – Test Yes, his red mother’s – Test Yes – Test

The best person/people to test for each of the various lines and types of DNA is shown bolded above…assuming that all people are living. Of course, if they aren’t, then test anyone else in the tree who carries that particular DNA – and don’t forget to consider aunts and uncles, or their children, as candidates.

If one person takes the Y and/or mitochondrial DNA test to represent a specific line, you don’t need another person to take the same test for that line. The only possible exception would be to confirm a specific Y DNA result matches a lineage as expected.

Looking at our three-generation example, you’ll be able to obtain a total of two Y DNA lines, three mitochondrial DNA lines, and 8 autosomal results, helping you to understand and piece together your family line.

You might ask, given that the parents and grandparents have all autosomally tested in this example, if our genealogist really needs to test her brother, and the answer is probably not – at least not today.

However, in cases like this, I do test the sibling, simply because I can learn and it may encourage their interest or preserve their DNA for their children who might someday be interested. We also don’t know what kind of advances the future holds.

If the parents aren’t both available, then you’ll want to test as many of your (and their) siblings as possible to attempt to recover as much of the parents’ DNA, (and matches) as possible.

Your family members’ DNA is just as valuable to your research as your own.

Increase Your Odds

Don’t let any of your inherited DNA go unused.

You can increase your odds of having autosomal matches by making sure you are in all 4 major vendor databases.

Both FamilyTreeDNA and MyHeritage accept transfers from 23andMe and Ancestry, who don’t accept transfers. Transferring and matching is free, and their unlock fees, $19 at FamilyTreeDNA, and $29 at MyHeritage, respectively, to unlock their advanced tools are both less expensive than retesting.

You’ll find easy-to-follow step-by-step transfer instructions to and from the vendors in the article DNA File Upload-Download and Transfer Instructions to and from DNA Testing Companies.

Order

You can order any of the tests mentioned above by clicking on these links:

Autosomal:

Transfers

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Y DNA: Step-by-Step Big Y Analysis

Many males take the Big Y-700 test offered by FamilyTreeDNA, so named because testers receive the most granular haplogroup SNP results in addition to 700+ included STR marker results. If you’re not familiar with those terms, you might enjoy the article, STRs vs SNPs, Multiple DNA Personalities.

The Big Y test gives testers the best of both, along with contributing to the building of the Y phylotree. You can read about the additions to the Y tree via the Big Y, plus how it helped my own Estes project, here.

Some men order this test of their own volition, some at the request of a family member, and some in response to project administrators who are studying a specific topic – like a particular surname.

The Big Y-700 test is the most complete Y DNA test offered, testing millions of locations on the Y chromosome to reveal mutations, some unique and never before discovered, many of which are useful to genealogists. The Big Y-700 includes the traditional Y DNA STR marker testing along with SNP results that define haplogroups. Translated, both types of test results are compared to other men for genealogy, which is the primary goal of DNA testing.

Being a female, I often recruit males in my family surname lines and sponsor testing. My McNiel line, historic haplogroup R-M222, has been particularly frustrating both genealogically as well as genetically after hitting a brick wall in the 1700s. My McNeill cousin agreed to take a Big Y test, and this analysis walks through the process of understanding what those results are revealing.

After my McNeill cousin’s Big Y results came back from the lab, I spent a significant amount of time turning over every leaf to extract as much information as possible, both from the Big Y-700 DNA test itself and as part of a broader set of intertwined genetic information and genealogical evidence.

I invite you along on this journey as I explain the questions we hoped to answer and then evaluate Big Y DNA results along with other information to shed light on those quandaries.

I will warn you, this article is long because it’s a step-by-step instruction manual for you to follow when interpreting your own Big Y results. I’d suggest you simply read this article the first time to get a feel for the landscape, before working through the process with your own results. There’s so much available that most people leave laying on the table because they don’t understand how to extract the full potential of these test results.

If you’d like to read more about the Big Y-700 test, the FamilyTreeDNA white paper is here, and I wrote about the Big Y-700 when it was introduced, here.

You can read an overview of Y DNA, here, and Y DNA: The Dictionary of DNA, here.

Ok, get yourself a cuppa joe, settle in, and let’s go!

George and Thomas McNiel – Who Were They?

George and Thomas McNiel appear together in Spotsylvania County, Virginia records. Y DNA results, in combination with early records, suggest that these two men were brothers.

I wrote about discovering that Thomas McNeil’s descendant had taken a Y DNA test and matched George’s descendants, here, and about my ancestor George McNiel, here.

McNiel family history in Wilkes County, NC, recorded in a letter written in 1898 by George McNiel’s grandson tells us that George McNiel, born about 1720, came from Scotland with his two brothers, John and Thomas. Elsewhere, it was reported that the McNiel brothers sailed from Glasgow, Scotland and that George had been educated at the University of Edinburgh for the Presbyterian ministry but had a change of religious conviction during the voyage. As a result, a theological tiff developed that split the brothers.

George, eventually, if not immediately, became a Baptist preacher. His origins remain uncertain.

The brothers reportedly arrived about 1750 in Maryland, although I have no confirmation. By 1754, Thomas McNeil appeared in the Spotsylvania County, VA records with a male being apprenticed to him as a tailor. In 1757, in Spotsylvania County, the first record of George McNeil showed James Pey being apprenticed to learn the occupation of tailor.

If George and Thomas were indeed tailors, that’s not generally a country occupation and would imply that they both apprenticed as such when they were growing up, wherever that was.

Thomas McNeil is recorded in one Spotsylvania deed as being from King and Queen County, VA. If this is the case, and George and Thomas McNiel lived in King and Queen, at least for a time, this would explain the lack of early records, as King and Queen is a thrice-burned county. If there was a third brother, John, I find no record of him.

My now-deceased cousin, George McNiel, initially tested for the McNiel Y DNA and also functioned for decades as the family historian. George, along with his wife, inventoried the many cemeteries of Wilkes County, NC.

George believed through oral history that the family descended from the McNiel’s of Barra.

McNiel Big Y Kisumul

George had this lovely framed print of Kisimul Castle, seat of the McNiel Clan on the Isle of Barra, proudly displayed on his wall.

That myth was dispelled with the initial DNA testing when our line did not match the Barra line, as can be seen in the MacNeil DNA project, much to George’s disappointment. As George himself said, the McNiel history is both mysterious and contradictory. Amen to that, George!

McNiel Big Y Niall 9 Hostages

However, in place of that history, we were instead awarded the Niall of the 9 Hostages badge, created many years ago based on a 12 marker STR result profile. Additionally, the McNiel DNA was assigned to haplogroup R-M222. Of course, today’s that’s a far upstream haplogroup, but 15+ years ago, we had only a fraction of the testing or knowledge that we do today.

The name McNeil, McNiel, or however you spell it, resembles Niall, so on the surface, this made at least some sense. George was encouraged by the new information, even though he still grieved the loss of Kisimul Castle.

Of course, this also caused us to wonder about the story stating our line had originated in Scotland because Niall of the 9 Hostages lived in Ireland.

Niall of the 9 Hostages

Niall of the 9 Hostages was reportedly a High King of Ireland sometime between the 6th and 10th centuries. However, actual historical records place him living someplace in the mid-late 300s to early 400s, with his death reported in different sources as occurring before 382 and alternatively about 411. The Annals of the Four Masters dates his reign to 379-405, and Foras Feasa ar Eirinn says from 368-395. Activities of his sons are reported between 379 and 405.

In other words, Niall lived in Ireland about 1500-1600 years ago, give or take.

Migration

Generally, migration was primarily from Scotland to Ireland, not the reverse, at least as far as we know in recorded history. Many Scottish families settled in the Ulster Plantation beginning in 1606 in what is now Northern Ireland. The Scots-Irish immigration to the states had begun by 1718. Many Protestant Scottish families immigrated from Ireland carrying the traditional “Mc” names and Presbyterian religion, clearly indicating their Scottish heritage. The Irish were traditionally Catholic. George could have been one of these immigrants.

We have unresolved conflicts between the following pieces of McNeil history:

  • Descended from McNeil’s of Barra – disproved through original Y DNA testing.
  • Immigrated from Glasgow, Scotland, and schooled in the Presbyterian religion in Edinburgh.
  • Descended from the Ui Neill dynasty, an Irish royal family dominating the northern half of Ireland from the 6th to 10th centuries.

Of course, it’s possible that our McNiel/McNeil line could have been descended from the Ui Neill dynasty AND also lived in Scotland before immigrating.

It’s also possible that they immigrated from Ireland, not Scotland.

And finally, it’s possible that the McNeil surname and M222 descent are not related and those two things are independent and happenstance.

A New Y DNA Tester

Since cousin George is, sadly, deceased, we needed a new male Y DNA tester to represent our McNiel line. Fortunately, one such cousin graciously agreed to take the Big Y-700 test so that we might, hopefully, answer numerous questions:

  • Does the McNiel line have a unique haplogroup, and if so, what does it tell us?
  • Does our McNiel line descend from Ireland or Scotland?
  • Where are our closest geographic clusters?
  • What can we tell by tracing our haplogroup back in time?
  • Do any other men match the McNiel haplogroup, and what do we know about their history?
  • Does the Y DNA align with any specific clans, clan history, or prehistory contributing to clans?

With DNA, you don’t know what you don’t know until you test.

Welcome – New Haplogroup

I was excited to see my McNeill cousin’s results arrive. He had graciously allowed me access, so I eagerly took a look.

He had been assigned to haplogroup R-BY18350.

McNiel Big Y branch

Initially, I saw that indeed, six men matched my McNeill cousin, assigned to the same haplogroup. Those surnames were:

  • Scott
  • McCollum
  • Glass
  • McMichael
  • Murphy
  • Campbell

Notice that I said, “were.” That’s right, because shortly after the results were returned, based on markers called private variants, Family Tree DNA assigned a new haplogroup to my McNeill cousin.

Drum roll please!!!

Haplogroup R-BY18332

McNiel Big Y BY18332

Additionally, my cousin’s Big Y test resulted in several branches being split, shown on the Block Tree below.

McNIel Big Y block tree

How cool is this!

This Block Tree graphic shows, visually, that our McNiel line is closest to McCollum and Campbell testers, and is a brother clade to those branches showing to the left and right of our new R-BY18332. It’s worth noting that BY25938 is an equivalent SNP to BY18332, at least today. In the future, perhaps another tester will test, allowing those two branches to be further subdivided.

Furthermore, after the new branches were added, Cousin McNeill has no more Private Variants, which are unnamed SNPs. There were all utilized in naming additional tree branches!

I wrote about the Big Y Block Tree here.

Niall (Or Whoever) Was Prolific

The first thing that became immediately obvious was how successful our progenitor was.

McNiel Big Y M222 project

click to enlarge

In the MacNeil DNA project, 38 men with various surname spellings descend from M222. There are more in the database who haven’t joined the MacNeil project.

Whoever originally carried SNP R-M222, someplace between 2400 and 5900 years ago, according to the block tree, either had many sons who had sons, or his descendants did. One thing is for sure, his line certainly is in no jeopardy of dying out today.

The Haplogroup R-M222 DNA Project, which studies this particular haplogroup, reads like a who’s who of Irish surnames.

Big Y Match Results

Big Y matches must have no more than 30 SNP differences total, including private variants and named SNPs combined. Named SNPs function as haplogroup names. In other words, Cousin McNeill’s terminal SNP, meaning the SNP furthest down on the tree, R-BY18332, is also his haplogroup name.

Private variants are mutations that have occurred in the line being tested, but not yet in other lines. Occurrences of private variants in multiple testers allow the Private Variant to be named and placed on the haplotree.

Of course, Family Tree DNA offers two types of Y DNA testing, STR testing which is the traditional 12, 25, 37, 67 and 111 marker testing panels, and the Big Y-700 test which provides testers with:

  • All 111 STR markers used for matching and comparison
  • Another 589+ STR markers only available through the Big Y test increasing the total STR markers tested from 111 to minimally 700
  • A scan of the Y chromosome, looking for new and known SNPs and STR mutations

Of course, these tests keep on giving, both with matching and in the case of the Big Y – continued haplogroup discovery and refinement in the future as more testers test. The Big Y is an investment as a test that keeps on giving, not just a one-time purchase.

I wrote about the Big Y-700 when it was introduced here and a bit later here.

Let’s see what the results tell us. We’ll start by taking a look at the matches, the first place that most testers begin.

Mcniel Big Y STR menu

Regular Y DNA STR matching shows the results for the STR results through 111 markers. The Big Y section, below, provides results for the Big Y SNPs, Big Y matches and additional STR results above 111 markers.

McNiel Big Y menu

Let’s take a look.

STR and SNP Testing

Of Cousin McNeil’s matches, 2 Big Y testers and several STR testers carry some variant of the Neal, Neel, McNiel, McNeil, O’Neil, etc. surnames by many spellings.

While STR matching is focused primarily on a genealogical timeframe, meaning current to roughly 500-800 years in the past, SNP testing reaches much further back in time.

  • STR matching reaches approximately 500-800 years.
  • Big Y matching reaches approximately 1500 years.
  • SNPs and haplogroups reach back infinitely, and can be tracked historically beyond the genealogical timeframe, shedding light on our ancestors’ migration paths, helping to answer the age-old question of “where did we come from.”

These STR and Big Y time estimates are based on a maximum number of mutations for testers to be considered matches paired with known genealogy.

Big Y results consider two men a match if they have 30 or fewer total SNP differences. Using NGS (next generation sequencing) scan technology, the targeted regions of the Y chromosome are scanned multiple times, although not all regions are equally useful.

Individually tested SNPs are still occasionally available in some cases, but individual SNP testing has generally been eclipsed by the greatly more efficient enriched technology utilized with Big Y testing.

Think of SNP testing as walking up to a specific location and taking a look, while NGS scan technology is a drone flying over the entire region 30-50 times looking multiple times to be sure they see the more distant target accurately.

Multiple scans acquiring the same read in the same location, shown below in the Big Y browser tool by the pink mutations at the red arrow, confirm that NGS sequencing is quite reliable.

McNiel Big Y browser

These two types of tests, STR panels 12-111 and the SNP-based Big Y, are meant to be utilized in combination with each other.

STR markers tend to mutate faster and are less reliable, experiencing frustrating back mutations. SNPs very rarely experience this level of instability. Some regions of the Y chromosome are messier or more complicated than others, causing problems with interpreting reads reliably.

For purposes of clarity, the string of pink A reads above is “not messy,” and “A” is very clearly a mutation because all ~39 scanned reads report the same value of “A,” and according to the legend, all of those scans are high quality. Multiple combined reads of A and G, for example, in the same location, would be tough to call accurately and would be considered unreliable.

You can see examples of a few scattered pink misreads, above.

The two different kinds of tests produce results for overlapping timeframes – with STR mutations generally sifting through closer relationships and SNPs reaching back further in time.

Many more men have taken the Y DNA STR tests over the last 20 years. The Big Y tests have only been available for the past handful of years.

STR testing produces the following matches for my McNiel cousin:

STR Level STR Matches STR Matches Who Took the Big Y % STR Who Took Big Y STR Matches Who Also Match on the Big Y
12 5988 796 13 52
25 6660 725 11 57
37 878 94 11 12
67 1225 252 21 23
111 4 2 50 1

Typically, one would expect that all STR matches that took the Big Y would match on the Big Y, since STR results suggest relationships closer in time, but that’s not the case.

  • Many STR testers who have taken the Big Y seem to be just slightly too distant to be considered a Big Y match using SNPs, which flies in the face of conventional wisdom.
  • However, this could easily be a function of the fact that STRs mutate both backward and forwards and may have simply “happened” to have mutated to a common value – which suggests a closer relationship than actually exists.
  • It could also be that the SNP matching threshold needs to be raised since the enhanced and enriched Big Y-700 technology now finds more mutations than the older Big Y-500. I would like to see SNP matching expanded to 40 from 30 because it seems that clan connections may be being missed. Thirty may have been a great threshold before the more sensitive Big Y-700 test revealed more mutations, which means that people hit that 30 threshold before they did with previous tests.
  • Between the combination of STRs and SNPs mutating at the same time, some Big Y matches are pushed just out of range.

In a nutshell, the correlation I expected to find in terms of matching between STR and Big Y testing is not what I found. Let’s take a look at what we discovered.

It’s worth noting that the analysis is easier if you are working together with at least your closest matches or have access via projects to at least some of their results. You can see common STR values to 111 in projects, such as surname projects. Project administrators can view more if project members have allowed access.

Unexpected Discoveries and Gotchas

While I did expect STR matches to also match on the Big Y, I don’t expect the Big Y matches to necessarily match on the STR tests. After all, the Big Y is testing for more deep-rooted history.

Only one of the McNiel Big Y matches also matches at all levels of STR testing. That’s not surprising since Big Y matching reaches further back in time than STR testing, and indeed, not all STR testers have taken a Big Y test.

Of my McNeill cousin’s closest Big Y matches, we find the following relative to STR matching.

Surname Ancestral Location Big Y Variant/SNP Difference STR Match Level
Scott 1565 in Buccleuch, Selkirkshire, Scotland 20 12, 25, 37, 67
McCollum Not listed 21 67 only
Glass 1618 in Banbridge, County Down, Ireland 23 12, 25, 67
McMichael 1720 County Antrim, Ireland 28 67 only
Murphy Not listed 29 12, 25, 37, 67
Campbell Scotland 30 12, 25, 37, 67, 111

It’s ironic that the man who matches on all STR levels has the most variants, 30 – so many that with 1 more, he would not have been considered a Big Y match at all.

Only the Campbell man matches on all STR panels. Unfortunately, this Campbell male does not match the Clan Campbell line, so that momentary clan connection theory is immediately put to rest.

Block Tree Matches – What They Do, and Don’t, Mean

Note that a Carnes male, the other person who matches my McNeill cousin at 111 STR markers and has taken a Big Y test does not match at the Big Y level. His haplogroup BY69003 is located several branches up the tree, with our common ancestor, R-S588, having lived about 2000 years ago. Interestingly, we do match other R-S588 men.

This is an example where the total number of SNP mutations is greater than 30 for these 2 men (McNeill and Carnes), but not for my McNeill cousin compared with other men on the same S588 branch.

McNiel Big Y BY69003

By searching for Carnes on the block tree, I can view my cousin’s match to Mr. Carnes, even though they don’t match on the Big Y. STR matches who have taken the Big Y test, even if they don’t match at the Big Y level, are shown on the Block Tree on their branch.

By clicking on the haplogroup name, R-BY69003, above, I can then see three categories of information about the matches at that haplogroup level, below.

McNiel Big Y STR differences

click to enlarge

By selecting “Matches,” I can see results under the column, “Big Y.” This does NOT mean that the tester matches either Mr. Carnes or Mr. Riker on the Big Y, but is telling me that there are 14 differences out of 615 STR markers above 111 markers for Mr. Carnes, and 8 of 389 for Mr. Riker.

In other words, this Big Y column is providing STR information, not indicating a Big Y match. You can’t tell one way or another if someone shown on the Block Tree is shown there because they are a Big Y match or because they are an STR match that shares the same haplogroup.

As a cautionary note, your STR matches that have taken the Big Y ARE shown on the block tree, which is a good thing. Just don’t assume that means they are Big Y matches.

The 30 SNP threshold precludes some matches.

My research indicates that the people who match on STRs and carry the same haplogroup, but don’t match at the Big Y level, are every bit as relevant as those who do match on the Big Y.

McNIel Big Y block tree menu

If you’re not vigilant when viewing the block tree, you’ll make the assumption that you match all of the people showing on the Block Tree on the Big Y test since Block Tree appears under the Big Y tools. You have to check Big Y matches specifically to see if you match people shown on the Block Tree. You don’t necessarily match all of them on the Big Y test, and vice versa, of course.

You match Block Tree inhabitants either:

  • On the Big Y, but not the STR panels
  • On the Big Y AND at least one level of STRs between 12 and 111, inclusive
  • On STRs to someone who has taken the Big Y test, but whom you do not match on the Big Y test

Big Y-500 or Big Y-700?

McNiel Big Y STR differences

click to enlarge

Looking at the number of STR markers on the matches page of the Block Tree for BY69003, above, or on the STR Matches page is the only way to determine whether or not your match took the Big Y-700 or the Big Y-500 test.

If you add 111 to the Big Y SNP number of 615 for Mr. Carnes, the total equals 726, which is more than 700, so you know he took the Big Y-700.

If you add 111 to 389 for Mr. Riker, you get 500, which is less than 700, so you know that he took the Big Y-500 and not the Big Y-700.

There are still a very small number of men in the database who did not upgrade to 111 when they ordered their original Big Y test, but generally, this calculation methodology will work. Today, all Big Y tests are upgraded to 111 markers if they have not already tested at that level.

Why does Big Y-500 vs Big Y-700 matter? The enriched chemistry behind the testing technology improved significantly with the Big Y-700 test, enhancing Y-DNA results. I was an avowed skeptic until I saw the results myself after upgrading men in the Estes DNA project. In other words, if Big Y-500 testers upgrade, they will probably have more SNPs in common.

You may want to contact your closest Big Y-500 matches and ask if they will consider upgrading to the Big Y-700 test. For example, if we had close McNiel or similar surname matches, I would do exactly that.

Matching Both the Big Y and STRs – No Single Source

There is no single place or option to view whether or not you match someone BOTH on the Big Y AND STR markers. You can see both match categories individually, of course, but not together.

You can determine if your STR matches took the Big Y, below, and their haplogroup, which is quite useful, but you can’t tell if you match them at the Big Y level on this page.

McNiel Big Y STR match Big Y

click to enlarge

Selecting “Display Only Matches With Big Y” means displaying matches to men who took the Big Y test, not necessarily men you match on the Big Y. Mr. Conley, in the example above, does not match my McNeill cousin on the Big Y but does match him at 12 and 25 STR markers.

I hope FTDNA will add three display options:

  • Select only men that match on the Big Y in the STR panel
  • Add an option for Big Y on the advanced matches page
  • Indicate men who also match on STRs on the Big Y match page

It was cumbersome and frustrating to have to view all of the matches multiple times to compile various pieces of information in a separate spreadsheet.

No Big Y Match Download

There is also no option to download your Big Y matches. With a few matches, this doesn’t matter, but with 119 matches, or more, it does. As more people test, everyone will have more matches. That’s what we all want!

What you can do, however, is to download your STR matches from your match page at levels 12-111 individually, then combine them into one spreadsheet. (It would be nice to be able to download them all at once.)

McNiel Big Y csv

You can then add your Big Y matches manually to the STR spreadsheet, or you can simply create a separate Big Y spreadsheet. That’s what I chose to do after downloading my cousin’s 14,737 rows of STR matches. I told you that R-M222 was prolific! I wasn’t kidding.

This high number of STR matches also perfectly illustrates why the Big Y SNP results were so critical in establishing the backbone relationship structure. Using the two tools together is indispensable.

An additional benefit to downloading STR results is that you can sort the STR spreadsheet columns in surname order. This facilitates easily spotting all spelling variations of McNiel, including words like Niel, Neal and such that might be relevant but that you might not notice otherwise.

Creating a Big Y Spreadsheet

My McNiel cousin has 119 Big Y-700 matches.

I built a spreadsheet with the following columns facilitating sorting in a number of ways, with definitions as follows:

McNiel Big Y spreadsheet

click to enlarge

  • First Name
  • Last Name – You will want to search matches on your personal page at Family Tree DNA by this surname later, so be sure if there is a hyphenated name to enter it completely.
  • Haplogroup – You’ll want to sort by this field.
  • Convergent – A field you’ll complete when doing your analysis. Convergence is the common haplogroup in the tree shared by you and your match. In the case of the green matches above, which are color-coded on my spreadsheet to indicate the closest matches with my McNiel cousin, the convergent haplogroup is BY18350.
  • Common Tree Gen – This column is the generations on the Block Tree shown to this common haplogroup. In the example above, it’s between 9 and 14 SNP generations. I’ll show you where to gather this information.
  • Geographic Location – Can be garnered from 4 sources. No color in that cell indicates that this information came from the Earliest Known Ancestor (EKA) field in the STR matches. Blue indicates that I opened the tree and pulled the location information from that source. Orange means that someone else by the same surname whom the tester also Y DNA matches shows this location. I am very cautious when assigning orange, and it’s risky because it may not be accurate. A fourth source is to use Ancestry, MyHeritage, or another genealogical resource to identify a location if an individual provides genealogical information but no location in the EKA field. Utilizing genealogy databases is only possible if enough information is provided to make a unique identification. John Smith 1700-1750 won’t do it, but Seamus McDougal (1750-1810) married to Nelly Anderson might just work.
  • STR Match – Tells me if the Big Y match also matches on STR markers, and if so, which ones. Only the first 111 markers are used for matching. No STR match generally means the match is further back in time, but there are no hard and fast rules.
  • Big Y Match – My original goal was to combine this information with the STR match spreadsheet. If you don’t wish to combine the two, then you don’t need this column.
  • Tree – An easy way for me to keep track of which matches do and do not have a tree. Please upload or create a tree.

You can also add a spreadsheet column for comments or contact information.

McNiel Big Y profile

You will also want to click your match’s name to display their profile card, paying particular attention to the “About Me” information where people sometimes enter genealogical information. Also, scan the Ancestral Surnames where the match may enter a location for a specific surname.

Private Variants

I added additional spreadsheet columns, not shown above, for Private Variant analysis. That level of analysis is beyond what most people are interested in doing, so I’m only briefly discussing this aspect. You may want to read along, so you at least understand what you are looking at.

Clicking on Private Variants in your Big Y Results shows your variants, or mutations, that are unnamed as SNPs. When they are named, they become SNPs and are placed on the haplotree.

The reference or “normal” state for the DNA allele at that location is shown as the “Reference,” and “Genotype” is the result of the tester. Reference results are not shown for each tester, because the majority are the same. Only mutations are shown.

McNiel Big Y private variants

There are 5 Private Variants, total, for my cousin. I’ve obscured the actual variant numbers and instead typed in 111111 and 222222 for the first two as examples.

McNiel Big Y nonmatching variants

In our example, there are 6 Big Y matches, with matches one and five having the non-matching variants shown above.

Non-matching variants mean that the match, Mr. Scott, in example 1, does NOT match the tester (my cousin) on those variants.

  • If the tester (you) has no mutation, you won’t have a Private Variant shown on your Private Variant page.
  • If the tester does have a Private Variant shown, and that variant shows ON their matches list of non-matching variants, it means the match does NOT match the tester, and either has the normal reference value or a different mutation. Explained another way, if you have a mutation, and that variant is listed on your match list of Non-Matching Variants, your match does NOT match you and does NOT have the same mutation.
  • If the match does NOT have the Private Variant on their list, that means the match DOES match the tester, and they both have the same mutation, making this Private Variant a candidate to be named as a new SNP.
  • If you don’t have a Private Variant listed, but it shows in the Non-Matching Variants of your match, that means you have the reference or normal value, and they have a mutation.

In example #1, above, the tester has a mutation at variant 111111, and 111111 is shown as a Non-Matching Variant to Mr. Scott, so Mr. Scott does NOT match the tester. Mr. Scott also does NOT match the tester at locations 222222 and 444444.

In example #5, 111111 is NOT shown on the Non-Matching Variant list, so Mr. Treacy DOES match the tester.

I have a terrible time wrapping my head around the double negatives, so it’s critical that I make charts.

On the chart below, I’ve listed the tester’s private variants in an individual column each, so 111111, 222222, etc.

For each match, I’ve copy and pasted their Non-Matching Variants in a column to the right of the tester’s variants, in the lavender region. In this example, I’ve typed the example variants into separate columns for each tester so you can see the difference. Remember, a non-matching variant means they do NOT match the tester’s mutation.

McNiel private variants spreadsheet

On my normal spreadsheet where the non-matching variants don’t have individuals columns, I then search for the first variant, 111111. If the variant does appear in the list, it means that match #1 does NOT have the mutation, so I DON’T put an X in the box for match #1 under 111111.

In the example above, the only match that does NOT have 111111 on their list of Non-Matching Variants is #5, so an X IS placed in that corresponding cell. I’ve highlighted that column in yellow to indicate this is a candidate for a new SNP.

You can see that no one else has the variant, 222222, so it truly is totally private. It’s not highlighted in yellow because it’s not a candidate to be a new SNP.

Everyone shares mutation 333333, so it’s a great candidate to become a new SNP, as is 555555.

Match #6 shares the mutation at 444444, but no one else does.

This is a manual illustration of an automated process that occurs at Family Tree DNA. After Big Y matches are returned, automated software creates private variant lists of potential new haplogroups that are then reviewed internally where SNPs are evaluated, named, and placed on the tree if appropriate.

If you follow this process and discover matches, you probably don’t need to do anything, as the automated review process will likely catch up within a few days to weeks.

Big Y Matches

In the case of the McNiel line, it was exciting to discover several private variants, mutations that were not yet named SNPs, found in several matches that were candidates to be named as SNPs and placed on the Y haplotree.

Sure enough, a few days later, my McNeill cousin had a new haplogroup assignment.

Most people have at least one Private Variant, locations in which they do NOT match another tester. When several people have these same mutations, and they are high-quality reads, the Private Variant qualifies to be added to the haplotree as a SNP, a task performed at FamilyTreeDNA by Michael Sager.

If you ever have the opportunity to hear Michael speak, please do so. You can watch Michael’s presentation at Genetic Genealogy Ireland (GGI) titled “The Tree of Mankind,” on YouTube, here, compliments of Maurice Gleeson who coordinates GGI. Maurice has also written about the Gleeson Y DNA project analysis, here.

As a result of Cousin McNeill’s test, six new SNPs have been added to the Y haplotree, the tree of mankind. You can see our new haplogroup for our branch, BY18332, with an equivalent SNP, BY25938, along with three sibling branches to the left and right on the tree.

McNiel Big Y block tree 4 branch

Big Y testing not only answers genealogical questions, it advances science by building out the tree of mankind too.

The surname of the men who share the same haplogroup, R-BY18332, meaning the named SNP furthest down the tree, are McCollum and Campbell. Not what I expected. I expected to find a McNeil who does match on at least some STR markers. This is exactly why the Big Y is so critical to define the tree structure, then use STR matches to flesh it out.

Taking the Big Y-700 test provided granularity between 6 matches, shown above, who were all initially assigned to the same branch of the tree, BY18350, but were subsequently divided into 4 separate branches. My McNiel cousin is no longer equally as distant from all 6 men. We now know that our McNiel line is genetically closer on the Y chromosome to Campbell and McCollum and further distant from Murphy, Scott, McMichael, and Glass.

Not All SNP Matches are STR Matches

Not all SNP matches are also STR matches. Some relationships are too far back in time. However, in this case, while each person on the BY18350 branches matches at some STR level, only the Campbell individual matches at all STR levels.

Remember that variants (mutations) are accumulating down both respective branches of the tree at the same time, meaning one per roughly every 100 years (if 100 is the average number we want to use) for both testers. A total of 30 variants or mutations difference, an average of 15 on each branch of the tree (McNiel and their match) would suggest a common ancestor about 1500 years ago, so each Big Y match should have a common ancestor 1500 years ago or closer. At least on average, in theory.

The Big Y test match threshold is 30 variants, so if there were any more mismatches with the Campbell male, they would not have been a Big Y match, even though they have the exact same haplogroup.

Having the same haplogroup means that their terminal SNP is identical, the SNP furthest down the tree today, at least until someone matches one of them on their Private Variants (if any remain unnamed) and a new terminal SNP is assigned to one or both of them.

Mutations, and when they happen, are truly a roll of the dice. This is why viewing all of your Big Y Block Tree matches is critical, even if they don’t show on your Big Y match list. One more variant and Campbell would have not been shown as a match, yet he is actually quite close, on the same branch, and matches on all STR panels as well.

SNPs Establish the Backbone Structure

I always view the block tree first to provide a branching tree structure, then incorporate STR matches into the equation. Both can equally as important to genealogy, but haplogroup assignment is the most accurate tool, regardless of whether the two individuals match on the Big Y test, especially if the haplogroups are relatively close.

Let’s work with the Block Tree.

The Block Tree

McNIel Big Y block tree menu

Clicking on the link to the Block Tree in the Big Y results immediately displays the tester’s branch on the tree, below.

click to enlarge

On the left side are SNP generation markers. Keep in mind that approximate SNP generations are marked every 5 generations. The most recent generations are based on the number of private variants that have not yet been assigned as branches on the tree. It’s possible that when they are assigned that they will be placed upstream someplace, meaning that placement will reduce the number of early branches and perhaps increase the number of older branches.

The common haplogroup of all of the branches shown here with the upper red arrow is R-BY3344, about 15 SNP generations ago. If you’re using 100 years per SNP generation, that’s about 1500 years. If you’re using 80 years, then 1200 years ago. Some people use even fewer years for calculations.

If some of the private variants in the closer branches disappear, then the common ancestral branch may shift to closer in time.

This tree will always be approximate because some branches can never be detected. They have disappeared entirely over time when no males exist to reproduce.

Conversely, subclades have been born since a common ancestor clade whose descendants haven’t yet tested. As more people test, more clades will be discovered.

Therefore, most recent common ancestor (MRCA) haplogroup ages can only be estimated, based on who has tested and what we know today. The tree branches also vary depending on whether testers have taken the Big Y-500 or the more sensitive Big Y-700, which detects more variants. The Y haplotree is a combination of both.

Big Y-500 results will not be as granular and potentially do not position test-takers as far down the tree as Big Y-700 results would if they upgraded. You’ll need to factor that into your analysis if you’re drawing genealogical conclusions based on these results, especially close results.

You’ll note that the direct path of descent is shown above with arrows from BY3344 through the first blue box with 5 equivalent SNPS, to the next white box, our branch, with two equivalent SNPs. Our McNeil ancestor, the McCollum tester, and the Campell tester have no unresolved private variants between them, which suggests they are probably closer in time than 10 generations back. You can see that the SNP generations are pushed “up” by the neighbor variants.

Because of the fact that private variants don’t occur on a clock cycle and occur in individual lines at an unsteady rate, we must use averages.

That means that when we look further “up” the tree, clicking generation by generation on the up arrow above BY3344, the SNP generations on the left side “adjust” based on what is beneath, and unseen at that level.

The Block Tree Adjusts

Note, in the example above, BY3344 is at SNP generation 15.

Next, I clicked one generation upstream, to R-S668.

McNiel Big Y block tree S668

click to enlarge

You can see that S668 is about 21 SNP generations upstream, and now BY3344 is listed as 20 generations, not 15. You can see our branch, BY3344, but you can no longer see subclades or our matches below that branch in this view.

You can, however, see two matches that descend through S668, brother branches to BY3344, red arrows at far right.

Clicking on the up arrow one more time shows us haplogroup S673, below, and the child branches. The three child branches on which the tester has matches are shown with red arrows.

McNiel Big Y S673

click to enlarge

You’ll immediately notice that now S668 is shown at 19 SNP generations, not 20, and S673 is shown at 20. This SNP generation difference between views is a function of dealing with aggregated and averaged private variants on combined lines and causes the SNP generations to shift. This is also why I always say “about.”

As you continue to click up the tree, the shifting SNP generations continue, reminding us that we can’t truly see back in time. We can only achieve approximations, but those approximations improve as more people test, and more SNPs are named and placed in their proper places on the phylotree.

I love the Block Tree, although I wish I could see further side-to-side, allowing me to view all of the matches on one expanded tree so I can easily see their relationships to the tester, and each other.

Countries and Origins

In addition to displaying shared averaged autosomal origins of testers on a particular branch, if they have taken the Family Finder test and opted-in to sharing origins (ethnicity) results, you can also view the countries indicated by testers on that branch along with downstream branches of the tree.

McNiel Big Y countries

click to enlarge

For example, the Countries tab for S673 is shown above. I can see matches on this branch with no downstream haplogroup currently assigned, as well as cumulative results from downstream branches.

Still, I need to be able to view this information in a more linear format.

The Block Tree and spreadsheet information beautifully augment the haplotree, so let’s take a look.

The Haplotree

On your Y DNA results page, click on the “Haplotree and SNPs” link.

McNIel Big Y haplotree menu

click to enlarge

The Y haplotree will be displayed in pedigree style, quite familiar to genealogists. The SNP legend will be shown at the top of the display. In some cases, “presumed positive” results occur where coverage is lacking, back mutations or read errors are encountered. Presumed positive is based on positive SNPs further down the tree. In other words, that yellow SNP below must read positive or downstream ones wouldn’t.

McNIel Big Y pedigree descent

click to enlarge

The tester’s branch is shown with the grey bar. To the right of the haplogroup-defining SNP are listed the branch and equivalent SNP names. At far right, we see the total equivalent SNPs along with three dots that display the Country Report. I wish the haplotree also showed my matches, or at least my matching surnames, allowing me to click through. It doesn’t, so I have to return to the Big Y page or STR Matches page, or both.

I’ve starred each branch through which my McNiell cousin descends. Sibling branches are shown in grey. As you’ll recall from the Block Tree, we do have matches on those sibling branches, shown side by side with our branch.

The small numbers to the right of the haplogroup names indicate the number of downstream branches. BY18350 has three, all displayed. But looking upstream a bit, we see that DF97 has 135 downstream branches. We also have matches on several of those branches. To show those branches, simply click on the haplogroup.

The challenge for me, with 119 McNeill matches, is that I want to see a combination of the block tree, my spreadsheet information, and the haplotree. The block tree shows the names, my spreadsheet tells me on which branches to look for those matches. Many aren’t easily visible on the block tree because they are downstream on sibling branches.

Here’s where you can find and view different pieces of information.

Data and Sources STR Matches Page Big Y Matches Page Block Tree Haplogroups & SNPs Page
STR matches Yes No, but would like to see who matches at which STR levels If they have taken Big Y test, but doesn’t mean they match on Big Y matching No
SNP matches *1 Shows if STR match has common haplogroup, but not if tester matches on Big Y No, but would like to see who matches at which STR level Big Y matches and STR matches that aren’t Big Y matches are both shown No, but need this feature – see combined haplotree/ block tree
Other Haplogroup Branch Residents Yes, both estimated and tested No, use block tree or click through to profile card, would like to see haplogroup listed for Big Y matches Yes, both Big Y and STR tested, not estimated. Cannot tell if person is Big Y match or STR match, or both. No individuals, but would like that as part of countries report, see combined haplotree/block tree
Fully Expanded Phylotree No No Would like ability to see all branches with whom any Big Y or STR match resides at one time, even if it requires scrolling Yes, but no match information. Matches report could be added like on Block Tree.
Averaged Ethnicities if Have FF Test No No Yes, by haplogroup branch No
Countries Matches map STR only No, need Big Y matches map Yes Yes
Earliest Known Ancestor Yes No, but can click through to profile card No No
Customer Trees Yes No, need this link No No
Profile Card Yes, click through Yes, click through Yes, click through No match info on this page
Downloadable data By STR panel only, would like complete download with 1 click, also if Big Y or FF match Not available at all No No
Path to common haplogroup No No, but would like to see matches haplogroup and convergent haplogroup displayed No, would like the path to convergent haplogroup displayed as an option No, see combined match-block -haplotree in next section

*1 – the best way to see the haplogroup of a Big Y match is to click on their name to view their profile card since haplogroup is not displayed on the Big Y match page. If you happen to also match on STRs, their haplogroup is shown there as well. You can also search for their name using the block tree search function to view their haplogroup.

Necessity being the mother of invention, I created a combined match/block tree/haplotree.

And I really, REALLY hope Family Tree DNA implements something like this because, trust me, this was NOT fun! However, now that it’s done, it is extremely useful. With fewer matches, it should be a breeze.

Here are the steps to create the combined reference tree.

Combo Match/Block/Haplotree

I used Snagit to grab screenshots of the various portions of the haplotree and typed the surnames of the matches in the location of our common convergent haplogroup, taken from the spreadsheet. I also added the SNP generations in red for that haplogroup, at far left, to get some idea of when that common ancestor occurred.

McNIel Big Y combo tree

click to enlarge

This is, in essence, the end-goal of this exercise. There are a few steps to gather data.

Following the path of two matches (the tester and a specific match) you can find their common haplogroup. If your match is shown on the block tree in the same view with your branch, it’s easy to see your common convergent parent haplogroup. If you can’t see the common haplogroup, it’s takes a few extra steps by clicking up the block tree, as illustrated in an earlier section.

We need the ability to click on a match and have a tree display showing both paths to the common haplogroup.

McNiel Big Y convergent

I simulated this functionality in a spreadsheet with my McNiel cousin, a Riley match, and an Ocain match whose terminal SNP is the convergent SNP (M222) between Riley and McNiel. Of course, I’d also like to be able to click to see everyone on one chart on their appropriate branches.

Combining this information onto the haplotree, in the first image, below, M222, 4 men match my McNeill cousin – 2 who show M222 as their terminal SNP, and 2 downstream of M222 on a divergent branch that isn’t our direct branch. In other words, M222 is the convergence point for all 4 men plus my McNeill cousin.

McNiel Big Y M222 haplotree

click to enlarge

In the graphic below, you can see that M222 has a very large number of equivalent SNPs, which will likely become downstream haplogroups at some point in the future. However, today, these equivalent SNPs push M222 from 25 generations to 59. We’ll discuss how this meshes with known history in a minute.

McNiel Big Y M222 block tree

click to enlarge

Two men, Ocain and Ransom, who have both taken the Big Y, whose terminal SNP is M222, match my McNiel cousin. If their common ancestor was actually 59 generations in the past, it’s very, very unlikely that they would match at all given the 30 mutation threshold.

On my reconstructed Match/Block/Haplotree, I included the estimated SNP generations as well. We are starting with the most distant haplogroups and working our way forward in time with the graphics, below.

Make no mistake, there are thousands more men who descend from M222 that have tested, but all of those men except 4 have more than 30 mutations total, so they are not shown as Big Y matches, and they are not shown individually on the Block Tree because they neither match on the Big Y or STR tests. However, there is a way to view information for non-matching men who test positive for M222.

McNiel Big Y M222 countries

click to enlarge

Looking at the Block Tree for M222, many STR match men took a SNP test only to confirm M222, so they would be shown positive for the M222 SNP on STR results and, therefore, in the detailed view of M222 on the Block tree.

Haplogroup information about men who took the M222 test and whom the tester doesn’t match at all are shown here as well in the country and branch totals for R-M222. Their names aren’t displayed because they don’t match the tester on either type of Y DNA test.

Back to constructing my combined tree, I’ve left S658 in both images, above and below, as an overlap placeholder, as we move further down, or towards current, on the haplotree.

McNiel Big Y combo tree center

click to enlarge

Note that BY18350, above, is also an overlap connecting below.

You’ll recall that as a result of the Big Y test, BY18350 was split and now has three child branches plus one person whose terminal SNP is BY18350. All of the men shown below were on one branch until Big Y results revealed that BY18350 needed to be split, with multiple new haplogroups added to the tree.

McNiel Big Y combo tree current

click to enlarge

Using this combination of tools, it’s straightforward for me to see now that our McNiel line is closest to the Campbell tester from Scotland according to the Big Y test + STRs.

Equal according to the Big Y test, but slightly more distant, according to STR matching, is McCollum. The next closest would be sibling branches. Then in the parent group of the other three, BY18350, we find Glass from Scotland.

In BY18350 and subgroups, we find several Scotland locations and one Northern Ireland, which was likely from Scotland initially, given the surname and Ulster Plantation era.

The next upstream parent haplogroup is BY3344, which looks to be weighted towards ancestors from Scotland, shown on the country card, below.

McNiel Big Y BY3344

click to enlarge

This suggests that the origins of the McNiel line was, perhaps, in Scotland, but it doesn’t tell us whether or not George and presumably, Thomas, immigrated from Ireland or Scotland.

This combined tree, with SNPs, surnames from Big Y matches, along with Country information, allows me to see who is really more closely related and who is further away.

What I didn’t do, and probably should, is to add in all of the STR matches who have taken the Big Y test, shown on their convergent branch – but that’s just beyond the scope of time I’m willing to invest, at least for now, given that hundreds of STR matches have taken the Big Y test, and the work of building the combined tree is all manual today.

For those reading this article without access to the Y phylogenetic tree, there’s a public version of the Y and mitochondrial phylotrees available, here.

What About Those McNiels?

No other known McNiel descendants from either Thomas or George have taken the Big Y test, so I didn’t expect any to match, but I am interested in other men by similar surnames. Does ANY other McNiel have a Big Y match?

As it turns out, there are two, plus one STR match who took a Big Y test, but is not a Big Y match.

However, as you can see on the combined match/block/haplotree, above, the closest other Big Y-matching McNeil male is found at about 19 SNP generations, or roughly 1900 years ago. Even if you remove some of the variants in the lower generations that are based on an average number of individual variants, you’re still about 1200 years in the past. It’s extremely doubtful that any surname would survive in both lines from the year 800 or so.

That McNeil tester’s ancestor was born in 1747 in Tranent, Scotland.

The second Big Y-matching person is an O’Neil, a few branches further up in the tree.

The convergent SNP of the two branches, meaning O’Neil and McNeill are at approximately the 21 generation level. The O’Neil man’s Neill ancestor is found in 1843 in Cookestown, County Tyrone, Ireland.

McNiel Big Y convergent McNeil lines

I created a spreadsheet showing convergent lines:

  • The McNeill man with haplogroup A4697 (ancestor Tranent, Scotland) is clearly closest genetically.
  • O’Neill BY91591, who is brother clades with Neel and Neal, all Irish, is another Big Y match.
  • The McNeill man with haplogroup FT91182 is an STR match, but not a Big Y match.

The convergent haplogroup of all of these men is DF105 at about the 22 SNP generation marker.

STRs

Let’s turn back to STR tests, with results that produce matches closer in time.

Searching my STR download spreadsheet for similar surnames, I discovered several surname matches, mining the Earliest Known Ancestor information, profiles and trees produced data as follows:

Ancestor STR Match Level Location
George Charles Neil 12, 25, match on Big Y A4697 1747-1814 Tranent, Scotland
Hugh McNeil 25 (tested at 67) Born 1800 Country Antrim, Northern Ireland
Duncan McNeill 12 (tested at 111) Married 1789, Argyllshire, Scotland
William McNeill 12, 25 (tested at 37) Blackbraes, Stirlingshire, Scotland
William McNiel 25 (tested at 67) Born 1832 Scotland
Patrick McNiel 25 (tested at 111) Trien East, County Roscommon, Ireland
Daniel McNeill 25 (tested at 67) Born 1764 Londonderry, Northern Ireland
McNeil 12 (tested at 67) 1800 Ireland
McNeill (2 matches) 25 (tested Big Y-  SNP FT91182) 1810, Antrim, Northern Ireland
Neal 25 – (tested Big Y, SNP BY146184) Antrim, Northern Ireland
Neel (2 matches) 67 (tested at 111, and Big Y) 1750 Ireland, Northern Ireland

Our best clue that includes a Big Y and STR match is a descendant of George Charles Neil born in Tranent, Scotland, in 1747.

Perhaps our second-best clue comes in the form of a 111 marker match to a descendant of one Thomas McNeil who appears in records as early as 1753 and died in 1761 In Rombout Precinct, Dutchess County, NY where his son John was born. This line and another match at a lower level both reportedly track back to early New Hampshire in the 1600s.

The MacNeil DNA Project tells us the following:

Participant 106370 descends from Isaiah McNeil b. 14 May 1786 Schaghticoke, Rensselaer Co. NY and d. 28 Aug 1855 Poughkeepsie, Dutchess Co., NY, who married Alida VanSchoonhoven.

Isaiah’s parents were John McNeal, baptized 21 Jun 1761 Rombout, Dutchess Co., NY, d. 15 Feb 1820 Stillwater, Saratoga Co., NY and Helena Van De Bogart.

John’s parents were Thomas McNeal, b.c. 1725, d. 14 Aug 1761 NY and Rachel Haff.

Thomas’s parents were John McNeal Jr., b. around 1700, d. 1762 Wallkill, Orange Co., NY (now Ulster Co. formed 1683) and Martha Borland.

John’s parents were John McNeal Sr. and ? From. It appears that John Sr. and his family were this participant’s first generation of Americans.

Searching this line on Ancestry, I discovered additional information that, if accurate, may be relevant. This lineage, if correct, and it may not be, possibly reaching back to Edinburgh, Scotland. While the information gathered from Ancestry trees is certainly not compelling in and of itself, it provides a place to begin research.

Unfortunately, based on matches shown on the MacNeil DNA Project public page, STR marker mutations for kits 30279, B78471 and 417040 when compared to others don’t aid in clustering or indicating which men might be related to this group more closely than others using line-marker mutations.

Matches Map

Let’s take a look at what the STR Matches Map tells us.

McNiel Big Y matches map menu

This 67 marker Matches Map shows the locations of the earliest known ancestors of STR matches who have entered location information.

McNiel Big Y matches mapMcNiel Big Y matches map legend

My McNeill cousin’s closest matches are scattered with no clear cluster pattern.

Unfortunately, there is no corresponding map for Big Y matches.

SNP Map

The SNP map provided under the Y DNA results allows testers to view the locations where specific haplogroups are found.

McNiel Big Y SNP map

The SNP map marks an area where at least two or more people have claimed their most distant known ancestor to be. The cluster size is the maximum amount of miles between people that is allowed in order for a marker indicating a cluster at a location to appear. So for example, the sample size is at least 2 people who have tested, and listed their most distant known ancestor, the cluster is the radius those two people can be found in. So, if you have 10 red dots, that means in 1000 miles there are 10 clusters of at least two people for that particular SNP. Note that these locations do NOT include people who have tested positive for downstream locations, although it does include people who have taken individual SNP tests.

Working my way from the McNiel haplogroup backward in time on the SNP map, neither BY18332 nor BY18350 have enough people who’ve tested, or they didn’t provide a location.

Moving to the next haplogroup up the tree, two clusters are formed for BY3344, shown below.

McNIel Big Y BY3344 map

S668, below.

McNiel Big Y S668 map

It’s interesting that one cluster includes Glasgow.

S673, below.

McNiel Big Y S673 map

DF85, below:

McNiel Big Y DF85 map

DF105 below:

McNiel BIg Y DF105 map

M222, below:

McNiel Big Y M222 map

For R-M222, I’ve cropped the locations beyond Ireland and Scotland. Clearly, RM222 is the most prevalent in Ireland, followed by Scotland. Wherever M222 originated, it has saturated Ireland and spread widely in Scotland as well.

R-M222

R-M222, the SNP initially thought to indicate Niall of the 9 Hostages, occurred roughly 25-59 SNP generations in the past. If this age is even remotely accurate, averaging by 80 years per generation often utilized for Big Y results, produces an age of 2000 – 4720 years. I find it extremely difficult to believe any semblance of a surname survived that long. Even if you reduce the time in the past to the historical narrative, roughly the year 400, 1600 years, I still have a difficult time believing the McNiel surname is a result of being a descendant of Niall of the 9 Hostages directly, although oral history does have staying power, especially in a clan setting where clan membership confers an advantage.

Surname or not, clearly, our line along with the others whom we match on the Big Y do descend from a prolific common ancestor. It’s very unlikely that the mutation occurred in Niall’s generation, and much more likely that other men carried M222 and shared a common ancestor with Niall at some point in the distant past.

McNiel Conclusion – Is There One?

If I had two McNiel wishes, they would be:

  • Finding records someplace in Virginia that connect George and presumably brothers Thomas and John to their parents.
  • A McNiel male from wherever our McNiel line originated becoming inspired to Y DNA test. Finding a male from the homeland might point the way to records in which I could potentially find baptismal records for George about 1720 and Thomas about 1724, along with possibly John, if he existed.

I remain hopeful for a McNiel from Edinburgh, or perhaps Glasgow.

I feel reasonably confident that our line originated genetically in Scotland. That likely precludes Niall of the 9 Hostages as a direct ancestor, but perhaps not. Certainly, one of his descendants could have crossed the channel to Scotland. Or, perhaps, our common ancestor is further back in time. Based on the maps, it’s clear that M222 saturates Ireland and is found widely in Scotland as well.

A great deal depends on the actual age of M222 and where it originated. Certainly, Niall had ancestors too, and the Ui Neill dynasty reaches further back, genetically, than their recorded history in Ireland. Given the density of M222 and spread, it’s very likely that M222 did, in fact, originate in Ireland or, alternatively, very early in Scotland and proliferated in Ireland.

If the Ui Neill dynasty was represented in the persona of the High King, Niall of the 9 Hostages, 1600 years ago, his M222 ancestors were clearly inhabiting Ireland earlier.

We may not be descended from Niall personally, but we are assuredly related to him, sharing a common ancestor sometime back in the prehistory of Ireland and Scotland. That man would sire most of the Irish men today and clearly, many Scots as well.

Our ancestors, whoever they were, were indeed in Ireland millennia ago. R-M222, our ancestor, was the ancestor of the Ui Neill dynasty and of our own Reverend George McNiel.

Our ancestors may have been at Knowth and New Grange, and yes, perhaps even at Tara.

Tara Niall mound in sun

Someplace in the mists of history, one man made a different choice, perhaps paddling across the channel, never to return, resulting in M222 descendants being found in Scotland. His descendants include our McNeil ancestors, who still slumber someplace, awaiting discovery.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Products and Services

Genealogy Research

Big Y News and Stats + Sale

I must admit – this past January when FamilyTreeDNA announced the Big Y-700, an upgrade from the Big Y-500 product, I was skeptical. I wondered how much benefit testers would really see – but I was game to purchase a couple upgrades – and I did. Then, when the results came back, I purchased more!

I’m very pleased to announce that I’m no longer skeptical. I’m a believer.

The Big Y-700 has produced amazing results – and now FamilyTreeDNA has decoupled the price of the BAM file in addition to announcing substantial sale prices for their Thanksgiving Sale.

I’m going to discuss sale pricing for products other than the Big Y in a separate article because I’d like to focus on the progress that has been made on the phylogenetic tree (and in my own family history) as a result of the Big Y-700 this year.

Big Y Pricing Structure Change

FamilyTreeDNA recently anounced some product structure changes.

The Big Y-700 price has been permanently dropped by $100 by decoupling the BAM file download from the price of the test itself. This accomplishes multiple things:

  • The majority of testers don’t want or need the BAM file, so the price of the test has been dropped by $100 permanently in order to be able to price the Big Y-700 more attractively to encourage more testers. That’s good for all of us!!!
  • For people who ordered the Big Y-700 since November 1, 2019 (when the sale prices began) who do want the BAM file, they can purchase the BAM file separately through the “Add Ons and Upgrades” page, via the “Upgrades” tab for $100 after their test results are returned. There will also be a link on the Big Y-700 results page. The total net price for those testers is exactly the same, but it represents a $100 permanent price drop for everyone else.
  • This BAM file decoupling reduces the initial cost of the Big Y-700 test itself, and everyone still has the option of purchasing the BAM file later, which will make the Big Y-700 test more affordable. Additionally, it allows the tester who wants the BAM file to divide the purchase into two pieces, which will help as well.
  • The current sale price for the Big Y-700 for the tester who has taken NO PREVIOUS Y DNA testing is now just $399, formerly $649. That’s an amazing price drop, about 40%, in the 9 months since the Big Y-700 was introduced!
  • Upgrade pricing is available too, further down in this article.
  • If you order an upgrade from any earlier Big Y to the Big Y-700, you receive an upgraded BAM file because you already paid for the BAM file when you ordered your initial Big Y test.
  • The VCF file is still available for download at no additional cost with any Big Y test.
  • There is no change in the BAM file availability for current customers. Everyone who ordered before November 1, 2019 will be able to download their BAM file as always.

The above changes are permanent, except for the sale price.

2019 has been a Banner Year

I know how successful the Big Y-700 has been for kits and projects that I manage, but how successful has it been overall, in a scientific sense?

I asked FamilyTreeDNA for some stats about the number of SNPs discovered and the number of branches added to the Y phylotree.

Drum roll please…

Branches Added This Year Total Tree Branches Variants Added to Tree This Year Total Variants Added to Tree
2018 6,259 17,958 60,468 132.634
2019 4,394 22.352 32,193 164,827

The tests completed in 2019 are only representative for 10 months, through October, and not the entire year.

Haplotree Branches

Not every SNP discovered results in a new branch being added to the haplotree, but many do. This chart shows the number of actual branches added in 2018 and 2019 to date.

Big Y 700 haplotree branches.png

These stats, provided by FamilyTreeDNA, show the totals in the bottom row, which is a cumulative branch number total, not a monthly total. At the end of October 2019, the total number of individual branches were 22,352.

Big Y 700 haplotree branches small.png

This chart, above, shows some of the smaller haplogroups.

Big Y 700 haplotree branches large.png

This chart shows the larger haplogroups, including massive haplogroup R.

Haplotree Variants

The number of variants listed below is the number of SNPs that have been discovered, named and placed on the tree. You’ll notice that these numbers are a lot larger than the number of branches, above. That’s because roughly 168,000 of these are equivalent SNPs, meaning they don’t further branch the tree – at least not yet. These 168K variants are the candidates to be new branches as more people test and the tree can be further split.

Big Y 700 variants.png

These numbers also don’t include Private Variants, meaning SNPs that have not yet been named.

If you see Private Variants listed in your Big Y results, when enough people have tested positive for the same variant, and it makes sense, the variants will be given a SNP name and placed on the tree.

Big Y 700 variants small.png

The smaller haplogroups variants again, above, followed by the larger, below.

Big Y 700 variants large.png

Upgrades from the Big Y, or Big Y-500 to Big Y-700

Based on what I see in projects, roughly one third of the Big Y and Big Y-500 tests have upgraded to the Big Y-700.

For my Estes line, I wondered how much value the Big Y-700 upgrade would convey, if any, but I’m extremely glad I upgraded several kits. As a result of the Big Y-700, we’ve further divided the sons of Abraham, born in 1747. This granularity wasn’t accomplished by STR testing and wasn’t accomplished by the Big Y or Big Y-500 testing alone – although all of these together are building blocks. I’m ECSTATIC since it’s my own ancestral line that has the new lineage defining SNP.

Big Y 700 Estes.png

Every Estes man descended from Robert born in 1555 has R-BY482.

The sons of the immigrant, Abraham, through his father, Silvester, all have BY490, but the descendants of Silvester’s brother, Robert, do not.

Moses, son of Abraham has ZS3700, but the rest of Abraham’s sons don’t.

Then, someplace in the line of kit 831469, between Moses born in 1711 and the present-day tester, we find a new SNP, BY154784.

Big Y 700 Estes block tree.png

Looking at the block tree, we see the various SNPs that are entirely Estes, except for one gentleman who does not carry the Estes surname. I wrote about the Block Tree, here.

Without Big Y testing, none of these SNPs would have been found, meaning we could never have split these lines genealogically.

Every kit I’ve reviewed carries SNPs that the Big Y-700 has been able to discern that weren’t discovered previously.

Every. Single. One.

Now, even someone who hasn’t tested Y DNA before can get the whole enchilada – meaning 700+ STRs, testing for all previously discovered SNPs, and new branch defining SNPs, like my Estes men – for $399.

If a new Estes tester takes this test, without knowing anything about his genealogy, I can tell him a great deal about where to look for his lineage in the Estes tree.

Reduced Prices

FamilyTreeDNA has made purchasing the Big Y-700 outright, or upgrading, EXTREMELY attractive.

Test Price
Big Y-700 purchase with no previous Y DNA test

 

$399
Y-12 upgrade to Big Y-700 $359
Y-25 upgrade to Big Y-700 $349
Y-37 upgrade to Big Y-700 $319
Y-67 upgrade to Big Y-700 $259
Y-111 upgrade to Big Y-700 $229
Big Y or Big Y-500 upgrade to Big Y-700 $189

Note that the upgrades include all of the STR markers as yet untested. For example, the 12-marker to Big Y-700 includes all of the STRs between 25 and 111, in addition to the Big Y-700 itself. The Big Y-700 includes:

  • All of the already discovered SNPs, called Named Variants, extending your haplogroup all the way to the leaf at the end of your branch
  • Personal and previously undiscovered SNPs called Private Variants
  • All of the untested STR markers inclusive through 111 markers
  • A minimum of a total of 700 STR markers, including markers above 111 that are only available through Big Y-700 testing

With the refinements in the Big Y test over the past few years, and months, the Big Y is increasingly important to genealogy – equally or more so than traditional STR testing. In part, because SNPs are not prone to back mutations, and are therefore more stable than STR markers. Taken together, STRs and SNPs are extremely informative, helping to break down ancestral brick walls for people whose genealogy may not reach far back in time – and even those who do.

If you are a male and have not Y DNA tested, there’s never been a better opportunity. If you are a female, find a male on a brick wall line and sponsor a scholarship.

Click here to order or upgrade!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Mitochondrial DNA Resources – Everything You Need to Know

Mitochondrial DNA Resources

Recently, I wrote a multi-part series about mitochondrial DNA – start to finish – everything you need to know.

I’ve assembled several articles in one place, and I’ll add any new articles here as well.

Please feel free to share this resource or any of the links to individual articles with friends, genealogy groups or on social media.

What the Difference Between Mitochondrial and Other Types of DNA?

Mitochondrial DNA is inherited directly from your matrilineal line, only, meaning your mother’s mother’s mother’s mother – on up your family tree until you run out of direct line mothers that you’ve identified. The great news is even if you don’t know the identities of those people in your tree, you carry their mitochondrial DNA which can help identify them.

Here’s a short article about the different kinds of DNA that can be used for genealogy.

Why Mitochondrial DNA?

Let’s start out with why someone might want to test their mitochondrial DNA.

After you purchase a DNA test, swab, return the kit and when the lab finishes processing your test, you’ll receive your results on your personal page at FamilyTreeDNA, the only company that tests mitochondrial DNA at the full sequence level and provides matching with tens of thousands of other testers.

What About Those Results?

People want to understand how to use all of the different information provided to testers. These articles provide a step-by-step primer.

Mitochondrial DNA personal page update

Sign in to your Family Tree DNA account and use these articles as a guideline to step through your results on your personal page.

We begin with an overview. What is mitochondrial DNA, how it is inherited and why is it useful for genealogy?

Next, we look at your results and decode what all the numbers mean. It’s easy, really!

Our ancestors lived in clans, and our mitochondrial DNA has its own versions of clans too – called haplogroups. Your full haplogroup can be very informative.

Sometimes there’s more than meets the eye. Here are my own tips and techniques for more than doubling the usefulness of your matches.

You’ll want to wring every possible advantage out of your tests, so be sure to join relevant projects and use them to their fullest extent.

Do you know how to utilize advanced matching? It’s a very powerful tool. If not, you will after these articles.

Mitochondrial DNA Information for Everyone

FamilyTreeDNA maintains an extensive public mitochondrial DNA tree, complete with countries of origin for all branches. You don’t need to have tested to enjoy the public tree.

However, if you have tested, take a look to see where the earliest known ancestors of your haplogroup matches are located based on the country flags.

Mitochondrial resources haplotree

These are mine. Where are yours?

What Can Mitochondrial DNA Do for You?

Some people mistakenly think that mitochondrial DNA isn’t useful for genealogy. I’m here to testify that it’s not only useful, it’s amazing! Here are three stories from my own genealogy about how I’ve used mitochondrial DNA to learn more about my ancestors and in some cases, break right through brick walls.

It’s not only your own mitochondrial DNA that’s important, but other family members too.

My cousin tested her mitochondrial DNA to discover that her direct matrilineal ancestor was Native American, much to her surprise. The great news is that her ancestor is my ancestor too!

Searching for Native American Ancestors?

If you’re searching for Native American or particular ancestors, mitochondrial DNA can tell you specifically if your mitochondrial DNA, or that of your ancestors (if you test a direct matrilineal descendant,) is Native, African, European, Jewish or Asian. Furthermore, your matches provide clues as to what country your ancestor might be from and sometimes which regions too.

Did you know that people from different parts of the world have distinctive haplogroups?

You can discover your ancestors’ origins through their mitochondrial DNA.

You can even utilize autosomal segment information to track back in time to the ancestor you seek. Then you can obtain that ancestor’s mitochondrial DNA by selectively testing their descendants or finding people who have already tested that descend from that ancestor. Here’s how.

You never know what you’re going to discover when you test your mitochondrial DNA. I discovered that although my earliest known matrilineal ancestor is found in Germany, her ancestors were from Scandinavia. My cousin discovered that our common ancestor is Mi’kmaq.

What secrets will your mitochondrial DNA reveal?

You can test or upgrade your mitochondrial DNA by clicking here.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research