Ancient DNA Matches – What Do They Mean?

The good news is that my three articles about the Anzick and other ancient DNA of the past few days have generated a lot of interest.

The bad news is that it has generated hundreds of e-mails every day – and I can’t possibly answer them all personally.  So, if you’ve written me and I don’t reply, I apologize and  I hope you’ll understand.  Many of the questions I’ve received are similar in nature and I’m going to answer them in this article.  In essence, people who have matches want to know what they mean.

Q – I had a match at GedMatch to <fill in the blank ancient DNA sample name> and I want to know if this is valid.

A – Generally, when someone asks if an autosomal match is “valid,” what they really mean is whether or not this is a genealogically relevant match or if it’s what is typically referred to as IBS, or identical by state.  Genealogically relevant samples are referred to as IBD, or identical by descent.  I wrote about that in this article with a full explanation and examples, but let me do a brief recap here.

In genealogy terms, IBD is typically used to mean matches over a particular threshold that can be or are GENEALOGICALLY RELEVANT.  Those last two words are the clue here.  In other words, we can match them with an ancestor with some genealogy work and triangulation.  If the segment is large, and by that I mean significantly over the threshold of 700 SNPs and 7cM, even if we can’t identify the common ancestor with another person, the segment is presumed to be IBD simply because of the math involved with the breakdown of segment into pieces.  In other words, a large segment match generally means a relatively recent ancestor and a smaller segment means a more distant ancestor.  You can readily see this breakdown on this ISOGG page detailing autosomal DNA transmission and breakdown.

Unfortunately, often smaller segments, or ones determined to be IBS are considered to be useless, but they aren’t, as I’ve demonstrated several times when utilizing them for matching to distant ancestors.  That aside, there are two kinds of IBS segments.

One kind of IBS segment is where you do indeed share a common ancestor, but the segment is small and you can’t necessarily connect it to the ancestor.  These are known as population matches and are interpreted to mean your common ancestor comes from a common population with the other person, back in time, but you can’t find the common ancestor.  By population, we could mean something like Amish, Jewish or Native American, or a country like Germany or the Netherlands.

In the cases where I’ve utilized segments significantly under 7cM to triangulate ancestors, those segments would have been considered IBS until I mapped them to an ancestor, and then they suddenly fell into the IBD category.

As you can see, the definitions are a bit fluid and are really defined by the genealogy involved.

The second kind of IBS is where you really DON’T share an ancestor, but your DNA and your matches DNA has managed to mutate to a common state by convergence, or, where your Mom’s and Dad’s DNA combined form a pseudo match, where you match someone on a segment run long enough to be considered a match at a low level.  I discussed how this works, with examples, in this article.  Look at example four, “a false match.”

So, in a nutshell, if you know who your common ancestor is on a segment match with someone, you are IBD, identical by descent.  If you don’t know who your common ancestor is, and the segment is below the normal threshold, then you are generally considered to be IBS – although that may or may not always be true.  There is no way to know if you are truly IBS by population or IBS by convergence, with the possible exception of phased data.

Data phasing is when you can compare your autosomal DNA with one or both parents to determine which half you obtained from whom.  If you are a match by convergence where your DNA run matches that of someone else because the combination of your parents DNA happens to match their segment, phasing will show that clearly.  Here’s an example for only one location utilizing only my mother’s data phased with mine.  My father is deceased and we have to infer his results based on my mother’s and my own.  In other words, mine minus the part I inherited from my mother = my father’s DNA.

My Result My Result Mother’s Result Mother’s Result Father’s Inferred Result Father’s Inferred Result
T A T G A

In this example of just one location, you can see that I carry a T and an A in that location.  My mother carries a T and a G, so I obviously inherited the T from her because I don’t have a G.  Therefore, my father had to have carried at least an A, but we can’t discern his second value.

This example utilized only one location.  Your autosomal data file will hold between 500,000 and 700,000 location, depending on the vendor you tested with and the version level.

You can phase your DNA with that of your parent(s) at GedMatch.  However, if both of your parents are living, an easier test would be to see if either of your parents match the individual in question.  If neither of your parents match them, then your match is a result of convergence or a data read error.

So, this long conversation about IBD and IBS is to reach this conclusion.

All of the ancient specimens are just that, ancient, so by definition, you cannot find a genealogy match to them, so they are not IBD.  Best case, they are IBS by population.  Worse case, IBS by convergence.  You may or may not be able to tell the difference.  The reason, in my example earlier this week, that I utilized my mother’s DNA and only looked at locations where we both matched the ancient specimens was because I knew those matches were not by convergence – they were in fact IBS by population because my mother and I both matched Anzick.

ancient compare5

Q – What does this ancient match mean to me?

A – Doggone if I know.  No, I’m serious.  Let’s look at a couple possibilities, but they all have to do with the research you have, or have not, done.

If you’ve done what I’ve done, and you’ve mapped your DNA segments to specific ancestors, then you can compare your ancient matching segments to your ancestral spreadsheet map, especially if you can tell unquestionably which side the ancestral DNA matches.  In my case, shown above, the Clovis Anzik matched my mother and me on the same segment and we both matched Cousin Herbie.  We know unquestionably who our common ancestor is with cousin Herbie – so we know, in our family line, which line this segment of DNA shared with Anzick descends through.

ancient compare6

If you’re not doing ancestor mapping, then I guess the Anzick match would come in the category of, “well, isn’t that interesting.”  For some, this is a spiritual connection to the past, a genetic epiphany.  For other, it’s “so what.”

Maybe this is a good reason to start ancestor mapping!  This article tells you how to get started.

Q – Does my match to Anzick mean he is my ancestor?

A – No, it means that you and Anzick share common ancestry someplace back in time, perhaps tens of thousands of years ago.

Q – I match the Anzick sample.  Does this prove that I have Native American heritage? 

A – No, and it depends.  Don’t you just hate answers like this?

No, this match alone does not prove Native American heritage, especially not at IBS levels.  In fact, many people who don’t have Native heritage match small segments?  How can this be?  Well, refer to the IBS by convergence discussion above.  In addition, Anzick child came from an Asian population when his ancestors migrated, crossing from Asia via Beringia.  That Eurasian population also settled part of Europe – so you could be matching on very small segments from a common population in Eurasia long ago.  In a paper just last year, this was discussed when Siberian ancient DNA was shown to be related to both Native Americans and Europeans.

In some cases, a match to Anzick on a segment already attributed to a Native line can confirm or help to confirm that attribution.  In my case, I found the Anzick match on segments in the Lore family who descend from the Acadians who were admixed with the Micmac.  I have several Anzick match segments that fit that criteria.

A match to Anzick alone doesn’t prove anything, except that you match Anzick, which in and of itself is pretty cool.

Q – I’m European with no ancestors from America, and I match Anzick too.  How can that be?

A – That’s really quite amazing isn’t it.  Just this week in Nature, a new article was published discussing the three “tribes” that settled or founded the European populations.  This, combined with the Siberian ancient DNA results that connect the dots between an ancient population that contributed to both Europeans and Native Americans explains a lot.

3 European Tribes

If you think about it, this isn’t a lot different than the discovery that all Europeans carry some small amount of Neanderthal and Denisovan DNA.

Well, guess what….so does Anzick.

Here are his matches to the Altai Neanderthal.

Chr Start Location End Location Centimorgans (cM) SNPs
2 241484216 242399416 1.1 138
3 19333171 21041833 2.6 132
6 31655771 32889754 1.1 133

He does not match the Caucasus Neanderthal.  He does, however, match the Denisovan individual on one location.

Chr Start Location End Location Centimorgans (cM) SNPs
3 19333171 20792925 2.1 107

Q – Maybe the scientists are just wrong and the burial is not 12,500 years old,  maybe just 100 years old and that’s why the results are matching contemporary people.

A – I’m not an archaeologist, nor do I play one…but I have been closely involved with numerous archaeological excavations over the past decade with The Lost Colony Research Group, several of which recovered human remains.  The photo below is me with Anne Poole, my co-director, sifting at one of the digs.

anne and me on dig

There are very specific protocols that are followed during and following excavation and an error of this magnitude would be almost impossible to fathom.  It would require  kindergarten level incompetence on the part of not one, but all professionals involved.

In the Montana Anzick case, in the paper itself, the findings and protocols are both discussed.  First, the burial was discovered directly beneath the Clovis layer where more than 100 tools were found, and the Clovis layer was undisturbed, meaning that this is not a contemporary burial that was buried through the Clovis layer.  Second, the DNA fragmentation that occurs as DNA degrades correlated closely to what would be expected in that type of environment at the expected age based on the Clovis layer.  Third, the bones themselves were directly dated using XAD-collagen to 12,707-12,556 calendar years ago.  Lastly, if the remains were younger, the skeletal remains would match most closely with Native Americans of that region, and that isn’t the case.  This graphic from the paper shows that the closest matches are to South Americans, not North Americans.

anzick matches

This match pattern is also confirmed independently by the recent closest GedMatch matches to South Americans.

Q – How can this match from so long ago possibly be real?

A – That’s a great question and one that was terribly perplexing to Dr. Svante Paabo, the man who is responsible for producing the full genome sequence of the first, and now several more, Neanderthals.  The expectation was, understanding autosomal DNA gets watered down by 50% in every generation though recombination, that ancient genomes would be long gone and not present in modern populations.  Imagine Svante’s surprise when he discovered that not only isn’t true, but those ancient DNA segmetns are present in all Europeans and many Asians as well.  He too agonized over the question about how this is possible, which he discussed in this great video.  In fact he repeated these tests over and over in different ways because he was convinced that modern individuals could not carry Neanderthal DNA – but all those repeated tests did was to prove him right.  (Paabo’s book, Neanderthal Man, In Search of Lost Genomes is an incredible read that I would highly recommend.)

What this means is that the population at one time, and probably at several different times, had to be very small.  In fact, it’s very likely that many times different pockets of the human race was in great jeopardy of dying out.  We know about the ones that survived.  Probably many did perish leaving no descendants today.  For example, no Neanderthal mitochondrial DNA has been found in any living or recent human.

In a small population, let’s say 5 males and 5 females who some how got separated from their family group and founded a new group, by necessity.  In fact, this could well be a description of how the Native Americans crossed Beringia.  Those 5 males and 5 females are the founding population of the new group.  If they survive, all of the males will carry the men’s haplogroups – let’s say they are Q and C, and all of the descendants will carry the mitochondrial haplogroups of the females – let’s say A, B, C, D and X.

There is a very limited amount of autosomal DNA to pass around.  If all of those 10 people are entirely unrelated, which is virtually impossible, there will be only 10 possible combinations of DNA to be selected from.  Within a few generations, everyone will carry part of those 10 ancestor’s DNA.  We all have 8 ancestors at the great-grandparent level.  By the time those original settlers’ descendants had great-great-grandparents – of which each one had 16, at least 6 of those original people would be repeated twice in their tree.

There was only so much DNA to be passed around.  In time, some of the segments would no longer be able to be recombined because when you look at phasing, the parents DNA was exactly the same, example below.  This is what happens in endogamous populations.

My Result My Result Mother’s Result Mother’s Result Father’s Result Father’s  Result
T T T T T T

Let’s say this group’s descendants lived without contact with other groups, for maybe 15,000 years in their new country.  That same DNA is still being passed around and around because there was no source for new DNA.  Mutations did occur from time to time, and those were also passed on, of course, but that was the only source of changed DNA – until they had contact with a new population.

When they had contact with a new population and admixture occurred, the normal 50% recombination/washout in every generation began – but for the previous 15,000 years, there had been no 50% shift because the DNA of the population was, in essence, all the same.  A study about the Ashkenazi Jews that suggests they had only a founding population of about 350 people 700 years ago was released this week – explaining why Ashkenazi Jewish descendants have thousands of autosomal matches and match almost everyone else who is Ashkenazi.  I hope that eventually scientists will do this same kind of study with Anzick and Native Americans.

If the “new population” we’ve been discussing was Native Americans, their males 15,000 year later would still carry haplogroups Q and C and the mitochondrial DNA would still be A, B, C, D and X.  Those haplogroups, and subgroups formed from mutations that occurred in their descendants, would come to define their population group.

In some cases, today, Anzick matches people who have virtually no non-Native admixture at the same level as if they were just a few generations removed, shown on the chart below.

anzick gedmatch one to all

Since, in essence, these people still haven’t admixed with a new population group, those same ancient DNA segments are being passed around intact, which tells us how incredibly inbred this original small population must have been.  This is known as a genetic bottleneck.

The admixture report below is for the first individual on the Anzick one to all Gedmatch compare at 700 SNPs and 7cM, above.  In essence, this currently living non-admixed individual still hasn’t met that new population group.

anzick1

If this “new population” group was Neanderthal, perhaps they lived in small groups for tens of thousands of years, until they met people exiting Africa, or Denisovans, and admixed with them.

There weren’t a lot of people anyplace on the globe, so by virtue of necessity, everyone lived in small population groups.  Looking at the odds of survival, it’s amazing that any of us are here today.

But, we are, and we carry the remains, the remnants of those precious ancestors, the Denisovans, the Neanderthals and Anzick.  Through their DNA, and ours, we reach back tens of thousands of years on the human migration path.  Their journey is also our journey.  It’s absolutely amazing and it’s no wonder people have so many questions and such a sense of enchantment.  But it’s true – and only you can determine exactly what this means to you.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

One Chromosome, Two Sides, No Zipper – ICW and the Matrix

ZipperThe questions I’ve received most often since the release of the new Family Finder Matrix from Family Tree DNA has to do with matches.  Specifically, what the “In Common With” feature is telling you versus what the Family Finder “Matrix” is telling you and how to utilize all of this information together.  At the bottom of this confusion is often a fundamental lack of understanding of how matching occurs and what it means in different contexts.

Let’s talk about this, step by step.

The “in common with” function (called triangulation for a few weeks, but now labeled “run common matches” ) shows you every person that you and one of your matches, match with in common.  I’ll be running this option for my matches with cousin David, shown below.

zipper 1

Here’s an example of my matches in common with my cousin, David.

Zipper 2

The Family Finder Matrix takes this information a bit further and shows you whether or not the people involved with this match, match each other as well.

In this case, I happen to know that my cousins Harold, Carl and Dean will match each other on my father’s side, as will my cousin David.  Warren doesn’t have firm genealogy, but from this, we can tell that he is indeed connected to this family group because he matches me, David, Harold and Carl, but not Dean and not Nova.  We have no idea how Nova connects to this line, if she does.  Notice that Nova does not match any of the other people in this group in the matrix below.  That means that my and David’s common ancestor with her is likely not from this same ancestral line shared by Harold, Carl and Dean.

zipper 3

From this point forward, I would drop back to my trusty downloaded full match spreadsheet that I maintain to see if indeed any of these people match me and my known cousins on the same segments.  If so, that confirms a family/ancestor relationship.   On the snipped from my spreadsheet below, you can see that Warren indeed matches both Buster and David and I, but not on the same segments.  Nova didn’t match any grouping on the same segments.  However, Buster and David both match me on the same portion of chromosome 19, so this confirms that we do share a common ancestor.  In this case, we also know, from our genealogy that the common ancestor is Lazarus Estes and wife, Elizabeth Vannoy.  Based on our multiple cousin matches, we can say that Warren is somehow connected to this line, but we can’t say how.

Zipper 4

I’ve had comments like “I have everything I need on my spreadsheet – I can see where all of my matches match me.”  And indeed, you can, but it’s not everything you need.  Here’s why.

Without additional information, you can’t tell, by just looking at your spreadsheet whether two people who match you on the same segment are matching on your Mom or Dad’s side.  For example, above, I know that both David and Buster are from my Dad’s line, but if I didn’t know that, one of them could be from Mom’s line and one could be from Dad’s, and while they are both related to me, on the same chromosome, they would, in that case, not be related to each other.  So, my spreadsheet of matches tells me clearly THAT people match me, and where, but it doesn’t tell me HOW or on which side.  For that, I need additional tools like ICW, the Matrix and plain old genealogy research.

This is the fundamental concept of matching and in a nutshell, why it’s so difficult.

Every Chromosome Has Two Sides

There are two sides to every chromosome, Mom’s side and Dad’s side.  Except nature has played a cruel trick on us and not installed a zipper.  There are no Mom and Dad labels.  There is no dividing that DNA or those matches in half magically, except by determing who they match, and how they do or don’t match each other.

When we match ourselves against our parents, for example, we then know immediately which half of our DNA came from which parent, but if you don’t have any parents available to match against, then you have to use genealogy or cousin matches to figure that out.

I talk about that in the Chromosome Mapping aka Ancestor Mapping article.

I’m going to use spreadsheets as examples here.  It think they are easier to see and understand, plus, I can manipulate them easily to reflect different situations.

Example 1 – The Very Basics of Matching

At each DNA location, or address, you have two alleles, one from each parent.  These alleles can have one of 4 values, or nucleotides, at each location, represented by the abbreviations T, A, C and G, short for Thymine, Adenine, Cytosine and Guanine.  That’s it, you’re done with all the science words now, so keep reading:)

On any given chromosome, from locations 1-20, you have the following DNA, in our example.

From Mom, you received all As and from Dad, all Cs.  You know that because I’m telling you, but remember, the matching software doesn’t know that because there is no zipper in your DNA.  All the software sees are that you have both an A and an C in location 1 and either an A or C is considered a match.

Zipper 5

In fact, this is what the software sees.  Be aware that in this case, AC=CA.

Zipper 6

Easy so far, right?

Example Two – Mom’s Known Cousin and Dad’s Known Cousin

Now you have two cousins, Mary and Myrtle.  You know, from having known them all of your life and sharing lots of Thanksgiving turkey that they are your family and you know clearly which side of your family they descend from.  Both of your cousins, Mary and Myrtle match you at the same locations on this chromosome, from 5-15.

But Mary is your mother’s cousin, and Myrtle is your Dad’s cousin.  So even though they both match you on the same exact chromosome and the same location, they do not match each other.  Well, let’s put it this way, if they also match each other, then you have an entirely different family genetic genealogy problem, called endogamy, and yes, you might be your own grandpa…but I digress.  But we’re going to assume for this discussion that your mother and father are not related to each other and do not share common ancestors.

Zipper 7

Still easy, right?

Example Three – An Unknown Cousin

Next, we have Martha.  You don’t know Martha, and you don’t know how she is related, but she obviously is.  Martha matches you, but she does not match Myrtle at all, and she doesn’t match Mary on enough overlapping chromosomes to be considered a match to her.  You can see their common match here between Mary and Martha in location 5.  In this case, as it turns out, Martha IS a cousin to Mary on Mom’s side, but we can’t tell that from this information because they don’t match in enough common locations to be above the matching threshold.  With this information, you can’t draw any conclusions.  You will have to wait to see who else Martha matches and look on your spreadsheet to see if Martha matches any of your known cousins and you on common segments which would confirm a common ancestor.  Your download spreadsheet will contain much more detailed information because once you match on any segment above the match threshold of about 7.7cM (plus a few other factors,) all matching segments of 1cM or above are downloaded – so you have a lot of information to work with.

But using both the ICW and matrix tools, Mary might cluster with other cousins on Mom’s side which would provide us with clues as to her relationship.  In fact, the first thing I’d do is to run an ICW with Mary and then utilize the Matrix tool to further define those relationships.

Zipper 8

Still not difficult.

Example Four – A “False Match”

Next we have Jeremy who is also a match to you.

Zipper 9

If you look at how Jeremy matches, you can see that he is actually matching on both sides, Mom’s and Dad’s side, but randomly.  Technically, he is a match to you, because he does match one or the other of your nucleotides at each location, A or C, but without a zipper, we have no idea HOW that DNA is divided in you between Mom and Dad.  In other words, the software doesn’t know that Mom was all A and Dad was all C, unless we’ve phased the data against your parents AND the software knows how to utilize that information.

However, if your parents are one of your matches, you can immediately see which side the match falls on, if either.  In this case, Jeremy doesn’t fall on either side because he is simply a circumstantial match, also known as a match my convergence or a false match.  This is also called IBS, or identical by state, as opposed to IBD, identical by descent.  The smaller the segment you show as a match, especially if there is no clustering, the more likely the match is to be IBS instead of the genealogically desirable IBD.

When people ask how someone can match a child but not a parent, this is the answer.  He matches you on 11 segments, circumstantially, but he only matches your parents on 5 and 6 segments, respectively, which often (but not always) puts him under the matching threshold.  Jeremy may also match Mary, depending on the thresholds.

This is also how someone can match in the “in common with” tool, but not be a match to anyone on the match list in the Matrix.  In fact, this is the power of these multiple tools.

This also doesn’t mean this match is entirely useless, because you DO match.  It may simply not be relevant genealogically.  In “The Autosomal Me” series, I’ve utilized very small match segments that in fact very probably ARE reflective of a common population and not of recent ancestry.  In my Native American research, this is exactly what I was looking for.  You may not be able to utilize this information today, but don’t entirely discount it either.  Just set it aside and move on to a more productive match.

Example Five – Common Matches, Different Ancestors

This situation provides clues, but no proof.

Mary and Joyce both match me on Mom’s segments, but they do not match each other.  They don’t match me on the same segments, so this indicates that they are probably from different ancestors in my Mother’s lines.  As more matches appear, the clusters of people and their genealogy will make this more apparent.

In order to determine which ancestors, I’ll need to work on the genealogy of both Mary and Joyce and see who else they also match on the same segments.  Sometimes the secret of the genealogy match is in the genealogy research or descent of your matches.

Zipper 10

Example Six – Clusters of Cousins

In this example, no one matches Dad, so he’s just out for now.  Susie and Mary match mom on the same segment, which proves that the three of these people share a common ancestor.  Mom and Joyce match each other too, but Joyce doesn’t match Mary and Susie, so they won’t cluster together on the matrix.  However, on the ICW tool, all three women, Joyce, Mary and Susie will match me and Mom.

Using the ICW tool if I were to ICW with Mom, you would see this list:

  • Joyce
  • Mary
  • Susie

The question then becomes, are Joyce, Mary and Susie related to each other, or not.  If so, and to me and Mom, then that indicates a common ancestor within the match group, like me, Joyce and Mom.  The second group doesn’t match the first group – me, Mary, Mom and Susie.  Using these tools together, these people clearly fall into two match groups, the green and blue on the spreadsheet below.  But remember, the match routine doesn’t know which side your As and Cs came from.  All it knows is that you match these people.  But based on these groups and my download spreadsheet common segment matches, I can tell that I’m working with two ancestral lines.

Zipper 11

My matrix for these people would look like this:

Zipper 12

My master matching spreadsheet would now look like this.

zipper 13

When we started, all I would have been able to see is that all of these people matched Mom and Dad and I on the same segments. By utilizing the various tools, I was able to sort into groups and eventually, subgroups.

In fact, you can see below that within Mom’s pink group, there is also the smaller cluster of Mary, Susie, me and Mom.

Zipper 14

For Jeremy and Martha, we can’t do any more right now, so I’ve recorded what we do know and set them aside.

Here, you can see the matches sorted by chromosome, start and end segment.

zipper 16

It looks a lot different than where we started, shown below, when all we had was a list of people who matched each other with no additional information.  We’ve added a lot!

zipper 17

In Summary – Creating the Zipper

So, where are we with this?

By utilizing all of the tools at your disposal, including the ICW tool, the Family Finder Matrix, your matching spreadsheet and your genealogical information, you’re in essence creating that zipper that divides half of your DNA into Mom’s side and Dad’s side.  Then into grandma’s and grandpa’s side, and on up the pedigree chart.

Each of these tools can tell you something unique and important.

The ICW tool tells you who matches you and another person, in common.  It doesn’t tell you if they also match each other.  This tool can provide extremely important clustering information.  For example, if I see unknown cousin Martha clustered with a whole group of known Estes descendants, then that’s a pretty good clue about how I’m related to Martha.  If, on the other hand, I find Martha clustered with people from both sides of my family, well, my Mom and Dad just might be related to each other or their ancestors went to or came from the same places.

By utilizing the Matrix tool, I can tell which of my matches are actually matching each other too, so that puts Martha in a much smaller group, or maybe eliminates her from certain groups.

By then utilizing my downloaded match spreadsheet, on which I record every known tidbit of genealogy information, even generalities like, “family from NC” if that’s the best I can get, I can then see where Martha matches me and others on the same segments, and based on the information in the ICW and the Matrix and my genealogy info, I may be able to slot Martha into a family group.  On a great day – I’ll be able to be more specific and tell her which family group – like we were able to do with my newly found cousin, Loujean.

So, I hope you’ve enjoyed learning how to install a chromosome zipper.  Now you can happily go about unzipping all of that genealogy information held in your DNA, that piece by piece, we’re slowing revealing.

zipper final

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Chromosome Mapping aka Ancestor Mapping

This article really should be called “Identifying Prodigal Great-Grandpa by Ancestor Mapping Your Chromosomes,” because that’s what we’re going to be doing.  It’s fun to map your ancestors to your chromosomes, but there is also a purpose and benefit to be derived.  So you can have guilt-free fun because you’re being productive too!  Oh, and yes, you can work on finding Prodigal Great-Grandpa.

I constantly receive questions similar to this:

“How can I find the identity of my mother’s mother’s father?  My great-grandmother went to her grave with this secret.  That’s one eighth of my ancestry.  What can I do?  How can I find out?”

The answer is that it’s not easy, but it is sometimes possible.  Note the word sometimes.  A good part of the definition of “sometimes” is how willing you are to do the requisite work and if you are lucky or not.  Luck favors those who work hard.  And let’s face it, you’ll never know if you don’t try.  I mean, Prodigal Great-Grandpa is not going to text you from the other side with his name and date of birth.

What we’re going to do is basically work through a process of elimination.  The term for what we are going to do is called chromosome mapping your ancestors or more simply, chromosome mapping or ancestor mapping. In essence, you are going to map your own chromosomes based on which ancestor contributed that part of your DNA.

I have simplified this process greatly in order to explain the concept in a way you can easily follow.  I’m going to use my own pedigree chart as an example.  We’ll pretend we don’t know the identity of Curtis Benjamin Lore.  And yes, for those of you wondering, all of these people are deceased.

Mapping pedigree chart

I realize that you are going to have more than the 32 autosomal matches shown on my example spreadsheet.  You’re also not going to be able to find common ancestry with many of your matches due to things like dead ends, incorrect ancestry, segments identical by state (IBS) or DNA that comes from older ancestors that is not recognizable today after name changes in many generations when descended through females.  There are lots of reasons why you might not be able to find genealogy matches.  It’s the other matches, the ones where you can decipher and determine your common ancestor that help a great deal, and that is where we’ll focus.  These are the ones that matter and the keys to identifying Prodigal Great-Grandpa.

In my example here, we live in a perfect world.  We are looking to map the DNA of my 8 great-grandparents in order to figure out the identity of mother’s mother’s father.  Of course, there is no Y-DNA to test in this instance, so we must rely on autosomal DNA.  Ok, so maybe it’s not such a perfect world.  In a perfect world, you’d be a male trying to find the identity of your father’s father’s father and you could test your own Y-DNA – but then we wouldn’t have a good story nor would we need autosomal DNA.  And most people aren’t that fortunate.

Three generations isn’t that far back – or four – if you count yourself as the first generation.  If you’re quite lucky, you can test one or both of your parents, and maybe even a grandparent or great-aunt or uncle.  Failing that, you should be able to find some cousins from your various lines to test.  This entire exercise will be much MUCH easier for you if you can test multiple people descended from each of the 4 couples involved because you’ll be able to tell which lines your matches do, and don’t, match based on which cousins they also match.  Take DNA test kits to family reunions!

Obviously, you won’t be able to test anyone directly descended from your unknown great-grandfather, except perhaps his children.  The more of his children you can test, either directly or through their children, if deceased, the better your chance of identifying your Prodigal Great-Grandfather because each child inherits some different DNA from their parents.  In my case, we’re going to presume that there are no other known children, other than my grandmother.  So how do we find Prodigal Great-Grandpa?

First, download all of your matches with corresponding segment data from your testing vendor, either 23andMe or Family Tree DNA, into a spreadsheet.  Ancestry does not allow you to do this, which is a significant drawback in terms of testing at Ancestry.  You can do this today at 23andMe and at Family Tree DNA most easily by utilizing www.dnagedcom.com download software.  You can also do this directly at Family Tree DNA on the Chromosome Browser page.

Your spreadsheet will look something like this, but without the colors.  That’s what you’ll be adding, along with the Common Ancestor column.

Mapping spreadsheet

Step 1 – Identify a common ancestor with those individuals you match on common DNA segments.  This is really two steps, the common ancestor part, and the common DNA segment part.  If these people are on your match list, we already know you have a common DNA segment over the vendor’s match threshold.  The presumption here is that if you have 3 people that match on the same segment from the same ancestor, that’s a confirmed “yes” that this particular DNA segment is descended from that ancestor.  You can also label these with only two confirmed descendants from the same ancestor, but I like to see three to be sure, especially if here is any doubt whatsoever that you’re dealing with the same ancestral family.  For example, if you are dealing with 2 people who carry the same surname from the same location, but you can’t quite find the common ancestor – you’ll need 3 matches to identify this segment.

In this case, I was able to test cousins so I know that on chromosome 1, Sue, Joe and John all match me on the same segment and they are all descended from Lazarus Estes.  I know this because one of them descends from Lazarus Estes and his wife, Elizabeth Vannoy, but the other two, Joe and John descend from an Estes upstream of Lazarus, let’s say, his father, John Y. Estes, through another child, which allows me to positively identify this segment as coming not just from the couple, Lazarus Estes and Elizabeth Vannoy, but from Lazarus specifically.

I’ve colored this segment mustard to represent Lazarus and so that you can visually see the difference between the 8 ancestors we’re working with.

2.  Repeat the same process with your other matches, hopefully utilizing cousins, to identify DNA segments of your other ancestors.  I’m only showing a very small subset of all of my DNA on my spreadsheet, and all matches are the exact same 10,000 cM blocks and only on one chromosome, for illustration purposes, but as you work through your matches, you’ll be able to color more and more of your DNA and assign it to different ancestral couples.  Each of your chromosomes will have different colors as different parts of each chromosome come from different ancestors.

Kitty Cooper released a tool to utilize AFTER you do this hard grunt-work part that paints a pretty picture of your ancestors mapped on to your various chromosomes.  Here’s her example.  Notice that each chromosome has 2 sides, Mom’s and Dad’s inheritance side.  We’re going to use that to our advantage and it’s one facet of how we’re going to find Prodigal Great-Grandpa .

mapping kitty cooper

In my case (not this example), I have several segments that I can’t identify to a particular couple, but I can assign it to a group.  This is my Acadian group and is terribly admixed because of extensive intermarriage.  I also have a “Mennonite” segment labeled in the same way for the same reason.  So while I don’t know specifically who, I do know where and that helps a lot too.  But in our perfect world in our example, we don’t have any of that.

3. Now that I have most of my genome colored in and assigned to ancestors, except for Prodigal Great-Grandpa, I can see where all new matches fall.  Let’s say I get a new match on chromosome 1 in the segments between 10,000 and 20,000 and they also match Sue, Joe and John.  Even if the new match is an adoptee and has no genealogy, I can tell them which line they descend from.  And let me tell you, there is no greater gift.  This is exactly how we told new cousin Loujean she descended from the Younger line.

However, if someone matches me on this chromosome 1 segment but NOT Sue, Joe and John, since Sue,Joe, John and I all match on the entire segment from 10,000-20,000, then the new match has to be matching me on my other parent’s side (or is IBS – identical by state, a circumstantial match.)  Never forget that you have two “sides” to each chromosome – Mom’s and Dad’s (except for the X chromosome in males which we are not addressing here.)

4.  The only part of my match spreadsheet left uncolored, since this is a perfect world, would be the part that would probably come from my Prodigal Great-Grandfather.  So let’s look at chromosome 8 and map it.

What we don’t know, and have to determine, is whether or not some of these parts of chromosome 8 really belong to ancestors identified in color above.  However, remember that we are dealing with fairly close matches, only 3 generations, and in some cases, only 2 generations, depending on which cousins tested.  So let’s say you found several cousins to test because grandma had a large family.  Based on the test results of several of your aunts and uncles along with other people descended from great-grandma’s ancestral lines, you are able to map most of the DNA of your great-grandmother.  In this case, we mapped this segment of chromosome 8 to my three cousins, Derrell, Darrell and Daryl.  (Yes, I really do have those cousins.)

The result is that now I have 8 matches that do match me, and based on other cousin matches, do descend from Great-Grandma/Great-Grandpa but don’t match the Derrell trio indicating Great-Grandma’s line.  What this tells me is that the people who aren’t assigned, because they don’t match my cousins Derrell, Daryl and Darrel, or any other distant groups, must then be from Prodigal Great-Grandpa’s side or are “problem matches.”  Problem matches are those that are IBS (Identical by State) or have a technical issue and we’re not going to deal with that here, because this is a perfect world and we’re only concerned with people whose genealogy we have and that match each other.  By this definition, problem matches are automatically eliminated.  So let’s look at the 8 people above who don’t match me or the Derrell cousins on Great-Grandma’s/Great-Grandpa’s side, beginning with Bobbi and ending with Isabel.

5.  Now we turn to genealogy.  We know that these 8 people all share a common ancestral line with Prodigal Great-Grandpa, we just don’t know who that is.  Let’s say that of this group, we discover that Bobbi, Harold and Buster are all related to each other, and glory be, they all know who their common ancestor is, or at least the common ancestral line.  Let’s say that Bobbi and Buster are first cousins in the Lore line and that Harold matches them closely as well, but he is descended from a Lore ancestor further upstream from Bobbi and Buster.  Therefore, we can now say, positively, that Prodigal Great-Grandpa descended somehow from the Lore line.

We still don’t know how Sarah, Ronald, Garret, Nina and Isabel connect to Prodigal Great-Grandpa, and that’s OK.  We can simply leave them uncolored for now.  We can select a color for Bobbi, Harold and Buster and assign then to Prodigal Great-Grandpa who descends from the Lore line.

Mapping PGG Lore

6.  Now it’s time for that luck to kick in.  We don’t know that Prodigal Great-Grandpa carried the surname Lore.  His mother could have been a Lore, or any of his ancestors.  All we have is a common surname and a common ancestor between three people who all match me on the same segment.  So, let’s assemble a tree of our cousins to see if we can narrow the scope of maybe who and where and then let’s get busy with the census and other records.  Geography is important.  Begatting requires proximity and many times, we can find the begatter in the neighborhood.  Also, check your genealogy software data base for this surname.  You may find the surname in an allied line.  Remember, families married their neighbors and often intermarried as well.

Sure enough, look there, in our perfect world, we discover that Nora Kirsch is working in her parents inn named the Kirsch House on the Ohio River in 1880.  The Kirsch House was also a boarding house, and a restaurant and pub.  One of their boarders in 1880 was none other than Benjamin Lore.  Hmmm.  Surely makes you wonder.  Further research on Benjamin Lore shows that he was a wildcat oilfield well driller working in the county where Nora lived and became something of a local legend for discovering the “Blue Lick” water well.  Well, now we have a name, proximity and maybe an opportunity.

7.  Well, peachy, but what next?  Further research on Benjamin Lore shows that he was married in the census, but where was his wife?  In previous census records, we find Benjamin Lore in Warren County, PA with his parents.  In the Warren County records, we find that he married Mary Bills, and additional research shows in 1880 a Mary Lore with 2 children, but no husband.  Court records show they later divorced, with 4 children.  Find those children!!!  They are the key to confirming the identity of Benjamin Lore as Prodigal Great-Grandpa.  If Benjamin’s other children had children about the same time as grandmother, each line should have 3 generations between Benjamin and the current generation.  Benjamin’s great-grandchildren through his first wife would be half-second cousins to me which would be the same as second cousins once removed.  They of course would be a generation closer to my mother whose DNA I also happen to have.

ISOGG has a wonderful Autosomal DNA Statistics page, and here you can see that second cousins once removed would share about 1.5% of their DNA in what is hopefully a large enough segment to match some of the cousins that have already tested.   My mother’s generation, first cousins once removed would share approximately 6.25%.

Mapping cousin chart

Benjamin’s descendants through his first wife may not match all of my cousins, but they will, hopefully, match some of the descendants of Prodigal Great-Grandpa, confirming, as best we can, that Benjamin Lore was grandmother’s father.  The best litmus test of course is how closely they would match the closest generations, like mother or great-aunts/uncles, if they were living.

Full Disclosure Note:  I used my own ancestors for purposes of illustration, even though Curtis Benjamin Lore (shown at right) was not prodigal in quite the way I portrayed in this article, well, at least not from my family’s perspective.  However, he was no saint Lore, CBeither and he may well have other descendants looking for him in this exact situation.  Aside from what we do know, there is the rumor of an illegitimate son showing up on his widow’s doorstep looking for him, albeit, a little too late.  We know that Curtis Benjamin (known as C.B.) Lore did marry Nora Kirsch in Dearborn County, Indiana, in 1888.  These photos are their “wedding photos” but interestingly, there is no photo of them together.

We also know that Curtis Benjamin Lore married Mary Bills in Warren County, PA., had four Lore children, 3 males (Sid, John Curtis and Herbert Judson Lore) and one female (Maud who married a Hendrickson), none of whom we have never been able to find.  Also, Curtis Benjamin Lore was not divorced from Mary until, ahem, after he was married to Nora Kirsch when Mary filed for divorce on the grounds of desertion.Kirsch, Nora

Apparently, his marriage to Nora Kirsch (pictured at right) fell, literally, according to the secret family story, into the “shotgun” category, so one has to understand that his choice of marriage versus death was fairly defensible.  I’m sure Nora’s father, a crusty old Civil War veteran, had no idea that he was already married or Curtis Benjamin would have been on the business end of that shotgun and marriage would not have been a choice.

The family took great care that this “uncomfortable” shotgun marriage situation never be discovered, to the point of falsifying the marriage date in the family Bible and also by “adjusting” the birth of the child by a year, also recorded incorrectly in the family Bible.  Were it not for the fact that I checked the church records in Dearborn County, I would never have discovered the discrepancy.  A child cannot be baptized months before it is born.  I might note that it was only AFTER this discovery that my mother was forthcoming with the “family secret” about the shotgun wedding.  Birth certificates were not issued at that time and my grandmother’s delayed birth certificate was issued based on the falsified family Bible information.

Benjamin probably would not have been bothered by this revelation at all, given what we know about him, but I’m sure Nora’s parents rolled over in their grave once or twice when I made the discovery and now that I’m, ugh, discussing it, and publicly at that.

Rogues and handsome scoundrels.  They are colorful and interesting aren’t they and provide a great amount of spice for family stories.  Hopefully these tools will help you find yours!!!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

First Iceland, Now the Faroe Islands

Faroe island capital

Sometimes it takes a crisis or a tragedy to spur a revolution.  That’s what has happened in the Faroe Islands.

In the 1990s, deCODE Genetics began the process of creating the world’s first population-wide biobank of genetic information by collecting the DNA of all residents of a confined geographic population.  They approached the Faroe Island, which at that time declined, and deCODE went on to proceed with the population of Iceland.  Unfortunately, deCODE eventually declared bankruptcy and was recently purchased, but the genomics revolution had begun and continues, ironically, in the Faroe Islands.

In Discovery Magazine’s recent article, “Faroe Island Aim to Sequence Genes of Entire Country,” they detail the plans for sequencing the genes of the entire population of 50,000 Faroe Islanders.

Faroe islands

Because of the isolation of the island, in the north Atlantic between Norway and Iceland, the residents have been marrying each other for generations, creating a highly endogamous population.  With few new genes being introduced, the existing genes get passed around, and around, and around.  This causes a very high incidence of some genetically transmitted diseases, and little known CTD, or carnitine transporter deficiency, is among them.

This genetic timebomb is also what spurred the Faroes to action, after the death of a young man, Edmund Jensen, and his family members, from this genetic mutation.

Termed FarGen, this project is leading the way on many fronts.  Questions of ethics, of responsibility, of liability and of privacy will all have to be addressed as this project unfolds, but the project holds the potential for life-changing discoveries on the medical front that will benefit not only Faroe Islanders, but many of the rest of us too.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research